Query lcl|Aclame:protein:vir:9875|NCBI_annot:hypothetical protein|genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Match_columns 296 No_of_seqs 55 out of 60 Neff 4.3 Searched_HMMs 1612 Date Sat Nov 30 11:14:27 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_33 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_33_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:9875 Length: 296 # 100.0 1E-149 7E-153 837.3 27.2 295 1-296 1-296 (296) 2 protein:vir:9927 Length: 295 # 100.0 2E-135 1E-138 759.6 25.3 286 7-296 1-289 (295) 3 protein:vir:106647 Length: 303 100.0 1E-133 7E-137 749.4 24.3 287 6-296 1-297 (303) 4 protein:vir:739 Length: 231 # 100.0 1.2E-40 7.4E-44 239.4 15.7 217 46-295 1-231 (231) 5 protein:vir:95107 Length: 270 100.0 4.9E-36 3E-39 214.2 18.9 255 1-296 1-266 (270) 6 protein:vir:96833 Length: 275 100.0 2.5E-35 1.5E-38 210.3 20.3 260 7-296 1-272 (275) 7 protein:vir:105334 Length: 276 100.0 7E-34 4.3E-37 202.4 19.7 261 1-296 1-271 (276) 8 protein:vir:95898 Length: 274 100.0 3.6E-32 2.2E-35 193.0 21.2 258 1-296 1-271 (274) 9 protein:vir:96262 Length: 274 100.0 3.6E-32 2.2E-35 193.0 21.2 258 1-296 1-271 (274) 10 protein:vir:97433 Length: 274 100.0 5E-32 3.1E-35 192.2 21.6 258 1-296 1-271 (274) 11 protein:vir:94494 Length: 274 100.0 5E-32 3.1E-35 192.2 21.6 258 1-296 1-271 (274) 12 protein:vir:93742 Length: 274 100.0 5.9E-32 3.7E-35 191.8 21.9 258 1-296 1-271 (274) 13 protein:vir:3613 Length: 272 # 100.0 1.2E-31 7.7E-35 190.0 19.2 259 1-295 1-272 (272) 14 protein:vir:1239 Length: 274 # 100.0 1.4E-31 8.4E-35 189.8 19.0 258 1-296 1-271 (274) 15 protein:vir:96123 Length: 274 100.0 5.4E-31 3.4E-34 186.5 21.8 259 1-296 1-271 (274) 16 protein:vir:3033 Length: 272 # 99.9 1.5E-29 9.5E-33 178.5 20.4 260 1-296 1-270 (272) 17 protein:vir:9820 Length: 272 # 99.9 1.5E-29 9.5E-33 178.5 20.4 260 1-296 1-270 (272) 18 protein:vir:80930 Length: 278 99.9 3.3E-28 2.1E-31 171.2 20.4 262 1-296 1-278 (278) 19 protein:vir:5974 Length: 324 # 99.9 1.8E-23 1.1E-26 145.3 18.1 278 1-294 1-324 (324) 20 protein:vir:102944 Length: 330 99.8 4.3E-23 2.7E-26 143.2 18.0 273 1-296 1-330 (330) 21 protein:vir:1583 Length: 351 # 99.8 1.3E-21 7.9E-25 135.1 19.1 266 1-296 1-298 (351) 22 protein:vir:108211 Length: 318 99.6 3E-18 1.8E-21 116.7 13.7 270 1-295 6-318 (318) 23 protein:vir:7990 Length: 273 # 99.6 1.6E-17 9.7E-21 112.7 17.2 245 22-295 1-273 (273) 24 protein:vir:105822 Length: 273 99.5 9.7E-16 6E-19 102.9 16.7 247 22-295 1-273 (273) 25 protein:vir:102605 Length: 273 99.5 9.7E-16 6E-19 102.9 16.7 247 22-295 1-273 (273) 26 protein:vir:99749 Length: 324 99.3 2.1E-13 1.3E-16 90.1 18.1 266 1-296 18-316 (324) 27 protein:vir:97148 Length: 324 99.3 3.7E-13 2.3E-16 88.8 17.7 276 1-296 18-316 (324) 28 protein:vir:80180 Length: 381 99.3 3.6E-13 2.2E-16 88.8 16.8 282 1-296 1-346 (381) 29 protein:vir:101607 Length: 379 99.3 4.2E-13 2.6E-16 88.4 16.3 264 1-295 98-379 (379) 30 protein:vir:95763 Length: 297 99.3 2.7E-13 1.6E-16 89.5 14.8 274 2-296 1-297 (297) 31 protein:vir:94622 Length: 341 99.3 8.4E-13 5.2E-16 86.8 17.4 278 1-296 1-341 (341) 32 protein:vir:103955 Length: 324 99.3 2.7E-13 1.7E-16 89.5 14.5 266 1-296 18-316 (324) 33 protein:vir:2344 Length: 397 # 99.2 5E-13 3.1E-16 88.0 15.1 276 1-296 10-334 (397) 34 protein:vir:9410 Length: 415 # 99.2 8E-13 5E-16 86.9 15.5 273 1-296 112-405 (415) 35 protein:vir:100135 Length: 418 99.2 1.7E-12 1E-15 85.1 16.8 255 1-296 135-416 (418) 36 protein:vir:94142 Length: 304 99.2 3.2E-12 2E-15 83.6 16.8 277 1-294 1-304 (304) 37 protein:vir:105905 Length: 304 99.2 3.2E-12 2E-15 83.6 16.8 277 1-294 1-304 (304) 38 protein:vir:96392 Length: 324 99.2 5.1E-12 3.2E-15 82.5 17.9 267 1-296 18-316 (324) 39 protein:vir:78830 Length: 324 99.2 5.1E-12 3.2E-15 82.5 17.9 267 1-296 18-316 (324) 40 protein:vir:9309 Length: 324 # 99.2 4.7E-12 2.9E-15 82.7 17.3 267 1-296 18-316 (324) 41 protein:vir:99075 Length: 392 99.2 2.3E-12 1.4E-15 84.4 14.9 275 1-296 1-306 (392) 42 protein:vir:97053 Length: 390 99.2 7.3E-12 4.5E-15 81.6 17.6 252 1-293 113-390 (390) 43 protein:vir:1886 Length: 385 # 99.2 1.6E-12 1E-15 85.2 13.7 257 1-296 104-385 (385) 44 protein:vir:191 Length: 385 # 99.2 1.6E-12 1E-15 85.2 13.7 257 1-296 104-385 (385) 45 protein:vir:7771 Length: 330 # 99.1 3.2E-12 2E-15 83.6 14.6 284 1-296 1-324 (330) 46 protein:vir:96223 Length: 324 99.1 3.3E-12 2E-15 83.5 14.6 274 1-296 18-316 (324) 47 protein:vir:4339 Length: 395 # 99.1 1.2E-11 7.7E-15 80.4 17.3 261 1-295 113-395 (395) 48 protein:vir:4700 Length: 415 # 99.1 4E-12 2.5E-15 83.1 14.4 269 1-296 112-405 (415) 49 protein:vir:4600 Length: 415 # 99.1 4E-12 2.5E-15 83.1 14.4 269 1-296 112-405 (415) 50 protein:vir:10364 Length: 390 99.1 3.4E-12 2.1E-15 83.4 13.7 251 1-293 113-390 (390) 51 protein:vir:79987 Length: 415 99.1 7.1E-12 4.4E-15 81.7 14.9 267 1-296 112-405 (415) 52 protein:vir:81100 Length: 415 99.1 7.1E-12 4.4E-15 81.7 14.9 267 1-296 112-405 (415) 53 protein:vir:98339 Length: 415 99.1 7.1E-12 4.4E-15 81.7 14.9 267 1-296 112-405 (415) 54 protein:vir:81070 Length: 390 99.1 6.4E-12 3.9E-15 82.0 13.8 257 1-293 113-390 (390) 55 protein:vir:9704 Length: 394 # 99.1 4.9E-12 3.1E-15 82.6 13.1 263 1-296 119-391 (394) 56 protein:vir:81227 Length: 413 99.1 1.5E-11 9.1E-15 80.0 15.4 266 1-296 105-411 (413) 57 protein:vir:104085 Length: 320 99.1 8.4E-12 5.2E-15 81.3 14.1 281 1-296 1-319 (320) 58 protein:vir:95376 Length: 425 99.1 1E-11 6.2E-15 80.9 14.5 255 1-296 137-422 (425) 59 protein:vir:2430 Length: 318 # 99.1 6.1E-12 3.8E-15 82.0 12.9 283 1-296 1-314 (318) 60 protein:vir:9759 Length: 303 # 99.1 2.5E-12 1.6E-15 84.2 10.7 272 1-296 1-303 (303) 61 protein:vir:41 Length: 299 # N 99.0 1.6E-11 9.7E-15 79.8 14.2 267 5-296 1-299 (299) 62 protein:vir:94673 Length: 419 99.0 1E-11 6.3E-15 80.8 11.9 263 1-296 116-418 (419) 63 protein:vir:4856 Length: 293 # 99.0 3.3E-11 2E-14 78.1 14.7 256 1-296 1-282 (293) 64 protein:vir:1328 Length: 392 # 99.0 5.7E-11 3.5E-14 76.7 15.7 260 1-296 110-392 (392) 65 protein:vir:4092 Length: 390 # 99.0 1.1E-10 6.5E-14 75.3 16.3 260 1-296 83-369 (390) 66 protein:vir:6242 Length: 390 # 98.9 5.7E-11 3.6E-14 76.7 14.1 261 1-296 98-390 (390) 67 protein:vir:4226 Length: 326 # 98.9 7.1E-11 4.4E-14 76.2 14.6 282 1-296 1-324 (326) 68 protein:vir:94771 Length: 298 98.9 1.5E-10 9.4E-14 74.4 16.2 265 1-294 1-298 (298) 69 protein:vir:81160 Length: 371 98.9 1.9E-10 1.2E-13 73.9 16.5 258 1-295 90-371 (371) 70 protein:vir:101650 Length: 497 98.9 6.2E-11 3.8E-14 76.6 12.4 270 1-296 141-494 (497) 71 protein:vir:7855 Length: 497 # 98.9 6.2E-11 3.8E-14 76.6 12.4 270 1-296 141-494 (497) 72 protein:vir:1638 Length: 298 # 98.9 8.9E-11 5.5E-14 75.7 13.0 260 1-294 1-298 (298) 73 protein:vir:8102 Length: 543 # 98.9 8.3E-11 5.2E-14 75.8 12.2 271 1-296 249-543 (543) 74 protein:vir:9574 Length: 300 # 98.8 2.3E-10 1.5E-13 73.4 14.1 273 1-295 1-300 (300) 75 protein:vir:3870 Length: 400 # 98.8 2.8E-10 1.7E-13 73.0 14.1 255 1-296 133-400 (400) 76 protein:vir:100172 Length: 394 98.8 8.2E-10 5.1E-13 70.4 16.0 269 1-296 103-385 (394) 77 protein:vir:3991 Length: 404 # 98.8 5E-10 3.1E-13 71.6 14.1 269 1-296 100-394 (404) 78 protein:vir:4830 Length: 397 # 98.8 3.9E-10 2.4E-13 72.2 13.3 262 1-296 97-386 (397) 79 protein:vir:8187 Length: 311 # 98.8 1.6E-09 1E-12 68.8 16.6 267 1-296 1-309 (311) 80 protein:vir:7409 Length: 408 # 98.8 5E-10 3.1E-13 71.6 13.6 262 1-296 116-394 (408) 81 protein:vir:102873 Length: 392 98.8 1E-09 6.3E-13 69.9 14.9 266 1-296 88-385 (392) 82 protein:vir:102082 Length: 392 98.8 1E-09 6.3E-13 69.9 14.9 266 1-296 88-385 (392) 83 protein:vir:105004 Length: 392 98.8 1E-09 6.3E-13 69.9 14.9 266 1-296 88-385 (392) 84 protein:vir:107593 Length: 392 98.8 1E-09 6.3E-13 69.9 14.9 266 1-296 88-385 (392) 85 protein:vir:1268 Length: 397 # 98.7 3.5E-10 2.2E-13 72.4 12.2 254 1-295 123-397 (397) 86 protein:vir:2504 Length: 305 # 98.7 1.3E-09 8E-13 69.3 15.3 268 1-296 1-301 (305) 87 protein:vir:1025 Length: 408 # 98.7 1.2E-09 7.6E-13 69.5 13.6 254 1-296 115-394 (408) 88 protein:vir:80684 Length: 315 98.7 1.1E-08 7E-12 64.2 18.5 263 1-296 1-308 (315) 89 protein:vir:4953 Length: 397 # 98.6 1.3E-09 8E-13 69.3 12.3 261 1-296 97-386 (397) 90 protein:vir:1383 Length: 421 # 98.6 5.7E-09 3.6E-12 65.8 15.5 264 1-296 104-387 (421) 91 protein:vir:100884 Length: 389 98.6 9E-09 5.6E-12 64.7 16.6 259 1-296 108-383 (389) 92 protein:vir:1433 Length: 435 # 98.6 5.6E-09 3.5E-12 65.8 15.4 269 1-296 131-425 (435) 93 protein:vir:3845 Length: 395 # 98.6 1.8E-09 1.1E-12 68.5 12.5 270 1-296 98-384 (395) 94 protein:vir:4997 Length: 397 # 98.6 6.9E-09 4.3E-12 65.3 15.6 261 1-296 97-386 (397) 95 protein:vir:104256 Length: 458 98.6 2.4E-09 1.5E-12 67.8 12.8 273 1-295 143-458 (458) 96 protein:vir:94576 Length: 347 98.6 2E-08 1.2E-11 62.8 17.9 281 1-295 1-347 (347) 97 protein:vir:6212 Length: 434 # 98.6 6E-09 3.7E-12 65.7 14.8 275 1-296 130-432 (434) 98 protein:vir:962 Length: 397 # 98.6 3.8E-09 2.4E-12 66.7 13.6 250 1-295 123-397 (397) 99 protein:vir:485 Length: 407 # 98.6 4.8E-09 3E-12 66.2 13.4 264 1-296 90-401 (407) 100 protein:vir:78223 Length: 333 98.6 6.6E-09 4.1E-12 65.4 14.1 279 1-295 1-333 (333) 101 protein:vir:78523 Length: 338 98.6 1.4E-08 8.9E-12 63.6 15.8 278 1-296 1-337 (338) 102 protein:vir:99920 Length: 311 98.5 7.8E-09 4.9E-12 65.0 14.0 268 1-295 1-311 (311) 103 protein:vir:80376 Length: 435 98.5 2E-08 1.2E-11 62.8 16.2 271 1-296 131-432 (435) 104 protein:vir:102119 Length: 404 98.5 6.1E-09 3.8E-12 65.6 12.3 263 1-296 101-401 (404) 105 protein:vir:8885 Length: 347 # 98.5 1.9E-07 1.2E-10 57.5 20.1 274 1-296 1-347 (347) 106 protein:vir:96762 Length: 632 98.5 1.4E-08 8.5E-12 63.7 13.9 254 1-294 357-632 (632) 107 protein:vir:94711 Length: 347 98.5 1.8E-07 1.1E-10 57.5 20.0 280 1-296 1-347 (347) 108 protein:vir:10450 Length: 344 98.5 1.4E-07 8.5E-11 58.2 18.9 282 1-295 1-344 (344) 109 protein:vir:4456 Length: 401 # 98.5 1.2E-08 7.6E-12 64.0 13.0 259 1-295 106-401 (401) 110 protein:vir:1541 Length: 347 # 98.4 7.4E-08 4.6E-11 59.7 17.2 283 1-296 1-345 (347) 111 protein:vir:80213 Length: 334 98.4 6E-08 3.7E-11 60.2 16.7 277 5-295 1-334 (334) 112 protein:vir:5739 Length: 366 # 98.4 3.3E-08 2.1E-11 61.6 14.7 270 1-296 64-365 (366) 113 protein:vir:8420 Length: 477 # 98.4 6.8E-08 4.2E-11 59.9 16.2 266 1-296 156-472 (477) 114 protein:vir:100247 Length: 425 98.4 1.4E-08 9E-12 63.6 12.4 255 1-296 129-425 (425) 115 protein:vir:4511 Length: 409 # 98.4 1.1E-07 6.8E-11 58.7 16.7 273 1-296 99-407 (409) 116 protein:vir:80128 Length: 466 98.4 2.5E-08 1.6E-11 62.2 13.1 261 1-296 148-452 (466) 117 protein:vir:78739 Length: 332 98.4 3.7E-08 2.3E-11 61.3 13.9 273 1-295 1-332 (332) 118 protein:vir:105038 Length: 428 98.3 8.3E-08 5.1E-11 59.4 14.9 272 1-296 125-426 (428) 119 protein:vir:9361 Length: 402 # 98.3 2.7E-08 1.7E-11 62.1 12.1 250 1-296 132-397 (402) 120 protein:vir:3364 Length: 347 # 98.3 3.5E-07 2.2E-10 56.0 17.1 283 1-296 1-345 (347) 121 protein:vir:1084 Length: 437 # 98.2 9.2E-08 5.7E-11 59.2 13.3 263 1-296 147-428 (437) 122 protein:vir:96978 Length: 387 98.2 4.6E-08 2.8E-11 60.8 11.4 251 1-296 117-382 (387) 123 protein:vir:94424 Length: 387 98.2 4.6E-08 2.8E-11 60.8 11.4 251 1-296 117-382 (387) 124 protein:vir:2685 Length: 387 # 98.2 4.6E-08 2.8E-11 60.8 11.4 251 1-296 117-382 (387) 125 protein:vir:2201 Length: 345 # 98.2 1.2E-06 7.6E-10 53.0 18.8 282 1-295 1-345 (345) 126 protein:vir:78640 Length: 352 98.2 8E-08 5E-11 59.5 11.6 249 1-296 82-347 (352) 127 protein:vir:9643 Length: 377 # 98.2 2.4E-07 1.5E-10 56.8 14.2 254 1-295 78-377 (377) 128 protein:vir:93881 Length: 387 98.1 1.4E-07 8.6E-11 58.2 11.9 254 1-296 117-382 (387) 129 protein:vir:9509 Length: 381 # 98.0 4.7E-07 2.9E-10 55.3 13.3 267 1-296 64-373 (381) 130 protein:vir:101291 Length: 381 98.0 4.7E-07 2.9E-10 55.3 13.3 267 1-296 64-373 (381) 131 protein:vir:99675 Length: 324 98.0 3.3E-06 2.1E-09 50.6 17.4 241 43-296 1-297 (324) 132 protein:vir:105645 Length: 400 97.9 5.2E-06 3.2E-09 49.6 17.5 273 1-296 1-338 (400) 133 protein:vir:103323 Length: 364 97.9 4.5E-06 2.8E-09 49.9 17.2 277 1-296 1-340 (364) 134 protein:vir:108303 Length: 418 97.9 2.6E-06 1.6E-09 51.2 15.2 269 1-296 1-311 (418) 135 protein:vir:98635 Length: 377 97.9 1.5E-06 9.5E-10 52.5 13.8 255 1-295 78-377 (377) 136 protein:vir:100057 Length: 375 97.8 1.4E-05 8.5E-09 47.3 18.3 282 1-296 1-369 (375) 137 protein:vir:93616 Length: 645 97.8 4.3E-06 2.7E-09 50.0 15.3 269 1-296 337-637 (645) 138 protein:vir:3158 Length: 321 # 97.8 4.4E-06 2.7E-09 50.0 15.1 266 2-296 1-313 (321) 139 protein:vir:97031 Length: 402 97.7 1.6E-05 9.9E-09 46.9 16.9 271 1-296 1-338 (402) 140 protein:vir:100632 Length: 381 97.7 4.4E-06 2.7E-09 49.9 13.2 257 1-296 76-374 (381) 141 protein:vir:95963 Length: 395 97.7 5.4E-06 3.4E-09 49.5 13.7 265 1-296 66-380 (395) 142 protein:vir:94800 Length: 319 97.7 1.4E-05 8.8E-09 47.2 15.8 275 1-296 1-297 (319) 143 protein:vir:97331 Length: 319 97.7 1.4E-05 8.8E-09 47.2 15.8 275 1-296 1-297 (319) 144 protein:vir:4197 Length: 314 # 97.6 3E-05 1.9E-08 45.3 17.5 271 1-296 1-312 (314) 145 protein:vir:80446 Length: 367 97.6 1E-05 6.4E-09 47.9 14.8 271 7-296 1-339 (367) 146 protein:vir:6324 Length: 335 # 97.6 3.8E-05 2.3E-08 44.8 17.9 277 1-296 1-331 (335) 147 protein:vir:107120 Length: 329 97.6 2.1E-05 1.3E-08 46.3 15.4 274 1-296 5-307 (329) 148 protein:vir:3525 Length: 423 # 97.5 1.4E-05 8.8E-09 47.2 14.3 267 1-296 1-317 (423) 149 protein:vir:78935 Length: 335 97.4 6.9E-05 4.3E-08 43.4 17.9 262 1-296 1-331 (335) 150 protein:vir:78350 Length: 383 97.4 5E-06 3.1E-09 49.7 9.9 267 1-296 71-380 (383) 151 protein:vir:3136 Length: 322 # 97.3 1.7E-05 1E-08 46.8 12.1 273 1-296 1-321 (322) 152 protein:vir:102655 Length: 322 97.3 0.00011 6.6E-08 42.4 16.6 272 7-296 1-322 (322) 153 protein:vir:79928 Length: 393 97.3 7.9E-06 4.9E-09 48.5 9.9 253 1-296 58-359 (393) 154 protein:vir:8324 Length: 410 # 97.2 7.7E-05 4.8E-08 43.1 14.6 250 1-293 110-410 (410) 155 protein:vir:7019 Length: 401 # 97.1 0.00016 1E-07 41.3 17.7 273 1-296 1-340 (401) 156 protein:vir:174 Length: 423 # 97.1 9.9E-05 6.1E-08 42.5 14.5 272 1-296 1-317 (423) 157 protein:vir:78920 Length: 290 97.1 0.00013 8.3E-08 41.8 15.2 257 22-296 1-290 (290) 158 protein:vir:105374 Length: 423 97.0 0.00018 1.1E-07 41.1 14.8 276 1-296 1-333 (423) 159 protein:vir:97397 Length: 517 96.8 0.00011 6.7E-08 42.3 12.2 260 1-296 225-515 (517) 160 protein:vir:79008 Length: 299 96.7 0.00044 2.7E-07 39.0 15.0 260 20-296 1-298 (299) 161 protein:vir:78387 Length: 349 96.5 0.00053 3.3E-07 38.6 17.7 271 1-296 1-338 (349) 162 protein:vir:105464 Length: 346 95.4 0.0021 1.3E-06 35.3 14.7 254 22-296 1-300 (346) 163 protein:vir:105522 Length: 423 95.3 0.0023 1.5E-06 35.0 16.3 273 1-296 1-333 (423) 164 protein:vir:94989 Length: 349 95.3 0.0024 1.5E-06 34.9 18.0 270 1-296 1-338 (349) 165 protein:vir:4159 Length: 315 # 94.7 0.0037 2.3E-06 33.9 13.0 267 1-294 1-315 (315) 166 protein:vir:1781 Length: 221 # 94.7 0.0021 1.3E-06 35.3 10.9 182 78-296 1-219 (221) 167 protein:vir:102335 Length: 312 94.4 0.0044 2.7E-06 33.5 14.8 253 20-296 1-311 (312) 168 protein:vir:78090 Length: 302 94.2 0.0051 3.2E-06 33.1 14.7 251 20-296 1-301 (302) 169 protein:vir:96442 Length: 418 91.4 0.0036 2.2E-06 34.0 7.1 268 1-296 63-411 (418) 170 protein:vir:79712 Length: 285 91.0 0.018 1.1E-05 30.1 14.6 254 22-296 1-285 (285) 171 protein:vir:99523 Length: 311 90.6 0.02 1.2E-05 29.9 16.2 268 14-295 1-311 (311) 172 protein:vir:103370 Length: 418 83.4 0.031 1.9E-05 28.8 7.1 273 1-296 63-411 (418) 173 protein:vir:95131 Length: 325 81.0 0.09 5.6E-05 26.3 13.4 259 1-296 1-297 (325) 174 protein:vir:3424 Length: 341 # 80.3 0.096 5.9E-05 26.2 15.8 269 10-293 1-341 (341) 175 protein:vir:6378 Length: 346 # 73.2 0.17 0.00011 24.8 10.6 258 10-293 1-346 (346) 176 protein:vir:106590 Length: 349 71.5 0.19 0.00012 24.5 13.1 267 9-293 1-349 (349) 177 protein:vir:8843 Length: 317 # 69.3 0.22 0.00014 24.1 13.0 258 1-296 1-317 (317) 178 protein:vir:99424 Length: 360 58.9 0.4 0.00025 22.7 13.8 264 2-296 1-359 (360) 179 protein:vir:80491 Length: 467 56.6 0.34 0.00021 23.2 6.2 241 1-296 25-288 (467) 180 protein:vir:63741 Length: 468 54.8 0.39 0.00024 22.9 6.2 238 1-296 26-289 (468) 181 protein:vir:393 Length: 341 # 37.4 1.1 0.00069 20.3 15.7 266 10-293 1-341 (341) 182 protein:vir:10324 Length: 320 35.9 1.2 0.00075 20.1 14.6 251 24-296 1-318 (320) 183 protein:vir:80068 Length: 301 28.0 1.8 0.0011 19.2 11.6 245 1-287 1-301 (301) 184 protein:vir:95875 Length: 401 26.7 1.9 0.0012 19.0 15.8 285 1-296 1-401 (401) 185 protein:vir:93696 Length: 364 23.2 2.3 0.0015 18.6 10.7 272 1-296 1-361 (364) 186 protein:vir:2770 Length: 318 # 21.6 2.6 0.0016 18.3 9.3 213 2-231 1-318 (318) No 1 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=100.00 E-value=1.1e-149 Score=837.32 Aligned_cols=295 Identities=97% Similarity=1.330 Sum_probs=292.2 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeee-eeeeecccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYA-GYDVTLAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk-~~yig~A~gdVaEGe~Ipls 79 (296) ||||||+||+||++++||+++|||||+|||++||++|||+|||+|++||..|++||+|| |.|++++. ||+|||+|||| T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~-dVaEGe~Ipls 79 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEG-NVPEGEVIPLS 79 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccc-cccCCcccchh Confidence 99999999999999999999999999999999999999999999999999999998876 99999996 99999999999 Q ss_pred heeeeecceeEEEEeecccccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHHHH Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALASAW 159 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~~ 159 (296) ||+|+++++++++||||||+|||||||+||||+||+|+|+||+++||+|||||||++|+++|+++++++++||+||+++| T Consensus 80 kvt~~~~~t~t~~ikK~rK~tTdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~~~t~~~lQ~Ala~~~ 159 (296) T protein:vir:98 80 KVERKIHSEKKIELKKYRKATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALASAW 159 (296) T ss_pred hheeeecceEEEEeeccccccCHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccceeeechhhHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcc Q lcl|Aclame:pro 160 GKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNN 239 (296) Q Consensus 160 ~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~ 239 (296) +++.++|||||++++|+||||+|+|+|||+++|++|++||++||+||||++||||+|||+|++|+|++||||+||+|||+ T Consensus 160 ~~l~~~feded~~~~V~FVnP~D~a~ylg~a~it~qt~fG~tyl~nfLG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~ 239 (296) T protein:vir:98 160 GKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNN 239 (296) T ss_pred hhhhhhccccCCCceEEEEehHHHHHHhcCCccchhheechhhhhhccccEEEEcCcCCCceEEEeeecceEEEeecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 240 SELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 240 g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) |||+++|++++|+||||||+|+++++||||||+++||++||||++||||++||+++| T Consensus 240 ~~l~~~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 240 SELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred cchhhhhccccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEecCCC Confidence 999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=100.00 E-value=1.6e-135 Score=759.60 Aligned_cols=286 Identities=34% Similarity=0.532 Sum_probs=276.5 Q ss_pred cccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhheeeeec Q lcl|Aclame:pro 7 YPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERKIH 86 (296) Q Consensus 7 ~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplskv~~~~~ 86 (296) -||+||++++||+.++||||++||++||++|+|+|||+|++||..|+|||+|||.|+|+|. ||+|||+||||||+|+++ T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~-dVaEGe~Iplskvt~~~~ 79 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQT-DPGEGETIPLSKVTRTKD 79 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccc-cccCCcccchhhheeeee Confidence 7999999999999999999999999999999999999999999999999999999999996 999999999999999999 Q ss_pred ceeEEEEeecccccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 87 SEKKIELKKYRKATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALASAWGKLQVLF 166 (296) Q Consensus 87 ~t~~~tikK~~K~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~~~~~~~~F 166 (296) ++++++||||||++||||||+|||||||+|+|+||+++||+|||||||++|+|+|++ +++++||.|++++|+++..+| T Consensus 80 ~t~t~kikK~rK~tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t--~tg~~lq~a~a~~~~al~~f~ 157 (295) T protein:vir:99 80 KDYTVKWFKKRRATTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTK--VKGVGLQKALSASWAKLATFN 157 (295) T ss_pred eeeEEEeeeecccccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCcee--eehhhHHHHHHHhhhhhhhcc Confidence 999999999999999999999999999999999999999999999999999998865 678999999999999999977 Q ss_pred ccccCcceEEEEcHHHHHHHhcCCcccccee--echhhhhhhheeE-EEEeccCCCceEEEEcccceEEEEecCcchhhh Q lcl|Aclame:pro 167 EDYGSERAIVFANSLDVAEYIAKAGITTQTA--FGLTYLVDFTGTV-IISTNDVTKGEIWATVPENIIFAYINPNNSELA 243 (296) Q Consensus 167 eded~~~~VlFvNP~Daa~~l~~a~i~~q~~--fg~tyl~nfLG~~-II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~ 243 (296) |+++ +++|+||||+|+++||++|.++.|.+ ||++||+|||||+ ||||+|||+|++|+|++||||+||+||++|||+ T Consensus 158 Ee~~-~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~nfLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~ 236 (295) T protein:vir:99 158 EFEG-SPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKNFLGMQNVIVMPSVPEGKIYSTAVENLVFASLNVKGGDLG 236 (295) T ss_pred cccC-CceEEEEehHHHHHHHhccccccchhhhhhhhhhhhhhccceEEEcccCCCceEEEeeccceEEEEecCCchhhh Confidence 7754 79999999999999999999988765 9999999999997 999999999999999999999999999999999 Q ss_pred hhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 244 KEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 244 ~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ++|++++|+||||||+|+++++||||||+++||++||||++||||++||+.|- T Consensus 237 ~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~ 289 (295) T protein:vir:99 237 GLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAA 289 (295) T ss_pred hhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecCc Confidence 99999999999999999999999999999999999999999999999995554 No 3 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=100.00 E-value=1.1e-133 Score=749.42 Aligned_cols=287 Identities=37% Similarity=0.585 Sum_probs=276.2 Q ss_pred ccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeee---eeeecccCcccCCceechhhee Q lcl|Aclame:pro 6 TYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAG---YDVTLAEGNVPEGEVIPLSKVE 82 (296) Q Consensus 6 ~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~---~yig~A~gdVaEGe~Iplskv~ 82 (296) -.+|+||++++||++++||||+|||++||++|||+|||+|++||..|++||+||| .|++++. ||+|||+||||||+ T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~-dVaEGe~Iplskvt 79 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNG-DVAEGDVIPLTKVT 79 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccc-cccCCcccchhhhe Confidence 7899999999999999999999999999999999999999999999999999987 6999995 99999999999999 Q ss_pred eeecceeEEEEeecccccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc------eecchhhHHHHHH Q lcl|Aclame:pro 83 RKIHSEKKIELKKYRKATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT------QDALGAGLQGALA 156 (296) Q Consensus 83 ~~~~~t~~~tikK~~K~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t------~~~t~~~lQ~Ala 156 (296) |+++++++++||||||+|||||||+||||+||+|+|+||+++||+|||||||++|+++|++ +++++++||+||+ T Consensus 80 ~~~~~t~~~~~kK~rK~tTdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~t~~t~~s~~glq~Al~ 159 (303) T protein:vir:10 80 REQVDITELQFAKYRKSTSAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKRTNKTKLSAENLQGALS 159 (303) T ss_pred eeecceEEEEeecccccccHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccccccceeecHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999999999999999854 4678999999999 Q ss_pred HHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccc-eeechhhhhhhheeEEEEeccCCCceEEEEcccceEEEEe Q lcl|Aclame:pro 157 SAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQ-TAFGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYI 235 (296) Q Consensus 157 ~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q-~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~ 235 (296) .+|+++...||| +.++|+||||+|+++||++|++..+ ++||++||+||||++||||+|||+|++|+|++||||+||+ T Consensus 160 ~~~~kl~~~~ed--~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L~nfLG~~II~S~kv~~G~~~~T~~~Ni~~ay~ 237 (303) T protein:vir:10 160 KGRANLSVLLDD--EITPIAFVNPNDTAEYLANGFINSTGAQFGVNLLTPYVGVKIVEFADVPQGEVWMTVAENLNVAYA 237 (303) T ss_pred hhhhhccccccc--cccEEEEEchHHHHHHhhcCCcchhhhhhhhhhhhhhhcceEEEeccCCCceEEEeeccceEEEEe Confidence 999999998875 4789999999999999999999987 7999999999999999999999999999999999999999 Q ss_pred cCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 236 NPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 236 ~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ||+ |||+++|++++|+||||||+|+++++||||||+++||++||||++||||++||++.- T Consensus 238 ~~~-g~l~~~f~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~~e 297 (303) T protein:vir:10 238 NPR-GELSRAFAFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKKDE 297 (303) T ss_pred cCc-hhhhhhhhhccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEEeccc Confidence 997 899999999999999999999999999999999999999999999999999996554 No 4 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=100.00 E-value=1.2e-40 Score=239.44 Aligned_cols=217 Identities=19% Similarity=0.249 Sum_probs=163.0 Q ss_pred ccccCCCCeeeeeeeeeeecccCcccCCceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHH Q lcl|Aclame:pro 46 KISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVR 123 (296) Q Consensus 46 ~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~ 123 (296) .--...|+||++|+| ||+|+ +++||++||+++|+++ +.+++|||++|++ ||||. ++|||||++|+.+||++ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~-~v~eG~~i~~~~l~~t---~~~atIk~~gk~~~itD~a~-l~~~gDp~~ea~~Q~~~ 73 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAA-DVAEGGEISLDKIGTT---TKSVTIKKAAKGTEITDEAA-LSGYGDPIGESNKQLGL 73 (231) T ss_pred CccccCCceEEeccc--ccchh-hhcCCCcCChhhcccc---ceeeeEeeeccceeeeHHHH-hhccCchHHHHHHHHHH Confidence 334678999999998 99998 9999999999999986 7899999999997 99996 99999999999999999 Q ss_pred HHHhhhhHHHHHHHhcCccceecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcc-------ccce Q lcl|Aclame:pro 124 QLQKKIRTDFVTALKTGTGTQDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI-------TTQT 196 (296) Q Consensus 124 ~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i-------~~q~ 196 (296) +|++|+|+|++++|++++++.+. .. -++++.++.++|+||++.+.|+||||+|++++|+.++. +... T Consensus 74 ~iA~kvD~di~~~~~~a~l~~~~-~~-----t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g~~i 147 (231) T protein:vir:73 74 SLANKVDDDLLKAAKTTSQTVST-KA-----NVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANA 147 (231) T ss_pred HHHHhhhHHHHHhhccccccccc-cc-----cHHHHHHHHHHhccccccceEEEEcchHHHhhhhccchhhhhhhhccce Confidence 99999999999999999876432 11 14455566689999999999999999999999997642 3345 Q ss_pred eechhhhhhhheeEEEEeccCCCceEEEEc----ccceEEEEe-cCcchhhhhhhccccccccceEEEeccccceeehhh Q lcl|Aclame:pro 197 AFGLTYLVDFTGTVIISTNDVTKGEIWATV----PENIIFAYI-NPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQT 271 (296) Q Consensus 197 ~fg~tyl~nfLG~~II~S~kV~~G~~~~t~----~~Nl~~ay~-~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et 271 (296) ++.|.+++ |+|++||+|+++|+|+.+.++ ++.|.++.= +++ +....+.-.-.+=+.+-.|.. T Consensus 148 ~~~G~iG~-i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~---vEtdRd~~~k~~~i~~~~~y~--------- 214 (231) T protein:vir:73 148 LINGTYAD-VLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQ---VETDRDIVTKTTVITADEHYA--------- 214 (231) T ss_pred eeecccce-EcceEEEEcCCCCCCceeeeeEEeeccceeeeecccce---eeccccccccccEEEEeEEEE--------- Confidence 56777776 999999999999999997654 555555442 111 111112222223333333421 Q ss_pred hhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 272 LLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 272 ~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) +.|. +..+||+.|++.- T Consensus 215 -----v~l~--~~~~vv~~t~~g~ 231 (231) T protein:vir:73 215 -----AYLY--DLTKVVNITFTGV 231 (231) T ss_pred -----EEEE--cCccEEEEEeecC Confidence 1111 2356777777655 No 5 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=100.00 E-value=4.9e-36 Score=214.17 Aligned_cols=255 Identities=18% Similarity=0.237 Sum_probs=184.7 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-- |--.+++..+=+.+.. .+++. +..++....-+.+.++.+||++|++|+|+++|+++ +++||++||..+ T Consensus 1 Ma~---T~~~d~I~Pev~~~~V----~e~~~-~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae-~~~eg~~i~~~~ 71 (270) T protein:vir:95 1 MTQ---TKKANLINPEVLANVV----SAQMQ-NAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAE-DLQEGVAMDTTQ 71 (270) T ss_pred CCc---eehhhhcchHHHHHHH----HHHHH-hHHhhccccccccccCCCCCCEEEeeeecCCCccc-cccCCCccchhh Confidence 321 1122444443333221 33332 23334444556678999999999999999999998 999999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcccee--cchhhHHHHHH Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQD--ALGAGLQGALA 156 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~--~t~~~lQ~Ala 156 (296) ++++ +.+++|||++|++ |||+. +++||||++++.+|+++++++++|+|+++.|++++++.+ .+.+.+. T Consensus 72 lt~~---~~~a~i~~~gk~~~itD~a~-~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~~~~~t~~~~~---- 143 (270) T protein:vir:95 72 MSMT---TTKVTVKETGKAVEVTQTAI-ITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTATVSADATGIL---- 143 (270) T ss_pred cccc---hheeeeehhhCcceecHHHH-hhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHH---- Confidence 9975 5789999999987 99996 899999999999999999999999999999999987643 3444444 Q ss_pred HHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccc-----cceeechhhhhhhheeE-EEEeccCCCceEEEEcccce Q lcl|Aclame:pro 157 SAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGIT-----TQTAFGLTYLVDFTGTV-IISTNDVTKGEIWATVPENI 230 (296) Q Consensus 157 ~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~-----~q~~fg~tyl~nfLG~~-II~S~kV~~G~~~~t~~~Nl 230 (296) ++..+|+|+.+...+++|||++++++++++.+. .+.+..+.+.+ ++|++ ||.|+++++|+.|+..++.| T Consensus 144 ----dA~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~-~~G~~Viv~s~~~~~~~~~l~~~gAi 218 (270) T protein:vir:95 144 ----DAIEVFNSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVE-IVGVSDIVKSKRVSENTAFLQRYGAM 218 (270) T ss_pred ----HHHHHhccccCCCcEEEEcHHHHHHHHhhhcccccccccchhcccccce-ecceeEEEeCCCCCceeEEEEeccce Confidence 444789999888999999999999999988542 22334455555 99987 57788999999999999999 Q ss_pred EEEEec-CcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 231 IFAYIN-PNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 231 ~~ay~~-~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .++.-. ++ +...-+--...+=+++-.|... ++. +..+||++|+.++= T Consensus 219 ~~~~~~~~~---vEtdRd~~~~~d~i~~~~~y~v--~~~--------------~~skvv~~t~~~a~ 266 (270) T protein:vir:95 219 EIVNKKKPE---AYTDFDILKRTHLLSTNYHYSV--NLK--------------DETGVVKVTFKPSG 266 (270) T ss_pred eeeecCCce---eeeccchhhcccEEEeeeEEEE--EEE--------------ccceEEEEEecCCC Confidence 865522 22 2222222234445555555422 111 33578888887766 No 6 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=2.5e-35 Score=210.30 Aligned_cols=260 Identities=20% Similarity=0.230 Sum_probs=189.7 Q ss_pred cccccceehhhhhhh--hhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhheeee Q lcl|Aclame:pro 7 YPEENLIKSTDLKYP--ITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERK 84 (296) Q Consensus 7 ~ae~nl~~~~dl~~a--~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplskv~~~ 84 (296) -|-.|.|+-.|+-.+ -+-=..+++. .-..+....-+.+.++.+||++|++|+|+++|+++ ++.||+.||.++++.+ T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~-~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~~i~~~~lt~~ 78 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELD-KKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAK-VVPEGEEIPIDLIETK 78 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHH-HhhhhcccceecccccCCCCCEEEeeeeccCCccc-cccCCCCcchhhcccc Confidence 344555555555221 1111222332 12222233335788999999999999999999998 9999999999999975 Q ss_pred ecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHHHHHHH Q lcl|Aclame:pro 85 IHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALASAWGKL 162 (296) Q Consensus 85 ~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~~~~~ 162 (296) +.+++|+|++|++ +||+. +.+++||++++.+|++.++++++|+|+++.|++++++..++..+ ++++.++ T Consensus 79 ---~~~~~i~~~~~~~~i~D~~~-~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~~~~~~-----~d~i~dA 149 (275) T protein:vir:96 79 ---KRQATIRKIGKGTVLTDEAL-LSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKVEADITK-----LAGLQTA 149 (275) T ss_pred ---eeeEEeehhcccccccHHHH-HhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccC-----HHHHHHH Confidence 5779999999986 99996 89999999999999999999999999999999998776543322 3455566 Q ss_pred HHhhccccCcceEEEEcHHHHHHHhcCCcc--------ccceeechhhhhhhheeEEEEeccCCCceEEEEcccceEEEE Q lcl|Aclame:pro 163 QVLFEDYGSERAIVFANSLDVAEYIAKAGI--------TTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAY 234 (296) Q Consensus 163 ~~~Feded~~~~VlFvNP~Daa~~l~~a~i--------~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay 234 (296) ..+|+|+++...+++|||++++.+++++.+ +.+....+.+. .++|++||+|+++|+|+.|+..++.+.++. T Consensus 150 ~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig-~~~G~~Vi~s~~~p~~t~~i~~~gA~~~~~ 228 (275) T protein:vir:96 150 IDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFG-EALGAIIVRSNKIKEGEAILAKRGAVKLIT 228 (275) T ss_pred HHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccc-eecCeeEEEeCCCCcceEEEEeccceeeee Confidence 688998888888999999999999988622 11222333444 499999999999999999999999888643 Q ss_pred ecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 235 INPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 235 ~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) -.+ =.+....+--...+-+.|-.|... .+. +.++||+++.+|+= T Consensus 229 ~~~--~~vE~~Rd~~~~~d~i~~~~~y~~--------------~~~--~~~~vv~~t~~~~~ 272 (275) T protein:vir:96 229 KRD--FFLETERHASHKSTALFSDKHYVA--------------YLY--DESKVVKITKSASG 272 (275) T ss_pred cCC--cccccccchhhcCcEEEEeEEEEE--------------EEE--cCccEEEEEecccc Confidence 221 123333333345666777777522 222 56789999998876 No 7 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=7e-34 Score=202.36 Aligned_cols=261 Identities=19% Similarity=0.192 Sum_probs=185.9 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-.. +|--.+++...=+.. =..+++.+. ..+-..--+.+.++.+||++|++|+|+++|+++ +++||+.||.++ T Consensus 1 Ma~~-~T~l~d~i~Pev~~~----~v~~~~~~~-~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~-~~~eg~~i~~~~ 73 (276) T protein:vir:10 1 MAQG-TTTKSTQIVPEVLAP----MMQAELDKK-LRFAQFADIDSTLVGQPGDTLTFPAFVYSGDAT-VVPEGQKIPVDK 73 (276) T ss_pred CCcc-eeehhhhhchHHHHH----HHHHHHHhh-hhhcccceecccccCCCCCEEEeeeecCCCccc-cccCCCccCccc Confidence 5422 233334443332221 133344222 333344445678999999999999999999998 999999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHHH Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALASA 158 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~ 158 (296) ++.+ +.+++|+|++|++ |||+. +.+++||++++.+|+++++++++|+|+++.|++++++..+...+ +++ T Consensus 74 lt~~---~~~a~i~~~~k~~~~tD~a~-~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~~~~t-----~d~ 144 (276) T protein:vir:10 74 IETN---RREAKIHKIGKGTDITDEAL-LSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVSADIGT-----LAG 144 (276) T ss_pred cccc---eeeEEeehccccccccHHHH-HhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccC-----HHH Confidence 9975 6889999999986 99996 89999999999999999999999999999999998765542222 344 Q ss_pred HHHHHHhhccccCcceEEEEcHHHHHHHhcCCcc--------ccceeechhhhhhhheeEEEEeccCCCceEEEEcccce Q lcl|Aclame:pro 159 WGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI--------TTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVPENI 230 (296) Q Consensus 159 ~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i--------~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl 230 (296) +.++..+|+|++....+++|||++++.+++++.+ +.+....+.+.+ ++|++||+|+++|+|+.|+..++.+ T Consensus 145 i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~Vi~s~~~p~~t~~l~~~gAi 223 (276) T protein:vir:10 145 LEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGE-ALGAVIVRSKKLDEGEAILAKRGAV 223 (276) T ss_pred HHHHHHHhccccCcccEEEEcHHHHHHHHHhccccccccccccccceeccccce-ecceeEEEcCCCCcceEEEEeccce Confidence 5556678998887889999999999999886522 222334555554 8999999999999999999999988 Q ss_pred EEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 231 IFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 231 ~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .+..-.+ =.+..-.+--...+-+.|-.|... .+. +.++|++++..+.. T Consensus 224 ~~~~~~~--~~vE~dRd~~~~~d~i~~~~~y~~--------------~~~--~~~~vv~~t~~~~~ 271 (276) T protein:vir:10 224 KLITKRD--FFLETDRDPSTKTTALYSDKHYVA--------------YLY--DESKAVKVTKGAGT 271 (276) T ss_pred eeeecCC--ceeecccchhhcccEEEEeeEEEE--------------EEE--cCcceEEEecCCcC Confidence 8643221 112222222334555666666421 111 34678888865544 No 8 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=3.6e-32 Score=192.98 Aligned_cols=258 Identities=20% Similarity=0.192 Sum_probs=184.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHH---hCcccccccCCCCeeeeeeeeeeecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEM---LGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~---LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip 77 (296) |-.- +|--.+++...= |.....+.+.+.+.. .-+.+.++.+||+||++|+|.++|+++ ++.||+.|+ T Consensus 1 m~~~-~T~l~d~i~Pev--------~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~~i~ 70 (274) T protein:vir:95 1 MAQG-MTKLTNQIVPEV--------LAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAK-VVAEGEKIP 70 (274) T ss_pred CCcc-eeehhheechHH--------HHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccc-cccCCCccc Confidence 4321 222333333322 333333344444433 235788999999999999999999998 999999999 Q ss_pred hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHH Q lcl|Aclame:pro 78 LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGAL 155 (296) Q Consensus 78 lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Al 155 (296) .++++.+ +.+++|+|++|++ +||+. +.+++||+++..+|+++++++++|+++++.|++++++....... T Consensus 71 ~~~lt~~---~~~~~i~~~~~a~~i~D~~~-~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~----- 141 (274) T protein:vir:95 71 TDILETK---KREAKIRKIAKGTSISDEAL-LSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITK----- 141 (274) T ss_pred hhhcccc---eeEEEeeeeecceeehHHHH-hhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccC----- Confidence 9999975 6789999999986 99995 89999999999999999999999999999999998776543222 Q ss_pred HHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcc--------ccceeechhhhhhhheeEEEEeccCCCceEEEEcc Q lcl|Aclame:pro 156 ASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI--------TTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVP 227 (296) Q Consensus 156 a~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i--------~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~ 227 (296) ++++.++..+|+|+++...+++|||+.++.+++++.+ +.+....+.+. .++|++||+|+++|+|+.|+..+ T Consensus 142 ~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig-~~~G~~Vi~s~~~~~~t~~l~~~ 220 (274) T protein:vir:95 142 LTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFG-EALGAVIVRSNKLEAGTAILAKK 220 (274) T ss_pred HHHHHHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccc-eecCeEEEEeCCCCCceEEEEec Confidence 4455666688999888888999999999999997622 22333344444 49999999999999999999999 Q ss_pred cceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 228 ENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 228 ~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +.+.++.=.+ =.+....+--...+=+.|-.|... .+. +.+++|++|-...- T Consensus 221 gA~~~~~~~~--~~vE~~Rd~~~~~d~i~~~~~y~~--------------~~~--~~~~~v~~tk~~~~ 271 (274) T protein:vir:95 221 GAVKLITKRD--FFLETDRDPSTKTTALYSDKHYVA--------------YLY--DESKAVKITKGSGS 271 (274) T ss_pred cceeeeecCC--cccccccccccccCEEEEeEEEEE--------------EEE--cCCcEEEEEcCCcc Confidence 9888643221 123333343345555666666421 111 34567776644433 No 9 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=3.6e-32 Score=192.98 Aligned_cols=258 Identities=20% Similarity=0.192 Sum_probs=184.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHH---hCcccccccCCCCeeeeeeeeeeecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEM---LGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~---LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip 77 (296) |-.- +|--.+++...= |.....+.+.+.+.. .-+.+.++.+||+||++|+|.++|+++ ++.||+.|+ T Consensus 1 m~~~-~T~l~d~i~Pev--------~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~~i~ 70 (274) T protein:vir:96 1 MAQG-MTKLTNQIVPEV--------LAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAK-VVAEGEKIP 70 (274) T ss_pred CCcc-eeehhheechHH--------HHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccc-cccCCCccc Confidence 4321 222333333322 333333344444433 235788999999999999999999998 999999999 Q ss_pred hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHH Q lcl|Aclame:pro 78 LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGAL 155 (296) Q Consensus 78 lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Al 155 (296) .++++.+ +.+++|+|++|++ +||+. +.+++||+++..+|+++++++++|+++++.|++++++....... T Consensus 71 ~~~lt~~---~~~~~i~~~~~a~~i~D~~~-~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~----- 141 (274) T protein:vir:96 71 TDILETK---KREAKIRKIAKGTSISDEAL-LSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITK----- 141 (274) T ss_pred hhhcccc---eeEEEeeeeecceeehHHHH-hhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccC----- Confidence 9999975 6789999999986 99995 89999999999999999999999999999999998776543222 Q ss_pred HHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcc--------ccceeechhhhhhhheeEEEEeccCCCceEEEEcc Q lcl|Aclame:pro 156 ASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI--------TTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVP 227 (296) Q Consensus 156 a~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i--------~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~ 227 (296) ++++.++..+|+|+++...+++|||+.++.+++++.+ +.+....+.+. .++|++||+|+++|+|+.|+..+ T Consensus 142 ~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig-~~~G~~Vi~s~~~~~~t~~l~~~ 220 (274) T protein:vir:96 142 LTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFG-EALGAVIVRSNKLEAGTAILAKK 220 (274) T ss_pred HHHHHHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccc-eecCeEEEEeCCCCCceEEEEec Confidence 4455666688999888888999999999999997622 22333344444 49999999999999999999999 Q ss_pred cceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 228 ENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 228 ~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +.+.++.=.+ =.+....+--...+=+.|-.|... .+. +.+++|++|-...- T Consensus 221 gA~~~~~~~~--~~vE~~Rd~~~~~d~i~~~~~y~~--------------~~~--~~~~~v~~tk~~~~ 271 (274) T protein:vir:96 221 GAVKLITKRD--FFLETDRDPSTKTTALYSDKHYVA--------------YLY--DESKAVKITKGSGS 271 (274) T ss_pred cceeeeecCC--cccccccccccccCEEEEeEEEEE--------------EEE--cCCcEEEEEcCCcc Confidence 9888643221 123333343345555666666421 111 34567776644433 No 10 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.96 E-value=5e-32 Score=192.17 Aligned_cols=258 Identities=21% Similarity=0.198 Sum_probs=185.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHH---HHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLL---EMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~---~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip 77 (296) |-.. .|+..|+-.+ -=|.....+.+.+.+ ...-+.+.++.+||++|++|+|.++|+++ +++||+.|| T Consensus 1 ma~~-------~T~~~d~iiP--ev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~-~~~~g~~i~ 70 (274) T protein:vir:97 1 MPQG-------LTKTSDQIIP--EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ-VVAEGEKIP 70 (274) T ss_pred CCcc-------ceehhheech--HHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccc-cccCCCccc Confidence 5432 3333333211 113333333333333 34445678899999999999999999998 999999999 Q ss_pred hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHH Q lcl|Aclame:pro 78 LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGAL 155 (296) Q Consensus 78 lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Al 155 (296) .++++.+ ..+++|+|++|+. +||+. +.+++||+++..+|+++++++++|+++++.|++++++..+.... T Consensus 71 ~~~lt~~---~~~~~i~~~~~~~~i~D~~~-~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~----- 141 (274) T protein:vir:97 71 TDILETK---KREAKIRKIAKGTSITDEAL-LSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITK----- 141 (274) T ss_pred ccccccc---eeEEEeeeecceecccHHHH-HhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccC----- Confidence 9999975 6789999999875 99996 78899999999999999999999999999999988765443222 Q ss_pred HHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcc--------ccceeechhhhhhhheeEEEEeccCCCceEEEEcc Q lcl|Aclame:pro 156 ASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI--------TTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVP 227 (296) Q Consensus 156 a~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i--------~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~ 227 (296) ++++.++..+|+|++....+++|||.+++.++++..+ +...+..+.+. .++|++||+|+++|+|+.|+..+ T Consensus 142 ~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig-~~~G~~Vi~s~~~p~~t~~l~~~ 220 (274) T protein:vir:97 142 LNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFG-EALGAIIVRTNKLEAGTAILAKK 220 (274) T ss_pred HHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccc-eecCeeEEEcCCCCcceEEEEeC Confidence 3455566688999888889999999999999987522 22233444444 49999999999999999999999 Q ss_pred cceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 228 ENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 228 ~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +++.++.-.+- .+...-+--.-.+-+.|-.|... -+. +..|||+++...+- T Consensus 221 gA~~~~~~~~~--~vE~~Rd~~~~~d~i~~~~~y~~-------------~~~---~~~~vv~~t~~~~~ 271 (274) T protein:vir:97 221 GAVKLILKRDF--FLEVARDASTKTTALYSDKHYVA-------------YLY---DESKAVKITKGSGS 271 (274) T ss_pred cceEeeecCCc--eeccccchhhcccEEEEEEEEEE-------------EEE---cCCceEEEecCccc Confidence 99986432211 12222233334567777777522 111 34577777766665 No 11 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.96 E-value=5e-32 Score=192.17 Aligned_cols=258 Identities=21% Similarity=0.198 Sum_probs=185.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHH---HHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLL---EMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~---~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip 77 (296) |-.. .|+..|+-.+ -=|.....+.+.+.+ ...-+.+.++.+||++|++|+|.++|+++ +++||+.|| T Consensus 1 ma~~-------~T~~~d~iiP--ev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~-~~~~g~~i~ 70 (274) T protein:vir:94 1 MPQG-------LTKTSDQIIP--EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ-VVAEGEKIP 70 (274) T ss_pred CCcc-------ceehhheech--HHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccc-cccCCCccc Confidence 5432 3333333211 113333333333333 34445678899999999999999999998 999999999 Q ss_pred hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHH Q lcl|Aclame:pro 78 LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGAL 155 (296) Q Consensus 78 lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Al 155 (296) .++++.+ ..+++|+|++|+. +||+. +.+++||+++..+|+++++++++|+++++.|++++++..+.... T Consensus 71 ~~~lt~~---~~~~~i~~~~~~~~i~D~~~-~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~----- 141 (274) T protein:vir:94 71 TDILETK---KREAKIRKIAKGTSITDEAL-LSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITK----- 141 (274) T ss_pred ccccccc---eeEEEeeeecceecccHHHH-HhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccC----- Confidence 9999975 6789999999875 99996 78899999999999999999999999999999988765443222 Q ss_pred HHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcc--------ccceeechhhhhhhheeEEEEeccCCCceEEEEcc Q lcl|Aclame:pro 156 ASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI--------TTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVP 227 (296) Q Consensus 156 a~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i--------~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~ 227 (296) ++++.++..+|+|++....+++|||.+++.++++..+ +...+..+.+. .++|++||+|+++|+|+.|+..+ T Consensus 142 ~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig-~~~G~~Vi~s~~~p~~t~~l~~~ 220 (274) T protein:vir:94 142 LNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFG-EALGAIIVRTNKLEAGTAILAKK 220 (274) T ss_pred HHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccc-eecCeeEEEcCCCCcceEEEEeC Confidence 3455566688999888889999999999999987522 22233444444 49999999999999999999999 Q ss_pred cceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 228 ENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 228 ~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +++.++.-.+- .+...-+--.-.+-+.|-.|... -+. +..|||+++...+- T Consensus 221 gA~~~~~~~~~--~vE~~Rd~~~~~d~i~~~~~y~~-------------~~~---~~~~vv~~t~~~~~ 271 (274) T protein:vir:94 221 GAVKLILKRDF--FLEVARDASTKTTALYSDKHYVA-------------YLY---DESKAVKITKGSGS 271 (274) T ss_pred cceEeeecCCc--eeccccchhhcccEEEEEEEEEE-------------EEE---cCCceEEEecCccc Confidence 99986432211 12222233334567777777522 111 34577777766665 No 12 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.96 E-value=5.9e-32 Score=191.79 Aligned_cols=258 Identities=21% Similarity=0.192 Sum_probs=189.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHH---HHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLL---EMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~---~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip 77 (296) |-. +.|+..|+-.+ -=|.....+.+.+.+ ...-+.+.++.+||++|++|+|+++|+++ +++||+.|| T Consensus 1 ma~-------~~T~~~~~iiP--ev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~-~~~eg~~i~ 70 (274) T protein:vir:93 1 MPQ-------GITKTSNQIIP--EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ-VVAEGEKIP 70 (274) T ss_pred CCc-------cceehhheech--HHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcc-cccCCCccc Confidence 433 33334443211 123333334443333 33334577899999999999999999998 999999999 Q ss_pred hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHH Q lcl|Aclame:pro 78 LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGAL 155 (296) Q Consensus 78 lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Al 155 (296) .++++.. ..+++++|++|+. +||+. +.+++||+++..+|+++++++++|+++++.|++++.+....... T Consensus 71 ~~~it~~---~~~~~i~~~~~~~~i~D~~~-~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~~~~~----- 141 (274) T protein:vir:93 71 TDILETK---KREAKIRKIAKGTSITDEAL-LSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITK----- 141 (274) T ss_pred ccccccc---eeEEEeeeecccccccHHHH-HhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccC----- Confidence 9999975 6789999999875 99996 67889999999999999999999999999999988765432222 Q ss_pred HHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcc--------ccceeechhhhhhhheeEEEEeccCCCceEEEEcc Q lcl|Aclame:pro 156 ASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI--------TTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVP 227 (296) Q Consensus 156 a~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i--------~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~ 227 (296) ++++.++..+|+|++....+++|||.+++.++++..+ +......+.+. .++|++||+|+++|+|+.|+..+ T Consensus 142 ~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig-~~~G~~Vi~s~~~p~~t~~l~~~ 220 (274) T protein:vir:93 142 LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFG-EALGAIIVRTNKLEAGTAILAKK 220 (274) T ss_pred HHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccc-eecCeeEEEcCCCCcceEEEEeC Confidence 3445566678888877888999999999999987632 11222333333 49999999999999999999999 Q ss_pred cceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 228 ENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 228 ~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +++.++.-.+ -.+....+--...+.+.|..|..- -++ +..+||+++..++- T Consensus 221 gai~~~~~~~--~~vE~~Rd~~~~~d~i~~~~~y~~-------------~~~---~~~~~v~~t~~~~s 271 (274) T protein:vir:93 221 GAVKLILKRD--FFLEVARDASTKTTALYSDKHYVA-------------YLY---DESKAVKITKGSGS 271 (274) T ss_pred CeEEEEecCC--cccccccchhhcccEEEEEEEEEE-------------EEE---cCCceEEEeeCccc Confidence 9999765432 235556666667788899888532 222 34577888877776 No 13 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.96 E-value=1.2e-31 Score=190.04 Aligned_cols=259 Identities=20% Similarity=0.250 Sum_probs=175.7 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-.. +|--.|++..+=+.+. ..++|.+. ..+...--+.+.+..+||++|++|+|+++|+++ +++||++||.++ T Consensus 1 ma~~-~T~~~d~iiPev~~~~----v~~~~~~~-~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~-~~~eg~~i~~~~ 73 (272) T protein:vir:36 1 MSKQ-KTTLADLVNPEVLAPI----VSYELNKA-LRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAA-DVAEGGEISLDK 73 (272) T ss_pred CCCc-ceehhhhhchHHHHHH----HHHHHHhh-hhhccccccccccccCCCCEEEEeeeccCcccc-ccCCCCccChhh Confidence 5432 2223334433322211 23344322 222333445678899999999999999999997 999999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHHH Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALASA 158 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~ 158 (296) ++.+ ..++++|+++|++ |||+. +.+++||+++..+|+++++++++|+|+++.|++++++.+.. . -+++ T Consensus 74 lt~~---~~~~~i~~~~k~~~vtD~~~-~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~~~~-~-----~~d~ 143 (272) T protein:vir:36 74 IGTT---TKSVTIKKAAKGTEITDEAA-LSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTVSTK-A-----NVDG 143 (272) T ss_pred cCCc---ceeEeeehhhccccccHHHH-hhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-c-----cHHH Confidence 9975 5789999999975 99996 78999999999999999999999999999999887654321 1 1334 Q ss_pred HHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccc-------eeechhhhhhhheeEEEEeccCCCceEEEEc----c Q lcl|Aclame:pro 159 WGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQ-------TAFGLTYLVDFTGTVIISTNDVTKGEIWATV----P 227 (296) Q Consensus 159 ~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q-------~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~----~ 227 (296) +.++..+|+|++....+++|||++++.+|+++.+... ....+.+. .++|++||+|+++|+|+.+.+. + T Consensus 144 i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig-~~~G~~Vv~s~~~p~~~~~~~~~~~~~ 222 (272) T protein:vir:36 144 VQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYA-DVLGAQIVRSKKLAEGSALMFKIVSNS 222 (272) T ss_pred HHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccc-eecCeeEEEeCCCCCCceeEEEEEecc Confidence 4555578998888888999999999999998865321 12333333 4899999999999999985443 5 Q ss_pred cceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 228 ENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 228 ~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) +.+.++. . ..-.+...-+--...+-++|-.|.. +-++ +.++||++|++.- T Consensus 223 gA~~~~~-~-~~~~vE~~R~~~~~~d~i~~~~~y~-------------~~v~---~~~~vv~~t~~g~ 272 (272) T protein:vir:36 223 PALKLVL-K-RGVQVETDRDIVTKTTVITADEHYA-------------AYLY---DLTKVVNITFTGV 272 (272) T ss_pred cceeeee-c-CCcccccccchhhcCcEEEEEEEEE-------------EEEE---cCccEEEEeecCC Confidence 5554321 1 1111221112222344566665531 1122 2357888888665 No 14 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.96 E-value=1.4e-31 Score=189.80 Aligned_cols=258 Identities=20% Similarity=0.179 Sum_probs=182.1 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHH---HHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKL---LEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L---~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip 77 (296) |-.. +|--.+++...= |.....+.+.+. ....-+.+.++.+||++|++|+|+++|+++ ++.||+.|+ T Consensus 1 ma~~-~T~l~d~iiPev--------~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~~i~ 70 (274) T protein:vir:12 1 MAQG-LTKTSNQIIPEV--------LAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ-VVAEGEKIP 70 (274) T ss_pred CCcc-eeehhhhhchHH--------HHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccc-cccCCCccc Confidence 4332 222223333322 222223333332 234445677899999999999999999998 999999999 Q ss_pred hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHH Q lcl|Aclame:pro 78 LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGAL 155 (296) Q Consensus 78 lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Al 155 (296) .++++.. +.+++|+|++|++ +||+. +.+++||+++..+|++.++++++|+++++.+++++++...+... T Consensus 71 ~~~lt~~---~~~~~i~~~~~~~~i~D~~~-~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~~a~~----- 141 (274) T protein:vir:12 71 TDILETK---KREAKIRKIAKGTSITDEAL-LSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITK----- 141 (274) T ss_pred hhhcccc---eeeEEeeeecceeeecHHHH-HhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccC----- Confidence 9999975 5789999999986 99995 89999999999999999999999999999999998776543322 Q ss_pred HHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcc--------ccceeechhhhhhhheeEEEEeccCCCceEEEEcc Q lcl|Aclame:pro 156 ASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI--------TTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVP 227 (296) Q Consensus 156 a~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i--------~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~ 227 (296) ++++.++..+|+|+++...+++|||.+++.+++++.+ +.+....|.+. .++|++||+|+++|+++.|+..+ T Consensus 142 ~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig-~~~G~~Vi~s~~~p~~t~~l~~~ 220 (274) T protein:vir:12 142 LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFG-EALGAIIVRSNKLEAGTAILAKK 220 (274) T ss_pred HHHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhhhccccccccccceecccce-eecCeeEEEeCCCCcceEEEEec Confidence 3455566688998887888999999999999987522 22223344444 48999999999999999999999 Q ss_pred cceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 228 ENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 228 ~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .++.++.-.+ =.+....+--...+=+.|-.|. |+.+. +.++||++|-..+- T Consensus 221 gA~~~~~~~~--~~vE~~Rd~~~~~d~i~~~~~y--------------~~~~~--~~~~vv~~t~~~~~ 271 (274) T protein:vir:12 221 GAVKLILKRD--FFLEVARDASTKTTALYSDKHY--------------VAYLY--DESKAVKITKGSGS 271 (274) T ss_pred cceeeeecCC--ceeccccchhhcccEEEeeeEE--------------EEEEE--cCCceEEEEcCCcc Confidence 9988643221 1133333333344555665553 22222 45677777754444 No 15 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.96 E-value=5.4e-31 Score=186.52 Aligned_cols=259 Identities=20% Similarity=0.228 Sum_probs=183.4 Q ss_pred Cccccccccccceehhhhhh--hhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceech Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKY--PITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPL 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~--a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ipl 78 (296) |-+-. |+..||-. .-+--+.++|.+.+ .+-...-+.+.++.+||++|++|+|.++|+++ +++||+.||. T Consensus 1 ma~~~-------T~~~d~i~Pev~s~~v~~~~~~~~-~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~-~~~~g~~i~~ 71 (274) T protein:vir:96 1 MAQGT-------TKVSNLIVPEVLAPMMQAELDKKL-RFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQ-VIAEGEKIPV 71 (274) T ss_pred CCccc-------cchhhhhhhHHHHHHHHHHHHhhh-hhcccccccccccCCCCCEEEEEeeccCCCcc-ccCCCCcCch Confidence 44322 22233321 11222333443222 22233446677899999999999999999998 9999999999 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHH Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALA 156 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala 156 (296) ++++.. +.+++|+|++|++ +||+. +.+++||+++..+|++.++++++|+++++.|++++.+..+...+ + T Consensus 72 ~~it~~---~~~~~i~~~~~~~~i~D~~~-~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~~~~~~-----~ 142 (274) T protein:vir:96 72 DQIGTS---KREAKVRKIGKGTELTDEAV-LSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITK-----L 142 (274) T ss_pred hhcccc---eeEEEEEeeeceeeecHHHH-HhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCccccc-----H Confidence 999975 6789999999975 99996 78899999999999999999999999999999998765543322 3 Q ss_pred HHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcc--cc------ceeechhhhhhhheeEEEEeccCCCceEEEEccc Q lcl|Aclame:pro 157 SAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI--TT------QTAFGLTYLVDFTGTVIISTNDVTKGEIWATVPE 228 (296) Q Consensus 157 ~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i--~~------q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~ 228 (296) +++.++..+|+|++....+++|||.+++.++++..+ .. ..+..+.+ ..++|++||+|+++|+|+.|+..++ T Consensus 143 d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~i-g~~~G~~Vi~s~~~p~~t~~l~~~g 221 (274) T protein:vir:96 143 DGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAF-GEALGAVIVRSNKLNKGEALLAKKG 221 (274) T ss_pred HHHHHHHHHhcccCCCceEEEeCHHHHHHHHhcccccccccccccccceeeccc-ceecCeeEEEcCCCCcceEEEEeCc Confidence 445566678888877888999999999999987632 11 11222233 3599999999999999999999999 Q ss_pred ceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 229 NIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 229 Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ++.++.--+ -.+....+--...+-++|-.|.... +. +.++||+.|..++= T Consensus 222 A~~~~~~~~--~~vE~~Rd~~~~~d~i~~~~~yg~~--------------~~--~~~~vv~~t~~~~~ 271 (274) T protein:vir:96 222 AVKLITKRD--FFLEKDRDASRKSTALYSDKHYVAY--------------LY--DESKVVKITKGAGD 271 (274) T ss_pred ceeeeecCC--cccccccchhhcccEEEEeeEEEEE--------------EE--cCccEEEEEcCccc Confidence 988654322 1233333434456677777773221 11 34677887776665 No 16 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.95 E-value=1.5e-29 Score=178.55 Aligned_cols=260 Identities=19% Similarity=0.265 Sum_probs=180.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-.+-|+.. +++..+-+. --+.+++.+. ..+....-+.+.++.+||++|++|+|..++.++ +++||+.||.++ T Consensus 1 MA~~~T~~~-~~~iPev~s----~~v~~~~~~~-~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~-~v~eg~~i~~~~ 73 (272) T protein:vir:30 1 MAVGTTKMA-QMLDPEVLA----DMIDAEVGKA-IRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAE-DVAEGEAIPMTQ 73 (272) T ss_pred CCCccccch-heechHHHH----HHHHHHHHHH-hhhhccccccccccCCCCCEEEEEEecCCCCcc-cccCCCcccccc Confidence 654333222 333332221 1122333222 223334445567889999999999999999997 999999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHHH Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALASA 158 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~ 158 (296) ++.. ..++++||+++++ |||++ ....+|++.+..+|++.++++++|+++++.+++++.+.+.. .+ +++ T Consensus 74 ~~~~---~~~~~~~~~~~~~~itd~~~-~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~~-~t-----~d~ 143 (272) T protein:vir:30 74 LGFK---KTTMTIKKAGKGVEITDEAI-LSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEAT-AT-----VDG 143 (272) T ss_pred cccc---eEEEEeeeeeeeeeecHHHH-hhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-cC-----HHH Confidence 9974 6889999999875 99997 56677899999999999999999999999998887654321 11 233 Q ss_pred HHHHHHhhccccCcceEEEEcHHHHHHHhcCCccc--------cceeechhhhhhhheeEEEEeccCCCceEEEEcccce Q lcl|Aclame:pro 159 WGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGIT--------TQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVPENI 230 (296) Q Consensus 159 ~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~--------~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl 230 (296) +.++..+|+++.....+++|||.+++.+++...+. .+....+.+. .++|..||.|+.+|+|++|+..++++ T Consensus 144 i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig-~i~G~~Vi~s~~~p~~t~~~~~~~a~ 222 (272) T protein:vir:30 144 VSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYG-EVLGVQIVRSRKCPKGTAYMVRKGAL 222 (272) T ss_pred HHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccch-hhcCeeEEEcCCCCcceEEEEcCCeE Confidence 44455678887777789999999999998765321 1222233333 58999999999999999999999988 Q ss_pred EEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 231 IFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 231 ~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .++.-.. -++....+...+.+-+.+-.|..-. .+ +.++||++|+.++= T Consensus 223 ~~~~~~~--~~ve~~r~~~~~~~~i~~~~~~~~~-------------v~---~~~~vv~~t~~~a~ 270 (272) T protein:vir:30 223 RIMLKRN--TMVETDRDITKAINQIVANKHYGVY-------------LY---KAEKAVKITLKDAA 270 (272) T ss_pred EEEecCC--ceeeeccccccceeEEEEEEEEEEE-------------EE---cCCceEEEEecccc Confidence 7764322 1233333333355566665553211 11 46699999998877 No 17 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.95 E-value=1.5e-29 Score=178.55 Aligned_cols=260 Identities=19% Similarity=0.265 Sum_probs=180.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-.+-|+.. +++..+-+. --+.+++.+. ..+....-+.+.++.+||++|++|+|..++.++ +++||+.||.++ T Consensus 1 MA~~~T~~~-~~~iPev~s----~~v~~~~~~~-~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~-~v~eg~~i~~~~ 73 (272) T protein:vir:98 1 MAVGTTKMA-QMLDPEVLA----DMIDAEVGKA-IRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAE-DVAEGEAIPMTQ 73 (272) T ss_pred CCCccccch-heechHHHH----HHHHHHHHHH-hhhhccccccccccCCCCCEEEEEEecCCCCcc-cccCCCcccccc Confidence 654333222 333332221 1122333222 223334445567889999999999999999997 999999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHHH Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALASA 158 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~ 158 (296) ++.. ..++++||+++++ |||++ ....+|++.+..+|++.++++++|+++++.+++++.+.+.. .+ +++ T Consensus 74 ~~~~---~~~~~~~~~~~~~~itd~~~-~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~~-~t-----~d~ 143 (272) T protein:vir:98 74 LGFK---KTTMTIKKAGKGVEITDEAI-LSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEAT-AT-----VDG 143 (272) T ss_pred cccc---eEEEEeeeeeeeeeecHHHH-hhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-cC-----HHH Confidence 9974 6889999999875 99997 56677899999999999999999999999998887654321 11 233 Q ss_pred HHHHHHhhccccCcceEEEEcHHHHHHHhcCCccc--------cceeechhhhhhhheeEEEEeccCCCceEEEEcccce Q lcl|Aclame:pro 159 WGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGIT--------TQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVPENI 230 (296) Q Consensus 159 ~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~--------~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl 230 (296) +.++..+|+++.....+++|||.+++.+++...+. .+....+.+. .++|..||.|+.+|+|++|+..++++ T Consensus 144 i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig-~i~G~~Vi~s~~~p~~t~~~~~~~a~ 222 (272) T protein:vir:98 144 VSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYG-EVLGVQIVRSRKCPKGTAYMVRKGAL 222 (272) T ss_pred HHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccch-hhcCeeEEEcCCCCcceEEEEcCCeE Confidence 44455678887777789999999999998765321 1222233333 58999999999999999999999988 Q ss_pred EEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 231 IFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 231 ~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .++.-.. -++....+...+.+-+.+-.|..-. .+ +.++||++|+.++= T Consensus 223 ~~~~~~~--~~ve~~r~~~~~~~~i~~~~~~~~~-------------v~---~~~~vv~~t~~~a~ 270 (272) T protein:vir:98 223 RIMLKRN--TMVETDRDITKAINQIVANKHYGVY-------------LY---KAEKAVKITLKDAA 270 (272) T ss_pred EEEecCC--ceeeeccccccceeEEEEEEEEEEE-------------EE---cCCceEEEEecccc Confidence 7764322 1233333333355566665553211 11 46699999998877 No 18 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.93 E-value=3.3e-28 Score=171.22 Aligned_cols=262 Identities=17% Similarity=0.150 Sum_probs=184.3 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHH---HHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKL---LEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L---~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip 77 (296) |-.- +|.-.+++..+=+ .....+.+.+. ....-+.+.++.+||++|++|+|+.+|+++ ++.||+.|| T Consensus 1 Ma~~-~T~~~~~iiPev~--------s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~-~~~~g~~i~ 70 (278) T protein:vir:80 1 MADL-TTKLANLIDPEVM--------GPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQ-DVAEGAAID 70 (278) T ss_pred CCCc-ceehhheecHHHH--------HHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcce-eecCCCcCc Confidence 5432 2222444444322 22222233332 233346778899999999999999999997 999999999 Q ss_pred hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcccee--cchhhHHH Q lcl|Aclame:pro 78 LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQD--ALGAGLQG 153 (296) Q Consensus 78 lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~--~t~~~lQ~ 153 (296) .++++.+ ..+++|++++|++ +|++. +...+||+++..+|++.++++++|+++++.|++++.+.+ .+.++... T Consensus 71 ~~~lt~~---~~~~~i~~~~~a~~v~D~~~-~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~ 146 (278) T protein:vir:80 71 YSALETE---SVKHGIKKAGKGVKLTDESV-LSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDK 146 (278) T ss_pred ccccccc---eeeEeeehhhccccccHHHH-hhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhh Confidence 9999975 5789999999986 99995 788999999999999999999999999999998876543 23344443 Q ss_pred HHHHHHHHHHHhhccccC-cceEEEEcHHHHHHHhcCCcc--------ccceeechhhhhhhheeEEEEeccCCCceEEE Q lcl|Aclame:pro 154 ALASAWGKLQVLFEDYGS-ERAIVFANSLDVAEYIAKAGI--------TTQTAFGLTYLVDFTGTVIISTNDVTKGEIWA 224 (296) Q Consensus 154 Ala~~~~~~~~~Feded~-~~~VlFvNP~Daa~~l~~a~i--------~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~ 224 (296) + +++++++.+++++++. ...+++|||.+.+.++++..+ +.+....+.+. .++|++|++|+++|+|+.|+ T Consensus 147 ~-~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig-~~~G~~Vi~s~~~p~~t~~l 224 (278) T protein:vir:80 147 I-ENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFG-ELLGWEIVRTKKLADGNALA 224 (278) T ss_pred H-HHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccce-eecceeEEEcCCCCcceEEE Confidence 3 4678888899988753 356899999999999887532 12222233333 48999999999999999999 Q ss_pred EcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 225 TVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 225 t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ..++++.+..--+- .+...-+--...+-+.|-.|.. +.+. +.+++|++|..++- T Consensus 225 ~~~gAi~~~~~~~~--~vE~~Rd~~~~~d~i~~~~~yg--------------~~v~--~~~~~v~it~~a~~ 278 (278) T protein:vir:80 225 VKAGALKTFLKRNL--LAESGRDMDHKLTKFNADQHYA--------------VALV--DETKAVKVVPVAGN 278 (278) T ss_pred EeccceeeeecCCc--ccccccchhhccceeeeeeEEE--------------EEEE--cCcceEEEeeccCC Confidence 99998764322211 1222122222456677776642 2211 45789999998888 No 19 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.85 E-value=1.8e-23 Score=145.34 Aligned_cols=278 Identities=13% Similarity=0.059 Sum_probs=168.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHH------hCcccccc-cCCCCeeeeeeeeee-ecccCcccC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEM------LGVTRKIS-VSEGMTLKTYAGYDV-TLAEGNVPE 72 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~------LgVtr~~~-~~pG~tIt~pk~~yi-g~A~gdVaE 72 (296) |-+++ -.+++..+=+++-.+....++ ++|++- -.+...+. ..||++|++|+|+++ |+++ +|.| T Consensus 1 MA~T~---lsd~i~peVf~~yv~~~~~~~-----~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~-~v~~ 71 (324) T protein:vir:59 1 MAYTK---ISDVIVPELFNPYVINTTTQL-----SAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQ-VLND 71 (324) T ss_pred CCcee---eeceechhHHHHHHHhhhHHH-----HHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCccc-ccCC Confidence 65432 234444333332212122111 111110 00112222 358999999999999 8887 9999 Q ss_pred CceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc-------- Q lcl|Aclame:pro 73 GEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG-------- 142 (296) Q Consensus 73 Ge~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~-------- 142 (296) |..|+.+|++.. +...++++.+|+. +|+|. +++++||+++..+||++++++++++++++.|+...+ T Consensus 72 ~~~i~~~~l~t~---~~~a~i~~~~k~~~~tD~a~-~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~ 147 (324) T protein:vir:59 72 TDDLVPQKINAG---QDKAVLILRGNAWSSHDLAA-TLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNK 147 (324) T ss_pred Ccccchhhcccc---eeeEEEEeecCceeehhhhh-hhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccce Confidence 999999999975 5778889999986 89994 899999999999999999999999999999974311 Q ss_pred -ceecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcccc-c-eeechhhhhhhheeEEEEeccCC- Q lcl|Aclame:pro 143 -TQDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITT-Q-TAFGLTYLVDFTGTVIISTNDVT- 218 (296) Q Consensus 143 -t~~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~-q-~~fg~tyl~nfLG~~II~S~kV~- 218 (296) ..++..+.. .-+.++.++..+|+|+.+.-.+++|||.+++++++++.+.- + ..=+.++. .++|.+||++.++| T Consensus 148 ~dvsa~~~~~--~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~-~~~G~~VivdD~~p~ 224 (324) T protein:vir:59 148 LDISGTADGI--YSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEFVKDSQSGIRFP-TYMNKRVIVDDSMPV 224 (324) T ss_pred eeeeccccce--ecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhhccccccCceee-eecccEEEEeCCCCc Confidence 111111110 11344555557899998888999999999999998764321 1 11122333 48999999999987 Q ss_pred --------CceEEEEcccceEEEEecC-cchhhhhh-----hccccccccceEEE---ec---cccceeehhhhhhHHH- Q lcl|Aclame:pro 219 --------KGEIWATVPENIIFAYINP-NNSELAKE-----FNLYGDPTGYIGMN---HF---QENTTLTIQTLLVSGM- 277 (296) Q Consensus 219 --------~G~~~~t~~~Nl~~ay~~~-~~g~~~~~-----f~~~td~tGliGv~---h~---~~~~~~t~et~~~~~~- 277 (296) +.+.|..+++.+.+..-.+ ..=|..|- =-+++|.+-.+++. -. ....+-|-+-+.-.+. T Consensus 225 ~~~~~~~~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~~l~~r~~~~~~p~G~s~~~~~~~~~sPt~~~L~~~~NW 304 (324) T protein:vir:59 225 ETLEDGTKVFTSYLFGAGALGYAEGQPEVPTETARNALGSQDILINRKHFVLHPRGVKFTENAMAGTTPTDEELANGANW 304 (324) T ss_pred cccCCCCceEEEEEEecCeEEEeecCCCcceecccCccccceEEEEeeEEEeEeeeEEecccccCCCCCChhhhcCCccc Confidence 3568888888887766432 11222221 22556776655542 11 1122233333333332 Q ss_pred Hh-hhhccceEE--EEEecC Q lcl|Aclame:pro 278 LM-YPERIDGIV--KVTLTP 294 (296) Q Consensus 278 ~l-fpE~~dgvv--~~tI~~ 294 (296) .+ |.=+.=.|| +..|.+ T Consensus 305 ~~v~~~k~i~i~~~~~~~~~ 324 (324) T protein:vir:59 305 QRVYDPKKIRIVQFKHRLQA 324 (324) T ss_pred ccccCccccceEEEEeeccC Confidence 11 111111222 333444 No 20 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.85 E-value=4.3e-23 Score=143.20 Aligned_cols=273 Identities=14% Similarity=0.111 Sum_probs=157.9 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCc-------ccccccCCCCeeeeeeeeee-ecccCcccC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGV-------TRKISVSEGMTLKTYAGYDV-TLAEGNVPE 72 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgV-------tr~~~~~pG~tIt~pk~~yi-g~A~gdVaE 72 (296) |-. -+|--.+++...-+++ ++.+--...++|++- |+ ...+. .||++|++|.|+++ |+++ ++.| T Consensus 1 Ma~-~~T~l~d~i~pevf~~-----yv~~~~~~~~~l~qS-G~i~~~~~i~~~~~-~~G~~i~~P~~~~l~G~~~-~~~d 71 (330) T protein:vir:10 1 MAN-ELTKILDTITPQQYNA-----YMQQYTAAKSAFVQS-GIAVSDERVSKNIT-SGGLLVNMPFWNDLTGDSE-VLGN 71 (330) T ss_pred CCC-CceEeeeeechhHHHH-----HHHHHhHHhhhhhhc-ccccccHHHHHHhh-cCCCEEEecccccCCCccc-ccCC Confidence 432 1122233444433332 222211122222221 21 22233 39999999999988 8887 9999 Q ss_pred Cc-eechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce----- Q lcl|Aclame:pro 73 GE-VIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ----- 144 (296) Q Consensus 73 Ge-~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~----- 144 (296) |+ .|+.+|+++. +...++++++|+. ||+| +++|++||+++..+||++..+++.++++++.|+.--+.. T Consensus 72 g~~~i~~~ki~t~---~~~a~i~~~~k~~~~tD~a-~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~ 147 (330) T protein:vir:10 72 GDKALETGKITAG---ADIACVLYRGRGWAANELT-GVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEK 147 (330) T ss_pred Cccccchhhcccc---eeEEEEEeecceeeehhhh-hhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccc Confidence 97 6999999975 6788999999986 9999 599999999999999999999999999999887332110 Q ss_pred ---------e--cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcccc--ceeechhhhhhhheeEE Q lcl|Aclame:pro 145 ---------D--ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITT--QTAFGLTYLVDFTGTVI 211 (296) Q Consensus 145 ---------~--~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~--q~~fg~tyl~nfLG~~I 211 (296) . .....+ -+.++.++..+|+|+.+.-.+++|||.+++++++++.+.. ....++++. .++|++| T Consensus 148 ~~~~~~~~~~~~~~~a~~---s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~-~~~G~~V 223 (330) T protein:vir:10 148 GALEETHVSDQSKASTGI---DAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQYIQPTTATINIP-TYLGYRV 223 (330) T ss_pred hhhhhhheeccccccccc---CHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhhhhcccccCcccc-cccceEE Confidence 0 000111 1345556668999998888999999999999998765432 222344444 4899999 Q ss_pred EEeccCC----CceEEEEcccceEEEEecCcc---hhhhhh-----hccccccccceEEEeccccceeehhhhhhHHHHh Q lcl|Aclame:pro 212 ISTNDVT----KGEIWATVPENIIFAYINPNN---SELAKE-----FNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLM 279 (296) Q Consensus 212 I~S~kV~----~G~~~~t~~~Nl~~ay~~~~~---g~~~~~-----f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~l 279 (296) |++.++| +.+.|...++.+.+..-.|.. =|..|- =.++++....+.+ .-.++..-.+..... T Consensus 224 ivdD~~p~~~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~~~~hp------~G~s~~~~~~~~~~~ 297 (330) T protein:vir:10 224 IIDDGIAPTGDIYTSYLFRTGSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRALVMHP------YGVKWTGAEVDAGNI 297 (330) T ss_pred EEeCCCCCCCCceeEEEEecCceeeecccCCccccccccCCccccceEEEEeeEEEeee------eeeeecccccccCcC Confidence 9999997 456778888888665422210 011111 1133333322221 001111000000111 Q ss_pred hhhccc----------------eEEEEEecCCC Q lcl|Aclame:pro 280 YPERID----------------GIVKVTLTPGV 296 (296) Q Consensus 280 fpE~~d----------------gvv~~tI~~~v 296 (296) +|-.-| .||...-+=.- T Consensus 298 sPt~~~L~~~~NW~~v~~~k~i~iv~~~~~~~~ 330 (330) T protein:vir:10 298 TPSNADLAKFKNWKRVYEPKNIGIIALKHKIGK 330 (330) T ss_pred CcChHHhcCCcCcccccChhhcceEEEEEecCC Confidence 121111 00000000000 No 21 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.81 E-value=1.3e-21 Score=135.13 Aligned_cols=266 Identities=14% Similarity=0.106 Sum_probs=152.3 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHH--h----CcccccccCCCCeeeeeeeeee-ecccCcccCC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEM--L----GVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEG 73 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~--L----gVtr~~~~~pG~tIt~pk~~yi-g~A~gdVaEG 73 (296) |-+++ -.+++..+=+++-.+-.+. ..++|++- + -+...+. .||++|++|+|+++ |+++ ++.|| T Consensus 1 MA~T~---lsd~i~PEvf~~yv~~~~~-----~~~~l~qSG~i~~~~~l~~~~~-~~G~~it~P~~~~l~Gd~~-~~~~~ 70 (351) T protein:vir:15 1 MAETH---LSDLIVPEVFGNYVVNQII-----KTNRFVQSGILTPDPDLGPHLL-EAGTRITVPFLNDLTGDPD-NWTDS 70 (351) T ss_pred CCcee---eeeeechhHHHHHHhhhhH-----HhhhHhhcccccccHHHHHHhh-cCCCEEEecccccCCCccc-ccCCC Confidence 66433 2445554433322121221 12233221 0 0222222 49999999999999 8998 99999 Q ss_pred ceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce------- Q lcl|Aclame:pro 74 EVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ------- 144 (296) Q Consensus 74 e~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~------- 144 (296) ..|+.+|++.. ....++++++|+. ||+|. ++++|||+++..+||+++++++.++++++.|+...+.. T Consensus 71 ~~i~~~kitt~---~~~a~i~~~~kg~~~tD~a~-~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~ 146 (351) T protein:vir:15 71 DDIDVNNLTSG---KQQGIKFYQTKAYGYTDLGT-MISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKV 146 (351) T ss_pred cccchheeccc---ceeEEEEeeccceehhhhhH-hhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccce Confidence 99999999975 5778899999986 89995 89999999999999999999999999999997431110 Q ss_pred -ecchh--hHHHHHHHHHHHHHHhhccccCc-ceEEEEcHHHHHHHhcCCcccc-ceeec-hhhhhhhheeEEEEeccCC Q lcl|Aclame:pro 145 -DALGA--GLQGALASAWGKLQVLFEDYGSE-RAIVFANSLDVAEYIAKAGITT-QTAFG-LTYLVDFTGTVIISTNDVT 218 (296) Q Consensus 145 -~~t~~--~lQ~Ala~~~~~~~~~Feded~~-~~VlFvNP~Daa~~l~~a~i~~-q~~fg-~tyl~nfLG~~II~S~kV~ 218 (296) ..+.. .-..--+.++.++..+|.|+.+. -.+++|||..+++++++..+.- +...| ..+. .++|++||++.++| T Consensus 147 ~d~t~~~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~-t~~G~~VivdD~~p 225 (351) T protein:vir:15 147 YDQTKVSPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIETIQPQNGATPFE-AYNGLRIVLDDDIE 225 (351) T ss_pred eccccccccccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhhccccccCcccc-eecceEEEEcCCCc Confidence 00000 00000134455556789987444 4788889999999998764422 22222 2333 48999999999997 Q ss_pred C---------ceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEE Q lcl|Aclame:pro 219 K---------GEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVK 289 (296) Q Consensus 219 ~---------G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~ 289 (296) . .+.|...++.+.+.=-++. -....|....-|-.--...++. .+-.-|+.| +...+ T Consensus 226 ~~~~~~~~~~ytsyl~~~GAi~~~~~~~~-------ve~~rd~~~~~g~d~l~~r~~~---~~hp~G~s~-----~~~~~ 290 (351) T protein:vir:15 226 IDLTDKTKPVSTSYIFAPGAVRYSTNMRS-------TETKYDPLINGGQDVIVQKRVG---TIHVAGTSI-----KASFS 290 (351) T ss_pred cccCCCCCceeEEEEEecceeeeecCCcC-------cceeecccCCCCceEEEEeeee---eeeeeeeee-----ccccc Confidence 3 2466777777763211110 0111222221111100011110 111112211 00000 Q ss_pred -EEecCCC Q lcl|Aclame:pro 290 -VTLTPGV 296 (296) Q Consensus 290 -~tI~~~v 296 (296) .-...|- T Consensus 291 ~~~~~sPt 298 (351) T protein:vir:15 291 PSKASFPT 298 (351) T ss_pred ccCcCCcC Confidence 0011122 No 22 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.64 E-value=3e-18 Score=116.68 Aligned_cols=270 Identities=11% Similarity=0.002 Sum_probs=144.1 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCe----eee----eeeeeeecccCcccC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMT----LKT----YAGYDVTLAEGNVPE 72 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~t----It~----pk~~yig~A~gdVaE 72 (296) -++| +... |-++-.||-. .=+| ...-+.+|++-+=|-.++=.+.|++ +++ |.| -.++++ +|+| T Consensus 6 ~i~s-~~~~-~~itv~~ll~--~P~~---I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~-~~~d~e-~VaE 76 (318) T protein:vir:10 6 GIVS-VSDG-PAITVRELVG--NPLW---IPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSF-LEDDVA-DVAE 76 (318) T ss_pred ccee-eecC-CceehHHhhC--Cchh---HHHHHHHHHhccchhhhhhhcccccccceeEEEeccccc-ccCcHh-hccC Confidence 1111 1112 3333333311 1123 3344556666665666555554443 333 223 358888 9999 Q ss_pred CceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC-ccceecchh Q lcl|Aclame:pro 73 GEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG-TGTQDALGA 149 (296) Q Consensus 73 Ge~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta-t~t~~~t~~ 149 (296) |++||++..++. +.+..+.+|+++++ |+|++ +.++.++|++.-+||.++|.+++|+.++++|..+ +++..+++. T Consensus 77 ggEiP~~~~~~G--~~~ia~~~K~G~~~~vS~Em~-~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~ 153 (318) T protein:vir:10 77 FGEIPVSAGARG--LPRTAFAVKKALGVRVSKEMI-DENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTA 153 (318) T ss_pred cccccccCCCCC--chhhhhhehhccceeccHHHH-hhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcC Confidence 999999998874 34555678999985 99996 8999999999999999999999999999999544 333322111 Q ss_pred ---------hHHHHHHHHHHHHHHhh------c--cccCcceEEEEcHHHHHHHhcCCcccc------cee-echhhhhh Q lcl|Aclame:pro 150 ---------GLQGALASAWGKLQVLF------E--DYGSERAIVFANSLDVAEYIAKAGITT------QTA-FGLTYLVD 205 (296) Q Consensus 150 ---------~lQ~Ala~~~~~~~~~F------e--ded~~~~VlFvNP~Daa~~l~~a~i~~------q~~-fg~tyl~n 205 (296) +.-.|....-+...+.+ + .++...-++++||.+.+.++++..+.. +.. .+..|..+ T Consensus 154 w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~ 233 (318) T protein:vir:10 154 WDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGN 233 (318) T ss_pred CCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhccccccc Confidence 11111111111111211 1 123345689999999999988876522 111 12222222 Q ss_pred ----hheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhh Q lcl|Aclame:pro 206 ----FTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYP 281 (296) Q Consensus 206 ----fLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfp 281 (296) +||.+||.|+.+|.|++|+.-.+|+= +|.|+.. |. +=.+| +|-| +.+-=--++...-+.-++. T Consensus 234 ~~g~~lGl~vi~s~~~p~~~alvlq~g~vG-~~~d~~p--l~-~t~~~-~egg--------~~~g~~~~s~~~~~~~~~~ 300 (318) T protein:vir:10 234 FPGSVMGLNVIRSRTFPIDRVLIMERGTVG-FYSDTRP--LQ-FTALY-PEGN--------GPNGGPTESYRADASHKRA 300 (318) T ss_pred ccceeeceEEeecCccCCCeeEEEecCCcc-eeecccc--ce-eeecc-cCCC--------CCCCCcchhhheehheeee Confidence 68999999999999999988865544 4444320 00 00000 0000 0000000111111111111 Q ss_pred h---ccceEEEEE-ecCC Q lcl|Aclame:pro 282 E---RIDGIVKVT-LTPG 295 (296) Q Consensus 282 E---~~dgvv~~t-I~~~ 295 (296) = .+=+|++.| |-.| T Consensus 301 ~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 301 LAVDQPKAALWLTGIVTP 318 (318) T ss_pred eeeeCcceeEEEeeccCC Confidence 1 111333322 2222 No 23 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.64 E-value=1.6e-17 Score=112.71 Aligned_cols=245 Identities=9% Similarity=0.053 Sum_probs=142.3 Q ss_pred hhhhh--HHHHhhhHHHHHHHhCcc-------cccccCCCCeeeeeeeeeeecccCcccCCceechhheeeeecceeEEE Q lcl|Aclame:pro 22 ITIDV--TNKFQENISKLLEMLGVT-------RKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERKIHSEKKIE 92 (296) Q Consensus 22 ~siDf--~~~f~~~i~~L~~~LgVt-------r~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplskv~~~~~~t~~~t 92 (296) +++.+ -+.|++-+.+.++---+. ......+|+||++|+|..++.++ ..++|..|+...+... ..+++ T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d-~~~~~~~~~~~~~~~~---~~~~t 76 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKD-YKAAGRQTSADAISDT---GVDLL 76 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccc-cccCCCccCccccccc---eEEEE Confidence 33332 255555444322222121 22256789999999999999887 7889999999988864 68899 Q ss_pred Eeecc-cc--cCH-HHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 93 LKKYR-KA--TTG-EDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALASAWGKLQVLFED 168 (296) Q Consensus 93 ikK~~-K~--vTd-EAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~~~~~~~~Fed 168 (296) |+|++ ++ ++| |.. ++.|+ +.+..+|+..++++++|+++++.+.++..........-.....+.+.++...|++ T Consensus 77 id~~~~~~~~i~d~d~~-~~~~~--~~~~~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:79 77 IDQEKSIDFLVDDIDRV-QVAGS--LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTK 153 (273) T ss_pred EeeecccceeeccHHHH-hhccc--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccchhhHHHHHHHHHHHhhh Confidence 98853 34 365 343 34443 6788999999999999999999997765332211111111234566677777776 Q ss_pred cc--CcceEEEEcHHHHHHHhcCCc-ccc------cee-echhhhhhhheeEEEEeccCCCceE---EEEcccceEEEEe Q lcl|Aclame:pro 169 YG--SERAIVFANSLDVAEYIAKAG-ITT------QTA-FGLTYLVDFTGTVIISTNDVTKGEI---WATVPENIIFAYI 235 (296) Q Consensus 169 ed--~~~~VlFvNP~Daa~~l~~a~-i~~------q~~-fg~tyl~nfLG~~II~S~kV~~G~~---~~t~~~Nl~~ay~ 235 (296) .+ ....+++|+|...+.+|+..+ +.. +.. ..|..+ +++|++|++|+.+|.++. ++..++.+-++=- T Consensus 154 ~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig-~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~~ 232 (273) T protein:vir:79 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIG-NLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) T ss_pred ccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEee-EEeceEEEecccccccCceEEEEEeccceeeeee Confidence 54 234689999999999988653 321 111 233333 589999999999997653 3334444322110 Q ss_pred cCc--chhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 236 NPN--NSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 236 ~~~--~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) ... .+--.+.|. +-.-|.+| ..+-.+=|| ||+..+=+.. T Consensus 233 ~~~~e~~r~~~~~~-----~~v~~~~~-------------yg~~v~~p~---~vv~~~~~g~ 273 (273) T protein:vir:79 233 IDTVEALRDQDSFS-----DRIRALHV-------------YGGKVVRPT---GVVVFNKTGS 273 (273) T ss_pred hhhhhcccCcccce-----eeeeeeee-------------eeeEEecCc---eEEEEeccCC Confidence 000 000000110 00111111 222223244 7777665555 No 24 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.52 E-value=9.7e-16 Score=102.89 Aligned_cols=247 Identities=10% Similarity=0.080 Sum_probs=143.9 Q ss_pred hhhhh--HHHHhhhHHHHHHHhCcccc-------cccCCCCeeeeeeeeeeecccCcccCCceechhheeeeecceeEEE Q lcl|Aclame:pro 22 ITIDV--TNKFQENISKLLEMLGVTRK-------ISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERKIHSEKKIE 92 (296) Q Consensus 22 ~siDf--~~~f~~~i~~L~~~LgVtr~-------~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplskv~~~~~~t~~~t 92 (296) +++.+ .+.|++-+.+.++---+... ..+.+|+||++|+|..++.++ ...+|..|+...++.. ..+++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d-~~~~~~~~~~~~~~~~---~~~~t 76 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKD-YKAAGRQTSADAISDT---GVDLL 76 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccc-cccCCCccCccccccc---eEEEE Confidence 44443 35565555433333223222 247889999999999999886 6788999998888864 58899 Q ss_pred Eeec-ccc--cCH-HHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 93 LKKY-RKA--TTG-EDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALASAWGKLQVLFED 168 (296) Q Consensus 93 ikK~-~K~--vTd-EAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~~~~~~~~Fed 168 (296) |+|. ..+ ++| |..|.+ + + +.+..+|...++++++|+++++.+.++..+...+...-...+...+.++...|++ T Consensus 77 id~~~~~~~~i~d~d~~~~~-~-~-~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:10 77 IDQEKSIDFLVDDIDRVQVA-G-S-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTK 153 (273) T ss_pred EeeeeecceEeecHHHhhhh-c-c-HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhh Confidence 9875 334 354 444444 3 3 5678899999999999999999987764332211111112234556666677766 Q ss_pred cc--CcceEEEEcHHHHHHHhcCCc-cc------cceee-chhhhhhhheeEEEEeccCCCce---EEEEcccceEEEEe Q lcl|Aclame:pro 169 YG--SERAIVFANSLDVAEYIAKAG-IT------TQTAF-GLTYLVDFTGTVIISTNDVTKGE---IWATVPENIIFAYI 235 (296) Q Consensus 169 ed--~~~~VlFvNP~Daa~~l~~a~-i~------~q~~f-g~tyl~nfLG~~II~S~kV~~G~---~~~t~~~Nl~~ay~ 235 (296) .+ ....+++|+|...+.+|+..+ +. .+... .|. +.+++|++|++|+.+|.++ +++..++.+-++-- T Consensus 154 ~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~-ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q 232 (273) T protein:vir:10 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGT-IGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) T ss_pred cCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeee-eeEEeceEEEEecccccCCccEEEEEeccceeeeee Confidence 53 235689999999999988653 32 12222 233 2358899999999999764 33444444332210 Q ss_pred cCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 236 NPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 236 ~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) ... ....-|+..+ | .+-+--..+.+-.+=| +|+++.+=+.. T Consensus 233 ~~~-------~e~~r~~~~~-~--------~~v~~~~~yg~~v~~~---~~~~~l~~~g~ 273 (273) T protein:vir:10 233 IDT-------VEALRDQDSF-S--------DRIRALHVYGGKVVRP---TGVVVFNKTGS 273 (273) T ss_pred eeh-------hhcccCCCcc-e--------eeeeeeeeeeeeEecc---ceEEEEeccCC Confidence 000 0000011110 0 0000001122233334 47777666666 No 25 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.52 E-value=9.7e-16 Score=102.89 Aligned_cols=247 Identities=10% Similarity=0.080 Sum_probs=143.9 Q ss_pred hhhhh--HHHHhhhHHHHHHHhCcccc-------cccCCCCeeeeeeeeeeecccCcccCCceechhheeeeecceeEEE Q lcl|Aclame:pro 22 ITIDV--TNKFQENISKLLEMLGVTRK-------ISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERKIHSEKKIE 92 (296) Q Consensus 22 ~siDf--~~~f~~~i~~L~~~LgVtr~-------~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplskv~~~~~~t~~~t 92 (296) +++.+ .+.|++-+.+.++---+... ..+.+|+||++|+|..++.++ ...+|..|+...++.. ..+++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d-~~~~~~~~~~~~~~~~---~~~~t 76 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKD-YKAAGRQTSADAISDT---GVDLL 76 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccc-cccCCCccCccccccc---eEEEE Confidence 44443 35565555433333223222 247889999999999999886 6788999998888864 58899 Q ss_pred Eeec-ccc--cCH-HHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 93 LKKY-RKA--TTG-EDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALASAWGKLQVLFED 168 (296) Q Consensus 93 ikK~-~K~--vTd-EAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~~~~~~~~Fed 168 (296) |+|. ..+ ++| |..|.+ + + +.+..+|...++++++|+++++.+.++..+...+...-...+...+.++...|++ T Consensus 77 id~~~~~~~~i~d~d~~~~~-~-~-~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~ 153 (273) T protein:vir:10 77 IDQEKSIDFLVDDIDRVQVA-G-S-LEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTK 153 (273) T ss_pred EeeeeecceEeecHHHhhhh-c-c-HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhh Confidence 9875 334 354 444444 3 3 5678899999999999999999987764332211111112234556666677766 Q ss_pred cc--CcceEEEEcHHHHHHHhcCCc-cc------cceee-chhhhhhhheeEEEEeccCCCce---EEEEcccceEEEEe Q lcl|Aclame:pro 169 YG--SERAIVFANSLDVAEYIAKAG-IT------TQTAF-GLTYLVDFTGTVIISTNDVTKGE---IWATVPENIIFAYI 235 (296) Q Consensus 169 ed--~~~~VlFvNP~Daa~~l~~a~-i~------~q~~f-g~tyl~nfLG~~II~S~kV~~G~---~~~t~~~Nl~~ay~ 235 (296) .+ ....+++|+|...+.+|+..+ +. .+... .|. +.+++|++|++|+.+|.++ +++..++.+-++-- T Consensus 154 ~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~-ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q 232 (273) T protein:vir:10 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGT-IGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) T ss_pred cCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeee-eeEEeceEEEEecccccCCccEEEEEeccceeeeee Confidence 53 235689999999999988653 32 12222 233 2358899999999999764 33444444332210 Q ss_pred cCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 236 NPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 236 ~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) ... ....-|+..+ | .+-+--..+.+-.+=| +|+++.+=+.. T Consensus 233 ~~~-------~e~~r~~~~~-~--------~~v~~~~~yg~~v~~~---~~~~~l~~~g~ 273 (273) T protein:vir:10 233 IDT-------VEALRDQDSF-S--------DRIRALHVYGGKVVRP---TGVVVFNKTGS 273 (273) T ss_pred eeh-------hhcccCCCcc-e--------eeeeeeeeeeeeEecc---ceEEEEeccCC Confidence 000 0000011110 0 0000001122233334 47777666666 No 26 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.34 E-value=2.1e-13 Score=90.10 Aligned_cols=266 Identities=14% Similarity=0.062 Sum_probs=158.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) ++-..++-..|.+.+.+-+...--.+++++-+.+.+-.-++..-+.+|+. |.++++|+++....+. -|+||+.||.++ T Consensus 18 ~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~-~v~Eg~~~~~~~ 95 (324) T protein:vir:99 18 NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPME-GTEKKFTFWADKPGAY-WVGEGQKIETSK 95 (324) T ss_pred hhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccee-EeccCccccccc Confidence 34333433344444443333334456777777777777778888899977 5579999998777775 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc------------ceec Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG------------TQDA 146 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~------------t~~~ 146 (296) ++.. ..+++.+|+++.+ |.|.++.+ .-+..+.-.++|.++|++++++.++.--.++.. +... T Consensus 96 ~~~~---~v~~~~~k~~~~~~iS~ell~ds-~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~ 171 (324) T protein:vir:99 96 ATWV---NATMRAFKLGVILPVTKEFLNYT-YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK 171 (324) T ss_pred ccee---EEEEeeEEEEEeehhhHHHHhcc-hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceecc Confidence 9875 5788999999875 99998644 346789999999999999999999843221110 0011 Q ss_pred chhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-chhhhhhhheeEEEEeccCC--CceEE Q lcl|Aclame:pro 147 LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-GLTYLVDFTGTVIISTNDVT--KGEIW 223 (296) Q Consensus 147 t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-g~tyl~nfLG~~II~S~kV~--~G~~~ 223 (296) .... ++.+.++.....+.+...-+.++||.+...+++-.+-..+..| ++... .++|..|+.+..++ +|.++ T Consensus 172 ~~~~-----~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~-~l~G~PVv~~~~~~~~~~~~i 245 (324) T protein:vir:99 172 GDFT-----QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSD-TLDGLPVVNLKSSNLKRGELI 245 (324) T ss_pred ccCC-----HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCCCc-cccceeEEeecCCCCCcceEE Confidence 1111 1222333344444443445789999998877654332223333 22222 48899988887755 56666 Q ss_pred EEcccceEEEEecCcchhh--hhh-------------hc-cccccccceEEEeccccceeehhhhhhHHHHhhhhccceE Q lcl|Aclame:pro 224 ATVPENIIFAYINPNNSEL--AKE-------------FN-LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGI 287 (296) Q Consensus 224 ~t~~~Nl~~ay~~~~~g~~--~~~-------------f~-~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgv 287 (296) +.-..++. |.+..+=.+ ++. ++ +..|++.+-...+. .+. +.+.+++ T Consensus 246 ~gd~~~~~--~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~-------------d~~---v~~~~a~ 307 (324) T protein:vir:99 246 TGDFDKLI--YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV-------------ALH---IADDKAF 307 (324) T ss_pred EEecccEE--EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEE-------------ccE---Eecccce Confidence 65555543 222211001 000 00 11122222222111 111 2355678 Q ss_pred EEEEecCCC Q lcl|Aclame:pro 288 VKVTLTPGV 296 (296) Q Consensus 288 v~~tI~~~v 296 (296) +++++..++ T Consensus 308 ~~lt~a~~~ 316 (324) T protein:vir:99 308 AKLVPADKK 316 (324) T ss_pred EEEEeccCC Confidence 888887666 No 27 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.31 E-value=3.7e-13 Score=88.76 Aligned_cols=276 Identities=12% Similarity=0.056 Sum_probs=155.9 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) ++-...+...|.+.+++-+...--++.+++-+.+.+-.-++..-+.+|+. |.++++|+++....|. -|+||+.||.++ T Consensus 18 ~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~-~v~Eg~~~~~~~ 95 (324) T protein:vir:97 18 NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAY-WVGEGQKIETSK 95 (324) T ss_pred hhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeecc-CCceEEEEEecCccee-EeccCccccccc Confidence 22222232333333333333333455666666666666677778899976 5679999998777775 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc------------cceec Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT------------GTQDA 146 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat------------~t~~~ 146 (296) ++.. ..+++.||++..+ |.|.++.+. -+..+.-.++|..+|+.++++.|+.--.++. ..... T Consensus 96 ~~f~---~v~~~~~k~~~~~~is~ell~ds~-~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~ 171 (324) T protein:vir:97 96 ATWV---NATMRAFKLGVILPVTKEFLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK 171 (324) T ss_pred ccee---EEEEeeEEEEEeehhhHHHHhcch-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCccccccccccceecc Confidence 8875 5888999999875 999986554 4678899999999999999999984322111 01111 Q ss_pred chhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-chhhhhhhheeEEEEecc--CCCceEE Q lcl|Aclame:pro 147 LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-GLTYLVDFTGTVIISTND--VTKGEIW 223 (296) Q Consensus 147 t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-g~tyl~nfLG~~II~S~k--V~~G~~~ 223 (296) +..+ ++.+.++.....+.....-+.++||.+...++.-.+-.-+..| ++... .++|..|+.+.. +++|.++ T Consensus 172 ~~~~-----~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~~~~~-tl~G~PV~~~~~~~~~~~~~~ 245 (324) T protein:vir:97 172 GDFT-----QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSD-TLDGLPVVNLKSSNLKRGELI 245 (324) T ss_pred ccCC-----HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCCCc-cccceeeEeecCCCCCcceEE Confidence 1111 1223333344444444455789999999877654332223333 23322 378999888776 4466677 Q ss_pred EEcccceEEEEecCcch--hhhhh--hccccccccc--eEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 224 ATVPENIIFAYINPNNS--ELAKE--FNLYGDPTGY--IGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 224 ~t~~~Nl~~ay~~~~~g--~~~~~--f~~~td~tGl--iGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +--..++. |.+..+= ++.+. +....|+.|- --+.|+...-++.. . +.+. +-+.+++++.++..|+ T Consensus 246 ~gd~~~~~--i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~--r-~d~~---v~~~~a~~~l~~~~~~ 316 (324) T protein:vir:97 246 TGDFDKLI--YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM--H-VALH---IADDKAFAKLVPADKK 316 (324) T ss_pred EEecccEE--EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEE--E-eccE---EecccceEEEEeccCC Confidence 65554433 2222110 01110 0111111111 11111111111100 0 0111 2345678888887776 No 28 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.29 E-value=3.6e-13 Score=88.81 Aligned_cols=282 Identities=9% Similarity=0.031 Sum_probs=143.5 Q ss_pred CccccccccccceehhhhhhhhhhhhH-HHHhhhHHHHHHH----hC--cccccccCCCCeeeeeeeeeeecccCcccCC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVT-NKFQENISKLLEM----LG--VTRKISVSEGMTLKTYAGYDVTLAEGNVPEG 73 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~-~~f~~~i~~L~~~----Lg--Vtr~~~~~pG~tIt~pk~~yig~A~gdVaEG 73 (296) |-+=.-. |--.+..+.....--|+ +.|+.-+.+.|+- +. ..+..+.++|+||++|++... .+. ++.+| T Consensus 1 ~~~~~~~---~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~~-~a~-d~~~g 75 (381) T protein:vir:80 1 MATIQGT---GGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISRA-AVY-DKQPQ 75 (381) T ss_pred Cceeccc---ccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCcc-eee-eecCC Confidence 3322110 11222222221122333 4444444433322 22 223457789999999999865 454 78999 Q ss_pred ceechhheeeeecceeEEEEeeccc---ccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce------ Q lcl|Aclame:pro 74 EVIPLSKVERKIHSEKKIELKKYRK---ATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ------ 144 (296) Q Consensus 74 e~Iplskv~~~~~~t~~~tikK~~K---~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~------ 144 (296) ..|+...++.. ..+++|.|++. .++|++- .--..|+..+..+|+..+|++++|++++..+....... T Consensus 76 ~~i~~~~~~~~---~~~itID~~~~~~~~Idd~D~-~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t 151 (381) T protein:vir:80 76 TPVNLQARTDS---EFTFTVTKYKESSFMIEDIVN-TQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYS 151 (381) T ss_pred CcccccccCCc---eEEEEEeeeeecceeechHHH-HhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 99999998864 57788866543 4566553 23333999999999999999999999998775321100 Q ss_pred ----------e--cchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCcccc------ceeechhhhh Q lcl|Aclame:pro 145 ----------D--ALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITT------QTAFGLTYLV 204 (296) Q Consensus 145 ----------~--~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~i~~------q~~fg~tyl~ 204 (296) . .++.+ +....+.+-++...|++.+ ....+++|+|...+.+|++.++.. +....+. +. T Consensus 152 ~~~~i~~~~~~~~~t~~~-~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~-Ig 229 (381) T protein:vir:80 152 YDTTLGDGTVNAHLTGTP-APLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGV-VG 229 (381) T ss_pred ccccccccccccccccch-hhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhcee-ee Confidence 0 11111 2233455666777776653 234599999999999998765432 1122222 33 Q ss_pred hhheeEEEEeccCCCceEEEEcccceEEEEecCc--chhhhhhhccccccccceEEEecccccee--------------- Q lcl|Aclame:pro 205 DFTGTVIISTNDVTKGEIWATVPENIIFAYINPN--NSELAKEFNLYGDPTGYIGMNHFQENTTL--------------- 267 (296) Q Consensus 205 nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~--~g~~~~~f~~~td~tGliGv~h~~~~~~~--------------- 267 (296) +++|++|++|+.+|.+.....+..+---+.+-|- +......| +.+ ...++..|..+.... T Consensus 230 ~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~~--s~~-a~av~~~k~yd~~~~~~~~~~~~~~g~~~~ 306 (381) T protein:vir:80 230 TILGMEVIVTTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPDQ--AGT-ANVVNTGSASDLAVSLSYFGLPVFSGAGAT 306 (381) T ss_pred EEcceEEEeecccccccccceeeecccccccccccccccccccc--ccc-eeeeeeeeeeceeeeeeeccceeeecceee Confidence 6899999999999986554333222211111110 00001101 100 122222222222111 Q ss_pred ------ehhhh-----hhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 268 ------TIQTL-----LVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 268 ------t~et~-----~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) |+-++ -..|+.--|...-..+.+-.+..+ T Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (381) T protein:vir:80 307 AADGGQTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSES 346 (381) T ss_pred ecCCCceeeeehhhhhhhhhcccccccccccceeEeeccc Confidence 11111 001222122221111111111111 No 29 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.28 E-value=4.2e-13 Score=88.43 Aligned_cols=264 Identities=11% Similarity=0.007 Sum_probs=149.8 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeec-ccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTL-AEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~-A~gdVaEGe~Ipls 79 (296) +.+....+...++++++.+...--++...+-+.+..-.-++...+..|+.. .++++|+++-.+. +-..|+||+.+|.+ T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~Eg~~~~~~ 176 (379) T protein:vir:10 98 GKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISG-GTYTFVRENGAGEGAIGAQVEGATKGQK 176 (379) T ss_pred hhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccC-CceEEEEeecCCCcccccccCCcccccc Confidence 333333333344555555443334444444444444444555566677764 4589998764432 33468999999999 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc-cce-e-cchhhHHHH Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT-GTQ-D-ALGAGLQGA 154 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat-~t~-~-~t~~~lQ~A 154 (296) +.+.. ..++..||++..+ |.|.++.+. .-...-.++|+..++++++..|+.-+.+.. ... . .....+. . T Consensus 177 ~~~f~---~i~~~~~k~~~~~~iS~ell~D~~--~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~d-~ 250 (379) T protein:vir:10 177 DYDIS---MIDVNTDFIAGFTRYSKKMANNLP--FLTSFIPNALRRDYAKAENAAFNAVLAANATASTEIITNKNKVE-M 250 (379) T ss_pred cccee---eeEeeeeeEEeeehhhHHHHhhHH--HHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccCcccHH-H Confidence 98865 5889999999865 999997654 467778899999999999999887664331 111 1 1111221 2 Q ss_pred HHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-------chhhhhhhheeEEEEeccCCCceEEEEcc Q lcl|Aclame:pro 155 LASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-------GLTYLVDFTGTVIISTNDVTKGEIWATVP 227 (296) Q Consensus 155 la~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-------g~tyl~nfLG~~II~S~kV~~G~~~~t~~ 227 (296) +.+++.+ .+.......+.++||.+.+.+++-.+-.-+..+ ++..- .++|..|+.|+.+|+|++|+--- T Consensus 251 i~~~~~~----~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~-~l~G~pvv~s~~~~ag~~~~gdf 325 (379) T protein:vir:10 251 LINEIAK----QENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVL-RINGIPLFRATWLAANKYYVGDW 325 (379) T ss_pred HHHHHHh----hhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcc-eecceeeEecCCCCCCceEEeec Confidence 2222222 233233445788999998875532211111111 11111 37799999999999999875333 Q ss_pred cceEEEEecCc-chh--hhh--hhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 228 ENIIFAYINPN-NSE--LAK--EFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 228 ~Nl~~ay~~~~-~g~--~~~--~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) .. +++-.. +.. ++. ...|.+|.+.+....|. .+..+-| +++|++++++= T Consensus 326 ~~---~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~-------------~~~v~~p---~a~v~~~~~~~ 379 (379) T protein:vir:10 326 TR---VTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQV-------------ALAVEQP---AALIFGDFTAV 379 (379) T ss_pred cc---EEEEEEeceEEEEeecccccccCCcEEEEEEEEe-------------ccEEecC---ccEEEEEecCC Confidence 22 222111 111 111 11234455555554443 2334444 67899999876 No 30 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.27 E-value=2.7e-13 Score=89.53 Aligned_cols=274 Identities=16% Similarity=0.121 Sum_probs=160.1 Q ss_pred ccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhhe Q lcl|Aclame:pro 2 VTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKV 81 (296) Q Consensus 2 ~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplskv 81 (296) .|-..+.-.|.+.+.+-+...--.+.++|-+.+.+---+++.-+.+|+..+..+++|+......+. -|+||+.||.++. T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v~Eg~~~~~~~~ 79 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAY-WVNETEKIKTDKP 79 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeE-EeecCcccccccc Confidence 344444444555566666555566777777777776667777788899888888888876666664 8999999999998 Q ss_pred eeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhc--------C-c--cceecch Q lcl|Aclame:pro 82 ERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKT--------G-T--GTQDALG 148 (296) Q Consensus 82 ~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkt--------a-t--~t~~~t~ 148 (296) +.. ..+++.+|++..+ |.|.++.+. -+....-.++|+++|+++++..++.-..+ . . .+..+.. T Consensus 80 ~f~---~v~l~~~k~~~~~~is~ell~ds~-~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~ 155 (297) T protein:vir:95 80 EVV---PVTLKAHKLGIILVTSREALNYTW-KKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGP 155 (297) T ss_pred cee---EEEEeeEEEEEeehhhHHHHhcCH-HHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccc Confidence 875 5788889999875 999986444 45678888999999999999999842110 0 0 0111111 Q ss_pred hhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEec--cCCCceEEEEc Q lcl|Aclame:pro 149 AGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTN--DVTKGEIWATV 226 (296) Q Consensus 149 ~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~--kV~~G~~~~t~ 226 (296) .++. .+.++.....+.+....+.++||.+.+.+++-.+-.-+..|.+... .++|..++.+. .+++|++++.- T Consensus 156 ~t~~-----~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~~~-~l~G~Pv~~~~~~~~~~~~~~~gd 229 (297) T protein:vir:95 156 INYD-----NILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKAAN-TIDGITTVDLKSARFEKGDLLAGD 229 (297) T ss_pred cCHH-----HHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCCCC-cccceeeEeecCCCCCCceEEEEe Confidence 1222 2223333444444445678999999887764222222334544332 47798877654 56788888766 Q ss_pred ccceEEEEecCcchhhhhhhccccccccc------eEEEecc-ccceeehhh-hhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 227 PENIIFAYINPNNSELAKEFNLYGDPTGY------IGMNHFQ-ENTTLTIQT-LLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 227 ~~Nl~~ay~~~~~g~~~~~f~~~td~tGl------iGv~h~~-~~~~~t~et-~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ..++. |.+. +++.= .+. ++.++ -|-.|+. ..+..-+-. .-+.+. +-+.+++++++...|| T Consensus 230 ~s~~~--~~~~--~~~~i--~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~---v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 230 FDNLI--YGVP--YNITY--KIS-EEGQISTITNADGTPINLFEQEMIAIRATMDIAVM---ITKTDAFAKLTPAERV 297 (297) T ss_pred cccEE--EEEe--cCeEE--EEe-eccccccccccCccchhhhhcCcEEEEEEEEeccE---eecccceEEEeecCCC Confidence 55543 2221 11110 000 11100 0000100 000000000 011222 2345788999999999 No 31 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.27 E-value=8.4e-13 Score=86.78 Aligned_cols=278 Identities=12% Similarity=0.061 Sum_probs=148.5 Q ss_pred CccccccccccceehhhhhhhhhhhhH-HHHhhhHHHHHH----HhCcccccc--cCCCCeeeeeeeeeeecccCcccCC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVT-NKFQENISKLLE----MLGVTRKIS--VSEGMTLKTYAGYDVTLAEGNVPEG 73 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~-~~f~~~i~~L~~----~LgVtr~~~--~~pG~tIt~pk~~yig~A~gdVaEG 73 (296) |--+.|.--.+++++ -.--|. +.|+..+.+.|+ .++..|..+ ..+|+||++|++.-. .+. +..+| T Consensus 1 ~~~~~~~~~~~~~t~------~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~~-~~~-d~~~~ 72 (341) T protein:vir:94 1 MALGNTITGPSINTQ------RGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRISEL-GVE-DKATD 72 (341) T ss_pred Ccchhhhccccccch------hHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccCcc-eee-eecCC Confidence 433333222222111 001111 222333333332 133334433 467999999998644 354 78999 Q ss_pred ceechhheeeeecceeEEEEeecc-cc--cCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce------ Q lcl|Aclame:pro 74 EVIPLSKVERKIHSEKKIELKKYR-KA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ------ 144 (296) Q Consensus 74 e~Iplskv~~~~~~t~~~tikK~~-K~--vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~------ 144 (296) ..|+...+... ..+++|.|++ .+ ++|++- ..-..|+..+.-+|...+|++++|++++..+..++... T Consensus 73 ~~i~~~~~~~~---~~~itiD~~~~~~~~i~d~d~-~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~ 148 (341) T protein:vir:94 73 VPVGVQPVNDT---DFVITVDTDRTTAVALDDLLE-IQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFS 148 (341) T ss_pred CccccccccCc---eEEEEEeeeeecceeechHHH-HhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCcccc Confidence 99999998864 5789997754 33 477664 34455899999999999999999999998886554211 Q ss_pred ----ecchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCccccce------eechhhhhhhheeEEE Q lcl|Aclame:pro 145 ----DALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQT------AFGLTYLVDFTGTVII 212 (296) Q Consensus 145 ----~~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~i~~q~------~fg~tyl~nfLG~~II 212 (296) +.+++. +....+.+..+...|++.+ ....+++|+|...+.+|++.++.... ...|. +.+++|++|+ T Consensus 149 ~~~~~~t~~~-~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~-ig~i~G~~V~ 226 (341) T protein:vir:94 149 SSNGAITGNG-QAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQ-IGSLMGVRVI 226 (341) T ss_pred CccccccCch-hhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheee-eeeEeceEEE Confidence 011111 1112234445556666542 23568899999999999876553321 12222 2358899999 Q ss_pred EeccCCCceEEEEcccceEEEEecCcchhhhhhh------ccccccccceE---EEeccccce----------------- Q lcl|Aclame:pro 213 STNDVTKGEIWATVPENIIFAYINPNNSELAKEF------NLYGDPTGYIG---MNHFQENTT----------------- 266 (296) Q Consensus 213 ~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f------~~~td~tGliG---v~h~~~~~~----------------- 266 (296) +|+.+|.+.....+..--..+..... ..+...- .++..--||+| ..|.....+ T Consensus 227 ~Sn~lp~~~~~~~~~~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~ 305 (341) T protein:vir:94 227 RTSLIGNNSATGWRNGAPTIAPAEAT-PGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQS 305 (341) T ss_pred Eeccccccccccccccccceeccccc-ccccccccccccccccccEEEEEEecccccceeeecchhhhcccccccccccc Confidence 99999988776544433333322221 1111111 12223334432 111111001 Q ss_pred --eehhh------hhhHHHHhhhhccceEEEEEe-cCCC Q lcl|Aclame:pro 267 --LTIQT------LLVSGMLMYPERIDGIVKVTL-TPGV 296 (296) Q Consensus 267 --~t~et------~~~~~~~lfpE~~dgvv~~tI-~~~v 296 (296) .-++. ++|.+-.|=||- +|..-- .+.| T Consensus 306 ~~~~~~~~~i~~~~~~G~~~lrp~~---~v~~~~~~~~~ 341 (341) T protein:vir:94 306 FENREQVWLMVGRQAYGARLYRPLH---AVNIHTTGDTV 341 (341) T ss_pred chhhhhhhhhhhhhhhcccccCcce---eEEEecCcCCC Confidence 11122 233444455554 443332 3334 No 32 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.26 E-value=2.7e-13 Score=89.48 Aligned_cols=266 Identities=13% Similarity=0.063 Sum_probs=159.1 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |+...++-..|.+.+.+-+...--++++++-+.+.+-.-++..-+.+|+. |.++++|++.....+. -|+||+++|-++ T Consensus 18 ~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~-~v~Eg~~~~~~~ 95 (324) T protein:vir:10 18 NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAY-WVGEGQKIETSK 95 (324) T ss_pred hhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCccee-EeccCccccccc Confidence 55555555555555554443334566777777777777777888889977 4569999998777785 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc------------ceec Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG------------TQDA 146 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~------------t~~~ 146 (296) .+.. ..+++.||++..+ |.|.++.+. -+..+.-.++|.++|++++++.++.--.++.. +... T Consensus 96 ~~~~---~v~~~~~k~~~~~~iS~ell~ds~-~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~ 171 (324) T protein:vir:10 96 ATWV---NATMRAFKLGVILPVTKEFLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK 171 (324) T ss_pred ccee---EEEEeeEEEEEeehhhHHHHhcch-HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceecc Confidence 8875 4778899999875 999986443 46788999999999999999999843221110 0000 Q ss_pred chhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-chhhhhhhheeEEEEecc--CCCceEE Q lcl|Aclame:pro 147 LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-GLTYLVDFTGTVIISTND--VTKGEIW 223 (296) Q Consensus 147 t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-g~tyl~nfLG~~II~S~k--V~~G~~~ 223 (296) ...+ ++.+.++.....+.+...-++++||.+...+++-.+-.-+..| ++... .++|..|+.+.. +++|.++ T Consensus 172 ~~~t-----~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~-~l~G~PV~~~~~~~~~~~~~~ 245 (324) T protein:vir:10 172 GDFT-----QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSD-TLDGLPVVNLKSSNLKRGELI 245 (324) T ss_pred ccCC-----HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecCCCCc-cccceeEEeecCCCCCcceEE Confidence 1111 1222333344444333445789999998877643332223333 33322 388998888765 4466677 Q ss_pred EEcccceEEEEecCcchhh--hhh--hccc------------cccccceEEEeccccceeehhhhhhHHHHhhhhccceE Q lcl|Aclame:pro 224 ATVPENIIFAYINPNNSEL--AKE--FNLY------------GDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGI 287 (296) Q Consensus 224 ~t~~~Nl~~ay~~~~~g~~--~~~--f~~~------------td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgv 287 (296) +.-..++.+ .+..+-.+ .+. +... .|++.+-+..+. ++. +-+.+++ T Consensus 246 ~gd~~~~~~--~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~--------------d~~--v~~~~A~ 307 (324) T protein:vir:10 246 TGDFDKLIY--GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV--------------ALH--IADDKAF 307 (324) T ss_pred EEecccEEE--EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEE--------------ccE--Eecccce Confidence 766665433 22111001 000 0011 122222211111 111 2345678 Q ss_pred EEEEecCCC Q lcl|Aclame:pro 288 VKVTLTPGV 296 (296) Q Consensus 288 v~~tI~~~v 296 (296) ++.+...++ T Consensus 308 ~~l~~a~~~ 316 (324) T protein:vir:10 308 AKLVPADKK 316 (324) T ss_pred EEEEeccCC Confidence 888876665 No 33 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.24 E-value=5e-13 Score=88.02 Aligned_cols=276 Identities=10% Similarity=0.068 Sum_probs=161.2 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |..+.++...+++..... -++.++..+. .-++.+-+.+||. +.++++|+++....+. -|+||++||-++ T Consensus 10 ~~~~~t~~~~g~l~~~~~-----~~ii~~l~~~----s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~-wv~Eg~~~~~s~ 78 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQA-----KDYFAEAEKT----SIVQRVAQKIPMG-ATGIVIPHWTGDVSAQ-WIGEGDMKPITK 78 (397) T ss_pred HhhccCCCCccccchhHH-----HHHHHHHHhc----cchhhhcceeecc-CCceEEEEEcCCcceE-EecCCccccccc Confidence 555555555555544322 2333332222 2234455778875 5669999998777775 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc----------ceecch Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG----------TQDALG 148 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~----------t~~~t~ 148 (296) .+.. ..+++.||++..+ |.|.++.+ .-+...+-.++|.++|++++++.|+.-..++++ +..... T Consensus 79 ~~f~---~v~l~~~k~~~~v~iS~ell~ds-~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~ 154 (397) T protein:vir:23 79 GNMT---KRDVHPAKIATIFVASAETVRAN-PANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISP 154 (397) T ss_pred ccee---EEEEeeEEEEEeehhhHHHHhcc-hHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecc Confidence 8875 5889999999975 99998544 456789999999999999999999853222111 111111 Q ss_pred hhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcccccee----------echhhhhhhheeEEEEeccCC Q lcl|Aclame:pro 149 AGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTA----------FGLTYLVDFTGTVIISTNDVT 218 (296) Q Consensus 149 ~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~----------fg~tyl~nfLG~~II~S~kV~ 218 (296) ......+. ++...+........+.++||.+...+++-.+-.-+.. +++.- ..++|..++.++.+| T Consensus 155 ~~~~~~~~----~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~-~tl~G~Pv~~s~~~~ 229 (397) T protein:vir:23 155 NAYQGLGV----SGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFRE-GRILGRPTILSDHVA 229 (397) T ss_pred cchhHHHH----HHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccC-ceeeeeeEEEeCCCC Confidence 11111111 2222233333345689999999988775322111111 11111 237899999999999 Q ss_pred CceEEEEcccceEEEEecCcchhhh---------------hhhc-cccccccce-------EEEeccccceeehhhhhhH Q lcl|Aclame:pro 219 KGEIWATVPENIIFAYINPNNSELA---------------KEFN-LYGDPTGYI-------GMNHFQENTTLTIQTLLVS 275 (296) Q Consensus 219 ~G~~~~t~~~Nl~~ay~~~~~g~~~---------------~~f~-~~td~tGli-------Gv~h~~~~~~~t~et~~~~ 275 (296) +|++.+...|==+++|.+..+=.+. +..+ +..|++.|- .+.|...-..++..+.... T Consensus 230 ~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~ 309 (397) T protein:vir:23 230 EGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTT 309 (397) T ss_pred CCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccce Confidence 9998654333212223322110110 1111 222445442 2344444455566666666 Q ss_pred HHHhhhhccceEEEEEe----cCCC Q lcl|Aclame:pro 276 GMLMYPERIDGIVKVTL----TPGV 296 (296) Q Consensus 276 ~~~lfpE~~dgvv~~tI----~~~v 296 (296) .....|--..|=++.++ +.+. T Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~ 334 (397) T protein:vir:23 310 YALDLDGASAGNFTLSLDGKTSANI 334 (397) T ss_pred eeecccccCcceEEEEecCccccCc Confidence 66677777788888887 3333 No 34 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.23 E-value=8e-13 Score=86.90 Aligned_cols=273 Identities=10% Similarity=-0.017 Sum_probs=158.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcccCCceec-h Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIP-L 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdVaEGe~Ip-l 78 (296) .+-++.....+.+.+.+-+...-.++.+++-+.+..-.-++..-+.+||+.|. ++.+|++.-...+. .|+||+.+| . T Consensus 112 ~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~Eg~~~~~~ 190 (415) T protein:vir:94 112 YLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALE-KVEELEENPEL 190 (415) T ss_pred HhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccce-ecccccccccc Confidence 11122222233333333444444566777766666666677777788887664 77888887666675 999999999 5 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce----------ec Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ----------DA 146 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~----------~~ 146 (296) +..+.+ ..++..||++.-+ |.|.++.+.+ +-.+.-.++|+.+++.+++..|+.-+.+++... .. T Consensus 191 ~~~~~~---~i~~~~~k~~~~~~is~ell~ds~~-~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~ 266 (415) T protein:vir:94 191 AVKPFF---QLAYDINTHRGYFRISREAIEDAKV-NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL 266 (415) T ss_pred ccccce---eeEeeheeeeeechhhHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccc Confidence 554443 5778889999875 9999876654 456788999999999999999998775542210 00 Q ss_pred chhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechhh----hhhhheeEEEEeccCCCceE Q lcl|Aclame:pro 147 LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTY----LVDFTGTVIISTNDVTKGEI 222 (296) Q Consensus 147 t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~ty----l~nfLG~~II~S~kV~~G~~ 222 (296) .+.+ ...+..+.++...+.+......+.++||.+.+.+++-.+-.-+..|.-.+ -..++|..|+.+..+|.|.. T Consensus 267 ~~~~--~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~ 344 (415) T protein:vir:94 267 EVKK--AKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQK 344 (415) T ss_pred cccc--ccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecccccCCC Confidence 0000 01122333444455554444568899999988776532222222221111 11378999999998886652 Q ss_pred EE--EcccceEEEEecCcchhhhhh-hccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 223 WA--TVPENIIFAYINPNNSELAKE-FNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 223 ~~--t~~~Nl~~ay~~~~~g~~~~~-f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) -- ....|+.-+|+-...+++.=. .++.++++++.+..+ +.+. +-+.++++++++++++ T Consensus 345 ~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~r~~~r-------------~d~~---~~~~~a~~~~~~~~~~ 405 (415) T protein:vir:94 345 GNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVR-------------QDCR---ILDYKSAIVIEYDDSE 405 (415) T ss_pred CccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEE-------------eccE---EeccccEEEEEEeccC Confidence 10 011122222221111222211 223445565554332 1122 2356899999999999 No 35 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.22 E-value=1.7e-12 Score=85.12 Aligned_cols=255 Identities=11% Similarity=0.012 Sum_probs=144.7 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee-ecccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi-g~A~gdVaEGe~Ipls 79 (296) .++ .+++..+.. .--++.+.|-+.+..-.-+++.-+..|+. |.++++|.+.-. ..+ .-|+||+++|-+ T Consensus 135 ~~~-~~~~~~g~l--------vp~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a-~~v~E~~~~~~~ 203 (418) T protein:vir:10 135 TVG-SGVSGSNSL--------VVADRQAGIIAPPQRKMTIRDLLMPGQTS-SSSIEYTVETGFTNNA-AAVAEGAQKPTS 203 (418) T ss_pred hcc-CCCCCCccc--------cchhHHHHHHHHHhhhhhHHhhcceeecc-CCceeEEEEecCCCce-eeeccCcccccc Confidence 111 112222221 12234444544555555566666778876 567899997664 344 489999999999 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc---------c--ceec Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT---------G--TQDA 146 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat---------~--t~~~ 146 (296) +.+.. ..+++.+|++..+ |.|.++.++ +-.+.-.++|..+|+.++++.|+.--.++. . +.+. T Consensus 204 ~~~f~---~v~~~~~k~~~~~~is~ell~ds~--~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~ 278 (418) T protein:vir:10 204 DLKFN---LKNQPVRTIAHLFKASRQILDDAP--ALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSI 278 (418) T ss_pred cccee---eEEEeeeeEEEeehhhHHHHHhHH--HHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc Confidence 98764 5788889988865 999987664 677888889999999999999985321110 0 0001 Q ss_pred ---chhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee----chhhhhhhheeEEEEeccCCC Q lcl|Aclame:pro 147 ---LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF----GLTYLVDFTGTVIISTNDVTK 219 (296) Q Consensus 147 ---t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f----g~tyl~nfLG~~II~S~kV~~ 219 (296) +...++. +..++ ...+..+....++++||.+...+++-.+-.-+..| +++-. .++|..|+.|+.+|. T Consensus 279 ~~~~~~~~~~-i~~~~----~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~-~l~G~pV~~~~~~p~ 352 (418) T protein:vir:10 279 TLANATPIDK-IRLAL----LQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTP-RLWNLPVVETQAMTA 352 (418) T ss_pred cccccccHHH-HHHHH----HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCc-eecceeeEEcCCCCC Confidence 1111221 11112 22233233455788999998866532211111122 22222 378999999999999 Q ss_pred ceEEEEcccceEEEEecCcchhhhhhh------ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 220 GEIWATVPENIIFAYINPNNSELAKEF------NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 220 G~~~~t~~~Nl~~ay~~~~~g~~~~~f------~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) |++++--..+..+-+.. +++.=.+ ++..|.+.|.+..+ +.+.... .++++.++++ T Consensus 353 ~~~~~gd~s~~~~~~~~---~~~~i~~~~~~~~~f~~~~~~~r~~~~-------------~d~~~~~---~~a~~~~~~~ 413 (418) T protein:vir:10 353 NEFLVGAFSMAAQIFDR---MEIEVLLSTENVDDFEKNMVSIRAEER-------------LALAVYR---PESFVTGALV 413 (418) T ss_pred CcEEEeeccceEEEEEe---cceEEEEecccchhhhcCceEEEEEEe-------------eccEEec---ccceEEEEec Confidence 99876544432111111 1121111 12233333333222 1222333 4789999999 Q ss_pred CCC Q lcl|Aclame:pro 294 PGV 296 (296) Q Consensus 294 ~~v 296 (296) +|+ T Consensus 414 ~~~ 416 (418) T protein:vir:10 414 EQA 416 (418) T ss_pred cCC Confidence 999 No 36 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.19 E-value=3.2e-12 Score=83.63 Aligned_cols=277 Identities=12% Similarity=0.045 Sum_probs=154.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-.++-++ .|.+++++-+...--.+.++|-+.+.+-.-++..-+.+|+. +..+++|+|.-...+. -|+||+++|-++ T Consensus 1 ma~~~~~~-~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~-~v~E~~~~~~~~ 77 (304) T protein:vir:94 1 MATPTYTP-GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAY-WVSETERIQTSK 77 (304) T ss_pred Cccccccc-ccccccCCCceecchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceE-EeecCccccccc Confidence 65555433 34555555444333445666666666555566677888876 4568999997666664 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhc-------------Ccccee Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKT-------------GTGTQD 145 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkt-------------at~t~~ 145 (296) .+.+ ..+++.+|++..+ |.|.++.+. -+-.+.-.++|.++++++++..|+.--.+ ...+.. T Consensus 78 ~~~~---~i~~~~~k~~~~~~iS~ell~ds~-~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~ 153 (304) T protein:vir:94 78 PEYA---QAEMEAKKIGVIIPLSKEFLKWTA-KDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKG 153 (304) T ss_pred ceee---EEEEEEEEEEEeehhhHHHHhcch-HHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccc Confidence 8875 5788999999875 999986454 45668889999999999999999842111 000000 Q ss_pred cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEeccCC----Cce Q lcl|Aclame:pro 146 ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDVT----KGE 221 (296) Q Consensus 146 ~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~kV~----~G~ 221 (296) .+..+ -...++.+.++...++.......+.++||.+.+++++-.+-.-+..|...-. .++|..|+.++.+| +|. T Consensus 154 ~~~~~-~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~-~l~G~PV~~~~~~~~~~~~~~ 231 (304) T protein:vir:94 154 NVVTD-TNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGN-EIMGLPLSYTGADVYDKKKSL 231 (304) T ss_pred ccccc-ccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCc-cccceeeEEecccccCCCCcE Confidence 00000 0112333444445555544455678999999998775333222344543222 37899999999986 445 Q ss_pred EEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccce-----eehhhhhh---HHHHhhhhccceEEEEEec Q lcl|Aclame:pro 222 IWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTT-----LTIQTLLV---SGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~-----~t~et~~~---~~~~lfpE~~dgvv~~tI~ 293 (296) +++.-..+..+.. . +++ .+-.+.-+-|++.++.+..- ....-+.+ .-+=+=+.+.+++++++.. T Consensus 232 ~~~gd~~~~~~~~--~--~~~----~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a 303 (304) T protein:vir:94 232 ALMGDWDYARYGI--L--QGI----EYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPT 303 (304) T ss_pred EEEEehhhEEEEE--e--cce----EEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEec Confidence 5554444432211 1 111 01011111122222111110 00000000 0011123445667777776 Q ss_pred C Q lcl|Aclame:pro 294 P 294 (296) Q Consensus 294 ~ 294 (296) . T Consensus 304 ~ 304 (304) T protein:vir:94 304 E 304 (304) T ss_pred C Confidence 6 No 37 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.19 E-value=3.2e-12 Score=83.63 Aligned_cols=277 Identities=12% Similarity=0.045 Sum_probs=154.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-.++-++ .|.+++++-+...--.+.++|-+.+.+-.-++..-+.+|+. +..+++|+|.-...+. -|+||+++|-++ T Consensus 1 ma~~~~~~-~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~-~v~E~~~~~~~~ 77 (304) T protein:vir:10 1 MATPTYTP-GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAY-WVSETERIQTSK 77 (304) T ss_pred Cccccccc-ccccccCCCceecchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceE-EeecCccccccc Confidence 65555433 34555555444333445666666666555566677888876 4568999997666664 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhc-------------Ccccee Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKT-------------GTGTQD 145 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkt-------------at~t~~ 145 (296) .+.+ ..+++.+|++..+ |.|.++.+. -+-.+.-.++|.++++++++..|+.--.+ ...+.. T Consensus 78 ~~~~---~i~~~~~k~~~~~~iS~ell~ds~-~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~ 153 (304) T protein:vir:10 78 PEYA---QAEMEAKKIGVIIPLSKEFLKWTA-KDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKG 153 (304) T ss_pred ceee---EEEEEEEEEEEeehhhHHHHhcch-HHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccc Confidence 8875 5788999999875 999986454 45668889999999999999999842111 000000 Q ss_pred cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEeccCC----Cce Q lcl|Aclame:pro 146 ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDVT----KGE 221 (296) Q Consensus 146 ~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~kV~----~G~ 221 (296) .+..+ -...++.+.++...++.......+.++||.+.+++++-.+-.-+..|...-. .++|..|+.++.+| +|. T Consensus 154 ~~~~~-~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~-~l~G~PV~~~~~~~~~~~~~~ 231 (304) T protein:vir:10 154 NVVTD-TNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGN-EIMGLPLSYTGADVYDKKKSL 231 (304) T ss_pred ccccc-ccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCc-cccceeeEEecccccCCCCcE Confidence 00000 0112333444445555544455678999999998775333222344543222 37899999999986 445 Q ss_pred EEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccce-----eehhhhhh---HHHHhhhhccceEEEEEec Q lcl|Aclame:pro 222 IWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTT-----LTIQTLLV---SGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~-----~t~et~~~---~~~~lfpE~~dgvv~~tI~ 293 (296) +++.-..+..+.. . +++ .+-.+.-+-|++.++.+..- ....-+.+ .-+=+=+.+.+++++++.. T Consensus 232 ~~~gd~~~~~~~~--~--~~~----~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a 303 (304) T protein:vir:10 232 ALMGDWDYARYGI--L--QGI----EYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPT 303 (304) T ss_pred EEEEehhhEEEEE--e--cce----EEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEec Confidence 5554444432211 1 111 01011111122222111110 00000000 0011123445667777776 Q ss_pred C Q lcl|Aclame:pro 294 P 294 (296) Q Consensus 294 ~ 294 (296) . T Consensus 304 ~ 304 (304) T protein:vir:10 304 E 304 (304) T ss_pred C Confidence 6 No 38 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.18 E-value=5.1e-12 Score=82.49 Aligned_cols=267 Identities=13% Similarity=0.079 Sum_probs=150.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) ++-..++...|.+.+.+-+...--++.+++-+.+..-.-++...+.+|+. |.++++|++.....+. -|+||++||.++ T Consensus 18 ~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~-~v~Eg~~~~~~~ 95 (324) T protein:vir:96 18 NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAY-WVGEGQKIETSK 95 (324) T ss_pred hhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccee-EecCCccccccc Confidence 33222332334443333333333456666666666666677777888865 6679999998777775 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc---------cce--ecc Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT---------GTQ--DAL 147 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat---------~t~--~~t 147 (296) ++.. ..+++.+|++..+ |.|.++.+. -+..+.-.++|+.++++++++-++.--.++. ... ... T Consensus 96 ~~~~---~v~~~~~k~~~~~~is~ell~ds~-~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~ 171 (324) T protein:vir:96 96 ATWV---NATMRAFKLGVILPVTKEFLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK 171 (324) T ss_pred ccee---EEEEeeEEEEEeehhhHHHHhcch-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceecc Confidence 9875 5888999999865 999986554 4677888899999999999999884321111 000 000 Q ss_pred hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-chhhhhhhheeEEEEecc--CCCceEEE Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-GLTYLVDFTGTVIISTND--VTKGEIWA 224 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-g~tyl~nfLG~~II~S~k--V~~G~~~~ 224 (296) ++.. ++.+.++...+.......-+.++||.+..++++-.+-.-+..+ ++... .++|..|+.+.. +++|.+|+ T Consensus 172 ~~~t----~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~-~l~G~PV~~~~~~~~~~~~~~~ 246 (324) T protein:vir:96 172 GDFT----QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSD-SLDGLPVVNLKSSNLKRGELIT 246 (324) T ss_pred cccc----HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCCCCC-cccceeeEeeCCCCCCcceEEE Confidence 1110 1222333333444333445789999998876643322222222 22222 378988887665 56677777 Q ss_pred EcccceEEEEecCcchhh--hh-hh------------c-cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEE Q lcl|Aclame:pro 225 TVPENIIFAYINPNNSEL--AK-EF------------N-LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) Q Consensus 225 t~~~Nl~~ay~~~~~g~~--~~-~f------------~-~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv 288 (296) .-..++.+ .+..+=.+ .+ ++ + +..|++.|-...+ +.+..+.| ++++ T Consensus 247 gd~~~~~~--g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r-------------~d~~v~~~---~A~~ 308 (324) T protein:vir:96 247 GDFDKLIY--GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH-------------VALHIADD---KAFA 308 (324) T ss_pred EecceEEE--EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE-------------EccEEecc---cceE Confidence 65555432 22110001 00 00 0 1123333222211 12222333 4566 Q ss_pred EEEecCCC Q lcl|Aclame:pro 289 KVTLTPGV 296 (296) Q Consensus 289 ~~tI~~~v 296 (296) +++.-.++ T Consensus 309 ~l~~a~~~ 316 (324) T protein:vir:96 309 KLVPADKR 316 (324) T ss_pred EEeccccc Confidence 66664444 No 39 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.18 E-value=5.1e-12 Score=82.49 Aligned_cols=267 Identities=13% Similarity=0.079 Sum_probs=150.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) ++-..++...|.+.+.+-+...--++.+++-+.+..-.-++...+.+|+. |.++++|++.....+. -|+||++||.++ T Consensus 18 ~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~-~v~Eg~~~~~~~ 95 (324) T protein:vir:78 18 NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAY-WVGEGQKIETSK 95 (324) T ss_pred hhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccee-EecCCccccccc Confidence 33222332334443333333333456666666666666677777888865 6679999998777775 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc---------cce--ecc Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT---------GTQ--DAL 147 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat---------~t~--~~t 147 (296) ++.. ..+++.+|++..+ |.|.++.+. -+..+.-.++|+.++++++++-++.--.++. ... ... T Consensus 96 ~~~~---~v~~~~~k~~~~~~is~ell~ds~-~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~ 171 (324) T protein:vir:78 96 ATWV---NATMRAFKLGVILPVTKEFLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK 171 (324) T ss_pred ccee---EEEEeeEEEEEeehhhHHHHhcch-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceecc Confidence 9875 5888999999865 999986554 4677888899999999999999884321111 000 000 Q ss_pred hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-chhhhhhhheeEEEEecc--CCCceEEE Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-GLTYLVDFTGTVIISTND--VTKGEIWA 224 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-g~tyl~nfLG~~II~S~k--V~~G~~~~ 224 (296) ++.. ++.+.++...+.......-+.++||.+..++++-.+-.-+..+ ++... .++|..|+.+.. +++|.+|+ T Consensus 172 ~~~t----~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~-~l~G~PV~~~~~~~~~~~~~~~ 246 (324) T protein:vir:78 172 GDFT----QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSD-SLDGLPVVNLKSSNLKRGELIT 246 (324) T ss_pred cccc----HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCCCCC-cccceeeEeeCCCCCCcceEEE Confidence 1110 1222333333444333445789999998876643322222222 22222 378988887665 56677777 Q ss_pred EcccceEEEEecCcchhh--hh-hh------------c-cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEE Q lcl|Aclame:pro 225 TVPENIIFAYINPNNSEL--AK-EF------------N-LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) Q Consensus 225 t~~~Nl~~ay~~~~~g~~--~~-~f------------~-~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv 288 (296) .-..++.+ .+..+=.+ .+ ++ + +..|++.|-...+ +.+..+.| ++++ T Consensus 247 gd~~~~~~--g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r-------------~d~~v~~~---~A~~ 308 (324) T protein:vir:78 247 GDFDKLIY--GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH-------------VALHIADD---KAFA 308 (324) T ss_pred EecceEEE--EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE-------------EccEEecc---cceE Confidence 65555432 22110001 00 00 0 1123333222211 12222333 4566 Q ss_pred EEEecCCC Q lcl|Aclame:pro 289 KVTLTPGV 296 (296) Q Consensus 289 ~~tI~~~v 296 (296) +++.-.++ T Consensus 309 ~l~~a~~~ 316 (324) T protein:vir:78 309 KLVPADKR 316 (324) T ss_pred EEeccccc Confidence 66664444 No 40 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.18 E-value=4.7e-12 Score=82.68 Aligned_cols=267 Identities=14% Similarity=0.085 Sum_probs=148.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) .+--+++--.|.+.+.+-+...--++++++-+.+.+-.-++..-+.+|+. |.++++|+++....+. -|+||+.||.++ T Consensus 18 ~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~-~v~Eg~~~~~~~ 95 (324) T protein:vir:93 18 NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAY-WVGEGQKIETSK 95 (324) T ss_pred hhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccee-eecCCccccccc Confidence 11112221222222222222223356677777777666677777888866 5568999998777775 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc----------ceec-c Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG----------TQDA-L 147 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~----------t~~~-t 147 (296) .+.. ..+++.+|+++-+ |+|.++.+ .-+..+.-.++|..+|++++++.++.--.+... .... . T Consensus 96 ~~f~---~i~~~~~k~~~~~~iS~ell~ds-~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~ 171 (324) T protein:vir:93 96 ATWV---NATMRAFKLGVILPVTKEFLNYT-YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK 171 (324) T ss_pred ccee---EEEEEeEEEEEeehhhHHHHhcc-hHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCccccccccccceecc Confidence 8875 5888999999875 99999644 346678889999999999999998742211100 0000 0 Q ss_pred hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-chhhhhhhheeEEEEecc--CCCceEEE Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-GLTYLVDFTGTVIISTND--VTKGEIWA 224 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-g~tyl~nfLG~~II~S~k--V~~G~~~~ 224 (296) +... ++.+.++....++......+.++||.+...+++-.+-.-+..| ++... .++|..|+.+.. .++|.+++ T Consensus 172 ~~~~----~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~~~~~-~l~G~PVv~~~~~~~~~~~i~~ 246 (324) T protein:vir:93 172 GDFT----QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSD-SLDGLPVVNLKSSNLKRGELIT 246 (324) T ss_pred cccc----HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCC-cccceeeEeecCCCCCcceEEE Confidence 1111 1222223333343333345789999998877643222222222 22222 478998888665 55677776 Q ss_pred EcccceEEEEecCcchhhhh--h--hc------------cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEE Q lcl|Aclame:pro 225 TVPENIIFAYINPNNSELAK--E--FN------------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) Q Consensus 225 t~~~Nl~~ay~~~~~g~~~~--~--f~------------~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv 288 (296) .-..++.+. +...-++.. . +. +-.|++.|-...+ +.+. +-+.++++ T Consensus 247 gdfs~~~~~--~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r-------------~d~~---v~~~~a~~ 308 (324) T protein:vir:93 247 GDFDKLIYG--IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH-------------VALH---IADDKAFA 308 (324) T ss_pred EecceEEEE--EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE-------------eccE---EecccceE Confidence 665554322 221111110 0 00 1112222222111 1122 33445677 Q ss_pred EEEecCCC Q lcl|Aclame:pro 289 KVTLTPGV 296 (296) Q Consensus 289 ~~tI~~~v 296 (296) +++.-.++ T Consensus 309 ~l~~a~~~ 316 (324) T protein:vir:93 309 KLVPADKR 316 (324) T ss_pred EEeccccc Confidence 77665555 No 41 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=99.16 E-value=2.3e-12 Score=84.43 Aligned_cols=275 Identities=9% Similarity=0.051 Sum_probs=143.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhC-ccccc----ccCCCCeeeeeeeeeeecccC---cccC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLG-VTRKI----SVSEGMTLKTYAGYDVTLAEG---NVPE 72 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~Lg-Vtr~~----~~~pG~tIt~pk~~yig~A~g---dVaE 72 (296) |- |-+-... ..+=.....|.+++- +.. |.|.. ..++|+||++|++....-.+- ..++ T Consensus 1 Ma--------~~~~~p~---~~a~~~l~~l~~~lv----~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~ 65 (392) T protein:vir:99 1 MA--------NAFSKPT---AVVDTAIQMLQNELI----LTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGA 65 (392) T ss_pred Cc--------cccccHH---HHHHHHHHHHHhhcc----chhhhccccccccccCCCCeEEEeecccccceeeecccccc Confidence 21 1000000 111122333333322 222 33432 347899999999865533321 2467 Q ss_pred CceechhheeeeecceeEEEEeecc-cc--cCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchh Q lcl|Aclame:pro 73 GEVIPLSKVERKIHSEKKIELKKYR-KA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGA 149 (296) Q Consensus 73 Ge~Iplskv~~~~~~t~~~tikK~~-K~--vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~ 149 (296) |..|..+.++.+ ..+++|.|++ ++ ++|++- ....+|...+.-+|..++|++++|.+++..+..+......... T Consensus 66 ~~~~~~~~~~~~---~~~~~id~~k~~~~~i~d~e~-~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~ 141 (392) T protein:vir:99 66 ERNLTVSDFTED---SFPVTLTDVAYHLGVLTDEEL-TFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVH 141 (392) T ss_pred CCcccccccccc---eEEEEEeeeeecceeechHHH-hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 888999998865 5788885433 34 488884 6899999999999999999999999999988765432111111 Q ss_pred hH-HHHHHHHHHHHHHhhcccc-CcceEEEEcHHHHHHHhcCCcccccee--------echhhhhhhheeEEEEeccCCC Q lcl|Aclame:pro 150 GL-QGALASAWGKLQVLFEDYG-SERAIVFANSLDVAEYIAKAGITTQTA--------FGLTYLVDFTGTVIISTNDVTK 219 (296) Q Consensus 150 ~l-Q~Ala~~~~~~~~~Feded-~~~~VlFvNP~Daa~~l~~a~i~~q~~--------fg~tyl~nfLG~~II~S~kV~~ 219 (296) .+ ....++.|.++...+++.+ ...-+++++|...+.+|++..+..... +--..+.+++|++|+.|+.+|. T Consensus 142 ~~~~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~ 221 (392) T protein:vir:99 142 EVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPH 221 (392) T ss_pred ccChhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeeccccc Confidence 11 2223455666667776642 123589999999999998875532111 1112234688999999999999 Q ss_pred ceEEEEcccceEEEEecCcchhhhhhhccccccccceE--EEeccccceeehhhhh--hHHHH-----hhhhccceEEEE Q lcl|Aclame:pro 220 GEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIG--MNHFQENTTLTIQTLL--VSGML-----MYPERIDGIVKV 290 (296) Q Consensus 220 G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliG--v~h~~~~~~~t~et~~--~~~~~-----lfpE~~dgvv~~ 290 (296) +..++..+..+.++.--|... ....+.-.+.-.+-+. ++-+.+....+..+.+ ..|.. -.+...... .+ T Consensus 222 ~t~~a~~~~a~~~at~a~v~~-~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~-~~ 299 (392) T protein:vir:99 222 GDAYLYHPTAFIMATRAPAPP-MGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRAR-KI 299 (392) T ss_pred ccceeeecccccccccccccc-ccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeee-ee Confidence 999888777776655433211 1111111122222221 1111111111111100 00000 000000000 00 Q ss_pred Ee-cCCC Q lcl|Aclame:pro 291 TL-TPGV 296 (296) Q Consensus 291 tI-~~~v 296 (296) +. ..++ T Consensus 300 ~~~~~~v 306 (392) T protein:vir:99 300 HLIPGSI 306 (392) T ss_pred eeeccee Confidence 00 0000 No 42 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.16 E-value=7.3e-12 Score=81.65 Aligned_cols=252 Identities=12% Similarity=0.057 Sum_probs=140.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee-ecccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi-g~A~gdVaEGe~Ipls 79 (296) +.++.+.....|+..+ + +-++++... .-.-+++.-+.+|+. +.++++|.+.-. +.+ .-|+||+++|-+ T Consensus 113 ~~~~~~~~~g~lip~~-~----~~~ii~~~~----~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~a-~~v~Eg~~~~~~ 181 (390) T protein:vir:97 113 ASTDAAGSAGALTTPN-R----LPGFITPPD----ARLTVRDLIGSGRTD-SALIEYVQETGFVNNA-AIVAEGALKPES 181 (390) T ss_pred hhcccccccccccchh-h----hHHHHHHHh----hhhhhHhhcceeecc-CCceEEEEEecCCcce-eeecCCcccccc Confidence 3333333333333222 1 223344333 333344455677776 557889998654 345 489999999999 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc---------c-c---e Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT---------G-T---Q 144 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat---------~-t---~ 144 (296) +.+.. ..+++.||++.-+ |.|.++.+ .+....-.++|+.+++.+++.-|+.--.++. . . . T Consensus 182 ~~~~~---~i~~~~~k~~~~~~is~ell~ds--~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~ 256 (390) T protein:vir:97 182 SLKFA---KKTDTTHVIAHTMKATRQILSDA--PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPT 256 (390) T ss_pred cccee---EEEEeeeeEEEeehhhHHHHHhH--HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccc Confidence 98865 5889999999875 99998755 3577788899999999999998885311110 0 0 0 Q ss_pred ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCCC Q lcl|Aclame:pro 145 DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVTK 219 (296) Q Consensus 145 ~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~~ 219 (296) ..+++.. +..+.++....+.......+++|||.+.+.+.+-.+-.-+..| +.++ .++|..|+.|+.+|+ T Consensus 257 ~~~~~~~----~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~--~l~G~pV~~~~~~~~ 330 (390) T protein:vir:97 257 TIAGATR----VDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTP--TLWGLPVVATQAMAP 330 (390) T ss_pred cccccch----HHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCccCCCCc--eecceeeEEcCCCCC Confidence 0111111 1223333334444433456789999998866532211111112 1111 378999999999999 Q ss_pred ceEEEEcccceEEEEecCcchhhhhhh-----ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 220 GEIWATVPENIIFAYINPNNSELAKEF-----NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 220 G~~~~t~~~Nl~~ay~~~~~g~~~~~f-----~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) |++++-...+-.+ +++. +++.-.+ .+..|.+++.+..+. .+. +=..++++++++. T Consensus 331 ~~~~~gd~~~~~~-~~~~--~~~~i~~~~~~~~f~~~~~~~r~~~r~-------------d~~---v~~~~a~v~~~~a 390 (390) T protein:vir:97 331 GEFLVGAFDLAAQ-IFDQ--WDARVEIGYVNDDFQRNMVTVLAEERL-------------ALV---VYRPEALITGSFA 390 (390) T ss_pred CcEEEEeccceEE-EEEe--cceEEEEeecccccccCcEEEEEEEee-------------ccE---EeccccEEEEEeC Confidence 9988654433111 1221 1121111 122233333222211 111 2244788888888 No 43 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.15 E-value=1.6e-12 Score=85.24 Aligned_cols=257 Identities=14% Similarity=0.041 Sum_probs=141.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) ..++.+...-.++.. -.+-+++++..+. .-++..-+.+|++. ..+++|.+...+.+-.-|+||+.||-++ T Consensus 104 ~~~~~~~~~g~~i~~-----~~~~~ii~~~~~~----~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~ 173 (385) T protein:vir:18 104 SLGSDADSAGSLIQP-----MQIPGIIMPGLRR----LTIRDLLAQGRTSS-NALEYVREEVFTNNADVVAEKALKPESD 173 (385) T ss_pred hhccccccCCceecc-----hhhhHHHHHhhhc----cchhhhcceecccC-cceEEEEEecCCcceeeeccCccccccc Confidence 122222222223221 1122333333332 23344456667764 4799999865433334789999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc-----------c--ee Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG-----------T--QD 145 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~-----------t--~~ 145 (296) .+.. ..+++.+|++..+ |.|.++.+ .+-.+.-.++|+.+++.+++..|+.--.++.. + .. T Consensus 174 ~~~~---~~~~~~~k~~~~~~is~ell~d~--~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~ 248 (385) T protein:vir:18 174 ITFS---KQTANVKTIAHWVQASRQVMDDA--PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLN 248 (385) T ss_pred ccee---EEEEeeeeEEEeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc Confidence 8865 5788999999875 99998754 35677889999999999999998863211110 0 01 Q ss_pred cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee----chhhhhhhheeEEEEeccCCCce Q lcl|Aclame:pro 146 ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF----GLTYLVDFTGTVIISTNDVTKGE 221 (296) Q Consensus 146 ~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f----g~tyl~nfLG~~II~S~kV~~G~ 221 (296) ..++. .+..+.++....+.......+.++||.+...+++-.+-.-+..| +++= ..++|..|+.|..+|+|+ T Consensus 249 ~~~~~----~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~-~~l~G~pV~~~~~~p~~~ 323 (385) T protein:vir:18 249 ATGDT----RADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTS-NIMWGLPVVPTKAQAAGT 323 (385) T ss_pred ccccc----hHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCC-ceecceeeEEcCcCCCCc Confidence 11111 23334444445555455567899999999876653221112222 2211 136799999999999999 Q ss_pred EEEEcccceEEEEecCcchhhhhhh------ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 222 IWATVPENIIFAYINPNNSELAKEF------NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~~g~~~~~f------~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) +++--..+ +|.-...+++.=.. .+..|.+++.... -+.+..+ +.+++++++++++ T Consensus 324 ~~~gd~~~---~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~-------------r~~~~v~---~~~a~~~~~~~aa 384 (385) T protein:vir:18 324 FTVGGFDM---ASQVWDRMDATVEVSREDRDNFVKNMLTILCEE-------------RLALAHY---RPTAIIKGTFSSG 384 (385) T ss_pred EEEeeccc---EEEEEEecceEEEEeccccchhhcCcEEEEEEE-------------eeccEEe---cccceEEEEeccC Confidence 88643322 22211111111000 0111222221111 1122223 3478999999887 Q ss_pred C Q lcl|Aclame:pro 296 V 296 (296) Q Consensus 296 v 296 (296) - T Consensus 385 ~ 385 (385) T protein:vir:18 385 S 385 (385) T ss_pred C Confidence 7 No 44 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.15 E-value=1.6e-12 Score=85.24 Aligned_cols=257 Identities=14% Similarity=0.041 Sum_probs=141.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) ..++.+...-.++.. -.+-+++++..+. .-++..-+.+|++. ..+++|.+...+.+-.-|+||+.||-++ T Consensus 104 ~~~~~~~~~g~~i~~-----~~~~~ii~~~~~~----~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~ 173 (385) T protein:vir:19 104 SLGSDADSAGSLIQP-----MQIPGIIMPGLRR----LTIRDLLAQGRTSS-NALEYVREEVFTNNADVVAEKALKPESD 173 (385) T ss_pred hhccccccCCceecc-----hhhhHHHHHhhhc----cchhhhcceecccC-cceEEEEEecCCcceeeeccCccccccc Confidence 122222222223221 1122333333332 23344456667764 4799999865433334789999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc-----------c--ee Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG-----------T--QD 145 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~-----------t--~~ 145 (296) .+.. ..+++.+|++..+ |.|.++.+ .+-.+.-.++|+.+++.+++..|+.--.++.. + .. T Consensus 174 ~~~~---~~~~~~~k~~~~~~is~ell~d~--~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~ 248 (385) T protein:vir:19 174 ITFS---KQTANVKTIAHWVQASRQVMDDA--PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLN 248 (385) T ss_pred ccee---EEEEeeeeEEEeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc Confidence 8865 5788999999875 99998754 35677889999999999999998863211110 0 01 Q ss_pred cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee----chhhhhhhheeEEEEeccCCCce Q lcl|Aclame:pro 146 ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF----GLTYLVDFTGTVIISTNDVTKGE 221 (296) Q Consensus 146 ~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f----g~tyl~nfLG~~II~S~kV~~G~ 221 (296) ..++. .+..+.++....+.......+.++||.+...+++-.+-.-+..| +++= ..++|..|+.|..+|+|+ T Consensus 249 ~~~~~----~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~-~~l~G~pV~~~~~~p~~~ 323 (385) T protein:vir:19 249 ATGDT----RADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTS-NIMWGLPVVPTKAQAAGT 323 (385) T ss_pred ccccc----hHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCC-ceecceeeEEcCcCCCCc Confidence 11111 23334444445555455567899999999876653221112222 2211 136799999999999999 Q ss_pred EEEEcccceEEEEecCcchhhhhhh------ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 222 IWATVPENIIFAYINPNNSELAKEF------NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~~g~~~~~f------~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) +++--..+ +|.-...+++.=.. .+..|.+++.... -+.+..+ +.+++++++++++ T Consensus 324 ~~~gd~~~---~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~-------------r~~~~v~---~~~a~~~~~~~aa 384 (385) T protein:vir:19 324 FTVGGFDM---ASQVWDRMDATVEVSREDRDNFVKNMLTILCEE-------------RLALAHY---RPTAIIKGTFSSG 384 (385) T ss_pred EEEeeccc---EEEEEEecceEEEEeccccchhhcCcEEEEEEE-------------eeccEEe---cccceEEEEeccC Confidence 88643322 22211111111000 0111222221111 1122223 3478999999887 Q ss_pred C Q lcl|Aclame:pro 296 V 296 (296) Q Consensus 296 v 296 (296) - T Consensus 385 ~ 385 (385) T protein:vir:19 385 S 385 (385) T ss_pred C Confidence 7 No 45 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.13 E-value=3.2e-12 Score=83.59 Aligned_cols=284 Identities=13% Similarity=0.077 Sum_probs=146.1 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-+.+ ....+.+.+.+-+...--++.++|-+.+.+-.-++..-+..||.. ..+++|++.....+. -|+||++||-++ T Consensus 1 m~~~~-~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~-~v~Eg~~~~~~~ 77 (330) T protein:vir:77 1 MAGST-VPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGP-TGISIPHWTGAVSAS-WTGEAERKPITK 77 (330) T ss_pred Ccccc-cchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEcCCccee-EecCCCcccccc Confidence 43333 222233333333332222344444444444445666667788775 458999998777775 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc---------------- Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG---------------- 142 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~---------------- 142 (296) .+.. ..+++.||++.-+ |.|.++.+++ +..+.-.++|.++|++++++.|+.-=.++.+ T Consensus 78 ~~f~---~i~~~~~k~~~~~~is~ell~ds~~-~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~ 153 (330) T protein:vir:77 78 GSFG---KQELEPVKITTIFAESAEVVRLNPL-NYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLAD 153 (330) T ss_pred ceee---EEEEeEEEEEEeehhhHHHHhcchH-HHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeec Confidence 8864 5788999999864 9999865543 5778899999999999999999842110000 Q ss_pred ceecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chh----hhhhhheeEEEE Q lcl|Aclame:pro 143 TQDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLT----YLVDFTGTVIIS 213 (296) Q Consensus 143 t~~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~t----yl~nfLG~~II~ 213 (296) +...+...........+.++.......+....+.++||.+...+++-.+-.-+..| ++. .-..++|..|+. T Consensus 154 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~ 233 (330) T protein:vir:77 154 TNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYV 233 (330) T ss_pred ccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEE Confidence 00011111111122233333444444444556889999999877742211111111 111 111378999999 Q ss_pred eccCCCce------EEEEcccceEEEEecCcchhhh--hhhccccccc---cceEEEecccccee-ehhh-hhhHHHHhh Q lcl|Aclame:pro 214 TNDVTKGE------IWATVPENIIFAYINPNNSELA--KEFNLYGDPT---GYIGMNHFQENTTL-TIQT-LLVSGMLMY 280 (296) Q Consensus 214 S~kV~~G~------~~~t~~~Nl~~ay~~~~~g~~~--~~f~~~td~t---GliGv~h~~~~~~~-t~et-~~~~~~~lf 280 (296) +..+|.|. +|+....+.. +.+..+-++. +..-+.+++. +.-+..++.-.++. .+-. .-+.+. T Consensus 234 ~~~~p~~~~~~~~~~~~gd~s~~~--i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~--- 308 (330) T protein:vir:77 234 ADNVVNGTVGNRVVGVMGDFSQVI--WGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFM--- 308 (330) T ss_pred eccccCCCCCCccEEEEEecceEE--EEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccE--- Confidence 99999865 3332222221 1111110110 0000001000 00000000000000 0000 011122 Q ss_pred hhccceEEEEEecCCC Q lcl|Aclame:pro 281 PERIDGIVKVTLTPGV 296 (296) Q Consensus 281 pE~~dgvv~~tI~~~v 296 (296) +-+.+++++++...|. T Consensus 309 v~~~~a~~~i~~~~~~ 324 (330) T protein:vir:77 309 VNDKDAFVKLTDQVAG 324 (330) T ss_pred EecccceEEEEeccCC Confidence 2344677888777766 No 46 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.13 E-value=3.3e-12 Score=83.53 Aligned_cols=274 Identities=13% Similarity=0.086 Sum_probs=149.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) ++-..++-..|.+.+.+-+...--++++++-+.+.+-.-++..-+.+|+. |.++++|++.....+. -|+||+.+|.++ T Consensus 18 ~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~-~v~Eg~~~~~~~ 95 (324) T protein:vir:96 18 NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAY-WVGEGQKIETSK 95 (324) T ss_pred hhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccee-eecCCccccccc Confidence 33333333333333333332223456677766666666677777888875 5679999987666674 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC---------cc--ceecc Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG---------TG--TQDAL 147 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta---------t~--t~~~t 147 (296) ++.. ..+++.+|++.-+ |.|.++.+ ..+..+.-.++|..+|+++++.-+|.--.++ +. ..... T Consensus 96 ~~f~---~v~~~~~k~~~~~~is~ell~ds-~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~ 171 (324) T protein:vir:96 96 ATWV---NATMRAFKLGVILPVTKEFLNYT-YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIK 171 (324) T ss_pred ccee---EEEEEeEEEEEeehhhHHHHhcc-hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccceecc Confidence 8875 5888999999875 99998644 3567889999999999999999988421111 00 00000 Q ss_pred hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-chhhhhhhheeEEEEecc--CCCceEEE Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-GLTYLVDFTGTVIISTND--VTKGEIWA 224 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-g~tyl~nfLG~~II~S~k--V~~G~~~~ 224 (296) +... ++.+.++....++.....-..++||.+...++.-.+-.-+..| ++.-. .++|..|+.+.. .++|.+++ T Consensus 172 ~~~~----~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~~~~~-~l~G~PV~~~~~~~~~~~~~~~ 246 (324) T protein:vir:96 172 GDFT----QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSD-SLDGLPVVNLKSSNLKRGELIT 246 (324) T ss_pred cccc----hHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCC-cccceeeEeecCCCCCcceEEE Confidence 1111 1122222233333333344789999998877643322222223 22222 378988887665 55677887 Q ss_pred EcccceEEEEecCcchhhhhhhccccccccceEEEecccc-------ceee-hhhh-hhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 225 TVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQEN-------TTLT-IQTL-LVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 225 t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~-------~~~t-~et~-~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) .-..++.+. +. +++ .+-.+.-..++...+... ++.+ +-.. -+.+. +-+.+++++++...+ T Consensus 247 gd~s~~~~~--~~--~~~----~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~---v~~~~a~~~l~~a~~ 315 (324) T protein:vir:96 247 GDFDKLIYG--IP--QLI----EYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH---IADDKAFAKLVPADK 315 (324) T ss_pred EecceEEEE--Ee--cCc----EEEEeecccccccccccccchhhhhcCcEEEEEEEEeccE---EecccceEEEecccc Confidence 766665432 22 111 111111111111000000 0000 0000 01122 233456777776666 Q ss_pred C Q lcl|Aclame:pro 296 V 296 (296) Q Consensus 296 v 296 (296) + T Consensus 316 ~ 316 (324) T protein:vir:96 316 R 316 (324) T ss_pred c Confidence 6 No 47 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.12 E-value=1.2e-11 Score=80.38 Aligned_cols=261 Identities=10% Similarity=-0.017 Sum_probs=140.9 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee-ecccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi-g~A~gdVaEGe~Ipls 79 (296) ..++.+. ..+ ...--++.+.+-+.+..-.-++..-+.+|+. |.++++|.+.-. ..+ .-|+||+.+|-+ T Consensus 113 ~~~~~~~-~~g--------~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~-~~~~~~~~~~~~~~~a-~~v~E~~~~~~~ 181 (395) T protein:vir:43 113 AITSIDG-SGG--------ALVAPDRRPGVVAAPQRRLTIRDLVAPGTTE-SNSVEYVRETGFVNNA-APVSEGTQKPYS 181 (395) T ss_pred hhcccCC-CCc--------cccchhhHHHHHHHHHhhhhHHhhccceecC-CCceEEEEEecCCCce-eeecCCcccccc Confidence 2221111 111 1112233444444444444456666777775 567899987554 345 379999999999 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc---------c--eec Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG---------T--QDA 146 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~---------t--~~~ 146 (296) +.+.+ ..+++.+|++..+ |.|.++.++ +-.+.-.++|+.+++.+++..|+.--.++.. . .+. T Consensus 182 ~~~~~---~i~~~~~k~~~~~~is~ell~d~~--~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~ 256 (395) T protein:vir:43 182 DLTFE---LENAPVRTIAHLFKASRQILDDAS--ALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPS 256 (395) T ss_pred cccee---EEEEeeeeEEEeehhhHHHHHhHH--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccc Confidence 98875 4788999999875 999987553 5667888999999999999999852111100 0 000 Q ss_pred chhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcccc----ceeechhhhhhhheeEEEEeccCCCceE Q lcl|Aclame:pro 147 LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITT----QTAFGLTYLVDFTGTVIISTNDVTKGEI 222 (296) Q Consensus 147 t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~----q~~fg~tyl~nfLG~~II~S~kV~~G~~ 222 (296) +...--......+.++.......+....+.++||.+...+++-.+-.- +..++++-. .++|..|+.|+.+|+|++ T Consensus 257 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~-~l~G~pVv~~~~~~~~~~ 335 (395) T protein:vir:43 257 GVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGSPQNGTTP-TLWRLPVVETQAITQDEF 335 (395) T ss_pred ccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccccccCCCc-eecceeeEEcCCCCCCcE Confidence 000000112233333333344433345689999999887653221111 112223222 378999999999999998 Q ss_pred EEEcccceEEEEecCcchhhh--h--hhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 223 WATVPENIIFAYINPNNSELA--K--EFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 223 ~~t~~~Nl~~ay~~~~~g~~~--~--~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) ++--..+..+. .+-.+..+. . ..++..|.+++.... -+.+..+.| ++++.++++++ T Consensus 336 ~~gd~~~~~~~-~~~~~~~i~~~~~~~~~f~~~~~~~r~~~-------------r~d~~v~~~---~a~~~~~~taa 395 (395) T protein:vir:43 336 LTGAFSLGAQI-FDRMDIEVLVSTENDKDFENNMVTIRAEE-------------RLAFAVYRP---EAFVTGSLTAS 395 (395) T ss_pred EEEeccceEEE-EEecceEEEEeccccchhhcCcEEEEEEE-------------eeccEEecc---cceEEEEeccC Confidence 75433331111 111111110 0 001111222222111 122333334 48999999999 No 48 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.12 E-value=4e-12 Score=83.07 Aligned_cols=269 Identities=11% Similarity=0.014 Sum_probs=152.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcccCCceec-h Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIP-L 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdVaEGe~Ip-l 78 (296) ..-++.-...+.+.+.+=+...-.++.+++-+.+..-.-+++.-+.+||+.|+ ++.++++.-...+. .|+||+.+| . T Consensus 112 ~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~Eg~~~~~~ 190 (415) T protein:vir:47 112 YLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALE-KVEELEENPEL 190 (415) T ss_pred HHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCccee-ecccccccccc Confidence 01011111112222223333444566777777777777777778889998885 56677765555564 899999999 4 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc------------e Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT------------Q 144 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t------------~ 144 (296) +..+.+ ..+++.+|++..+ |.|.++.+.+ +-.+.-.++|+.+|+.+++..|+.-+.++... . T Consensus 191 ~~~~~~---~v~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~ 266 (415) T protein:vir:47 191 AVKPFF---QLAYDINTHRGYFRISREAIEDAKV-NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL 266 (415) T ss_pred ccccee---eEEeeeeeeEeeehhhHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccccccee Confidence 555543 5788899999875 9999876665 45778999999999999999999877543211 0 Q ss_pred -ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCC Q lcl|Aclame:pro 145 -DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVT 218 (296) Q Consensus 145 -~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~ 218 (296) ..+..++. .+.++.....+.....-+.++||.+.+.+++-.+-.-+..| +++-. .++|..|+.+...| T Consensus 267 ~~~~~~~~~-----~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~-~l~G~pV~~~~~~~ 340 (415) T protein:vir:47 267 EVKKAKSLD-----DIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQ-RLLGAKIEILPDEV 340 (415) T ss_pred ccccccchH-----HHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCc-cccceeeEEecccc Confidence 01112221 22223333333333456789999999876432211111112 12111 37899999988887 Q ss_pred CceE--EEEcccceEEEEecCcchhhhhh-hccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 219 KGEI--WATVPENIIFAYINPNNSELAKE-FNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 219 ~G~~--~~t~~~Nl~~ay~~~~~g~~~~~-f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) .|.. ....-.|..-+|.-...+++.=. .++.++.+++++..+ +.+..+ ..++++.++++++ T Consensus 341 ~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r-------------~d~~v~---~~~a~~~~~~~~~ 404 (415) T protein:vir:47 341 LGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVR-------------QDCRIL---DYKSAIVIEYDDS 404 (415) T ss_pred ccCCCccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEE-------------eccEEe---ccccEEEEEeecc Confidence 5531 00111222222222111222211 223445555544321 112222 4588999999999 Q ss_pred C Q lcl|Aclame:pro 296 V 296 (296) Q Consensus 296 v 296 (296) + T Consensus 405 ~ 405 (415) T protein:vir:47 405 E 405 (415) T ss_pred C Confidence 9 No 49 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.12 E-value=4e-12 Score=83.07 Aligned_cols=269 Identities=11% Similarity=0.014 Sum_probs=152.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcccCCceec-h Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIP-L 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdVaEGe~Ip-l 78 (296) ..-++.-...+.+.+.+=+...-.++.+++-+.+..-.-+++.-+.+||+.|+ ++.++++.-...+. .|+||+.+| . T Consensus 112 ~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~Eg~~~~~~ 190 (415) T protein:vir:46 112 YLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALE-KVEELEENPEL 190 (415) T ss_pred HHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCccee-ecccccccccc Confidence 01011111112222223333444566777777777777777778889998885 56677765555564 899999999 4 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc------------e Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT------------Q 144 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t------------~ 144 (296) +..+.+ ..+++.+|++..+ |.|.++.+.+ +-.+.-.++|+.+|+.+++..|+.-+.++... . T Consensus 191 ~~~~~~---~v~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~ 266 (415) T protein:vir:46 191 AVKPFF---QLAYDINTHRGYFRISREAIEDAKV-NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL 266 (415) T ss_pred ccccee---eEEeeeeeeEeeehhhHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccccccee Confidence 555543 5788899999875 9999876665 45778999999999999999999877543211 0 Q ss_pred -ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCC Q lcl|Aclame:pro 145 -DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVT 218 (296) Q Consensus 145 -~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~ 218 (296) ..+..++. .+.++.....+.....-+.++||.+.+.+++-.+-.-+..| +++-. .++|..|+.+...| T Consensus 267 ~~~~~~~~~-----~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~-~l~G~pV~~~~~~~ 340 (415) T protein:vir:46 267 EVKKAKSLD-----DIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQ-RLLGAKIEILPDEV 340 (415) T ss_pred ccccccchH-----HHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCc-cccceeeEEecccc Confidence 01112221 22223333333333456789999999876432211111112 12111 37899999988887 Q ss_pred CceE--EEEcccceEEEEecCcchhhhhh-hccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 219 KGEI--WATVPENIIFAYINPNNSELAKE-FNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 219 ~G~~--~~t~~~Nl~~ay~~~~~g~~~~~-f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) .|.. ....-.|..-+|.-...+++.=. .++.++.+++++..+ +.+..+ ..++++.++++++ T Consensus 341 ~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r-------------~d~~v~---~~~a~~~~~~~~~ 404 (415) T protein:vir:46 341 LGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVR-------------QDCRIL---DYKSAIVIEYDDS 404 (415) T ss_pred ccCCCccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEE-------------eccEEe---ccccEEEEEeecc Confidence 5531 00111222222222111222211 223445555544321 112222 4588999999999 Q ss_pred C Q lcl|Aclame:pro 296 V 296 (296) Q Consensus 296 v 296 (296) + T Consensus 405 ~ 405 (415) T protein:vir:46 405 E 405 (415) T ss_pred C Confidence 9 No 50 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.11 E-value=3.4e-12 Score=83.43 Aligned_cols=251 Identities=12% Similarity=0.081 Sum_probs=140.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee-ecccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi-g~A~gdVaEGe~Ipls 79 (296) +.++.+.+...++..+.+ -+++++..+. ..|++ .-+.+|+. +..+++|.|.-. +.+ .-|+||+++|-+ T Consensus 113 ~~~~~~~~~g~~~~~~~~-----~~ii~~~~~~-~~l~~---~~~~~~~~-~~~~~~~~~~~~~~~a-~~v~Eg~~~~~~ 181 (390) T protein:vir:10 113 ASTDAAGSAGALTTPNRL-----PGFITQPDAR-LTVRD---LIGSGRTD-SALIEYVQETGFVNNA-AIVAEGALKPES 181 (390) T ss_pred hhcccccccccccchhHH-----HHHHHHHHhh-chhhh---hcceeecc-CCceEEEEEecCCcce-eeecCCcccccc Confidence 444444445555544333 1334433332 22444 34556665 447999998754 345 489999999999 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc---------c----ce Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT---------G----TQ 144 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat---------~----t~ 144 (296) ..+.. ..+++.+|++..+ |.|.++.+. +-...-.++|+.+++.+++..|+.--.++. . +. T Consensus 182 ~~~~~---~i~~~~~k~~~~~~is~ell~d~~--~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~ 256 (390) T protein:vir:10 182 SLKFA---KKTDTTHVIAHTMKATRQILSDAP--QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPT 256 (390) T ss_pred cccee---EEEEeeEEEEEeehhhHHHHHhHH--HHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccccccccccccccc Confidence 98865 5788899998865 999987553 678888899999999999998885311110 0 00 Q ss_pred ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee----chhhhhhhheeEEEEeccCCCc Q lcl|Aclame:pro 145 DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF----GLTYLVDFTGTVIISTNDVTKG 220 (296) Q Consensus 145 ~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f----g~tyl~nfLG~~II~S~kV~~G 220 (296) ...+...-..+. ++....+.......+.++||.+.+.+++-.+-.-+..| ++.-. .++|..|+.++.+|+| T Consensus 257 ~~~~~~~~~~~~----~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~~-~l~G~pv~~~~~~p~~ 331 (390) T protein:vir:10 257 TIAGATRVDQLR----LAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTP-TLWGLPVVATQAMAPG 331 (390) T ss_pred cccccchHHHHH----HHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCcCcCCc-eecceeeEEcCCCCCC Confidence 111112212222 22233333333455788999998876642221111121 11111 2789999999999999 Q ss_pred eEEEEcccceEEEE--ecCcchhhhhhh-----ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 221 EIWATVPENIIFAY--INPNNSELAKEF-----NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 221 ~~~~t~~~Nl~~ay--~~~~~g~~~~~f-----~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) ++|+--. .-+| .+. +++.-.+ .+..|.+.+.+..+.- +=+-+.++++++++- T Consensus 332 ~~~~gdf---~~~~~~~~~--~~~~i~~~~~~~~~~~~~~~~r~~~r~d----------------~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 332 EFLVGAF---DLAAQIFDQ--WDARVEIGYVNDDFQRNMVTVLAEERLA----------------LVVYRPEALISGSFA 390 (390) T ss_pred cEEEEec---cceEEEEEe--cceEEEEeecccccccCcEEEEEEEeec----------------cEEeccccEEEEEeC Confidence 9875332 2222 121 1121111 1222333333222211 123455788888888 No 51 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.10 E-value=7.1e-12 Score=81.71 Aligned_cols=267 Identities=10% Similarity=-0.006 Sum_probs=152.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcccCCceech- Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIPL- 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdVaEGe~Ipl- 78 (296) ...++.-...+.+++.+=+...--+|.+++-+.+..-.-++..-+.+||..|. ++.+|++.-...+. .|+||+.+|- T Consensus 112 ~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~E~~~~~~~ 190 (415) T protein:vir:79 112 YLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALE-KVEELEENPEL 190 (415) T ss_pred HHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccce-eeccccccCcc Confidence 11111111112222222222233466677766666666677777888887664 77788886666665 8999999994 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce----------ec Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ----------DA 146 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~----------~~ 146 (296) +..+.+ ..+++.+|++.-+ |.|.++.+.+ +-.+.-.++|..+++++++..++.-+.+++... +. T Consensus 191 ~~~~~~---~v~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~ 266 (415) T protein:vir:79 191 AVKPFF---QLAYDINTHRGYFRISREAIEDAKV-NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL 266 (415) T ss_pred ccccee---eEEeeeeeeEeeehhhHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccc Confidence 444443 5788899999875 9999876665 366778999999999999999998775442110 00 Q ss_pred chhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCCCce Q lcl|Aclame:pro 147 LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVTKGE 221 (296) Q Consensus 147 t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~~G~ 221 (296) ++. ....++.+.++...+.+......+.++||.+...+++-.+-.-+..| +++- ..++|..|+.+...|.|. T Consensus 267 ~~~--~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~-~~l~G~pV~~~~~~~~~~ 343 (415) T protein:vir:79 267 EVK--KAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ-QRLLGAKIEILPDEVLGQ 343 (415) T ss_pred ccc--cccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCC-ceecceeeEEecccccCC Confidence 000 00112333344444555444456789999999877542221112222 1111 137898999988887654 Q ss_pred EEEEcccceEEEEecCc-------chhhhhhh-ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 222 IWATVPENIIFAYINPN-------NSELAKEF-NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~-------~g~~~~~f-~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) . ++..++|.|.+ .+++.=.+ ++..+.+++++..+ +.+. +-+.++++.++++ T Consensus 344 ~-----~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r-------------~d~~---v~~~~a~~~~~~~ 402 (415) T protein:vir:79 344 K-----GNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVR-------------QDCR---ILDYKSAIVIEYD 402 (415) T ss_pred C-----CccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEE-------------eccE---EeccccEEEEEEe Confidence 2 22223333321 12222111 12234455444321 1111 2357889999999 Q ss_pred CCC Q lcl|Aclame:pro 294 PGV 296 (296) Q Consensus 294 ~~v 296 (296) +++ T Consensus 403 ~~~ 405 (415) T protein:vir:79 403 DSE 405 (415) T ss_pred ccC Confidence 999 No 52 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.10 E-value=7.1e-12 Score=81.71 Aligned_cols=267 Identities=10% Similarity=-0.006 Sum_probs=152.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcccCCceech- Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIPL- 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdVaEGe~Ipl- 78 (296) ...++.-...+.+++.+=+...--+|.+++-+.+..-.-++..-+.+||..|. ++.+|++.-...+. .|+||+.+|- T Consensus 112 ~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~E~~~~~~~ 190 (415) T protein:vir:81 112 YLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALE-KVEELEENPEL 190 (415) T ss_pred HHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccce-eeccccccCcc Confidence 11111111112222222222233466677766666666677777888887664 77788886666665 8999999994 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce----------ec Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ----------DA 146 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~----------~~ 146 (296) +..+.+ ..+++.+|++.-+ |.|.++.+.+ +-.+.-.++|..+++++++..++.-+.+++... +. T Consensus 191 ~~~~~~---~v~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~ 266 (415) T protein:vir:81 191 AVKPFF---QLAYDINTHRGYFRISREAIEDAKV-NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL 266 (415) T ss_pred ccccee---eEEeeeeeeEeeehhhHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccc Confidence 444443 5788899999875 9999876665 366778999999999999999998775442110 00 Q ss_pred chhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCCCce Q lcl|Aclame:pro 147 LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVTKGE 221 (296) Q Consensus 147 t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~~G~ 221 (296) ++. ....++.+.++...+.+......+.++||.+...+++-.+-.-+..| +++- ..++|..|+.+...|.|. T Consensus 267 ~~~--~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~-~~l~G~pV~~~~~~~~~~ 343 (415) T protein:vir:81 267 EVK--KAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ-QRLLGAKIEILPDEVLGQ 343 (415) T ss_pred ccc--cccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCC-ceecceeeEEecccccCC Confidence 000 00112333344444555444456789999999877542221112222 1111 137898999988887654 Q ss_pred EEEEcccceEEEEecCc-------chhhhhhh-ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 222 IWATVPENIIFAYINPN-------NSELAKEF-NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~-------~g~~~~~f-~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) . ++..++|.|.+ .+++.=.+ ++..+.+++++..+ +.+. +-+.++++.++++ T Consensus 344 ~-----~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r-------------~d~~---v~~~~a~~~~~~~ 402 (415) T protein:vir:81 344 K-----GNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVR-------------QDCR---ILDYKSAIVIEYD 402 (415) T ss_pred C-----CccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEE-------------eccE---EeccccEEEEEEe Confidence 2 22223333321 12222111 12234455444321 1111 2357889999999 Q ss_pred CCC Q lcl|Aclame:pro 294 PGV 296 (296) Q Consensus 294 ~~v 296 (296) +++ T Consensus 403 ~~~ 405 (415) T protein:vir:81 403 DSE 405 (415) T ss_pred ccC Confidence 999 No 53 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.10 E-value=7.1e-12 Score=81.71 Aligned_cols=267 Identities=10% Similarity=-0.006 Sum_probs=152.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcccCCceech- Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIPL- 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdVaEGe~Ipl- 78 (296) ...++.-...+.+++.+=+...--+|.+++-+.+..-.-++..-+.+||..|. ++.+|++.-...+. .|+||+.+|- T Consensus 112 ~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~E~~~~~~~ 190 (415) T protein:vir:98 112 YLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALE-KVEELEENPEL 190 (415) T ss_pred HHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccce-eeccccccCcc Confidence 11111111112222222222233466677766666666677777888887664 77788886666665 8999999994 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce----------ec Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ----------DA 146 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~----------~~ 146 (296) +..+.+ ..+++.+|++.-+ |.|.++.+.+ +-.+.-.++|..+++++++..++.-+.+++... +. T Consensus 191 ~~~~~~---~v~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~ 266 (415) T protein:vir:98 191 AVKPFF---QLAYDINTHRGYFRISREAIEDAKV-NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL 266 (415) T ss_pred ccccee---eEEeeeeeeEeeehhhHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccc Confidence 444443 5788899999875 9999876665 366778999999999999999998775442110 00 Q ss_pred chhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCCCce Q lcl|Aclame:pro 147 LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVTKGE 221 (296) Q Consensus 147 t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~~G~ 221 (296) ++. ....++.+.++...+.+......+.++||.+...+++-.+-.-+..| +++- ..++|..|+.+...|.|. T Consensus 267 ~~~--~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~-~~l~G~pV~~~~~~~~~~ 343 (415) T protein:vir:98 267 EVK--KAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ-QRLLGAKIEILPDEVLGQ 343 (415) T ss_pred ccc--cccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCC-ceecceeeEEecccccCC Confidence 000 00112333344444555444456789999999877542221112222 1111 137898999988887654 Q ss_pred EEEEcccceEEEEecCc-------chhhhhhh-ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 222 IWATVPENIIFAYINPN-------NSELAKEF-NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~-------~g~~~~~f-~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) . ++..++|.|.+ .+++.=.+ ++..+.+++++..+ +.+. +-+.++++.++++ T Consensus 344 ~-----~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r-------------~d~~---v~~~~a~~~~~~~ 402 (415) T protein:vir:98 344 K-----GNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVR-------------QDCR---ILDYKSAIVIEYD 402 (415) T ss_pred C-----CccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEE-------------eccE---EeccccEEEEEEe Confidence 2 22223333321 12222111 12234455444321 1111 2357889999999 Q ss_pred CCC Q lcl|Aclame:pro 294 PGV 296 (296) Q Consensus 294 ~~v 296 (296) +++ T Consensus 403 ~~~ 405 (415) T protein:vir:98 403 DSE 405 (415) T ss_pred ccC Confidence 999 No 54 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.08 E-value=6.4e-12 Score=81.97 Aligned_cols=257 Identities=12% Similarity=0.058 Sum_probs=140.8 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee-ecccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi-g~A~gdVaEGe~Ipls 79 (296) +.++.+.....++....+ -+++++..+. .-++..-+.+|+. +.++++|.+... +.+. -|+||+.+|-+ T Consensus 113 ~~~~~~~~~g~~~~~~~~-----~~ii~~~~~~----~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~-~v~Eg~~~~~~ 181 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRL-----PGFITPPDAR----LTVRDLIGSGRTD-SALIEYVQETGFVNNAA-IVAEGALKPES 181 (390) T ss_pred hccccccCCcceechhhh-----HHHHHHHhhh----hhhhhhcceeecc-CCceEEEEEecCCccee-eecCCcccccc Confidence 333333333334333222 2344444333 2233444667776 457899998654 3453 79999999999 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc---------ceecch Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG---------TQDALG 148 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~---------t~~~t~ 148 (296) +.+.. ..+++++|++..+ |.|.++.+. +..+.-.++|+.+++.+++..|+.--.++.. ....+. T Consensus 182 ~~~~~---~i~~~~~k~~~~~~is~ell~d~~--~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~ 256 (390) T protein:vir:81 182 SLKFA---KKTDTTHVIAHTMKATRQILSDAP--QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPT 256 (390) T ss_pred cceee---EEEEeeeEEEEeehhhHHHHHhHH--HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeeccccccccc Confidence 98865 5889999999875 999987664 6788888999999999999988853211110 000000 Q ss_pred hhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee----chhhhhhhheeEEEEeccCCCceEEE Q lcl|Aclame:pro 149 AGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF----GLTYLVDFTGTVIISTNDVTKGEIWA 224 (296) Q Consensus 149 ~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f----g~tyl~nfLG~~II~S~kV~~G~~~~ 224 (296) ..-....+..+.++...++..+...-+.++||.+.+.+++-.+-.-+..| +++-. .++|..|+.++.+|+|++++ T Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~-~l~G~pv~~~~~~p~~~~~~ 335 (390) T protein:vir:81 257 TIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNARGTLTP-TLWGLPVVATQAMAPGEFLV 335 (390) T ss_pred ccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcccccCc-eecceeeEEcCCCCCCcEEE Confidence 00000111222223333444333445789999998877642211111122 11111 36899999999999999886 Q ss_pred EcccceEEEEecCcchhhhhhhc-----cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 225 TVPENIIFAYINPNNSELAKEFN-----LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 225 t~~~Nl~~ay~~~~~g~~~~~f~-----~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) -...+.-. .++ .+++.=.+. +.+|.+++....+. .+ =+-..++++++|+. T Consensus 336 gd~~~~~~-~~~--~~~~~v~~~~~~~~~~~~~v~~r~~~r~-------------d~---~v~~~~a~v~~t~a 390 (390) T protein:vir:81 336 GAFDLAAQ-IFD--QWDARVEIGYVGEDFQRNMITVLAEERL-------------AL---VVYRPEALISGSFA 390 (390) T ss_pred EehhceEE-EEE--ecceEEEEecccchhhcCcEEEEEEEee-------------cc---EEecccceEEEEeC Confidence 55443211 111 122221111 12233333222221 11 13456778888888 No 55 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.07 E-value=4.9e-12 Score=82.58 Aligned_cols=263 Identities=13% Similarity=0.026 Sum_probs=145.1 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec-hh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP-LS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip-ls 79 (296) -..++.......++..+-+...--+|.+++-+.+..-.-+++..+.+|+..|+ .++|.+...+..-.-|+||+++| ++ T Consensus 119 ~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~E~~~~~~~~ 197 (394) T protein:vir:97 119 NETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAELEKNPALA 197 (394) T ss_pred HhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcc-eEEEEEecCCCccceecccccccccc Confidence 00011111122223333333233455555555555555566777888887665 78888755443335899999999 45 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHH Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALAS 157 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~ 157 (296) ..+.. ..+++.+|++.-+ |.|.++.+++ +-.+.-.++|+.++....+..++..+.+++.....+.+.+..++.. T Consensus 198 ~~~~~---~v~l~~~k~~~~i~is~ell~ds~~-~~~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~~~~~~~~~ 273 (394) T protein:vir:97 198 KPDFK---DVAWNIDTYRGAIPLSQESIDDADV-DLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNG 273 (394) T ss_pred cccce---eEEeehhheeeehhhHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccccccHHHHHHHHHh Confidence 55543 4778889999875 9999987765 3667788999999999999999988877665444455555444422 Q ss_pred HHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechhh----hhhhheeEEEEec--cCCCceEEEEcccceE Q lcl|Aclame:pro 158 AWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTY----LVDFTGTVIISTN--DVTKGEIWATVPENII 231 (296) Q Consensus 158 ~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~ty----l~nfLG~~II~S~--kV~~G~~~~t~~~Nl~ 231 (296) .. +. .-..+.++||.+.+.++.=.+-.-+..|.-.+ ...++|..|+.+. -++.+.+++ .|.. T Consensus 274 ~~--------~~-~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~---gd~~ 341 (394) T protein:vir:97 274 GF--------DP-AYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFI---GDFK 341 (394) T ss_pred hh--------hh-hhCCEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEecccccCCccEEE---eecc Confidence 11 11 12346789999987654322111122231110 0137898777754 455555553 2222 Q ss_pred EEEecCcchhhhhhhc-cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 232 FAYINPNNSELAKEFN-LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 232 ~ay~~~~~g~~~~~f~-~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) -+|.-...+++.=.+. ...+.+++.+.... .+. +-..++++++++++.. T Consensus 342 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~-------------d~~---v~~~~a~~~~~~~~~~ 391 (394) T protein:vir:97 342 RGVLFADRKDLGLRWADNEIYGQYLQAVLRF-------------GVS---KVDDKAGYYVTFTPEP 391 (394) T ss_pred ccEEEEEecceEEEEecccccceeEEEEEEE-------------ccE---EecccceEEEEecccc Confidence 2221111122221111 11233444433211 111 2246889999998766 No 56 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.07 E-value=1.5e-11 Score=79.99 Aligned_cols=266 Identities=15% Similarity=0.074 Sum_probs=136.5 Q ss_pred Cccccc----cccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee----ecccCcccC Q lcl|Aclame:pro 1 MVTSRT----YPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV----TLAEGNVPE 72 (296) Q Consensus 1 ~~~~~~----~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi----g~A~gdVaE 72 (296) +...+. ........+++-+...--++.++|-+-+..-.-++.+-+.+|+. |.++++|...-. +.+. -|+| T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~-~v~E 182 (413) T protein:vir:81 105 YVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMT-NTTIKYLMEKANRVVEGGFK-TVAE 182 (413) T ss_pred hhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeecc-CCceeEEEeccccccccccc-eecC Confidence 000000 00001111122222222334444444444444444555667774 556777765432 2343 7899 Q ss_pred Cceechhhee-eeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC--------- Q lcl|Aclame:pro 73 GEVIPLSKVE-RKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG--------- 140 (296) Q Consensus 73 Ge~Iplskv~-~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta--------- 140 (296) |+++|-+.+. .. ..+++++|++..+ |.|.++.+. .-...-.++|+.++++++++.|+.--.++ T Consensus 183 g~~~~~~~~~~f~---~i~~~~~k~~~~~~iS~ell~ds~--~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~ 257 (413) T protein:vir:81 183 GGKKPYMRFADFD---IVTESLSKIAGLTKITDEMIEDYD--FLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKR 257 (413) T ss_pred cccccccCcccce---eeEeeeeeEEEeehhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccc Confidence 9999977653 32 4788889998864 999987774 36777888899999999999888521111 Q ss_pred ccc--e-ecchhhHHHHHHHHHHHHHHhhccc-cCcceEEEEcHHHHHHHhc--CCc--------cccceeechhhh-hh Q lcl|Aclame:pro 141 TGT--Q-DALGAGLQGALASAWGKLQVLFEDY-GSERAIVFANSLDVAEYIA--KAG--------ITTQTAFGLTYL-VD 205 (296) Q Consensus 141 t~t--~-~~t~~~lQ~Ala~~~~~~~~~Fede-d~~~~VlFvNP~Daa~~l~--~a~--------i~~q~~fg~tyl-~n 205 (296) ++. . ..+...+-..+..++.. .... +.....++|||.+.+.+++ +++ +......+..+. .. T Consensus 258 ~~~~~~~~~~~~~~~~~i~~~~~~----~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~ 333 (413) T protein:vir:81 258 DGIQTLAVSNKDELADSIYKAMTN----ISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPA 333 (413) T ss_pred cccccccccccchhHHHHHHHHHH----hhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCce Confidence 000 0 11122222223333222 2221 2223357899999987653 221 111112222221 13 Q ss_pred hheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhc------cccccccceEEEeccccceeehhhhhhHHHHh Q lcl|Aclame:pro 206 FTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFN------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLM 279 (296) Q Consensus 206 fLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~------~~td~tGliGv~h~~~~~~~t~et~~~~~~~l 279 (296) ++|..|+.|..+|.|++++--.. .+|.-...+++.-... +.+|++++.+.... .+. T Consensus 334 l~G~pv~~s~~~~~~~~~~gd~~---~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~-------------d~~-- 395 (413) T protein:vir:81 334 PWGLRTVQSQVVPVGKPVVGAFR---SAASVLRKGGVRIDSTNTNVDDFENNLITVRAEERV-------------GLM-- 395 (413) T ss_pred ecceeeEEcCCCCcccEEEEecc---cEEEEEEecceEEEEeccccchhhcCcEEEEEEEee-------------ccE-- Confidence 77999999999999998753332 2222111112211111 22233333322211 122 Q ss_pred hhhccceEEEEEecCCC Q lcl|Aclame:pro 280 YPERIDGIVKVTLTPGV 296 (296) Q Consensus 280 fpE~~dgvv~~tI~~~v 296 (296) +-..+++++++++++| T Consensus 396 -~~~~~a~~~l~~~~~~ 411 (413) T protein:vir:81 396 -VTFPEAIVQLDVAEVV 411 (413) T ss_pred -EecccceEEEEecCCC Confidence 3445889999999999 No 57 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.07 E-value=8.4e-12 Score=81.31 Aligned_cols=281 Identities=10% Similarity=0.044 Sum_probs=139.8 Q ss_pred Cccccccc-cc---cceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCcee Q lcl|Aclame:pro 1 MVTSRTYP-EE---NLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVI 76 (296) Q Consensus 1 ~~~~~~~a-e~---nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~I 76 (296) |--...+. |. ..+.+++-+...-.++.+++-+.+.+-.-++..-+.+||. |.++++|++.....+. -|+||++| T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~-~v~E~~~~ 78 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWIGDVSAQ-WIGEGDMK 78 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceE-EecCCccc Confidence 32222221 11 1111111111112223344433344444455556777775 5679999998777774 89999999 Q ss_pred chhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc------------ Q lcl|Aclame:pro 77 PLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG------------ 142 (296) Q Consensus 77 plskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~------------ 142 (296) |-++.+.+ ..+++.+|++..+ |.|.++.+.. +-.+.-.++|.+++++++++.||.--.++.. T Consensus 79 ~~~~~~f~---~v~~~~~k~~~~~~is~ell~ds~~-~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~ 154 (320) T protein:vir:10 79 PITKGNMT---SQNIAPHKIATIFVASAETVRANPA-NYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSL 154 (320) T ss_pred ccccccee---EEEEeeEEEEEeehhhHHHHhcChH-HHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccc Confidence 99998875 4788999999975 9999965553 6778899999999999999999852211110 Q ss_pred --ceecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCc-----cccceeechh----hhhhhheeEE Q lcl|Aclame:pro 143 --TQDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAG-----ITTQTAFGLT----YLVDFTGTVI 211 (296) Q Consensus 143 --t~~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~-----i~~q~~fg~t----yl~nfLG~~I 211 (296) +...+.+.+.. +...+.++............+.++||.+...+++-.+ +.....+++. ....++|..| T Consensus 155 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv 233 (320) T protein:vir:10 155 ADPGGATASDLTA-YDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPT 233 (320) T ss_pred eeccccccccccc-HHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeee Confidence 00011122211 1111222333333434456799999999887764221 1111111110 0113679999 Q ss_pred EEeccCCCceEEEEcccceEEEEecCcchhhh----hhhccc--cccccceE--EEeccccceeehhhhhhHHHHhhhhc Q lcl|Aclame:pro 212 ISTNDVTKGEIWATVPENIIFAYINPNNSELA----KEFNLY--GDPTGYIG--MNHFQENTTLTIQTLLVSGMLMYPER 283 (296) Q Consensus 212 I~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~----~~f~~~--td~tGliG--v~h~~~~~~~t~et~~~~~~~lfpE~ 283 (296) +.+..+|.|+.+..-.|-=++++.+. +++. +...+. +++.+... +.++.-.-++.. -+.+ =+.+ T Consensus 234 ~~~~~~~~~~~~~~~gd~~~~~~~~~--~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~---~~d~---~v~~ 305 (320) T protein:vir:10 234 ILSDHVADGTTVGYMGDFRNVIWGQV--GGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEA---EYAF---HNND 305 (320) T ss_pred EecCCCCCCceEEEEeecceEEEEEe--cCeEEEEeecceeeeccccccccchhhhcCcEEEEEEE---eecc---EEec Confidence 99999999986532211111112121 1110 000000 00000000 000000000000 0111 1344 Q ss_pred cceEEEEE-ecCCC Q lcl|Aclame:pro 284 IDGIVKVT-LTPGV 296 (296) Q Consensus 284 ~dgvv~~t-I~~~v 296 (296) .+++++++ +++|= T Consensus 306 ~~a~~~l~~~~ap~ 319 (320) T protein:vir:10 306 KDAFVKLTNVVTPD 319 (320) T ss_pred ccceEEEEeccCCC Confidence 56666665 44444 No 58 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.07 E-value=1e-11 Score=80.88 Aligned_cols=255 Identities=15% Similarity=0.118 Sum_probs=145.2 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) ....++++.-+.+. --++.+++-+.+..-.-+++..+.+|++ |+ .++|+....+.+. -|+||+++|.+. T Consensus 137 ~~~~~~~~~gg~~v--------P~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g~-~~ip~~~~~~~a~-~v~E~~~~~~~~ 205 (425) T protein:vir:95 137 FRNLRAVAGGELTI--------PEVVVNRIMDIMGDYTTLYPLVDKIRVK-GT-TRILVDTDTSPAT-WIEQSGALPTGD 205 (425) T ss_pred HHhhcccccCceec--------cHHHHHHHHHHHHhhhhHHHhhceeecC-ce-eEEEEecCCcccc-cccccccccccc Confidence 11112222222222 2234555555555555567777888874 54 5899987777775 899999999887 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc-----------cc--ee Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT-----------GT--QD 145 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat-----------~t--~~ 145 (296) ..+- +..+++.+|++.-+ |.|.++.+.. +-...-.++|+..|+.++++.++.-=.+++ .. +. T Consensus 206 ~~~f--~~i~l~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~ 282 (425) T protein:vir:95 206 VGTI--ASIDFDGFKVGKVTFVDNYLLQDSII-NLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVT 282 (425) T ss_pred cccc--ceeeeeheeeeeeehhhHHHHhccHH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccc Confidence 5321 35788889998864 9999865544 456677899999999999998885321110 00 00 Q ss_pred --cchhhHHHHHHHHHHHHHHhhcc-c-cCcceEEEEcHHHHHHHhc------CC--ccccceeechhhhhhhheeEEEE Q lcl|Aclame:pro 146 --ALGAGLQGALASAWGKLQVLFED-Y-GSERAIVFANSLDVAEYIA------KA--GITTQTAFGLTYLVDFTGTVIIS 213 (296) Q Consensus 146 --~t~~~lQ~Ala~~~~~~~~~Fed-e-d~~~~VlFvNP~Daa~~l~------~a--~i~~q~~fg~tyl~nfLG~~II~ 213 (296) +....+ ..+.++...... + .....+.++||.+....|. ++ +.-.+...+.+. .++|..|+. T Consensus 283 ~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~--~l~G~pvv~ 355 (425) T protein:vir:95 283 VEADNNLL-----KNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTP--DLLGLRVVF 355 (425) T ss_pred cccccchH-----HHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCc--cccceeeEE Confidence 011111 112222222221 1 1235678899998643332 22 111111111111 377999999 Q ss_pred eccCCCceEEEEcccceEEEEecCcchhhhhhh----ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEE Q lcl|Aclame:pro 214 TNDVTKGEIWATVPENIIFAYINPNNSELAKEF----NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVK 289 (296) Q Consensus 214 S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f----~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~ 289 (296) |..+|++.+++- |...|++... +++.-.. .+..|.+++.+....- |... ..++++. T Consensus 356 ~~~~~~~~i~~G---d~~~~~~~~~-~~~~i~~~~~~~f~~~~~~~~~~~r~d-------------~~~~---~~~a~~~ 415 (425) T protein:vir:95 356 NNFLDDDTVLFG---EFEQYTLVER-ENITIDSSTHVKFTEDQTAFRGKGRFD-------------GKPV---KPEAFVL 415 (425) T ss_pred cCcCCCccEEEE---ecccEEEEee-cceEEEeecccccccCceEEEEEEeeC-------------cEee---cccceEE Confidence 999999988763 4444555443 3332222 2334555555543221 2222 3478999 Q ss_pred EEecCCC Q lcl|Aclame:pro 290 VTLTPGV 296 (296) Q Consensus 290 ~tI~~~v 296 (296) .+|+.|| T Consensus 416 ~~i~~~~ 422 (425) T protein:vir:95 416 VTITDPV 422 (425) T ss_pred EEecCcC Confidence 9999999 No 59 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.05 E-value=6.1e-12 Score=82.05 Aligned_cols=283 Identities=9% Similarity=0.008 Sum_probs=139.8 Q ss_pred Cccccccccccc----eehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCcee Q lcl|Aclame:pro 1 MVTSRTYPEENL----IKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVI 76 (296) Q Consensus 1 ~~~~~~~ae~nl----~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~I 76 (296) |.-+..+.-+|. +.+++-+...-..+.+++-+.+.+-.-++..-+.+||. +.++++|+++..+.+. -|+||++| T Consensus 1 ~~~~~~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~-~v~Eg~~~ 78 (318) T protein:vir:24 1 MAAGTAFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWVGDVSAQ-WIGEGDMK 78 (318) T ss_pred CCCCCCCCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCcceE-EecCCccc Confidence 333333322221 11122111111233344433333333445556778875 6679999998888885 89999999 Q ss_pred chhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc--------ceec Q lcl|Aclame:pro 77 PLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG--------TQDA 146 (296) Q Consensus 77 plskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~--------t~~~ 146 (296) |.++.+.+ ..+++.||++..+ |.|.++.+.. +..+.-.++|.++++++++..|+.--.++.. .... T Consensus 79 ~~~~~~f~---~i~~~~~k~~~~~~iS~e~l~ds~~-~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~ 154 (318) T protein:vir:24 79 PITKGNMT---SQTIAPHKIATIFVASAETVRANPA-NYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISI 154 (318) T ss_pred ccccccee---EEEEeeEEEEEeehhhHHHhhcChH-HHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccc Confidence 99998875 4778889999865 9999975654 6888999999999999999999853321110 0000 Q ss_pred c-hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccc----------cceeechhhhhhhheeEEEEec Q lcl|Aclame:pro 147 L-GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGIT----------TQTAFGLTYLVDFTGTVIISTN 215 (296) Q Consensus 147 t-~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~----------~q~~fg~tyl~nfLG~~II~S~ 215 (296) . ..+........+.++....+..+....+.++||.+...+++-.+-. .....++. ...++|..++.+. T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~-~~~i~g~pv~~~~ 233 (318) T protein:vir:24 155 ADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFR-SGRIVARPTILSD 233 (318) T ss_pred cccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCcccccc-CceEEEEeeEEeC Confidence 0 0000000011111222222333445568999999998776422111 11111111 1136688899999 Q ss_pred cCCCceEEEEcccceEEEEecCcchhhhh----hhccccccccceEEEeccccce-eehhhhh-hHHHHhhhhccceEEE Q lcl|Aclame:pro 216 DVTKGEIWATVPENIIFAYINPNNSELAK----EFNLYGDPTGYIGMNHFQENTT-LTIQTLL-VSGMLMYPERIDGIVK 289 (296) Q Consensus 216 kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~----~f~~~td~tGliGv~h~~~~~~-~t~et~~-~~~~~lfpE~~dgvv~ 289 (296) .++.|+.++...|==.++|.+. +++.= ...+. ..+.--|..|+.-.++ ..+-... +.+. +.+.+++++ T Consensus 234 ~~~~~~~~~~~gdfs~~~~~~~--~~l~i~~~~~~~~~-~~~~~~~~~~~~f~~~~~~~r~~~r~d~~---v~~~~a~~~ 307 (318) T protein:vir:24 234 HVVEGTTVGFMGDFSQLIWGQI--GGLSFDVTDQATLN-LGTVESPNFVSLWQHNLVAVRVEAEYAFH---CNDAEAFVA 307 (318) T ss_pred CCCCCccEEEEeecceEEEEEe--cCeEEEEeecccee-ccccccccchhhhhcCcEEEEEEEEEccE---EecccceEE Confidence 9999987543222111223322 11210 00000 0001111111110001 0011010 1111 233455666 Q ss_pred EEecCCC Q lcl|Aclame:pro 290 VTLTPGV 296 (296) Q Consensus 290 ~tI~~~v 296 (296) ++...+= T Consensus 308 i~~~~a~ 314 (318) T protein:vir:24 308 LTNVVSG 314 (318) T ss_pred EEeeccC Confidence 5543222 No 60 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.05 E-value=2.5e-12 Score=84.17 Aligned_cols=272 Identities=11% Similarity=0.050 Sum_probs=139.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-|+.+. +.+..+.+ +-++.++.. .-.-++..-+.+||.. .++++|+++..+.|. -|+||+++|.++ T Consensus 1 m~t~t~g---g~liP~~~----~~~ii~~l~----~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~-wv~E~~~~~~s~ 67 (303) T protein:vir:97 1 MGTETSK---ASLFDKHL----VSDLINKVK----GHSSLAKLSSQKPIPF-NGSKEFTFTLDSDID-VVAENGKKTHGG 67 (303) T ss_pred CcccCCC---CeEcchhH----HHHHHHHHH----hhchhhhhcceeecCC-CceEEEEEecCcceE-EeecCccccccc Confidence 6655332 22222222 223333322 2222444446677764 568999998888886 999999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhc--CCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce------------ Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYG--SNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ------------ 144 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsG--ygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~------------ 144 (296) ++.+ ..+++.||.+.-+ |.|-++.+. ..+-.++-.++|+++++++++..|+.-...++++. T Consensus 68 ~~f~---~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~ 144 (303) T protein:vir:97 68 LSLE---PVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSK 144 (303) T ss_pred ccee---eEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccc Confidence 8875 5788889998875 999875332 22345678999999999999999996543222211 Q ss_pred --ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccc------cceeechhhhhhhheeEEEEecc Q lcl|Aclame:pro 145 --DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGIT------TQTAFGLTYLVDFTGTVIISTND 216 (296) Q Consensus 145 --~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~------~q~~fg~tyl~nfLG~~II~S~k 216 (296) .....+--...+..+.++...+.+.+...-..++||.+...+++-.+-. .+...|++.. .++|..|+.|+. T Consensus 145 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~-~l~G~Pv~~s~~ 223 (303) T protein:vir:97 145 VTQVVKFTESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPD-SINGLKSSVNTT 223 (303) T ss_pred cccccccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCc-eecceeeEEecc Confidence 0000000011234455555566554444557899999999887422111 1111222222 378999999999 Q ss_pred CCCceEEEEcc-----cce-EEEEecCcchhhhhhhccccccccceEEEeccccceeehh-hhhhHHHHhhhhccceEEE Q lcl|Aclame:pro 217 VTKGEIWATVP-----ENI-IFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQ-TLLVSGMLMYPERIDGIVK 289 (296) Q Consensus 217 V~~G~~~~t~~-----~Nl-~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~e-t~~~~~~~lfpE~~dgvv~ 289 (296) ||.+...+... .|. +.+++..+ +++.=....++|++|- ++ |-...+-..+- +.-+.+..+-|+- +++ T Consensus 224 v~~~~~~~~~~~~~~~Gdf~~~~~~~~~-~~~~~~~~~~~~~d~~-~~-~~~~~n~~~~r~~~r~~~~v~~p~a---f~~ 297 (303) T protein:vir:97 224 VGAGADEAESKDLVIIGDFESMFKWGYA-KQIPMEIIKYGDPDNS-GK-DLKGYNQIYLRAEAYIGWGILDAKS---FAR 297 (303) T ss_pred cCCccccCCCccEEEEeeccccEEEEEe-cCcEEEEeeccCCCCc-ch-hhhhcCcEEEEEEEEeccEeecccc---eEE Confidence 99765332221 121 11222222 2222222223333221 00 00000000000 0111222223322 222 Q ss_pred EEecCCC Q lcl|Aclame:pro 290 VTLTPGV 296 (296) Q Consensus 290 ~tI~~~v 296 (296) ++ .++| T Consensus 298 l~-~~~~ 303 (303) T protein:vir:97 298 VT-KGEV 303 (303) T ss_pred ee-CCCC Confidence 22 2223 No 61 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.03 E-value=1.6e-11 Score=79.82 Aligned_cols=267 Identities=15% Similarity=0.074 Sum_probs=144.9 Q ss_pred cccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhheeee Q lcl|Aclame:pro 5 RTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERK 84 (296) Q Consensus 5 ~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplskv~~~ 84 (296) --+-.++.+++.+-+...-..+.++|-+.+.+-.-++.+-+.+|+. |.+.++|.+.- ..+ .-|+||+++|.++.+.+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~~~~~~-~~a-~~v~E~~~~~~~~~~f~ 77 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMT-KPEEEFTFMSG-VGA-FWVDEAERIQTSKPTFT 77 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecC-CCcEEEEEEcC-Cce-eeeecCcccccccccee Confidence 1111222333333333333456666666666666677777899985 56678999864 445 48999999999998875 Q ss_pred ecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHH---------HH---hcCccceecchhh Q lcl|Aclame:pro 85 IHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVT---------AL---KTGTGTQDALGAG 150 (296) Q Consensus 85 ~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~---------aL---ktat~t~~~t~~~ 150 (296) ..++..||++..+ |.|.++.+. -+-.+.-.++|.+++++++++.++. .+ ..++.+....... T Consensus 78 ---~v~l~~~k~~~~~~is~ell~ds~-~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~~~~~ 153 (299) T protein:vir:41 78 ---KAKMRSKKMGVIIPTTKENLNYSV-TNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEETANK 153 (299) T ss_pred ---EEEEeeEEEEEeehhhHHHHhcCH-HHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeecccccc Confidence 5788899999875 999996455 3567888999999999999998874 11 1111111111111 Q ss_pred HHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCCCceEEEE Q lcl|Aclame:pro 151 LQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVTKGEIWAT 225 (296) Q Consensus 151 lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~~G~~~~t 225 (296) + +.+.++....++......+.++||.+...+++-.+-.-+..| ++. . .++|..|+.+..+|.|+ T Consensus 154 ~-----~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~-~-~l~G~PV~~~~~~~~~~---- 222 (299) T protein:vir:41 154 Y-----DDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGV-D-DVLGLPIAYTPKYTFGD---- 222 (299) T ss_pred H-----HHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCC-c-eecceeeEEecccCCCC---- Confidence 1 222233334455444566899999998877742211111111 211 1 37899999999999874 Q ss_pred cccceEEEEecCcc------hhhhhh------hccccccccceEEEeccccceee-hhhhhhHHHHhhhhccceEEEEEe Q lcl|Aclame:pro 226 VPENIIFAYINPNN------SELAKE------FNLYGDPTGYIGMNHFQENTTLT-IQTLLVSGMLMYPERIDGIVKVTL 292 (296) Q Consensus 226 ~~~Nl~~ay~~~~~------g~~~~~------f~~~td~tGliGv~h~~~~~~~t-~et~~~~~~~lfpE~~dgvv~~tI 292 (296) ++..+++.|.+. +++.=. +....|+.|- .|+.-.++.+ +-. ..-+=+-+.+.+++++++. T Consensus 223 --~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~r~--~~~~d~~v~~~~A~~~l~~ 295 (299) T protein:vir:41 223 --KDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGK---PLNLAERDMAAIKA--TFEVGFMVVKDEAFSAVQP 295 (299) T ss_pred --CceEEEEEecccEEEEEecCcEEEEeeccccccccccccc---chhhhhcCcEEEEE--EEEeccEEecccceEEEEe Confidence 222233333210 111000 0111111111 1110000000 000 0000112334567777776 Q ss_pred cCCC Q lcl|Aclame:pro 293 TPGV 296 (296) Q Consensus 293 ~~~v 296 (296) +++= T Consensus 296 ~aa~ 299 (299) T protein:vir:41 296 KAGN 299 (299) T ss_pred ccCC Confidence 6666 No 62 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.00 E-value=1e-11 Score=80.84 Aligned_cols=263 Identities=11% Similarity=-0.002 Sum_probs=132.7 Q ss_pred Ccccccccccc-ceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeee--------eeeeecccCccc Q lcl|Aclame:pro 1 MVTSRTYPEEN-LIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYA--------GYDVTLAEGNVP 71 (296) Q Consensus 1 ~~~~~~~ae~n-l~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk--------~~yig~A~gdVa 71 (296) .....+....+ ++....... --.+.+.+......-.-+.+.-+.+|+. +..+++|+ |...+.+. -|+ T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~--p~~~~~~i~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~-~v~ 191 (419) T protein:vir:94 116 NRLLSRDAPAGTITNPNVPHL--PQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKAA-VVP 191 (419) T ss_pred HHhhccccccccccCCccccc--chhhhHHHHHHHhhhhhhhhcceeeecc-CCceeeeeeccccccccccCcccc-eec Confidence 00000111111 111110000 0001111111111111123334445554 44455554 33444554 889 Q ss_pred CCceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC--------- Q lcl|Aclame:pro 72 EGEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG--------- 140 (296) Q Consensus 72 EGe~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta--------- 140 (296) ||+.+|.++++.. ..++++||++..+ |.|.++.++ +-...-.++|+.+++.+++..|+.-=.++ T Consensus 192 Eg~~~~~~~~~~~---~i~~~~~k~~~~~~is~ell~d~~--~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~ 266 (419) T protein:vir:94 192 EGTAKPQSTLSFD---TITTTLKTVAHWLPITRQAADDNS--QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTP 266 (419) T ss_pred CCcccccccccee---eEEeeeeeEEEeehhhHHHHHhHH--HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccc Confidence 9999999998875 5889999999865 999997553 56677888999999999999998410000 Q ss_pred --------ccceecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccce------eechhhhhhh Q lcl|Aclame:pro 141 --------TGTQDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQT------AFGLTYLVDF 206 (296) Q Consensus 141 --------t~t~~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~------~fg~tyl~nf 206 (296) ..+...+.... ...+.++............+.++||.+...+++-..-+.+. ..++.-. .+ T Consensus 267 ~~~~~~~~~~~~~~t~~~~----~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~-~l 341 (419) T protein:vir:94 267 GIGTYQQPKPTAPATDEPP----LVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATP-RI 341 (419) T ss_pred ccccccccccccccccchh----HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCc-cc Confidence 00001111111 22222333333333334458999999988876433211111 1122222 37 Q ss_pred heeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhh------ccccccccceEEEeccccceeehhhhhhHHHHhh Q lcl|Aclame:pro 207 TGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEF------NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMY 280 (296) Q Consensus 207 LG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f------~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lf 280 (296) +|..|+.+..+|+|++++--..+ +|.-...+++.-.. .+..|.+++....+ +.+.. T Consensus 342 ~G~pV~~~~~~~~~~~~~gd~~~---~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r-------------~d~~v-- 403 (419) T protein:vir:94 342 WGLNVVSTVAIAQGTALVGGFRQ---GATLWSRQGITVLMTDSHADFFTANTLVILAEFR-------------ANLAV-- 403 (419) T ss_pred cceeeEEcCCCCCccEEEeeccc---eEEEEEecceEEEEeccccchhhcCcEEEEEEEe-------------eccEE-- Confidence 89999999999999987543322 12111111121111 12234444333322 12222 Q ss_pred hhccceEEEEEecCCC Q lcl|Aclame:pro 281 PERIDGIVKVTLTPGV 296 (296) Q Consensus 281 pE~~dgvv~~tI~~~v 296 (296) -..++++++++++++ T Consensus 404 -~~~~a~~~~~~~aa~ 418 (419) T protein:vir:94 404 -YQPKAFVRVTFAAAT 418 (419) T ss_pred -eccccEEEEEeccCC Confidence 345799999999999 No 63 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.00 E-value=3.3e-11 Score=78.06 Aligned_cols=256 Identities=9% Similarity=0.029 Sum_probs=141.8 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeee-ecccCcccCCceec- Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDV-TLAEGNVPEGEVIP- 77 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yi-g~A~gdVaEGe~Ip- 77 (296) |+.+++.- ++++=+...--+|.+++-+.+..-.-+++.-+.+|+..+. ++.+|+|... +.|. -|+||+++| T Consensus 1 ~l~~~~~~-----t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~-~v~Eg~~~~~ 74 (293) T protein:vir:48 1 MLDSKTDH-----SGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLAN-IDDEAGKIAD 74 (293) T ss_pred Cceeeccc-----ccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCccee-eecCCccccc Confidence 54444321 1111111112244444444444445556666778887664 6778888644 4564 899999999 Q ss_pred hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce-ecchhhHHHH Q lcl|Aclame:pro 78 LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ-DALGAGLQGA 154 (296) Q Consensus 78 lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~-~~t~~~lQ~A 154 (296) .++.+.. ..+++.||++..+ |.|.++.+++ +-.+.-.++|+++++++.+..|+.-+.+.+... ..+.+.|.. T Consensus 75 ~~~~~~~---~i~l~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~d~i~~- 149 (293) T protein:vir:48 75 IDDPKLS---LIKYTIKRYAGISTVTNSLLADSAE-NILAWLSGWIAKKVVVTRNKAILGVVDKLPTKPTLTKWDDIID- 149 (293) T ss_pred cccccee---EEEEeeeEEEEeehhhHHHHhhhhH-HHHHHHHHHHHHHHHHHHHhHHhhccccccccccccCHHHHHH- Confidence 5777764 4788999999875 9999977766 467889999999999999999998886654322 223334433 Q ss_pred HHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEe--ccCCCceEEEEcc Q lcl|Aclame:pro 155 LASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIIST--NDVTKGEIWATVP 227 (296) Q Consensus 155 la~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S--~kV~~G~~~~t~~ 227 (296) .+.++. .......+.++||.+.+.+++-.+-.-+..| +++- ..++|..|+.+ .-++.+. . T Consensus 150 ---~~~~l~----~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~-~~l~G~Pv~~~~~~~~~~~~-----~ 216 (293) T protein:vir:48 150 ---LEAKVD----PAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTG-YSIAGFAVKEISDRWLPNAS-----S 216 (293) T ss_pred ---HHHhhh----hhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCC-ceecceeeEEecccccCCcc-----C Confidence 333332 2223456889999998876442221212222 1111 13788766543 3344321 1 Q ss_pred cceEEEEecCc-------chhhhhhh------ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecC Q lcl|Aclame:pro 228 ENIIFAYINPN-------NSELAKEF------NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTP 294 (296) Q Consensus 228 ~Nl~~ay~~~~-------~g~~~~~f------~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~ 294 (296) ++..+++.|.+ .+++.=.. .+..|++++.+... +.+. +-+.++|+++++++ T Consensus 217 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r-------------~d~~---~~~~~a~~~l~~~~ 280 (293) T protein:vir:48 217 GVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDR-------------FDVV---ATDTEAFVPASFKA 280 (293) T ss_pred CceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEe-------------eCcE---EecccceEEEEeec Confidence 22223332221 01111000 01123333333221 1222 33447889999987 Q ss_pred CC Q lcl|Aclame:pro 295 GV 296 (296) Q Consensus 295 ~v 296 (296) ++ T Consensus 281 ~~ 282 (293) T protein:vir:48 281 IA 282 (293) T ss_pred cc Confidence 77 No 64 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.99 E-value=5.7e-11 Score=76.75 Aligned_cols=260 Identities=16% Similarity=0.102 Sum_probs=139.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) .....+....+++.. ...-+++.+ .+.+..-+..+.+..|+..|..+++|.+.....+. -|+||+++|.++ T Consensus 110 ~~~~t~~~~g~~~~~-----~~~~~~i~~---~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v~E~~~~~~~~ 180 (392) T protein:vir:13 110 KRDGTKAGNPNVLSR-----TLYGQLIAQ---AVERSAIMRGGASTFTTSDANPMDFTVITGRATAG-IVGETAEIPESY 180 (392) T ss_pred hhcccccCCCccccc-----cchHHHHHH---HHhhhhhhhhcceeeecCCCceeEEEEEcCCccee-eecccccccccc Confidence 111111111122211 111122211 12222123345677888899999999987766664 799999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHH---------HhcCccce----e Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTA---------LKTGTGTQ----D 145 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~a---------Lktat~t~----~ 145 (296) .+.. ..++..+|++.-+ |.|.++.+.+ +-.+.-.++|+.+|+.+++..|+.- |...+... + T Consensus 181 ~~f~---~v~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~ 256 (392) T protein:vir:13 181 PATT---QRSMGGFKYGFASVVSYEFATDQVL-DLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGE 256 (392) T ss_pred ccee---eEEeeeeeEEeeehhHHHHHhcchH-HHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccccccc Confidence 8764 5788889998865 9999975544 4556788999999999999999851 11111000 0 Q ss_pred cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeec--hhhh--hhhheeEEEEeccCCCce Q lcl|Aclame:pro 146 ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFG--LTYL--VDFTGTVIISTNDVTKGE 221 (296) Q Consensus 146 ~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg--~tyl--~nfLG~~II~S~kV~~G~ 221 (296) +++..+ .++.+.++............+.++||.+.+.+++=.+=.-+..|. .+-. ..++|..|+.+..+|.++ T Consensus 257 ~~~~~~---~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~ 333 (392) T protein:vir:13 257 ADADSK---VSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKVVETDDGMPADK 333 (392) T ss_pred cccccc---cHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEcCCCCCCc Confidence 111110 011111221112222223467889999998765321111122221 1111 137899999999999999 Q ss_pred EEEEcccceEEEEecCcchhh--hhhhc--cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 222 IWATVPENIIFAYINPNNSEL--AKEFN--LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~~g~~--~~~f~--~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +++--..+ |++... +++ ...-. +..|.+++.+..+.- |. +-..++++.++++++- T Consensus 334 i~~Gdf~~---~~i~~~-~~~~i~~~~~~~~~~~~~~~r~~~r~d-------------~~---~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 334 VLFADLSK---YRVRFA-GSLRVDRSVDAKFSTDQIVYRFLQRAD-------------GL---LVDARGAKVLTVTPAA 392 (392) T ss_pred EEEeeccc---eeEEee-cceEEEeeccccccCCcEEEEEEEEec-------------cE---EecccceEEEEeeccC Confidence 88644332 333222 222 11111 233555555443321 11 2345677777887766 No 65 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.96 E-value=1.1e-10 Score=75.28 Aligned_cols=260 Identities=15% Similarity=0.072 Sum_probs=151.3 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec-hh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP-LS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip-ls 79 (296) +...+++ ++-+...--+|.+++-+.+.+..-+++.-|.+|+..| ..++|++...+.+. -++||+.+| .+ T Consensus 83 ~~~~~~~--------~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~-~~~i~~~~~~~~a~-~~~E~~~~~~~~ 152 (390) T protein:vir:40 83 VIAGNGF--------AGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTAT-TEWIISVGDVATAW-WGPLCAEIKEVL 152 (390) T ss_pred HHhccCc--------ccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCc-eeEEEEEcCCccee-eeccccccCccc Confidence 2222222 2334444456677777777777778888899998654 46789987777775 899999997 45 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHH---------Hhc---Ccc--c Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTA---------LKT---GTG--T 143 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~a---------Lkt---at~--t 143 (296) +.+.. ..+++.||++.-+ |.|.++.+.+ +-.+.-.++|+.+|+.+++..|+.- |.. .+. . T Consensus 153 ~~~f~---~i~l~~~k~~~~i~iS~ell~ds~~-~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~ 228 (390) T protein:vir:40 153 DNGFD---KIQTGMYKLSAYIPVCNAMLDLGPS-WLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEH 228 (390) T ss_pred cccce---eeEeeeeeEEEeehhhHHHHhcchH-HHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeecccccccccc Confidence 65543 5888999998854 9999976655 3567788999999999999988851 110 000 0 Q ss_pred eecchhhHHH-HHHHHHHHHHHhhccc---cCcceEEEEcHHHHHHHhcCCc-c-ccceeechhhhhhhheeEEEEeccC Q lcl|Aclame:pro 144 QDALGAGLQG-ALASAWGKLQVLFEDY---GSERAIVFANSLDVAEYIAKAG-I-TTQTAFGLTYLVDFTGTVIISTNDV 217 (296) Q Consensus 144 ~~~t~~~lQ~-Ala~~~~~~~~~Fede---d~~~~VlFvNP~Daa~~l~~a~-i-~~q~~fg~tyl~nfLG~~II~S~kV 217 (296) ...++..+-. .....+.++...+.+. .....+.+|||.+.+.+|+.-. + ..++.+-... .++|..||.|+.+ T Consensus 229 ~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~--~~~g~pvv~~~~~ 306 (390) T protein:vir:40 229 PVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGI--LPVPLEIVQSVAV 306 (390) T ss_pred ccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCcccccc--CCCceeEEEcCCC Confidence 0001111111 0112222333333221 1235688999999887776321 1 1222221111 1468899999999 Q ss_pred CCceEEEEcccceEEEEecCcchhhhhh----hccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 218 TKGEIWATVPENIIFAYINPNNSELAKE----FNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 218 ~~G~~~~t~~~Nl~~ay~~~~~g~~~~~----f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) |+|++++-...+ |++-.+ +++.=. -.+..|+++|.+..+.--. +-...++++..|+ T Consensus 307 p~~~i~~Gd~s~---~~i~~~-~~~~v~~~~~~~f~~~~~~~r~~~r~dg~----------------v~~~~A~~~l~~~ 366 (390) T protein:vir:40 307 PVGKAVAGRAKD---YFMGIG-SEQVIRTSTEYRLLDDETLYYAKQYANGR----------------PKDNSSFLVFDIT 366 (390) T ss_pred CCCcEEEEeece---EEEEee-cceEEEecchhhhhcCcEEEEEEEEeCCE----------------EecccceEEEEee Confidence 999988755544 333222 333211 1233466666665443211 1122356666665 Q ss_pred CCC Q lcl|Aclame:pro 294 PGV 296 (296) Q Consensus 294 ~~v 296 (296) +.. T Consensus 367 ~~~ 369 (390) T protein:vir:40 367 GLE 369 (390) T ss_pred ccC Confidence 553 No 66 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.94 E-value=5.7e-11 Score=76.73 Aligned_cols=261 Identities=18% Similarity=0.111 Sum_probs=140.2 Q ss_pred Cccccc---ccc-ccceehhhhhhhhhhhhHHHHhhhHHHHHHH----hCcccccccCCCCeeeeeeeeeeecccCcccC Q lcl|Aclame:pro 1 MVTSRT---YPE-ENLIKSTDLKYPITIDVTNKFQENISKLLEM----LGVTRKISVSEGMTLKTYAGYDVTLAEGNVPE 72 (296) Q Consensus 1 ~~~~~~---~ae-~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~----LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaE 72 (296) ....|. -++ ..++.+.+=+....- .+++-|.++++- ..+.+..++..|..+++|+++-...+. -|+| T Consensus 98 ~~~~r~~~~~~~~~~~t~~~~g~~~~~~----~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~-wv~E 172 (390) T protein:vir:62 98 LGEARSFEFAPEKRDGTKAGNPNVLSRT----LYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSAS-IVGE 172 (390) T ss_pred hhhhHHHHhhhhhhcccccCCCcccccc----chHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCccee-eecc Confidence 000000 000 011111110001011 122223333322 234466788888889999987766774 7999 Q ss_pred CceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHH-------HHhcC--- Q lcl|Aclame:pro 73 GEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVT-------ALKTG--- 140 (296) Q Consensus 73 Ge~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~-------aLkta--- 140 (296) |+.||-+..+.. ..+++.||++.-+ |.|.++.+.+ +-...-.++|..+|+.+++..|+. .+... T Consensus 173 ~~~~~~~~~~f~---~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~Gi~~~~~~~ 248 (390) T protein:vir:62 173 TAEIPESYPATA---QRSMGGFKYGFASVVSYEFATDQVL-DLVGFLVSDAGPAIGDAMGRHFITGTGQPRGILTDASPA 248 (390) T ss_pred ccccccccccee---eeEeeeeeEEeehHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHhhhhccCCcccccccccccc Confidence 999999998864 5888999999865 9999976654 456678899999999999999874 11110 Q ss_pred cccee-cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHH--hcCCccccceee-----chhhhhhhheeEEE Q lcl|Aclame:pro 141 TGTQD-ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEY--IAKAGITTQTAF-----GLTYLVDFTGTVII 212 (296) Q Consensus 141 t~t~~-~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~--l~~a~i~~q~~f-----g~tyl~nfLG~~II 212 (296) +.+.. +..+.+ -++.+.++............+.++||.....+ |++++ -+..| ++.- ..++|..|+ T Consensus 249 ~~~~~~~~~~~~---~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~--g~~l~~~~~~~g~~-~~l~G~Pv~ 322 (390) T protein:vir:62 249 TATFLATDTDSK---VSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDAN--GQYLWQSGLTVGAP-SLFNGKVVE 322 (390) T ss_pred ccceeccccccc---chHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccC--CCeeecCCcCCCcc-ceecccceE Confidence 00000 011100 01111111111122122345789999998765 44442 11111 1111 137899999 Q ss_pred EeccCCCceEEEEcccceEEEEecCcchhh--hhhhc--cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEE Q lcl|Aclame:pro 213 STNDVTKGEIWATVPENIIFAYINPNNSEL--AKEFN--LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) Q Consensus 213 ~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~--~~~f~--~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv 288 (296) .+..+|.+++++- |..-|++..+ +++ ...-. +..|++++....+.- |. +-..++|+ T Consensus 323 ~~~~~p~~~i~~g---d~s~~~i~~~-~~~~v~~~~~~~~~~~~~~~~~~~r~d-------------~~---~~~~~A~~ 382 (390) T protein:vir:62 323 TDDGMPADKILFA---DLSKYRVRFA-GSLRVDRSVDAKFSTDQIVYRFLQRAD-------------GL---LVDARGAK 382 (390) T ss_pred EecCCCCccEEEe---eccceeEEee-cceEEEeeccccccCCcEEEEEEEEeC-------------cE---eechhheE Confidence 9999999987753 3444444433 222 11111 223555555442211 11 44556788 Q ss_pred EEEecCCC Q lcl|Aclame:pro 289 KVTLTPGV 296 (296) Q Consensus 289 ~~tI~~~v 296 (296) ..+++++- T Consensus 383 ~l~~~~~a 390 (390) T protein:vir:62 383 VLTVTPGA 390 (390) T ss_pred EEEeecCC Confidence 88888877 No 67 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.94 E-value=7.1e-11 Score=76.21 Aligned_cols=282 Identities=11% Similarity=0.079 Sum_probs=135.7 Q ss_pred Cccccccccc--------cceehhh-hhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCccc Q lcl|Aclame:pro 1 MVTSRTYPEE--------NLIKSTD-LKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVP 71 (296) Q Consensus 1 ~~~~~~~ae~--------nl~~~~d-l~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVa 71 (296) |.+.+.-+-+ .++++++ -+...-.++++.+-+.+.+-.-++++-+.+||. +.++++|+++....+. -|+ T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~-~~~~~~p~~~~~~~a~-~v~ 78 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMG-TTGQKIPHWTGDVSAS-WIG 78 (326) T ss_pred CCCCccchhhhcCcchhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeecc-CCceEEEEEeCCcceE-Eec Confidence 2222211100 1111111 000111223333333333333455666778876 5679999998777775 899 Q ss_pred CCceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhc---------- Q lcl|Aclame:pro 72 EGEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKT---------- 139 (296) Q Consensus 72 EGe~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkt---------- 139 (296) ||++||-++.+.+ ..+++.+|++..+ |.|.++.+. -+..+.-.++|.+++++++++.++.--.+ T Consensus 79 Eg~~~~~~~~~f~---~i~~~~~k~~~~v~iS~ell~~s~-~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~ 154 (326) T protein:vir:42 79 EGDMKPITKGNMT---SQTIAPHKIATIFVASAETVRANP-ANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTT 154 (326) T ss_pred CCcccccccccee---EEEEeeEEEEEeehhhHHHHhcCH-HHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccc Confidence 9999999998875 5889999999976 999997665 45778899999999999999999842110 Q ss_pred -CccceecchhhHHHH--HHHH-HHHHHHhhccccCcceEEEEcHHHHHHHhc--CCc---cccceeech----hhhhhh Q lcl|Aclame:pro 140 -GTGTQDALGAGLQGA--LASA-WGKLQVLFEDYGSERAIVFANSLDVAEYIA--KAG---ITTQTAFGL----TYLVDF 206 (296) Q Consensus 140 -at~t~~~t~~~lQ~A--la~~-~~~~~~~Feded~~~~VlFvNP~Daa~~l~--~a~---i~~q~~fg~----tyl~nf 206 (296) ........+.+..+. .+.. +..+............+.++||.+.+++++ +++ +-.....++ ...-.+ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l 234 (326) T protein:vir:42 155 KEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRI 234 (326) T ss_pred cccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCcee Confidence 000000010110000 0000 111112222223334578899999998763 221 111111111 111137 Q ss_pred heeEEEEeccCCCceEEEEcccceEEE-EecCcchhhhhhhccccccccce------EEEec-cccceeehhhhhhHHHH Q lcl|Aclame:pro 207 TGTVIISTNDVTKGEIWATVPENIIFA-YINPNNSELAKEFNLYGDPTGYI------GMNHF-QENTTLTIQTLLVSGML 278 (296) Q Consensus 207 LG~~II~S~kV~~G~~~~t~~~Nl~~a-y~~~~~g~~~~~f~~~td~tGli------Gv~h~-~~~~~~t~et~~~~~~~ 278 (296) +|..|+.+..+|.|++++..- |..-+ |.+. +++. +.. .+|..+- +-.|+ ...+...+-...-.+. T Consensus 235 ~G~pv~~~~~~~~~~~~~~~G-d~s~~~~~~~--~~~~--v~~-~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~- 307 (326) T protein:vir:42 235 VARPTILSDHVASGTVVGYQG-DFRQLVWGQV--GGLS--FDV-TDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAF- 307 (326) T ss_pred eeeeEEEcCCCCCCceEEEEe-ecceEEEEEe--cceE--EEE-eecceeeecccccccchhhhhcCcEEEEEEEEecc- Confidence 799999999999999875422 22222 2222 1110 000 0111000 00000 0000010000111111 Q ss_pred hhhhccceEEEEEecCCC Q lcl|Aclame:pro 279 MYPERIDGIVKVTLTPGV 296 (296) Q Consensus 279 lfpE~~dgvv~~tI~~~v 296 (296) =+.+.+++++.+-.++= T Consensus 308 -~v~~~~a~~~l~~~~~~ 324 (326) T protein:vir:42 308 -HCNDKDAFVKLTNVDAT 324 (326) T ss_pred -EEecccceEEEeecccc Confidence 12344555554433222 No 68 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.94 E-value=1.5e-10 Score=74.43 Aligned_cols=265 Identities=11% Similarity=0.063 Sum_probs=133.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |.+|--+ +.. -+|++.|-+.+.+-.-++..-+.+|+..| .+++|++.....|. -|+||+++|.++ T Consensus 1 ma~~gG~-----lip--------~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~-~v~Eg~~~~~~~ 65 (298) T protein:vir:94 1 MVLNKGT-----LFD--------PELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEID-VVAESGKKTHGG 65 (298) T ss_pred Ceecccc-----ccC--------hhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceE-EeeCCccccccc Confidence 5443311 111 13344443334433345555567777665 57899987776774 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCc---hhHHHHHHHHHHHHhhhhHHHHHHHhcCccce----------- Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNE---AVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ----------- 144 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygd---av~etd~QL~~~iq~kIdnD~~~aLktat~t~----------- 144 (296) .+.. ..+++.+|++..+ |.|.++. -.++ -.++-.++|+.+|+++++..|+.-...++++. T Consensus 66 ~~f~---~v~l~~~k~~~~~~iS~ell~~-~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~ 141 (298) T protein:vir:94 66 VTLA---PQTMVPIKVEYGARISDEFMYA-SDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDS 141 (298) T ss_pred ccee---EEEEeeeEEEEeeehhHHHhcc-CCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccc Confidence 8864 5788889999865 9999743 3333 34668899999999999999996422111100 Q ss_pred --e--cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcccccee-----echhhhhhhheeEEEEec Q lcl|Aclame:pro 145 --D--ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTA-----FGLTYLVDFTGTVIISTN 215 (296) Q Consensus 145 --~--~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~-----fg~tyl~nfLG~~II~S~ 215 (296) + .....-.......+.++...++..+...-+.++||.+.+.+++-.+-.-+.. .++... .++|..|+.++ T Consensus 142 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~-tl~G~PV~~~~ 220 (298) T protein:vir:94 142 KVTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPD-TINGLPVDVNK 220 (298) T ss_pred ccccccccccccccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCc-eecceeeEEec Confidence 0 0000000111234455556666654455689999999997765221111122 223222 37899999999 Q ss_pred cCCCce------EEEEcccceEEEEecCcchhhhhhhccccccccceEEEecc-ccceeehhh-hhhHHHHhhhhccceE Q lcl|Aclame:pro 216 DVTKGE------IWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQ-ENTTLTIQT-LLVSGMLMYPERIDGI 287 (296) Q Consensus 216 kV~~G~------~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~-~~~~~t~et-~~~~~~~lfpE~~dgv 287 (296) .||.+. +++---.+.. .|-.. +++.=...-+.|+.| .+|+- ..+...+-. .-+.+...-|+ ++ T Consensus 221 ~v~~~~~~~~~~~~~Gdfs~~~-~~~~~--~~~~~~~~~~~~~d~---~~~~~f~~~~v~~r~~~r~~~~~~~~~---a~ 291 (298) T protein:vir:94 221 TVSDMSLTQRDRAIIGDFANGF-KWGYA--KEVPLEVIQYGDPDN---SGLDLKGYNQVYIRAELFLGWGILDAT---KF 291 (298) T ss_pred ccccccCCCccEEEEeeccceE-EEEEe--cCceEEEeecCCCcC---cchhhhhcCcEEEEEEEEeccEeeccc---ce Confidence 998642 2221111111 11111 112111111222111 00100 000000000 00112222232 33 Q ss_pred EEEEecC Q lcl|Aclame:pro 288 VKVTLTP 294 (296) Q Consensus 288 v~~tI~~ 294 (296) ++++--- T Consensus 292 ~~l~~~t 298 (298) T protein:vir:94 292 ARVTEAN 298 (298) T ss_pred EEEEecC Confidence 3332111 No 69 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=98.93 E-value=1.9e-10 Score=73.92 Aligned_cols=258 Identities=12% Similarity=-0.011 Sum_probs=147.9 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcccCCceec-h Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIP-L 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdVaEGe~Ip-l 78 (296) -++..+++.-+.+..+ ++.+.+-+.+..-.-+++.-+..||+.++ ++.+|+....+.+. -|+||+++| . T Consensus 90 a~~~~t~~~gg~~vP~--------~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v~Eg~~~~~~ 160 (371) T protein:vir:81 90 AMSEGSNQDGGYTVPQ--------DIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFV-EVAEGAAIGEK 160 (371) T ss_pred hhccCCCccCceeecH--------hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccee-eeccccccccc Confidence 2233333333333333 33444444444444466666777775443 45566554455553 899999998 6 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHH Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALA 156 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala 156 (296) +..+.. ..+++.+|++.-+ |.|.++.+.. +-...-.++|+.+++.+++..|+.-..+++.+...+.+++..++. T Consensus 161 ~~~~f~---~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~~~~~~~i~~~~~ 236 (371) T protein:vir:81 161 ATPQFT---LLQYQVKKYAGFFRVTNELLNDSTE-AIVNTLVRWIGDESRVTRNGLIINVLNTKAKTAIADLDGLKQIIN 236 (371) T ss_pred ccccee---eEEeeeeEEEEeehhhHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccHHHHHHHHH Confidence 777764 4788899999865 9999865543 456778889999999999999998876665554455556654442 Q ss_pred HHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCCCceEE--EEcccc Q lcl|Aclame:pro 157 SAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVTKGEIW--ATVPEN 229 (296) Q Consensus 157 ~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~~G~~~--~t~~~N 229 (296) . .+... .....+.++||.+.+.+++-.+-.-+..| ++... -++|..|+.+..+|.|..+ .+..+. T Consensus 237 ~---~l~~~----~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~-~l~G~pV~~~~~~~~~~~~~~~~~~~~ 308 (371) T protein:vir:81 237 V---QLDPV----FRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGR-QLLGLPVVIVSNKVLANRVDGGTGAQF 308 (371) T ss_pred h---hcchh----hhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCc-eecceeEEEecccccCccccccccCCc Confidence 2 11111 22346889999998876543222222222 12112 2779999999998877544 334444 Q ss_pred eEEEEecCcch-------hhhhhhc------cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 230 IIFAYINPNNS-------ELAKEFN------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 230 l~~ay~~~~~g-------~~~~~f~------~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) -.+.+-|.+.+ ++.=... +-.|++++.+..+ +.+. +-+.+++++++++.+ T Consensus 309 ~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r-------------~d~~---~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 309 APIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIER-------------MDVK---MRDDEAFVFGEVQLA 371 (371) T ss_pred ceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEe-------------eccE---EecccceEEEEEecC Confidence 45555443210 0100000 0112222222211 1122 234589999999999 No 70 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.89 E-value=6.2e-11 Score=76.56 Aligned_cols=270 Identities=15% Similarity=0.121 Sum_probs=140.8 Q ss_pred Cccccc-cccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee-ecccCcccCCceech Q lcl|Aclame:pro 1 MVTSRT-YPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPL 78 (296) Q Consensus 1 ~~~~~~-~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi-g~A~gdVaEGe~Ipl 78 (296) ....+. .....+..+.+-+...--++.+++-+.+.+-.-++..-+.+|+..| .+++|+..-. +.+ +-|+||+.+|- T Consensus 141 ~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a-~wv~E~~~~~~ 218 (497) T protein:vir:10 141 GETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNA-AAVAEAGTYPF 218 (497) T ss_pred hhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcc-eeeccCccccc Confidence 000000 0001122223344444445555554444444455666677888776 5899987443 345 48999999999 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHH---------HhcCcc-ceec Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTA---------LKTGTG-TQDA 146 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~a---------Lktat~-t~~~ 146 (296) +..+.. ..++..||++--+ |.|.++.+. +-.+.-.++|+..|+.++|..|+.= +..++. +.+. T Consensus 219 s~~~f~---~i~~~~~k~a~~~~iS~ell~d~~--~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~ 293 (497) T protein:vir:10 219 SSEEFA---RVYEQVGKVANALTITDEGLRDAP--ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASS 293 (497) T ss_pred ccccce---eeEeeeeeeEeecHhHHHHHHhHH--HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccc Confidence 998764 5788889988865 999987554 3677888999999999999988751 100000 0000 Q ss_pred ----ch--------------------------hhHHHHH-HHH------------------HHHHH----HhhccccCcc Q lcl|Aclame:pro 147 ----LG--------------------------AGLQGAL-ASA------------------WGKLQ----VLFEDYGSER 173 (296) Q Consensus 147 ----t~--------------------------~~lQ~Al-a~~------------------~~~~~----~~Feded~~~ 173 (296) .. ..++.+. ..+ ...+. .......... T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 373 (497) T protein:vir:10 294 ASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTP 373 (497) T ss_pred cccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCC Confidence 00 0000000 000 00000 0000111122 Q ss_pred eEEEEcHHHHHHHhcCCcccc----ceeechhhh------hhhheeEEEEeccCCCceEEEEcccceEE-EEecCcchhh Q lcl|Aclame:pro 174 AIVFANSLDVAEYIAKAGITT----QTAFGLTYL------VDFTGTVIISTNDVTKGEIWATVPENIIF-AYINPNNSEL 242 (296) Q Consensus 174 ~VlFvNP~Daa~~l~~a~i~~----q~~fg~tyl------~nfLG~~II~S~kV~~G~~~~t~~~Nl~~-ay~~~~~g~~ 242 (296) .+.++||.|...++.-.+-.- +..+++.+. ..++|..|+.+..+|.|++|+ .+.+- ||.=...+++ T Consensus 374 ~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~---Gd~~~~~~~i~~r~~~ 450 (497) T protein:vir:10 374 NAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILV---GHFAPSVIQTARREGV 450 (497) T ss_pred CeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEE---eecccceEEEEEeccc Confidence 368899999987653222111 111122111 126799999999999999864 33332 2210001222 Q ss_pred hhhh------ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 243 AKEF------NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 243 ~~~f------~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .=.+ .|.+|.+++.+.... .+... +.++++++++++++ T Consensus 451 ~v~~~~~~~~~f~~n~v~~r~~~r~-------------~~~v~---~p~A~~~l~~~~~~ 494 (497) T protein:vir:10 451 TMQMTNSNGTDFVDGKVTVRAEERL-------------GLLVY---RPSAFQLIQLKKGA 494 (497) T ss_pred EEEeecccchhhhcCcEEEEEEEee-------------cceee---ccccEEEEEecCCc Confidence 2111 133444444433321 22233 45789999999999 No 71 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.89 E-value=6.2e-11 Score=76.56 Aligned_cols=270 Identities=15% Similarity=0.121 Sum_probs=140.8 Q ss_pred Cccccc-cccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee-ecccCcccCCceech Q lcl|Aclame:pro 1 MVTSRT-YPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPL 78 (296) Q Consensus 1 ~~~~~~-~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi-g~A~gdVaEGe~Ipl 78 (296) ....+. .....+..+.+-+...--++.+++-+.+.+-.-++..-+.+|+..| .+++|+..-. +.+ +-|+||+.+|- T Consensus 141 ~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a-~wv~E~~~~~~ 218 (497) T protein:vir:78 141 GETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNA-AAVAEAGTYPF 218 (497) T ss_pred hhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcc-eeeccCccccc Confidence 000000 0001122223344444445555554444444455666677888776 5899987443 345 48999999999 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHH---------HhcCcc-ceec Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTA---------LKTGTG-TQDA 146 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~a---------Lktat~-t~~~ 146 (296) +..+.. ..++..||++--+ |.|.++.+. +-.+.-.++|+..|+.++|..|+.= +..++. +.+. T Consensus 219 s~~~f~---~i~~~~~k~a~~~~iS~ell~d~~--~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~ 293 (497) T protein:vir:78 219 SSEEFA---RVYEQVGKVANALTITDEGLRDAP--ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASS 293 (497) T ss_pred ccccce---eeEeeeeeeEeecHhHHHHHHhHH--HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccc Confidence 998764 5788889988865 999987554 3677888999999999999988751 100000 0000 Q ss_pred ----ch--------------------------hhHHHHH-HHH------------------HHHHH----HhhccccCcc Q lcl|Aclame:pro 147 ----LG--------------------------AGLQGAL-ASA------------------WGKLQ----VLFEDYGSER 173 (296) Q Consensus 147 ----t~--------------------------~~lQ~Al-a~~------------------~~~~~----~~Feded~~~ 173 (296) .. ..++.+. ..+ ...+. .......... T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 373 (497) T protein:vir:78 294 ASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTP 373 (497) T ss_pred cccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCC Confidence 00 0000000 000 00000 0000111122 Q ss_pred eEEEEcHHHHHHHhcCCcccc----ceeechhhh------hhhheeEEEEeccCCCceEEEEcccceEE-EEecCcchhh Q lcl|Aclame:pro 174 AIVFANSLDVAEYIAKAGITT----QTAFGLTYL------VDFTGTVIISTNDVTKGEIWATVPENIIF-AYINPNNSEL 242 (296) Q Consensus 174 ~VlFvNP~Daa~~l~~a~i~~----q~~fg~tyl------~nfLG~~II~S~kV~~G~~~~t~~~Nl~~-ay~~~~~g~~ 242 (296) .+.++||.|...++.-.+-.- +..+++.+. ..++|..|+.+..+|.|++|+ .+.+- ||.=...+++ T Consensus 374 ~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~---Gd~~~~~~~i~~r~~~ 450 (497) T protein:vir:78 374 NAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILV---GHFAPSVIQTARREGV 450 (497) T ss_pred CeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEE---eecccceEEEEEeccc Confidence 368899999987653222111 111122111 126799999999999999864 33332 2210001222 Q ss_pred hhhh------ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 243 AKEF------NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 243 ~~~f------~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .=.+ .|.+|.+++.+.... .+... +.++++++++++++ T Consensus 451 ~v~~~~~~~~~f~~n~v~~r~~~r~-------------~~~v~---~p~A~~~l~~~~~~ 494 (497) T protein:vir:78 451 TMQMTNSNGTDFVDGKVTVRAEERL-------------GLLVY---RPSAFQLIQLKKGA 494 (497) T ss_pred EEEeecccchhhhcCcEEEEEEEee-------------cceee---ccccEEEEEecCCc Confidence 2111 133444444433321 22233 45789999999999 No 72 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.88 E-value=8.9e-11 Score=75.68 Aligned_cols=260 Identities=12% Similarity=0.057 Sum_probs=130.2 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |+++-.+ +..+ ++.+.|-+.+.+-.-++.+-+.+||.. ..+++|.++....|. -|+||++||.++ T Consensus 1 ma~~gG~-----lvp~--------~~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~-~v~E~~~~~~~~ 65 (298) T protein:vir:16 1 MVLNKGT-----LFDP--------TLVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEID-VVAESGKKTHGG 65 (298) T ss_pred CcccCcc-----eech--------hHHHHHHHHHHhhhhhhhhcceeeccC-CceEEEEEecCcceE-EecCCccccccc Confidence 7765533 1222 222333222222223344456677765 447899998777775 999999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCc---hhHHHHHHHHHHHHhhhhHHHHHHHhcCccce----------- Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNE---AVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ----------- 144 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygd---av~etd~QL~~~iq~kIdnD~~~aLktat~t~----------- 144 (296) ++.. ..+++.+|++..+ |.|.++ .-.++ -.++-.++|+.+|+++++..|+.-...++++. T Consensus 66 ~~f~---~v~l~~~k~a~~~~iS~ell~-~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~ 141 (298) T protein:vir:16 66 VTLA---PQTMVPIKVEYGARISDEFMY-ASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDS 141 (298) T ss_pred ccee---EEEEeeeeEEEeehhhHHHhh-cCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccc Confidence 8864 5788999999865 999974 33333 34578899999999999999996532111100 Q ss_pred ----ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEec Q lcl|Aclame:pro 145 ----DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTN 215 (296) Q Consensus 145 ----~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~ 215 (296) ..........+...+.++...++..+...-..++||.+.+.+++-.+-.-+..| ++.-. .++|..|+.++ T Consensus 142 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~-~l~G~PV~~~~ 220 (298) T protein:vir:16 142 KVTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPD-TINGLPVDVNK 220 (298) T ss_pred ccccccccccccccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCc-eecceeeEEec Confidence 001111111122334444455555433344688999999987753322222222 22212 37899999999 Q ss_pred cCCCce----EEEEcccceEEEEecCcchhhhhhhccccccc---------cceEEEeccccceeehhhhhhHHHHhhhh Q lcl|Aclame:pro 216 DVTKGE----IWATVPENIIFAYINPNNSELAKEFNLYGDPT---------GYIGMNHFQENTTLTIQTLLVSGMLMYPE 282 (296) Q Consensus 216 kV~~G~----~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~t---------GliGv~h~~~~~~~t~et~~~~~~~lfpE 282 (296) .||.+. ..+..-|-=+.+.+.++ +++.=...-..|+. |.|++.-. .-+.+..+-| T Consensus 221 ~v~~~~~~~~~~~~~GDfs~~~~~~~~-~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~----------~r~d~~v~~~- 288 (298) T protein:vir:16 221 TVSDMSLTQRDRAIIGDFANGFKWGYA-KEVPLEVIQYGDPDNSGLDLKGYNQVYIRAE----------LFLGWGILDA- 288 (298) T ss_pred ccccccCCCccEEEEeeccceEEEEEe-cCceEEEeeccCCcCcchhhhhcCcEEEEEE----------EEEccEeecc- Confidence 998642 11111110011111111 11111111111111 11111100 0011222223 Q ss_pred ccceEEEEEecC Q lcl|Aclame:pro 283 RIDGIVKVTLTP 294 (296) Q Consensus 283 ~~dgvv~~tI~~ 294 (296) +++++++--- T Consensus 289 --~a~~~l~~at 298 (298) T protein:vir:16 289 --TKFARVTEAN 298 (298) T ss_pred --cceEEEeecC Confidence 2333332211 No 73 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=98.86 E-value=8.3e-11 Score=75.84 Aligned_cols=271 Identities=10% Similarity=-0.006 Sum_probs=144.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHH-HhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNK-FQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~-f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ipls 79 (296) ...+.|+........+++ .+. |...+...--+..+.+..++ +| .+.+|.-.....+. -|+||+.+|.+ T Consensus 249 ~~~~~t~~~gg~lip~~~--------~~~ii~~~~~~~~~l~~~~~~~~~-~g-~~~~~~~~~~~~a~-~v~Eg~~~~~~ 317 (543) T protein:vir:81 249 RAMGLTKADGGYLVPFQL--------DPTVIITSNGSLNDIRRFARQVVA-TG-DVWHGVSSAAVQWS-WDAEFEEVSDD 317 (543) T ss_pred hhcccccccCcccCchhh--------hhHHHHHHHhhhchhhhhcccccC-Cc-ceEEEEecCCccee-ecccCcccccc Confidence 112222222222222222 221 22222222233334455444 45 45677765556664 89999999999 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC----------c---cc- Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG----------T---GT- 143 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta----------t---~t- 143 (296) +++.. ..+++.+|++..+ |.|.++.+ . +-.+.-.++|..+++.+++.-|+.--.++ + .. T Consensus 318 ~~~~~---~i~~~~~k~~~~~~is~ell~d~-~-~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~ 392 (543) T protein:vir:81 318 SPEFG---QPEIPVKKAQGFVPISIEALQDE-A-NVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEI 392 (543) T ss_pred ccccc---eeeeeeeeeEeeehhhHHHHhcc-H-HHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhcccccccc Confidence 98864 5889999999876 99998644 4 78889999999999999999887421111 0 00 Q ss_pred eecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee----chhhhhhhheeEEEEeccCCC Q lcl|Aclame:pro 144 QDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF----GLTYLVDFTGTVIISTNDVTK 219 (296) Q Consensus 144 ~~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f----g~tyl~nfLG~~II~S~kV~~ 219 (296) ++.+...+ .+..+.++...+........+.++||.+...+++-.+-.-+..| ++.-. -++|..|+.+..+|. T Consensus 393 ~~~~~~~~---~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g~~~-~l~G~pv~~~~~~~~ 468 (543) T protein:vir:81 393 APVTAETF---ALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIGNGEPS-QLLGRPVGEAEAMDA 468 (543) T ss_pred cccccccc---cHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccCcCCCCCc-cccceeeEEeccccc Confidence 01111111 12334444445554444557889999998876542211111122 11111 278999999999999 Q ss_pred ceEEEEcccceEEEEecCcchhhhhhhc--cccccccceEEEeccccceeehhh-hhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 220 GEIWATVPENIIFAYINPNNSELAKEFN--LYGDPTGYIGMNHFQENTTLTIQT-LLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 220 G~~~~t~~~Nl~~ay~~~~~g~~~~~f~--~~td~tGliGv~h~~~~~~~t~et-~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +......+++..++|.|.+.==++...+ +-.|.-++.+. +...+...+-. .-+.+..+ +.++++.+++.++- T Consensus 469 ~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~--~~~~~~~~~~~~~r~d~~v~---~~~A~~~l~~~~~a 543 (543) T protein:vir:81 469 NWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTN--RRPNGSRGWFAYYRMGADVV---NPNAFRLLNVETAS 543 (543) T ss_pred cccccccCCcceEEEeeccceeEEeecccEEEEeccccccc--hhhcCceEEEEEEeeccEee---cccceEEEEecccC Confidence 9988888888888887753100000000 11111111000 00000001100 01222333 34677888886666 No 74 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.84 E-value=2.3e-10 Score=73.38 Aligned_cols=273 Identities=11% Similarity=0.067 Sum_probs=139.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-.+.+.. -.|+ .+.+ .+.+-+.+..-.-++.+-+.+|+..| .+++|++.....|. -|+||+++|.++ T Consensus 1 ma~~t~~~-G~li-p~~~--------~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~~p~~~~~~~a~-wv~Eg~~~~~s~ 68 (300) T protein:vir:95 1 MSEAQLSK-GNLF-NPEL--------VTKVINKVKGHSSIAKLSPQKPIPFN-GQREFVFDFDSDID-IVAENGKKTHGG 68 (300) T ss_pred CcccccCC-ccee-chhh--------HHHHHHHHHhhhhhhhhcceeeccCC-ceEEEEEecCcceE-EeeCCccccccc Confidence 65544432 2232 2222 22222222221122233345565544 68899987776774 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhh--cCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce------------ Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMY--GSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ------------ 144 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqls--Gygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~------------ 144 (296) .+.. ..+++.||++.-+ |.|-++.+ .+-+-.++-.++|+.+|++++|..|+.-...++++. T Consensus 69 ~~f~---~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~ 145 (300) T protein:vir:95 69 VSLD---PVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKK 145 (300) T ss_pred ccce---eeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccc Confidence 8875 5778889999865 99987543 355677888999999999999999996532211110 Q ss_pred -ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCc-----cccceeechhhhhhhheeEEEEeccCC Q lcl|Aclame:pro 145 -DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAG-----ITTQTAFGLTYLVDFTGTVIISTNDVT 218 (296) Q Consensus 145 -~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~-----i~~q~~fg~tyl~nfLG~~II~S~kV~ 218 (296) +....+-....+..+.++...+++.+...-+.++||.+...+++-.+ +-.+...++... .++|..|+.|+.+| T Consensus 146 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~-~l~G~Pv~~s~~v~ 224 (300) T protein:vir:95 146 VTQTVPFKDTNPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPD-AINGLAVDKNRTVS 224 (300) T ss_pred cceeecccccchHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCc-eecceeeEEecCCC Confidence 00000001111234445555565544444578999999997764322 212223344444 38899999999998 Q ss_pred Cce----EEEEcccceE-EEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 219 KGE----IWATVPENII-FAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 219 ~G~----~~~t~~~Nl~-~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) .+. ..+.. .+.. .++..++ +++.-...-+.|..|- |+ +-...+...+-...-.+ +=..+.+++++++=. T Consensus 225 ~~~~~~~~~~~~-GDf~~~~~~~~~-~~~~~~v~~~~~~d~~-~~-~~f~~~~v~~r~~~r~d--~~v~~~~a~~~l~~~ 298 (300) T protein:vir:95 225 YSQTDPKNTAIV-GDFETMFKWGYA-KEVPMEIIKYGDPDNS-GR-DLKGYNQIYIRCEAYIG--WGIMDAASFARIVKT 298 (300) T ss_pred CCCCCCccEEEE-eeccceEEEEEe-cccEEEEeeccCCCCc-ch-hhhhcCcEEEEEEEeec--ceeecccceEEEecC Confidence 754 11111 1111 1112222 2222222222222210 00 00011111111000001 112234556666555 Q ss_pred CC Q lcl|Aclame:pro 294 PG 295 (296) Q Consensus 294 ~~ 295 (296) +. T Consensus 299 ~g 300 (300) T protein:vir:95 299 GG 300 (300) T ss_pred CC Confidence 55 No 75 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=98.83 E-value=2.8e-10 Score=72.97 Aligned_cols=255 Identities=13% Similarity=0.007 Sum_probs=138.9 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec-hh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP-LS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip-ls 79 (296) +.+..+.+..+. ..--++.+++-+-+....-+++.-+.+|+..| ++++|.+...+.+-.-|+||..+| .+ T Consensus 133 ~~~~~~~~~gg~--------~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~~ 203 (400) T protein:vir:38 133 VNAGVKAADAAS--------TIPETISNTPQRELQTVVDLKPFTNVFQASTQ-KGTYPTVANATTKMVTVAELEKNPAMA 203 (400) T ss_pred HhhcccccCCcc--------cccHHHHHHHHHHHHhhhhhhhcceeEeccCc-ceEEEEEecCCCccccccccccccccc Confidence 222222222222 22234445555555555556667777888655 567787765534335899999998 45 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHH Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALAS 157 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~ 157 (296) ..+.. ..+++.+|+++-+ |.|.++.+++ +-.+.-.++|+..+..+++..++.-..+++.+...+.+.+..++.. T Consensus 204 ~~~f~---~i~~~~~k~~~~~~is~ell~ds~~-~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 279 (400) T protein:vir:38 204 KPEFK---PVNWSVETYRQALPVSQESIDDSAI-DLVGLIAQNGQQIKVNTTNGAVATLLKGFTAKTISSVDDLKHINNV 279 (400) T ss_pred cccce---eeEeehhheeeehhhHHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHhhhhccccccccccccHHHHHHHHHh Confidence 55553 5778889999875 9999976665 4667888999999999999999888776665444444445443321 Q ss_pred HHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCCCceEEEEcccceEE Q lcl|Aclame:pro 158 AWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVTKGEIWATVPENIIF 232 (296) Q Consensus 158 ~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ 232 (296) . -+. ....+.++||.+...+.+-.+-.-+..| +++-. .++|..|+.+...|.+. .++-.+ T Consensus 280 ~--------~~~-~~~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~-~l~G~pv~~~~~~~~~~-----~g~~~~ 344 (400) T protein:vir:38 280 D--------LDP-AYSRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGK-SVLGMPIAVVSDDTLGA-----AGEAHA 344 (400) T ss_pred h--------hhh-hhCcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCcc-ccccceeEEecccccCC-----CCceEE Confidence 1 111 1246889999998876542211112223 11111 37899999988877542 122233 Q ss_pred EEecCcchhhhhhhccccccccceEEEeccccceeehhhh-----hhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 233 AYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTL-----LVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 233 ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~-----~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .|.|. ++..-+. |-.| +.+.+..+ ..+.+. -+.+. |-..++++++++++.- T Consensus 345 ~~gd~-----s~~~~~~-~~~~-~~~~~~~~---~~~~~~~~~~~r~d~~---~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 345 FLGDI-----KRAILFA-NRAD-FMVRWVDD---QIYGQFLQAGMRFGVS---VADEKAGYFLTYTPKA 400 (400) T ss_pred EEEec-----cccEEEE-eecc-eEEEEecc---cccceeEEEEEEeccE---EecccceEEEEeecCC Confidence 33332 2211111 1011 11111110 011110 01111 1234567788886666 No 76 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=98.81 E-value=8.2e-10 Score=70.41 Aligned_cols=269 Identities=10% Similarity=0.000 Sum_probs=136.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec-hh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP-LS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip-ls 79 (296) +...+..+ .+..++.+=+...--++.+++-+.+..-.-+++.-+..|++.| ++++|.....+....-|+||+++| .+ T Consensus 103 ~~~~~~~~-~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~~ 180 (394) T protein:vir:10 103 HGKVIDNA-AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAELAENPALA 180 (394) T ss_pred cchhhhhh-hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEEecCCCccccccccccccccc Confidence 11111000 0111222222222234555555555555555666677777554 566766554433335899999999 56 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHH Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALAS 157 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~ 157 (296) ..+.. ..++++||++.-+ |.|.++.+.+ +-.+.-.+.|+..+..+++..++..+.+++.....+..++. T Consensus 181 ~~~~~---~v~l~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~~~~~~d----- 251 (394) T protein:vir:10 181 EPEFE---QVDWSVSTYRGAIPLSEEAIADSAV-DLTSLVGQSINEKSVNTYNAMIAPVLQSFTAKATTTDTLVD----- 251 (394) T ss_pred cccce---eEEeeeeeeEeeehhHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccHH----- Confidence 66653 5788999999865 9999875544 45778889999999999999999888766543322222222 Q ss_pred HHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----c---hhhhhhhheeEEEEeccCCCceEEEEcccc Q lcl|Aclame:pro 158 AWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----G---LTYLVDFTGTVIISTNDVTKGEIWATVPEN 229 (296) Q Consensus 158 ~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g---~tyl~nfLG~~II~S~kV~~G~~~~t~~~N 229 (296) .+.++.....+.. -..+.++||.+...+++=.+-.-+..| + +..-..++|..|+.+.....+. ..++ T Consensus 252 ~l~~~~~~~~~~~-~~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~----~~~~ 326 (394) T protein:vir:10 252 SLKHILNVDLDPA-YSRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGS----AAGD 326 (394) T ss_pred HHHHHHHhhhhhh-ccCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCC----CCCc Confidence 1112222222221 135899999998876642211111112 1 1111137898876654332111 1233 Q ss_pred eEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHH---hhhhccceEEEEEecCCC Q lcl|Aclame:pro 230 IIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGML---MYPERIDGIVKVTLTPGV 296 (296) Q Consensus 230 l~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~---lfpE~~dgvv~~tI~~~v 296 (296) ..++|.|. ++..-+. |..| +-+.+.. +.++.+.+ -+.. .=+-..++|+.+++++++ T Consensus 327 ~~i~~gd~-----s~~~~~~-~~~~-~~v~~~~---~~~~~~~~-~~~~r~d~~~~~~~ai~~~~~~~~~ 385 (394) T protein:vir:10 327 QKAFVGDL-----KRGVLFA-DRQQ-VTLAWED---SKIYGRYL-GAAFRFGVKQADSNAGYFVTNTDAA 385 (394) T ss_pred eEEEEeec-----cccEEEE-eecc-eEEEEec---ccccceeE-EEEEEeccEEeccccEEEEEeeccc Confidence 34444443 2211111 1111 1122211 11122211 0000 012335567788888888 No 77 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.78 E-value=5e-10 Score=71.56 Aligned_cols=269 Identities=12% Similarity=0.057 Sum_probs=135.1 Q ss_pred Ccc----ccccccc---cceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeee-ecccCccc Q lcl|Aclame:pro 1 MVT----SRTYPEE---NLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDV-TLAEGNVP 71 (296) Q Consensus 1 ~~~----~~~~ae~---nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yi-g~A~gdVa 71 (296) ++. .....|. +..+..+=+...-.++.+.+-+.+.+-.-+++.-+..|++.+. ++.+|++... +.+. .|+ T Consensus 100 ~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v~ 178 (404) T protein:vir:39 100 MVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTV-MDA 178 (404) T ss_pred HHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCcccee-eec Confidence 000 0000000 1111122122223355555555555555566666778876653 4444554322 3343 799 Q ss_pred CCceec-hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce-ecc Q lcl|Aclame:pro 72 EGEVIP-LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ-DAL 147 (296) Q Consensus 72 EGe~Ip-lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~-~~t 147 (296) ||+.+| .++.+.. ..+++.+|+++.+ |.|.++.+.+ +..+.-.++|...++.+++..++.-..+++... ..+ T Consensus 179 Eg~~~~~~~~~~f~---~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~~~~~~~ 254 (404) T protein:vir:39 179 EDGKIPDLDNPRLT---IIKYLIKRYAGIITATNTLLKDTAE-NILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKPTIAK 254 (404) T ss_pred Ccccccccccccee---eEEeeeeeEEeeehhHHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Confidence 999999 5777764 5788999999875 9999875554 457888999999999999999998776554322 123 Q ss_pred hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEecc--CCCc Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTND--VTKG 220 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~k--V~~G 220 (296) .+.+..+++. ..........+.++||.+.+.+++-.+-.-+..| +++-- .++|..|+.+.. +|.+ T Consensus 255 ~~~i~~~~~~-------~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~-~l~G~pV~~~~~~~~~~~ 326 (404) T protein:vir:39 255 FDDVITMINT-------SVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSY-LIKGKKVIVVADRWLPNS 326 (404) T ss_pred HHHHHHHHHH-------hhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcc-eecceeEEEecccccCcc Confidence 3444433321 1222222456899999998877642221112222 22211 367987766543 3322 Q ss_pred eEEEEcccceEEEEecCcchhhhhhhccccccccc-eEEEecc----ccceeehhhh-hhHHHHhhhhccceEEEEEecC Q lcl|Aclame:pro 221 EIWATVPENIIFAYINPNNSELAKEFNLYGDPTGY-IGMNHFQ----ENTTLTIQTL-LVSGMLMYPERIDGIVKVTLTP 294 (296) Q Consensus 221 ~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGl-iGv~h~~----~~~~~t~et~-~~~~~~lfpE~~dgvv~~tI~~ 294 (296) . .....+++.|+ ++.+.+.. ..|+ |-+.... ..+...+-.. -+.+. +-+.++++++++++ T Consensus 327 ~-----~~~~~~~~gd~-----~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~---~~~~~a~~~~~~~~ 392 (404) T protein:vir:39 327 G-----STVYPLYYGDM-----SQAITLFD-RENMSLLPTNIGAGAFETDTTKIRVIDRFDVK---TTDSEALVAGSFTA 392 (404) T ss_pred C-----CCccEEEEEec-----cccEEEEe-ecceEEEEeccchhhhhhceeeEEEEeeeccE---EecccceEEEEeec Confidence 1 12222334333 22222211 1110 1111100 0000000000 11222 33456778888765 Q ss_pred CC Q lcl|Aclame:pro 295 GV 296 (296) Q Consensus 295 ~v 296 (296) .- T Consensus 393 ~a 394 (404) T protein:vir:39 393 IA 394 (404) T ss_pred cc Confidence 54 No 78 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=98.77 E-value=3.9e-10 Score=72.18 Aligned_cols=262 Identities=10% Similarity=0.045 Sum_probs=136.6 Q ss_pred Cccccccc---cccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeee--eeeee-ecccCcccCCc Q lcl|Aclame:pro 1 MVTSRTYP---EENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTY--AGYDV-TLAEGNVPEGE 74 (296) Q Consensus 1 ~~~~~~~a---e~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~p--k~~yi-g~A~gdVaEGe 74 (296) ++..+... .-+..++++=+...--++.+++-+.+..-.-+++.-+.+|+.. .+.+.| ++... +.+. .|+||+ T Consensus 97 ~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~-~v~E~~ 174 (397) T protein:vir:48 97 LVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTT-LTGSRVYEKWADITGLAK-LDDEAG 174 (397) T ss_pred HHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccC-CcceEEEEeecCCCccee-eecccc Confidence 11111100 0122223333333344566666666655556666667777653 334444 44322 3353 799999 Q ss_pred eechh-heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcccee-cchhh Q lcl|Aclame:pro 75 VIPLS-KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQD-ALGAG 150 (296) Q Consensus 75 ~Ipls-kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~-~t~~~ 150 (296) .+|-+ +.+.. ..+++.+|++..+ |.|.++.+.+ +..+.-.++|+.+++.+++..|++-..+++.... .+.+. T Consensus 175 ~~~~~~~~~~~---~v~~~~~k~~~~~~iS~ell~ds~~-~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~~~~~~~~d~ 250 (397) T protein:vir:48 175 SIGTNDDPKLY---PIRYAIKRYAGISTVTNSLLADSAE-NILAWLSGWIAKKVVVTRNKAILEAIATLPTKPTLTKWDD 250 (397) T ss_pred cccccccccee---eEEeeheeeeeehhhHHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccHHH Confidence 99965 45653 4778889999875 9999976654 5778899999999999999999987765543221 22233 Q ss_pred HHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEE--eccCCCce-- Q lcl|Aclame:pro 151 LQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIIS--TNDVTKGE-- 221 (296) Q Consensus 151 lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~--S~kV~~G~-- 221 (296) +. +.+.++. .......+.++||.+.+.+++=.+-.-+..| +++-. .++|..|+. +.-++.+. T Consensus 251 i~----~~~~~l~----~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~-~l~G~PV~~~~~~~~~~~~~~ 321 (397) T protein:vir:48 251 II----DLQAKVD----PAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGY-SIDGFAVKEVADRWLANASSG 321 (397) T ss_pred HH----HHHHHhh----hhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCc-eeccceeEEecccccCCcCCC Confidence 32 2333333 2223456889999998876543222222222 11111 267865544 33343322 Q ss_pred ---EEEEcccceEEEEecCcchhhhhhhc------cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEe Q lcl|Aclame:pro 222 ---IWATVPENIIFAYINPNNSELAKEFN------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTL 292 (296) Q Consensus 222 ---~~~t~~~Nl~~ay~~~~~g~~~~~f~------~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI 292 (296) +++ .|+.-+|.-...+++.-... +..|.+++.+..+ +.+..+.| ++++++++ T Consensus 322 ~~~~~~---gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r-------------~d~~~~~~---~a~~~~~~ 382 (397) T protein:vir:48 322 AMPLYF---GDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDR-------------FDVVATDT---ESFVPASF 382 (397) T ss_pred ceEEEE---EeccceEEEEeecceEEEEeccchhhhhcCceeEEEEee-------------eccEEecc---cceEEEEe Confidence 221 22222221111111211111 1122222222211 23334444 68899999 Q ss_pred cCCC Q lcl|Aclame:pro 293 TPGV 296 (296) Q Consensus 293 ~~~v 296 (296) +++. T Consensus 383 ~~~~ 386 (397) T protein:vir:48 383 KAIA 386 (397) T ss_pred cccc Confidence 8877 No 79 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.77 E-value=1.6e-09 Score=68.78 Aligned_cols=267 Identities=10% Similarity=-0.029 Sum_probs=139.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |.|..+. +....+. |.+++=+.+..-.-++.+-+.+||..| .+++|++.....|. -|+||+++|.++ T Consensus 1 mat~~~g---g~lvP~~--------~~~~ii~~~~~~s~i~~~~~~i~~~~~-~~~~p~~~~~~~a~-wv~Eg~~~~~~~ 67 (311) T protein:vir:81 1 MVALATG---TFQLPKH--------LVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGE-VVGEGAQKSEST 67 (311) T ss_pred CceecCC---ceEcchh--------HHHHHHHHHHhcchhhhhcceeecCCC-ceEEEEEeCCceeE-EeecCccccccc Confidence 7665543 2333322 334443333333334455566777655 68999997777774 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCc---hhHHHHHHHHHHHHhhhhHHHHHHHhcCccc------------ Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNE---AVTNTDNALVRQLQKKIRTDFVTALKTGTGT------------ 143 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygd---av~etd~QL~~~iq~kIdnD~~~aLktat~t------------ 143 (296) .+.. ..++..||++.-+ |.|.+|.+ ..+ -.+.-.++|+++|+++++.-|+.--..++++ T Consensus 68 ~~f~---~v~l~~~kl~~~~~iS~ell~~~-~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~ 143 (311) T protein:vir:81 68 ATFA---PVTAIPRKVQVTQRFSQEVKWAD-ESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDT 143 (311) T ss_pred ceee---EEEEeeEEEEEeehhhHHHhhcC-cccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCccccccccccccc Confidence 8864 5778889988754 99987533 333 3577889999999999999998653211110 Q ss_pred ---eecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEec Q lcl|Aclame:pro 144 ---QDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTN 215 (296) Q Consensus 144 ---~~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~ 215 (296) ...+ ..=.......+.++...+...+......++||.+...+++-.+-+-+..| ++.-. .++|..|+.++ T Consensus 144 ~~~~~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~-tl~G~Pv~~~~ 221 (311) T protein:vir:81 144 TNIVELT-TGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVA-SFAGLNAAVSD 221 (311) T ss_pred ceeeeec-ccccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCc-eecceeEEecc Confidence 0000 00001223334455556665443344588899999877653222222222 22222 27899999999 Q ss_pred cCCCceEEEEcc--------cceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhh--hhcc- Q lcl|Aclame:pro 216 DVTKGEIWATVP--------ENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMY--PERI- 284 (296) Q Consensus 216 kV~~G~~~~t~~--------~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lf--pE~~- 284 (296) .+|.+....... .+..+. .||+++.+-...++ +-+....+...-..-.+...+++.| =+|. T Consensus 222 ~i~~~~~~~~~~~~~~~~~~~~~~~~-----~gDfs~~~i~~~~~---~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d 293 (311) T protein:vir:81 222 TVRGGPEAVTASTGVYRTTNPNVKAI-----AGDFSAFRWGVQVS---IPLELIEFGDPDGLGDLKRQNQIAIRAEVVYG 293 (311) T ss_pred cccccccccccccchhcccCCccEEE-----EEecccEEEEEecc---ceEEEeccCCCCcchhhhhcCcEEEEEEEEec Confidence 999776443322 122222 23444422111111 1111111100000001122222222 1233 Q ss_pred ------ceEEEEEecCCC Q lcl|Aclame:pro 285 ------DGIVKVTLTPGV 296 (296) Q Consensus 285 ------dgvv~~tI~~~v 296 (296) +++++ ++.++ T Consensus 294 ~~v~~~~a~~~--l~~a~ 309 (311) T protein:vir:81 294 IGIMSTDAFAV--VRDAD 309 (311) T ss_pred cEeecccceEE--EEeec Confidence 33444 33344 No 80 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=98.76 E-value=5e-10 Score=71.58 Aligned_cols=262 Identities=12% Similarity=0.049 Sum_probs=135.8 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcccCCceec-h Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIP-L 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdVaEGe~Ip-l 78 (296) +. +.+....+.+. --+|.+++-+.+..-.-+++.-+..|++.++ .+.+|+|...+....-|+||+.+| . T Consensus 116 ~~-~~~~~~gg~~v--------P~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~ 186 (408) T protein:vir:74 116 ET-SGSDSAAGLTI--------PQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDL 186 (408) T ss_pred hc-ccccCCCceee--------chhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccccccccc Confidence 11 11222222222 2244555555555555566777788888765 677888876655545899999999 5 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce-ecchhhHHHHH Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ-DALGAGLQGAL 155 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~-~~t~~~lQ~Al 155 (296) ++.+.. ..+++.+|++..+ |.|.++.+.+ +-.+.-.++|..++..+++..|+.-..+++... ..+.+++..++ T Consensus 187 ~~~~~~---~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~~~~~~~~~i~~~~ 262 (408) T protein:vir:74 187 DNPRLT---IIKYLIKRYAGIITATNTLLKDTAE-NILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKPTIANFDDVITMI 262 (408) T ss_pred ccccee---eEEeeeeeEEeeehhHHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccHHHHHHHH Confidence 767764 4788999999975 9999864444 468889999999999999999987654443221 12323333222 Q ss_pred HHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEecc--CCCceEEEEccc Q lcl|Aclame:pro 156 ASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTND--VTKGEIWATVPE 228 (296) Q Consensus 156 a~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~k--V~~G~~~~t~~~ 228 (296) +. .+ ........+.++||.+...++.-..-.-+..| +++-. .++|..|+.+.. +|... .+ T Consensus 263 ~~---~l----~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~-~l~G~pV~~~~~~~~~~~~-----~~ 329 (408) T protein:vir:74 263 NT---SV----DPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSY-LIKGKQVIVVADRWLPNSG-----ST 329 (408) T ss_pred HH---hh----hhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCc-eecceeeEEecCccccccc-----CC Confidence 11 11 11112346788999998877642211112222 11111 378987766543 33221 11 Q ss_pred ceEEEEecCcchhhhhhhccccccccceEEEecc----ccceeehhh-hhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 229 NIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQ----ENTTLTIQT-LLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 229 Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~----~~~~~t~et-~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .-.++|.| +++++.+...+---+-+.... ..+..++-. .-+.+..+. .++++++++++.. T Consensus 330 ~~~i~~gd-----~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~---~~a~~~~~~~~~~ 394 (408) T protein:vir:74 330 VYPLYYGD-----MSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATD---SEALVAGSFTAIA 394 (408) T ss_pred cceEEEEe-----hhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEec---ccceEEEEeeccc Confidence 12222322 222221111100001010000 000000000 012333444 4677888885433 No 81 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=98.75 E-value=1e-09 Score=69.89 Aligned_cols=266 Identities=9% Similarity=-0.027 Sum_probs=138.0 Q ss_pred Cc---------cccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcc Q lcl|Aclame:pro 1 MV---------TSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNV 70 (296) Q Consensus 1 ~~---------~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdV 70 (296) +. .-.-...-+.++.++=+...--++.+.+-+.+..-.-+++.-+..|+..++ ++.+|+..-...+. -| T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v 166 (392) T protein:vir:10 88 PLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFA-EI 166 (392) T ss_pred cccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccce-ee Confidence 00 000000011112222222222344555555555545556666777776544 45677765555564 89 Q ss_pred cCCceech-hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecc Q lcl|Aclame:pro 71 PEGEVIPL-SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDAL 147 (296) Q Consensus 71 aEGe~Ipl-skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t 147 (296) +||+++|- +..+.. ..+++.+|++..+ |.|.++.+.+ +-.+.-.++|+.+|+++++.-|+....+++.+...+ T Consensus 167 ~E~~~~~~~~~~~~~---~v~l~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~ 242 (392) T protein:vir:10 167 TEMGEIPETDNPKFS---NVQYAVKDRAGILPLSRSLLQDSDQ-NILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS 242 (392) T ss_pred cccccccccccccce---eEEeeeeeEEEeehhhHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC Confidence 99999984 445543 4778889998875 9999876654 357788899999999999999998887766555455 Q ss_pred hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeec----hhhhhhhheeEEEE--eccCCCce Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFG----LTYLVDFTGTVIIS--TNDVTKGE 221 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg----~tyl~nfLG~~II~--S~kV~~G~ 221 (296) .+.+..++.. .........-+.++||.+.+.+++-.+-.-+..|- ...-..++|..+|. ++..+.. T Consensus 243 ~d~i~~~~~~-------~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~- 314 (392) T protein:vir:10 243 LDDIKDVLNV-------KLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKS- 314 (392) T ss_pred HHHHHHHHHH-------hhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCC- Confidence 5666555532 22322234567899999988765422111122221 01111267865433 2222221 Q ss_pred EEEEcccceEEEEecCcc-------hhhhhhhc------cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEE Q lcl|Aclame:pro 222 IWATVPENIIFAYINPNN-------SELAKEFN------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~~-------g~~~~~f~------~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv 288 (296) ..+..+...+.|.|.+. +++.=.++ +..|++++.+..+. .+. +=..++++ T Consensus 315 -~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~-------------d~~---v~~~~a~~ 377 (392) T protein:vir:10 315 -KGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRD-------------DVQ---MWDNEAAV 377 (392) T ss_pred -CcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEee-------------ccE---EecccceE Confidence 12233333344433221 11110010 11122222222211 122 22357899 Q ss_pred EEEecCCC Q lcl|Aclame:pro 289 KVTLTPGV 296 (296) Q Consensus 289 ~~tI~~~v 296 (296) ++++++.. T Consensus 378 ~l~~~~~a 385 (392) T protein:vir:10 378 YGEIDLSA 385 (392) T ss_pred EEEecccc Confidence 98885544 No 82 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=98.75 E-value=1e-09 Score=69.89 Aligned_cols=266 Identities=9% Similarity=-0.027 Sum_probs=138.0 Q ss_pred Cc---------cccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcc Q lcl|Aclame:pro 1 MV---------TSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNV 70 (296) Q Consensus 1 ~~---------~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdV 70 (296) +. .-.-...-+.++.++=+...--++.+.+-+.+..-.-+++.-+..|+..++ ++.+|+..-...+. -| T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v 166 (392) T protein:vir:10 88 PLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFA-EI 166 (392) T ss_pred cccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccce-ee Confidence 00 000000011112222222222344555555555545556666777776544 45677765555564 89 Q ss_pred cCCceech-hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecc Q lcl|Aclame:pro 71 PEGEVIPL-SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDAL 147 (296) Q Consensus 71 aEGe~Ipl-skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t 147 (296) +||+++|- +..+.. ..+++.+|++..+ |.|.++.+.+ +-.+.-.++|+.+|+++++.-|+....+++.+...+ T Consensus 167 ~E~~~~~~~~~~~~~---~v~l~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~ 242 (392) T protein:vir:10 167 TEMGEIPETDNPKFS---NVQYAVKDRAGILPLSRSLLQDSDQ-NILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS 242 (392) T ss_pred cccccccccccccce---eEEeeeeeEEEeehhhHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC Confidence 99999984 445543 4778889998875 9999876654 357788899999999999999998887766555455 Q ss_pred hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeec----hhhhhhhheeEEEE--eccCCCce Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFG----LTYLVDFTGTVIIS--TNDVTKGE 221 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg----~tyl~nfLG~~II~--S~kV~~G~ 221 (296) .+.+..++.. .........-+.++||.+.+.+++-.+-.-+..|- ...-..++|..+|. ++..+.. T Consensus 243 ~d~i~~~~~~-------~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~- 314 (392) T protein:vir:10 243 LDDIKDVLNV-------KLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKS- 314 (392) T ss_pred HHHHHHHHHH-------hhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCC- Confidence 5666555532 22322234567899999988765422111122221 01111267865433 2222221 Q ss_pred EEEEcccceEEEEecCcc-------hhhhhhhc------cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEE Q lcl|Aclame:pro 222 IWATVPENIIFAYINPNN-------SELAKEFN------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~~-------g~~~~~f~------~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv 288 (296) ..+..+...+.|.|.+. +++.=.++ +..|++++.+..+. .+. +=..++++ T Consensus 315 -~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~-------------d~~---v~~~~a~~ 377 (392) T protein:vir:10 315 -KGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRD-------------DVQ---MWDNEAAV 377 (392) T ss_pred -CcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEee-------------ccE---EecccceE Confidence 12233333344433221 11110010 11122222222211 122 22357899 Q ss_pred EEEecCCC Q lcl|Aclame:pro 289 KVTLTPGV 296 (296) Q Consensus 289 ~~tI~~~v 296 (296) ++++++.. T Consensus 378 ~l~~~~~a 385 (392) T protein:vir:10 378 YGEIDLSA 385 (392) T ss_pred EEEecccc Confidence 98885544 No 83 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=98.75 E-value=1e-09 Score=69.89 Aligned_cols=266 Identities=9% Similarity=-0.027 Sum_probs=138.0 Q ss_pred Cc---------cccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcc Q lcl|Aclame:pro 1 MV---------TSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNV 70 (296) Q Consensus 1 ~~---------~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdV 70 (296) +. .-.-...-+.++.++=+...--++.+.+-+.+..-.-+++.-+..|+..++ ++.+|+..-...+. -| T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v 166 (392) T protein:vir:10 88 PLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFA-EI 166 (392) T ss_pred cccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccce-ee Confidence 00 000000011112222222222344555555555545556666777776544 45677765555564 89 Q ss_pred cCCceech-hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecc Q lcl|Aclame:pro 71 PEGEVIPL-SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDAL 147 (296) Q Consensus 71 aEGe~Ipl-skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t 147 (296) +||+++|- +..+.. ..+++.+|++..+ |.|.++.+.+ +-.+.-.++|+.+|+++++.-|+....+++.+...+ T Consensus 167 ~E~~~~~~~~~~~~~---~v~l~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~ 242 (392) T protein:vir:10 167 TEMGEIPETDNPKFS---NVQYAVKDRAGILPLSRSLLQDSDQ-NILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS 242 (392) T ss_pred cccccccccccccce---eEEeeeeeEEEeehhhHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC Confidence 99999984 445543 4778889998875 9999876654 357788899999999999999998887766555455 Q ss_pred hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeec----hhhhhhhheeEEEE--eccCCCce Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFG----LTYLVDFTGTVIIS--TNDVTKGE 221 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg----~tyl~nfLG~~II~--S~kV~~G~ 221 (296) .+.+..++.. .........-+.++||.+.+.+++-.+-.-+..|- ...-..++|..+|. ++..+.. T Consensus 243 ~d~i~~~~~~-------~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~- 314 (392) T protein:vir:10 243 LDDIKDVLNV-------KLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKS- 314 (392) T ss_pred HHHHHHHHHH-------hhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCC- Confidence 5666555532 22322234567899999988765422111122221 01111267865433 2222221 Q ss_pred EEEEcccceEEEEecCcc-------hhhhhhhc------cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEE Q lcl|Aclame:pro 222 IWATVPENIIFAYINPNN-------SELAKEFN------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~~-------g~~~~~f~------~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv 288 (296) ..+..+...+.|.|.+. +++.=.++ +..|++++.+..+. .+. +=..++++ T Consensus 315 -~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~-------------d~~---v~~~~a~~ 377 (392) T protein:vir:10 315 -KGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRD-------------DVQ---MWDNEAAV 377 (392) T ss_pred -CcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEee-------------ccE---EecccceE Confidence 12233333344433221 11110010 11122222222211 122 22357899 Q ss_pred EEEecCCC Q lcl|Aclame:pro 289 KVTLTPGV 296 (296) Q Consensus 289 ~~tI~~~v 296 (296) ++++++.. T Consensus 378 ~l~~~~~a 385 (392) T protein:vir:10 378 YGEIDLSA 385 (392) T ss_pred EEEecccc Confidence 98885544 No 84 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=98.75 E-value=1e-09 Score=69.89 Aligned_cols=266 Identities=9% Similarity=-0.027 Sum_probs=138.0 Q ss_pred Cc---------cccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcc Q lcl|Aclame:pro 1 MV---------TSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNV 70 (296) Q Consensus 1 ~~---------~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdV 70 (296) +. .-.-...-+.++.++=+...--++.+.+-+.+..-.-+++.-+..|+..++ ++.+|+..-...+. -| T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v 166 (392) T protein:vir:10 88 PLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFA-EI 166 (392) T ss_pred cccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccce-ee Confidence 00 000000011112222222222344555555555545556666777776544 45677765555564 89 Q ss_pred cCCceech-hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecc Q lcl|Aclame:pro 71 PEGEVIPL-SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDAL 147 (296) Q Consensus 71 aEGe~Ipl-skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t 147 (296) +||+++|- +..+.. ..+++.+|++..+ |.|.++.+.+ +-.+.-.++|+.+|+++++.-|+....+++.+...+ T Consensus 167 ~E~~~~~~~~~~~~~---~v~l~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~ 242 (392) T protein:vir:10 167 TEMGEIPETDNPKFS---NVQYAVKDRAGILPLSRSLLQDSDQ-NILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS 242 (392) T ss_pred cccccccccccccce---eEEeeeeeEEEeehhhHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC Confidence 99999984 445543 4778889998875 9999876654 357788899999999999999998887766555455 Q ss_pred hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeec----hhhhhhhheeEEEE--eccCCCce Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFG----LTYLVDFTGTVIIS--TNDVTKGE 221 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg----~tyl~nfLG~~II~--S~kV~~G~ 221 (296) .+.+..++.. .........-+.++||.+.+.+++-.+-.-+..|- ...-..++|..+|. ++..+.. T Consensus 243 ~d~i~~~~~~-------~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~- 314 (392) T protein:vir:10 243 LDDIKDVLNV-------KLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKS- 314 (392) T ss_pred HHHHHHHHHH-------hhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCC- Confidence 5666555532 22322234567899999988765422111122221 01111267865433 2222221 Q ss_pred EEEEcccceEEEEecCcc-------hhhhhhhc------cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEE Q lcl|Aclame:pro 222 IWATVPENIIFAYINPNN-------SELAKEFN------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~~-------g~~~~~f~------~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv 288 (296) ..+..+...+.|.|.+. +++.=.++ +..|++++.+..+. .+. +=..++++ T Consensus 315 -~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~-------------d~~---v~~~~a~~ 377 (392) T protein:vir:10 315 -KGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRD-------------DVQ---MWDNEAAV 377 (392) T ss_pred -CcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEee-------------ccE---EecccceE Confidence 12233333344433221 11110010 11122222222211 122 22357899 Q ss_pred EEEecCCC Q lcl|Aclame:pro 289 KVTLTPGV 296 (296) Q Consensus 289 ~~tI~~~v 296 (296) ++++++.. T Consensus 378 ~l~~~~~a 385 (392) T protein:vir:10 378 YGEIDLSA 385 (392) T ss_pred EEEecccc Confidence 98885544 No 85 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=98.75 E-value=3.5e-10 Score=72.44 Aligned_cols=254 Identities=12% Similarity=0.034 Sum_probs=136.7 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcccCCceec-h Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIP-L 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdVaEGe~Ip-l 78 (296) |.+ .+++..+.. .--.|.++|-+.+..-.-++..-+.+|+++++ ++.+|+..-...+. -|+||+++| . T Consensus 123 ~~~-~~~~~gg~l--------vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v~Eg~~~~~~ 192 (397) T protein:vir:12 123 MSG-INDEDGGIL--------IPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFS-PVEELGNLPEI 192 (397) T ss_pred ccc-cccccCccc--------CchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCccee-eeccccccccc Confidence 211 122222221 12234455544444444555666777887653 67788776666674 999999998 5 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHH Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALA 156 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala 156 (296) +..+.. ..+++.+|++..+ |.|.++.+++ +-.+.-.++|+.+++++++.-|+.-..+++.+...+.+++..++. T Consensus 193 ~~~~~~---~v~~~~~k~~~~~~is~e~l~ds~~-~l~~~i~~~l~~~~~~~~d~~il~G~g~~~~~g~~~~~~i~~~~~ 268 (397) T protein:vir:12 193 DQPRFT---KVSYSIIDYGGIMTLSNSMLNDSDQ-AIMTYVAKWFAKKSVVTRNNLILAAIASLKKVDIDGLDGIKKALN 268 (397) T ss_pred ccccce---eEEeeheeeEeeehhhHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHHHhccccccccccccHHHHHHHHh Confidence 655553 4678889998876 9999876665 456778899999999999999998776555433344444544442 Q ss_pred HHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCCC----ce--EEEE Q lcl|Aclame:pro 157 SAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVTK----GE--IWAT 225 (296) Q Consensus 157 ~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~~----G~--~~~t 225 (296) . ..........+.++||.+.+.+++-.+-.-+..| +++- ..++|..|+.++.... |+ +++ T Consensus 269 ~-------~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~-~~l~G~pv~~~~~~~~~~~~~~~~~~~- 339 (397) T protein:vir:12 269 V-------TLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTK-KLLDGRPVVPFTNRVLKTQKGKAPLII- 339 (397) T ss_pred h-------ccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCC-ccccceeeEEecccccccCCCccEEEE- Confidence 1 1222233456889999998876542111111122 1111 1278988776654222 22 221 Q ss_pred cccceEEEEecCcchhhhh-----hhc-cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 226 VPENIIFAYINPNNSELAK-----EFN-LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 226 ~~~Nl~~ay~~~~~g~~~~-----~f~-~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) .|..-+|+-...+++.= .+. +..|.++|.+..+. .+..+ +.+++++++++.- T Consensus 340 --gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~-------------d~~~~---~~~a~~~~~~t~~ 397 (397) T protein:vir:12 340 --GNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIERE-------------DVRKW---DEDAVVFGQITVE 397 (397) T ss_pred --EehhceEEEEeecceEEEEeccccchhhcCceEEEEEEee-------------ccEEe---cccceEEEEEeeC Confidence 11221221110011110 010 11233333333221 22222 3467777777777 No 86 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.75 E-value=1.3e-09 Score=69.31 Aligned_cols=268 Identities=12% Similarity=0.008 Sum_probs=139.2 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCce----- Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEV----- 75 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~----- 75 (296) |-++.++.-..|+ .+++ .+++-+.+.+-.-++..-+.+|+. +.++++|+++....|. -|+||+. T Consensus 1 ma~~t~~~gg~li-P~~~--------~~~Ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~-wv~E~~~~~~~~ 69 (305) T protein:vir:25 1 MADISRAEVASLI-QEAY--------SDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEAD-WVGESATDPKGV 69 (305) T ss_pred CCCccCCccceec-CHHH--------HHHHHHHHHhhchhhhhcceeecc-CCcEEEEEEeCCcceE-Eeeccccccccc Confidence 5444333333332 3233 333333333333345556788885 4479999998777774 8999985 Q ss_pred echhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHh--------------- Q lcl|Aclame:pro 76 IPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALK--------------- 138 (296) Q Consensus 76 Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLk--------------- 138 (296) ||.++.+.. ..+++.+|++..+ |.|.++.+.+ +....-.++|+++++++++..||.=-. T Consensus 70 ~~~s~~~f~---~i~~~~~k~~~~~~is~ell~ds~~-~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~ 145 (305) T protein:vir:25 70 KPTSKVTWA---NRTLVAEEIAVIIPVHENVIDDATV-AVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAV 145 (305) T ss_pred cccccccee---eEEeeeEEEEEeehhhHHHHhcchH-HHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccc Confidence 677776664 4678889988865 9999865554 567888999999999999999984211 Q ss_pred cCccce--ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEecc Q lcl|Aclame:pro 139 TGTGTQ--DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTND 216 (296) Q Consensus 139 tat~t~--~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~k 216 (296) .+.+.. ......... +...+.++...........--.++||.+.+.+++-.+-.-+..|.-. .++|..++.++. T Consensus 146 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~---~l~G~Pv~~~~~ 221 (305) T protein:vir:25 146 TAGQAVEVVGGVANESD-IVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD---SFAGFRTFFNRN 221 (305) T ss_pred cccccccccccchhhhH-HHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecCC---cccccceEEcCc Confidence 110100 011111111 22333333333443333333478899999987653332223334221 267988888888 Q ss_pred CCC----ceEEEEcccceEEEEecCcch---hhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEE Q lcl|Aclame:pro 217 VTK----GEIWATVPENIIFAYINPNNS---ELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVK 289 (296) Q Consensus 217 V~~----G~~~~t~~~Nl~~ay~~~~~g---~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~ 289 (296) ++. +.+++-...+. ++-..+| +..+...+..+++..--+.++.-.-|+... +...+ -+..++++ T Consensus 222 ~~~~~~~~~~~~gd~s~~---~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r---~~~~v---~~p~a~v~ 292 (305) T protein:vir:25 222 GAWDADAAIEVIADSSRV---KIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKAR---FAYVL---GVSATAQG 292 (305) T ss_pred cCCCCCccEEEEEecceE---EEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEe---eccee---eCcccEEE Confidence 763 44554443332 2222111 111111222233322212211111111110 01112 23447788 Q ss_pred EEecC--CC Q lcl|Aclame:pro 290 VTLTP--GV 296 (296) Q Consensus 290 ~tI~~--~v 296 (296) ++..+ +| T Consensus 293 ~~~~~~~~~ 301 (305) T protein:vir:25 293 ANKTPVAVV 301 (305) T ss_pred Ecccccccc Confidence 87753 24 No 87 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.69 E-value=1.2e-09 Score=69.45 Aligned_cols=254 Identities=11% Similarity=0.020 Sum_probs=136.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeee-ecccCcccCCceech Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDV-TLAEGNVPEGEVIPL 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yi-g~A~gdVaEGe~Ipl 78 (296) -.++.+.+.-+.+. -.++.+.+-+.+..-.-+++.-+.+|++.+. .+.+|++.-. +.+. -|+||+.+|- T Consensus 115 a~~~~t~~~gg~~v--------P~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v~E~~~~~~ 185 (408) T protein:vir:10 115 TETSGSDSAAGLTI--------PQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTV-MDAEDGKIPD 185 (408) T ss_pred hhhcccccCCceec--------cHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeecccccccee-eecCcccccc Confidence 11112222222222 2344455555555555566667777876543 3556665433 3443 7999999994 Q ss_pred -hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce-ecchhhHHHH Q lcl|Aclame:pro 79 -SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ-DALGAGLQGA 154 (296) Q Consensus 79 -skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~-~~t~~~lQ~A 154 (296) +..+.. ..+++.||++..+ |.|.++.+++ +-.+.-.++|+..++.+++..|++-..+++... ..+.+.+..+ T Consensus 186 ~~~~~~~---~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~~~~~~~~~l~~~ 261 (408) T protein:vir:10 186 LDNPQLT---IIKYLIKRYAGIITATNTSLKDTAE-NILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITM 261 (408) T ss_pred ccCccee---eEEeeeeeEEeeehhHHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccHHHHHHH Confidence 545543 4788899999875 9999865544 457788999999999999999998887665432 2233344333 Q ss_pred HHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEec--cCCCceEEEEcc Q lcl|Aclame:pro 155 LASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTN--DVTKGEIWATVP 227 (296) Q Consensus 155 la~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~--kV~~G~~~~t~~ 227 (296) +.. ..........+.++||.+...+++-.+-.-+..| +++- ..++|..|+.+. -+|... . T Consensus 262 ~~~-------~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~-~~l~G~PV~~~~~~~~~~~~-----~ 328 (408) T protein:vir:10 262 INT-------AVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNS-YLIKGKQVIVVADRWLPNTG-----S 328 (408) T ss_pred HHH-------hhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCC-ceecceeeEEecccccCccC-----C Confidence 321 1222222346889999998876653322222222 1111 137897766643 344321 2 Q ss_pred cceEEEEecCcc-------hhhhhhhc------cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecC Q lcl|Aclame:pro 228 ENIIFAYINPNN-------SELAKEFN------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTP 294 (296) Q Consensus 228 ~Nl~~ay~~~~~-------g~~~~~f~------~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~ 294 (296) ++..++|.|++. +++.=.+. +..|.++|.+..+ +.+..+. .++++++++++ T Consensus 329 ~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r-------------~d~~v~~---~~a~~~~~~~~ 392 (408) T protein:vir:10 329 TVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDR-------------FDVKATD---SEALVAGSFSA 392 (408) T ss_pred CceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEe-------------eccEEec---cccEEEEEeec Confidence 233344444321 11110000 1112222222111 2333444 47788888877 Q ss_pred CC Q lcl|Aclame:pro 295 GV 296 (296) Q Consensus 295 ~v 296 (296) +. T Consensus 393 ~~ 394 (408) T protein:vir:10 393 IA 394 (408) T ss_pred cc Confidence 54 No 88 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.68 E-value=1.1e-08 Score=64.16 Aligned_cols=263 Identities=13% Similarity=0.037 Sum_probs=132.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-+..+.. -+.+..+.+ .+++-+.+.+-.-++.+-+.+||. +..+++|++.....|. -|+||++||.++ T Consensus 1 Ma~~~~~~-gg~~vP~~~--------~~~ii~~l~~~s~i~~l~~~i~~~-~~~~~ip~~~~~~~a~-wv~Eg~~~~~s~ 69 (315) T protein:vir:80 1 MADDFLSA-GKLELPGSM--------IGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAK-IVGEGEVKPSAS 69 (315) T ss_pred CCCCcCCc-CceEcchHH--------HHHHHHHHHhhchhhhhcceeecC-CCceEEEEEeCCcceE-EeeCCccccccc Confidence 76554332 334433333 333322222222234445667776 4578999998777775 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchh----HHHHHHHHHHHHhhhhHHHHHHHhcCccc----------- Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAV----TNTDNALVRQLQKKIRTDFVTALKTGTGT----------- 143 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav----~etd~QL~~~iq~kIdnD~~~aLktat~t----------- 143 (296) .+.. ..+++.||++.-+ |.|.++. ...+.+ ..-.++|+++|++++|.-|+.--..+++. T Consensus 70 ~~f~---~v~l~~~kl~~~~~iS~ell~~-s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~ 145 (315) T protein:vir:80 70 VDVS---AFTAQPIKVVTQQRVSDEFMWA-DADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNK 145 (315) T ss_pred ccee---eeEeeeeeEEeeehhhHHHhhc-CchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccccccccccc Confidence 8764 5778889988764 9999854 444443 45567888888999998888542211110 Q ss_pred ----eecchhhHHHHHHHHHHHHHHhhccc-cCcceEEEEcHHHHHHHhcCC-----ccccceee----chhhhhhhhee Q lcl|Aclame:pro 144 ----QDALGAGLQGALASAWGKLQVLFEDY-GSERAIVFANSLDVAEYIAKA-----GITTQTAF----GLTYLVDFTGT 209 (296) Q Consensus 144 ----~~~t~~~lQ~Ala~~~~~~~~~Fede-d~~~~VlFvNP~Daa~~l~~a-----~i~~q~~f----g~tyl~nfLG~ 209 (296) ...+... +..+.++...+... ....-..++||...+.+++-. +...+..+ .+... .++|. T Consensus 146 ~~~~~~~~~~~-----~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~-tl~G~ 219 (315) T protein:vir:80 146 TKNIVDATDSA-----TADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLD-NWRGL 219 (315) T ss_pred ccceeeccccc-----hHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCc-eecce Confidence 0111111 11223333344332 223346889999988776432 11112221 11112 38899 Q ss_pred EEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhc-------c----cc--ccccceEEEeccccceeehhhhhhHH Q lcl|Aclame:pro 210 VIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFN-------L----YG--DPTGYIGMNHFQENTTLTIQTLLVSG 276 (296) Q Consensus 210 ~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~-------~----~t--d~tGliGv~h~~~~~~~t~et~~~~~ 276 (296) .|+.++.+|.+.......... +++ ||+++.+- + +. |.++.==+.|+.-.-+... -+.+ T Consensus 220 PV~~~~~~~~~~~~~~~~~~~-~~~-----GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~---r~~~ 290 (315) T protein:vir:80 220 NVGASSTVSGAPEMSPASGVK-AIV-----GDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEA---VLYV 290 (315) T ss_pred eeEecCcCCcccccccccccE-EEE-----eecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEE---Eecc Confidence 999999999765432222211 111 22222110 0 00 1111000000000000000 0112 Q ss_pred HHhhhhccceEEEEEecC-CC Q lcl|Aclame:pro 277 MLMYPERIDGIVKVTLTP-GV 296 (296) Q Consensus 277 ~~lfpE~~dgvv~~tI~~-~v 296 (296) . +.+.+++++++... |. T Consensus 291 ~---v~~~~a~~~l~~~~a~~ 308 (315) T protein:vir:80 291 A---IESLDSFAVVKEKAAPK 308 (315) T ss_pred e---eecccceEEEeeccCCC Confidence 2 23445677776533 44 No 89 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=98.64 E-value=1.3e-09 Score=69.31 Aligned_cols=261 Identities=10% Similarity=0.031 Sum_probs=136.0 Q ss_pred Ccccccccc---ccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeee-ecccCcccCCce Q lcl|Aclame:pro 1 MVTSRTYPE---ENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDV-TLAEGNVPEGEV 75 (296) Q Consensus 1 ~~~~~~~ae---~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yi-g~A~gdVaEGe~ 75 (296) ++.++--.+ -+..++++=+...--++.+++-+.+..-.-+++.-+..|+.+++ .+.+|+|.-. +.+. .|+||+. T Consensus 97 ~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v~E~~~ 175 (397) T protein:vir:49 97 LVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLAN-IDDEAGK 175 (397) T ss_pred HHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCccee-eecCccc Confidence 111110000 01111222222222234445544444444555566677775432 4567777543 4464 8999999 Q ss_pred ec-hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcccee-cchhhH Q lcl|Aclame:pro 76 IP-LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQD-ALGAGL 151 (296) Q Consensus 76 Ip-lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~-~t~~~l 151 (296) +| .+..+.. ..+++.+|++..+ |.|.++.+.+ +-.+.-.++|+.+++.+++..++.-..+++.... .+.+.+ T Consensus 176 ~~~~~~~~~~---~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~~~~~~~d~i 251 (397) T protein:vir:49 176 IADVDDPKLS---LIKYTIKRYAGISTVTNSLLADSAE-NILAWLSGWIAKKVVVTRNKAILEAIAALPTKPTLTKWDDI 251 (397) T ss_pred ccccccccee---eEEeeeeeEEeeehhHHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccHHHH Confidence 99 5776664 4788899999875 9999976654 4677889999999999999999988765553221 233333 Q ss_pred HHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEE--eccCCCceEEE Q lcl|Aclame:pro 152 QGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIIS--TNDVTKGEIWA 224 (296) Q Consensus 152 Q~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~--S~kV~~G~~~~ 224 (296) .. .+.++.. ......+.++||.+...+++-.+-.-+..| +++-. .++|..|+. +.-++.+. T Consensus 252 ~~----~~~~l~~----~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~-~l~G~PV~~~~~~~~~~~~--- 319 (397) T protein:vir:49 252 ID----LEAKVDP----AIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGY-SIDGFAVKEVADRWLANGT--- 319 (397) T ss_pred HH----HHHhhhh----hhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCc-eecceeeEEeccccccccc--- Confidence 32 2223322 223456889999998877543221112222 22222 277876654 33344332 Q ss_pred EcccceEEEEecCc-------chhhhhhhc------cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEE Q lcl|Aclame:pro 225 TVPENIIFAYINPN-------NSELAKEFN------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVT 291 (296) Q Consensus 225 t~~~Nl~~ay~~~~-------~g~~~~~f~------~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~t 291 (296) .++..+.|.|.+ .+++.=.+. +..|.+++.+.. -+.+..+ +.+++++++ T Consensus 320 --~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~-------------r~d~~~~---~~~a~~~~~ 381 (397) T protein:vir:49 320 --GGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVID-------------RFDVVAT---DTEAFVPAS 381 (397) T ss_pred --CCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeEEEEe-------------eeCcEEe---cccceEEEE Confidence 122333333321 111111110 111222222221 1223333 447889999 Q ss_pred ecCCC Q lcl|Aclame:pro 292 LTPGV 296 (296) Q Consensus 292 I~~~v 296 (296) ++++. T Consensus 382 ~~~~~ 386 (397) T protein:vir:49 382 FKAIA 386 (397) T ss_pred eeccc Confidence 87766 No 90 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=98.63 E-value=5.7e-09 Score=65.76 Aligned_cols=264 Identities=11% Similarity=0.027 Sum_probs=136.5 Q ss_pred Cccccccc-cccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecc-cCcccCCceech Q lcl|Aclame:pro 1 MVTSRTYP-EENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLA-EGNVPEGEVIPL 78 (296) Q Consensus 1 ~~~~~~~a-e~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A-~gdVaEGe~Ipl 78 (296) +..-.... +.+.+++++=+...-.++.+.+-+-+....-+++.-+.+||..| ++++|.+.....+ -+.++||.+||- T Consensus 104 ~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~E~~~~~~ 182 (421) T protein:vir:13 104 IRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRN-AGKMPVRAGASVDKLANLAKDTELVK 182 (421) T ss_pred hhccchhHHHhhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEeecCCccceeeccccccccc Confidence 11111111 11223333333333445556655555555555666677888776 4677766554332 135899999999 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce-ecchhhHHHHH Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ-DALGAGLQGAL 155 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~-~~t~~~lQ~Al 155 (296) ++++.. ..+++++|++.-+ |.|.++.+.+ +-.+.-.++|+.++..+++.+++..+++..... ..+.+.+-.+ T Consensus 183 s~~~f~---~i~~~~~k~~~~v~iS~ell~ds~~-~l~~~i~~~la~~~~~~~~~~i~~~~~g~~~~~~~~~~d~i~~~- 257 (421) T protein:vir:13 183 AMLKTQ---PMAYDIDDYGLLAPIDNSLLEDSEI-NFLEFVNEEFAEFAVNTENAEIVKQAKAVLAEETINDYAGLVKT- 257 (421) T ss_pred ccccee---EEEeeeeeeEeehhhhHHHHhhhHH-HHHHHHHHHHHHHHHHHhhhhHhhhhhhccccccccchHHHHHH- Confidence 988864 4788899999875 9999876655 356778899999999999999998876553221 1222333322 Q ss_pred HHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee----chhhhhhhheeEEEEeccCCCceEEEEcccceE Q lcl|Aclame:pro 156 ASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF----GLTYLVDFTGTVIISTNDVTKGEIWATVPENII 231 (296) Q Consensus 156 a~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f----g~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~ 231 (296) +.+ .........+.++||.+...+++-.+-.-+..| +++-. -++|..|+.+...+.+. .++-. T Consensus 258 ---~~~----l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~-tl~G~pV~~~~~~~~~~-----~~~~~ 324 (421) T protein:vir:13 258 ---INS----LVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDGGDL-VFKGRPVIELEESIFDV-----GDETK 324 (421) T ss_pred ---HHH----hhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecCcCCCCCc-eecceeeEEeccccccC-----CCceE Confidence 223 333333456889999998876642222222222 11111 27898888888776542 12233 Q ss_pred EEEecCcc-------hhhhhhh----ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 232 FAYINPNN-------SELAKEF----NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 232 ~ay~~~~~-------g~~~~~f----~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ++|.|.+. +++.=.. .+..|.+++-+..+ +.+..+-||.....+..+..+-| T Consensus 325 ~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r-------------~d~~~~~~~a~~~~~~~~~~a~v 387 (421) T protein:vir:13 325 FIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETIARIIER-------------FDVNSPLDKSSDAEKIRKFGVIV 387 (421) T ss_pred EEEEeccccEEEEEecceEEEeecccccccCeeEEEEEee-------------ecceeecchhhheeeecccceee Confidence 33333221 1111000 01111111111110 11122223333322222211111 No 91 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=98.63 E-value=9e-09 Score=64.68 Aligned_cols=259 Identities=10% Similarity=-0.016 Sum_probs=132.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec-hh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP-LS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip-ls 79 (296) ..+..+++.-+.+..+ ++.+.+-+.+..-.-++..-+.+||+.| ++++|.+...+.....++||.++| .+ T Consensus 108 ~~~~~t~~~gg~~vP~--------~~~~~i~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~~ 178 (389) T protein:vir:10 108 ATSKVTSTEAGVLIPE--------EIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAELAENPKLA 178 (389) T ss_pred hhcccccCCcceeehH--------HHHHHHHHHHHhhhhHHhhcceeeccCC-eeEEEEEecCCCccccccccccccccc Confidence 2222222333333333 3334444444444445566677788655 477777765544445899999998 56 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHHHHH Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALAS 157 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~ 157 (296) ..+.. ..+++++|+++-+ |.|.++.+.+ +-.+.-.++|+..+....+..|+..+.+++.....+..+.. .+.. T Consensus 179 ~~~~~---~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~~~d-~l~~ 253 (389) T protein:vir:10 179 EPEFN---KVDWSVATYRGAIPLSEEAIADSAV-DLTALVGQSIKEKSVNTYNAMIAPVLQSFTAKKTTTDTLVD-SLKH 253 (389) T ss_pred cccce---eeeeeheeeEeeehhhHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccHH-HHHH Confidence 66543 5788899999865 9999876665 35567788999999999999999888766543222222222 1222 Q ss_pred HHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechhh--------hhhhheeEEEEecc-CCCceEEEEccc Q lcl|Aclame:pro 158 AWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTY--------LVDFTGTVIISTND-VTKGEIWATVPE 228 (296) Q Consensus 158 ~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~ty--------l~nfLG~~II~S~k-V~~G~~~~t~~~ 228 (296) .+... -+.. ..-+.++||.+...+++-.+-.-+..|...+ -.-++|..|+.+.. .+.+. .+ T Consensus 254 ~~~~~----~~~~-~~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~-----~~ 323 (389) T protein:vir:10 254 ILNVD----LDPA-YSRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSL-----AG 323 (389) T ss_pred HHHhh----hhhh-hCcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCC-----CC Confidence 22111 1111 1357889999988776432211122221111 01278987755443 23211 12 Q ss_pred ceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhh-----hHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 229 NIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLL-----VSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 229 Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~-----~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +..++|.|. ++++.+.. ..| +-+.- +++..+.+.+ +.+..+ ..+++++.++++.. T Consensus 324 ~~~~~~gd~-----~~~~~~~~-~~~-~~i~~---~~~~~~~~~~~~~~r~d~~~~---~~~a~~~~~~~~~~ 383 (389) T protein:vir:10 324 DQKAFVGDL-----KRGVLFTD-RQQ-VTLAW---EDSKIYGKYLGAAFRFGVQKA---DSKAGYFVTNTDVP 383 (389) T ss_pred ceEEEEeec-----cccEEEEe-ecc-eEEEe---eccccccceEEEEEEeccEEe---cccceEEEEeeccC Confidence 223333332 22211111 111 00100 0111122211 112222 23467788887666 No 92 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=98.62 E-value=5.6e-09 Score=65.82 Aligned_cols=269 Identities=11% Similarity=0.078 Sum_probs=134.3 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCc-ccccccCCCCeeeeeeeeeeecccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGV-TRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgV-tr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ipls 79 (296) .+++.+...-..+..+++ +=+|+++.. .-.-++.+ .|.+|+..| .+++|.++....+. -|+||+.+|.+ T Consensus 131 ~~~~~t~~~gg~~vP~~~----~~~ii~~l~----~~~~i~~~~~~~~~~~~~-~~~~p~~~~~~~a~-~v~E~~~~~~~ 200 (435) T protein:vir:14 131 SLNTLSPGAGGVLVPENL----SSEVIELLR----PKSVVRKLGARTLPLSNG-NITIPRLKGGAIVG-YIGADTDIPTT 200 (435) T ss_pred hcccCCcCCCccccchhH----HHHHHHHHh----hhchhhhhcceeeecCCC-ceEEEEEeCCccee-eeccCcccccc Confidence 223333222222333332 112333222 11112222 567788877 58999997666664 89999999999 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCc-hhHHHHHHHHHHHHhhhhHHHHHHHhcCc-----------cc-- Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNE-AVTNTDNALVRQLQKKIRTDFVTALKTGT-----------GT-- 143 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygd-av~etd~QL~~~iq~kIdnD~~~aLktat-----------~t-- 143 (296) ..+.. ..+++.+|++..+ |.|.++.++++. ..+.-.++|..+|+.+++..|+.--.++. +. T Consensus 201 ~~~f~---~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~ 277 (435) T protein:vir:14 201 QQQFD---DLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVI 277 (435) T ss_pred cccee---EEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccee Confidence 98764 4778889999875 999998888864 44667899999999999999984221110 00 Q ss_pred eecchhhHHHHHHHHHHHHHHhhccc--cCcceEEEEcHHHHHHHhcCCccccceeec-hhhhhhhheeEEEEeccCCCc Q lcl|Aclame:pro 144 QDALGAGLQGALASAWGKLQVLFEDY--GSERAIVFANSLDVAEYIAKAGITTQTAFG-LTYLVDFTGTVIISTNDVTKG 220 (296) Q Consensus 144 ~~~t~~~lQ~Ala~~~~~~~~~Fede--d~~~~VlFvNP~Daa~~l~~a~i~~q~~fg-~tyl~nfLG~~II~S~kV~~G 220 (296) ....+.+... +...+.++...+..- .....+.++||.+...+++-.+-.-+..|. .+-. -++|..|+.++.+|.. T Consensus 278 ~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~g-~l~G~Pv~~~~~~p~~ 355 (435) T protein:vir:14 278 TASDASTLQK-IETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANG-MLKGYPVGKTTQVPIN 355 (435) T ss_pred ccccccchhh-HHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceeccCCCCC-eeecceeEeecccccc Confidence 0111222222 223445555555432 223567899999998765433222223331 1111 2679999999999875 Q ss_pred eEEEEcccceEEEEecCcchhhhhhhccccccccceEEEecccccee----ehhhhhhHH--HHhhhhccceEEEEEecC Q lcl|Aclame:pro 221 EIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTL----TIQTLLVSG--MLMYPERIDGIVKVTLTP 294 (296) Q Consensus 221 ~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~----t~et~~~~~--~~lfpE~~dgvv~~tI~~ 294 (296) ...... .-.++|.|. ++.+ + .|. |=|.+.-+...... +.-.++... .+..=+|.|+-+ .-.. T Consensus 356 ~~~~~~--~~~i~~gd~-----s~~~-i-~~~-~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~--~~~~ 423 (435) T protein:vir:14 356 LGETGK--ESEIYFTDF-----GDVF-I-GEE-ETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGP--RHVE 423 (435) T ss_pred ccCCCc--cceEEEeec-----ccEE-E-EEe-cccEEEEeccccccccccchhhhhhcChhheeeeeeeCcee--eccc Confidence 322211 112334332 2221 0 000 00111110000000 000000000 112234555522 2222 Q ss_pred CC Q lcl|Aclame:pro 295 GV 296 (296) Q Consensus 295 ~v 296 (296) ++ T Consensus 424 a~ 425 (435) T protein:vir:14 424 SI 425 (435) T ss_pred ce Confidence 22 No 93 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=98.62 E-value=1.8e-09 Score=68.52 Aligned_cols=270 Identities=8% Similarity=-0.014 Sum_probs=135.2 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee---ecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV---TLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi---g~A~gdVaEGe~Ip 77 (296) +---+.......+.+.+-+...--+|.+++-+.+..-.-++++-+..||..+. .++|.|... ..+ .-|+||+.+| T Consensus 98 ~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~a-~~v~E~~~~~ 175 (395) T protein:vir:38 98 VKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSH-GSRVYEKLADITPLK-DLDDESALIG 175 (395) T ss_pred HHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCc-ceEEEEeeccCCccc-cccccccccc Confidence 00000000001111222222223345555555555555566666677775543 234444222 233 3699999999 Q ss_pred h-hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc-eecchhhHHH Q lcl|Aclame:pro 78 L-SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT-QDALGAGLQG 153 (296) Q Consensus 78 l-skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t-~~~t~~~lQ~ 153 (296) - +..+.. ..+++.+|++..+ |.|.++.++++ -.+.-.++|+..+..+++..|+.-..+++.. ...+.+.+.. T Consensus 176 ~~~~~~f~---~v~~~~~k~~~~~~iS~ell~ds~~~-l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~~~~~~i~~ 251 (395) T protein:vir:38 176 DNDDPELT---VVKYLIHRYAGITTVTNTLLKDTVDN-IIQWLVNWAAKKDVVTRNAKILEVMGKAPKKPTISQFDNIKD 251 (395) T ss_pred ccccccee---eEEeeeeeeEeehhhHHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccccccHHHHHH Confidence 4 556654 4678889999875 99999766663 3678899999999999999999877555432 2223344443 Q ss_pred HHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCCCceEEEEccc Q lcl|Aclame:pro 154 ALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVTKGEIWATVPE 228 (296) Q Consensus 154 Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~~G~~~~t~~~ 228 (296) ++.. ..........+.++||.+...+++-.+-.-+..| +++-. .++|..|+.+..++.+.. .+ T Consensus 252 ~~~~-------~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~-~l~G~pV~~~~~~~~~~~----~~ 319 (395) T protein:vir:38 252 LENN-------TLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKY-LIDGKPVIRIADKWLPDV----SG 319 (395) T ss_pred HHHH-------hhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcc-eeccceeEEecccccCcC----CC Confidence 3321 1112222456889999998876543221112222 11111 278988888876654431 12 Q ss_pred ceEEEEecCcchhhhhhhccccccccceEEEecc----ccceeehhhhh-hHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 229 NIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQ----ENTTLTIQTLL-VSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 229 Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~----~~~~~t~et~~-~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .-.++|.|.+ +.+.+...+-.-|-+.... ..+...+-... +.+. +-+.+++++++++++. T Consensus 320 ~~~i~~gd~~-----~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~---~~~~~a~~~~~~~~~~ 384 (395) T protein:vir:38 320 SHPLYFGDLK-----QGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQ---LIDDGAFAAASFKTVA 384 (395) T ss_pred cceEEEEecc-----ccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccE---EecccceEEEEeeccc Confidence 2233444432 1111111000001111000 00000000000 1122 2345788999998777 No 94 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=98.61 E-value=6.9e-09 Score=65.32 Aligned_cols=261 Identities=11% Similarity=0.048 Sum_probs=136.4 Q ss_pred Cccccccc---cccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeee-ecccCcccCCce Q lcl|Aclame:pro 1 MVTSRTYP---EENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDV-TLAEGNVPEGEV 75 (296) Q Consensus 1 ~~~~~~~a---e~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yi-g~A~gdVaEGe~ 75 (296) ++.+.-.. .....++++=+...--++.+.+-+.+..-.-++..-+..||..++ ++.+|+|... +.+. -|+||+. T Consensus 97 ~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v~E~~~ 175 (397) T protein:vir:49 97 LVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAK-LDDEGGQ 175 (397) T ss_pred HhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCccee-eeccccc Confidence 11111000 000111111111112234445544444444555566777777664 4667777543 3453 7999999 Q ss_pred echhh-eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce-ecchhhH Q lcl|Aclame:pro 76 IPLSK-VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ-DALGAGL 151 (296) Q Consensus 76 Iplsk-v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~-~~t~~~l 151 (296) +|-+. .+.. ..+++.+|+++-+ |.|.++.+.+ +-.+.-.++|..+++.+++.-|+.-..+++... ..+.+++ T Consensus 176 ~~~~~~~~~~---~v~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~~~~~~~~d~i 251 (397) T protein:vir:49 176 IGQNDDPKLS---LIRYAIKRYAGISTVTNSLLADSAE-NILAWLSGWIAKKVVVTRNKAILEAIGTLPNKPTLAKWDDI 251 (397) T ss_pred ccccccccee---eeEeeeeeeEeehhhHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHHHhccccccccccccCHHHH Confidence 99765 3443 4678899999875 9999876665 467789999999999999999998775554321 1233333 Q ss_pred HHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEE--eccCCCceEEE Q lcl|Aclame:pro 152 QGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIIS--TNDVTKGEIWA 224 (296) Q Consensus 152 Q~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~--S~kV~~G~~~~ 224 (296) .. ++.++ +.......+.++||.+.+.+++=.+-.-+..| +++.. .++|..|+. +.-+|.+.. T Consensus 252 ~~----~~~~l----~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~-~l~G~pV~~~~~~~~~~~~~-- 320 (397) T protein:vir:49 252 ID----LQAKV----DPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGY-SIDGFVVKEISDRFLPNGTG-- 320 (397) T ss_pred HH----HHHhh----hhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCc-eecceeeEEecccccccccC-- Confidence 32 23333 33233456899999998866432211112222 12211 378876554 444453321 Q ss_pred EcccceEEEEecCc-------chhhhhhhc------cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEE Q lcl|Aclame:pro 225 TVPENIIFAYINPN-------NSELAKEFN------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVT 291 (296) Q Consensus 225 t~~~Nl~~ay~~~~-------~g~~~~~f~------~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~t 291 (296) ++..++|.|.+ .+++.=.+. +..|.+++.+..+ +.+. +-+.+++++++ T Consensus 321 ---~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r-------------~d~~---~~~~~a~~~~~ 381 (397) T protein:vir:49 321 ---GAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDR-------------FDVV---STDTEAFVPAS 381 (397) T ss_pred ---CceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEe-------------eccE---EecccceEEEE Confidence 22223333321 111111110 1122233322211 2222 23458999999 Q ss_pred ecCCC Q lcl|Aclame:pro 292 LTPGV 296 (296) Q Consensus 292 I~~~v 296 (296) +++++ T Consensus 382 ~~~~~ 386 (397) T protein:vir:49 382 FKAIA 386 (397) T ss_pred ecccc Confidence 98888 No 95 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=98.60 E-value=2.4e-09 Score=67.84 Aligned_cols=273 Identities=12% Similarity=0.039 Sum_probs=135.2 Q ss_pred Cccccccccc---------cc-eehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcc Q lcl|Aclame:pro 1 MVTSRTYPEE---------NL-IKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNV 70 (296) Q Consensus 1 ~~~~~~~ae~---------nl-~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdV 70 (296) ++..+...++ |- +...+.+...--.+.+.+-+.+..-.-++..-+.+|+.. ...++|.....+.|. -| T Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~a~-~v 220 (458) T protein:vir:10 143 VMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSS-KILTMLVEPDAGKAT-WV 220 (458) T ss_pred HHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCC-cceEEEEecCCccee-ec Confidence 1111111110 00 001111112122344444444444444556667778864 456677776666664 78 Q ss_pred cCCceechhheeeee---cceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC----- Q lcl|Aclame:pro 71 PEGEVIPLSKVERKI---HSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG----- 140 (296) Q Consensus 71 aEGe~Iplskv~~~~---~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta----- 140 (296) +||+.+|-+...... -...+++.+|++.-+ |.|.++.+.+ +-.+.-.++|+.+|..+++..|+.--.++ T Consensus 221 ~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~-~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi 299 (458) T protein:vir:10 221 AASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAIF-SLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGL 299 (458) T ss_pred ccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcchH-HHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccee Confidence 999988854321100 123567778888864 9999854443 46788999999999999999998521110 Q ss_pred ----c---cce--ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhc--CCc---ccc----ceeechhh Q lcl|Aclame:pro 141 ----T---GTQ--DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIA--KAG---ITT----QTAFGLTY 202 (296) Q Consensus 141 ----t---~t~--~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~--~a~---i~~----q~~fg~ty 202 (296) + ... ..++......-++.+.++...+........+.++||.+...++. +++ |.. ....+++- T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~ 379 (458) T protein:vir:10 300 LTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQV 379 (458) T ss_pred eecccccccceeecccccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcC Confidence 0 000 00111111111233344444444433345688999999886543 221 111 11111111 Q ss_pred hhhhheeEEEEeccCCCce----EEEE-cccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHH Q lcl|Aclame:pro 203 LVDFTGTVIISTNDVTKGE----IWAT-VPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGM 277 (296) Q Consensus 203 l~nfLG~~II~S~kV~~G~----~~~t-~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~ 277 (296) . .++|..|+.+..+|.+. +++- -.++.. .+| .+++.-..+-|+ .+|++++.-. .-+... T Consensus 380 ~-~l~G~pv~~~~~~p~~~~~~~~~~~~f~~~~~--~~~--~~~~~v~~d~~~-~~~~~~~~~~----------~r~~~~ 443 (458) T protein:vir:10 380 G-RIYGLPVVVSEYFPAKANSAEFAVIVYKDNFV--MPR--QRAVTVERERQA-GKQRDAYYVT----------QRVNLQ 443 (458) T ss_pred c-eecceeeEEccccccccCCcceEEEEecccEE--EEE--eeceEEEeeccc-CCCceEEEEE----------EEecce Confidence 1 27899999999998752 1110 011110 111 122222222222 1333333211 113445 Q ss_pred HhhhhccceEEEEEecCC Q lcl|Aclame:pro 278 LMYPERIDGIVKVTLTPG 295 (296) Q Consensus 278 ~lfpE~~dgvv~~tI~~~ 295 (296) .++| +|+|+.|..+. T Consensus 444 v~~~---~a~v~~~~aa~ 458 (458) T protein:vir:10 444 RYFA---NGVVSGTYAAS 458 (458) T ss_pred Eecc---cceEEEeeccC Confidence 5566 69999999888 No 96 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.60 E-value=2e-08 Score=62.79 Aligned_cols=281 Identities=9% Similarity=0.015 Sum_probs=156.6 Q ss_pred Cccccccccccceehhhhhhhhhhh--hHHHHhhhHHHHHH----HhCcccccccCCCCeeeeeeeeeeecccCcccCCc Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITID--VTNKFQENISKLLE----MLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siD--f~~~f~~~i~~L~~----~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe 74 (296) |--..+-..-|+..-.- +..-+.| |.+.|+..+.+=++ .++..|...+..|+++++|.-.-+..+ ...+|+ T Consensus 1 ma~~~~~~~~~t~~g~~-~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~~~~~--~~~~G~ 77 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKG-MSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGRTKAA--YLQPGE 77 (347) T ss_pred CCccccccccccccccC-CcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccceeEe--eeecCc Confidence 32111111111111000 1233556 99999998865443 455566667789999999975454443 677999 Q ss_pred eechh--heeeeecceeEEEEeec--cc-ccC--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce--- Q lcl|Aclame:pro 75 VIPLS--KVERKIHSEKKIELKKY--RK-ATT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ--- 144 (296) Q Consensus 75 ~Ipls--kv~~~~~~t~~~tikK~--~K-~vT--dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~--- 144 (296) .|+-+ -+.. ...+++|.++ .. .|- ||+ | .-.|...|..+|...+++++.|.-++..|..+.... T Consensus 78 ~l~~~~~~~~~---~e~~ltID~~~y~~~~VddiD~~-q--~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~ 151 (347) T protein:vir:94 78 NLDDKRKDMKH---TEKTINIDGLLTADVLIYDIEDA-M--NHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTAN 151 (347) T ss_pred CCCCCcCCccc---cceEEEEcchhhhhhhhhhHHHH-h--cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 98654 2333 3467888764 33 353 344 3 345689999999999999999999987664321100 Q ss_pred ---------------------ecchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCccccce----- Q lcl|Aclame:pro 145 ---------------------DALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQT----- 196 (296) Q Consensus 145 ---------------------~~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~i~~q~----- 196 (296) ..+......++..++-++...+.+.+ ....+++|+|...+.+|+........ T Consensus 152 ~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~ 231 (347) T protein:vir:94 152 NENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALI 231 (347) T ss_pred ccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccccccccc Confidence 00001123445566777777776543 23579999999999999754332211 Q ss_pred eechhhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhh----hhhhcccc-ccccceEEEeccc-------- Q lcl|Aclame:pro 197 AFGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSEL----AKEFNLYG-DPTGYIGMNHFQE-------- 263 (296) Q Consensus 197 ~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~----~~~f~~~t-d~tGliGv~h~~~-------- 263 (296) .+.-..+.+++|.+|+.|+.+|.+.......+- ++.. .+.. +..++-|. |-++-+|+.-.+. T Consensus 232 ~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~---~~~~--~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~ 306 (347) T protein:vir:94 232 DPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEE---GVAP--TNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLK 306 (347) T ss_pred ccccceeEEeeceEEEEcCccccccCccccccc---cccc--ccccccccccccccccccccceEEEEechhhhhhhhhc Confidence 111122445889999999999986643332221 1111 1110 11122221 3344444432111 Q ss_pred ------cceeehhhhhhHHHHhh---hhccceEEEEEecCC Q lcl|Aclame:pro 264 ------NTTLTIQTLLVSGMLMY---PERIDGIVKVTLTPG 295 (296) Q Consensus 264 ------~~~~t~et~~~~~~~lf---pE~~dgvv~~tI~~~ 295 (296) .+..-+++-+|=|-..| |=|+|+.++++.+++ T Consensus 307 ~~~~e~~~~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 307 DMALERARRANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred ccceeeeechhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 12233333333333333 557888999999999 No 97 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=98.60 E-value=6e-09 Score=65.65 Aligned_cols=275 Identities=11% Similarity=0.014 Sum_probs=141.0 Q ss_pred Ccccc-ccccc---cceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccC--cccCCc Q lcl|Aclame:pro 1 MVTSR-TYPEE---NLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEG--NVPEGE 74 (296) Q Consensus 1 ~~~~~-~~ae~---nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~g--dVaEGe 74 (296) ++... ...|. ++ .+.+=+...--+|.++|-+-+..-.-+..+-+.+|+. ..+++|.+...+.+.. ..+||. T Consensus 130 ~l~~~~~~~e~~a~~~-~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~--~~~~~p~~~~~~~a~~~~~~~e~~ 206 (434) T protein:vir:62 130 YIVGNIDEKEARALGL-VTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTK--ENIKYPVLVKKAEAQGHKNERTNN 206 (434) T ss_pred Hhccccchhhhhhhcc-cccccceecchhhHHHHHHhhhhhhhhhhhcceeccC--CceEEEEEecCCcccceecccccc Confidence 11100 00000 11 1112222223345555555555444455555666654 3588888766555542 246788 Q ss_pred eechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc---eecchh Q lcl|Aclame:pro 75 VIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT---QDALGA 149 (296) Q Consensus 75 ~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t---~~~t~~ 149 (296) .+|.+..+.. ..+++.||++.-+ |.|.++.++++ -.+.-.++|+..+..+++..|+.--.++..+ ....+. T Consensus 207 ~~~~~~~~f~---~v~~~~~k~~~~~~iS~ell~ds~~~-l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~ 282 (434) T protein:vir:62 207 EMPETDIEFD---EIELSPTEFDALATVTKKLLARTGLP-IEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAV 282 (434) T ss_pred ccccccccee---eEEeeheeeEeehhhHHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccc Confidence 9999987764 4778889988865 99998766553 4677889999999999999988421111000 000011 Q ss_pred hHH---HHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-------chhhhhhhheeEEEEeccCCC Q lcl|Aclame:pro 150 GLQ---GALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-------GLTYLVDFTGTVIISTNDVTK 219 (296) Q Consensus 150 ~lQ---~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-------g~tyl~nfLG~~II~S~kV~~ 219 (296) .+. ....+.+.++............+.++||.+...+++-.+-.-+..| +++-- .++|..|+.+..+|. T Consensus 283 ~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~-tl~G~pV~~~~~~~~ 361 (434) T protein:vir:62 283 EFKTDEKNLYDALVKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGY-TLLGFPVEEEDAIDI 361 (434) T ss_pred cccccccchhhHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCc-eecceeeEEecCccC Confidence 111 1112222233222332222456789999999866432111112222 12111 288999999999986 Q ss_pred ceE---EEEcccceEEEEecCcch--hhhhhhc--cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEe Q lcl|Aclame:pro 220 GEI---WATVPENIIFAYINPNNS--ELAKEFN--LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTL 292 (296) Q Consensus 220 G~~---~~t~~~Nl~~ay~~~~~g--~~~~~f~--~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI 292 (296) +.. ....-.+..-||+-.+.| ++..... +.+|++||.+.... -|.+++.+.-=.|+++++ T Consensus 362 ~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~-------------Dgk~i~~~~~~~~~~~~~ 428 (434) T protein:vir:62 362 PDSPDTPVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLL-------------DAQLIHSPFEVPVYKYVL 428 (434) T ss_pred ccCCCceEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeee-------------cceeecCcccceEEEEEe Confidence 542 112223333333321111 1111111 23355665554433 133345333334888999 Q ss_pred cCCC Q lcl|Aclame:pro 293 TPGV 296 (296) Q Consensus 293 ~~~v 296 (296) ++|. T Consensus 429 ~~~~ 432 (434) T protein:vir:62 429 KAPT 432 (434) T ss_pred ccCC Confidence 9999 No 98 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=98.59 E-value=3.8e-09 Score=66.72 Aligned_cols=250 Identities=16% Similarity=0.080 Sum_probs=128.3 Q ss_pred Ccc-------c-cccccccceehhhhhhhhhhhhHHHHh--hhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcc Q lcl|Aclame:pro 1 MVT-------S-RTYPEENLIKSTDLKYPITIDVTNKFQ--ENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNV 70 (296) Q Consensus 1 ~~~-------~-~~~ae~nl~~~~dl~~a~siDf~~~f~--~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdV 70 (296) .+. + .+....+....+++ .+++- .....|+.. .+.+|+..+ ..++|.....+...+.+ T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~vp~~~--------~~~i~~~~~~~~l~~~---~~~~~~~~~-~~~~~~~~~~~~~~~~~ 190 (397) T protein:vir:96 123 FVKSKGAEKRDGFTSVEGGALIPQEL--------LQPQLEPKDIVDLSKY---VRSVPVNSA-SGKFPVISKSGSKMATV 190 (397) T ss_pred HHHhhhhhhhhcccccccccchhHHH--------HHHHHHhhhhhhHHHh---hhhcccccc-ceeEEEEeccCCccccc Confidence 111 1 11112222222222 22211 122233333 334445443 35555544444444579 Q ss_pred cCCceec-hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecc Q lcl|Aclame:pro 71 PEGEVIP-LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDAL 147 (296) Q Consensus 71 aEGe~Ip-lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t 147 (296) +||+.+| .+..+.. ..+++++|++..+ |.|.++.+.++ -.+.-.++|+..++...+..++.-..+++.+...+ T Consensus 191 ~E~~~~~~~~~~~~~---~i~~~~~~~~~~~~~s~ell~ds~~~-l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~ 266 (397) T protein:vir:96 191 QQLEKNPQLANPKMV---EIDYSVATRRGYIPISQEMIDDASYD-VTGLIADEIQDQSLNTKNADIAAVLKTATAKSVVG 266 (397) T ss_pred ccccccccccccccc---ceeecHhHhhcchhhHHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 9999998 4666553 4678888888765 89998766653 56677889999999999999998876666555555 Q ss_pred hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEeccCCCceE Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTNDVTKGEI 222 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~kV~~G~~ 222 (296) .+.+..+++... . . .-..+.++||.+...+++=.+-.-+..| +++-. .++|..|+.++....|. T Consensus 267 ~d~~~~~~~~~~----~----~-~~~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~-~l~G~pv~~~~~~~~~~- 335 (397) T protein:vir:96 267 VDGLKDLINKEI----K----K-VYDVKLFISASMYSELDKLKDKNGRYLLQDSITAASGK-QLLGKEVVVLDDDVIGK- 335 (397) T ss_pred hHHHHHHHHHhh----h----h-hcCcEEEEcHHHHHHHHHhhccCCCeEeccCccCCCcc-cccccceEEecccccCC- Confidence 566655443221 1 1 1235899999998876652221112233 11111 27798887766544332 Q ss_pred EEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccce-------EEEEEecCC Q lcl|Aclame:pro 223 WATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDG-------IVKVTLTPG 295 (296) Q Consensus 223 ~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dg-------vv~~tI~~~ 295 (296) ..++..++|-| +++...++ |..| +-+.+. +..++++. +.-| +|.|| +++++++.+ T Consensus 336 ---~~~~~~~~~gd-----~~~~~~~~-~~~~-~~~~~~---~~~~~~~~----~~~~-~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 336 ---SVGNVVGFIGD-----AKAFASFF-DRKQ-VSVSWV---DNNIYGQL----LAGI-IRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred ---CCCceEEEEee-----hhcceEeE-eecc-eEEEEe---ccccccee----EEEE-EEEccEEecccceEEEEeecC Confidence 12333344433 33322111 1111 111111 11222221 1111 34554 778888888 No 99 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.56 E-value=4.8e-09 Score=66.20 Aligned_cols=264 Identities=13% Similarity=0.094 Sum_probs=135.5 Q ss_pred Cccccc---cc-cc---cceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCC Q lcl|Aclame:pro 1 MVTSRT---YP-EE---NLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEG 73 (296) Q Consensus 1 ~~~~~~---~a-e~---nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEG 73 (296) |-+... .. |. +..+.++=+...--+|.+.+-+.+.+..-++++-+.+|+..| .+++|.......+ +-|+|| T Consensus 90 l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a-~~v~E~ 167 (407) T protein:vir:48 90 MRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGS-DYKKLVNLGGTTS-GWVGET 167 (407) T ss_pred HhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCcce-eeeccc Confidence 100000 00 00 011111112222234555565555555666777778888765 7888887555555 479999 Q ss_pred ceechhhe-eeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce------ Q lcl|Aclame:pro 74 EVIPLSKV-ERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ------ 144 (296) Q Consensus 74 e~Iplskv-~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~------ 144 (296) +.+|-++. +. ...+++++|++.-+ |.|.++.+.+ +-.+.-.++|+.+|+.+++..|+.- ++++.. T Consensus 168 ~~~~~~~~~~f---~~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~~~a~l~G--~G~~~p~Gil~~ 241 (407) T protein:vir:48 168 DARPETATSKL---GLIEPFMGEIYGNPQATQKMLDDAFF-NVEDWINSELALEFAEQEEIAFTSG--DGSKKPKGFLAY 241 (407) T ss_pred ccccccccccc---eeEEeeeeeeEeehhhHHHHHhcchH-HHHHHHHHHHHHHHHHHHHhhhhcc--CCCCccceeeec Confidence 99996553 33 24788889999854 9999865553 5678889999999999999987741 111000 Q ss_pred -------------------ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeec--hhhh Q lcl|Aclame:pro 145 -------------------DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFG--LTYL 203 (296) Q Consensus 145 -------------------~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg--~tyl 203 (296) ......+ -++.+.++............+.++||.+...+++-.+-.-+..|. .+.. T Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g 318 (407) T protein:vir:48 242 ESTDEDDKTRAFGKLQHIASGAASGV---TADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELG 318 (407) T ss_pred cccccccccccccccccccccccccc---ChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC Confidence 0000000 011222222222222234567889999988764322111122221 1111 Q ss_pred --hhhheeEEEEeccCCCceEEEEcccceEEEEecCcc-------hhhhhhhccc--cccccceEEEeccccceeehhhh Q lcl|Aclame:pro 204 --VDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNN-------SELAKEFNLY--GDPTGYIGMNHFQENTTLTIQTL 272 (296) Q Consensus 204 --~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~-------g~~~~~f~~~--td~tGliGv~h~~~~~~~t~et~ 272 (296) ..++|..|+.+..+|.. ..++-.++|.|.+. .++.=..+-| .|.++|.+..+ T Consensus 319 ~~~~l~G~PV~~~~~~p~~-----~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~~~~~~~~~~~~~r------------ 381 (407) T protein:vir:48 319 QPSSLAGYGIVENEQMPDI-----AADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKR------------ 381 (407) T ss_pred CCceecceeeEEecCcCCc-----cCCccEEEEEeccccEEEEEeeceEEEeeccccCCcEEEEEEEE------------ Confidence 13789999999999852 23333344444321 1111001111 12222222211 Q ss_pred hhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 273 LVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 273 ~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +.+.... .+++++.+++++. T Consensus 382 -~d~~v~~---~~a~~~l~~~aa~ 401 (407) T protein:vir:48 382 -TGGMLVD---SQAIKLMKIGAAT 401 (407) T ss_pred -eccEEec---ccceEEEEeeccC Confidence 1223333 3477888888777 No 100 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.56 E-value=6.6e-09 Score=65.45 Aligned_cols=279 Identities=13% Similarity=0.024 Sum_probs=137.0 Q ss_pred Cccccc--------ccccccee-hhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeeccc---- Q lcl|Aclame:pro 1 MVTSRT--------YPEENLIK-STDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAE---- 67 (296) Q Consensus 1 ~~~~~~--------~ae~nl~~-~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~---- 67 (296) |-+.+. ..+.+.+. ..++- --.+.++|-+.+.+-.-++..-+.+|+. +..+++|+++....|. T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~li---P~~~~~~ii~~l~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~e 76 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLL---PKEIVGPIFDKAQESSLVLRMGEQIPIS-YGETIIPTTVKRPEVGQVGV 76 (333) T ss_pred CchhHHhhhhcccccccCceecCCcccc---chhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCceeEeecC Confidence 222222 22222221 22221 2234566655555555566667788876 5667899987654432 Q ss_pred ---CcccCCceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc Q lcl|Aclame:pro 68 ---GNVPEGEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG 142 (296) Q Consensus 68 ---gdVaEGe~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~ 142 (296) .-++||+.||.++.+.. ..++..+|++.-+ |.|.++ ...-+....-.++|+++|+++++..|+.-=...++ T Consensus 77 g~~~~~~e~~~~~~~~~~f~---~i~l~~~kl~~~~~is~ell~-~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~ 152 (333) T protein:vir:78 77 GTSNEQREGGLKPLSGTAWD---TRSVSPIKLATIVTVSEEFAR-MNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTG 152 (333) T ss_pred ccccccccccccccccccee---EEEEeeEEEEEeehhhHHHHh-cCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCC Confidence 14566788898888775 4678889988865 999985 44456778899999999999999999842211100 Q ss_pred c-------------e---ecchhhHHHHHHHHHHHHHHhhccc-cCcceEEEEcHHHHHHHhcCCcc--------cccee Q lcl|Aclame:pro 143 T-------------Q---DALGAGLQGALASAWGKLQVLFEDY-GSERAIVFANSLDVAEYIAKAGI--------TTQTA 197 (296) Q Consensus 143 t-------------~---~~t~~~lQ~Ala~~~~~~~~~Fede-d~~~~VlFvNP~Daa~~l~~a~i--------~~q~~ 197 (296) + + ......-. .....+.++....... +....+.++||.+.+.+++-... -.... T Consensus 153 ~~~~g~~~~~~~~~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~ 231 (333) T protein:vir:78 153 SALQGIDTDNVIANTTNVDYLQETGD-PLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRIN 231 (333) T ss_pred cccccccccccccccccccccccccc-hhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCcc Confidence 0 0 00000000 0112222333333222 22334788899998876543221 11112 Q ss_pred echhhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcc------hhhhhhhccccccccce---EEEec-ccccee Q lcl|Aclame:pro 198 FGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNN------SELAKEFNLYGDPTGYI---GMNHF-QENTTL 267 (296) Q Consensus 198 fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~------g~~~~~f~~~td~tGli---Gv~h~-~~~~~~ 267 (296) .+++-. .++|..|+.|+.||.+....... +..+++.|.+. +++.=... ++..+. |..|+ ..++.. T Consensus 232 ~~~~~~-~l~G~Pv~~~~~i~~~~~~~~~~-~~~~~~gD~~~~~~g~~~~~~i~~~---~~~~~~~~~~~~~~~~~~~~v 306 (333) T protein:vir:78 232 LAAQTG-DVLGLPAQFGRAVGGDLGAAVDS-KTRIIGGDFSQLKFGFADEIRIKMS---DTATLTDSGSATVSMWQTNQI 306 (333) T ss_pred ccCCCc-eeeceeeEEccccCCCccccCCC-ccEEEEEecccEEEEEeeccEEEEe---ccccccccccceeehhhcCcE Confidence 222222 37899999999999764322221 22233333210 11100000 111110 00110 011111 Q ss_pred ehhhhhhHHHHhhhhccceEEEEEe-cCC Q lcl|Aclame:pro 268 TIQTLLVSGMLMYPERIDGIVKVTL-TPG 295 (296) Q Consensus 268 t~et~~~~~~~lfpE~~dgvv~~tI-~~~ 295 (296) .+-...-.+. =+...+++++++- ++| T Consensus 307 ~~r~~~r~d~--~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 307 AILIEVTFGW--LLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEEEEEEEcc--EEecccceEEEeccCCC Confidence 1111111111 1245567777763 666 No 101 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.55 E-value=1.4e-08 Score=63.57 Aligned_cols=278 Identities=14% Similarity=0.048 Sum_probs=136.2 Q ss_pred Ccc-------ccccccc-cce-ehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee-------- Q lcl|Aclame:pro 1 MVT-------SRTYPEE-NLI-KSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-------- 63 (296) Q Consensus 1 ~~~-------~~~~ae~-nl~-~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi-------- 63 (296) |.+ +.....+ +++ ...+| .--.|++++-+.+.+-.-+++.-+.+||. |..+++|++... T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~l---iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~~ 76 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDL---LPKEIVGPIFDKAQESSLVLRLGENIPIS-YGETIIPTTVKRPEVGQVGV 76 (338) T ss_pred CcchHHhhhhhcccccccceecccccc---cchHHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccceeecc Confidence 222 1111111 211 22222 22345666666666666667777888985 668999986432 Q ss_pred ecccCcccCCceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc Q lcl|Aclame:pro 64 TLAEGNVPEGEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT 141 (296) Q Consensus 64 g~A~gdVaEGe~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat 141 (296) +.+ ..++||+++|.++.+.. ..+++.+|++.-+ |+|.++.+. -+..+.-.++|+.+++++++..|+.--.+++ T Consensus 77 ~~~-~~~~Eg~~~~~~~~~f~---~v~l~~~k~~~~~~is~ell~ds~-~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~ 151 (338) T protein:vir:78 77 GTS-NEQREGGTKPLSGTAWD---TRSVAPIKLATIVTVSEEFARMNP-SGLYTKLQADLAYAIGRGIDLAVFHGKSPLT 151 (338) T ss_pred ccc-cccccccccccccccee---EEEEEEEEEEEeehhhHHHHhcCH-HHHHHHHHHHHHHHHHHHHHHHhhcccCCCc Confidence 333 36889999999998874 5788889998865 999985444 4566888899999999999999985322111 Q ss_pred c--------------cee--cchhhHHHHHHHHHHHHHHhhcc-ccCcceEEEEcHHHHHHHhcCCcc--------ccce Q lcl|Aclame:pro 142 G--------------TQD--ALGAGLQGALASAWGKLQVLFED-YGSERAIVFANSLDVAEYIAKAGI--------TTQT 196 (296) Q Consensus 142 ~--------------t~~--~t~~~lQ~Ala~~~~~~~~~Fed-ed~~~~VlFvNP~Daa~~l~~a~i--------~~q~ 196 (296) . .++ ....+.. ..+..+.++...+.. .+....+.++||.+.+.+++-..+ -... T Consensus 152 ~~~~~gi~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~ 230 (338) T protein:vir:78 152 GSALQGIDTNNVIVNTTNVDYLQTGTT-PLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRI 230 (338) T ss_pred cccccccccccccccccccccccccch-hhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeeccc Confidence 0 000 0111111 122333333333322 222345799999998876542221 1112 Q ss_pred eechhhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcc------hhhhhhhccccccccceE------EEecccc Q lcl|Aclame:pro 197 AFGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNN------SELAKEFNLYGDPTGYIG------MNHFQEN 264 (296) Q Consensus 197 ~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~------g~~~~~f~~~td~tGliG------v~h~~~~ 264 (296) .+++.-. .++|..|+.++.||....-. ...+-.+++.|.+. +++.= ...|+.++.. -.|+.-. T Consensus 231 ~~~~~~~-~l~G~PV~~~~~ip~~~~~~-~~~~~~~~~gdfs~~~~~~~~~~~i---~~~~~~~~~~~~~~~~~~~~~~~ 305 (338) T protein:vir:78 231 NLAASAG-DLLGLPVQFGKAVGGDLGAA-TDSKVRVVGGDFSQLKYGFADEIRV---KMSDTATLTDNTSPTPQTVSMWQ 305 (338) T ss_pred ccCCCCc-eeeeeeEEEccccCcccccc-CCcccEEEEEecceEEEEeecccEE---EEeecccccccccccccchhhhh Confidence 2222222 37899999999998542211 11122233333210 11100 0001111100 0010000 Q ss_pred ceee-hh-hhhhHHHHhhhhccceEEEEEe-cCCC Q lcl|Aclame:pro 265 TTLT-IQ-TLLVSGMLMYPERIDGIVKVTL-TPGV 296 (296) Q Consensus 265 ~~~t-~e-t~~~~~~~lfpE~~dgvv~~tI-~~~v 296 (296) ++.+ +- ..-+.+..+-| +++++++- ++|- T Consensus 306 ~~~~~~r~~~r~d~~v~~~---~a~~~l~~~~~~~ 337 (338) T protein:vir:78 306 TNQIAILIEVTFGWLLGDK---QAFVKFVDDEDPD 337 (338) T ss_pred cCcEEEEEEEEeccEeecc---cceEEEecccCCC Confidence 0000 00 01122333333 33444332 2233 No 102 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.54 E-value=7.8e-09 Score=65.03 Aligned_cols=268 Identities=11% Similarity=-0.013 Sum_probs=133.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |.|..+... .++ .+++ .+++-+.+.+-.-++..-+.+|+..| .+++|+++-...|. -|+||++||.++ T Consensus 1 Mat~tt~~g-~~v-P~~~--------~~~ii~~~~~~s~l~~~~~~i~~~~~-~~~~p~~~~~~~a~-wv~Eg~~~~~~~ 68 (311) T protein:vir:99 1 MATFGTGNL-KNL-PRNI--------ADGMVKDVVQGSTVAVLSARKPQRFG-NEDIITFNGRPKAE-FVGEGQQKSSTT 68 (311) T ss_pred CceecCCCc-eec-cHHH--------HHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCceeE-EeecCccccccc Confidence 987754432 232 2222 22222222222224445567787754 57999997777775 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCC--chhHHHHHHHHHHHHhhhhHHHHHHHhcCcc-------------- Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSN--EAVTNTDNALVRQLQKKIRTDFVTALKTGTG-------------- 142 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGyg--dav~etd~QL~~~iq~kIdnD~~~aLktat~-------------- 142 (296) .+.. ..+++.||++..+ |.|-++.+... +=.++-.++|+.+|+++++.-|+.--.++++ T Consensus 69 ~~f~---~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~ 145 (311) T protein:vir:99 69 GEFD---FVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAAS 145 (311) T ss_pred ceee---EEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCcccccccccccccc Confidence 8864 5778888888765 99987533332 2367788999999999999999965432211 Q ss_pred -ceecchhhHHHHHHHHHHHHHHhhccc--cCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEe Q lcl|Aclame:pro 143 -TQDALGAGLQGALASAWGKLQVLFEDY--GSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIIST 214 (296) Q Consensus 143 -t~~~t~~~lQ~Ala~~~~~~~~~Fede--d~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S 214 (296) ..+.+..+. ..+...+.++...+... +......++||.+...+++-.+-.-+..| ++.-. .++|.-++.| T Consensus 146 ~~~~~~~~~~-~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~-~l~G~Pv~~s 223 (311) T protein:vir:99 146 KRVELTADTI-ANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVS-SFEGIDASVS 223 (311) T ss_pred ceeecccccc-chhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCc-eecceeeEee Confidence 001111111 01112222233333322 11223489999999987653221112222 22111 3779999999 Q ss_pred ccCCCceEEEEcccceEEEEec---CcchhhhhhhccccccccceEEEeccccceeeh---h---hhhhHHHHh--hhhc Q lcl|Aclame:pro 215 NDVTKGEIWATVPENIIFAYIN---PNNSELAKEFNLYGDPTGYIGMNHFQENTTLTI---Q---TLLVSGMLM--YPER 283 (296) Q Consensus 215 ~kV~~G~~~~t~~~Nl~~ay~~---~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~---e---t~~~~~~~l--fpE~ 283 (296) +.++.+..+.... ..++..+ .--||+++... +|+..+..-+.+++ + .++-..++. +=+| T Consensus 224 ~~i~~~~~~~~~~--~~~~~~~~~~~~~Gdf~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r 293 (311) T protein:vir:99 224 DTVNGGDEADPDD--EDLDAARAVRGIVGDFANGIH--------WGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIV 293 (311) T ss_pred ccccccccccccc--chhhccCcceEEEeeccccEE--------EEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEe Confidence 9998766653221 1111111 00134433221 22222211111110 0 000011111 1223 Q ss_pred cce------EEEEEecCC Q lcl|Aclame:pro 284 IDG------IVKVTLTPG 295 (296) Q Consensus 284 ~dg------vv~~tI~~~ 295 (296) .|+ .|+++-..+ T Consensus 294 ~d~~v~~~~~v~~~~~~A 311 (311) T protein:vir:99 294 YGWYVFTDRFVVIENAVA 311 (311) T ss_pred ecceecChhHeeeecccC Confidence 333 223333333 No 103 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.54 E-value=2e-08 Score=62.84 Aligned_cols=271 Identities=13% Similarity=0.098 Sum_probs=133.3 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCc-ccccccCCCCeeeeeeeeeeecccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGV-TRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgV-tr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ipls 79 (296) .+++.+...-..+..+.+ .++|-+.+....-++.+ .|.+|+..| .+++|++.....+. -|+||+.+|.+ T Consensus 131 ~~~~~~~~~gg~lvP~~~--------~~~ii~~l~~~~~i~~~~~~~v~~~~~-~~~~p~~~~~~~a~-~v~E~~~~~~~ 200 (435) T protein:vir:80 131 SLNTLSPGAGGVLVPENL--------SSEVIELLRPKSVVRKLGARTLPLSNG-NITIPRLKGGAIVG-YIGADTDIPTT 200 (435) T ss_pred hhcccCCCCCccccchhH--------HHHHHHHHhhhchhhhccceeeecCCC-ceEEEEEeCCccee-eeccCcccccc Confidence 122222222222222222 22222222222222333 567888877 48999987666664 79999999999 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCC-chhHHHHHHHHHHHHhhhhHHHHHHHhcCcc-------------c Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSN-EAVTNTDNALVRQLQKKIRTDFVTALKTGTG-------------T 143 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGyg-dav~etd~QL~~~iq~kIdnD~~~aLktat~-------------t 143 (296) +.+.. ..++..+|++..+ |.|.++.++++ +..+.-.++|+.+++.+++..|+.--.++.. . T Consensus 201 ~~~f~---~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~ 277 (435) T protein:vir:80 201 QQQFD---DLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVI 277 (435) T ss_pred cccee---eEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeeccccccee Confidence 98865 4778889998865 99999888886 4567788999999999999999853211110 0 Q ss_pred eecchhhHHHHHHHHHHHHHHhhccc--cCcceEEEEcHHHHHHHhcCCccccceeec-hhhhhhhheeEEEEeccCCCc Q lcl|Aclame:pro 144 QDALGAGLQGALASAWGKLQVLFEDY--GSERAIVFANSLDVAEYIAKAGITTQTAFG-LTYLVDFTGTVIISTNDVTKG 220 (296) Q Consensus 144 ~~~t~~~lQ~Ala~~~~~~~~~Fede--d~~~~VlFvNP~Daa~~l~~a~i~~q~~fg-~tyl~nfLG~~II~S~kV~~G 220 (296) ...++.++.. +...+.++...+..- .....+.++||.+...+.+-.+-.-+..|. .+-. .++|..|+.++.+|.. T Consensus 278 ~~~~~~~~~~-~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~-~l~G~pv~~~~~~p~~ 355 (435) T protein:vir:80 278 TASDGSTLQK-IETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANG-MLKGYPVGKTTQVPIN 355 (435) T ss_pred ecccccchhh-HHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCCCCC-eEeeeeeEEecccccc Confidence 0112222222 222344444444332 223567889999997654422211122331 1111 2779999999999874 Q ss_pred eEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHH------HHhhhhccceEEE----- Q lcl|Aclame:pro 221 EIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSG------MLMYPERIDGIVK----- 289 (296) Q Consensus 221 ~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~------~~lfpE~~dgvv~----- 289 (296) ...+ .+.-.++|.| +++.+ + .|. |=|-+.......-........+. ....=++.|+-+. T Consensus 356 ~~~~--~~~~~i~~gd-----~s~~~-i-~~~-~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~ 425 (435) T protein:vir:80 356 LGEA--GKESEIYFTD-----FGDVF-I-GEE-ETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESI 425 (435) T ss_pred ccCC--CCcceEEEEE-----cccEE-E-Eee-cceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccce Confidence 3221 1122344443 22321 1 011 11112111111100111111000 0112233333221 Q ss_pred EEecCCC Q lcl|Aclame:pro 290 VTLTPGV 296 (296) Q Consensus 290 ~tI~~~v 296 (296) +.|+..- T Consensus 426 ~~l~~~~ 432 (435) T protein:vir:80 426 AVLSGVA 432 (435) T ss_pred EEEeccC Confidence 1111111 No 104 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.49 E-value=6.1e-09 Score=65.60 Aligned_cols=263 Identities=10% Similarity=0.047 Sum_probs=132.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCC-eeeeeeeeeeecccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~-tIt~pk~~yig~A~gdVaEGe~Ipls 79 (296) ....+....-+..+..+=+...--++.+++-+.+..-.-++...+..|++.++ ++.+|+..-...+. .|+||+.+|.+ T Consensus 101 ~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~-~v~e~~~~~~~ 179 (404) T protein:vir:10 101 NLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMK-PLSENQQIPTN 179 (404) T ss_pred cchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCccee-ecccccccccc Confidence 00100000001111111122222345566655555555566667778876543 67788775555564 89999999997 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc-----------ceec Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG-----------TQDA 146 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~-----------t~~~ 146 (296) ...-+. ...+++.+|++.-+ |.|.++.+.+ +-.+.-.++|+.+++.+++.-|+.-..++.. +... T Consensus 180 ~~~~~f-~~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~ 257 (404) T protein:vir:10 180 GDNGKL-ERFNFKLKDLADFMSIPNDLLKFADK-SLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITL 257 (404) T ss_pred ccccce-eeeEeeheeeEeeehhhHHHHhhcHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeec Confidence 533211 24778889999864 9999864443 4556678899999999999988854332211 0111 Q ss_pred -chhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechh----hhhhhheeEEEE-eccCCCc Q lcl|Aclame:pro 147 -LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLT----YLVDFTGTVIIS-TNDVTKG 220 (296) Q Consensus 147 -t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~t----yl~nfLG~~II~-S~kV~~G 220 (296) +...+ ..+..++.. .+-.. .....+.++||.+.+.+++-.+-.-+..|.-. .-..++|..|+. +...+.+ T Consensus 258 ~~~~~~-~~~~~~~~~--~l~~~-~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~ 333 (404) T protein:vir:10 258 PKSPAL-KDFKKCKNV--ELLNV-FKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLS 333 (404) T ss_pred cccccH-HHHHHHHHh--hhhcc-ccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCccccceeeEEecccccCC Confidence 11111 112111111 11122 22456789999998876653222222222111 111267876654 3334333 Q ss_pred eEEEEcccceEEEEecCcchhhhhhhcccc------------------ccccceEEEeccccceeehhhhhhHHHHhhhh Q lcl|Aclame:pro 221 EIWATVPENIIFAYINPNNSELAKEFNLYG------------------DPTGYIGMNHFQENTTLTIQTLLVSGMLMYPE 282 (296) Q Consensus 221 ~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~t------------------d~tGliGv~h~~~~~~~t~et~~~~~~~lfpE 282 (296) . .++..++|.|. ++.+.+.. |.+.+.+..+ +.+. +- T Consensus 334 ~-----~~~~~~~~gd~-----s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r-------------~d~~---v~ 387 (404) T protein:vir:10 334 T-----ESAIPVLLGDT-----KEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMR-------------IDGN---VK 387 (404) T ss_pred C-----CCccEEEEEec-----cccEEEEEecceEEEEeccccchhhcCceEEEEEEe-------------eccE---Ee Confidence 2 23334444443 22222111 2222221111 1222 34 Q ss_pred ccceEEEEEecCCC Q lcl|Aclame:pro 283 RIDGIVKVTLTPGV 296 (296) Q Consensus 283 ~~dgvv~~tI~~~v 296 (296) +.+++++++++.+. T Consensus 388 ~~~a~~~~~~~~aa 401 (404) T protein:vir:10 388 DSEALLIAEIPVES 401 (404) T ss_pred cccceEEEEeeccc Confidence 55788999998777 No 105 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.48 E-value=1.9e-07 Score=57.47 Aligned_cols=274 Identities=11% Similarity=0.062 Sum_probs=151.6 Q ss_pred Cccccccccccceehhhh----------hhhhhhhhHHHHhhhHHHHHH----HhCcccccccCCCCeeeeeeeeeeecc Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDL----------KYPITIDVTNKFQENISKLLE----MLGVTRKISVSEGMTLKTYAGYDVTLA 66 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl----------~~a~siDf~~~f~~~i~~L~~----~LgVtr~~~~~pG~tIt~pk~~yig~A 66 (296) |- |++....| +.+.++ |++.|+..+.+-++ .++..|..++..|+++++|.-.-.... T Consensus 1 ~a--------~~~~~~~~~~~~g~~~~~~d~~al-~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~~~~ 71 (347) T protein:vir:88 1 MA--------NATGGQQIGANQGKGQSAADKLAL-FLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGY 71 (347) T ss_pred CC--------CcccchhhhccCCCCccccchHHH-HHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecceeee Confidence 32 22222221 112334 88999988865554 466667778889999999864433332 Q ss_pred cCcccCCceechh--heeeeecceeEEEEeeccc---ccC--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhc Q lcl|Aclame:pro 67 EGNVPEGEVIPLS--KVERKIHSEKKIELKKYRK---ATT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKT 139 (296) Q Consensus 67 ~gdVaEGe~Ipls--kv~~~~~~t~~~tikK~~K---~vT--dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkt 139 (296) ...+|+.++-+ .+.. ...+++|.++.- .|. ||+ | --.|...|..++...++++.+|.-++..|.. T Consensus 72 --~~~~g~~l~~~~~~~~~---~~~~i~ID~~~y~~~~Vdd~D~~-q--~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~ 143 (347) T protein:vir:88 72 --YLAPGENLDDKRKDIKH---SEKVIQIDGLLTSDVLIYDIEDA-M--NHYDVRAEYSAQLGEALAIAADGAVLAEMAK 143 (347) T ss_pred --eeccccCCCCCCCCCcc---ceEEEEEechhhhhhhhhhHHHH-h--hcCCchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 46789887654 2332 347788877533 343 444 3 3346999999999999999999999876643 Q ss_pred Cccce-----------------------ecchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCcccc Q lcl|Aclame:pro 140 GTGTQ-----------------------DALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITT 194 (296) Q Consensus 140 at~t~-----------------------~~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~i~~ 194 (296) +.... .........+++.++-++...+.+-+ ....+++|+|...+.+|...+... T Consensus 144 ~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~ 223 (347) T protein:vir:88 144 LCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNA 223 (347) T ss_pred hhccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhh Confidence 21100 00112233445566666666665532 235789999999999987654321 Q ss_pred -----c-eeechhhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhh---hhhhcccc-ccccceEEE-e--- Q lcl|Aclame:pro 195 -----Q-TAFGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSEL---AKEFNLYG-DPTGYIGMN-H--- 260 (296) Q Consensus 195 -----q-~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~---~~~f~~~t-d~tGliGv~-h--- 260 (296) + ....+ -+.+++|.+|+.|+.+|.|..-..+.. .+.++.+... +..+.-|. |.++-+|+. | T Consensus 224 ~~~~~~~~~~~G-~vg~i~G~~V~~s~nlp~~~~~~~~~~----~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a 298 (347) T protein:vir:88 224 ANYAALIDPETG-NIRNVMGFEVIEVPHLTVGGAGDNNPA----DGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSA 298 (347) T ss_pred hhhccccchhcc-eeeeeccceEEEeeccccccccccccc----ccccccccccccccccccccccccCcEEEEEechhh Confidence 1 11111 123588999999999996543322211 1222211100 11222122 444444432 1 Q ss_pred -----ccc-----cceeehhhhhhHHHHhh---hhccceEEEEEecCCC Q lcl|Aclame:pro 261 -----FQE-----NTTLTIQTLLVSGMLMY---PERIDGIVKVTLTPGV 296 (296) Q Consensus 261 -----~~~-----~~~~t~et~~~~~~~lf---pE~~dgvv~~tI~~~v 296 (296) .++ .+.-.++.-++=|...| +=|+|+.+.+..+++- T Consensus 299 ~g~v~~~d~~~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 299 VGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred hhheecccceeeeeechhhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 111 12222333333333333 4577888888887777 No 106 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=98.48 E-value=1.4e-08 Score=63.69 Aligned_cols=254 Identities=8% Similarity=0.014 Sum_probs=133.1 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) +.+..++.--.|+-. ++ .+=+|++..... ..+.-||+. .+|+..|+ +++|+.+-...+ +-|+||+++|.++ T Consensus 357 ~~~~t~~~gg~lvp~-~~---~~~~iie~lr~~--s~i~~l~~~-~~~~~~g~-~~ip~~~~~~~a-~wv~E~~~~~~s~ 427 (632) T protein:vir:96 357 LEKKTAGKGGELVAT-EL---LSEEFIDILRNK--AIIGQMGAR-MLPGLVGD-VDIPKKTSGANF-YWIGEDEDVQDSD 427 (632) T ss_pred hhccccccccccccc-cc---chHHHHHHHhhc--chhhhhcce-EeecCCcc-eEEEEEeCCcee-EeecCCccccccc Confidence 222211111122222 11 122444443321 122334544 57888875 889998765555 4799999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc-----------eecc Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT-----------QDAL 147 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t-----------~~~t 147 (296) ++.. ..+++.||++.-+ |.|.+..+++ +....-.+.|+.+++.++|..|+.--.++.+. ...+ T Consensus 428 ~~f~---~i~l~~~k~~~~v~iS~ell~ds~~-~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~ 503 (632) T protein:vir:96 428 FDFT---TLSFSPKTIAGAVPVTRKLRKQSSI-HVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYP 503 (632) T ss_pred ccee---eEEeeeeEEEEehhhHHHHHhccch-HHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecc Confidence 8764 5788889998865 9999865655 46677788999999999999998543221111 0001 Q ss_pred hhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCccc---cceeechhhhhhhheeEEEEeccCCCceE Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGIT---TQTAFGLTYLVDFTGTVIISTNDVTKGEI 222 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~i~---~q~~fg~tyl~nfLG~~II~S~kV~~G~~ 222 (296) ..++ -+..+.++...+.... ....+.++||...+.+.. +.+. -+..+... -++|.-++.|+.+|.|++ T Consensus 504 ~~~~---~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~-~~l~d~~G~~i~~~~---~l~G~pv~~s~~ip~~~~ 576 (632) T protein:vir:96 504 AGGV---DWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKK-AQVFDNTGERIWQNN---EVNGYRAEASNQIPADTW 576 (632) T ss_pred cccC---CHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHH-HhccCCCCceeecCC---eecccceEeccccccCcE Confidence 1111 0122333333343321 234577899987654433 2221 12223211 256889999999999998 Q ss_pred EEEcccceEEEEecCcchhhhhhhcc----ccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecC Q lcl|Aclame:pro 223 WATVPENIIFAYINPNNSELAKEFNL----YGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTP 294 (296) Q Consensus 223 ~~t~~~Nl~~ay~~~~~g~~~~~f~~----~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~ 294 (296) ++..-..+.+ .+. |++.=..+- ..|.+.|....+. ......| ++++.....+ T Consensus 577 ~~gd~s~~~i--~~~--~~~~i~~~~~~~~~~~~v~~~~~~~~-------------d~~v~~~---~af~~~k~~A 632 (632) T protein:vir:96 577 IFGDWSQIVI--AMW--GVLDLKVDPYTKAASDGLVLRVFQDV-------------DAGVRRK---EAFCIAKKGA 632 (632) T ss_pred EEeecceEEE--EEe--cceEEEEccccccccCceEEEEEeec-------------Cceeech---hhhhheeecC Confidence 8655444321 121 222211111 1233333333321 1122222 3344444444 No 107 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.48 E-value=1.8e-07 Score=57.55 Aligned_cols=280 Identities=10% Similarity=0.036 Sum_probs=149.6 Q ss_pred Cccccccccccceehh--hhhhhhhhhhHHHHhhhHHHH----HHHhCcccccccCCCCeeeeeeeeeeecccCcccCCc Q lcl|Aclame:pro 1 MVTSRTYPEENLIKST--DLKYPITIDVTNKFQENISKL----LEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~--dl~~a~siDf~~~f~~~i~~L----~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe 74 (296) |--+ +-.--|+..-. +=+.+.+ =|++.|...+.+- ...++..|..++.-|+++++|.-.-.... +...|+ T Consensus 1 m~~~-~~~~~~t~~g~~~~~~d~~a-l~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~~tv~--~~t~G~ 76 (347) T protein:vir:94 1 MANV-PGQKIGTDQGKGKSSSDALA-LFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGRTSGV--YLAPGE 76 (347) T ss_pred CCCC-CccccccccccCCccccHHH-HHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecccceeee--eecCCC Confidence 2111 11111110000 0000111 1567777666554 34567788888899999999886555433 677899 Q ss_pred eechhheeeeecceeEEEEeecc--c-cc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC---cc---- Q lcl|Aclame:pro 75 VIPLSKVERKIHSEKKIELKKYR--K-AT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG---TG---- 142 (296) Q Consensus 75 ~Iplskv~~~~~~t~~~tikK~~--K-~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta---t~---- 142 (296) .|+-+.-..+ ....+++|.++. . .| =||+ |. -.|...|..+|...+|++.+|..++..+... +. T Consensus 77 ~l~~~~~~~~-~~e~~itID~~~~~~~~VddiD~~-q~--~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~ 152 (347) T protein:vir:94 77 RLSDKRKGIK-HTEKVITIDGLLTADVMIFDIEDA-MN--HYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNE 152 (347) T ss_pred CcCCCCCCCC-cceEEEEecchhhhhHHhhhHHHH-hc--CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 9977643222 234568887764 2 34 2455 43 3459999999999999999999998766320 00 Q ss_pred --------c--------eecchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCccccce------ee Q lcl|Aclame:pro 143 --------T--------QDALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQT------AF 198 (296) Q Consensus 143 --------t--------~~~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~i~~q~------~f 198 (296) . ...+......++..++-++...+++-+ ....+++|+|...+.+|.+..+.... .- T Consensus 153 ~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 232 (347) T protein:vir:94 153 NIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPE 232 (347) T ss_pred ccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhcccccccc Confidence 0 001112233455566767777776532 23468999999999888776543211 11 Q ss_pred chhhhhhhheeEEEEeccCCCce--------EEEEcccceEEEEecCcchhhhhhhccccccccceEEE-ecc------- Q lcl|Aclame:pro 199 GLTYLVDFTGTVIISTNDVTKGE--------IWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMN-HFQ------- 262 (296) Q Consensus 199 g~tyl~nfLG~~II~S~kV~~G~--------~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~-h~~------- 262 (296) .| -+.+++|++|+.|+.+|.+. .+.+.+++-+.+--+. +++ ...|-+..+|+. |-. T Consensus 233 ~G-~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~-~~~------~~~~~~~~~~l~~h~~A~~~v~~ 304 (347) T protein:vir:94 233 TG-NIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATA-SSD------VKVTMDNVVGLFSHRSAVGTVKL 304 (347) T ss_pred cc-ceEEEeceEEEecCcccccccccccccCcceecCcccccccccc-hhh------hcccccceeEEEeehhhhhhhhc Confidence 11 12358899999999998642 3333333322111111 011 222444445543 211 Q ss_pred ------ccceeehhhhhhHHHHhh---hhccceEEEEEecCCC Q lcl|Aclame:pro 263 ------ENTTLTIQTLLVSGMLMY---PERIDGIVKVTLTPGV 296 (296) Q Consensus 263 ------~~~~~t~et~~~~~~~lf---pE~~dgvv~~tI~~~v 296 (296) ..++--++.-++=|...| +=|+|+.++++.+++- T Consensus 305 ~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 305 RDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred ccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 112222223333333322 3356677777666555 No 108 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.46 E-value=1.4e-07 Score=58.20 Aligned_cols=282 Identities=11% Similarity=0.035 Sum_probs=161.2 Q ss_pred Cccccccccccceehhhhhhhhhhh--hHHHHhhhHHHHHHH----hCcccccccCCCCeeeeeeeeeeecccCcccCCc Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITID--VTNKFQENISKLLEM----LGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siD--f~~~f~~~i~~L~~~----LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe 74 (296) |-...|.+..|+.+-..-..+-+-+ |.+.|+..+.+=++. ++..|..++.-|+++++|.-.-+..+ ....|+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG~~~~~--~~~~G~ 78 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAA--YLAPGE 78 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeeceeEEE--eeecCC Confidence 7777777777777655433222211 889998888665553 55566667888999999865444433 566899 Q ss_pred eechhh--eeeeecceeEEEEee--ccc-ccC--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc---- Q lcl|Aclame:pro 75 VIPLSK--VERKIHSEKKIELKK--YRK-ATT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT---- 143 (296) Q Consensus 75 ~Iplsk--v~~~~~~t~~~tikK--~~K-~vT--dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t---- 143 (296) +++-+. +.. ...+++|.+ |.. .|- ||+ | .-.|...|..+|...++++.+|.-++..|..+... T Consensus 79 ~l~~t~~~~~~---~e~~l~ID~~~y~~~~VdDiD~~-q--~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~ 152 (344) T protein:vir:10 79 NLDDIRKDIKH---TEKVITIDGLLTADVLIYDIEDA-M--NHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQY 152 (344) T ss_pred CCCCCCCCccc---ceEEEEEcchhhhhhhhhhHHHH-h--cCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Confidence 987653 333 246788866 433 343 444 3 34569999999999999999999998777421110 Q ss_pred --------------ee------cchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCccccceeech- Q lcl|Aclame:pro 144 --------------QD------ALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQTAFGL- 200 (296) Q Consensus 144 --------------~~------~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~- 200 (296) .+ ........+++.++-++...+.+-+ ....+++|+|...+-+|.+..+.... +++ T Consensus 153 ~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~-~~~~ 231 (344) T protein:vir:10 153 NENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAAN-YAAL 231 (344) T ss_pred ccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccc-cccc Confidence 00 0112233455666666666665532 23468899999999888776553321 222 Q ss_pred -----hhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEE-ec------------- Q lcl|Aclame:pro 201 -----TYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMN-HF------------- 261 (296) Q Consensus 201 -----tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~-h~------------- 261 (296) ..+.++.|.+|+.|+.+|.|..-...+.+----|+-+ .....+...|.+-.+|+. |. T Consensus 232 ~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~----~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~ 307 (344) T protein:vir:10 232 IDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFP----ATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLAL 307 (344) T ss_pred cceeeeEEEEEeceEEEeccccccccCCcccccccCcccccc----CCcccceeeecceeEEEeechhhhhhhhhcccee Confidence 2233478999999999997633211111111111111 111122222333333322 11 Q ss_pred cccceeehhhhhhHHHHhh---hhccceEEEEEecCC Q lcl|Aclame:pro 262 QENTTLTIQTLLVSGMLMY---PERIDGIVKVTLTPG 295 (296) Q Consensus 262 ~~~~~~t~et~~~~~~~lf---pE~~dgvv~~tI~~~ 295 (296) ...+.--++.-++=|...| +=|+++...+.++.- T Consensus 308 e~~r~~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 308 ERARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred ecccchhHHHHHHHHHhhcccceecccceEEEEeecC Confidence 1112333444444444444 446676766666555 No 109 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.45 E-value=1.2e-08 Score=63.97 Aligned_cols=259 Identities=13% Similarity=0.085 Sum_probs=130.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) ..++.+.+.-+.+.. .+|.+.+-+.+....-++..-+.+|+. |..++.|.......+ +-|+||+.+|-+. T Consensus 106 a~~~~~~~~GG~~iP--------~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~a-~wv~E~~~~~~~~ 175 (401) T protein:vir:44 106 ALQVGTDEDGGYAVP--------EELDRSILSLLKDEVVMRQEATVITVG-GSDYKKLVNLGGTAS-GWVGETDTRSQTA 175 (401) T ss_pred HhhcCCCCCCceecc--------HhHHHHHHHHHHhhhhhhhhceeeecC-CCceEEEEecCCccc-eeeccccccCccc Confidence 111112122222222 334444444444444455556667775 456778876554445 4799999999644 Q ss_pred -eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc-------------- Q lcl|Aclame:pro 81 -VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT-------------- 143 (296) Q Consensus 81 -v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t-------------- 143 (296) .+.. ..+++++|++.-+ |.|.++.+.+ +-.+.-.++|+.+|+.+++..|+.- ++++. T Consensus 176 ~~~~~---~v~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~la~ai~~~~~~~~l~G--~G~~~p~Gil~~~~~~~~~ 249 (401) T protein:vir:44 176 TSRLG---LIEPFMGEIYGNPQATQKMLDDAFF-NVEAWINSELATEFAEQEEIAFTTG--DGTKKPKGFLAYESTEESD 249 (401) T ss_pred cccce---eeeeehhheeeehhhhHHHHhcchH-HHHHHHHHHHHHHHHHHHHhhhhcc--CCCCccceeeccccccccc Confidence 3432 4678889998865 9999864433 5567888999999999999998842 11100 Q ss_pred -----------eecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeech--hh--hhhhhe Q lcl|Aclame:pro 144 -----------QDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGL--TY--LVDFTG 208 (296) Q Consensus 144 -----------~~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~--ty--l~nfLG 208 (296) .+.....+ -++.+.++............+.++||.+...+++-.+-.-+..|-- +. ...++| T Consensus 250 ~~~~~~~~~~~~t~~~~~~---~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G 326 (401) T protein:vir:44 250 KARAFGKLQHIVSGEATAV---TADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAG 326 (401) T ss_pred ccccccccccccccccccc---CHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecc Confidence 00000000 0222223333333333345689999999887643222111222311 11 113889 Q ss_pred eEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccc----cceeehhh-hhhHHHHhhhhc Q lcl|Aclame:pro 209 TVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQE----NTTLTIQT-LLVSGMLMYPER 283 (296) Q Consensus 209 ~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~----~~~~t~et-~~~~~~~lfpE~ 283 (296) ..|+.++.+|.. ..++-.++|.| +++++.+. |..| +....+ .+...+-. .-+.+..+. T Consensus 327 ~PVv~~~~~p~~-----~~~~~~i~~Gd-----~~~~~~i~-~~~~---~~~~~~~~~~~~~v~~~a~~r~d~~~~~--- 389 (401) T protein:vir:44 327 YGIAENEQMPDI-----AADAKAIAFGN-----FKRGYTIV-DRIG---TRILRDPYTNKPFVGFYTTKRTGGMLVD--- 389 (401) T ss_pred eeeEEecCcCCc-----cCCccEEEEee-----hhccEEEE-Eecc---eEEeeeccccCCcEEEEEEEEeccEEec--- Confidence 999999999852 22333344433 33333221 1111 111110 00000000 012222222 Q ss_pred cceEEEEEecCC Q lcl|Aclame:pro 284 IDGIVKVTLTPG 295 (296) Q Consensus 284 ~dgvv~~tI~~~ 295 (296) .++++..++.++ T Consensus 390 ~~a~~~l~~~aa 401 (401) T protein:vir:44 390 SQAIKLLKIAAA 401 (401) T ss_pred ccceEEEEeecC Confidence 346677888777 No 110 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.45 E-value=7.4e-08 Score=59.68 Aligned_cols=283 Identities=11% Similarity=0.039 Sum_probs=147.7 Q ss_pred Cccccccccccceehhhhh-hhhhhh--hHHHHhhhHHHHHHH----hCcccccccCCCCeeeeeeeeeeecccCcccCC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLK-YPITID--VTNKFQENISKLLEM----LGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEG 73 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~-~a~siD--f~~~f~~~i~~L~~~----LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEG 73 (296) |-...+=..-| +....+ ..-+.| |+++|+..+.+-++. ++..|..+...|+++++|+-.-... . +...| T Consensus 1 ma~~~~~~~~~--t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~~t~-~-~~~~g 76 (347) T protein:vir:15 1 MANIQGGQQIG--TNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKA-A-YLKPG 76 (347) T ss_pred CCccccCCccc--cccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccceee-e-eeccC Confidence 54443322112 222222 122334 889999888877765 4455555678899999988654433 2 66789 Q ss_pred ceechhheeeeecceeEEEEeeccc---ccC--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc------ Q lcl|Aclame:pro 74 EVIPLSKVERKIHSEKKIELKKYRK---ATT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG------ 142 (296) Q Consensus 74 e~Iplskv~~~~~~t~~~tikK~~K---~vT--dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~------ 142 (296) +.|+.+--..+ ....+++|.++.- .|- ||+ | --.|...+..++...++++++|..++..|..+.. T Consensus 77 ~~l~~~~~~~~-~~e~~ltID~~~~~~~~VddlD~~-q--~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~ 152 (347) T protein:vir:15 77 ENLDDKRKDIK-HTEKVIHIDGLLTADVLIYDIEDA-M--NHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASN 152 (347) T ss_pred CCCCCCCCCCc-cceEEEEechhhhhhHHhhhHHHH-h--cCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 99987642222 2346677765433 353 333 3 4456999999999999999999999987743200 Q ss_pred ---------c-----e-----ecchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCccccc-----e Q lcl|Aclame:pro 143 ---------T-----Q-----DALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQ-----T 196 (296) Q Consensus 143 ---------t-----~-----~~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~i~~q-----~ 196 (296) . . ..+......++..++-++...+++-+ ....+++|+|.-.+.+|++.++... . T Consensus 153 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~ 232 (347) T protein:vir:15 153 ENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALI 232 (347) T ss_pred ccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccc Confidence 0 0 00111223344555555555555432 2457899999999999988765421 1 Q ss_pred eechhhhhhhheeEEEEeccCCCceEEEEcccceE-EEEecCcchhhhhhhccccccccce------EEEecc-----cc Q lcl|Aclame:pro 197 AFGLTYLVDFTGTVIISTNDVTKGEIWATVPENII-FAYINPNNSELAKEFNLYGDPTGYI------GMNHFQ-----EN 264 (296) Q Consensus 197 ~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~-~ay~~~~~g~~~~~f~~~td~tGli------Gv~h~~-----~~ 264 (296) ...-..+.+++|++|+.|+.+|.+..--....|.- -.|+....+.....+. +++.-||+ |....+ .. T Consensus 233 ~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~-f~~~~~l~~h~~A~g~v~~~~~~~e~~ 311 (347) T protein:vir:15 233 DHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVA-LDNVVGLFQHRSAVGTVKLKDLALERA 311 (347) T ss_pred cccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeec-cccceeeeeccceeeeeEeeceeeeec Confidence 11112333588999999999997643222222211 0111111111111111 11112222 111100 11 Q ss_pred ceeehhhh------hhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 265 TTLTIQTL------LVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 265 ~~~t~et~------~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +...++.- ++.+-.+ |+|+++... -+.| T Consensus 312 ~~~~~~~d~i~~~~~~G~~vl---rP~~av~~~-~~~~ 345 (347) T protein:vir:15 312 RRANYQADQIIAKYAMGHGGL---RPEAAGAIV-LPKV 345 (347) T ss_pred ccchhhhhhhehhhhcCCcee---ccccEEEEe-cCCC Confidence 22233332 3333333 344555552 2333 No 111 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.45 E-value=6e-08 Score=60.18 Aligned_cols=277 Identities=11% Similarity=0.047 Sum_probs=150.3 Q ss_pred cccccccceehhhhhhhhhhh--hHHHHhhhHHHHH----HHhCcccccccCCCCeeeeeeeeeeecccCcccCCceech Q lcl|Aclame:pro 5 RTYPEENLIKSTDLKYPITID--VTNKFQENISKLL----EMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPL 78 (296) Q Consensus 5 ~~~ae~nl~~~~dl~~a~siD--f~~~f~~~i~~L~----~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ipl 78 (296) -+++-.|..+-...+-+-+.+ |.+.|+..+.+=+ ..++..|...+.-|+++++|.=.-... . ....|+.|+. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG~~~~-~-~~~~g~~l~~ 78 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVGASTI-A-GRKAGEELVV 78 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeecceee-e-eecCCCCCCC Confidence 233444444443343333322 6688888775444 456777777888999999986433333 2 5678999998 Q ss_pred hheeeeecceeEEEEee--ccc-ccC--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce--------- Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKK--YRK-ATT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ--------- 144 (296) Q Consensus 79 skv~~~~~~t~~~tikK--~~K-~vT--dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~--------- 144 (296) +.+... ..+++|.. |.. .|- ||+ | .-.|.-.|..+|+..+++++.|..++..|..+.... T Consensus 79 ~~~~~~---~~~l~ID~~l~~~~~VddiD~~-q--~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~ 152 (334) T protein:vir:80 79 QKNVSD---KLNLTVDTVLYARHFFDKFDEW-T--SNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAF 152 (334) T ss_pred CCcccC---ceEEEEeeeeehhhhHhhHHHH-h--cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 887764 46777754 433 343 555 4 334599999999999999999999986664332110 Q ss_pred --------------ecchhhHHHHHHHHHHHHHHhhccc--c---CcceEEEEcHHHHHHHhcCCccccc--------ee Q lcl|Aclame:pro 145 --------------DALGAGLQGALASAWGKLQVLFEDY--G---SERAIVFANSLDVAEYIAKAGITTQ--------TA 197 (296) Q Consensus 145 --------------~~t~~~lQ~Ala~~~~~~~~~Fede--d---~~~~VlFvNP~Daa~~l~~a~i~~q--------~~ 197 (296) ......-+ ++..+|-.+...+.+- . ....+++|+|...+.+|.+.++... .. T Consensus 153 ~~G~~~~~~~~g~~~~~~~~~~-~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~ 231 (334) T protein:vir:80 153 HDGILLPSTISGLAADAAADAD-VLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNS 231 (334) T ss_pred cCCcceeecccccccchhhhHH-HHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecccccccc Confidence 00001111 2223333333333321 1 2357999999999999998765321 11 Q ss_pred echhhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccc-eEEEeccc---cceeehhhhh Q lcl|Aclame:pro 198 FGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGY-IGMNHFQE---NTTLTIQTLL 273 (296) Q Consensus 198 fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGl-iGv~h~~~---~~~~t~et~~ 273 (296) +...-+.+++|++|+.|+.+|.+-........- | ++--||+++...+..-..=+ .+-.++.. .++--..+.+ T Consensus 232 ~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~---~-~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~ 307 (334) T protein:vir:80 232 FVGGRIAMLNGVRVVETPRFPQSAITANALGAD---F-NVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHY 307 (334) T ss_pred ccceeEEEEeceEEEeecCCCCccccccccccc---c-ccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHH Confidence 222223458899999999999774221111100 0 01123333332222110000 11111110 1111233333 Q ss_pred hH------HHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 274 VS------GMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 274 ~~------~~~lfpE~~dgvv~~tI~~~ 295 (296) +- +-.+=||-. ++++.|++-| T Consensus 308 i~~~~a~G~g~lRPeaa-~vv~~~~~~~ 334 (334) T protein:vir:80 308 LDTFQSYNIGQRRPDAV-AVHDITVTNP 334 (334) T ss_pred HHHHHHcCCceeccceE-EEEEEeeecC Confidence 33 334446543 6788888888 No 112 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.42 E-value=3.3e-08 Score=61.58 Aligned_cols=270 Identities=14% Similarity=0.096 Sum_probs=122.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHH--HHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceech Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISK--LLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPL 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~--L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ipl 78 (296) +..+.++..-..+..+. +.++|-+-+.. .+..|| .|.+|+..|+ +++|+++-...+ +-|+||+.+|. T Consensus 64 ~a~~~~~~~Gg~lvP~~--------~~~~ii~~l~~~s~l~~lg-~~~v~~~~g~-~~~p~~t~~~~a-~wv~E~~~~~~ 132 (366) T protein:vir:57 64 MAISTAAGSGGALIPQN--------MQNEVIELLRDRTVVRILG-ARSIPLPNGN-LSMPRLSGGATA-GYVGEGKDVVA 132 (366) T ss_pred hhccccccCCccccchh--------HHHHHHHHHhhhcchhhhc-eeeeecCCCc-eEEEEEeCCcce-eeeccCccccc Confidence 22222221112111222 22333222221 122333 5567887774 999999765566 47999999999 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc-------------c Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG-------------T 143 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~-------------t 143 (296) ++.+.. ..+++.+|++.-+ |.|.++.+.. +-...-.++|+.+++.++|..|+.--.++.+ . T Consensus 133 s~~~f~---~i~~~~~k~~~~~~iS~ell~ds~~-~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~ 208 (366) T protein:vir:57 133 TGATFD---DVKLSAKTMIALVPVSNQLIGRAGF-NVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRL 208 (366) T ss_pred ccccee---EEEEeeEEEEEeehhhHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccce Confidence 998764 5778889988764 9999866654 3456778999999999999988853211100 0 Q ss_pred eec--chhhHHHHHHHHHHHHHHhhccc--cCcceEEEEcHHHHHHHhcCCccccceeec-hhhhhhhheeEEEEeccCC Q lcl|Aclame:pro 144 QDA--LGAGLQGALASAWGKLQVLFEDY--GSERAIVFANSLDVAEYIAKAGITTQTAFG-LTYLVDFTGTVIISTNDVT 218 (296) Q Consensus 144 ~~~--t~~~lQ~Ala~~~~~~~~~Fede--d~~~~VlFvNP~Daa~~l~~a~i~~q~~fg-~tyl~nfLG~~II~S~kV~ 218 (296) ... +..++.... .....+...+.+. .......++||.....+++-.+-.-+..|. +.=. -++|..|+.|+.+| T Consensus 209 ~~~~~t~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~~~g-~l~G~Pvv~s~~ip 286 (366) T protein:vir:57 209 VAWTGTAINLTTID-EYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPEMSQG-ILKGYPIQRTSAIP 286 (366) T ss_pred eeccccccchhhHH-HHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCceeccCCCCC-eecceeeEEccccc Confidence 011 111111111 1111222223322 123457789999988765432211223331 1111 26799999999998 Q ss_pred CceEEEEcccceEEEEecCcc------hhhhhhhc---cccccccceEEEecc-ccceeehhhhhhHHHHhhhhccceEE Q lcl|Aclame:pro 219 KGEIWATVPENIIFAYINPNN------SELAKEFN---LYGDPTGYIGMNHFQ-ENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) Q Consensus 219 ~G~~~~t~~~Nl~~ay~~~~~------g~~~~~f~---~~td~tGliGv~h~~-~~~~~t~et~~~~~~~lfpE~~dgvv 288 (296) ... .+..+.-.++|.|.+. +++.=... -+.|..|-+ |+. ..+..-+-.....++. +-+..+++ T Consensus 287 ~~~--~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~---~~~f~~~~~~iR~~~~~d~~--v~~~~a~~ 359 (366) T protein:vir:57 287 ANL--GDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQL---VSAFARNQSLIRVVTEHDIG--FRHPEGLV 359 (366) T ss_pred ccc--ccCCCccEEEEEecceEEEEEecceEEEEeeccccccccccc---hhhhhcCceeEEeeeeeCcE--eeccccEE Confidence 742 1122233344444320 11110000 011111110 000 0000000000000000 00111111 Q ss_pred EEEecCCC Q lcl|Aclame:pro 289 KVTLTPGV 296 (296) Q Consensus 289 ~~tI~~~v 296 (296) .. +... T Consensus 360 ~l--t~~~ 365 (366) T protein:vir:57 360 LG--TGVI 365 (366) T ss_pred EE--eccc Confidence 11 1111 No 113 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.41 E-value=6.8e-08 Score=59.88 Aligned_cols=266 Identities=8% Similarity=-0.022 Sum_probs=129.3 Q ss_pred Cccccccccccceehhhhhhhhhhhh-HHHHhhhHHHHHHHhCcccccccCCC-CeeeeeeeeeeecccCcccCCce--- Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDV-TNKFQENISKLLEMLGVTRKISVSEG-MTLKTYAGYDVTLAEGNVPEGEV--- 75 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf-~~~f~~~i~~L~~~LgVtr~~~~~pG-~tIt~pk~~yig~A~gdVaEGe~--- 75 (296) ..++.+.+--.++-. +| .++|-+-+..-.-++...+.+||+.+ ..+++|+-......-.-++||.. T Consensus 156 ~~~~~~~~gg~lv~~---------~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~ 226 (477) T protein:vir:84 156 DLDRNGGTGGYAVPP---------LWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTA 226 (477) T ss_pred cccccCCCcceeecc---------chhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCccccc Confidence 111111111112211 11 12232222222223344555666543 36888875332222225788864 Q ss_pred --echhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc---------- Q lcl|Aclame:pro 76 --IPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT---------- 141 (296) Q Consensus 76 --Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat---------- 141 (296) +|.++++.. ..+++.||++.-+ |.|.++.+.+ +-.+.-.++|+.+|+.+++..|+.-=.++. T Consensus 227 ~~~~~s~~~f~---~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~ 302 (477) T protein:vir:84 227 PSAHEVDLTDG---FVQANVKTIAGQQGIAIQLLDQAAV-SVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAG 302 (477) T ss_pred cccccccccee---eEEEeeeeEEeeeHHHHHHHhccch-hHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccc Confidence 567776653 4778889988765 9999977776 467888999999999999998884211110 Q ss_pred -cceecc--hhhHHHHHHHHHHHHHHhhccc----cCcceEEEEcHHHHHHHhcCCccccceee---------------- Q lcl|Aclame:pro 142 -GTQDAL--GAGLQGALASAWGKLQVLFEDY----GSERAIVFANSLDVAEYIAKAGITTQTAF---------------- 198 (296) Q Consensus 142 -~t~~~t--~~~lQ~Ala~~~~~~~~~Fede----d~~~~VlFvNP~Daa~~l~~a~i~~q~~f---------------- 198 (296) +..+.+ +.++.. +.....++.+....- .....+.++||.+.+.+++-.+-.-+..| T Consensus 303 ~~~~~~~~~~~t~~~-~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~ 381 (477) T protein:vir:84 303 ITQVTATSAGSALEK-HQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEV 381 (477) T ss_pred cccccccccccchhh-HHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCccccccccccccc Confidence 000111 111111 111122222222221 12234789999998866542221111111 Q ss_pred --chhhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccce------eehh Q lcl|Aclame:pro 199 --GLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTT------LTIQ 270 (296) Q Consensus 199 --g~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~------~t~e 270 (296) .+... .++|..|+.|+.+|.+...... --.+++.| ++..+ ..+-| +.......+ .-+. T Consensus 382 ~~~~~~~-~l~G~pVv~s~~~p~~~~~~~d--~~~i~~gd-----~~~~~---i~~~~---~~~~~~~~~~~~~~~~~~~ 447 (477) T protein:vir:84 382 ASQRVVG-QMHGLPVVTDPTLPTTLGTGTD--QDVIHVLR-----ASDLA---LFESS---VRMRALQETRAENLSVLLQ 447 (477) T ss_pred ccccccc-hhcccceEecCcccccccccCC--cceEEEEE-----eceEE---EEeec---eeEEeccccccccceeeee Confidence 01111 2679999999999987544322 22334434 22221 11222 111111111 1111 Q ss_pred hhhhHHHHhhhhc-cceEEEEEecCCC Q lcl|Aclame:pro 271 TLLVSGMLMYPER-IDGIVKVTLTPGV 296 (296) Q Consensus 271 t~~~~~~~lfpE~-~dgvv~~tI~~~v 296 (296) ...+.+. -|+| ...+|.+|+++.- T Consensus 448 v~~~~~~--~~~r~~~afv~~t~~~~~ 472 (477) T protein:vir:84 448 VYGYLAF--TAARFPQSVVEIGGTALT 472 (477) T ss_pred ehhhhhh--hhhccccceEEeeccccc Confidence 1112221 2666 7889999996555 No 114 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=98.41 E-value=1.4e-08 Score=63.56 Aligned_cols=255 Identities=11% Similarity=0.046 Sum_probs=130.8 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) -+++.+++.-+.... -+|.++|-+-+....-++..-+.+|+..| .+++|.......+. -|+||+.+|-++ T Consensus 129 al~~~t~~~gG~lvP--------~~~~~~ii~~~~~~s~l~~l~~~~~~~~~-~~~~~~~~~~~~a~-wv~E~~~~~~~~ 198 (425) T protein:vir:10 129 ALNKGEDSEGGYLTP--------IEWDRTITNKLVLISPMRQLCRVQPVSKA-GFSKLFNMGGTTSG-WVGEASQRPQTN 198 (425) T ss_pred HhhcCcCCCCceecc--------HhHHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEEcCCccee-eecccccccccc Confidence 233333333333222 23444444444444445566677777654 57788776666664 899999999765 Q ss_pred e-eeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHH---------HhcCccceec-- Q lcl|Aclame:pro 81 V-ERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTA---------LKTGTGTQDA-- 146 (296) Q Consensus 81 v-~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~a---------Lktat~t~~~-- 146 (296) . +.. ..+++.+|++--+ |.|.++.+.+ +-.+.-.++|+..|+.+++..|+.= |...+....+ T Consensus 199 ~~~f~---~v~~~~~k~~~~i~iS~ell~ds~~-~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~ 274 (425) T protein:vir:10 199 AATFQ---PLSFASGEIYANPAATQQILDDAEI-DLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAK 274 (425) T ss_pred ccccc---eeeeeheeeEeehHhHHHHHhcchh-HHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeecccccccccc Confidence 3 332 4678888988865 9999865543 4678889999999999999988851 1110000000 Q ss_pred ------chhhHHHHHHHHHHHHHHhhcc---ccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEE Q lcl|Aclame:pro 147 ------LGAGLQGALASAWGKLQVLFED---YGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVII 212 (296) Q Consensus 147 ------t~~~lQ~Ala~~~~~~~~~Fed---ed~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II 212 (296) ....-+.+-...+.++.+.+.. ......+.++||.+...+++-.+-.-+..| .++.. -++|..|+ T Consensus 275 ~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~-~l~G~PV~ 353 (425) T protein:vir:10 275 HPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPA-TLAGYPVT 353 (425) T ss_pred ccccccccccccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCc-eecceeeE Confidence 0000011111223333444432 222456889999998866532221112222 22222 27899999 Q ss_pred EeccCCCceEEEEcccceEEEEecCcchhhhhhhcc------------cc--ccccceEEEeccccceeehhhhhhHHHH Q lcl|Aclame:pro 213 STNDVTKGEIWATVPENIIFAYINPNNSELAKEFNL------------YG--DPTGYIGMNHFQENTTLTIQTLLVSGML 278 (296) Q Consensus 213 ~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~------------~t--d~tGliGv~h~~~~~~~t~et~~~~~~~ 278 (296) .++.+|... .++-.++|.| +++++-+ |. |.+++.+..+ +.+.. T Consensus 354 ~~~~~p~~~-----~~~~~i~~Gd-----~~~~~~i~~~~~~~v~~d~~~~~~~~~~~~~~r-------------~d~~v 410 (425) T protein:vir:10 354 EVPDMPDVA-----ANSTPILFGD-----FQQTYLIIDRIGVRVLRDPYTAKPYVLFYTTKR-------------VGGGL 410 (425) T ss_pred EecCcCCcc-----CCccEEEEEe-----hhccEEEEEecceEEEecccccCCcEEEEEEEE-------------eccEe Confidence 999988521 1222233322 2222211 11 2222222211 11111 Q ss_pred hhhhccceEEEEEecCCC Q lcl|Aclame:pro 279 MYPERIDGIVKVTLTPGV 296 (296) Q Consensus 279 lfpE~~dgvv~~tI~~~v 296 (296) .- .+++++.++.++= T Consensus 411 ~~---~~A~~~l~~~as~ 425 (425) T protein:vir:10 411 LN---PEPMRAMKVAASE 425 (425) T ss_pred ec---ccceEEEEeeccC Confidence 11 2244444444444 No 115 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=98.38 E-value=1.1e-07 Score=58.74 Aligned_cols=273 Identities=9% Similarity=0.021 Sum_probs=135.4 Q ss_pred Cccccccc---------cccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCccc Q lcl|Aclame:pro 1 MVTSRTYP---------EENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVP 71 (296) Q Consensus 1 ~~~~~~~a---------e~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVa 71 (296) +....+-. -.+.+...+=+...--+|.+++-+.+....-++++-+.+|+..|..+.+|.-...+..-..|+ T Consensus 99 ~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 178 (409) T protein:vir:45 99 GASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLG 178 (409) T ss_pred hhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCcccccccc Confidence 00000000 001111111111112234455544555555566667888998888887777554433324899 Q ss_pred CCceechhheeeeecceeEEEEeec-ccc--cCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC-------- Q lcl|Aclame:pro 72 EGEVIPLSKVERKIHSEKKIELKKY-RKA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG-------- 140 (296) Q Consensus 72 EGe~Iplskv~~~~~~t~~~tikK~-~K~--vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta-------- 140 (296) ||+.+|-+.+... ..+++-+|. ++- +|.|.++.+.+ +-.+.-.++|..++..+++..|+.-=.++ T Consensus 179 E~~~~~~~~~~f~---~~~l~~~k~~~~~i~is~ell~ds~~-~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gi 254 (409) T protein:vir:45 179 ENEEAGEEDTDFG---MGSLGALKMTSKIIRVSNELLQDSAI-DMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGL 254 (409) T ss_pred ccccccccccccc---eeeeeeeeeeeeehhhhHHHHhccHH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccccee Confidence 9999999998764 466666665 453 59999865533 56788889999999999999988421111 Q ss_pred ----ccce-ecchh--hHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHh--cCCc---cccceeechhhhhhhhe Q lcl|Aclame:pro 141 ----TGTQ-DALGA--GLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYI--AKAG---ITTQTAFGLTYLVDFTG 208 (296) Q Consensus 141 ----t~t~-~~t~~--~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l--~~a~---i~~q~~fg~tyl~nfLG 208 (296) +... ..... ++. .+.+.+..+...+. .....++++||.+...++ ++++ +-.....++.-- .++| T Consensus 255 l~~~~~~~~~~~~~~~~~d-~i~~l~~~l~~~~~--~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~-~l~G 330 (409) T protein:vir:45 255 AASVTGTTQTAAANAVKWQ-EILALKHSIDPAYR--RGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPA-SVLN 330 (409) T ss_pred eeccccccccccccccchH-HHHHHHHhhhhhhc--cCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCc-eecc Confidence 1000 01111 111 11122222222221 224457788999987653 3332 111111111111 3789 Q ss_pred eEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccc-eEEEeccc--cceeehhhh-hhHHHHhhhhcc Q lcl|Aclame:pro 209 TVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGY-IGMNHFQE--NTTLTIQTL-LVSGMLMYPERI 284 (296) Q Consensus 209 ~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGl-iGv~h~~~--~~~~t~et~-~~~~~~lfpE~~ 284 (296) ..|+.+..+|... .++-.++|-|. ++.+ + -+..++ +-..|+.. .+...+-.. -+.+. |-.. T Consensus 331 ~PV~~~~~~p~~~-----~~~~~i~~Gd~-----~~~~-i-~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~---~~~~ 395 (409) T protein:vir:45 331 VPYVIDQEIDDIG-----AGKKFMFCGDF-----DRFI-I-RRVRYMILKRLVERYAEYDQTGFLAFHRFDCI---LEDT 395 (409) T ss_pred eeeEEecCcCCcc-----CCccEEEEeeh-----hhhh-e-eeccceEEEEeecccccCCcEEEEEEEEeccE---eech Confidence 9999999998521 23334445443 3322 1 111111 11112111 111111110 12222 2344 Q ss_pred ceEEEEEecCCC Q lcl|Aclame:pro 285 DGIVKVTLTPGV 296 (296) Q Consensus 285 dgvv~~tI~~~v 296 (296) ++++..++++++ T Consensus 396 ~A~~~l~~k~s~ 407 (409) T protein:vir:45 396 SAIKALVGKGSV 407 (409) T ss_pred hheEEEEeccCC Confidence 588999999999 No 116 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=98.38 E-value=2.5e-08 Score=62.25 Aligned_cols=261 Identities=12% Similarity=0.089 Sum_probs=143.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) +..+++..-.+++.. -++.+++-..+.....+++..+..|+. | +.++|.-.....+. -|+||+.||-++ T Consensus 148 ~~~~~~~~g~~~~vP--------~~~~~~i~~~l~~~~~l~~~~~v~~~~-g-~~~~~~~~~~~~a~-wv~E~~~~~~~~ 216 (466) T protein:vir:80 148 AQQKRAVSGAELTIP--------DVMLELLRDNMHRYSKLISKVRLRPLK-G-TARQNIAGAIPEGV-WTEAVANLNELS 216 (466) T ss_pred hhhhhhhcccccccc--------HHHHHHHHHhhhhhhhhhhheeeeecC-c-eeEeeeecCCccee-eccccccccccc Confidence 222332222222222 245666666666666677777877764 3 45666655555553 689999999988 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC------------ccc--e Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG------------TGT--Q 144 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta------------t~t--~ 144 (296) .+.. ..++.++|++.-+ |.|.++.+++ +-.+.-.++|+.+++.+++..|+.--.++ +.. . T Consensus 217 ~~f~---~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~ 292 (466) T protein:vir:80 217 LSFS---QIEVDGYKVGGFIPIPNSTLEDSDL-NLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNW 292 (466) T ss_pred cccc---ceeecceeeeeehhhhHHHHhcchH-HHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeeccccccccccc Confidence 7754 4778889988864 9999865554 46677888999999999999887511000 000 0 Q ss_pred --------ecchhhHHHH----------HHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcc-ccceeec--hhhh Q lcl|Aclame:pro 145 --------DALGAGLQGA----------LASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI-TTQTAFG--LTYL 203 (296) Q Consensus 145 --------~~t~~~lQ~A----------la~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i-~~q~~fg--~tyl 203 (296) +.+...+..+ +...+..+...-........+..+||.....+++.... ..++.+. ..-. T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~ 372 (466) T protein:vir:80 293 GTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLNNT 372 (466) T ss_pred ccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCCCc Confidence 0000011000 00111111111111223456678898887777665532 2232221 1111 Q ss_pred hhhheeEEEEeccCCCceEEEEcccceEEEEecCcchh----hhhhhccccccccceEEEeccccceeehhhhhhHHHHh Q lcl|Aclame:pro 204 VDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSE----LAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLM 279 (296) Q Consensus 204 ~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~----~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~l 279 (296) ..++|..|+.|..+|+|++++--... |.+-.+ ++ .+....|..|+++|.+..+. .|... T Consensus 373 ~~i~G~pvv~s~~~~~~~~~~g~~~~---y~i~~r-~~~~i~~~~~~~f~~d~~~~r~~~r~-------------dg~~~ 435 (466) T protein:vir:80 373 MPIVGGDIVILDFIPDNDIIGGYGSL---YLLAER-ADIKLAQSEHVRFIEDQTVFKGTARY-------------DGKPV 435 (466) T ss_pred ccccccceeecCccCccceeeecccc---EEEEee-cceEEEechhhhhhcCcEEEEEEEEE-------------ccEEe Confidence 13789999999999999987654443 333222 11 22333355688887776652 12222 Q ss_pred hhhccceEEEEEec---CCC Q lcl|Aclame:pro 280 YPERIDGIVKVTLT---PGV 296 (296) Q Consensus 280 fpE~~dgvv~~tI~---~~v 296 (296) ..++++.++|+ ++| T Consensus 436 ---~~~afv~~~~~~~~~~~ 452 (466) T protein:vir:80 436 ---FGEGFVAVNIANANPTT 452 (466) T ss_pred ---ccCceEEEEecCCCccc Confidence 33566666663 333 No 117 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.38 E-value=3.7e-08 Score=61.33 Aligned_cols=273 Identities=11% Similarity=0.068 Sum_probs=145.5 Q ss_pred Cccccccccccceehhhhhhhhhhh---hHHHHhhhHHHHHH----HhCcccccccCCCCeeeeeeeeeeecccCcccCC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITID---VTNKFQENISKLLE----MLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEG 73 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siD---f~~~f~~~i~~L~~----~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEG 73 (296) |-+=+-..-+|.-.-.--+..-+.| |.+.|+..+.+=++ .++..+.....-|+++++|.-.-+... +...| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~~~--~~~~g 78 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAG--YHTPG 78 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccceeEe--eecCC Confidence 5443333333433211111122233 78888888765554 345555556678999999876444333 55678 Q ss_pred ceechhh-eeeeecceeEEEEee--ccc-ccC--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc---- Q lcl|Aclame:pro 74 EVIPLSK-VERKIHSEKKIELKK--YRK-ATT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT---- 143 (296) Q Consensus 74 e~Iplsk-v~~~~~~t~~~tikK--~~K-~vT--dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t---- 143 (296) +.|...+ +..+ .++++|.+ |-. .+- ||+ |.+ .|...|..+|.+.++++++|..++..|..+... T Consensus 79 ~~l~~~~~~~~~---~~~l~ID~~ky~~~~VddiD~~-q~~--~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~ 152 (332) T protein:vir:78 79 TPIVGDAGIKAN---EKTLVMDDLLVSSQFVYSLDEI-FSQ--YSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) T ss_pred CCCCCCCCCCCc---eEEEEEehhhhhHHHHHhHHHH-hcC--cchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcc Confidence 8886654 5543 47788875 322 342 444 433 568999999999999999999999877543210 Q ss_pred ----------eecchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCC--cc------cc-ceeechhh Q lcl|Aclame:pro 144 ----------QDALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKA--GI------TT-QTAFGLTY 202 (296) Q Consensus 144 ----------~~~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a--~i------~~-q~~fg~ty 202 (296) .+++..+=-.++++++-++...+.+-+ ....+++|+|.-.+.+|+.. .+ +. ....++.. T Consensus 153 ~~~~g~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~ 232 (332) T protein:vir:78 153 TGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG 232 (332) T ss_pred cccccccccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceeccee Confidence 011101101224445555555565432 23468889999999999843 22 11 23445555 Q ss_pred hhhhheeEEEEeccCCCce--EEEEc--ccceEEEEecCcchhhhhhhccccccccceE------EE--ecc------cc Q lcl|Aclame:pro 203 LVDFTGTVIISTNDVTKGE--IWATV--PENIIFAYINPNNSELAKEFNLYGDPTGYIG------MN--HFQ------EN 264 (296) Q Consensus 203 l~nfLG~~II~S~kV~~G~--~~~t~--~~Nl~~ay~~~~~g~~~~~f~~~td~tGliG------v~--h~~------~~ 264 (296) +..+.|++|+.|+.+|.+. .+... +.+-|.| .+++++ .+|++. .. +++ .. T Consensus 233 i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~-----~~~~~~-------~~~~~~h~~a~~~v~~~~~~~~~t~~~ 300 (332) T protein:vir:78 233 LYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDY-----QVDASA-------LAGLIFHREAAGCIQSVAPTIQTTSGD 300 (332) T ss_pred eeEEeeeEEEecCccccCccccccccccccccccc-----cccccc-------ceEEeecccceeeeeeeccchhhhhcc Confidence 6668899999999998543 22111 1122211 122222 333332 11 110 01 Q ss_pred ceeehhhhhhHHHHhh---hhccceEEEEEecCC Q lcl|Aclame:pro 265 TTLTIQTLLVSGMLMY---PERIDGIVKVTLTPG 295 (296) Q Consensus 265 ~~~t~et~~~~~~~lf---pE~~dgvv~~tI~~~ 295 (296) +.--++.-.+-|...| +=|+|++++.+ ++ T Consensus 301 ~~~~~~~d~i~~~~~~G~~v~rPe~~v~l~--~a 332 (332) T protein:vir:78 301 FNVQYQGDLIVGKLAMGCGSLRTSVAGSFQ--AA 332 (332) T ss_pred cchhhhHhhhhhhhhhcCceecccceEEEe--eC Confidence 1122222233333222 33455555443 33 No 118 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=98.33 E-value=8.3e-08 Score=59.42 Aligned_cols=272 Identities=11% Similarity=0.072 Sum_probs=125.8 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) +..+.++..-+.+..+++. =+|++..... +-|++ || .|.+|+..|+ +++|++.....+. -|+||+.+|.++ T Consensus 125 ~~~~~~~~~gg~liP~~~~----~~ii~~l~~~-~~l~~-~~-~~~~~~~~g~-~~~p~~~~~~~a~-~v~Eg~~~~~~~ 195 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIH----SEVIELLRDR-TIVRK-LG-ARSIPLPNGN-MSLPRLAGGATAS-YTGENQDAKVSE 195 (428) T ss_pred hhhcccccCCccccchhHH----HHHHHHHhhh-chhhh-hc-ceeeecCCcc-eEEEEEeCCccee-eeccCccccccc Confidence 1112111122222233331 1233332211 12222 23 3557777666 8899987666664 899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc-------------e- Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT-------------Q- 144 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t-------------~- 144 (296) .+.. ..+++.+|++..+ |.|.++.+.. +-.+.-.++|+.+|+.+++..|+.-=.++... + T Consensus 196 ~~f~---~i~~~~~k~~~~v~is~ell~ds~~-~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~ 271 (428) T protein:vir:10 196 ARFD---DVKLTAKTMIAMVPISNALIGRAGF-NVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLP 271 (428) T ss_pred ccee---eEEeeeEEEEEeehhhHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc Confidence 8764 4778899999875 9999865554 35677889999999999999887421111000 0 Q ss_pred --ecchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEeccCCCc Q lcl|Aclame:pro 145 --DALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDVTKG 220 (296) Q Consensus 145 --~~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~kV~~G 220 (296) .....++. .+...+..+...+.... ....+.++||.+...+.+-.+-.-+..|....-..++|..|+.++.+|.+ T Consensus 272 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~ 350 (428) T protein:vir:10 272 WAADAAVNLD-TIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEMAQGMLKGYPIQRTSAIPAN 350 (428) T ss_pred ccccccccHH-HHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeeceeeEEecccccc Confidence 00111111 11111111112222211 12457799999998665432211123331111113789999999999986 Q ss_pred eEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehh----hhhhH--HHHhhhhccceEEE----E Q lcl|Aclame:pro 221 EIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQ----TLLVS--GMLMYPERIDGIVK----V 290 (296) Q Consensus 221 ~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~e----t~~~~--~~~lfpE~~dgvv~----~ 290 (296) ... ..+.-.++|.|.+ +.+ +. + -|=|-+..+......... ..+.. ..+..=+|.|+-+. + T Consensus 351 ~~~--~~~~~~i~~gd~s-----~~~-i~-~-~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~ 420 (428) T protein:vir:10 351 LGE--GGKESEIYFADFN-----DVV-IG-E-DGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGL 420 (428) T ss_pred ccC--CCccceEEEEecc-----eEE-EE-E-ecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceE Confidence 432 2233445555532 111 00 0 011111111110000000 00000 00111223332221 0 Q ss_pred EecCCC Q lcl|Aclame:pro 291 TLTPGV 296 (296) Q Consensus 291 tI~~~v 296 (296) .+-..| T Consensus 421 ~~~t~~ 426 (428) T protein:vir:10 421 VLGTGV 426 (428) T ss_pred EEEecc Confidence 000111 No 119 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=98.33 E-value=2.7e-08 Score=62.09 Aligned_cols=250 Identities=11% Similarity=0.045 Sum_probs=124.9 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeee-cccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVT-LAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig-~A~gdVaEGe~Ipls 79 (296) -.++.+.+.-+.+..+ +|.+++-+.+..---++++-+..++. ..++|+..+.+ .+ .-|+||+.+|-+ T Consensus 132 a~~~~t~~~GG~lIP~--------~~~~~Ii~~~~~~~~l~~~~~v~~~~---~~~~p~~~~~~~~a-~~v~Eg~~~~~~ 199 (402) T protein:vir:93 132 ALPTGNDSGGDKLLPK--------TLSKEIVSEPFAKNQLREKARLTNIK---GLEIPRVSYTLDDD-DFITDVETAKEL 199 (402) T ss_pred hhccCCCcCCccccch--------hHHHHHHHhHHhhhhhhhhceeeecC---CceeeeeeccCCcc-cccccccccccc Confidence 1222222222333333 33343333333323333333444443 24567765543 45 489999999999 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc---------ceecch Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG---------TQDALG 148 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~---------t~~~t~ 148 (296) +.+.. ..++..+|++.-+ |.|.++.+.++ -.+.-.++|+.++..+.++.+|....+..+ ....++ T Consensus 200 ~~~f~---~i~~~~~k~~~~i~iS~ell~Ds~~~-l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~ 275 (402) T protein:vir:93 200 KAKGD---TVKFTTNKFKVFAAISDTVIHGSDVD-LVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG 275 (402) T ss_pred ccccc---eeeecceeeeeechhhHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc Confidence 88764 5788889998854 99998655544 566778888888888777766643221100 001122 Q ss_pred hhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEeccCCCceEEEEccc Q lcl|Aclame:pro 149 AGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVPE 228 (296) Q Consensus 149 ~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~ 228 (296) ... +..+.++............+.+||+.+...+++--.=+....+.+.- ..+||..|+.+...+. ++ .. T Consensus 276 ~~~----~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~~~~-~~llG~PV~~t~~~~~--i~---~G 345 (402) T protein:vir:93 276 ADM----YDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPA-EKVFGKPVVFTDAAVK--PI---VG 345 (402) T ss_pred cch----HHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCC-ccccccceEEecCCCc--ee---ee Confidence 222 12222222222221123557899999987665432111112222211 1377988888876542 22 24 Q ss_pred ceEEEEecCcchhhhhhhcccc----ccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 229 NIIFAYINPNNSELAKEFNLYG----DPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 229 Nl~~ay~~~~~g~~~~~f~~~t----d~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +...||...... .+..++ +.++|.+.... -|.+. +.++|+..+|+++. T Consensus 346 Df~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~r~-------------Dg~v~---~~~A~~~l~ik~~~ 397 (402) T protein:vir:93 346 DFNYFGINYDGT----TYDTDKDVKKGEYLFVLTAWY-------------DQQRT---LDSAFRIAKAKENT 397 (402) T ss_pred chhhhhhhhhhh----hhhhhhcccCCceEEEEEEEe-------------CcEEe---chhheEEEEeecCC Confidence 455555554311 122222 23333322211 11111 34567778887776 No 120 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.27 E-value=3.5e-07 Score=55.96 Aligned_cols=283 Identities=11% Similarity=0.057 Sum_probs=146.0 Q ss_pred Cccccccccccceehhhhhhhhhhh--hHHHHhhhHHHHHH----HhCcccccccCCCCeeeeeeeeeeecccCcccCCc Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITID--VTNKFQENISKLLE----MLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siD--f~~~f~~~i~~L~~----~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe 74 (296) |--..+-..-|..--.- +..-+-| |++.|+..+.+-++ .++..|..+...|+++++|.-.-.... +...|+ T Consensus 1 ~~~~~~~~~~~t~~g~~-~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~t~~--~~~~g~ 77 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKG-QSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAA--YLKPGE 77 (347) T ss_pred CCCCccCcccccccccC-CcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccceeee--eecCCC Confidence 32111111111100000 1122334 99999988866554 345555567788999999876444332 667899 Q ss_pred eechhheeeeecceeEEEEeeccc---ccC--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC--------- Q lcl|Aclame:pro 75 VIPLSKVERKIHSEKKIELKKYRK---ATT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG--------- 140 (296) Q Consensus 75 ~Iplskv~~~~~~t~~~tikK~~K---~vT--dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta--------- 140 (296) .|+.+--..+ ....+++|.++.- .|- ||+ | .-.|...+..+|...++++++|..++..|... T Consensus 78 ~l~~~~~~~~-~~e~~ltiD~~~y~~~~VddiD~~-q--~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~ 153 (347) T protein:vir:33 78 NLDDKRKDIK-HTEKVIHIDGLLTADVLIYDIEDA-M--NHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNE 153 (347) T ss_pred CCCCCCCCCc-cceEEEEechhhhhhHHHhhHHHH-h--cCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 9987642222 2345677765443 453 555 4 45679999999999999999999998654211 Q ss_pred -----------------ccceecchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCcccccee---- Q lcl|Aclame:pro 141 -----------------TGTQDALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQTA---- 197 (296) Q Consensus 141 -----------------t~t~~~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~i~~q~~---- 197 (296) +++.. +....-.+++.++-++...+++-+ ....+++|+|...+.+|+..++..... T Consensus 154 ~~~~~~~~~~~~~~~~~tg~~~-d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~ 232 (347) T protein:vir:33 154 NIEGLGKPTVLTLVKPTTGSLT-DPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALL 232 (347) T ss_pred cccccccccccccccccccccc-chhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccccccc Confidence 11000 001112244555556666665532 235789999999999998776542211 Q ss_pred -echhhhhhhheeEEEEeccCCCceEEEE----cccceEEEEecCcchhhhhhhccccccccc------eEEEeccc--- Q lcl|Aclame:pro 198 -FGLTYLVDFTGTVIISTNDVTKGEIWAT----VPENIIFAYINPNNSELAKEFNLYGDPTGY------IGMNHFQE--- 263 (296) Q Consensus 198 -fg~tyl~nfLG~~II~S~kV~~G~~~~t----~~~Nl~~ay~~~~~g~~~~~f~~~td~tGl------iGv~h~~~--- 263 (296) ..-..+.+++|++|+.|+.+|.+.+-.. .+++-+.+-.+.. .-...+| ++.-|| +|....++ T Consensus 233 ~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~-~~~~~a~---~~~~gl~~h~~A~g~v~~~~~~~ 308 (347) T protein:vir:33 233 DPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSS-TTVKVAL---DNVVGLFQHRSAVGTVKLKDLAL 308 (347) T ss_pred ccccceeEEEeceeEEEecccccCccccccccccccccccccCCcc-cceeccc---cceeeeeecchhheeeeeeceee Confidence 1112233588999999999998743321 1222222222211 0011111 111232 12111111 Q ss_pred --cceeehhhhhhHHHHhh---hhccceEEEEEecCCC Q lcl|Aclame:pro 264 --NTTLTIQTLLVSGMLMY---PERIDGIVKVTLTPGV 296 (296) Q Consensus 264 --~~~~t~et~~~~~~~lf---pE~~dgvv~~tI~~~v 296 (296) .+...++.-++-|...| .=|+|+++.... +.| T Consensus 309 e~~r~~~~~~d~i~~~~~~G~~vlrP~~av~i~~-~~~ 345 (347) T protein:vir:33 309 ERARRANYQADQIIAKYAMGHGGLRPEAAGAIVL-PKV 345 (347) T ss_pred eeccchhhhhHhhhhhhhcCCceecccceEEEec-CCC Confidence 13334444333333332 123455555532 333 No 121 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=98.24 E-value=9.2e-08 Score=59.15 Aligned_cols=263 Identities=14% Similarity=0.015 Sum_probs=126.1 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec-hh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP-LS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip-ls 79 (296) .+..+.....+.....+-+...--++.+.... +.....++...+..++++|. +++|.+...+....-|+||+.+| .+ T Consensus 147 ~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~e~~~~~e~~ 224 (437) T protein:vir:10 147 YLKTGEVRDVTGIALKDGKVIIPETILTPEKE-VHQFPRLGSLVRTESVTTTT-GKLPIFNNSTDLLTAHTEYGQTTKNA 224 (437) T ss_pred HHHhhhhhhhhhcccccccccchHHHHHHHHH-hhhhhhhhhcceeEeeccCc-eeeEEeeccccccccccccccccccc Confidence 11110000011111122111111122222111 11111233345556777664 77887765555446899999998 45 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcccee--cchhhHHHHH Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQD--ALGAGLQGAL 155 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~--~t~~~lQ~Al 155 (296) ..+.. ..++..+|++.-+ |.|.++.+.+ +-.+.-.+.|+..|..+++..+++-+.+++.... .+.+.+..++ T Consensus 225 ~~~~~---~v~~~~~k~~~~~~is~ell~ds~~-~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~~~~~~~~~~ 300 (437) T protein:vir:10 225 TPVIT---PILWDLKTYTGGYVFSQELISDSSY-DWQAELQSRLIELRDNTDDSLIITALTDGIKKTTSTYLLGDLKKVL 300 (437) T ss_pred cccce---eeeeehhheeeehhhhHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHH Confidence 44442 4678889988865 9999876655 3566788899999999999999988866654322 2333444333 Q ss_pred HHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEEEec--cCCCceEEEEccc Q lcl|Aclame:pro 156 ASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVIISTN--DVTKGEIWATVPE 228 (296) Q Consensus 156 a~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II~S~--kV~~G~~~~t~~~ 228 (296) .. .+ ........+.++||.+...+++-.+-.-+..| ++.- ..++|-.|+.+. -+|.+. ++ T Consensus 301 ~~---~l----~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~-~~l~G~pv~~~~~~~~~~~~-----~~ 367 (437) T protein:vir:10 301 NV---TL----KPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAATG-YTLLGKTVVIVDDKLFPSAS-----AG 367 (437) T ss_pred Hh---hh----hhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCCC-cccccceeEEecccccCCcC-----CC Confidence 21 11 11122456889999998865542211112222 1111 127897776653 334332 34 Q ss_pred ceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhc-------cceEEEEEecCCC Q lcl|Aclame:pro 229 NIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPER-------IDGIVKVTLTPGV 296 (296) Q Consensus 229 Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~-------~dgvv~~tI~~~v 296 (296) +..++|.|. ++..-++ |..| +.+.+... .-...+... .-+| .++++.++.+.|. T Consensus 368 ~~~~~~gd~-----~~~~~~~-~r~~-~~~~~~~~--~~~~~~~~~-----~~~r~d~~~~~~~a~~~l~~~~~~ 428 (437) T protein:vir:10 368 DVNIVVAPL-----KKAVINF-KLTE-ITGQFQDT--YDIWYKQLG-----IFLRQNVVQASKDLIVNLTGKLKA 428 (437) T ss_pred ceEEEEeec-----cccEEEE-eeec-eEEEEecc--cccccceee-----EEEEEccEEecccceEEEEeeccc Confidence 444444443 2222211 1111 11111111 111111111 1123 4456666665433 No 122 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=98.23 E-value=4.6e-08 Score=60.82 Aligned_cols=251 Identities=12% Similarity=0.051 Sum_probs=122.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) -.++.+.+.-+.+..+++ .+++-+.+..-.-++++-+..|+. ..++|+..+.+..-.-|+||+.+|-++ T Consensus 117 a~~~~~~~~gG~lIP~~~--------~~~Ii~~~~~~~~l~~~~~~~~~~---~~~~p~~~~~~~~a~~v~Eg~~~~~~~ 185 (387) T protein:vir:96 117 ALPTGNDSGGDKLLPKTL--------SKEIVSEPFAKNQLREKARLTNIK---GLEIPRVSYTLDDDDFITDVETAKELK 185 (387) T ss_pred hhccCCCCCCceeechhH--------HHHHHHHHHhhchhhhhceeeecC---CceeeeeeccCCccccccccccccccc Confidence 112222222233333333 333322222222222333334443 356777665543334799999999998 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc---------ceecchh Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG---------TQDALGA 149 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~---------t~~~t~~ 149 (296) .+.. ..++..+|++.-+ |.|.++.+++ +-.+.-.++|+.++..+.++.+|..-....+ ..+.+++ T Consensus 186 ~~f~---~v~l~~~k~~~~i~iS~ell~ds~~-~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~ 261 (387) T protein:vir:96 186 AKGD---TVKFTTNKFKVFAAISDTVIHGSDV-DLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGA 261 (387) T ss_pred cccc---eeeechheeeeechhhHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccccccccc Confidence 8764 5778889998854 9999865544 3456777888888888777777643321100 0011222 Q ss_pred hHHHHHHHHHHHHHHhhccc---cCcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEeccCCCceEEEEc Q lcl|Aclame:pro 150 GLQGALASAWGKLQVLFEDY---GSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATV 226 (296) Q Consensus 150 ~lQ~Ala~~~~~~~~~Fede---d~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~ 226 (296) .. +..+.+.+.+- .....+.+||+.+.+.+++--.=+....+.+.. ..+||..|+.+...++ ++ T Consensus 262 ~~-------~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~~~-~~llG~PV~~~~~~~~--~~--- 328 (387) T protein:vir:96 262 DM-------YDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPA-EKVFGKPVVFTDAAVK--PI--- 328 (387) T ss_pred ch-------HHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCC-ccccccceEEecCCCc--ee--- Confidence 21 22233333321 123457789999987665432212222232222 2478988888876542 22 Q ss_pred ccceEEEEecCcchhhhhhhc-cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 227 PENIIFAYINPNNSELAKEFN-LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 227 ~~Nl~~ay~~~~~g~~~~~f~-~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ..+...||++.. +-.-..+. ..+|.++|.+... +-|... +.++++...|+++- T Consensus 329 ~GDf~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r-------------~Dg~v~---~~~A~~~l~~ka~~ 382 (387) T protein:vir:96 329 VGDFNYFGINYD-GTTYDTDKDVKKGEYLFVLTAW-------------YDQQRT---LDSAFRIAKAKENT 382 (387) T ss_pred eechhhhhhhhh-hhhheecccccCCceEEEEEEE-------------eCcEee---chhheEEEEeecCC Confidence 234444444432 11111111 1113333333221 111111 24466778886666 No 123 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=98.23 E-value=4.6e-08 Score=60.82 Aligned_cols=251 Identities=12% Similarity=0.051 Sum_probs=122.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) -.++.+.+.-+.+..+++ .+++-+.+..-.-++++-+..|+. ..++|+..+.+..-.-|+||+.+|-++ T Consensus 117 a~~~~~~~~gG~lIP~~~--------~~~Ii~~~~~~~~l~~~~~~~~~~---~~~~p~~~~~~~~a~~v~Eg~~~~~~~ 185 (387) T protein:vir:94 117 ALPTGNDSGGDKLLPKTL--------SKEIVSEPFAKNQLREKARLTNIK---GLEIPRVSYTLDDDDFITDVETAKELK 185 (387) T ss_pred hhccCCCCCCceeechhH--------HHHHHHHHHhhchhhhhceeeecC---CceeeeeeccCCccccccccccccccc Confidence 112222222233333333 333322222222222333334443 356777665543334799999999998 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc---------ceecchh Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG---------TQDALGA 149 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~---------t~~~t~~ 149 (296) .+.. ..++..+|++.-+ |.|.++.+++ +-.+.-.++|+.++..+.++.+|..-....+ ..+.+++ T Consensus 186 ~~f~---~v~l~~~k~~~~i~iS~ell~ds~~-~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~ 261 (387) T protein:vir:94 186 AKGD---TVKFTTNKFKVFAAISDTVIHGSDV-DLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGA 261 (387) T ss_pred cccc---eeeechheeeeechhhHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccccccccc Confidence 8764 5778889998854 9999865544 3456777888888888777777643321100 0011222 Q ss_pred hHHHHHHHHHHHHHHhhccc---cCcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEeccCCCceEEEEc Q lcl|Aclame:pro 150 GLQGALASAWGKLQVLFEDY---GSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATV 226 (296) Q Consensus 150 ~lQ~Ala~~~~~~~~~Fede---d~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~ 226 (296) .. +..+.+.+.+- .....+.+||+.+.+.+++--.=+....+.+.. ..+||..|+.+...++ ++ T Consensus 262 ~~-------~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~~~-~~llG~PV~~~~~~~~--~~--- 328 (387) T protein:vir:94 262 DM-------YDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPA-EKVFGKPVVFTDAAVK--PI--- 328 (387) T ss_pred ch-------HHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCC-ccccccceEEecCCCc--ee--- Confidence 21 22233333321 123457789999987665432212222232222 2478988888876542 22 Q ss_pred ccceEEEEecCcchhhhhhhc-cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 227 PENIIFAYINPNNSELAKEFN-LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 227 ~~Nl~~ay~~~~~g~~~~~f~-~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ..+...||++.. +-.-..+. ..+|.++|.+... +-|... +.++++...|+++- T Consensus 329 ~GDf~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r-------------~Dg~v~---~~~A~~~l~~ka~~ 382 (387) T protein:vir:94 329 VGDFNYFGINYD-GTTYDTDKDVKKGEYLFVLTAW-------------YDQQRT---LDSAFRIAKAKENT 382 (387) T ss_pred eechhhhhhhhh-hhhheecccccCCceEEEEEEE-------------eCcEee---chhheEEEEeecCC Confidence 234444444432 11111111 1113333333221 111111 24466778886666 No 124 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=98.23 E-value=4.6e-08 Score=60.82 Aligned_cols=251 Identities=12% Similarity=0.051 Sum_probs=122.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) -.++.+.+.-+.+..+++ .+++-+.+..-.-++++-+..|+. ..++|+..+.+..-.-|+||+.+|-++ T Consensus 117 a~~~~~~~~gG~lIP~~~--------~~~Ii~~~~~~~~l~~~~~~~~~~---~~~~p~~~~~~~~a~~v~Eg~~~~~~~ 185 (387) T protein:vir:26 117 ALPTGNDSGGDKLLPKTL--------SKEIVSEPFAKNQLREKARLTNIK---GLEIPRVSYTLDDDDFITDVETAKELK 185 (387) T ss_pred hhccCCCCCCceeechhH--------HHHHHHHHHhhchhhhhceeeecC---CceeeeeeccCCccccccccccccccc Confidence 112222222233333333 333322222222222333334443 356777665543334799999999998 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc---------ceecchh Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG---------TQDALGA 149 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~---------t~~~t~~ 149 (296) .+.. ..++..+|++.-+ |.|.++.+++ +-.+.-.++|+.++..+.++.+|..-....+ ..+.+++ T Consensus 186 ~~f~---~v~l~~~k~~~~i~iS~ell~ds~~-~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~ 261 (387) T protein:vir:26 186 AKGD---TVKFTTNKFKVFAAISDTVIHGSDV-DLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGA 261 (387) T ss_pred cccc---eeeechheeeeechhhHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccccccccc Confidence 8764 5778889998854 9999865544 3456777888888888777777643321100 0011222 Q ss_pred hHHHHHHHHHHHHHHhhccc---cCcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEeccCCCceEEEEc Q lcl|Aclame:pro 150 GLQGALASAWGKLQVLFEDY---GSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATV 226 (296) Q Consensus 150 ~lQ~Ala~~~~~~~~~Fede---d~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~ 226 (296) .. +..+.+.+.+- .....+.+||+.+.+.+++--.=+....+.+.. ..+||..|+.+...++ ++ T Consensus 262 ~~-------~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~~~-~~llG~PV~~~~~~~~--~~--- 328 (387) T protein:vir:26 262 DM-------YDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPA-EKVFGKPVVFTDAAVK--PI--- 328 (387) T ss_pred ch-------HHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCC-ccccccceEEecCCCc--ee--- Confidence 21 22233333321 123457789999987665432212222232222 2478988888876542 22 Q ss_pred ccceEEEEecCcchhhhhhhc-cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 227 PENIIFAYINPNNSELAKEFN-LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 227 ~~Nl~~ay~~~~~g~~~~~f~-~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ..+...||++.. +-.-..+. ..+|.++|.+... +-|... +.++++...|+++- T Consensus 329 ~GDf~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r-------------~Dg~v~---~~~A~~~l~~ka~~ 382 (387) T protein:vir:26 329 VGDFNYFGINYD-GTTYDTDKDVKKGEYLFVLTAW-------------YDQQRT---LDSAFRIAKAKENT 382 (387) T ss_pred eechhhhhhhhh-hhhheecccccCCceEEEEEEE-------------eCcEee---chhheEEEEeecCC Confidence 234444444432 11111111 1113333333221 111111 24466778886666 No 125 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.21 E-value=1.2e-06 Score=53.01 Aligned_cols=282 Identities=11% Similarity=0.040 Sum_probs=157.8 Q ss_pred Cccccccccccceehhhhhhhhh--hhhHHHHhhhHHHHHHH----hCcccccccCCCCeeeeeeeeeeecccCcccCCc Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPIT--IDVTNKFQENISKLLEM----LGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~s--iDf~~~f~~~i~~L~~~----LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe 74 (296) |-.-.+.-..|+....--+.+-+ ==|.+.|+..+.+=++. ++..|...+.-|.++++|.-.-+..+ ....|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~iG~~~~~--~~~~G~ 78 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAA--YLAPGE 78 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeeecceEEE--eeecCC Confidence 54444445555555433322111 11788888888776654 45556667788999999865444433 567899 Q ss_pred eechhheeeeecceeEEEEeecccc---cC--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc------ Q lcl|Aclame:pro 75 VIPLSKVERKIHSEKKIELKKYRKA---TT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT------ 143 (296) Q Consensus 75 ~Iplskv~~~~~~t~~~tikK~~K~---vT--dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t------ 143 (296) +++.+....+. ...+++|.+..-. |- ||+ | .-.|...|..+|...++++.+|.-++..|..+... T Consensus 79 ~l~~~~~~~~~-~e~~ltID~~~y~~~~VddiD~~-q--~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~ 154 (345) T protein:vir:22 79 NLDDKRKDIKH-TEKVITIDGLLTADVLIYDIEDA-M--NHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNE 154 (345) T ss_pred CCCCCCCCccc-ceEEEEecchhhhhhhHhhHHHH-h--cCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 98776432221 2356787654332 32 555 4 34569999999999999999999999766432110 Q ss_pred ------------eec------chhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCccccceeech--- Q lcl|Aclame:pro 144 ------------QDA------LGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQTAFGL--- 200 (296) Q Consensus 144 ------------~~~------t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~--- 200 (296) .+. .....+.+++.++-++...+++-+ ....+++|+|...+-+|.+..+.... +++ T Consensus 155 ~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~-~~~~~~ 233 (345) T protein:vir:22 155 NIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAAN-YAALID 233 (345) T ss_pred cccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccc-cccccc Confidence 000 011224455666666666665532 23478999999999888776553221 222 Q ss_pred ---hhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhh---hhhhccccccccceEEEecccc---------- Q lcl|Aclame:pro 201 ---TYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSEL---AKEFNLYGDPTGYIGMNHFQEN---------- 264 (296) Q Consensus 201 ---tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~---~~~f~~~td~tGliGv~h~~~~---------- 264 (296) ..+.++.|.+|+.|+.+|.| ...++++. -+++ +..+ +..++.+.+.+..+|+...+.- T Consensus 234 ~~~G~V~~i~G~~V~~sn~lp~~-~~~~~~~~----~~~~-~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~ 307 (345) T protein:vir:22 234 PEKGSIRNVMGFEVVEVPHLTAG-GAGTAREG----TTGQ-KHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLA 307 (345) T ss_pred cccceEEEEeceEEEeccccccc-ccCccccC----cccc-cccccccccceeeeeccCceEEEEEehhheeeeeeecce Confidence 12335789999999999964 22222211 1111 1111 2234455566667776553331 Q ss_pred ----ceeehhhhhhHHHHhh---hhccceEEEEEecCC Q lcl|Aclame:pro 265 ----TTLTIQTLLVSGMLMY---PERIDGIVKVTLTPG 295 (296) Q Consensus 265 ----~~~t~et~~~~~~~lf---pE~~dgvv~~tI~~~ 295 (296) +.--+++-++=|...| +=|+++.+++..+-- T Consensus 308 ~e~~r~~~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 308 LERARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred eeeeechhHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 1122333333333333 234455555544433 No 126 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=98.17 E-value=8e-08 Score=59.48 Aligned_cols=249 Identities=12% Similarity=0.085 Sum_probs=125.4 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee-ecccCcccCCceechh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPLS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi-g~A~gdVaEGe~Ipls 79 (296) -.++++.+.-+.+..+ +|.+++-+.+....-+.++-+..++.. .++|+-.+. +.|. -|+||+.+|-+ T Consensus 82 al~~~~~~~gG~lIP~--------~~~~~Ii~~l~~~s~l~~~~~v~~~~~---~~~p~~~~~~~~a~-~v~E~~~~~~~ 149 (352) T protein:vir:78 82 ALPTGNDSGGDKLLPK--------TLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDD-FITDVETAKEL 149 (352) T ss_pred HhccCCCCCCceeccH--------hHHHHHHHHHHhhcchhhheeeEecCC---ceEEEEecCCCccc-ccccccccccc Confidence 1122222333333332 444555444444444455555555542 345654444 4564 89999999999 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc----------eecc Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT----------QDAL 147 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t----------~~~t 147 (296) +.+.. ..++..||++.-+ |.|.++.+. -+-.+.-.++|+..+.++.+..+|..-. +++. .+.+ T Consensus 150 ~~~f~---~v~~~~~k~~~~i~is~ell~Ds~-~~l~~~i~~~la~~~~~~e~~~~~~~g~-g~~~~~g~l~~~~~~~~t 224 (352) T protein:vir:78 150 KLKGD---TVKFTTNKFKVFAAISDTVIHGSD-VDLVNWVENALQSGLAAKERKDALAVSP-KSGLEHMSFYNGSVKEVE 224 (352) T ss_pred cccce---eeeecceeEEeechhhHHHHhhhh-HHHHHHHHHHHHHHHHHHHHHhhhhcCC-CCcccccceecccccccc Confidence 88764 4778889998854 999986444 3466777888888887764555553211 1110 0112 Q ss_pred hhhHHHHHHHHHHHHHHhhccc---cCcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEeccCCCceEEE Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDY---GSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDVTKGEIWA 224 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Fede---d~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~ 224 (296) +.+. +..+.+.+.+- -....+.++||.+...+++-..=+....+.+..- .+||-.|+.+...+ +++ T Consensus 225 ~~~~-------~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~~~~~-~llG~PV~~~~~~~--~~~- 293 (352) T protein:vir:78 225 GANM-------YDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAE-KVFGKPVVFTDAAV--KPI- 293 (352) T ss_pred ccch-------HHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCcccccCCc-cccccceEEecCCC--cee- Confidence 2221 22233333321 1235678999999877665322222233333322 37798888877543 232 Q ss_pred EcccceEEEEecCcchhhhhhhccccc-cccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 225 TVPENIIFAYINPNNSELAKEFNLYGD-PTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 225 t~~~Nl~~ay~~~~~g~~~~~f~~~td-~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) -.+...||++... ..|..+++ .+|++|+.-.. -+-|...-|| +++.++++++- T Consensus 294 --~Gdf~~~~~~~~~----~~~~~~~~~~~g~~~f~~~~----------r~Dg~~~~~e---A~~~l~~~a~~ 347 (352) T protein:vir:78 294 --VGDFNYFGINYDG----TTYDTDKDVKKGEYLFVLTA----------WYDQQRTLDS---AFRIAKAKEST 347 (352) T ss_pred --Eeehhhhhhhhhh----heeeeeccccCCeeEEEEEe----------eeCceeechh---heEEEEeeccc Confidence 2445555544321 12222222 23333332111 1112222233 45666665555 No 127 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.17 E-value=2.4e-07 Score=56.85 Aligned_cols=254 Identities=15% Similarity=0.029 Sum_probs=145.8 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec-hh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP-LS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip-ls 79 (296) +++.++.+.... ..--++.+++-+.+.+.--++.+-+.+|+. ..+++|+-.-.+.|. -|+|+++|+ -+ T Consensus 78 ~~~~~~~~~gg~--------lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~--~~~~i~~~~~~~~a~-wv~e~~~~~~~~ 146 (377) T protein:vir:96 78 IDKNVGGKDKFK--------LLPEETMVQVFDDLVAEHPLLKVINFKNTS--LRLKALTAETSGTAV-WGDIFGEIKGQL 146 (377) T ss_pred HHhcCCCCCCce--------ecCHHHHHHHHHHHHhhhhhhhhceeEecC--CceEEEEecCCccee-Eeeccccccccc Confidence 333333333222 233456777777777777777777778873 456788765666664 788999886 33 Q ss_pred heeeeecceeEEEEeecccc--cCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHH---------HHh---cCc---- Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVT---------ALK---TGT---- 141 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~--vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~---------aLk---tat---- 141 (296) ..+- ...++..+|+..- +|-|-++.+++ +-.+.-.++|+.+++..++..|+. .|+ .++ T Consensus 147 ~~~f---~~i~l~~~kl~~~~~is~~ll~ds~~-~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~ 222 (377) T protein:vir:96 147 KQAF---KEQDFSQFKLTAFVVIPKDALKFGPK-WLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQS 222 (377) T ss_pred Cccc---eeEeeeeeeEEeechhhHHHhhcchh-hHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccc Confidence 4333 3577888888875 48888854444 466788899999999999988875 111 000 Q ss_pred ---------------cc-eecchhhHHHHHHHHHHHHHHhhccc-------cCcceEEEEcHHHHHHHhcCCccccceee Q lcl|Aclame:pro 142 ---------------GT-QDALGAGLQGALASAWGKLQVLFEDY-------GSERAIVFANSLDVAEYIAKAGITTQTAF 198 (296) Q Consensus 142 ---------------~t-~~~t~~~lQ~Ala~~~~~~~~~Fede-------d~~~~VlFvNP~Daa~~l~~a~i~~q~~f 198 (296) ++ +..+.+.+-.-+ .++...+... .....+.++||.+.++.+++-....+ T Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~--- 295 (377) T protein:vir:96 223 TGRDITTYKTDKEAIADLSDLDPDTAVELL----VPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQ--- 295 (377) T ss_pred ccccccceeeccccccccccCChhHHHHHH----HHHHHhhccccccccccccCceEEEEchhhHHhccccccccCC--- Confidence 00 011222222222 2222222211 12356899999998876654432221 Q ss_pred chhhhhhhh--eeEEEEeccCCCceEEEEcccceEEEEecCcchhh--hhhhccccccccceEEEeccccceeehhhhhh Q lcl|Aclame:pro 199 GLTYLVDFT--GTVIISTNDVTKGEIWATVPENIIFAYINPNNSEL--AKEFNLYGDPTGYIGMNHFQENTTLTIQTLLV 274 (296) Q Consensus 199 g~tyl~nfL--G~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~--~~~f~~~td~tGliGv~h~~~~~~~t~et~~~ 274 (296) .|.|. +.| |..++.|..+|+|++++-...+ +..++-.+=.+ +...-+..|+++|.+..+.- .+-. T Consensus 296 ~G~~~-~~l~~p~~v~~s~~~p~~~i~fgdf~~--Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~d-G~~~------- 364 (377) T protein:vir:96 296 FGEYV-TVLPHGITILESLAVETGKAIAFVANR--YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFY-GKAK------- 364 (377) T ss_pred CCCce-eccCCCceEEecCCCCcccEEEEEcCc--EEEEEecccEEEeehhhhhhcCCeEEEEEEEEc-CEEe------- Confidence 12333 244 4789999999999988776665 23333221111 22334556888888876532 1111 Q ss_pred HHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 275 SGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 275 ~~~~lfpE~~dgvv~~tI~~~ 295 (296) ..+++++.+|+.+ T Consensus 365 --------d~~a~~vl~l~~~ 377 (377) T protein:vir:96 365 --------DNHTAALLTLAGG 377 (377) T ss_pred --------cCCcEEEEEEecC Confidence 2234666666666 No 128 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=98.12 E-value=1.4e-07 Score=58.18 Aligned_cols=254 Identities=12% Similarity=0.042 Sum_probs=121.8 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) -++..+...-+.+..+++ .+++-+.+...--+.++-+..++. ..++|+-.+.+...+-|+||+.+|-++ T Consensus 117 al~~~t~s~gG~~IP~~~--------~~~Ii~~~~~~~~l~~~~~v~~~~---~~~~p~~~~~~~~a~~v~E~~~~~~~~ 185 (387) T protein:vir:93 117 ALPTGNDSGGDKLLPKTL--------SKEIVSEPFAKNQLREKARLTNIK---GLEIPRVSYTLDDDDFITDVETAKELK 185 (387) T ss_pred hhccCcCCCCceeechhH--------HHHHHHHHHhhchhhhheeeeecC---CceEEEEeecCCccccccCcccccccc Confidence 112222222333333333 333333332222233333444443 245666444433234899999999999 Q ss_pred eeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc---------ceecchh Q lcl|Aclame:pro 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG---------TQDALGA 149 (296) Q Consensus 81 v~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~---------t~~~t~~ 149 (296) ++.. ..+++.+|++.-+ |.|.++.++++ -.+.-.++|+..+..+.++++|..-....+ ....++. T Consensus 186 ~~f~---~v~~~~~k~~~~~~iS~ell~Ds~~~-l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~~ 261 (387) T protein:vir:93 186 LKGD---TVKFTTNKFKVFAAISDTVIHGSDVD-LVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEGA 261 (387) T ss_pred cccc---eeeeeheeeeeechhhHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccccccccc Confidence 8764 4778889998864 99998766653 566777888888888877777643211100 0011222 Q ss_pred hHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechhhhhhhheeEEEEeccCCCceEEEEcccc Q lcl|Aclame:pro 150 GLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVPEN 229 (296) Q Consensus 150 ~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~N 229 (296) .+ ++.+.++............+.++||.+...+++--.=+....+.+.- ..+||..|+.+...+. ++ ..+ T Consensus 262 ~~----~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~~~~~~~~-~~llG~PV~~~~~~~~--~~---~GD 331 (387) T protein:vir:93 262 DM----YDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPA-EKVFGKPVVFTDAAVK--PI---VGD 331 (387) T ss_pred ch----HHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCC-ccccccceEEecCCCc--ee---eee Confidence 21 12222222222221123457789999987665422212222222221 2478988888775432 22 233 Q ss_pred eEEEEecCcchhhhhhhccccc-cccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 230 IIFAYINPNNSELAKEFNLYGD-PTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 230 l~~ay~~~~~g~~~~~f~~~td-~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ...||++... ..|..+++ .+|++|+.-.. + +-|.+. +.++++.++|+++- T Consensus 332 f~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~---r-------~d~~v~---~~eA~~~l~~k~~~ 382 (387) T protein:vir:93 332 FNYFGINYDG----TTYDTDKDVKKGEYLFVLTA---W-------YDQQRT---LDSAFRIAKAKENT 382 (387) T ss_pred hhhhheehhh----heeeecccccCCceeEEEEe---e-------eCceee---chhheEEEEeecCC Confidence 4444443321 11222221 23333332110 0 111111 23355667776665 No 129 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=98.03 E-value=4.7e-07 Score=55.25 Aligned_cols=267 Identities=11% Similarity=-0.025 Sum_probs=135.0 Q ss_pred Cccccc---cccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRT---YPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~---~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip 77 (296) ..|+.- +.+-+-.+..+-+...--+|.+++-+.+.+.--++.+-+.+++. | .+++|+-.-.+.|. -++|+++|+ T Consensus 64 ~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~-~~~i~~~~~~~~a~-w~~e~~~~~ 140 (381) T protein:vir:95 64 SLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-L-RLKFLKSETSGVAV-WGKIYGEIK 140 (381) T ss_pred cccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC-c-ceEEEEecCCccee-eeccccccc Confidence 111100 00000111122233334466777777777777778888878874 4 46788876666674 788988886 Q ss_pred h-hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC-------------- Q lcl|Aclame:pro 78 L-SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG-------------- 140 (296) Q Consensus 78 l-skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta-------------- 140 (296) = +..+. ...+++.+|+..-+ |.|-++.+.+ +--+.-.++|+.+++..++.-|+.=-.++ T Consensus 141 ~~~~~~f---~~i~l~~~kl~~~~~is~elL~Ds~~-~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~ 216 (381) T protein:vir:95 141 GQLDAAF---SEETAIQNKLTAFVVLPKDLNDFGPA-WIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVS 216 (381) T ss_pred ccccccc---eeeeecceeEEeechhhHHHhhcCHH-HHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccc Confidence 3 23332 24778889998764 8888754443 55677788999999999988776411000 Q ss_pred -----------cccee-cchhhHHHHHHHHHHHHHHhhccc---cCcceEEEEcHHHHHHHhcCCccccceeechhhhhh Q lcl|Aclame:pro 141 -----------TGTQD-ALGAGLQGALASAWGKLQVLFEDY---GSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVD 205 (296) Q Consensus 141 -----------t~t~~-~t~~~lQ~Ala~~~~~~~~~Fede---d~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~n 205 (296) .++.+ .+...+-..|......+...+... -....+..+||.+.+++++......+ +|.|+. T Consensus 217 ~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~---~G~~v~- 292 (381) T protein:vir:95 217 VTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA---NGVYVT- 292 (381) T ss_pred cccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCC---CCceee- Confidence 00000 000111111111111111111110 01235678999999988765544332 223332 Q ss_pred h--heeEEEEeccCCCceEEEEcccceEEEEecCcch--hhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhh Q lcl|Aclame:pro 206 F--TGTVIISTNDVTKGEIWATVPENIIFAYINPNNS--ELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYP 281 (296) Q Consensus 206 f--LG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g--~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfp 281 (296) . +|..|+.|..+|+|++++-.-.+ +...+..+- +.+....+..|+++|.+..+--- . | T Consensus 293 ~l~~g~~vv~s~~~p~~~iifgDfs~--Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg-~---------------~ 354 (381) T protein:vir:95 293 ALPFNLNVIESTVQEAGKVLTYVKGL--YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYG-K---------------A 354 (381) T ss_pred cCCCCceEEecCCCCcCcEEEEeccc--EEEEEecccEEEeechhHhhcCCeEEEEEEEEcC-E---------------E Confidence 2 36789999999999977644433 111221111 11122224446666665543211 0 0 Q ss_pred hccceEEE--EEe--cCCC Q lcl|Aclame:pro 282 ERIDGIVK--VTL--TPGV 296 (296) Q Consensus 282 E~~dgvv~--~tI--~~~v 296 (296) =..++++. ++| .+|+ T Consensus 355 ~~~~A~~v~~l~~~~~~~~ 373 (381) T protein:vir:95 355 KDNKVAAVWKLDLKGHKPA 373 (381) T ss_pred ecCceEEEEEEEecCCCcC Confidence 11223334 444 2233 No 130 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=98.03 E-value=4.7e-07 Score=55.25 Aligned_cols=267 Identities=11% Similarity=-0.025 Sum_probs=135.0 Q ss_pred Cccccc---cccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRT---YPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~---~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip 77 (296) ..|+.- +.+-+-.+..+-+...--+|.+++-+.+.+.--++.+-+.+++. | .+++|+-.-.+.|. -++|+++|+ T Consensus 64 ~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~-~~~i~~~~~~~~a~-w~~e~~~~~ 140 (381) T protein:vir:10 64 SLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-L-RLKFLKSETSGVAV-WGKIYGEIK 140 (381) T ss_pred cccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC-c-ceEEEEecCCccee-eeccccccc Confidence 111100 00000111122233334466777777777777778888878874 4 46788876666674 788988886 Q ss_pred h-hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC-------------- Q lcl|Aclame:pro 78 L-SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG-------------- 140 (296) Q Consensus 78 l-skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta-------------- 140 (296) = +..+. ...+++.+|+..-+ |.|-++.+.+ +--+.-.++|+.+++..++.-|+.=-.++ T Consensus 141 ~~~~~~f---~~i~l~~~kl~~~~~is~elL~Ds~~-~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~ 216 (381) T protein:vir:10 141 GQLDAAF---SEETAIQNKLTAFVVLPKDLNDFGPA-WIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVS 216 (381) T ss_pred ccccccc---eeeeecceeEEeechhhHHHhhcCHH-HHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccc Confidence 3 23332 24778889998764 8888754443 55677788999999999988776411000 Q ss_pred -----------cccee-cchhhHHHHHHHHHHHHHHhhccc---cCcceEEEEcHHHHHHHhcCCccccceeechhhhhh Q lcl|Aclame:pro 141 -----------TGTQD-ALGAGLQGALASAWGKLQVLFEDY---GSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVD 205 (296) Q Consensus 141 -----------t~t~~-~t~~~lQ~Ala~~~~~~~~~Fede---d~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~n 205 (296) .++.+ .+...+-..|......+...+... -....+..+||.+.+++++......+ +|.|+. T Consensus 217 ~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~---~G~~v~- 292 (381) T protein:vir:10 217 VTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA---NGVYVT- 292 (381) T ss_pred cccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCC---CCceee- Confidence 00000 000111111111111111111110 01235678999999988765544332 223332 Q ss_pred h--heeEEEEeccCCCceEEEEcccceEEEEecCcch--hhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhh Q lcl|Aclame:pro 206 F--TGTVIISTNDVTKGEIWATVPENIIFAYINPNNS--ELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYP 281 (296) Q Consensus 206 f--LG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g--~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfp 281 (296) . +|..|+.|..+|+|++++-.-.+ +...+..+- +.+....+..|+++|.+..+--- . | T Consensus 293 ~l~~g~~vv~s~~~p~~~iifgDfs~--Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg-~---------------~ 354 (381) T protein:vir:10 293 ALPFNLNVIESTVQEAGKVLTYVKGL--YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYG-K---------------A 354 (381) T ss_pred cCCCCceEEecCCCCcCcEEEEeccc--EEEEEecccEEEeechhHhhcCCeEEEEEEEEcC-E---------------E Confidence 2 36789999999999977644433 111221111 11122224446666665543211 0 0 Q ss_pred hccceEEE--EEe--cCCC Q lcl|Aclame:pro 282 ERIDGIVK--VTL--TPGV 296 (296) Q Consensus 282 E~~dgvv~--~tI--~~~v 296 (296) =..++++. ++| .+|+ T Consensus 355 ~~~~A~~v~~l~~~~~~~~ 373 (381) T protein:vir:10 355 KDNKVAAVWKLDLKGHKPA 373 (381) T ss_pred ecCceEEEEEEEecCCCcC Confidence 11223334 444 2233 No 131 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.00 E-value=3.3e-06 Score=50.61 Aligned_cols=241 Identities=10% Similarity=0.069 Sum_probs=124.1 Q ss_pred cccccccCCCCeeeeeeeeeeecccCcccCCceechhheeeeecceeEEEEeeccc---ccC--HHHHHhhcCCchhHHH Q lcl|Aclame:pro 43 VTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERKIHSEKKIELKKYRK---ATT--GEDIQMYGSNEAVTNT 117 (296) Q Consensus 43 Vtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplskv~~~~~~t~~~tikK~~K---~vT--dEAIqlsGygdav~et 117 (296) ..| ++.-|++.++|.=.-.... ....|++|..+--..+ ....+++|.++-- -|- ||+ |. -.|...|. T Consensus 1 ~vr--~i~~g~s~~~~~iG~~~~~--~~~~G~~l~~~~~~~~-~~e~~itID~~l~~~~~VdDiD~~-qa--~~Dlr~e~ 72 (324) T protein:vir:99 1 MTR--TITSGKSAQFPVMGRTKAR--YLKQGQSLDDGREDIK-HTEKVITIDGLLTTDVLIYDIEDA-MN--HYDVRSEY 72 (324) T ss_pred Cee--eeecCceEEEeeeeeeEec--cccCCCCcCCCcCCcC-cccEEEEecchhhhhhhhhhHHHH-hc--CccchhHH Confidence 344 4566999999875344333 4557888865421111 2235677765433 242 555 43 35699999 Q ss_pred HHHHHHHHHhhhhHHHHHHHhcC----ccc------------------eecchhhHHHHHHHHHHHHHHhhcccc--Ccc Q lcl|Aclame:pro 118 DNALVRQLQKKIRTDFVTALKTG----TGT------------------QDALGAGLQGALASAWGKLQVLFEDYG--SER 173 (296) Q Consensus 118 d~QL~~~iq~kIdnD~~~aLkta----t~t------------------~~~t~~~lQ~Ala~~~~~~~~~Feded--~~~ 173 (296) .+|...++++.+|.-++..+... +.. ...+....-.++..++-++...+++-+ +.. T Consensus 73 s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~g 152 (324) T protein:vir:99 73 STQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGD 152 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCC Confidence 99999999999999988664210 000 000111112244555555556665432 245 Q ss_pred eEEEEcHHHHHHHhcCCccccceeech------hhhhhhheeEEEEeccCCCceEEE--EcccceEEEEecCcchhhhhh Q lcl|Aclame:pro 174 AIVFANSLDVAEYIAKAGITTQTAFGL------TYLVDFTGTVIISTNDVTKGEIWA--TVPENIIFAYINPNNSELAKE 245 (296) Q Consensus 174 ~VlFvNP~Daa~~l~~a~i~~q~~fg~------tyl~nfLG~~II~S~kV~~G~~~~--t~~~Nl~~ay~~~~~g~~~~~ 245 (296) .+++|+|...+-++.+..+... .+++ .-+..++|.+|+.|+.+|.+.... ...++-.. ..++.||.... T Consensus 153 R~~vv~P~~y~~Ll~~~~~~~~-~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~--~~~~~~~~~~~ 229 (324) T protein:vir:99 153 RTFYTDPDTYSAILAALMPNAA-NYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGH--IFPATGDSTTT 229 (324) T ss_pred CEEEeChHHHHHHhhccccccc-ccccccceecceEEEEeceEEEecCCcccccccccccccccccc--ccccccccccc Confidence 7899999998866655433321 1111 122347899999999999753321 11111111 11112222211 Q ss_pred hccccccccceEEEeccc--------------cceeehhhhh-----hHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 246 FNLYGDPTGYIGMNHFQE--------------NTTLTIQTLL-----VSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 246 f~~~td~tGliGv~h~~~--------------~~~~t~et~~-----~~~~~lfpE~~dgvv~~tI~~~v 296 (296) -..-.|.++.+|+.-.++ .+.--++.-+ .+|...+ |+|+++.++..+.+ T Consensus 230 ~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~l--RPe~a~~v~l~~~~ 297 (324) T protein:vir:99 230 GKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGL--RPEAVGAIIFEDGE 297 (324) T ss_pred cccccccCceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCccc--ccceEEEEEEccCc Confidence 111234555555532111 0011112211 1222222 77888877765543 No 132 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=97.94 E-value=5.2e-06 Score=49.55 Aligned_cols=273 Identities=12% Similarity=0.117 Sum_probs=158.3 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHH----HHhCcccccccCCCCeeeeeeeeeeecccCcccCCcee Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLL----EMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVI 76 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~----~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~I 76 (296) |-+....--.+---+ ...-=-|.+.|+.-..+=+ ..|+..+..++.-|++.++|.=.-...+ ....|++| T Consensus 1 Ms~~n~~t~p~~~gs----g~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~lG~s~a~--y~~pG~~l 74 (400) T protein:vir:10 1 MSTPNNLTNVAVSAS----GEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQ--VLAPGQSP 74 (400) T ss_pred CCCCccccccccccc----cchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEEe--eecCCCCc Confidence 433211111111111 1112247888988776655 4678889999999999999875444333 45567776 Q ss_pred chhheeeeecceeEEEEee--ccc-cc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc---------- Q lcl|Aclame:pro 77 PLSKVERKIHSEKKIELKK--YRK-AT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT---------- 141 (296) Q Consensus 77 plskv~~~~~~t~~~tikK--~~K-~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat---------- 141 (296) .=+.+.. +-..++|.- |.. -+ =||+ ++=|+..=.|--+|+..++++..|.-++..++-+. T Consensus 75 dg~~~~~---dk~~ItIDtLL~a~~~V~dlDd~--q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~ 149 (400) T protein:vir:10 75 AATSTQA---DKNQLVIDATVIARNTVAHLHDV--QGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTN 149 (400) T ss_pred CCCCccc---CcEEEEeCceeeecchhhhHHHH--hhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 4443322 224456642 222 23 2566 37787556899999999999999998886553220 Q ss_pred --c-------ce---e----cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeec----hh Q lcl|Aclame:pro 142 --G-------TQ---D----ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFG----LT 201 (296) Q Consensus 142 --~-------t~---~----~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg----~t 201 (296) + .. + .++..|..|+.++..++..+.-.+ .+ +++++|-+.+..|..++--.+..|| +. T Consensus 150 ~~g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~--~d-~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~ 226 (400) T protein:vir:10 150 PRVKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDI--SD-VAILMPWRYFNVLRDADRIVDKSYTISQSGA 226 (400) T ss_pred CCccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCc--cc-eEEEcCHHHHHHHHhCCcccchhccccCCCc Confidence 0 00 0 133345556666666655444432 34 5777787877666655432344443 12 Q ss_pred hh----hhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHH Q lcl|Aclame:pro 202 YL----VDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGM 277 (296) Q Consensus 202 yl----~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~ 277 (296) |. -.+.|+.|++|+.+|.+ +.|+.-.-+- ..+.+.+|+...|++..+|+.-.+. --++.|+.-+++= T Consensus 227 ~~~g~v~~v~Gv~Iv~Sn~lP~~------a~~~~~~~lS--~a~~G~~y~~t~d~s~~~av~F~~s-Av~tvk~~~lt~~ 297 (400) T protein:vir:10 227 TIQGFVLSSYNCPVIPSNRFPKY------SQGQKHHLLS--NEDNGYRYDPIAEMNGAIAVLFTAD-ALLVGRSIDVIGD 297 (400) T ss_pred cccceEEEEeceEEEeeCcCCcc------cCcccccccc--cCCCCccCCccccccceeEEEEehh-heEEEEeeccccc Confidence 21 13678999999999964 2222222222 2446888999999999999875443 3344554433333 Q ss_pred Hhh------------------hhccceEEEEEe----cCCC Q lcl|Aclame:pro 278 LMY------------------PERIDGIVKVTL----TPGV 296 (296) Q Consensus 278 ~lf------------------pE~~dgvv~~tI----~~~v 296 (296) .|+ |-|.|.+..++- +++| T Consensus 298 ~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~~~~~ 338 (400) T protein:vir:10 298 IFYEKKEKTYYIDTFMSEGAIPDRWEAVSVVTTKRQSTGAV 338 (400) T ss_pred cccchhhHHHHHHHHHHhCCcccchhheEEEEecCCccccc Confidence 322 556788887776 4455 No 133 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=97.94 E-value=4.5e-06 Score=49.91 Aligned_cols=277 Identities=12% Similarity=0.054 Sum_probs=152.8 Q ss_pred Cccccccccccceehhhhhhhhhhh--hHHHHhhhHHHHH----HHhCcccccccCCCCeeeeeeeeeeecccCcccCCc Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITID--VTNKFQENISKLL----EMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siD--f~~~f~~~i~~L~----~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe 74 (296) |-.. |..+-.-.+.+-+.+ |.+.|+.-..+=+ ..++..+.-++.-|++.++|.=.-.... ....|+ T Consensus 1 ms~~------n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~~~~~--~~~~G~ 72 (364) T protein:vir:10 1 MSNP------NVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGETELQ--VLSPGK 72 (364) T ss_pred CCCc------ccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeeeeEEe--eeccCc Confidence 3221 222222222211222 6788887776655 4577788888999999999875333331 344566 Q ss_pred eechhheeeeecceeEEEEee--cccc-c--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc--c----- Q lcl|Aclame:pro 75 VIPLSKVERKIHSEKKIELKK--YRKA-T--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT--G----- 142 (296) Q Consensus 75 ~Iplskv~~~~~~t~~~tikK--~~K~-v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat--~----- 142 (296) .+.-+.+.. .-.+++|.. |... | =||+ | +=|+..=.|-.+|+..++++..|.-++..++.+- + T Consensus 73 ~ld~~~~~~---~k~~itID~ll~a~~~V~diDe~-q-~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~ 147 (364) T protein:vir:10 73 SPDASPTEF---DKNRLVVDTTVIARNTVAHFHDV-Q-NDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIR 147 (364) T ss_pred ccCCCCccc---CcEEEEecceeeechhhhhHHHH-h-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 655444443 234677754 3333 3 2566 3 6787666888899999999999998876554220 0 Q ss_pred ----------c----ee-----cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcccc-------ce Q lcl|Aclame:pro 143 ----------T----QD-----ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITT-------QT 196 (296) Q Consensus 143 ----------t----~~-----~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~-------q~ 196 (296) . .+ .++..|-.|+.++..+|..+.-.+ ...+.+|+|..-+.+|.+.++-. +. T Consensus 148 ~~~~~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~--~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~ 225 (364) T protein:vir:10 148 KNPRVAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDT--SELCGLMPWTAFNCLRDADRIVDKSYTIAASD 225 (364) T ss_pred cCCcccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCc--cccEEEeChHHHHHHhcCCccccccccccCCC Confidence 0 00 011222333333333333332222 34899999999999998765321 11 Q ss_pred -eechhhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhh--- Q lcl|Aclame:pro 197 -AFGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTL--- 272 (296) Q Consensus 197 -~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~--- 272 (296) ..++.-+ .+.|+.|+.|+.+|.+--..++-.++--.-+.+ ..-++.|+...|++...|+.-.+ .-=+|.|.. T Consensus 226 ~~~~G~v~-~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~--~~~g~~y~v~~d~~~~~~~~f~~-~Al~tv~~~~~t 301 (364) T protein:vir:10 226 NTVDGFVL-KSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSN--AGNGNRYDVTAGQTSAQAVLFTQ-DALLVGRTISIT 301 (364) T ss_pred ccccceeE-EEeceEEEecccccccccccccccccccccccc--ccCCcccccccccceeEEEEEec-ceEEEEEEecce Confidence 1122222 367899999999998655554544444433332 23477888888888777776433 111122221 Q ss_pred ---------------hhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 273 ---------------LVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 273 ---------------~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +..++=-=+=|+|++++++-..+- T Consensus 302 ~e~~~~~~~~~~~ida~~a~G~g~lRPeaa~~i~~~~~~ 340 (364) T protein:vir:10 302 GDIFYEKKEKTWYIDTFLAEGAIPDRWEAVAVVTAADTA 340 (364) T ss_pred eeeeeccceeeeeeeeehcccCcccCccceEEEEecCCC Confidence 111111123355777776654443 No 134 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=97.90 E-value=2.6e-06 Score=51.21 Aligned_cols=269 Identities=13% Similarity=0.117 Sum_probs=134.3 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccc---cccCCCCeeeeeeeeeeecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRK---ISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~---~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip 77 (296) |- +.+.|+.+. .+ .+=.....|++++- +.. -|.|. ..-..|+||++|..... .+.+|..|. T Consensus 1 m~----~~~N~~ltp-~i---ia~~~l~~l~~~lV-~~~--lv~r~y~~e~~~~GDTV~I~vp~~~-----~v~dg~~~~ 64 (418) T protein:vir:10 1 MA----VQDNNLLTD-DV---IAKEALRLLKNNLV-MAK--CVYRNYEKTFGKVGDTIRLKLPYRV-----KSASGRTLV 64 (418) T ss_pred CC----ccccccccH-HH---HHHHHHHHHHHhcc-chh--hhcCCCchHHhhCCCEEEEeeCCce-----eecccCCcc Confidence 43 334455443 22 23334445555443 111 13442 22356999999986332 455677888 Q ss_pred hhheeeeecceeEEEEeeccc---ccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecchhhHHHH Q lcl|Aclame:pro 78 LSKVERKIHSEKKIELKKYRK---ATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGA 154 (296) Q Consensus 78 lskv~~~~~~t~~~tikK~~K---~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t~~~lQ~A 154 (296) .+.+... ..+++|.|+.- .++|+.- .....+-..+.-++-..+|+++||.|++..++.+.......+... . T Consensus 65 ~~~~te~---~v~l~id~~k~~~~~itD~e~-a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~gt~~--~ 138 (418) T protein:vir:10 65 KQPMVDQ---TIPFKIAYQEHVGLEYTVKDK-TLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTPGVRP--G 138 (418) T ss_pred ccccccc---eEEEEEecccccceeechHHH-hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccCCcCc--c Confidence 7777643 46788876543 2477663 455667778888999999999999999988877654322111111 1 Q ss_pred HHHHHHHHHHhhccccC--c-ceEEEEcHHHHHHHhcCCccccce-----ee-chhhhhhhheeEEEEeccCCCceEEEE Q lcl|Aclame:pro 155 LASAWGKLQVLFEDYGS--E-RAIVFANSLDVAEYIAKAGITTQT-----AF-GLTYLVDFTGTVIISTNDVTKGEIWAT 225 (296) Q Consensus 155 la~~~~~~~~~Feded~--~-~~VlFvNP~Daa~~l~~a~i~~q~-----~f-g~tyl~nfLG~~II~S~kV~~G~~~~t 225 (296) -+..+.++..++++.+= . +-+++++|...+.++++.....+. ++ .+ .+.++.|.+|++|+.||..+... T Consensus 139 ~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G-~IG~i~GF~V~~S~nip~~tag~- 216 (418) T protein:vir:10 139 AFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMG-YRGNVAAYEVYESQNLPKHTVGD- 216 (418) T ss_pred hHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhhee-eeeeeeceEEEEecCCCcccccc- Confidence 14456666677776531 2 368999999888777654322111 11 11 23468899999999999644332 Q ss_pred cccceEEEEecCcchhhhhhhcc-------c-cccccceEEEeccccceeehhhh-----------------hhHHHHhh Q lcl|Aclame:pro 226 VPENIIFAYINPNNSELAKEFNL-------Y-GDPTGYIGMNHFQENTTLTIQTL-----------------LVSGMLMY 280 (296) Q Consensus 226 ~~~Nl~~ay~~~~~g~~~~~f~~-------~-td~tGliGv~h~~~~~~~t~et~-----------------~~~~~~lf 280 (296) ...+...+-+...+.-++-..+. - -|--.|=||. .-+..|+|.. .-..++++ T Consensus 217 ~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~---~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~ 293 (418) T protein:vir:10 217 HGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVF---GVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKIS 293 (418) T ss_pred cccceeeecccccceeEEEeecceeeccceeeccEEEECcee---ecccccccccccceEEEEEeeccccccCcceeEec Confidence 22211222222211112111110 0 0111222211 1111111111 11124444 Q ss_pred hhccceEEEEEecCC--C Q lcl|Aclame:pro 281 PERIDGIVKVTLTPG--V 296 (296) Q Consensus 281 pE~~dgvv~~tI~~~--v 296 (296) |=+.++.+-..=..+ | T Consensus 294 p~~~~~~~~~~~~~~~~~ 311 (418) T protein:vir:10 294 PSLNDGTATINNENGDPV 311 (418) T ss_pred cccccccccccccccccc Confidence 444333221100000 0 No 135 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=97.89 E-value=1.5e-06 Score=52.46 Aligned_cols=255 Identities=12% Similarity=-0.040 Sum_probs=145.8 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec-hh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP-LS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip-ls 79 (296) .++.++.+ +-++..--+|.+++-+.+.+.--++.+-+.+++. | .+++|+-.-.+.|. -+.|+++++ -+ T Consensus 78 ~~~~~~~~--------~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~-~~~~~~~~~~~~a~-w~~e~~~~~~~~ 146 (377) T protein:vir:98 78 IDKNVGGK--------DKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-L-RLKALTAETSGTAV-WGDIFGEIKGQL 146 (377) T ss_pred HHhccCCC--------CCccccCHHHHHHHHHHHHHhhhhhhheeeEecC-c-ceEEEEecCCccee-EeecccccCccc Confidence 22333333 3333344567788888887777788888888874 4 46888876666664 688988886 23 Q ss_pred heeeeecceeEEEEeecccc--cCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHH---------Hhc---Ccc--- Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTA---------LKT---GTG--- 142 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~--vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~a---------Lkt---at~--- 142 (296) +.+. ...++..+|+..- +|-|-++.+++ +-...-.++|+.+++..++..|+.= |+. ++. T Consensus 147 ~~~f---~~i~l~~~kl~a~~~is~elL~ds~~-~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~ 222 (377) T protein:vir:98 147 KQAF---KEQDFSQFKLTAFVVIPKDALKFGPK-WIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQS 222 (377) T ss_pred Cccc---eeEeecceeEEeeecccHHhhhccHh-HHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccc Confidence 3322 2467778887764 48888864554 4667788999999999999888751 110 000 Q ss_pred --ceecch----hhH---HHHHHH------HH------HHHHHhhccccCcceEEEEcHHHHHHHhcCCcccc-ceeech Q lcl|Aclame:pro 143 --TQDALG----AGL---QGALAS------AW------GKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITT-QTAFGL 200 (296) Q Consensus 143 --t~~~t~----~~l---Q~Ala~------~~------~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~-q~~fg~ 200 (296) ....+. +.+ ..++.. +| .....++.| ..-++++++||.|.+.+........ ++.+. T Consensus 223 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd-~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~- 300 (377) T protein:vir:98 223 TGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLK-IAGQVKLILNPEDRWALEAQFTSRNQFGEYV- 300 (377) T ss_pred cccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhc-cCCceEEEecccchhhccccccccCCCCccc- Confidence 000000 011 100000 11 111233444 2357899999999988776554332 33333 Q ss_pred hhhhhhhe--eEEEEeccCCCceEEEEcccceEEEEecCcch-hh--hhhhccccccccceEEEeccccceeehhhhhhH Q lcl|Aclame:pro 201 TYLVDFTG--TVIISTNDVTKGEIWATVPENIIFAYINPNNS-EL--AKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVS 275 (296) Q Consensus 201 tyl~nfLG--~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g-~~--~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~ 275 (296) ..|| ..|+.|..+|+|++++-...+ |.+-.++| ++ +...-+..|+++|.+..+.-- T Consensus 301 ----t~lg~p~~vv~s~~~p~~~i~fgdf~~---Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg------------ 361 (377) T protein:vir:98 301 ----TVLPHGITILESLAVETGKAIAFVANR---YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYG------------ 361 (377) T ss_pred ----cccCCCceEEecCCCCcccEEEEEecc---eeEEeecceEEEeechhhhhcCceEEEEEEEEcC------------ Confidence 2454 678999999999987655444 22222211 11 122234457788777644211 Q ss_pred HHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 276 GMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 276 ~~~lfpE~~dgvv~~tI~~~ 295 (296) - |=..+++++.+|+.+ T Consensus 362 -~---~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 362 -K---AKDNHTAALLTLAGG 377 (377) T ss_pred -E---EeccCcEEEEEEecC Confidence 1 112235777778777 No 136 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=97.84 E-value=1.4e-05 Score=47.26 Aligned_cols=282 Identities=15% Similarity=0.052 Sum_probs=140.3 Q ss_pred Cc---cccccccccceehhhhhhhhh--hhhHHHHhhhHHHHHH----HhCcccccccCCCCeeeeeeeeeeecccCccc Q lcl|Aclame:pro 1 MV---TSRTYPEENLIKSTDLKYPIT--IDVTNKFQENISKLLE----MLGVTRKISVSEGMTLKTYAGYDVTLAEGNVP 71 (296) Q Consensus 1 ~~---~~~~~ae~nl~~~~dl~~a~s--iDf~~~f~~~i~~L~~----~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVa 71 (296) |- -||. .-.|+-+-+-.+.+-+ ==|.+.|+..+.+=++ .++..|..++.-|+++++|.=.-.... +.. T Consensus 1 ~~~~~~~~~-~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~~t~~--~~t 77 (375) T protein:vir:10 1 MANANQVAL-GRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGRMTSS--FHT 77 (375) T ss_pred Ccccccccc-CccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeeeeEEe--eec Confidence 21 1332 2233333222211111 1177888887766555 467777778889999999775444333 456 Q ss_pred CCceechhheeeeecceeEEEEeeccc---ccC--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc---- Q lcl|Aclame:pro 72 EGEVIPLSKVERKIHSEKKIELKKYRK---ATT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG---- 142 (296) Q Consensus 72 EGe~Iplskv~~~~~~t~~~tikK~~K---~vT--dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~---- 142 (296) .|++|.-+...-.+...++++|.+..- .|- ||+ | .-.|...|..+|...++++.+|.-++..|..+.. T Consensus 78 ~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~a-q--a~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p 154 (375) T protein:vir:10 78 PGTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDET-L--AHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASP 154 (375) T ss_pred CCcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHH-h--cCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Confidence 788886665443333446788866433 343 455 4 3446999999999999999999999977743210 Q ss_pred -------------------cee---cchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCC--ccccce Q lcl|Aclame:pro 143 -------------------TQD---ALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKA--GITTQT 196 (296) Q Consensus 143 -------------------t~~---~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a--~i~~q~ 196 (296) +.. .++..+ +.++-++...+.+.+ +...+++|+|..-+-+|.+. +...+. T Consensus 155 ~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~----~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~ 230 (375) T protein:vir:10 155 VSATNFVEPGGTQIRVGSGTNESDAFTASAL----VNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNR 230 (375) T ss_pred cccccccccCcceeeeccccccccccCHHHH----HHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeee Confidence 000 123333 444444555554432 23568889999988887652 212222 Q ss_pred eech------hhhhhhheeEEEEeccCCCceE---------EEEcccceEEEEecCcchhhhhhh--c-------ccccc Q lcl|Aclame:pro 197 AFGL------TYLVDFTGTVIISTNDVTKGEI---------WATVPENIIFAYINPNNSELAKEF--N-------LYGDP 252 (296) Q Consensus 197 ~fg~------tyl~nfLG~~II~S~kV~~G~~---------~~t~~~Nl~~ay~~~~~g~~~~~f--~-------~~td~ 252 (296) .|++ .-+..+.|++|+.|+.+|.... -.+.|+++.-.+.++-++.++.+. + .-+.- T Consensus 231 d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~ 310 (375) T protein:vir:10 231 DVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKS 310 (375) T ss_pred cccccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCce Confidence 2322 1122377899999999996432 234455555555544333222221 1 11222 Q ss_pred ccceEEEeccccceeehhhhhhHHHH-----hhhhccceEEE--------------EEecCCC Q lcl|Aclame:pro 253 TGYIGMNHFQENTTLTIQTLLVSGML-----MYPERIDGIVK--------------VTLTPGV 296 (296) Q Consensus 253 tGliGv~h~~~~~~~t~et~~~~~~~-----lfpE~~dgvv~--------------~tI~~~v 296 (296) .|++. +.+. -.+.|++-+..=. -==+--|.|+- +.|..+. T Consensus 311 ~~~~~-~~~A---~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~ 369 (375) T protein:vir:10 311 CGLIF-QKEA---AGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGA 369 (375) T ss_pred EEEEE-chhh---eeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCc Confidence 23332 0000 0111111110000 00000011110 0001111 No 137 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=97.82 E-value=4.3e-06 Score=49.98 Aligned_cols=269 Identities=16% Similarity=0.150 Sum_probs=118.8 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCccccccc--CCCCeeeeeeeeeeecccCcccCCceech Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISV--SEGMTLKTYAGYDVTLAEGNVPEGEVIPL 78 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~--~pG~tIt~pk~~yig~A~gdVaEGe~Ipl 78 (296) ..|+.+...-+++..+.+. -+|++..... ..+..||......+ .|| .+++|+.+--..+ .=|+||+.+|. T Consensus 337 ~~~~~~~~~Gg~~vp~~~~----~~ii~~l~~~--svv~~l~~~~~~~~~~~~~-~~~ip~~t~~~~a-~wv~Eg~~~~~ 408 (645) T protein:vir:93 337 GTTTDPQWAGSLSEYQEYA----QDFIDYLRPQ--TIIGRFGQGGIPALRQVPF-NIRVHAQVSGGAA-GWVGEGKTKPL 408 (645) T ss_pred cccccccccCCccCchhhH----HHHHHhhhhh--hhHHhhccccccccccccC-ceeeeeeecCcce-EEeccCccccc Confidence 1111112223444444331 2233222111 11223332211111 122 5788987544444 36999999999 Q ss_pred hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc-c---c-eecchhhH Q lcl|Aclame:pro 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT-G---T-QDALGAGL 151 (296) Q Consensus 79 skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat-~---t-~~~t~~~l 151 (296) ++.+.. ..+++.||.+--+ |.|-++.++.+ .-..-.++|+.+|++++|..|+.--.++. . + ......+. T Consensus 409 s~~~f~---~v~l~~~kla~~~~iS~ell~ds~~~-~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~ 484 (645) T protein:vir:93 409 TKFDFE---SITFSHAKVSAIAVLTEELIRFSSPA-ADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGT 484 (645) T ss_pred ccccee---EEEEeeEEEEEeehhHHHHHhhchHH-HHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceecccccc Confidence 998874 4788888888765 99998766544 34566799999999999999985332220 0 0 00000000 Q ss_pred H--HHHHHHHHHHHHhhccc--cCcceEEEEcHHHHHHHhcCCccccceeech-hh-hhhhheeEEEEeccCCCceEEEE Q lcl|Aclame:pro 152 Q--GALASAWGKLQVLFEDY--GSERAIVFANSLDVAEYIAKAGITTQTAFGL-TY-LVDFTGTVIISTNDVTKGEIWAT 225 (296) Q Consensus 152 Q--~Ala~~~~~~~~~Fede--d~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~-ty-l~nfLG~~II~S~kV~~G~~~~t 225 (296) + +-....+.++...+... .....+.++||.....+++-.+=.-+..|.. .. ...++|..|+.|+.||.+-++.. T Consensus 485 ~~~~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~~~~~~tL~G~PV~~s~~vp~~~~~gd 564 (645) T protein:vir:93 485 ASSGNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDMTLLGGSFQGLPVIVSQYVGDQLVLVN 564 (645) T ss_pred ccccchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecCCCCCCceeeceeeEEeccCCcceeEec Confidence 0 00111122333334332 1224578899999887765332111233311 00 01377999999999986533322 Q ss_pred cccceEEEEecCcchhhhhhhccccccccceEEEeccccc----------------eeehhhhh-hHHHHhhhhccceEE Q lcl|Aclame:pro 226 VPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENT----------------TLTIQTLL-VSGMLMYPERIDGIV 288 (296) Q Consensus 226 ~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~----------------~~t~et~~-~~~~~lfpE~~dgvv 288 (296) .. . +++.+. +++. +.+..+.+ +=+.+.+... -..+-... +.....-|| +|+ T Consensus 565 ~s-~--~~ig~~--~~v~--i~~s~~a~--~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~---a~~ 632 (645) T protein:vir:93 565 AP-D--IYLADD--GGVA--VDMSREAS--LEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTA---AVA 632 (645) T ss_pred cc-c--EEEEEe--cceE--EEeeccee--EEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCcc---ceE Confidence 22 1 112111 1111 00000000 0000000000 00000000 011112222 233 Q ss_pred EEEecCCC Q lcl|Aclame:pro 289 KVTLTPGV 296 (296) Q Consensus 289 ~~tI~~~v 296 (296) ++| +| T Consensus 633 ~lt---~~ 637 (645) T protein:vir:93 633 VIT---GV 637 (645) T ss_pred EEe---cc Confidence 222 23 No 138 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=97.81 E-value=4.4e-06 Score=49.96 Aligned_cols=266 Identities=16% Similarity=0.132 Sum_probs=139.0 Q ss_pred cccccc-------ccccceehhhhh--hhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccC Q lcl|Aclame:pro 2 VTSRTY-------PEENLIKSTDLK--YPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPE 72 (296) Q Consensus 2 ~~~~~~-------ae~nl~~~~dl~--~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaE 72 (296) +..|++ ++.|.+...|.. ....-++..+|-+.+.+--.+|+.-|.+|+... +-++|+|.+-+.+.....| T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~-~~~i~~~~~~~~~~~~~~e 79 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAK-KTRIPTLNIGERHRRPQDE 79 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCc-ceeeeeeccCCcccccccc Confidence 222222 333444434332 233447788888888888888999999998654 4678888765444322224 Q ss_pred Cc-eechhheeeeecceeEEEEeecccc--cCHHHHHhhcCC-chhHHHHHHHHHHHHhhhhHHHHHHHhcCccc----- Q lcl|Aclame:pro 73 GE-VIPLSKVERKIHSEKKIELKKYRKA--TTGEDIQMYGSN-EAVTNTDNALVRQLQKKIRTDFVTALKTGTGT----- 143 (296) Q Consensus 73 Ge-~Iplskv~~~~~~t~~~tikK~~K~--vTdEAIqlsGyg-dav~etd~QL~~~iq~kIdnD~~~aLktat~t----- 143 (296) |. ..+-++.+. ...++..||..-- +|.|-++.+-.+ +--+.-.++++.+++..+..-+| .|++. T Consensus 80 ~~~~~~~~~~~~---~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~----nGd~~~~~~~ 152 (321) T protein:vir:31 80 GEWNENESDVST---GTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAA----NGDEDAEDSF 152 (321) T ss_pred ccccccccccee---eeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhhee----eccccCCCcc Confidence 42 444555443 2356777887765 489988655332 33333444444444444444333 22211 Q ss_pred -----------------eecchhhH-HHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhc---CC--ccccceeech Q lcl|Aclame:pro 144 -----------------QDALGAGL-QGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIA---KA--GITTQTAFGL 200 (296) Q Consensus 144 -----------------~~~t~~~l-Q~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~---~a--~i~~q~~fg~ 200 (296) .+..+..+ -..+.+.+..+...|.+ ....|.+||+....+|+. +. .+......++ T Consensus 153 ~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~--~~~~v~im~~~~~~~~~~~l~~~~~~~~~~~l~~~ 230 (321) T protein:vir:31 153 ENQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRA--RMNPALIVSEDQLLSYHYTLTDRDTPLGDNVIMGE 230 (321) T ss_pred cccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhc--CCCeEEEechHHHHHHHHHHhcCCCccccchhhcc Confidence 11111111 11233333333333432 346899999998876653 22 2322333333 Q ss_pred hhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhh----hhHH Q lcl|Aclame:pro 201 TYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTL----LVSG 276 (296) Q Consensus 201 tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~----~~~~ 276 (296) ... .++|..|+.+..+|++.+++|...||.+++-- + . .+.+..+....+.+.. .+++ T Consensus 231 ~~~-tl~G~pvv~~~~mP~~~il~t~~~nl~~~~~~-~-~----------------~~~~~~~~~~~~~~~~~~~~~~~~ 291 (321) T protein:vir:31 231 ADV-NPFSFPIIGSGLWPDDKAMFTDPQNLIYALYR-D-L----------------EIDVLTESDKVSERDLHARYFMRG 291 (321) T ss_pred ccc-cccceeEEEcCCCCCCcEEEeccccEEEEEee-c-c----------------EEEEeecCccccccceeeEeeeee Confidence 322 37899999999999999999999999754322 1 1 1111111111111100 0011 Q ss_pred H-HhhhhccceEEEEE-ecCCC Q lcl|Aclame:pro 277 M-LMYPERIDGIVKVT-LTPGV 296 (296) Q Consensus 277 ~-~lfpE~~dgvv~~t-I~~~v 296 (296) . =+--|..+.++-++ |..|+ T Consensus 292 ~~~~~ve~~~a~a~~~~i~~~~ 313 (321) T protein:vir:31 292 DDDFAIENTEAVVLAEGLGDPL 313 (321) T ss_pred ecceeEeccccEEEEecCCcch Confidence 0 11125566666666 66666 No 139 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=97.72 E-value=1.6e-05 Score=46.87 Aligned_cols=271 Identities=12% Similarity=0.084 Sum_probs=151.2 Q ss_pred Cccccccccccceehhhhhhhhhhh--hHHHHhhhHHHHH----HHhCcccccccCCCCeeeeeeeeeeecccCcccCCc Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITID--VTNKFQENISKLL----EMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siD--f~~~f~~~i~~L~----~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe 74 (296) |-.. |..+-.-.+.+-+.+ |.+.|+.-..+=+ ..++..+.-++.-|++.++|.=.-.... ....|+ T Consensus 1 Ms~~------n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~~~a~--y~~~G~ 72 (402) T protein:vir:97 1 MSTP------NTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQ--VLAPGQ 72 (402) T ss_pred CCCc------ccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEeeeEEe--eecccc Confidence 3221 222222222211222 6788887776655 4677788888999999999875333332 333566 Q ss_pred eechhheeeeecceeEEEEee--cccc-c--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc-------- Q lcl|Aclame:pro 75 VIPLSKVERKIHSEKKIELKK--YRKA-T--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT-------- 141 (296) Q Consensus 75 ~Iplskv~~~~~~t~~~tikK--~~K~-v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat-------- 141 (296) .+.-+.+... -.+++|.. |... | =||+ | +=|+..=.|-.+|+..++++..|.-++..++.+. T Consensus 73 ~ldg~~~~~~---k~~ItID~lL~a~~~V~diDea-q-~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~ 147 (402) T protein:vir:97 73 SPNATPTQAD---KNQLVIDTTVIARNTVAHIHDV-Q-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER 147 (402) T ss_pred ccCCCCcccc---cEEEEeCceeechhhhhhHHHH-H-hcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 5544443322 24566653 3333 3 2566 3 7788677889999999999999998876553210 Q ss_pred -----------ccee-------cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcccc-------ce Q lcl|Aclame:pro 142 -----------GTQD-------ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITT-------QT 196 (296) Q Consensus 142 -----------~t~~-------~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~-------q~ 196 (296) ..+. .++..|-.|+.++..+|..+.-.+ ...+++++|...+-+|.+.++-. +. T Consensus 148 ~~~~~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~--~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g 225 (402) T protein:vir:97 148 NKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDI--SDVAIMMPWKFFNALRDADRIVDKTYTISQSG 225 (402) T ss_pred ccCcccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCc--cccEEEeChHHHHHHhhcccccchhhccccCC Confidence 0111 122333344444444444433333 33799999999999998766421 11 Q ss_pred ee-chhhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhh--- Q lcl|Aclame:pro 197 AF-GLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTL--- 272 (296) Q Consensus 197 ~f-g~tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~--- 272 (296) .+ ++.. ..+.|++|+.|+.+|.+- .|+--. ..+..+.+.+|+.+.|++..+|+.-.+ .-=.|.|+. T Consensus 226 ~~~~G~v-~~v~Gv~Vv~SnnlP~~a------~~it~~--~ls~a~~G~~y~~t~d~t~~~~~~f~~-~Av~tvk~~~vT 295 (402) T protein:vir:97 226 ATINGFV-LSSYNCPVIPSNRFPTFA------QDQAHH--LLSNEDNGYRYDPIAEMNGAVAVLFTS-DALLVGRTIEVT 295 (402) T ss_pred cccccee-EEEeceEEEecCcccccc------cccccc--ccccCCCCccCCcCcccceeEEEEEec-ceEEEEEeeccc Confidence 12 2222 237789999999999642 111100 111233477888778888888877433 222233332 Q ss_pred ------------hhHHHHhh---hhccceEEEEEecC----CC Q lcl|Aclame:pro 273 ------------LVSGMLMY---PERIDGIVKVTLTP----GV 296 (296) Q Consensus 273 ------------~~~~~~lf---pE~~dgvv~~tI~~----~v 296 (296) +|-+...| |=|+|.+..++... ++ T Consensus 296 ~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~~ 338 (402) T protein:vir:97 296 GDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGD 338 (402) T ss_pred cchhhchhHHHHHHHHHHHhCCcccCccceEEEEEeccccccc Confidence 22222222 55677777776533 33 No 140 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=97.67 E-value=4.4e-06 Score=49.94 Aligned_cols=257 Identities=12% Similarity=0.007 Sum_probs=135.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec-hh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP-LS 79 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip-ls 79 (296) +.++ +.+.- +...--.|.+++-+.+.+.--++.+-+.+++. ...++|+-.-.+.|. -++|+++++ -+ T Consensus 76 ~~~~-t~~~G--------g~lvP~~~~~~I~~~l~~~spir~~a~v~~~~--~~~~i~~~~~~~~a~-W~~e~~~~~~~~ 143 (381) T protein:vir:10 76 INKS-VGYKE--------EKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAV-WGKIYGEIKGQL 143 (381) T ss_pred Hhhc-CCCCC--------ceecCHHHHHHHHHHHHhhcceeeeeeeEecC--cceEEEeecCCcceE-Eeeccccccccc Confidence 2222 22222 22223346677766676666677777888773 355778766556664 577877776 33 Q ss_pred heeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce----e-------- Q lcl|Aclame:pro 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ----D-------- 145 (296) Q Consensus 80 kv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~----~-------- 145 (296) ..+. ...+++.+|+..-+ |.|-++.+.+ +--+.-.++|+.+++.+++.-|+.= ++++.- + T Consensus 144 ~~~f---~~i~l~~~kl~a~i~is~elL~Ds~~-~le~~i~~~la~~~a~~~~~afi~G--dG~~qP~Gil~~~~~~~~~ 217 (381) T protein:vir:10 144 DAAF---SEETAIQNKLTAFVVLPKDLNDFGPA-WIERFVRVQIEEAFAVALETAFLKG--TGKDQPIGLNRQVQKGVSV 217 (381) T ss_pred Cccc---eeEeecceeEEeeccccHHHHhccHH-HHHHHHHHHHHHHHHHHhhceeEec--ccCCCceeeeecCCccccc Confidence 3332 24678889988754 8888864444 3556788899999999999877632 111100 0 Q ss_pred ----------------cchhhHHHHHHHHHHHHHHhhcc---ccCcceEEEEcHHHHHHHhcCCccccceeechhhhhh- Q lcl|Aclame:pro 146 ----------------ALGAGLQGALASAWGKLQVLFED---YGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVD- 205 (296) Q Consensus 146 ----------------~t~~~lQ~Ala~~~~~~~~~Fed---ed~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~n- 205 (296) .+...+-..++.-+..+.....- ......+..+||.+.+++++......+ .|.|+.. T Consensus 218 ~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~---~G~~v~~l 294 (381) T protein:vir:10 218 TDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA---NGVYVTAL 294 (381) T ss_pred cccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCC---CCceeecC Confidence 00001111111111111110000 011345778999999998876654432 1233321 Q ss_pred hheeEEEEeccCCCceEEEEcccceEEEEecCcchh--hhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhc Q lcl|Aclame:pro 206 FTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSE--LAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPER 283 (296) Q Consensus 206 fLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~--~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~ 283 (296) .+|..|+.|..+|+|++++-.-.+ +...+..+-. .+....+..|+++|.+..+.- |. |=. T Consensus 295 p~g~~vv~~~~~p~~~i~fGDfs~--Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~d-------------G~---~~~ 356 (381) T protein:vir:10 295 PFNLNVIESTVQEAGKVLTYVKGL--YDGYLAGGINVQKFKETLALDDMDLYTAKQFAY-------------GK---AKD 356 (381) T ss_pred CCCceeEEcCCCCcCcEEEEEccc--EEEEEecccEEEeechhhhhcCceEEEEEEEEc-------------CE---Eec Confidence 247789999999999987644433 2222221111 112223444666666654321 11 112 Q ss_pred cceEEEEEec-----CCC Q lcl|Aclame:pro 284 IDGIVKVTLT-----PGV 296 (296) Q Consensus 284 ~dgvv~~tI~-----~~v 296 (296) .++++..+|+ |+| T Consensus 357 ~~A~~v~~l~~~~~~~~~ 374 (381) T protein:vir:10 357 NKVAAVWKLDLKGHKPAL 374 (381) T ss_pred CCcEEEEEEeecCCcccc Confidence 2345555554 445 No 141 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=97.67 E-value=5.4e-06 Score=49.45 Aligned_cols=265 Identities=11% Similarity=0.008 Sum_probs=134.7 Q ss_pred Cccccc----cccc-------cceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCc Q lcl|Aclame:pro 1 MVTSRT----YPEE-------NLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGN 69 (296) Q Consensus 1 ~~~~~~----~ae~-------nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gd 69 (296) +..+|- +.|+ +-.+..+=++..--++.+++-+.+.+.--++..-+..|+. | +..+|+....+.+. - T Consensus 66 ~~~~r~~~~l~~ee~~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~-~-~~~i~~~~~~~~a~-w 142 (395) T protein:vir:95 66 ILAKRSQDPLTSEERKFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAG-I-KTRVIKADPAGQAV-W 142 (395) T ss_pred HHhhcCccccchHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-C-ceEEEEecCCcceE-E Confidence 000000 0000 0001111122233456777777777777778888888873 4 56888887777775 6 Q ss_pred ccCCceec-hhheeeeecceeEEEEeecccc--cCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC------ Q lcl|Aclame:pro 70 VPEGEVIP-LSKVERKIHSEKKIELKKYRKA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG------ 140 (296) Q Consensus 70 VaEGe~Ip-lskv~~~~~~t~~~tikK~~K~--vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta------ 140 (296) +.|+++++ .+..+.. ..++..||+..- +|.|-++.++++ -...-.++|+.+++.+++..|+.==.++ T Consensus 143 ~~e~~~~~~~~~~~f~---~i~l~~~kl~~~~~iS~ell~ds~~~-ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~G 218 (395) T protein:vir:95 143 GKVFGEIKGQLDAAFR---EENFTQYKLTCFVVLPDDLSTFGPAW-IERFVRTQIQEAISVALESAIINGGGAAKTQPVG 218 (395) T ss_pred eecccccCccccccce---eeeeceeeEEEeecccHHHHhcchhH-HHHHHHHHHHHHHHHHHhhheeeccCCCCcCcee Confidence 66766774 4454432 477888998874 489888645443 4567888999999999998777321110 Q ss_pred --------cccee--cchhhH-HHHHHHHHHHHHHhh-------ccc---cCcceEEEEcHHHHHHHhcCCccccceeec Q lcl|Aclame:pro 141 --------TGTQD--ALGAGL-QGALASAWGKLQVLF-------EDY---GSERAIVFANSLDVAEYIAKAGITTQTAFG 199 (296) Q Consensus 141 --------t~t~~--~t~~~l-Q~Ala~~~~~~~~~F-------ede---d~~~~VlFvNP~Daa~~l~~a~i~~q~~fg 199 (296) +.... .....+ -..+...+..+.+.+ +.. -......++||.+.++..+......+ . T Consensus 219 il~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~~~~~---~ 295 (395) T protein:vir:95 219 LMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYTYLTA---N 295 (395) T ss_pred eeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcceeccC---C Confidence 00000 000000 000111122222211 111 11245678999998887765433222 2 Q ss_pred hhhhhhhh--eeEEEEeccCCCceEEEEcccceEEEEecCcchhh----hhhhccccccccceEEEeccccceeehhhhh Q lcl|Aclame:pro 200 LTYLVDFT--GTVIISTNDVTKGEIWATVPENIIFAYINPNNSEL----AKEFNLYGDPTGYIGMNHFQENTTLTIQTLL 273 (296) Q Consensus 200 ~tyl~nfL--G~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~----~~~f~~~td~tGliGv~h~~~~~~~t~et~~ 273 (296) |.+.. .| |..|+.|..+|+|++++-.-.+ |.+-.+ +++ +...-+..|+++|.+..+- + T Consensus 296 G~~~~-~lg~g~~v~~~~~~p~~~i~fgdfs~---y~i~~r-~~~~i~~~~~~~~~~d~~~f~~~~r~-d---------- 359 (395) T protein:vir:95 296 GGFVT-VLPYNVTIITSEFVPEGKLVAFVTDR---YNAVRG-GGLTVKKFDQTLALEDAVLFTAKTFA-Y---------- 359 (395) T ss_pred Cccee-ccCCcceEEEcCCCCCCcEEEEeccc---EEEEEe-cceEEEeccchhhhCCcEEEEEEEEE-C---------- Confidence 23332 44 5679999999999987543332 222222 111 2222233466666665432 1 Q ss_pred hHHHHhhhhccceEEEEEec---CCC Q lcl|Aclame:pro 274 VSGMLMYPERIDGIVKVTLT---PGV 296 (296) Q Consensus 274 ~~~~~lfpE~~dgvv~~tI~---~~v 296 (296) |..+-| ++++..+|+ +|+ T Consensus 360 --g~~~~~---~A~~~l~i~~~~~~~ 380 (395) T protein:vir:95 360 --GQPDDN---KASAVYDLKVASAPR 380 (395) T ss_pred --CEEecc---ccEEEEEeeccCCCC Confidence 122222 223333443 333 No 142 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=97.65 E-value=1.4e-05 Score=47.17 Aligned_cols=275 Identities=14% Similarity=0.059 Sum_probs=145.3 Q ss_pred Ccccccc----ccccceehhhhh-hhhhhhhHHHHhhhHHHHHHHh------CcccccccCCCCeeeeeeeeeeecccCc Q lcl|Aclame:pro 1 MVTSRTY----PEENLIKSTDLK-YPITIDVTNKFQENISKLLEML------GVTRKISVSEGMTLKTYAGYDVTLAEGN 69 (296) Q Consensus 1 ~~~~~~~----ae~nl~~~~dl~-~a~siDf~~~f~~~i~~L~~~L------gVtr~~~~~pG~tIt~pk~~yig~A~gd 69 (296) |--.--- -.-||..-..-. +.-+|-..++|+.-|++++..+ .+.|......|.+|+||++...|..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~D-- 78 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKD-- 78 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeeccccccc-- Confidence 2111000 011222111111 1345667889999999887754 45667888899999999998877664 Q ss_pred ccCCceechhheeeeecceeEEEEeeccc-ccCHHHHHhhcCC-chhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecc Q lcl|Aclame:pro 70 VPEGEVIPLSKVERKIHSEKKIELKKYRK-ATTGEDIQMYGSN-EAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDAL 147 (296) Q Consensus 70 VaEGe~Iplskv~~~~~~t~~~tikK~~K-~vTdEAIqlsGyg-dav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t 147 (296) -.-+.-+....++.+. .+++++-+||+. .+-+=.+..+.+. .+-+-..++....++-.||...|+.|.+...+.... T Consensus 79 Y~R~~g~~~g~vt~~~-~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~ 157 (319) T protein:vir:94 79 YKRNATNEFDHPKIEE-TTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTV 157 (319) T ss_pred ccCCCCcccCCcccce-eEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccccccc Confidence 3333345555665542 234444444443 2322222223333 344456678888899999999999886543322111 Q ss_pred hhhHHHHHHHHHHHHHHhhcccc-CcceEEEEcHHHHHHHhcCCccc------cceeechhhhhhhheeEEEEec--cCC Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYG-SERAIVFANSLDVAEYIAKAGIT------TQTAFGLTYLVDFTGTVIISTN--DVT 218 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded-~~~~VlFvNP~Daa~~l~~a~i~------~q~~fg~tyl~nfLG~~II~S~--kV~ 218 (296) +.+ ...++..+.++..++.+.+ ....|+||+|.=..-++.+..+. .+..+.+.- ..+.|++|+.++ ..+ T Consensus 158 ~~t-~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~V-g~idG~~Vi~vps~~~k 235 (319) T protein:vir:94 158 GTG-SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQ-GELDGFVIVKVPTKLLQ 235 (319) T ss_pred ccC-HHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeec-eeecCeEEEEecccccc Confidence 111 2334556666667776532 23579999997666554443332 222222222 247788988753 333 Q ss_pred CceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 219 KGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 219 ~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .-..++..+..+... .-.. -.++|...-+.-| =-++-....+.+.+....+||+...=++|. T Consensus 236 ~in~i~~h~~A~~~~-~k~~---~~~~~~p~~~~~a------------~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:94 236 GLQAIAVVGEVLASP-IQAD---LAKTNSNIPGMFG------------TLAEQLLYTGAFVPEHLQKYIFTIGGTEVA 297 (319) T ss_pred cceEEEEcCCeeeee-eeee---eeeccCCCccccc------------eeeeeeeeeeeEEeccccceEEEeecCCcc Confidence 444555555554321 0000 0111211111111 123345567777787788999865445555 No 143 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=97.65 E-value=1.4e-05 Score=47.17 Aligned_cols=275 Identities=14% Similarity=0.059 Sum_probs=145.3 Q ss_pred Ccccccc----ccccceehhhhh-hhhhhhhHHHHhhhHHHHHHHh------CcccccccCCCCeeeeeeeeeeecccCc Q lcl|Aclame:pro 1 MVTSRTY----PEENLIKSTDLK-YPITIDVTNKFQENISKLLEML------GVTRKISVSEGMTLKTYAGYDVTLAEGN 69 (296) Q Consensus 1 ~~~~~~~----ae~nl~~~~dl~-~a~siDf~~~f~~~i~~L~~~L------gVtr~~~~~pG~tIt~pk~~yig~A~gd 69 (296) |--.--- -.-||..-..-. +.-+|-..++|+.-|++++..+ .+.|......|.+|+||++...|..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~D-- 78 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKD-- 78 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeeccccccc-- Confidence 2111000 011222111111 1345667889999999887754 45667888899999999998877664 Q ss_pred ccCCceechhheeeeecceeEEEEeeccc-ccCHHHHHhhcCC-chhHHHHHHHHHHHHhhhhHHHHHHHhcCccceecc Q lcl|Aclame:pro 70 VPEGEVIPLSKVERKIHSEKKIELKKYRK-ATTGEDIQMYGSN-EAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDAL 147 (296) Q Consensus 70 VaEGe~Iplskv~~~~~~t~~~tikK~~K-~vTdEAIqlsGyg-dav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~t 147 (296) -.-+.-+....++.+. .+++++-+||+. .+-+=.+..+.+. .+-+-..++....++-.||...|+.|.+...+.... T Consensus 79 Y~R~~g~~~g~vt~~~-~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~ 157 (319) T protein:vir:97 79 YKRNATNEFDHPKIEE-TTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTV 157 (319) T ss_pred ccCCCCcccCCcccce-eEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccccccc Confidence 3333345555665542 234444444443 2322222223333 344456678888899999999999886543322111 Q ss_pred hhhHHHHHHHHHHHHHHhhcccc-CcceEEEEcHHHHHHHhcCCccc------cceeechhhhhhhheeEEEEec--cCC Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYG-SERAIVFANSLDVAEYIAKAGIT------TQTAFGLTYLVDFTGTVIISTN--DVT 218 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded-~~~~VlFvNP~Daa~~l~~a~i~------~q~~fg~tyl~nfLG~~II~S~--kV~ 218 (296) +.+ ...++..+.++..++.+.+ ....|+||+|.=..-++.+..+. .+..+.+.- ..+.|++|+.++ ..+ T Consensus 158 ~~t-~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~V-g~idG~~Vi~vps~~~k 235 (319) T protein:vir:97 158 GTG-SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQ-GELDGFVIVKVPTKLLQ 235 (319) T ss_pred ccC-HHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeec-eeecCeEEEEecccccc Confidence 111 2334556666667776532 23579999997666554443332 222222222 247788988753 333 Q ss_pred CceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 219 KGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 219 ~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .-..++..+..+... .-.. -.++|...-+.-| =-++-....+.+.+....+||+...=++|. T Consensus 236 ~in~i~~h~~A~~~~-~k~~---~~~~~~p~~~~~a------------~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:97 236 GLQAIAVVGEVLASP-IQAD---LAKTNSNIPGMFG------------TLAEQLLYTGAFVPEHLQKYIFTIGGTEVA 297 (319) T ss_pred cceEEEEcCCeeeee-eeee---eeeccCCCccccc------------eeeeeeeeeeeEEeccccceEEEeecCCcc Confidence 444555555554321 0000 0111211111111 123345567777787788999865445555 No 144 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=97.65 E-value=3e-05 Score=45.35 Aligned_cols=271 Identities=14% Similarity=0.123 Sum_probs=138.9 Q ss_pred Cccccc-cccccceehhhhhh-hhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeec-ccC--cccCCce Q lcl|Aclame:pro 1 MVTSRT-YPEENLIKSTDLKY-PITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTL-AEG--NVPEGEV 75 (296) Q Consensus 1 ~~~~~~-~ae~nl~~~~dl~~-a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~-A~g--dVaEGe~ 75 (296) |=-=|. +-...-++..|++- .-.-+...+|-+.+.+--.+|+.-|.++...-.++++|+..+... ..+ -.+|+.+ T Consensus 1 ~~~~~~~~~~~k~it~~d~~gG~L~P~~~~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~~~ 80 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDLGKGILAVQRFGEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTKVA 80 (314) T ss_pred CchhhhHHHhhcccccccCCCceeChHHHHHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCCcc Confidence 100000 00000011112110 001133467777777777788888877533335578888754311 111 2245556 Q ss_pred echhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhH-HHHHHHHHHHHhhhhHHHHHH---------------- Q lcl|Aclame:pro 76 IPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVT-NTDNALVRQLQKKIRTDFVTA---------------- 136 (296) Q Consensus 76 Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~-etd~QL~~~iq~kIdnD~~~a---------------- 136 (296) .|-+..+. ++.++..||+.-.+ |+|.++.+-++..+. .-..+++..+.++...=|+.- T Consensus 81 ~~~~~~tf---~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G 157 (314) T protein:vir:41 81 PTADEVTV---STNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDG 157 (314) T ss_pred CCcccccc---cceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchh Confidence 66666654 35667778877654 999998777654343 333455666666554433321 Q ss_pred -HhcCccc-ee---cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcC-----Cccccceeechhhhhhh Q lcl|Aclame:pro 137 -LKTGTGT-QD---ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAK-----AGITTQTAFGLTYLVDF 206 (296) Q Consensus 137 -Lktat~t-~~---~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~-----a~i~~q~~fg~tyl~nf 206 (296) |+.++.. ++ .........+-+.+..+...|-. +..+.+.+||+.....|++- ..+..+...++.-.. + T Consensus 158 ~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~-~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~-l 235 (314) T protein:vir:41 158 WMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQ-LKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQ-Y 235 (314) T ss_pred hhhhcccceeecCccccccHHHHHHHHHHhcCchhhc-CCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCce-e Confidence 1111111 11 11122233333334444443322 23578999999999887642 234455556665543 8 Q ss_pred heeEEEEeccC-----CCceEEEEcccceEEEEecCcchhhhhhhcccc--ccccceEEEeccccceeehhhhhhHHHHh Q lcl|Aclame:pro 207 TGTVIISTNDV-----TKGEIWATVPENIIFAYINPNNSELAKEFNLYG--DPTGYIGMNHFQENTTLTIQTLLVSGMLM 279 (296) Q Consensus 207 LG~~II~S~kV-----~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~t--d~tGliGv~h~~~~~~~t~et~~~~~~~l 279 (296) +|..|+.+..+ +++.++++.+.||.+ +.. -++......+. +.+.++-- |..+.. T Consensus 236 ~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~--~~~--~~ir~~~~~~a~~~~~~~~~~-~r~d~~-------------- 296 (314) T protein:vir:41 236 DGIPIQYVPALDALGDDKARALLTVPTNLVY--GFW--RNIRIEPKRDAAMRRTEYIAS-LRADCN-------------- 296 (314) T ss_pred cceeeEecccccccCCCCceEEEechhheEE--Eee--ceeEEeecccCcCCeEEEEEE-EEeceE-------------- Confidence 89988888776 579999999999854 332 23333222222 23333211 111100 Q ss_pred hhhccceEEEEEecCCC Q lcl|Aclame:pro 280 YPERIDGIVKVTLTPGV 296 (296) Q Consensus 280 fpE~~dgvv~~tI~~~v 296 (296) | |..|+++++.|..+= T Consensus 297 ~-~~~~aa~~~~~~~~~ 312 (314) T protein:vir:41 297 Y-EDENAAVAAVIDMSS 312 (314) T ss_pred E-EEcCcEEEEEeeccC Confidence 1 236788888887777 No 145 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=97.64 E-value=1e-05 Score=47.91 Aligned_cols=271 Identities=12% Similarity=0.074 Sum_probs=136.0 Q ss_pred ccccc-ceehhhhhhh-hhhhhHHHHhhhHHHHHHHhCccc-----ccccCCCCeeeeeeeeee-ecccCcccCCc---e Q lcl|Aclame:pro 7 YPEEN-LIKSTDLKYP-ITIDVTNKFQENISKLLEMLGVTR-----KISVSEGMTLKTYAGYDV-TLAEGNVPEGE---V 75 (296) Q Consensus 7 ~ae~n-l~~~~dl~~a-~siDf~~~f~~~i~~L~~~LgVtr-----~~~~~pG~tIt~pk~~yi-g~A~gdVaEGe---~ 75 (296) -|+-| .|+-.|+=.. .==+++.+-....++|++-==|.+ .+-..+|+++++|-|+.+ |+++ ++.+.. + T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~-n~~~d~~~~~ 79 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEP-NYGSDNPNVE 79 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCcc-ccCCCCCccc Confidence 11111 1111111000 001122222222222222100111 111479999999999999 5554 787775 5 Q ss_pred echhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc----------- Q lcl|Aclame:pro 76 IPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG----------- 142 (296) Q Consensus 76 Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~----------- 142 (296) ++..|+++. .....+...+|+. +|-+--++| +||+....+|++....+.-...+++.|+.--. T Consensus 80 ~t~~kittg---~~~a~v~~r~kaw~~~Dla~~lsG-~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~ 155 (367) T protein:vir:80 80 APIDGLGSG---EMKTTKTWLNKAYGAMDLTAELAG-SNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIK 155 (367) T ss_pred ccccccccc---hheeeeehhcccchhhhHHHHhhC-chHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhh Confidence 888998874 3445666677875 788866677 79999999999999999999999988873211 Q ss_pred ----------------cee---cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccc----cceeec Q lcl|Aclame:pro 143 ----------------TQD---ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGIT----TQTAFG 199 (296) Q Consensus 143 ----------------t~~---~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~----~q~~fg 199 (296) +.+ .+++.=..--+.++.+++-+|.|..+.=.+++|||.=++++.+.-.|. ++...+ T Consensus 156 ~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~li~~i~~sd~~~~ 235 (367) T protein:vir:80 156 TRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQLT 235 (367) T ss_pred hhhccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccccccccCCCCccc Confidence 011 011000001134455566788887666779999999988877765432 222222 Q ss_pred hhhhhhhheeEEEEeccCCC-----ce---EEEEcccceEEEEecCcchhhhhhhccccccc-c-ceEEEeccccceeeh Q lcl|Aclame:pro 200 LTYLVDFTGTVIISTNDVTK-----GE---IWATVPENIIFAYINPNNSELAKEFNLYGDPT-G-YIGMNHFQENTTLTI 269 (296) Q Consensus 200 ~tyl~nfLG~~II~S~kV~~-----G~---~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~t-G-liGv~h~~~~~~~t~ 269 (296) +.-++|..||.+..+|. ++ -|+-.+..+.+.-..|.- .-...-|+. | -=|+.--+.-++. T Consensus 236 ---i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~-----~~E~~Rd~~~~~~gG~d~L~~Rr~~-- 305 (367) T protein:vir:80 236 ---IPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQV-----PVAVGRRELRGNGSGLEYILERKEW-- 305 (367) T ss_pred ---cceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCcc-----ceecccchhhhcCCceEEEEeeeeE-- Confidence 22378999999999993 22 344555555543333320 011111221 1 1122111111111 Q ss_pred hhhhhHHHHhhhhccceEEEEEe-----------cCCC Q lcl|Aclame:pro 270 QTLLVSGMLMYPERIDGIVKVTL-----------TPGV 296 (296) Q Consensus 270 et~~~~~~~lfpE~~dgvv~~tI-----------~~~v 296 (296) .+-..|+.|-...+ +.-+. ..|- T Consensus 306 -~~hP~G~s~~~~~v---~~~~~~~~~~~~~~~~~sPt 339 (367) T protein:vir:80 306 -IVHPGGFNWLDADV---TIPDNTGSPSGITSGPPAIT 339 (367) T ss_pred -Eeecceeeeccccc---ccccccccccccccccCCCC Confidence 11122322211100 00000 0111 No 146 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=97.61 E-value=3.8e-05 Score=44.84 Aligned_cols=277 Identities=10% Similarity=0.066 Sum_probs=151.4 Q ss_pred Cccccccccccceehhhhhhhhhhh-hHHHHhhhHHHH----HHHhCcccccccCCCCeeeeeeeeeeecccCcccCCce Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITID-VTNKFQENISKL----LEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEV 75 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siD-f~~~f~~~i~~L----~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~ 75 (296) |- ++ .||+...-=+.+-..+ |.+.|+..+.+= ...++..+..++.-|+++++|.=.-...+ ....|++ T Consensus 1 ms----~~-~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~--~~~pG~~ 73 (335) T protein:vir:63 1 MS----FL-NDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAK--GRRAGEE 73 (335) T ss_pred CC----Cc-ccchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeeeeeee--cccCCcC Confidence 32 22 4444432222111122 668888777554 34678888888999999999875444333 4567888 Q ss_pred echhheeeeecceeEEEEee--ccc-cc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc------- Q lcl|Aclame:pro 76 IPLSKVERKIHSEKKIELKK--YRK-AT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT------- 143 (296) Q Consensus 76 Iplskv~~~~~~t~~~tikK--~~K-~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t------- 143 (296) |--+.+.. +-.+++|.. |.+ -+ =||+ | +=| |--.|..+|+..++++..|.-++-.|-.+... T Consensus 74 l~~~~~~~---~k~~itVD~ll~a~~~I~dlDe~-~-~~y-DvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~ 147 (335) T protein:vir:63 74 LERSRVVN---DKWNLTVDTLLYLRHQFDHQDEW-T-QSF-DMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLE 147 (335) T ss_pred cCCCCccc---cceEEEecceeechhhhhhHHHH-h-cCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccC Confidence 87766543 335677765 333 34 2555 3 334 48899999999999999999998555332211 Q ss_pred ----------eecch----hhHHHHHHHHHHHHHHhhcccc-----CcceEEEEcHHHHHHHhcCCccccceeec---h- Q lcl|Aclame:pro 144 ----------QDALG----AGLQGALASAWGKLQVLFEDYG-----SERAIVFANSLDVAEYIAKAGITTQTAFG---L- 200 (296) Q Consensus 144 ----------~~~t~----~~lQ~Ala~~~~~~~~~Feded-----~~~~VlFvNP~Daa~~l~~a~i~~q~~fg---~- 200 (296) ...++ .+.| +|..++-.+...|.+.+ -...+++|+|..-+.+|.+.++- +..|+ + T Consensus 148 ~~~~~G~~~~~~~tg~~~~~~~~-~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~-n~~~~~s~~~ 225 (335) T protein:vir:63 148 DAFSPGVLEKLDLTGLTAKQAAD-KIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLM-NVEYQATGAT 225 (335) T ss_pred CCcCCCcceeeeeccCcccccHH-HHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccc-cccccccccc Confidence 00011 1222 34444444555555332 13479999999999999877653 22332 1 Q ss_pred -----hhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhcccc-ccccceEEEecc------ccceee Q lcl|Aclame:pro 201 -----TYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYG-DPTGYIGMNHFQ------ENTTLT 268 (296) Q Consensus 201 -----tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~t-d~tGliGv~h~~------~~~~~t 268 (296) .-+.++.|+.|+.|+.+|.+.....+..|=. |...+|..+...+.. -+.=..+-.++. +.++.+ T Consensus 226 ~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~----n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~ 301 (335) T protein:vir:63 226 NDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHF----NVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFS 301 (335) T ss_pred ccccCceeEEeeceEEEeeccCCCCCcccccccccC----CccccccceeEEEEEecceEEEEEEeecccceeeccchhh Confidence 1233577899999999998865544321110 122344544443332 111112222321 223322 Q ss_pred hhhhhhHHHHhhhhccceEEEEEecC--CC Q lcl|Aclame:pro 269 IQTLLVSGMLMYPERIDGIVKVTLTP--GV 296 (296) Q Consensus 269 ~et~~~~~~~lfpE~~dgvv~~tI~~--~v 296 (296) .--....++=-=|=|+|+++.++.+. ++ T Consensus 302 ~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~ 331 (335) T protein:vir:63 302 WVLDTFQMYNIGARRPDTAGAIELKGIGAF 331 (335) T ss_pred HHhHHHHHcCCcccccceEEEEEEcCCCce Confidence 22222222222255778888887643 22 No 147 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=97.56 E-value=2.1e-05 Score=46.27 Aligned_cols=274 Identities=14% Similarity=0.048 Sum_probs=143.2 Q ss_pred Cccccccc-----------cccceehhhh-hhhhhhhhHHHHhhhHHHHHHH--h----CcccccccCCCCeeeeeeeee Q lcl|Aclame:pro 1 MVTSRTYP-----------EENLIKSTDL-KYPITIDVTNKFQENISKLLEM--L----GVTRKISVSEGMTLKTYAGYD 62 (296) Q Consensus 1 ~~~~~~~a-----------e~nl~~~~dl-~~a~siDf~~~f~~~i~~L~~~--L----gVtr~~~~~pG~tIt~pk~~y 62 (296) .+|.-.|- .-||..-.+- -++-+|...++|..-|++.+.. + -+.|......|++|++|++.. T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~ 84 (329) T protein:vir:10 5 FITGVKTMNKEIKNATGKLKLNLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDV 84 (329) T ss_pred EEechhhhhhhhhcccceeEEehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeecc Confidence 12211110 1133222111 1356888999999999998754 2 356677888999999999988 Q ss_pred eecccCcccCCceechhheeeeecceeEEEEeecccc-cCHHHHHhhcCCc-hhHHHHHHHHHHHHhhhhHHHHHHHhcC Q lcl|Aclame:pro 63 VTLAEGNVPEGEVIPLSKVERKIHSEKKIELKKYRKA-TTGEDIQMYGSNE-AVTNTDNALVRQLQKKIRTDFVTALKTG 140 (296) Q Consensus 63 ig~A~gdVaEGe~Iplskv~~~~~~t~~~tikK~~K~-vTdEAIqlsGygd-av~etd~QL~~~iq~kIdnD~~~aLkta 140 (296) .|..+ -..+.-+....++.+. .+++++-.||+.= +-+=.+..+.+.- +-+-..++....++-.||...|+.|.+. T Consensus 85 ~gl~D--Y~R~~g~~~g~vt~~~-~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~ 161 (329) T protein:vir:10 85 TELKD--YKRNATNEFDHPQIQE-TTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARN 161 (329) T ss_pred ccccc--ccCCCCccccccccce-eEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhh Confidence 77764 4334445555666542 2333333443331 2211122233332 3344567788889999999999988554 Q ss_pred ccceecchhhHHHHHHHHHHHHHHhhcccc-CcceEEEEcHHHHHHHhcCCccc------cceeechhhhhhhheeEEEE Q lcl|Aclame:pro 141 TGTQDALGAGLQGALASAWGKLQVLFEDYG-SERAIVFANSLDVAEYIAKAGIT------TQTAFGLTYLVDFTGTVIIS 213 (296) Q Consensus 141 t~t~~~t~~~lQ~Ala~~~~~~~~~Feded-~~~~VlFvNP~Daa~~l~~a~i~------~q~~fg~tyl~nfLG~~II~ 213 (296) ..+...++.+ ...+++.+.++..++.+.+ ....|+||+|.=..-+....... .+..+.+.- ..+.|++|+. T Consensus 162 a~~~~~~~~t-~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g~V-g~idG~~Ii~ 239 (329) T protein:vir:10 162 KAKHLTVGSG-ADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKGVQ-GELDGFTIVK 239 (329) T ss_pred cccccccccC-HHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeee-eeecCeEEEE Confidence 3322211122 3334456666667776542 24569999997766555443332 222232222 2377889887 Q ss_pred ecc--CCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEE Q lcl|Aclame:pro 214 TND--VTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVT 291 (296) Q Consensus 214 S~k--V~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~t 291 (296) +++ .+.-..++..++.+... .-.. --++|...-+. .-=-++-....+.+.+....+||... T Consensus 240 vps~~~k~in~ii~~~~A~~~~-~K~~---~~~~~~p~~~~------------~a~~v~gr~yyd~~V~~~k~~~I~~~- 302 (329) T protein:vir:10 240 VPSKMLQGVEAMAVIGEVMASP-IQAN---EAKLNSNVPGM------------FGTLAEQMLYTGAFVPEHLQKYIFTI- 302 (329) T ss_pred ecCCcccceeEEEEcCCceeee-eeee---eeeeeCCCCcc------------chheeeeeeeeeeEEEccccCEEEEe- Confidence 543 33334555555443321 0000 00111111111 11123345566777777777888653 Q ss_pred ecCCC Q lcl|Aclame:pro 292 LTPGV 296 (296) Q Consensus 292 I~~~v 296 (296) ++.+. T Consensus 303 ~~~a~ 307 (329) T protein:vir:10 303 GGKEV 307 (329) T ss_pred cccCc Confidence 33333 No 148 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=97.53 E-value=1.4e-05 Score=47.16 Aligned_cols=267 Identities=13% Similarity=0.095 Sum_probs=117.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccC-----CCCeeeeeeeeeeecccCcccCCce Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVS-----EGMTLKTYAGYDVTLAEGNVPEGEV 75 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~-----pG~tIt~pk~~yig~A~gdVaEGe~ 75 (296) |- |-.-+ .+-...+=.....|++++- +.. -|.|..+.+ .|+||++|.=.-....+..-+-|.. T Consensus 1 MA--------N~llT-~iP~iia~~al~~l~~~lV-~~~--lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~ 68 (423) T protein:vir:35 1 MA--------NNLES-NISQIVLKKFLPGFMSDIV-LCK--TVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITG 68 (423) T ss_pred Cc--------cchhh-hhHHHHHHHHHHHHHhhcc-cch--hcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCC Confidence 21 11111 0111122233444444443 222 166666554 4999999875433333333334555 Q ss_pred echhheeeeecceeEEEEeeccc---ccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC-ccceecchhhH Q lcl|Aclame:pro 76 IPLSKVERKIHSEKKIELKKYRK---ATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG-TGTQDALGAGL 151 (296) Q Consensus 76 Iplskv~~~~~~t~~~tikK~~K---~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta-t~t~~~t~~~l 151 (296) |..+.+... ..+++|.|... .++||.-.+ ...+=-.....| ..+++++|+.++...+... .......+... T Consensus 69 ~~~~~~~e~---~v~l~id~~k~~a~~v~d~e~~l-~i~~~~~~l~~a-~~ala~~vd~~l~~~l~~~a~~~vgt~~t~~ 143 (423) T protein:vir:35 69 KDKNGLFSA---KATGKVGKYITVAVEWTQIEEAL-KLNQLDQILSPI-HERMVTDLETELAHFMMNNGALSLGSPNTAI 143 (423) T ss_pred ccccccccc---eeeEEeccceeccceeCHHHHHh-hHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhccccccccccCCc Confidence 666666643 35678866444 247666322 222222233333 4678888999999877542 22211111111 Q ss_pred HHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCC-ccccceeechhh-----h-hhhheeEEEEeccCCCceE Q lcl|Aclame:pro 152 QGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKA-GITTQTAFGLTY-----L-VDFTGTVIISTNDVTKGEI 222 (296) Q Consensus 152 Q~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a-~i~~q~~fg~ty-----l-~nfLG~~II~S~kV~~G~~ 222 (296) . + .+.+.++..++++.. ...-+++++|...+.+|+.- .+......+.+- + .++.|..|++|+.||..+. T Consensus 144 ~-~-~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~ 221 (423) T protein:vir:35 144 K-K-WADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQ 221 (423) T ss_pred c-h-HHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceEEEEcCCCccccc Confidence 1 1 234555556665542 13468999999988888654 344333333321 1 2467889999999997544 Q ss_pred EEEcc------c-ceE--------EEEecCcchhh-hhh-hccccccccceEE--Eecccccee-------------ehh Q lcl|Aclame:pro 223 WATVP------E-NII--------FAYINPNNSEL-AKE-FNLYGDPTGYIGM--NHFQENTTL-------------TIQ 270 (296) Q Consensus 223 ~~t~~------~-Nl~--------~ay~~~~~g~~-~~~-f~~~td~tGliGv--~h~~~~~~~-------------t~e 270 (296) ..-.. . +.. ..|++.. |.- +.. |....|.-=+=|| .|.+.+..+ +.+ T Consensus 222 gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~-~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~ 300 (423) T protein:vir:35 222 GDFDGAITVKTAPNVDYLSVKDSYQFTVALT-GATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEE 300 (423) T ss_pred cccccceeeccccccccccccccccceeeee-eeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEecc Confidence 33111 1 100 0011110 000 000 0111121111122 122222221 111 Q ss_pred hhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 271 TLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 271 t~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +...+| |-..++|.++. T Consensus 301 ~~~~a~---------g~~~v~i~p~~ 317 (423) T protein:vir:35 301 TNSTAS---------GDVTVKLSGVP 317 (423) T ss_pred cccccc---------CceeEEccccc Confidence 111111 11223333332 No 149 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=97.43 E-value=6.9e-05 Score=43.39 Aligned_cols=262 Identities=10% Similarity=0.078 Sum_probs=143.0 Q ss_pred Cccccccccccceehhhhhhhhhh--hhHHHHhhhHHHHH----HHhCcccccccCCCCeeeeeeeeeeecccCcccCCc Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITI--DVTNKFQENISKLL----EMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~si--Df~~~f~~~i~~L~----~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe 74 (296) |-+ + .||+...- +.+-+. =|.+.|+..+.+=+ ..++..+..++.-|+++++|.=.-... + ....|+ T Consensus 1 ms~----~-~~~t~~~~-~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~-~-~~~pG~ 72 (335) T protein:vir:78 1 MSF----L-NDLTRPNY-AGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEA-K-GRRAGE 72 (335) T ss_pred CCc----c-cccccccc-ccccchhhhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeeeeee-c-ccccCc Confidence 322 2 33333211 111111 26788887775444 457778888899999999985333322 2 455788 Q ss_pred eechhheeeeecceeEEEEee--ccc-cc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce----- Q lcl|Aclame:pro 75 VIPLSKVERKIHSEKKIELKK--YRK-AT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ----- 144 (296) Q Consensus 75 ~Iplskv~~~~~~t~~~tikK--~~K-~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~----- 144 (296) ++.-+.+.. .-.+++|.. |.. .| =||+ | +=| |--.|..+|+..++++..|.-++..|-.+.... T Consensus 73 ~l~~~~~~~---~k~~itID~ll~a~~~VddlDe~-~-~~y-DvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~ 146 (335) T protein:vir:78 73 ELERSRVVN---DKWNLTVDTLLYLRHQFDHQDEW-T-QSF-DMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDL 146 (335) T ss_pred ccCCCCccc---CCeEEEecceeechhhHhhHHHh-h-cCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 886666543 235677755 233 34 2555 3 444 488999999999999999999986664443210 Q ss_pred -------------------ecchhhHHHHHHHHHHHHHHhhcccc-----CcceEEEEcHHHHHHHhcCCccccceeec- Q lcl|Aclame:pro 145 -------------------DALGAGLQGALASAWGKLQVLFEDYG-----SERAIVFANSLDVAEYIAKAGITTQTAFG- 199 (296) Q Consensus 145 -------------------~~t~~~lQ~Ala~~~~~~~~~Feded-----~~~~VlFvNP~Daa~~l~~a~i~~q~~fg- 199 (296) +..+..|.. ++-++...|.+-| ....|++|+|..-+.+|.+.++-. ..|+ T Consensus 147 ~~~~~~G~~~~~~~tg~~~~~~~~~l~~----a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n-~~~~~ 221 (335) T protein:vir:78 147 EDAFSPGVLEKLDLTGLTAKEAAEKIVR----MHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMS-VEYQA 221 (335) T ss_pred CCCcCCCcceeeeeccccccccHHHHHH----HHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccc-ccccc Confidence 012223333 3334444444221 234799999999999998876532 2222 Q ss_pred --h------hhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhcccc-ccccceEE-----------E Q lcl|Aclame:pro 200 --L------TYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYG-DPTGYIGM-----------N 259 (296) Q Consensus 200 --~------tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~t-d~tGliGv-----------~ 259 (296) + .-+.++.|++|+.|+.+|.+..... .++.+|+.|- |-+-.+|+ . T Consensus 222 s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~---------------~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~ 286 (335) T protein:vir:78 222 TGATNDYVKSRVAILNGVKVLETPRFATKAISAH---------------PLGRHFNVSAEEAERQIALFLPSKTLITAQV 286 (335) T ss_pred cccccccccceeEEeeceEEEeeccCCCCCCccc---------------cccccCCcccccccceEEEEEecceEEEEEE Confidence 1 1233578999999999998753332 2333343333 22222222 1 Q ss_pred ecccccee---ehhhhhhHHHHhh---hhccceEEEEEecCC--C Q lcl|Aclame:pro 260 HFQENTTL---TIQTLLVSGMLMY---PERIDGIVKVTLTPG--V 296 (296) Q Consensus 260 h~~~~~~~---t~et~~~~~~~lf---pE~~dgvv~~tI~~~--v 296 (296) ++...+.- -..+.++-|...| |=|+|+++.++.+.. + T Consensus 287 ~~~~~e~~~~~~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~ 331 (335) T protein:vir:78 287 APVQAKLWEDHDQFSWVLDTFQMYNIGARRPDTAGAIELKGIEAF 331 (335) T ss_pred EecccceeeccchhhHhhhHHHHcCCcccCcceEEEEEecCCCcc Confidence 11111110 1122233332222 457778887776543 3 No 150 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=97.37 E-value=5e-06 Score=49.66 Aligned_cols=267 Identities=10% Similarity=-0.010 Sum_probs=137.1 Q ss_pred Cccccc---cccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceec Q lcl|Aclame:pro 1 MVTSRT---YPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIP 77 (296) Q Consensus 1 ~~~~~~---~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Ip 77 (296) ..|+.- +-+=+.....+-++..--+|.+++-+.+.+.--++.+.+.+|+. | .+++|+-.-.+.|. -++|+++|+ T Consensus 71 ~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~-~-~~~i~~~~~~~~a~-w~~e~~~~~ 147 (383) T protein:vir:78 71 NITNEEIKFFNDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTG-L-RTKFLKSETSGVAV-WGKIFGEIK 147 (383) T ss_pred hhhHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecC-C-ceEEEEEcCCcceE-Eeecccccc Confidence 111000 00000111223333334467777777777777778888888863 4 46898876666664 788888875 Q ss_pred -hhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhc------------Cc- Q lcl|Aclame:pro 78 -LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKT------------GT- 141 (296) Q Consensus 78 -lskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkt------------at- 141 (296) -+..+. ...++..+|+..-+ |.|-++.+++ +--+.-.++|+.+++.+++.-|+.==.+ .+ T Consensus 148 ~~~~~~f---~~i~l~~~kl~~~i~is~ell~Ds~~-~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~ 223 (383) T protein:vir:78 148 GQLDATF---SDEESIQNKLTAFVVVPKDLEKFGPA-WVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGST 223 (383) T ss_pred cccCcce---eeEeecceeeEeeccchHHHhhccHH-HHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCccc Confidence 333332 34678889988754 8888865554 4457788899999999999887731100 00 Q ss_pred ------------cce-ecchhhHHHHHHHHHHHHHHhhcccc---CcceEEEEcHHHHHHHhcCCccccceeechhhhhh Q lcl|Aclame:pro 142 ------------GTQ-DALGAGLQGALASAWGKLQVLFEDYG---SERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVD 205 (296) Q Consensus 142 ------------~t~-~~t~~~lQ~Ala~~~~~~~~~Feded---~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl~n 205 (296) .+. ......+...+......+.-..+... ....+.++||.|.+.|........+. |.|. . T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~---G~~~-t 299 (383) T protein:vir:78 224 VVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSLNAN---GVYV-T 299 (383) T ss_pred ccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchhccCCC---Ccee-e Confidence 000 00011111112111111111111111 12346899999998887654432221 1222 1 Q ss_pred h--heeEEEEeccCCCceEEEEcccceEEEEecCcchh--hhhhhccccccccceEEEeccccceeehhhhhhHHHHhhh Q lcl|Aclame:pro 206 F--TGTVIISTNDVTKGEIWATVPENIIFAYINPNNSE--LAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYP 281 (296) Q Consensus 206 f--LG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~--~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfp 281 (296) . +|..||.|..+|+|++++-.... +...+-.+=. .+...-+..|+++|++..+.-- ..+ T Consensus 300 ~l~~~~~iv~s~~~p~~~iifgdfs~--Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~dG-------------~~~-- 362 (383) T protein:vir:78 300 ALPFNLNIIESLFVPEKKAISYVAER--YDALIGGPLDIGTYDQTLAIEDLNLYAAKQFAYG-------------KAK-- 362 (383) T ss_pred ecCCCceEEecCCCCcccEEEeeccc--eEEEecccceEEecchhhhhcCceEEEEEEEEcC-------------EEe-- Confidence 2 36789999999999987654443 2222321111 1122334558888887754211 111 Q ss_pred hccceEEE--EEecC--CC Q lcl|Aclame:pro 282 ERIDGIVK--VTLTP--GV 296 (296) Q Consensus 282 E~~dgvv~--~tI~~--~v 296 (296) ..++++. ++|.+ ++ T Consensus 363 -~~~A~~vl~~~~~~~~~~ 380 (383) T protein:vir:78 363 -DDKAAAVWTLNINPAEQT 380 (383) T ss_pred -cCCeEEEEEEEecCCCCC Confidence 1223333 33422 22 No 151 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=97.29 E-value=1.7e-05 Score=46.79 Aligned_cols=273 Identities=12% Similarity=0.032 Sum_probs=142.6 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCceechhh Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplsk 80 (296) |-|-. |...+..+ .++-=+..+...++.+=|=...+.|+..-..|+||+||.=.-..- .|=.++..|.++. T Consensus 1 ~~~~n-----~ts~~qaf--i~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV--~dY~~~~~i~~d~ 71 (322) T protein:vir:31 1 MSTGN-----NTSNTQAL--IVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVV--RSRPEQGDFTFDN 71 (322) T ss_pred CCCCC-----CcccceEE--eehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEecccccccc--ccccCCCCccccc Confidence 55433 22222222 112223444444554444456678877766799999987543322 2556888899998 Q ss_pred eeeeecceeEEEEee---cccccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce------------- Q lcl|Aclame:pro 81 VERKIHSEKKIELKK---YRKATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ------------- 144 (296) Q Consensus 81 v~~~~~~t~~~tikK---~~K~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~------------- 144 (296) +++. ..+++|.+ ++..|+|..+|.+ ++-.+...++.+.+++..+|.=.-..|+++.++. T Consensus 72 ltt~---~~~l~IDq~KYfaf~VdDD~~Qa~--~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~ 146 (322) T protein:vir:31 72 LDTG---EISIILRDEVYAGNAISKKLRQDS--RWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVP 146 (322) T ss_pred CCCc---eEEEEEehhhhhccccchhHHHhh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCc Confidence 8875 46788866 3446888777744 4677788888888888888877666666543210 Q ss_pred ---ecchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHH---------HhcCCcc---ccc-eeechhhhhhh Q lcl|Aclame:pro 145 ---DALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAE---------YIAKAGI---TTQ-TAFGLTYLVDF 206 (296) Q Consensus 145 ---~~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~---------~l~~a~i---~~q-~~fg~tyl~nf 206 (296) ..+++ =|...+..+.++..++++.+ ....+++|+|.-.+. .+++.+. ..+ .+=|+-...+. T Consensus 147 ~~iv~~gt-~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~ 225 (322) T protein:vir:31 147 HRFVGTGT-DQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSV 225 (322) T ss_pred cceeccCC-CchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHH Confidence 00122 23334466777777787753 235699999997553 3445432 212 11233345578 Q ss_pred heeEEEEeccCCCceE--EEEcccceEEEEecCcchhhhhhhccccccccceEEEecc---------ccceeehhh---h Q lcl|Aclame:pro 207 TGTVIISTNDVTKGEI--WATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQ---------ENTTLTIQT---L 272 (296) Q Consensus 207 LG~~II~S~kV~~G~~--~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~---------~~~~~t~et---~ 272 (296) +|++|+.||.+++++. ++-...+...+ |.. +-|-...|.--+=||+|-+ +.+|.--++ . T Consensus 226 ~GF~V~~SN~l~~~~~~i~aG~d~~~t~a------g~~-n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~ 298 (322) T protein:vir:31 226 YGIDLFVSNLLADANETINAGGDARSTTA------GKC-NMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTA 298 (322) T ss_pred hceeeeeeccccccccccccCcccccccc------eee-cccccccchhhhhhhhHhhhhhhhhcccCccccccceeeee Confidence 9999999999975441 11111111110 111 1111212222222222222 112222111 1 Q ss_pred hhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 273 LVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 273 ~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .-.+-++.||.+ +.+..++.+-- T Consensus 299 ~~g~g~~r~e~l-~~~~a~~~~~~ 321 (322) T protein:vir:31 299 RWGNGLVRDENL-VCVLANADKVT 321 (322) T ss_pred eecceeecccce-EEEEecccccc Confidence 223445566655 23333332222 No 152 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=97.28 E-value=0.00011 Score=42.38 Aligned_cols=272 Identities=10% Similarity=0.083 Sum_probs=135.9 Q ss_pred cccccceeh-hhhhhhhhhhhHHHHhhhHHHHHHHhC-----cccccc-cCCCCeeeeeeeee---e--ecccCcccCCc Q lcl|Aclame:pro 7 YPEENLIKS-TDLKYPITIDVTNKFQENISKLLEMLG-----VTRKIS-VSEGMTLKTYAGYD---V--TLAEGNVPEGE 74 (296) Q Consensus 7 ~ae~nl~~~-~dl~~a~siDf~~~f~~~i~~L~~~Lg-----Vtr~~~-~~pG~tIt~pk~~y---i--g~A~gdVaEGe 74 (296) -+-.|+... .-|..-++-=|+++|..++..+++-=| --|... ..-++++..|.-.. + +.....++.+. T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 80 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSADGT 80 (322) T ss_pred CcccceeeeeeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccccCcc Confidence 111121111 111112355699999999998865443 222111 11222333322100 0 11111222222 Q ss_pred -eechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc------e- Q lcl|Aclame:pro 75 -VIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT------Q- 144 (296) Q Consensus 75 -~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t------~- 144 (296) .+|.....+ .++.+.+..|.-+. .+.+- +---.||.+...++.+.+++++.|.-+++++...... + T Consensus 81 ~dtp~~~~~~---~~r~~~~~d~~~~~~VDd~D~-~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~ 156 (322) T protein:vir:10 81 YPTPVNNKPF---AKRRTNVDTYDTGHVVEQEDI-SQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVE 156 (322) T ss_pred cCCCcccccc---ceEEEeecccccceecchHHH-HHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccc Confidence 355544433 34556665555454 45553 4456799999999999999999999888766432100 0 Q ss_pred -------ecchhhHHHHHHHHHHHHHHhhcccc---CcceEEEEcHHHHHHHhcCCccccc------eee-chhhhhhhh Q lcl|Aclame:pro 145 -------DALGAGLQGALASAWGKLQVLFEDYG---SERAIVFANSLDVAEYIAKAGITTQ------TAF-GLTYLVDFT 207 (296) Q Consensus 145 -------~~t~~~lQ~Ala~~~~~~~~~Feded---~~~~VlFvNP~Daa~~l~~a~i~~q------~~f-g~tyl~nfL 207 (296) ...+.++- ..-+-.+..+|+.-+ +..-++.|+|...+++|...+++.. -++ +|. ..++| T Consensus 157 ~~ss~~i~~g~~g~t---~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~-ig~~l 232 (322) T protein:vir:10 157 FLATQEIGDGTKPIS---FDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGI-ITNWM 232 (322) T ss_pred cCCCcccccCccchh---HHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCe-eeeee Confidence 00011111 111223334454421 2235899999999999998876531 111 233 34699 Q ss_pred eeEEEEeccCCCceE--E-----EEcccc--eEEEEecCcchhhhhhhccccccccceEEEeccccceee--hhhhhhHH Q lcl|Aclame:pro 208 GTVIISTNDVTKGEI--W-----ATVPEN--IIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLT--IQTLLVSG 276 (296) Q Consensus 208 G~~II~S~kV~~G~~--~-----~t~~~N--l~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t--~et~~~~~ 276 (296) |..+|.|++||.... + .++.+. -+++|-... =.|..-.|-+.-| .+.++..++. +=...+.| T Consensus 233 Gf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~A-----v~~a~~~dv~~~i--~~~~~~~~a~~I~~~~~~Ga 305 (322) T protein:vir:10 233 GYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMA-----LGYHSCKDIWTKV--AEDPSASFAWRIYSAFTADC 305 (322) T ss_pred eEEEEEeccCCccccccccccccCCCCccceeEEEEecCc-----eeEEEeeeeeEEe--eccCCcchhhhhhhhhhhCc Confidence 999999999995221 1 111111 122333321 0111111323333 3444444433 22233344 Q ss_pred HHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 277 MLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 277 ~~lfpE~~dgvv~~tI~~~v 296 (296) ..+ ..+|||+..+.... T Consensus 306 ~ri---~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 306 VRV---EDEHIFKLRLKNSL 322 (322) T ss_pred eEe---ccCcEEEEEEeccC Confidence 444 34899999998888 No 153 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=97.25 E-value=7.9e-06 Score=48.54 Aligned_cols=253 Identities=14% Similarity=0.176 Sum_probs=131.9 Q ss_pred Cccc------------cccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccC Q lcl|Aclame:pro 1 MVTS------------RTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEG 68 (296) Q Consensus 1 ~~~~------------~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~g 68 (296) |.-- -||+..|++. +++|-=+= -+.-.-|-=.-....++.+.-|.---+|.+..+ -|- T Consensus 58 mm~G~~p~~eV~~~e~mtt~~a~Ili------P~vis~v~--~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~-Ra~- 127 (393) T protein:vir:79 58 MMEGETPTNEVNLREFMATPSAQILI------PRVIVGTM--REAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIM-RAY- 127 (393) T ss_pred HhcCCCchhheehhhhhcCCCcceec------hhhhhhhh--hhcccchhHHHHHHHHHhhhcCcceeccchhee-eec- Confidence 1111 1233333333 33321110 000000000011123566777877778888644 564 Q ss_pred cccCCceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccce-- Q lcl|Aclame:pro 69 NVPEGEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ-- 144 (296) Q Consensus 69 dVaEGe~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~-- 144 (296) +||||.++|-..+.-...+..++..+|++-.+ |+|+|-.||+ |-|+..-++..+.++++-+.-||.-+++..++. T Consensus 128 ~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~-Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfD 206 (393) T protein:vir:79 128 DVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQW-DLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFD 206 (393) T ss_pred cccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchH-HHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeee Confidence 89999999999988322334556667776665 9999988998 579999999999999999999999887654421 Q ss_pred --------ecchhhHHHHHHH--HHHHHH----HhhccccCcceEEEEcHHHHHHHhcCCccc--cceeechhhhh---- Q lcl|Aclame:pro 145 --------DALGAGLQGALAS--AWGKLQ----VLFEDYGSERAIVFANSLDVAEYIAKAGIT--TQTAFGLTYLV---- 204 (296) Q Consensus 145 --------~~t~~~lQ~Ala~--~~~~~~----~~Feded~~~~VlFvNP~Daa~~l~~a~i~--~q~~fg~tyl~---- 204 (296) ..++-++|+-.+- ..+++. .+-.+ .....|+|++|.--.-+-+++... .+++|| +|.. T Consensus 207 a~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~-hyt~svi~MHPLAWnv~AKna~me~~~~na~g-N~~~~~~~ 284 (393) T protein:vir:79 207 NYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMAN-EYTPSDLMMHPLAWTVFAKNELMGSLQANPYG-NYPAKGAP 284 (393) T ss_pred ccccCccceeecCCccccccccccHHHHHHHHHHHhcc-cCCcceEEEcCchhhhhhhhhhhcceeecccc-ccCccccc Confidence 1344334332110 011111 12233 346779999999887777777553 356666 3321 Q ss_pred --hhhe-----------eEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhh Q lcl|Aclame:pro 205 --DFTG-----------TVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQT 271 (296) Q Consensus 205 --nfLG-----------~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et 271 (296) +.|| ..|+.|.-||=-+ -++-|.+|.=..+-+||.--.+ .++.|- T Consensus 285 ts~algp~~i~~~~~~nlnv~~sPfvp~d~--------------------k~~rFd~~~Vd~NnvgvlLV~D--~i~tdq 342 (393) T protein:vir:79 285 SSMALGPDSIQGRLPFNFNVNLSPFIPLDK--------------------KSRRFDVYAVDRNNVGVLLVRD--DLKTDQ 342 (393) T ss_pred hhhhhchhhhccccccceeEEEeccccccc--------------------ccceeeEEEeecCCceEEEEec--Ccceec Confidence 2344 3455555554211 2455555552223334332111 222221 Q ss_pred hhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 272 LLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 272 ~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) |-|.+-+|-+.....-- T Consensus 343 --------~ddk~rdiq~iKl~ERY 359 (393) T protein:vir:79 343 --------WDEKARGLQNIKMIERY 359 (393) T ss_pred --------cccccccceeeeeeeee Confidence 22333333333322211 No 154 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=97.18 E-value=7.7e-05 Score=43.14 Aligned_cols=250 Identities=14% Similarity=0.147 Sum_probs=134.3 Q ss_pred Ccccc--------------------ccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeee Q lcl|Aclame:pro 1 MVTSR--------------------TYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAG 60 (296) Q Consensus 1 ~~~~~--------------------~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~ 60 (296) |+.++ .|...+-.-+.++ ..--|+|+++ .+.+..||- .+|.. |+|+.- T Consensus 110 l~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd~~~~i~~~~-v~d~i~li~q-~r~i~slf~------tLP~~-g~T~eY--- 177 (410) T protein:vir:83 110 MWNSAQGNASAADRLEVYARAADHQKTGDLQGVIPDPI-VGPVIDFIDS-ARPLVSTLG------TLPLN-NATFYR--- 177 (410) T ss_pred HhccCCchHHHHHHHHHHHHhhccCcccccccccchhH-hhhHHHHHhh-ccchhhhhh------hCCCC-CCeeEE--- Confidence 22221 1222211111222 3445667765 445555552 27765 998754 Q ss_pred eeeecccCccc----------CCceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHH----HHH Q lcl|Aclame:pro 61 YDVTLAEGNVP----------EGEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNAL----VRQ 124 (296) Q Consensus 61 ~yig~A~gdVa----------EGe~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL----~~~ 124 (296) .+...+ ..|+ ||+..|.-||+.. +.++.||-|+-.+ |-.+|. .+--..++-.-|-| +.+ T Consensus 178 ~v~t~~-~tV~~q~~~~kqa~EGd~L~~gKl~~~---t~tA~ikTyGGyt~LSRQ~IE-Rs~v~~L~~~lraL~~AYA~a 252 (410) T protein:vir:83 178 PIVSQR-PAVGLQGVAGGASDEKTELDSQKMVID---RLTVNAKTLGGYVNVSRQAID-FSSPSALDLVVNGLGQQYAIE 252 (410) T ss_pred eeeccc-ccccccccccccccccccccccceeee---eccceeehhcCcccccceeee-cCChhhHHHHHHHHHHHHHHH Confidence 233222 2343 9999999999985 7899999999876 788884 44445555555555 555 Q ss_pred HHhhhhHHHHHHHhcCccceecchhhHHHHHHHHHHHHHHhhccc--cCcceEEEEcHHHHHHHhcCCc--c----ccce Q lcl|Aclame:pro 125 LQKKIRTDFVTALKTGTGTQDALGAGLQGALASAWGKLQVLFEDY--GSERAIVFANSLDVAEYIAKAG--I----TTQT 196 (296) Q Consensus 125 iq~kIdnD~~~aLktat~t~~~t~~~lQ~Ala~~~~~~~~~Fede--d~~~~VlFvNP~Daa~~l~~a~--i----~~q~ 196 (296) -.+.++.-|....-+...+.+.+++.+-.++ | ++.+.+.|- +-...++-|+| |+-+.++.-- + ..-. T Consensus 253 tea~vra~L~~t~t~~~a~~~~Tad~~~~~i---~-da~~~v~da~~~~~~~~i~vS~-DVl~~~~~~f~~~~~~~~dt~ 327 (410) T protein:vir:83 253 TEALVGAALASTSTGAVGYGNATADNVASAI---W-QAAGAVYTAVKGMGRLVIAIAP-DVLGDFGPLFAPVNPTNAHST 327 (410) T ss_pred HHHHHHHHHHHhhhhhhhhhhccHHHHHHHH---H-HHHHHHhhhhccceeeeEEech-hhhhhccceeeccCCCCcccc Confidence 5556666665555444444455666664433 4 566777663 22233556665 5544443321 1 1111 Q ss_pred eech-----hhhhhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhh--hhhhccccccccceEEEeccccceeeh Q lcl|Aclame:pro 197 AFGL-----TYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSEL--AKEFNLYGDPTGYIGMNHFQENTTLTI 269 (296) Q Consensus 197 ~fg~-----tyl~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~--~~~f~~~td~tGliGv~h~~~~~~~t~ 269 (296) .||. ..-..++|+-|+++.+.+.|++++..+..|..+=-....=+| ...++|+-|=.|+.++. T Consensus 328 Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~Ai~~~eS~~gp~qL~d~~i~nLt~~ySgY~a~a---------- 397 (410) T protein:vir:83 328 GFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTAAIECFEQRVGTLQVVEPSVFGLQVAYAGYFSTL---------- 397 (410) T ss_pred cccccccccchhhhhcccceEEecCCCcCeeeEeccceeeeeecCCceeEeeCCchhhhhhhheeeeeec---------- Confidence 2443 222237778899999999999999999999877544200001 12233333222332221 Q ss_pred hhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 270 QTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 270 et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) |+..||||=+.=+ T Consensus 398 -----------~~~~~gliPv~g~ 410 (410) T protein:vir:83 398 -----------VVNEDAIVPLVGS 410 (410) T ss_pred -----------cccccceeeeccC Confidence 2233444422222 No 155 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=97.12 E-value=0.00016 Score=41.34 Aligned_cols=273 Identities=12% Similarity=0.093 Sum_probs=149.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHH----HHhCcccccccCCCCeeeeeeeeeeecccCcccCCcee Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLL----EMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVI 76 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~----~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~I 76 (296) |-+....--.+---+ +...++ |.+.|+.-..+=+ ..|+..+..++.-|++.++|.=.-...+ ....|+.+ T Consensus 1 Ms~~n~~t~~~~~~s---g~~~al-~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~G~s~~~--~~~pG~~l 74 (401) T protein:vir:70 1 MSTPNNLTNVAVSAS---GEVDSL-LIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQ--VLAPGQSP 74 (401) T ss_pred CCCCccccccccccc---cchhHh-HHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEee--eecCCCCc Confidence 433221111111111 111222 7888888776655 4678889999999999999875444333 34567766 Q ss_pred chhheeeeecceeEEEEeec--ccc-c-C-HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc--------- Q lcl|Aclame:pro 77 PLSKVERKIHSEKKIELKKY--RKA-T-T-GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG--------- 142 (296) Q Consensus 77 plskv~~~~~~t~~~tikK~--~K~-v-T-dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~--------- 142 (296) .-+.+.. .-..++|... ... | - ||+ | +=|+..=.|--+|+..++++..|.-++..++-+.. T Consensus 75 d~~~~~~---dK~~ItID~lL~a~~~V~dlDe~-q-~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~ 149 (401) T protein:vir:70 75 AATSTQA---DKNQLVIDATVIARNTVAHLHDV-Q-GDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTN 149 (401) T ss_pred CCCCccc---ccEEEEeCceeehhhhhhhHHHH-H-hcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccC Confidence 5444433 2245677543 222 3 2 455 3 66775557888999999999999988666632110 Q ss_pred c----------e-------ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeec----hh Q lcl|Aclame:pro 143 T----------Q-------DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFG----LT 201 (296) Q Consensus 143 t----------~-------~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg----~t 201 (296) . . ..++..|-.|+.++...+..+.-. ..+ +++++|-+.+..|-..+--.+..|+ +. T Consensus 150 p~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP--~~r-~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~ 226 (401) T protein:vir:70 150 PRVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVD--ISD-VAILMPWRYFNVLRDADRIVDKTYTISQSGA 226 (401) T ss_pred CCcCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCC--ccc-eEEEcCHHHHHHHHhcCcccchhhccccCCc Confidence 0 0 012334555555555554444433 234 6666777777555444322233332 11 Q ss_pred hh----hhhheeEEEEeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHH Q lcl|Aclame:pro 202 YL----VDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGM 277 (296) Q Consensus 202 yl----~nfLG~~II~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~ 277 (296) |. -.+.|+.|++|+.+|.|-...+ ..++ +..+.+.+|+...|++..+|+.-.+. --++.|+.-+++= T Consensus 227 ~~~G~v~~vaGv~Vv~SnnlP~~a~~it-~~~l-------s~a~~G~~y~~~~d~s~~~~v~f~~~-Av~tvk~~~lt~~ 297 (401) T protein:vir:70 227 TIQGFTLSSYNCPVIPSNRFPKYSQGQT-HHLL-------SNEDNGYRYDPLPAMNGAIAVLFTAD-ALLVGRSIDVTGD 297 (401) T ss_pred cccceEEEEeceEEEeeccccccccccc-cccc-------cccCCCccCCCCccccceeEEEEehh-heEEEEeeccccc Confidence 21 1366899999999997532111 1111 12345788888889999999875443 3333444333322 Q ss_pred Hhh------------------hhccceEEEEEec------CCC Q lcl|Aclame:pro 278 LMY------------------PERIDGIVKVTLT------PGV 296 (296) Q Consensus 278 ~lf------------------pE~~dgvv~~tI~------~~v 296 (296) .|+ |-|.|.+..++-+ .++ T Consensus 298 ~~~d~r~~~~~id~~~a~g~g~~RPeaa~vv~~k~~~~~~~~~ 340 (401) T protein:vir:70 298 IFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRNTTTGAVE 340 (401) T ss_pred hhhhhhhhHHHHHHHHHhCCcccchhheEEEeecCcccccccc Confidence 222 3345555554332 222 No 156 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=97.10 E-value=9.9e-05 Score=42.54 Aligned_cols=272 Identities=11% Similarity=0.065 Sum_probs=115.3 Q ss_pred Cccccccccccce-ehhhhhhhhhhhhHHHHhhhHHHHHHHhCccccccc-----CCCCeeeeeeeeeeecccCcccCCc Q lcl|Aclame:pro 1 MVTSRTYPEENLI-KSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISV-----SEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) Q Consensus 1 ~~~~~~~ae~nl~-~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~-----~pG~tIt~pk~~yig~A~gdVaEGe 74 (296) |- .+|. ...++ .+=.....|++++- +... |.|..+. +.|+||+||.=.-....+..-..|. T Consensus 1 Ma-------N~llT~ip~i---ia~~al~~l~~~lV-~~~l--Vnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~ 67 (423) T protein:vir:17 1 MP-------NNLDSNVSQI---VLKKFLPGFMSDLV-LAKT--VDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDIS 67 (423) T ss_pred Cc-------cchhhhhHHH---HHHHHHHHHHhhcc-cchh--hcccCCcchhhcccCCEEEEeeCCcceeecccCcccC Confidence 21 1111 11122 22233444444443 1121 5665544 3699999976322222222223444 Q ss_pred eechhheeeeecceeEEEEeeccc---ccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcC-ccceecchhh Q lcl|Aclame:pro 75 VIPLSKVERKIHSEKKIELKKYRK---ATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG-TGTQDALGAG 150 (296) Q Consensus 75 ~Iplskv~~~~~~t~~~tikK~~K---~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkta-t~t~~~t~~~ 150 (296) -|..+.+... ..+++|.|..- .++|+.- .....+ ..+.-++=.++|+++||.|++..+... .......+.. T Consensus 68 ~~~~~~l~e~---~v~l~id~~k~va~~v~d~E~-~~~i~~-~~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~gt~~t~ 142 (423) T protein:vir:17 68 GQNKNNLISG---KATGRVGNYITVAVEYQQLEE-AIKLNQ-LEEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTP 142 (423) T ss_pred CcccCccccc---eeEEEeeceeeeeeeecHHHH-hcChhH-HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcc Confidence 4555666543 35688866444 3477763 234333 233333345778999999998776443 2211111111 Q ss_pred HHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCc-cccceeechhh-----h-hhhheeEEEEeccCCCce Q lcl|Aclame:pro 151 LQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAG-ITTQTAFGLTY-----L-VDFTGTVIISTNDVTKGE 221 (296) Q Consensus 151 lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~-i~~q~~fg~ty-----l-~nfLG~~II~S~kV~~G~ 221 (296) ..+ .+.|.++..++++.. ...-.++++|...+.+|++-. +......+.+- + .++.|.+|+.|+.||..+ T Consensus 143 -~~a-~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T 220 (423) T protein:vir:17 143 -ITK-WSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRT 220 (423) T ss_pred -ccc-HHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCCCcccc Confidence 112 234555666666542 124688999999999887653 33322222221 1 357789999999999754 Q ss_pred EEEEcccceE-EEEecCcc--hhhhhhh---c-cccccccce---------EE--Eeccccceee-------hhhhhhH- Q lcl|Aclame:pro 222 IWATVPENII-FAYINPNN--SELAKEF---N-LYGDPTGYI---------GM--NHFQENTTLT-------IQTLLVS- 275 (296) Q Consensus 222 ~~~t~~~Nl~-~ay~~~~~--g~~~~~f---~-~~td~tGli---------Gv--~h~~~~~~~t-------~et~~~~- 275 (296) ..+-..--.. .+..-+.+ .+.++.. . .+...+|.+ || .|.+.+..++ .|-.+.. T Consensus 221 ~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~ 300 (423) T protein:vir:17 221 QGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTAD 300 (423) T ss_pred ccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEec Confidence 4432110000 00000000 0000000 0 000111211 11 1222221110 0000000 Q ss_pred HHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 276 GMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 276 ~~~lfpE~~dgvv~~tI~~~v 296 (296) +..+.+ |-..++|.++. T Consensus 301 ~~~~a~----~~~tv~i~p~~ 317 (423) T protein:vir:17 301 ANSDSS----GDVTVTLSGVP 317 (423) T ss_pred cccccc----CceEEEecCcc Confidence 000000 11234444322 No 157 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=97.09 E-value=0.00013 Score=41.81 Aligned_cols=257 Identities=14% Similarity=0.136 Sum_probs=146.1 Q ss_pred hhhhhHHHHhhhHHHHHHHhCc-----ccccccCCCCeeeeeeeeeeecccCcccCCceechhheeeeecceeEEEEeec Q lcl|Aclame:pro 22 ITIDVTNKFQENISKLLEMLGV-----TRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERKIHSEKKIELKKY 96 (296) Q Consensus 22 ~siDf~~~f~~~i~~L~~~LgV-----tr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplskv~~~~~~t~~~tikK~ 96 (296) ++|.++.+|+..|++-+..--+ .+.....-|.+|++|+=...|..+-+ -+.-++...++.+. .+++++-.|+ T Consensus 1 Main~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~--R~~g~~~g~v~~~~-et~tl~qdR~ 77 (290) T protein:vir:78 1 MAINYVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHT--RNKGYNEGSASNTN-KSYTIDFDRD 77 (290) T ss_pred CchhHHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccc--cCCCcccCccccce-eeEEeecccc Confidence 7899999999999887743222 33445667999999997776666422 23334444444432 3566666666 Q ss_pred ccc-cC----HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc------eecchhhHHHHHHHHHHHHHHh Q lcl|Aclame:pro 97 RKA-TT----GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT------QDALGAGLQGALASAWGKLQVL 165 (296) Q Consensus 97 ~K~-vT----dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t------~~~t~~~lQ~Ala~~~~~~~~~ 165 (296) +.= +- ||+ | +-..+-+-..++....++-.+|.-.|+.|.+...+ .+.++++.=.++-.+..+ T Consensus 78 ~~F~vD~~DvDEt-~--~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~~~---- 150 (290) T protein:vir:78 78 VEFFVDVMDVDET-G--QALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKDNVFTKLKAAIRK---- 150 (290) T ss_pred ceeeccccchhHH-h--hhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHHHHHHHHHHHHHH---- Confidence 552 32 443 2 22344556667888889999999989877533221 122445444444444444 Q ss_pred hccccCcceEEEEcHHHHHHHhcCC-cccc--c-eeechh----hhhhhheeEEEEecc---------CCCceEEEEccc Q lcl|Aclame:pro 166 FEDYGSERAIVFANSLDVAEYIAKA-GITT--Q-TAFGLT----YLVDFTGTVIISTND---------VTKGEIWATVPE 228 (296) Q Consensus 166 Feded~~~~VlFvNP~Daa~~l~~a-~i~~--q-~~fg~t----yl~nfLG~~II~S~k---------V~~G~~~~t~~~ 228 (296) +.+......|+||+|.=.. +|.++ .++- + ..++-+ ....+.|++|+.... .-.|-.-.+.+. T Consensus 151 ldevp~~~rvl~vtp~~~~-lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~~ak 229 (290) T protein:vir:78 151 VKKYGTQNLVMYVSPDVMA-ALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAAGAK 229 (290) T ss_pred HHhcCCCCeEEEECHHHHH-HHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhcccccccCCcc Confidence 4334456789999997555 55443 3321 1 112211 123477888876331 334655667777 Q ss_pred ceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 229 NIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 229 Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) +||+-=++|.. -.--.=.|...++-=..+...+.--++=-..+.+..+....+||.... .| T Consensus 230 ~in~ii~~~~a----~i~~~K~~~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~---~~ 290 (290) T protein:vir:78 230 KLNFLLVNKGS----VVGGAKHASIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIAST---EV 290 (290) T ss_pred ceeEEEEcCCc----eeeeeeeeEEEeeCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEe---eC Confidence 78877777641 111111122222221222233334555566677788899999988653 23 No 158 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=96.95 E-value=0.00018 Score=41.07 Aligned_cols=276 Identities=10% Similarity=0.050 Sum_probs=120.4 Q ss_pred Cccccccccccc-eehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccC-----CCCeeeeeeeeeeecccCcccCCc Q lcl|Aclame:pro 1 MVTSRTYPEENL-IKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVS-----EGMTLKTYAGYDVTLAEGNVPEGE 74 (296) Q Consensus 1 ~~~~~~~ae~nl-~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~-----pG~tIt~pk~~yig~A~gdVaEGe 74 (296) |- .+| +...++ .+=.....|++.+- +.. -|.|..+.+ .|+||++|.=.-....+.....|. T Consensus 1 Ma-------N~llT~~p~i---ia~~aL~~l~~~lV-~~~--lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~ 67 (423) T protein:vir:10 1 MP-------NNLDSNVSQI---VLKKFLPGFMSDLV-LAK--TVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDIS 67 (423) T ss_pred Cc-------cchhhhhHHH---HHHHHHHHHHhhcc-cch--hhcccCCCcccccccCCEEEEeeCCceeeeccCCcccc Confidence 21 111 111111 22233444444443 111 256655433 699998876443333333333455 Q ss_pred eechhheeeeecceeEEEEeeccc---ccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccceec-chhh Q lcl|Aclame:pro 75 VIPLSKVERKIHSEKKIELKKYRK---ATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDA-LGAG 150 (296) Q Consensus 75 ~Iplskv~~~~~~t~~~tikK~~K---~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t~~~-t~~~ 150 (296) .|..+.+... ..+++|.|..- .++|+.- ....++ ..+.-++=.++|+++||.|++..+......... .+.. T Consensus 68 ~~~~~dl~e~---~v~l~id~~k~va~~v~d~E~-~~~i~~-~~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt~~t~ 142 (423) T protein:vir:10 68 GQNKNNLISG---KATGRVGNYITVAVEYQQLEE-AIKLNQ-LEEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTP 142 (423) T ss_pred ccccCccccc---eeEEEeeceeeeeeeechHHH-hcChhh-HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcc Confidence 5655555543 35678866444 3476663 233333 233334446779999999998766543221111 1111 Q ss_pred HHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCc-cccceeechhhh------hhhheeEEEEeccCCCce Q lcl|Aclame:pro 151 LQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAG-ITTQTAFGLTYL------VDFTGTVIISTNDVTKGE 221 (296) Q Consensus 151 lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~-i~~q~~fg~tyl------~nfLG~~II~S~kV~~G~ 221 (296) . .+ .+.|.++..++++.. ...-.++++|...+.+|++-. +......+.+-+ .++.|.+|+.|+.||..+ T Consensus 143 ~-~a-~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T 220 (423) T protein:vir:10 143 I-TK-WSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRT 220 (423) T ss_pred c-ch-HHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCcccc Confidence 1 12 234555666666542 124688999999998887543 443333322211 357789999999999855 Q ss_pred EEEEcc-cceEEEEecCcch--hhhhhhc----cccccccce---------E--EEecccccee-------------ehh Q lcl|Aclame:pro 222 IWATVP-ENIIFAYINPNNS--ELAKEFN----LYGDPTGYI---------G--MNHFQENTTL-------------TIQ 270 (296) Q Consensus 222 ~~~t~~-~Nl~~ay~~~~~g--~~~~~f~----~~td~tGli---------G--v~h~~~~~~~-------------t~e 270 (296) ..+-.. --...++.-+.+. +-++... .+...+|.+ | ..|.+.+..+ +.+ T Consensus 221 ~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~ 300 (423) T protein:vir:10 221 QGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTAD 300 (423) T ss_pred ccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEee Confidence 443211 1111122211110 0000000 000111211 1 1133332211 111 Q ss_pred hhhh----HHHHhhhhccceEEE---EEecCCC Q lcl|Aclame:pro 271 TLLV----SGMLMYPERIDGIVK---VTLTPGV 296 (296) Q Consensus 271 t~~~----~~~~lfpE~~dgvv~---~tI~~~v 296 (296) .... ..++++|=.+....- .++++.+ T Consensus 301 ~~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~ 333 (423) T protein:vir:10 301 ANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQV 333 (423) T ss_pred eeeccCCceeeeccCccccccCCcccccccccc Confidence 1111 124444433221100 0111111 No 159 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=96.77 E-value=0.00011 Score=42.33 Aligned_cols=260 Identities=15% Similarity=0.065 Sum_probs=122.6 Q ss_pred Cccc-----cccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeeecccCcccCCce Q lcl|Aclame:pro 1 MVTS-----RTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEV 75 (296) Q Consensus 1 ~~~~-----~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~ 75 (296) ...+ +..++.++....--+...--++..+.... +....++-+..+.++.....+|.-.--..+ .-+.||.. T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~---~~~~~~i~~~~~~~~i~~~~~~~~~~~~~a-~~~~eG~~ 300 (517) T protein:vir:97 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDA---VNDEGSLLPFIRHENLPTLVVGGDNALTQG-TGHTTGTD 300 (517) T ss_pred HHhcccccccceeeeecccccccccccchHHHHHHHHh---hhhhccceeeeeeccccceeeeccccccee-eeeecCCc Confidence 0000 00111111100000001111223332222 222333444455555555555543222234 36889999 Q ss_pred echhheeeeecceeEEEEeeccccc--CHHHHHhhcCCch---hHHHHHHHHHHHHhhhhHHHHHHHhcCccc-----e- Q lcl|Aclame:pro 76 IPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEA---VTNTDNALVRQLQKKIRTDFVTALKTGTGT-----Q- 144 (296) Q Consensus 76 Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygda---v~etd~QL~~~iq~kIdnD~~~aLktat~t-----~- 144 (296) .|.+.++.. ..++++|+++.-+ |.|-|+.+.+++. .+--.++|+..++.+.+..|+.==.+++.. . T Consensus 301 kp~s~~tf~---~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a 377 (517) T protein:vir:97 301 KTESNITLQ---TRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVV 377 (517) T ss_pred cccccccee---eEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccc Confidence 999998875 5778888888864 8898888888764 356789999999999999998432222110 0 Q ss_pred ------e-cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee-----chhhhhhhheeEEE Q lcl|Aclame:pro 145 ------D-ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-----GLTYLVDFTGTVII 212 (296) Q Consensus 145 ------~-~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f-----g~tyl~nfLG~~II 212 (296) + ...+.+. +.+..+....... ...+.+|||.+.+.+++-.+-.-+..| +..-. ..+|+.-+ T Consensus 378 ~~~~~~~~~~~~~~~----d~i~~l~~a~~~a--~~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~~~~~-~l~G~~~~ 450 (517) T protein:vir:97 378 GDAWATNVTGTTNIQ----ELLEKLSVATPKA--ADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIA-THFGFNRL 450 (517) T ss_pred cccccccccccchHH----HHHHHHHHHhhhc--cCCEEEECHHHHHHHHHhhcCCCCeeccCcCCccccc-ccCCcccc Confidence 0 1112222 1222233322221 235688999999966543322222222 11111 13453222 Q ss_pred EeccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEE-e--ccccceeehhhhhhHHHHhhhhccceEEE Q lcl|Aclame:pro 213 STNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMN-H--FQENTTLTIQTLLVSGMLMYPERIDGIVK 289 (296) Q Consensus 213 ~S~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~-h--~~~~~~~t~et~~~~~~~lfpE~~dgvv~ 289 (296) .+ .+..|+.... |++-|++ .|..|...+- . ..+......|+ .++|-++-||+. +. T Consensus 451 ~~-~~~~~~~~~~---~~~~y~i--------------~~~~g~~~~~~fd~~~n~~~f~~~~-~~~g~i~~~~r~---a~ 508 (517) T protein:vir:97 451 VQ-SVAVDEKTAV---SLSGYVT--------------NGSRGMEFEQGTILVENNKEYLFEM-PISGSLEYKGTT---AY 508 (517) T ss_pred cc-ccccCceeEe---eccccEE--------------EeecceeeeeeeecccCceeEeeee-eeccccccccce---EE Confidence 22 2333332211 2222211 1222222111 1 12223333443 455666777775 57 Q ss_pred EEecCCC Q lcl|Aclame:pro 290 VTLTPGV 296 (296) Q Consensus 290 ~tI~~~v 296 (296) .+.+||| T Consensus 509 ~~~~p~~ 515 (517) T protein:vir:97 509 GTYTPPV 515 (517) T ss_pred EEEcCCC Confidence 8999999 No 160 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=96.65 E-value=0.00044 Score=39.01 Aligned_cols=260 Identities=9% Similarity=0.021 Sum_probs=127.0 Q ss_pred hhhhhhhHHHHhhhHHHHHHHhCcccccccC---------CCCeeeeeeeeeeecccCcccC-Cc---eechhheeeeec Q lcl|Aclame:pro 20 YPITIDVTNKFQENISKLLEMLGVTRKISVS---------EGMTLKTYAGYDVTLAEGNVPE-GE---VIPLSKVERKIH 86 (296) Q Consensus 20 ~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~---------pG~tIt~pk~~yig~A~gdVaE-Ge---~Iplskv~~~~~ 86 (296) =| .|.|+++|+..|++-+..--++..+.+. -|.+|++|+=...|..+-+=.- |. .+.++. T Consensus 1 MA-~~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~------ 73 (299) T protein:vir:79 1 MA-ALNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAW------ 73 (299) T ss_pred Cc-cchhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcce------ Confidence 12 6889999999999888766665544332 2899999997666655422211 11 222222 Q ss_pred ceeEEEEeecccc-cCHHHHHhhcCCchhH-HHHHHHHHHHHhhhhHHHHHHHhcCc---cc----eecchhhHHHHHHH Q lcl|Aclame:pro 87 SEKKIELKKYRKA-TTGEDIQMYGSNEAVT-NTDNALVRQLQKKIRTDFVTALKTGT---GT----QDALGAGLQGALAS 157 (296) Q Consensus 87 ~t~~~tikK~~K~-vTdEAIqlsGygdav~-etd~QL~~~iq~kIdnD~~~aLktat---~t----~~~t~~~lQ~Ala~ 157 (296) .+++++-+|++.= +-+=.+..+.+...++ -..++....++-.||.-.|+.|.++. ++ +..++++. .+ T Consensus 74 ~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~----y~ 149 (299) T protein:vir:79 74 EPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNV----LE 149 (299) T ss_pred eEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHH----HH Confidence 2344444444442 3311111222222222 23444556677778888887663221 11 11233433 34 Q ss_pred HHHHHHHhhccc--cCcceEEEEcHHHHHHHhcCCcccc--ceeechhh----hhhhheeEEEE--eccCC------Cce Q lcl|Aclame:pro 158 AWGKLQVLFEDY--GSERAIVFANSLDVAEYIAKAGITT--QTAFGLTY----LVDFTGTVIIS--TNDVT------KGE 221 (296) Q Consensus 158 ~~~~~~~~Fede--d~~~~VlFvNP~Daa~~l~~a~i~~--q~~fg~ty----l~nfLG~~II~--S~kV~------~G~ 221 (296) .+.++...+.+- .....|+||+|.=..-+..+.+++- +..++... ...+.|++|+. |+..+ .|. T Consensus 150 ~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~ 229 (299) T protein:vir:79 150 VFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGW 229 (299) T ss_pred HHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceeccCc Confidence 455555556543 2346899999976664444444431 11111111 12367888876 44343 344 Q ss_pred EEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 222 IWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 222 ~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .-.+-+-.||+-=+++.. -.--.-.|...+.-=.-+...+ .-++=-..+.+..+....+|| -+.++.+= T Consensus 230 ~~~~~ak~in~ii~~~~a----~~~~~K~~~~~~~~P~~~~~~~-~~~~~r~y~d~~v~~nk~~~i-~~~~~~a~ 298 (299) T protein:vir:79 230 KVGAGAKQIFMSLVHPSA----IITPVSYQFSKLDEPTAVTEGK-YFYFEESFEDVFILNKKADAI-QFVVEGAG 298 (299) T ss_pred cccCcccccceEEEcCCe----eeeeEeeeeEEeecCCCCCccc-eeeeeeeeeeeeeeccccCeE-EEEeeecC Confidence 433334445554444320 0000111111111000111112 224445566777888889998 44554444 No 161 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=96.54 E-value=0.00053 Score=38.56 Aligned_cols=271 Identities=11% Similarity=0.039 Sum_probs=134.9 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCccc------ccccCCCCeeeeeeeeee-ecccCcccC- Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTR------KISVSEGMTLKTYAGYDV-TLAEGNVPE- 72 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr------~~~~~pG~tIt~pk~~yi-g~A~gdVaE- 72 (296) |-++|-+ +++...+- -==.++.+-....+.|++- ||-. .+-.++|+.+++|-|.-+ |+++.+|-. T Consensus 1 Ma~T~l~---D~iipe~~---vf~~Yv~~~~~e~~~l~qS-Gii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D 73 (349) T protein:vir:78 1 MAITTIG---DIVTGNIP---VLASYMTEDPVEKTAFFDS-GILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSND 73 (349) T ss_pred CCceEEe---eeeccCHH---HHHHHHHHhhHHhhhhhhc-cceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCC Confidence 6554422 11111100 0112444444455555552 4433 122478999999999876 566544522 Q ss_pred --CceechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCcc------ Q lcl|Aclame:pro 73 --GEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG------ 142 (296) Q Consensus 73 --Ge~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~------ 142 (296) -+.++..|++.. .....+...+|+- +|=+--++| +||+.+..+|++....+.-.+.+++.|+.--+ T Consensus 74 ~~~~~~t~~kitt~---~~~a~~~~r~kaw~~~Dla~~lsG-~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~ 149 (349) T protein:vir:78 74 VYQDIATPRAIQTG---EMMARVAYLNEGFGQADLTVELTS-QNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSAT 149 (349) T ss_pred Cccccccccccccc---ceeeeeeeeccccchhHHHHHhhC-chHHHHHHHHHHHHHhhHHHHHHHHHHHHhhccccccc Confidence 235677888864 3444555556765 665555789 79999999999999999999999998873211 Q ss_pred ---------cee------cchhhHHHHHHHHHHHHHH-hhccccCcceEEEEcHHHHHHHhcCCccc-cceeechhhhhh Q lcl|Aclame:pro 143 ---------TQD------ALGAGLQGALASAWGKLQV-LFEDYGSERAIVFANSLDVAEYIAKAGIT-TQTAFGLTYLVD 205 (296) Q Consensus 143 ---------t~~------~t~~~lQ~Ala~~~~~~~~-~Feded~~~~VlFvNP~Daa~~l~~a~i~-~q~~fg~tyl~n 205 (296) +.. -+++.+..| ..++-| .+.|..+.=..++||+.=.+++.++..|. .+..-|..-+.. T Consensus 150 ~~~~~~~~~t~d~s~~a~~~~~~~~dA----~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i~t 225 (349) T protein:vir:78 150 DAYHEQNDMVVDVSATLGFDAGAFIDA----TQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTMFAT 225 (349) T ss_pred chhhhcccceeeeccccCCChhhhhhh----HHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhhccCcccCcccce Confidence 001 122333333 333333 34443444568999998888766655443 122222222345 Q ss_pred hheeEEEEeccCCCc--------eEEEEcccceEEEEecCcchhhhhhhccccccc----cceEEEeccccceeehhhhh Q lcl|Aclame:pro 206 FTGTVIISTNDVTKG--------EIWATVPENIIFAYINPNNSELAKEFNLYGDPT----GYIGMNHFQENTTLTIQTLL 273 (296) Q Consensus 206 fLG~~II~S~kV~~G--------~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~t----GliGv~h~~~~~~~t~et~~ 273 (296) ++|.+||.+..+|.- +.|+-.++.+.+....|. .....--|.. |-.=..+.+... .+- T Consensus 226 y~G~~VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~-----~~~et~rd~~~g~~~G~d~l~~R~~~-----~~h 295 (349) T protein:vir:78 226 YQGYRVIVDDSMTVVGQGAQRKFISIIFGQGAIGYGEGNPV-----MPLEYEREASRANGGGVETLWTRKTW-----LLH 295 (349) T ss_pred ecCeEEEEeCCCccccCCCCceEEEEEeecceEEEccCCCc-----cceeeecccccCCcceeEEEEEeeEE-----Eee Confidence 889999999999841 345666666666554432 0011111111 111111111000 011 Q ss_pred hHHHHh---------------hhhccce-----EEEEEecCCC Q lcl|Aclame:pro 274 VSGMLM---------------YPERIDG-----IVKVTLTPGV 296 (296) Q Consensus 274 ~~~~~l---------------fpE~~dg-----vv~~tI~~~v 296 (296) .-|+.| +|..-|= --+|-=.+.+ T Consensus 296 p~G~s~~~a~v~~~~~~~~~~sPt~aeLa~~~NW~~v~~~K~I 338 (349) T protein:vir:78 296 PFGYRFTSAVITGNGTETIARSASWQDLANATNWNRVVDRKHV 338 (349) T ss_pred eeeeeeccccccCCccccccCCCChHHhcCCcCcccccChhhc Confidence 112211 1111000 0000000000 No 162 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=95.42 E-value=0.0021 Score=35.26 Aligned_cols=254 Identities=11% Similarity=0.078 Sum_probs=130.9 Q ss_pred hhhhhHHHHhhhHHHHHHH------hCccc----ccccCCCCeeeeeeeee-eecccCc-----ccCCceechhheeeee Q lcl|Aclame:pro 22 ITIDVTNKFQENISKLLEM------LGVTR----KISVSEGMTLKTYAGYD-VTLAEGN-----VPEGEVIPLSKVERKI 85 (296) Q Consensus 22 ~siDf~~~f~~~i~~L~~~------LgVtr----~~~~~pG~tIt~pk~~y-ig~A~gd-----VaEGe~Iplskv~~~~ 85 (296) ++|.++.+|+..|++-+.. ++..+ ......|.+|++|+-.. .|.++-+ .+.| .+-++. T Consensus 1 Mainya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g-~v~~~~----- 74 (346) T protein:vir:10 1 MTINYAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVA-NYSNDW----- 74 (346) T ss_pred CcchhHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCccccc-ccccce----- Confidence 8999999999999985432 22222 22345789999999853 4554322 1122 233333 Q ss_pred cceeEEEEeecccc-c----CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCc------c--ceecchhhHH Q lcl|Aclame:pro 86 HSEKKIELKKYRKA-T----TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT------G--TQDALGAGLQ 152 (296) Q Consensus 86 ~~t~~~tikK~~K~-v----TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat------~--t~~~t~~~lQ 152 (296) .+++++-.|++.= + -||+=+..-.++.++ ++....++--+|.-.|+.|-+.. . +...+.++.- T Consensus 75 -et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~---ef~r~~vvPEiDayrfskLa~~a~~~~~~~~~~~a~T~~ni~ 150 (346) T protein:vir:10 75 -DSYELKNERYWSTLVDPSDIDETNMVVSLANITK---QFNLDSKMPEKDRYMFSHLYSGKEAAHDGGITTNTLDEKNIL 150 (346) T ss_pred -eEEEeeccccceecccccchHHHHHHhHHHHHHH---HHHHHhhcchhhHHHHHHHHHhhhhhccccccccccCHHHHH Confidence 3455666665552 2 244411122233333 34555566677888777764221 1 1112444444 Q ss_pred HHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCc-cc------cceeechhhhhhhheeEEEE--eccCC----- Q lcl|Aclame:pro 153 GALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAG-IT------TQTAFGLTYLVDFTGTVIIS--TNDVT----- 218 (296) Q Consensus 153 ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~-i~------~q~~fg~tyl~nfLG~~II~--S~kV~----- 218 (296) .++-.+..++.+.-- .....|+||+|.=.. .|.+++ ++ .++..-+ ....+.|++|+. |+..+ T Consensus 151 ~~i~~~~~~lde~~v--p~~~rvl~vTp~~~~-lLk~s~~f~k~~~v~~~~~i~~-~V~siDGv~Ii~VPs~r~~t~~~f 226 (346) T protein:vir:10 151 PAFDNMMLDFDEARI--PSTNRILYVTPKTNA-ILKRAEAMNRALTLKDPNNIQR-TVYSLDDVTIRVVPSDLMQTAYDF 226 (346) T ss_pred HHHHHHHHHHHHccC--CCCCeEEEECHHHHH-HHhhchhheeccccccccccce-eeeeecCeEEEEcchhhcccchhh Confidence 444444433333211 235689999996554 665553 32 2121111 122466888865 44443 Q ss_pred -CceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEec--cccceeehhhhhhHHHHhhhhccceEEEEEecCC Q lcl|Aclame:pro 219 -KGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHF--QENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) Q Consensus 219 -~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~--~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~ 295 (296) .|-...+.+-.||+-=++|. -=+ .-...-.+.+.-. .....--++=-..+.+..+....+||...-=++| T Consensus 227 ~~G~~~~t~ak~INfiiv~~~-A~i------a~~K~~~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~ 299 (346) T protein:vir:10 227 SDGSKIIDTAKQIEMFLIYNG-VQI------APEKYSFVGFDQPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKP 299 (346) T ss_pred ccCccccCCccceeEEEECCc-eee------eeeeeeeeEeeCCCCCcccceeeeeeeeeeeeeeccccceEEEeeeccc Confidence 46666666667887777663 111 0011111111111 0111123444556677788889999976655666 Q ss_pred C Q lcl|Aclame:pro 296 V 296 (296) Q Consensus 296 v 296 (296) . T Consensus 300 ~ 300 (346) T protein:vir:10 300 K 300 (346) T ss_pred c Confidence 5 No 163 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=95.30 E-value=0.0023 Score=35.01 Aligned_cols=273 Identities=12% Similarity=0.110 Sum_probs=111.7 Q ss_pred Cccccccccccce-ehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccC-----CCCeeeeeee---eeeecccCccc Q lcl|Aclame:pro 1 MVTSRTYPEENLI-KSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVS-----EGMTLKTYAG---YDVTLAEGNVP 71 (296) Q Consensus 1 ~~~~~~~ae~nl~-~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~-----pG~tIt~pk~---~yig~A~gdVa 71 (296) |- .+|+ -..+ ..+=.....|++++- +... |.|..+.+ -||||++|.= .+-..+.++. T Consensus 1 MA-------Nsl~~l~p~---iia~~al~~l~~~lV-~~~l--V~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~- 66 (423) T protein:vir:10 1 MA-------NNLDANVSQ---IVLKKFLPGFMSDLV-LCKT--VDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDI- 66 (423) T ss_pred Cc-------cccccccHH---HHHHHHHHHHHhhcc-cchh--hccCCCccccccccCCEEEEeeCCceeeecccCccc- Confidence 21 1110 0111 112223333444333 2222 55545433 6999998762 2221111111 Q ss_pred CCceechhheeeeecceeEEEEeeccc---ccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhc-Cccceecc Q lcl|Aclame:pro 72 EGEVIPLSKVERKIHSEKKIELKKYRK---ATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKT-GTGTQDAL 147 (296) Q Consensus 72 EGe~Iplskv~~~~~~t~~~tikK~~K---~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLkt-at~t~~~t 147 (296) .|+ +.+.+.. ....++|.+... .++|+.-+ ...++= .+.-++=.++|+++|+.|+...+.. +....... T Consensus 67 t~~--~~~~l~e---~~v~l~id~~k~~a~~v~d~E~~-l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vgt~ 139 (423) T protein:vir:10 67 TGK--SKNSLIS---AKATGEVGNYITVAVEYRQIEEA-LKLNQL-DQILVPINERMVTDLETELALFMMKHGALSLGSP 139 (423) T ss_pred Ccc--ccccccc---ceEEEEecceeeeeeeeChHHHh-cChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Confidence 221 2223321 124567765444 34777643 233332 3344444678899999999755533 22222111 Q ss_pred hhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCCc-cccceeech------hhhhhhheeEEEEeccCC Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAG-ITTQTAFGL------TYLVDFTGTVIISTNDVT 218 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a~-i~~q~~fg~------tyl~nfLG~~II~S~kV~ 218 (296) +... .+ .+.|.++..++++.. ...-+++++|...+.++++-. +.....-+. ..+.+++|..|+.|+.|| T Consensus 140 ~t~~-~a-~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp 217 (423) T protein:vir:10 140 NTPI-KK-WSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGNFGGIRALMSNGLA 217 (423) T ss_pred cccc-cc-HHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccchHHHHhcccceeecceEEEEecCCc Confidence 1111 12 134555556666542 124688999999998887542 322121121 122357788999999998 Q ss_pred ---CceE--EEEcccceEEEEecCcc-hh-hhhhhccccccccc---------eEE--Eecccccee-----------eh Q lcl|Aclame:pro 219 ---KGEI--WATVPENIIFAYINPNN-SE-LAKEFNLYGDPTGY---------IGM--NHFQENTTL-----------TI 269 (296) Q Consensus 219 ---~G~~--~~t~~~Nl~~ay~~~~~-g~-~~~~f~~~td~tGl---------iGv--~h~~~~~~~-----------t~ 269 (296) .|+. ..++....-+.++.... .+ -+..++.+..-+|. =|| .|.+.+.+| +. T Consensus 218 ~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V 297 (423) T protein:vir:10 218 SRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATV 297 (423) T ss_pred ccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEE Confidence 3542 12222222222322200 00 01122222222332 232 233333221 11 Q ss_pred --hhhhhHH----HHhhhhccceEEE---EEecCCC Q lcl|Aclame:pro 270 --QTLLVSG----MLMYPERIDGIVK---VTLTPGV 296 (296) Q Consensus 270 --et~~~~~----~~lfpE~~dgvv~---~tI~~~v 296 (296) .+...++ ++++|=.++-+-- .++++.+ T Consensus 298 ~~~~~~~a~~~~tv~i~p~~~~~~~~~~~~~V~a~~ 333 (423) T protein:vir:10 298 MEDANAHSSGDVTVKISGVPIFDAGYPQYNAVDRLL 333 (423) T ss_pred EecccccccCceEEEeccccccccCcccccceeccc Confidence 1111111 3444433321100 0111100 No 164 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=95.26 E-value=0.0024 Score=34.93 Aligned_cols=270 Identities=12% Similarity=0.049 Sum_probs=136.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccc------cccCCCCeeeeeeeeee-ecccCcccCC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRK------ISVSEGMTLKTYAGYDV-TLAEGNVPEG 73 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~------~~~~pG~tIt~pk~~yi-g~A~gdVaEG 73 (296) |-++|-+ +++...+- -==.|+.+-....+.|++- ||-.. +-.++|+.+++|-|+-+ |+++-++ .| T Consensus 1 Ma~T~l~---D~iipe~~---vf~~Yv~~~~~e~~~l~qS-Gii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~-~~ 72 (349) T protein:vir:94 1 MAITTIG---NIVTGNIP---VLASYMTEDPVEKTAFFNS-GILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNY-SN 72 (349) T ss_pred CCceEEe---eeeccChH---HHHHHHHHhHHHhhhhhhc-cceeccHHHHHHHhcCCCEEEeeeeecCCCCccccc-CC Confidence 6554422 11111110 0112444444555666662 54431 12478999999999875 6655344 34 Q ss_pred c----eechhheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc---- Q lcl|Aclame:pro 74 E----VIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT---- 143 (296) Q Consensus 74 e----~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t---- 143 (296) + .++..|++.. .....+.-.+|+- +|=+--++| +||.....+|++....+.-.+.+++.|+.--+. T Consensus 73 dt~~~~~t~~kit~~---~~~a~~~~r~kaw~~~Dla~~lsG-~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~ 148 (349) T protein:vir:94 73 DVYQDIATPRAIQTG---EMMARVAYLNEGFGQADLTVELTS-QNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSA 148 (349) T ss_pred CCccccccccccccc---ceeeeeeeeccccchhHHHHHhhC-chHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccc Confidence 3 5778888864 2333334445654 666755689 699999999999999999999999988732110 Q ss_pred -----------ee------cchhhHHHHHHHHHHHHHH-hhccccCcceEEEEcHHHHHHHhcCCccc-cceeechhhhh Q lcl|Aclame:pro 144 -----------QD------ALGAGLQGALASAWGKLQV-LFEDYGSERAIVFANSLDVAEYIAKAGIT-TQTAFGLTYLV 204 (296) Q Consensus 144 -----------~~------~t~~~lQ~Ala~~~~~~~~-~Feded~~~~VlFvNP~Daa~~l~~a~i~-~q~~fg~tyl~ 204 (296) .. -+++.|..|+ .++-| .+.|..+.=..++||+.=.+++.+...|. .+..-|..-+. T Consensus 149 ~~~~~~~~~~~~d~~~~a~~~~~~~~~A~----~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i~ 224 (349) T protein:vir:94 149 TDAYHEQNDMVVDVSATSGFDAGAFIDAT----QTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTMFA 224 (349) T ss_pred cccccccCceeEEecccCCCChhhHHHHH----HHHHHHhccccccceeEEEEchHHHHHHHhcchhhhccCcccCcccc Confidence 00 1334444443 33333 34444444468899998777766655443 12122222233 Q ss_pred hhheeEEEEeccCCC--------ceEEEEcccceEEEEecCcchhhhhhhccccccc----cceEEEeccccceeehhhh Q lcl|Aclame:pro 205 DFTGTVIISTNDVTK--------GEIWATVPENIIFAYINPNNSELAKEFNLYGDPT----GYIGMNHFQENTTLTIQTL 272 (296) Q Consensus 205 nfLG~~II~S~kV~~--------G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~t----GliGv~h~~~~~~~t~et~ 272 (296) .++|..||.+..+|- -+.|.-.++.+.+....|. .......|.. |..=..+++. +. .+ T Consensus 225 ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~-----~~~E~~rd~~~g~~~G~d~L~~R~--~~---~~ 294 (349) T protein:vir:94 225 TYQGYRVIVDDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPE-----MPLEYEREASRANGGGVETLWTRK--TW---LL 294 (349) T ss_pred eecCcEEEEeCCCccccCCCCceEEEEEeecceEEeecCCCC-----cceeeecccccCCcceeEEEEEee--EE---Ee Confidence 588999999999983 2445667777666655542 0111111111 1111111110 00 01 Q ss_pred hhHHHHh---------------hhhccceE-----EEEEecCCC Q lcl|Aclame:pro 273 LVSGMLM---------------YPERIDGI-----VKVTLTPGV 296 (296) Q Consensus 273 ~~~~~~l---------------fpE~~dgv-----v~~tI~~~v 296 (296) -.-|+.| +|..-|== -++-=.+.+ T Consensus 295 hp~G~s~~~a~v~~~~~~~~~~sPt~aeLa~~~NW~~v~~~K~I 338 (349) T protein:vir:94 295 HPFGYSFTSAVITGNGTETIARSASWQDLANAANWNRVVDRKHV 338 (349) T ss_pred eeeeeeecccccCCCccccccCCCChHHhcCCcCcccccChhhc Confidence 1112211 11111000 000000000 No 165 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=94.70 E-value=0.0037 Score=33.90 Aligned_cols=267 Identities=12% Similarity=0.084 Sum_probs=115.3 Q ss_pred Ccccc------ccccccceehhhhhh-hhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeee----eeeeecccCc Q lcl|Aclame:pro 1 MVTSR------TYPEENLIKSTDLKY-PITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYA----GYDVTLAEGN 69 (296) Q Consensus 1 ~~~~~------~~ae~nl~~~~dl~~-a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk----~~yig~A~gd 69 (296) |.|-+ .+-....++.+|++- .---+..++|-+.+.+--.+|..-|.++..-+.+.++++ |.....+. . T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~~d~~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~-~ 79 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDVPDLGRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRD-E 79 (315) T ss_pred CcccchhhcCChhhhhhhcCCcCCCCceechHHHHHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccc-c Confidence 33311 111111112223211 111234455655555544566666654333344555544 21111121 3 Q ss_pred ccCCceechhheeeeecceeEEEEeecccc--cCHHHHHhhcCCch-hHHHHHHHHHHHHhhhhHHHHHHHhcC------ Q lcl|Aclame:pro 70 VPEGEVIPLSKVERKIHSEKKIELKKYRKA--TTGEDIQMYGSNEA-VTNTDNALVRQLQKKIRTDFVTALKTG------ 140 (296) Q Consensus 70 VaEGe~Iplskv~~~~~~t~~~tikK~~K~--vTdEAIqlsGygda-v~etd~QL~~~iq~kIdnD~~~aLkta------ 140 (296) .+|+++.|-+..+.. ..++..||+.-- +|+|.++.+-++.. -+.-..+++.+++.+...-|+.-=.++ T Consensus 80 ~~~~~~~~~~~~~f~---~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~ 156 (315) T protein:vir:41 80 TGQKLAPPESTAEVK---TNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLR 156 (315) T ss_pred ccCcCCCCCCccccc---eeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCcccc Confidence 456666666665542 456666776653 59999976665533 334444555555555555555320000 Q ss_pred --ccce-----ecchhhHH--------HHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCC-----ccccceeech Q lcl|Aclame:pro 141 --TGTQ-----DALGAGLQ--------GALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKA-----GITTQTAFGL 200 (296) Q Consensus 141 --t~t~-----~~t~~~lQ--------~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a-----~i~~q~~fg~ 200 (296) .+=. .+.....+ ..|-+.+..+...|-. +..+.+.+||+...++|++-. .+......++ T Consensus 157 ~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~-~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g 235 (315) T protein:vir:41 157 MSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRN-NLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTGA 235 (315) T ss_pred ccccceecccccccccccccccccccHHHHHHHHHhcChHHhh-cCCceEEEEcHHHHHHHHHHhccCCCccccchhhcC Confidence 0100 00111111 1111122222222211 224679999999999887532 1322333333 Q ss_pred hhhhhhheeEEEEeccC-----CCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhh-- Q lcl|Aclame:pro 201 TYLVDFTGTVIISTNDV-----TKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLL-- 273 (296) Q Consensus 201 tyl~nfLG~~II~S~kV-----~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~-- 273 (296) .-.. ++|..|+....+ +++.++++...|+. |.+-. + |.+.+.++...-.+.-++ T Consensus 236 ~~~t-l~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~--~~~~~--~--------------i~i~~~~~a~~~~~~~~~~~ 296 (315) T protein:vir:41 236 NSIL-YDGRPVQYVPALEALNDGKSRALFVVPTQLV--YGFWR--N--------------IKVVPDYDAEMRLTKYVASL 296 (315) T ss_pred CCce-ecccceEecccccccCCCCccEEEecccceE--EEecc--c--------------cEEEeeecCCCCceEEEEEE Confidence 3232 778777766665 57889999998863 33321 1 222222211110000000 Q ss_pred -hHHHHhhhhccceEEEEEecC Q lcl|Aclame:pro 274 -VSGMLMYPERIDGIVKVTLTP 294 (296) Q Consensus 274 -~~~~~lfpE~~dgvv~~tI~~ 294 (296) +.+-... -++.+...|+- T Consensus 297 r~d~~~~~---~~~~a~~~~~v 315 (315) T protein:vir:41 297 RTDNHYED---EEGAVSATITV 315 (315) T ss_pred EeceeEEe---ccceeEeeeeC Confidence 0111111 22333333333 No 166 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=94.68 E-value=0.0021 Score=35.32 Aligned_cols=182 Identities=12% Similarity=0.065 Sum_probs=91.1 Q ss_pred hhheeeeecceeEEEEeecccccC--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc------------ Q lcl|Aclame:pro 78 LSKVERKIHSEKKIELKKYRKATT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT------------ 143 (296) Q Consensus 78 lskv~~~~~~t~~~tikK~~K~vT--dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t------------ 143 (296) .+- +--.+=.|- ||+ |. -.|-..|..+|+..++++.+|.-++..+..+..+ T Consensus 1 iD~------------lL~a~~~VdDiD~a-qa--~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~ 65 (221) T protein:vir:17 1 MDD------------LLVASQFVYDLDEI-LA--QWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFS 65 (221) T ss_pred CCc------------chhHHHHHHhHHHH-Hh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcc Confidence 000 000111221 555 43 4588999999999999999999999887644211 Q ss_pred --eecchhhHHHHHHHHHHHHHHhhcccc--CcceEEEEcHHHHHHHhcCC-ccccceee--------chhhhhhhheeE Q lcl|Aclame:pro 144 --QDALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKA-GITTQTAF--------GLTYLVDFTGTV 210 (296) Q Consensus 144 --~~~t~~~lQ~Ala~~~~~~~~~Feded--~~~~VlFvNP~Daa~~l~~a-~i~~q~~f--------g~tyl~nfLG~~ 210 (296) ..+...+--.+++.++-++..++++-+ ....+++|+|...+.+|+.. ..-.+..| .+.-+.++.|.+ T Consensus 66 ~~~~a~~t~~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~ 145 (221) T protein:vir:17 66 VNIGAGNTNNAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIR 145 (221) T ss_pred eeccccccCCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcE Confidence 111111212345666666767666543 24568999999999999742 11122222 222244588999 Q ss_pred EEEeccCCC--ceEE-------EEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhh Q lcl|Aclame:pro 211 IISTNDVTK--GEIW-------ATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYP 281 (296) Q Consensus 211 II~S~kV~~--G~~~-------~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfp 281 (296) |+.|+.+|. |+-+ .+.+++.+-| + +|+++.. |+.-.+.- -.|++ ++ .=| T Consensus 146 V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~~y----r-~~fs~~~----------glv~~~~A-vgtvk---l~---~~~ 203 (221) T protein:vir:17 146 IYKSNVLASLYGTNLVTDPGDATTSGENNGSY----R-PAITDRA----------GLVFHKEA-ADTVE---VL---LPP 203 (221) T ss_pred EEEeccCCcccccccccCCccccccccccccc----c-ccccceE----------EEEEcchh-eeeee---ee---cCC Confidence 999999997 4422 2222222222 1 2233333 32221110 00111 11 112 Q ss_pred hccceEEE-EEecCCC Q lcl|Aclame:pro 282 ERIDGIVK-VTLTPGV 296 (296) Q Consensus 282 E~~dgvv~-~tI~~~v 296 (296) .+.--|+. -.|..|- T Consensus 204 ~~~~~~~~~~~~~~~~ 219 (221) T protein:vir:17 204 SRPPLVISMFSIRRPD 219 (221) T ss_pred CCCceeeeeeeccCCC Confidence 22222221 2344443 No 167 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=94.44 E-value=0.0044 Score=33.49 Aligned_cols=253 Identities=9% Similarity=-0.002 Sum_probs=138.4 Q ss_pred hhhhhhhHHHHhhhHHHHHHHhCccccccc-------CCCCeeeeeeeeeeecccCcccCC-----ceechhheeeeecc Q lcl|Aclame:pro 20 YPITIDVTNKFQENISKLLEMLGVTRKISV-------SEGMTLKTYAGYDVTLAEGNVPEG-----EVIPLSKVERKIHS 87 (296) Q Consensus 20 ~a~siDf~~~f~~~i~~L~~~LgVtr~~~~-------~pG~tIt~pk~~yig~A~gdVaEG-----e~Iplskv~~~~~~ 87 (296) =|-+|.++.+|...|++.+..--++..+.. .-|.+|++|+-...|.++-+=..| ..|-+++ . T Consensus 1 Mantl~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g~v~~~~------e 74 (312) T protein:vir:10 1 MANTLAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGGDVKFEY------E 74 (312) T ss_pred CCcchhHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecccccccccccCCccccccccccc------e Confidence 134599999999999998886556665543 448999999987777765332122 1344433 3 Q ss_pred eeEEEEeecccc-cC----HHHHHhhcCCchhHH-HHHHHHHHHHhhhhHHHHHHHhcCcc----------ceecchhhH Q lcl|Aclame:pro 88 EKKIELKKYRKA-TT----GEDIQMYGSNEAVTN-TDNALVRQLQKKIRTDFVTALKTGTG----------TQDALGAGL 151 (296) Q Consensus 88 t~~~tikK~~K~-vT----dEAIqlsGygdav~e-td~QL~~~iq~kIdnD~~~aLktat~----------t~~~t~~~l 151 (296) +++|+-.|++.= +- ||+ .+--.++. ..++....++--||.-.|+-|-+... +.+.+.+++ T Consensus 75 t~tl~qDR~~~F~vD~mDvDET----n~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni 150 (312) T protein:vir:10 75 TKTMTQDRGRKFTLDAMDVDET----NFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTI 150 (312) T ss_pred eEEeeecccceeeccccchhhH----hhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHH Confidence 566777776652 32 443 22222222 34456667778888888887742211 112345555 Q ss_pred HHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcccc--ceeechhh----hhhhheeEEEEeccCCCce---- Q lcl|Aclame:pro 152 QGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITT--QTAFGLTY----LVDFTGTVIISTNDVTKGE---- 221 (296) Q Consensus 152 Q~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~--q~~fg~ty----l~nfLG~~II~S~kV~~G~---- 221 (296) =.+|..+..++.+. +-....|+||.|. +...|+++.... ...++.+- ...+.|++||. ||+.- T Consensus 151 ~~~i~~~~~~lde~---~vp~~rvl~vTp~-~~~lLk~~~~~~~~~~~~~~~~i~~~V~~iDgv~Ii~---VPs~r~~t~ 223 (312) T protein:vir:10 151 INKIKTGIKIIREN---GYNGPLVCHLTYD-SMFAIEEKVLEKLTAVTFAQGGIQTQVPSIDGCALIK---TPQNRMYSS 223 (312) T ss_pred HHHHHHHHHHHHHc---cCCCceEEEeChH-HHHHHhhhhhceecccccccceeeeeeeeecccEEEE---chhhhccce Confidence 44454444444442 1124679999994 778888764221 11222221 22366778763 55432 Q ss_pred ----------------EEEEcccceEEEEecCcchhhhhhhccccccccceEEEe---ccccceeehhhhhhHHHHhhhh Q lcl|Aclame:pro 222 ----------------IWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNH---FQENTTLTIQTLLVSGMLMYPE 282 (296) Q Consensus 222 ----------------~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h---~~~~~~~t~et~~~~~~~lfpE 282 (296) .-.+.+-+||+-=++|+. =+ .-...-.+.+.- ++..+.--++=-..+.+..+.. T Consensus 224 ~~f~dG~t~~~~~gg~~~~~~ak~INfiiv~~~a-~i------~~~K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv~~n 296 (312) T protein:vir:10 224 ILLNDGTTSNQTAGGYLKGTKALDTNFIIAPVDV-PL------AITKQDKMRIFDPETNQTANAWSMDYRRYHDLWVTDN 296 (312) T ss_pred eeeccCcccccccCceeecCcccccceEEeCCce-ee------ceeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeecc Confidence 233455567776666641 00 011111222210 1111223444455566777888 Q ss_pred ccceE-EEEEecCCC Q lcl|Aclame:pro 283 RIDGI-VKVTLTPGV 296 (296) Q Consensus 283 ~~dgv-v~~tI~~~v 296 (296) ..+|| |-+.=.+|| T Consensus 297 k~~~Iyv~~k~a~~~ 311 (312) T protein:vir:10 297 KANSVYANFKDAKPV 311 (312) T ss_pred ccCeEEEEeecccCC Confidence 99999 555556677 No 168 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=94.19 E-value=0.0051 Score=33.14 Aligned_cols=251 Identities=14% Similarity=0.119 Sum_probs=134.9 Q ss_pred hhhhhhhHHHHhhhHHHHHHHhCccccc-------ccCCCCeeeeeeeee-----eecccCcc----cCCceechhheee Q lcl|Aclame:pro 20 YPITIDVTNKFQENISKLLEMLGVTRKI-------SVSEGMTLKTYAGYD-----VTLAEGNV----PEGEVIPLSKVER 83 (296) Q Consensus 20 ~a~siDf~~~f~~~i~~L~~~LgVtr~~-------~~~pG~tIt~pk~~y-----ig~A~gdV----aEGe~Iplskv~~ 83 (296) =|-+|.++.+|...|++-+..--++..+ ...-|.+|++|+-.. .|.++-+= ..|. |-+ +. T Consensus 1 Mantl~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~~g~-v~~---~~ 76 (302) T protein:vir:78 1 MANSLALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYNRSTGFTQGS-VTL---AW 76 (302) T ss_pred CCchhHHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccccccccCccccc-eee---ee Confidence 1345999999999999988764455544 456678999999864 24443221 1232 322 22 Q ss_pred eecceeEEEEeecccc-cC----HHHHHhhcCCchhHH-HHHHHHHHHHhhhhHHHHHHHhcCc---c-ceecch--hhH Q lcl|Aclame:pro 84 KIHSEKKIELKKYRKA-TT----GEDIQMYGSNEAVTN-TDNALVRQLQKKIRTDFVTALKTGT---G-TQDALG--AGL 151 (296) Q Consensus 84 ~~~~t~~~tikK~~K~-vT----dEAIqlsGygdav~e-td~QL~~~iq~kIdnD~~~aLktat---~-t~~~t~--~~l 151 (296) ++++|+-.+++.= +- ||. ++--.++. ..++....++=-+|.=.|+.|-+.. . ..+.+. .+- T Consensus 77 ---et~tlt~DR~~~f~vD~mDvdET----n~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~ 149 (302) T protein:vir:78 77 ---SDYTLDYDLAQSFQIDAMDVDET----KNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASA 149 (302) T ss_pred ---eeEEeeeccceeeeccccchhhh----hhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccCccccccccchhH Confidence 4677887776663 21 343 22222333 3344566677788888887774221 1 111111 123 Q ss_pred HHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCc-ccc---ceeech----hhhhhhheeEEEEeccCCCceEE Q lcl|Aclame:pro 152 QGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAG-ITT---QTAFGL----TYLVDFTGTVIISTNDVTKGEIW 223 (296) Q Consensus 152 Q~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~-i~~---q~~fg~----tyl~nfLG~~II~S~kV~~G~~~ 223 (296) +.+| ..+..+...+.+. ...|+||.|. +...|+++. ++- ...|+- +....+-|+.||. ||+.-.+ T Consensus 150 ~nvl-~~i~~~~~~~~e~--~~~vl~vtp~-~~~~Lk~a~~~~~~~~~~~~~~~~i~~~V~~lDgv~Ii~---VPs~r~~ 222 (302) T protein:vir:78 150 QALM-GDIATAMELVDDS--NQLILVTSPT-TLAGLLNTALIRESKNTQVLRRGEVDTKITFIQDVEVLQ---VPSEYLY 222 (302) T ss_pred HHHH-HHHHHHHHHhhcc--CCeEEEEChH-HHHHHhcchhhccceeccccccccccceeeeecccEEEE---chhhhcc Confidence 3333 3455666666653 4799999995 556777653 321 112221 1233466777763 4543222 Q ss_pred -----------EEcccceEEEEecCcchhhhhhhccccccccceEEEec---cccceeehhhhhhHHHHhhhhccceEEE Q lcl|Aclame:pro 224 -----------ATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHF---QENTTLTIQTLLVSGMLMYPERIDGIVK 289 (296) Q Consensus 224 -----------~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~---~~~~~~t~et~~~~~~~lfpE~~dgvv~ 289 (296) .+.+-+||+-=++++. - ..-...-.+.+.-. +..+.--++=-..+.+..+....+||. T Consensus 223 t~~~f~~G~~~~~~ak~INfiiv~~~a----~---ia~~K~~~~~if~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI~- 294 (302) T protein:vir:78 223 DKVAPKVGVPDYTGAKKIPYMIFKRDA----P---TGIVKTDKVRVFEPDTNQSADAYKVDLRLYHDLIVPKNQRPGII- 294 (302) T ss_pred cceeccCCccccCCccceeEEEECCCe----e---eeeeeeeeeEeeCCCCCCCcceeeeeeeeEeeeeeeccccCeEE- Confidence 3444556666665531 0 00111222222111 111222455556677788899999999 Q ss_pred EEecCCC Q lcl|Aclame:pro 290 VTLTPGV 296 (296) Q Consensus 290 ~tI~~~v 296 (296) +.++++| T Consensus 295 ~~~~~~~ 301 (302) T protein:vir:78 295 KASFGTI 301 (302) T ss_pred Eeecccc Confidence 7788888 No 169 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=91.42 E-value=0.0036 Score=34.01 Aligned_cols=268 Identities=12% Similarity=0.074 Sum_probs=110.2 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHH---hCcccccccCCCCeeeeee---------------eee Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEM---LGVTRKISVSEGMTLKTYA---------------GYD 62 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~---LgVtr~~~~~pG~tIt~pk---------------~~y 62 (296) .-++.++++.- -+.++|.++.+-= |.++- |+.+ .-|.|-..+ .|++|++=+ |.. T Consensus 63 ~~~~~~ta~~~-a~~T~i~V~~~~~----f~~~~--l~~~~~~~EvirVtsV-ng~~lTV~RG~~~t~aa~iaag~~~~~ 134 (418) T protein:vir:96 63 FASAVVTAEAL-ADATVLTVENSDG----LTKGM--IFYNEATGENMRLELV-NGLNLTVKRQTGRIAAAIIAANTKLIV 134 (418) T ss_pred eeeEEEEEEEe-cCceEEEecCCcc----ccccc--EEEEecCCeEEEEEEE-eCCEEEEEEccCCeeeeeeecCceEEE Confidence 11223333332 3334443332222 33332 1100 122233333 488888854 334 Q ss_pred eecccCcccCCceechhheeee-ecceeEEEEeeccccc--CHHH-HHhhcCCchhHHHHHHHH---------HHHHhhh Q lcl|Aclame:pro 63 VTLAEGNVPEGEVIPLSKVERK-IHSEKKIELKKYRKAT--TGED-IQMYGSNEAVTNTDNALV---------RQLQKKI 129 (296) Q Consensus 63 ig~A~gdVaEGe~Iplskv~~~-~~~t~~~tikK~~K~v--TdEA-IqlsGygdav~etd~QL~---------~~iq~kI 129 (296) ||++ ..||..-|-+.-... ..+.. ..|.+-.-++ |++| +...|+++---.=.+.|. +..-++. T Consensus 135 ig~~---~eEGsd~~ta~~~k~~~vsN~-tQIf~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~~kv~iE~ali~g~~~ 210 (418) T protein:vir:96 135 IGTA---FEEGSQRPTARSIQPVYVPNF-TQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFHATEQETAIFFGQAF 210 (418) T ss_pred eecC---cccccccCCcceecceeccch-hheehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHHHHHHHHHhhhccccc Confidence 4443 357877776642211 11111 1232222233 5555 333587665311111111 1111111 Q ss_pred h---------------HHHHHHHhcC----ccceecchhhHHHHHHHHHHHHHHhhccc---cC-cc---eEEEEcHHHH Q lcl|Aclame:pro 130 R---------------TDFVTALKTG----TGTQDALGAGLQGALASAWGKLQVLFEDY---GS-ER---AIVFANSLDV 183 (296) Q Consensus 130 d---------------nD~~~aLkta----t~t~~~t~~~lQ~Ala~~~~~~~~~Fede---d~-~~---~VlFvNP~Da 183 (296) . .-+..++++- ..+...+-+-|..++. ++|... +. .+ ++.+||++.- T Consensus 211 ~~~~ng~p~~~t~R~m~gI~~f~~~Nvi~ag~~~~~t~d~L~~~~~-------~a~~~g~n~G~~~~~~~y~~~V~a~~k 283 (418) T protein:vir:96 211 MGTYNGQPLHTTQGIVDAIRQYAPDNVNAMPNPTAVTYDDVVDATI-------DAFKWSVNVGDNTQRVMFCDTVGMRTM 283 (418) T ss_pred cCCCCCcccccccchhHHHHhhccccccccCCCCcCCHHHHHHHHH-------HHHhhcCCCCCcccceEEEEEeChHHH Confidence 1 1122222211 1111123334444443 444311 21 22 6788888764 Q ss_pred H---HHhcCCccc---cceeechh---hhhhhhe-eEEEEec-----cCCCceEEEEcccceEEEEec---Ccchhhhhh Q lcl|Aclame:pro 184 A---EYIAKAGIT---TQTAFGLT---YLVDFTG-TVIISTN-----DVTKGEIWATVPENIIFAYIN---PNNSELAKE 245 (296) Q Consensus 184 a---~~l~~a~i~---~q~~fg~t---yl~nfLG-~~II~S~-----kV~~G~~~~t~~~Nl~~ay~~---~~~g~~~~~ 245 (296) - ++.+ +|. ..+.+|+. |..|| | +.||..+ +||+|+++..-++++.++|++ +.-..+++. T Consensus 284 ~~I~k~~~--~I~~~~~en~~G~vv~~~~Td~-G~v~ii~n~~~pad~I~~g~mlVvD~~~vkL~yL~~R~~~~E~l~k~ 360 (418) T protein:vir:96 284 QDIGRFFG--EVTVTQRETSYGMVFTEWKFFK-GRLIIKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQG 360 (418) T ss_pred HHHhhhhc--eeEeccccceeceEEEEEEeec-cEEEEEecCCCCccccCcceEEEEecCceEEEEecCCCccchhcccC Confidence 3 3333 453 34678874 44444 8 5687777 889999999999999999994 444444444 Q ss_pred hc--cccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEe----cCCC Q lcl|Aclame:pro 246 FN--LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTL----TPGV 296 (296) Q Consensus 246 f~--~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI----~~~v 296 (296) .. +.+-..---|--.+.....+..|- ++.+-- ..+..++|= ++.| T Consensus 361 G~~~~~~~~~~~~~~~~D~~~G~l~~El----tle~~N--~~a~a~itgl~~~~~~~ 411 (418) T protein:vir:96 361 GGENKSGATDYSYGHGVDAQGGSLTSEW----ALELLN--PQGCAVITGLQKAKERV 411 (418) T ss_pred CCcccccccccccccccccccCEEEEEE----EEEeec--ccccEEeeccccccccc Confidence 31 111000000111122223333331 112222 222222220 1111 No 170 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=90.97 E-value=0.018 Score=30.13 Aligned_cols=254 Identities=12% Similarity=0.034 Sum_probs=126.1 Q ss_pred hhhhhHHHHhhhHHHHHHHhCcccc---------cccCCCCeeeeeee-eeeecccCcccCCceechhheeeeecceeEE Q lcl|Aclame:pro 22 ITIDVTNKFQENISKLLEMLGVTRK---------ISVSEGMTLKTYAG-YDVTLAEGNVPEGEVIPLSKVERKIHSEKKI 91 (296) Q Consensus 22 ~siDf~~~f~~~i~~L~~~LgVtr~---------~~~~pG~tIt~pk~-~yig~A~gdVaEGe~Iplskv~~~~~~t~~~ 91 (296) ++|..+++|..-|.+-+..--.+.. ....-|.+|++|+- +..|.++-+=.-| .+-..++.+. .++++ T Consensus 1 Main~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g--~~~g~v~~~~-et~tl 77 (285) T protein:vir:79 1 MTVVLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQD--NARKTISVGK-ETVKL 77 (285) T ss_pred CcchhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccC--ccccccceee-eEEEe Confidence 7788888888777665532222221 23445899999996 3555554332233 3333333331 45666 Q ss_pred EEeecccc-c----CHHHHHhhcCCchhHHHHHH-HHHHHHhhhhHHHHHHHhcCccc---eecchhhHHHHHHHHHHHH Q lcl|Aclame:pro 92 ELKKYRKA-T----TGEDIQMYGSNEAVTNTDNA-LVRQLQKKIRTDFVTALKTGTGT---QDALGAGLQGALASAWGKL 162 (296) Q Consensus 92 tikK~~K~-v----TdEAIqlsGygdav~etd~Q-L~~~iq~kIdnD~~~aLktat~t---~~~t~~~lQ~Ala~~~~~~ 162 (296) +-.|++.= + =||. +.. .++..-++ ....+.--||.-.|+.|.+...+ .+.+.++.=.++-.+..++ T Consensus 78 ~~DR~~~f~iD~mDvdEn-~~~----~~~ni~~ef~~~~vvPEiDayrfskla~~a~~~~~~~~T~~nv~~~i~~~~~~l 152 (285) T protein:vir:79 78 THEDWFGYDLDQFDMDEN-GAY----TVENVVREHNKMITIPHRDKVAVQKLFDSAAKKATDSITKDNALDAYDTAEAYM 152 (285) T ss_pred eccccceecccccchhhh-hhh----hHHHHHHHHHhhhhcchhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHH Confidence 66666553 2 1343 222 23333333 44556678888888888544332 2234445444444444433 Q ss_pred HHhhccccCcceEEEEcHHHHHHHhcCCc-cc----c-ceeechhh---hhhhhe-eEEEE--eccCCCceEEEEcccce Q lcl|Aclame:pro 163 QVLFEDYGSERAIVFANSLDVAEYIAKAG-IT----T-QTAFGLTY---LVDFTG-TVIIS--TNDVTKGEIWATVPENI 230 (296) Q Consensus 163 ~~~Feded~~~~VlFvNP~Daa~~l~~a~-i~----~-q~~fg~ty---l~nfLG-~~II~--S~kV~~G~~~~t~~~Nl 230 (296) .+. +-....|+||.| ++...|+++. +. . |....+.+ ...+-| +.|+. |+..+. ..-+.+| T Consensus 153 de~---~vp~~rvl~vTp-~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt----~~~~k~I 224 (285) T protein:vir:79 153 FDN---EVPGGFVMFVSS-AYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKG----LGITNHV 224 (285) T ss_pred HHc---CCCCceEEEECh-HHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccC----cCcchhc Confidence 332 112468999988 5666776663 22 2 22211111 233556 56653 333321 1222456 Q ss_pred EEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 231 IFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 231 ~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) |+-=++|+. -+ +. .=.|...++-=-.++..+.--++=-..+.+..+....+||.. ..+++| T Consensus 225 nfiiv~~~a-~i--~~-~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~-~~~a~~ 285 (285) T protein:vir:79 225 NFILTPLSA-IA--PI-VKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYV-AATAGV 285 (285) T ss_pred cEEEecCce-ec--cc-eeeeeeEeECCCCCCCcceeeeeeeeeeeeeehhhccceeee-eecccC Confidence 666665531 00 00 001211111111122233334444556677788889999955 478888 No 171 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=90.63 E-value=0.02 Score=29.91 Aligned_cols=268 Identities=14% Similarity=0.184 Sum_probs=144.4 Q ss_pred ehhhhhhhhhhhhHHHHhhhHHHHHHH------hCcccccccCCCCeeeeeeeeeeecccCcccCCceechhheeeeecc Q lcl|Aclame:pro 14 KSTDLKYPITIDVTNKFQENISKLLEM------LGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSKVERKIHS 87 (296) Q Consensus 14 ~~~dl~~a~siDf~~~f~~~i~~L~~~------LgVtr~~~~~pG~tIt~pk~~yig~A~gdVaEGe~Iplskv~~~~~~ 87 (296) ..+|- --+.|.++.+|...|++-+.. |...+-.-..-|.+|++|+-...|.++-+=..| -.--.++-+ -+ T Consensus 1 ~~~~a-n~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g--~~~g~v~~~-~e 76 (311) T protein:vir:99 1 MPTDA-ETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDLVNGGRSFTLKTISTSGLKDHTRGKG--FNSGTISDE-KT 76 (311) T ss_pred CCCcc-hhhHHHHHHHHHHHHHHHHHhhhcccceecCchheeecCCEEEEEeeeeccccccccccC--ccccceeee-ee Confidence 33332 234588999999999887654 444444334569999999988877765443333 122333332 24 Q ss_pred eeEEEEeecccc-cCHHHHHhhcCCchhHH-HHHHHHHHHHhhhhHHHHHHHhcCcc------------------ceecc Q lcl|Aclame:pro 88 EKKIELKKYRKA-TTGEDIQMYGSNEAVTN-TDNALVRQLQKKIRTDFVTALKTGTG------------------TQDAL 147 (296) Q Consensus 88 t~~~tikK~~K~-vTdEAIqlsGygdav~e-td~QL~~~iq~kIdnD~~~aLktat~------------------t~~~t 147 (296) +++|+-.|++.= +-.-.+.-+++--.++. ..++....+.--+|.-.|+.|-+... ..+.+ T Consensus 77 t~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~lt 156 (311) T protein:vir:99 77 IYTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETLD 156 (311) T ss_pred EEEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccccccC Confidence 667776666653 22111112233233322 34555566677788888877732210 00123 Q ss_pred hhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCc-cc--c-ceeechhh----hhhhheeEEEEe---cc Q lcl|Aclame:pro 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAG-IT--T-QTAFGLTY----LVDFTGTVIIST---ND 216 (296) Q Consensus 148 ~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~-i~--~-q~~fg~ty----l~nfLG~~II~S---~k 216 (296) .++.=..|-.++.++.+ -.....|+||.| ++...|++++ ++ . ...||-+. ...+.|+.||+. +. T Consensus 157 ~~nvl~~l~~~~~~~~~----v~~~~rvl~vTp-~~~~lLk~~~~~~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r 231 (311) T protein:vir:99 157 ETNAYSQLKTGIGKVRK----YGTQNLVGYVSS-EVMDALERSKEFTRNITNQNVGTTALESRITSIDGVQLIEVYESNR 231 (311) T ss_pred HHHHHHHHHHHHHHHHh----cCCCCeEEEECh-HHHHHHhhchhhheeeecccccccccccccceecCeEEEEecCchh Confidence 34444445555555544 234569999999 5666777763 32 1 12222221 234668877654 32 Q ss_pred ------CCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEE Q lcl|Aclame:pro 217 ------VTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKV 290 (296) Q Consensus 217 ------V~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~ 290 (296) ...|..-.+.+-+||+-=++|+. =++ . .=.|...++-=..++..+.--++=-..+.+..+....+|| -+ T Consensus 232 ~~t~~~ft~G~~~~~~ak~INfiiv~~~a-~i~--~-~K~~~v~~f~P~~~~~gd~~l~~~R~Y~D~fv~~nk~~~I-yv 306 (311) T protein:vir:99 232 FMTKYDFTDGAKPTEDAKAINFLVVAKPA-VIS--I-VKENAVFLFAPGQHTDGDGYLYQNRLYHDLFIKKHKRDGI-FV 306 (311) T ss_pred hcchhhhcCCccccCcccccceEEeCCCe-eee--e-eeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeE-EE Confidence 33677767777788888777741 110 0 0012222221112223334445555667778888899998 56 Q ss_pred EecCC Q lcl|Aclame:pro 291 TLTPG 295 (296) Q Consensus 291 tI~~~ 295 (296) .++.+ T Consensus 307 ~~k~A 311 (311) T protein:vir:99 307 SVKKA 311 (311) T ss_pred eeecC Confidence 66666 No 172 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=83.35 E-value=0.031 Score=28.84 Aligned_cols=273 Identities=13% Similarity=0.059 Sum_probs=116.5 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHH---hCcccccccCCCCeeeeee---------------eee Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEM---LGVTRKISVSEGMTLKTYA---------------GYD 62 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~---LgVtr~~~~~pG~tIt~pk---------------~~y 62 (296) .-++.++++.- -..++|.++.+-=| -++. |+.+ .-|.|-..+ .|++|++-+ |.. T Consensus 63 ~~~~~~ta~a~-a~~T~l~ve~~~~f----~~~~--l~~~~~~~Evirv~sV-ng~~lTV~Rg~~~t~aaaia~n~~~~~ 134 (418) T protein:vir:10 63 FASAVVTAEAA-ADATVLTVENSDGL----TKGM--IFYNEATGENMRLELV-NGLNLTVKRQTGRISAAIIAANTKLIV 134 (418) T ss_pred eeeEEEEEEEe-cCceEEEEcCccee----cccc--EEEEccCCeEEEEEEE-eCCEEEEEEecCCeeEEEEecCceEEE Confidence 23344444332 33344544333223 2221 1111 124444455 488888854 445 Q ss_pred eecccCcccCCceechhhee-eeecceeEEEEeeccccc--CHHHH-HhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHh Q lcl|Aclame:pro 63 VTLAEGNVPEGEVIPLSKVE-RKIHSEKKIELKKYRKAT--TGEDI-QMYGSNEAVTNTDNALVRQLQKKIRTDFVTALK 138 (296) Q Consensus 63 ig~A~gdVaEGe~Iplskv~-~~~~~t~~~tikK~~K~v--TdEAI-qlsGygdav~etd~QL~~~iq~kIdnD~~~aLk 138 (296) ||++ ..||..-|-+.-. +..-+.. ..|.+-.-.+ |.+|. ...|.+|.. +...+..+..+-+|.+-++.=-+ T Consensus 135 Ig~~---~eEGsd~~ta~~~k~~~vsNv-tQIF~~avsvSgTaqAs~~q~Gvsn~~-ese~drk~~~av~iEkalI~G~~ 209 (418) T protein:vir:10 135 IGTA---FEEGSQRPTARSIQPVYVPNF-TQIFRNAWALTDTARASYAEAGYSNIT-ESRRDCMDFHATEQETAIFFGQA 209 (418) T ss_pred eccc---cccccccCCcceecceeccch-hhhhhhhhhhhhhhhhccccccCchHH-HHHHHHHHHHHHHHHHHHhcccc Confidence 5554 3688888776422 2111111 1232211223 66662 236888664 44444444444455554443321 Q ss_pred cCccceec---chhhHHHHH-------------------HHHHHHHHHhhccc---cCcc----eEEEEcHHHHH---HH Q lcl|Aclame:pro 139 TGTGTQDA---LGAGLQGAL-------------------ASAWGKLQVLFEDY---GSER----AIVFANSLDVA---EY 186 (296) Q Consensus 139 tat~t~~~---t~~~lQ~Al-------------------a~~~~~~~~~Fede---d~~~----~VlFvNP~Daa---~~ 186 (296) ....+... .-.|+..++ .+....+.+.|+-. +..+ ++.+|||+.-- ++ T Consensus 210 ~~~~~~~g~~R~m~GIl~~vr~~~~gnVv~a~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~ 289 (418) T protein:vir:10 210 FMGTYNGQPLHTTQGIVDAVRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRF 289 (418) T ss_pred cCCCcCCcchhhHHHHHHHHhhhcccceeccCCCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhh Confidence 11110000 001111111 01111223444311 2222 78899887533 45 Q ss_pred hcCCccc---cceeechhhhhhhhe-eEEEEec-------cCCCceEEEEcccceEEEEecCcchhhhhhhccccccccc Q lcl|Aclame:pro 187 IAKAGIT---TQTAFGLTYLVDFTG-TVIISTN-------DVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGY 255 (296) Q Consensus 187 l~~a~i~---~q~~fg~tyl~nfLG-~~II~S~-------kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGl 255 (296) .+ +|. ..+.||+.+.+=-.| .+|++-+ .+|+|+++..-++++.++|++- -|.+-.|.=-++.-.+ T Consensus 290 ~~--~I~~~~~e~~~G~vv~~~~~~~G~I~L~~~p~~~~~~lp~g~mlVvD~~~vkL~~L~~--R~~~~E~l~k~G~~~~ 365 (418) T protein:vir:10 290 FG--EVTVTQRETSYGMVFTEWKFFKGRLILKEHPLFSAIGISPGFAVVVDVPAVKLAYMDG--RNAKVENYGQGGGENK 365 (418) T ss_pred hh--heeecccceeeeEEEEEEEcceEEEEeecccccccccCCCceEEEEccccceEEEecc--ccccchhcccCCCccc Confidence 44 343 236688876543333 3443333 4999999999999999999963 2333333211110001 Q ss_pred eEE-----Ee--ccccceeehhhhhhHHHHhhhhccceEEEEEe----cCCC Q lcl|Aclame:pro 256 IGM-----NH--FQENTTLTIQTLLVSGMLMYPERIDGIVKVTL----TPGV 296 (296) Q Consensus 256 iGv-----~h--~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI----~~~v 296 (296) -|- -| +...-.++.|- ++.+ =+..|..++|= ++.| T Consensus 366 ~~~~~~~~~~~~D~~kG~iv~E~----tLe~--~N~~a~avitgl~~~~~~~ 411 (418) T protein:vir:10 366 SGATDYSYGHGVDAQGGSLTSEW----ALEL--LNPQGCAVITGLQKAKERV 411 (418) T ss_pred ccccccccccccccccceEEEEe----eeee--ecccceEEeeccceecccc Confidence 110 11 11222233331 1122 22333333321 1111 No 173 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=80.97 E-value=0.09 Score=26.32 Aligned_cols=259 Identities=11% Similarity=0.047 Sum_probs=112.8 Q ss_pred CccccccccccceehhhhhhhhhhhhHHH-HhhhHHHHHHHhCcccccc-----cC----CCCeeeeeeeee-eecc--c Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNK-FQENISKLLEMLGVTRKIS-----VS----EGMTLKTYAGYD-VTLA--E 67 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~-f~~~i~~L~~~LgVtr~~~-----~~----pG~tIt~pk~~y-ig~A--~ 67 (296) |- --||.. |-.. |..-+++|.+.|-..|... ++ -|+.+.+|-|.- +|+. . T Consensus 1 m~------------lsD~~v-----fN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~ 63 (325) T protein:vir:95 1 MA------------LSDLAV-----YSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRR 63 (325) T ss_pred Cc------------hhhhhh-----hhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeecccccccccccccc Confidence 11 112211 1111 2223333444333333311 11 399999999975 4432 1 Q ss_pred CcccCCceechhheeeeecceeEEEEeecccc--cCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHH----hcC- Q lcl|Aclame:pro 68 GNVPEGEVIPLSKVERKIHSEKKIELKKYRKA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTAL----KTG- 140 (296) Q Consensus 68 gdVaEGe~Iplskv~~~~~~t~~~tikK~~K~--vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aL----kta- 140 (296) .++.+...++-.|+++-+ ...+++. .+|+ .++++--+.| .+|..+.-++++.++++-...+.+..+ ..+ T Consensus 64 ~~~~~~~~vt~~kitt~~--~~av~~~-r~~g~~~~d~~~~~~g-~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~ 139 (325) T protein:vir:95 64 RNAYGSGTVAEKVLKHLV--DTSVKVA-AGTPPVRLDPGQFRWI-QQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSAL 139 (325) T ss_pred ccCCCCceeccceecccc--ceeeEEe-cccCcccccHHHHhhc-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 256677778889988643 2222221 1233 3666643566 467776666666666555555543333 211 Q ss_pred ---cc-cee--cchhhH-HHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceee---chhhhhhhheeE Q lcl|Aclame:pro 141 ---TG-TQD--ALGAGL-QGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF---GLTYLVDFTGTV 210 (296) Q Consensus 141 ---t~-t~~--~t~~~l-Q~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~f---g~tyl~nfLG~~ 210 (296) +. ..+ ...++- +.--+..+-++.-+|.|..+.=..++||+.=.+++.+++.+.....| |.+-+..++|.. T Consensus 140 ~~~~~~v~dis~~~~~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~~i~t~~G~~ 219 (325) T protein:vir:95 140 SQVSDVVYDATANTDAADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVNVVRDPFGKL 219 (325) T ss_pred cccccceeeeecccCcccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCcccccccCCcE Confidence 10 001 001100 00011233334456776544445677888777777777666433322 222234588999 Q ss_pred EEEeccCCCce--------EEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhh Q lcl|Aclame:pro 211 IISTNDVTKGE--------IWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPE 282 (296) Q Consensus 211 II~S~kV~~G~--------~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE 282 (296) ||.+..+|... .|.-.++.+-+-.-+| +....+|++.---.|.+-..+-+ ..+-.-|+.| -| T Consensus 220 VIVdD~~p~~~~g~~~~ytty~lg~GAi~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~t-f~lhp~G~sw-~~ 289 (325) T protein:vir:95 220 LVMTDSPNLFAAGTPNVYHILGLVPGGVLIGQNND--------FDANEETKNGDENIIRTYQAEWS-YNIGVKGFAW-DK 289 (325) T ss_pred EEEeCCCCCCCccCceeEEEEEEecCeEEecCCCC--------ccccccccCcccceeeeeeeeee-EEeecceeee-ec Confidence 99999888533 2444444433222211 12222222211111111000000 1222333333 01 Q ss_pred ccceEEEEEecCCC Q lcl|Aclame:pro 283 RIDGIVKVTLTPGV 296 (296) Q Consensus 283 ~~dgvv~~tI~~~v 296 (296) - .... .|- T Consensus 290 s-----~~g~-sPt 297 (325) T protein:vir:95 290 A-----NGGK-SPT 297 (325) T ss_pred c-----cccC-CcC Confidence 0 0011 122 No 174 >protein:vir:3424 Length: 341 # NCBI annotation: capsid component # Family: family:all:1021 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040587;genbank:gi:9626251;genbank:GeneID:2703482 Probab=80.33 E-value=0.096 Score=26.17 Aligned_cols=269 Identities=12% Similarity=0.047 Sum_probs=120.4 Q ss_pred ccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeee--eeeeeeecccCccc---CCceechhheeee Q lcl|Aclame:pro 10 ENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKT--YAGYDVTLAEGNVP---EGEVIPLSKVERK 84 (296) Q Consensus 10 ~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~--pk~~yig~A~gdVa---EGe~Iplskv~~~ 84 (296) =|+-+...|.. ++++--.--..|++. ...+.. ..+-.+|.+ -++ ....|. -|. +|..+.-...++ T Consensus 1 ~d~f~~~~L~~-----~i~~~~~~~~~l~d~-~fp~~~-~~~t~~v~~~~~~~-~~~lap-~v~~~~~~~~~~~~~~~~- 70 (341) T protein:vir:34 1 MSMYTTAQLLA-----ANEQKFKFDPLFLRL-FFRESY-PFTTEKVYLSQIPG-LVNMAL-YVSPIVSGEVIRSRGGST- 70 (341) T ss_pred CCCcCHHHHHH-----HHHhccCccchhHHh-cCCccc-ccccceEEEEEeeC-CeeEEE-eecCCCCcceeccCceee- Confidence 23333333322 222222212223333 333321 112223332 111 111221 223 333333333221 Q ss_pred ecceeEEEEeecccccCHHHHHhhcCC-------chhHHHHHHH-------HHHHHhhhhHHHHHHHhcCcccee----- Q lcl|Aclame:pro 85 IHSEKKIELKKYRKATTGEDIQMYGSN-------EAVTNTDNAL-------VRQLQKKIRTDFVTALKTGTGTQD----- 145 (296) Q Consensus 85 ~~~t~~~tikK~~K~vTdEAIqlsGyg-------dav~etd~QL-------~~~iq~kIdnD~~~aLktat~t~~----- 145 (296) .+.+...=|....++.+.++.-.+| .|.....+++ ...|+..+.--|..+|.++.-... T Consensus 71 --~~~~~p~i~~~~~i~~~d~~~r~~g~~~~~~~~~~~~~~~~i~~~l~~l~~~i~~~~E~m~~qaL~~Gki~~~~~g~~ 148 (341) T protein:vir:34 71 --SEFTPGYVKPKHEVNPQMTLRRLPDEDPQNLADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFD 148 (341) T ss_pred --eEEecCccCccceeCHHHHHHHhhccccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEEecCCcc Confidence 1223332333445677777654444 2332222332 234555555567788866532110 Q ss_pred ---------------cchhhH----HHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcccc------------ Q lcl|Aclame:pro 146 ---------------ALGAGL----QGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITT------------ 194 (296) Q Consensus 146 ---------------~t~~~l----Q~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~------------ 194 (296) .++... ....++-+.++.+..+..+....+++++++-...++.++.+-. T Consensus 149 ~~~vDfg~~~~~~~~~t~~~~W~~~~~~~~d~l~di~~~~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~ 228 (341) T protein:vir:34 149 PVEVDMGRSEENNITQSGGTEWSKRDKSTYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSEL 228 (341) T ss_pred EEEEEeCCCCccceEecCCccCCcCCCchHHHHHHHHHHHHhcCCceEEEEeCHHHHHHHhcCHHHHHHHhhcccccccc Confidence 111110 0112334445555555555566778888887776666654310 Q ss_pred -----ceeechhhhhhhheeEEEEec-----------cCCCceEEEEcccce-EEEEecCcchhhhhhhccccccccceE Q lcl|Aclame:pro 195 -----QTAFGLTYLVDFTGTVIISTN-----------DVTKGEIWATVPENI-IFAYINPNNSELAKEFNLYGDPTGYIG 257 (296) Q Consensus 195 -----q~~fg~tyl~nfLG~~II~S~-----------kV~~G~~~~t~~~Nl-~~ay~~~~~g~~~~~f~~~td~tGliG 257 (296) ....+.+|+..+.|.+|+.-+ -+|+|++++.+++++ ..+|-.+...+.... ..... .-+.+ T Consensus 229 ~~~~~~~~~~~~~~~~~~g~~i~~y~~~y~ddG~~~~~ip~~~v~l~p~g~~g~~~yg~~~d~~~~~~-~~~~~-~~~~~ 306 (341) T protein:vir:34 229 ETAVKDLGKAVSYKGMYGDVAIVVYSGQYVENGVKKNFLPDNTMVLGNTQARGLRTYGCIQDADAQRE-GINAS-ARYPK 306 (341) T ss_pred cccccccccceeeeeecCCceEEEEcCEEEECCcEEeeecCCeEEEeeCCCcceEEEeeccccccccc-ceeee-eEeee Confidence 111244555445566553222 389999999999864 777755432221110 01000 01112 Q ss_pred EEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 258 MNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 258 v~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) ..+.. .+.-..++.+-|..+--|+++|+++++++- T Consensus 307 ~~~~~-~dp~~~~~~~~s~pLPv~~~pd~~~~a~V~ 341 (341) T protein:vir:34 307 NWVTT-GDPAREFTMIQSAPLMLLADPDEFVSVQLA 341 (341) T ss_pred eeeec-CCCcEEEEEEcccceeeeeCCCcEEEEEeC Confidence 11111 112234445556667788999999999998 No 175 >protein:vir:6378 Length: 346 # NCBI annotation: capsid protein E # Family: family:all:1021 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918991;genbank:gi:34610166;genbank:GeneID:2559600 Probab=73.19 E-value=0.17 Score=24.76 Aligned_cols=258 Identities=11% Similarity=0.009 Sum_probs=119.1 Q ss_pred ccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee----ecccCcccCCceechhheeeee Q lcl|Aclame:pro 10 ENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV----TLAEGNVPEGEVIPLSKVERKI 85 (296) Q Consensus 10 ~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi----g~A~gdVaEGe~Iplskv~~~~ 85 (296) =++-++..|.. ++++.- .-.-|++.+ ..+. +..+..+|.+ .++ ..|. .|.+|+.= ..+.+.. T Consensus 1 ~d~f~~~~l~~-----~i~~~p-~~~~l~~~~-fp~~-~~~~t~~i~i---~~~~g~~~la~-~v~~~~~~--~~~~~~g 66 (346) T protein:vir:63 1 MEIFDTLTLAG-----VIQSGP-ALSMYWQGF-YPNE-ITFDTDEILF---DLVFKDKKLAP-FVAPNVQG--RVIAARG 66 (346) T ss_pred CCccCHHHHHH-----HHHhcC-Cccchhhhc-Cccc-cccccceEEE---EEecCceeeee-eecCCCCc--ceecccc Confidence 33333444432 222221 112233333 3221 2223344432 122 1232 44443321 1122221 Q ss_pred cceeEEEEe--ecccccCHHHHHhh--------cCCchhH-------HHHHHHHHHHHhhhhHHHHHHHhcCcccee--- Q lcl|Aclame:pro 86 HSEKKIELK--KYRKATTGEDIQMY--------GSNEAVT-------NTDNALVRQLQKKIRTDFVTALKTGTGTQD--- 145 (296) Q Consensus 86 ~~t~~~tik--K~~K~vTdEAIqls--------Gygdav~-------etd~QL~~~iq~kIdnD~~~aLktat~t~~--- 145 (296) .++.+.+.- |..+.++.+.++.. |=..+.. +-...|.+.|...+.--+..+|.++.-..+ T Consensus 67 ~~~~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~~~E~m~~~al~~gki~~~g~~ 146 (346) T protein:vir:63 67 YTTKTFRPAYVKPKDVINPNRTLKRRAGEQPIIGGMSLQERFQAVVADSQLEQRQRIENRIEWMCAMATIYGYVDVVGEA 146 (346) T ss_pred eeeeEeecCccCccceeCHHHHHHHhhhhhhccCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCEEEeeCCc Confidence 122232222 44445676666431 2222232 223455667777777778888876532211 Q ss_pred --------------------------cchhhHHHHHHHHHHHHHHhhcccc-CcceEEEEcHHHHHHHhcCCcccc---- Q lcl|Aclame:pro 146 --------------------------ALGAGLQGALASAWGKLQVLFEDYG-SERAIVFANSLDVAEYIAKAGITT---- 194 (296) Q Consensus 146 --------------------------~t~~~lQ~Ala~~~~~~~~~Feded-~~~~VlFvNP~Daa~~l~~a~i~~---- 194 (296) ++.+-+ ..+.+..+...+.. ....+++++|+=...++.++.+-. T Consensus 147 ~~~~~vdfg~~~~~~~~lt~~~~W~~~~adp~-----~di~~~~~~~~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~ 221 (346) T protein:vir:63 147 FPMQRVDFGRDPALTVQLTGGAAWDQATSDPL-----GNIQTMRTTAWKKSNSTITRLTMGLDAWSLFSQKPAVVELLNL 221 (346) T ss_pred eeEEEEeeCCCccceeeecccccCCCCCCCHH-----HHHHHHHHHHHHccCCceEEEEECHHHHHHHhcCHHHHHHHhh Confidence 011111 22333334444432 345578888877766655543211 Q ss_pred -c-----------------eeechhh--hhhhheeEEEE------------eccCCCceEEEEcccce-EEEEecCcchh Q lcl|Aclame:pro 195 -Q-----------------TAFGLTY--LVDFTGTVIIS------------TNDVTKGEIWATVPENI-IFAYINPNNSE 241 (296) Q Consensus 195 -q-----------------~~fg~ty--l~nfLG~~II~------------S~kV~~G~~~~t~~~Nl-~~ay~~~~~g~ 241 (296) + ..+.+++ ..++.|.+|+. .+-+|+|++++.+++++ .++|-++...+ T Consensus 222 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~gi~i~~y~~~y~d~~G~~~~~ip~~~v~~~p~~~~g~~~yg~~~d~~ 301 (346) T protein:vir:63 222 FYKGSTSDFNRSRLDDGSPVQYQGTIGGYNGMGTLELYTYHDTYTGDDNTEQEILGSYDVVGTGPGLQGTQCFGAIMDFK 301 (346) T ss_pred hccccccccchhhcccchhhhhhhhHhhhhccCCeEEEEeccEEEcCCCceeccccCCeEEEEecCCcceEEEeeccccc Confidence 0 0011111 11234555543 34588899999988764 77776653222 Q ss_pred hhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 242 LAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 242 ~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) . +.. ..- +-..+....+....++.+-|..+--|.++|++++++++ T Consensus 302 ~----~~~--~~~-~~~~~~~~~dp~~~~~~~~s~plPv~~~p~~~~~~~V~ 346 (346) T protein:vir:63 302 N----GLV--PTR-MFPKMWEEEDPSVAMLMTQSAPLMVPAQPNASFRMTVK 346 (346) T ss_pred c----Ccc--cce-eeeEEEEecCCCEEEEEEeeeccceecCCCcEEEEEeC Confidence 1 100 000 12333344444555666667778888999999999999 No 176 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=71.55 E-value=0.19 Score=24.49 Aligned_cols=267 Identities=12% Similarity=0.053 Sum_probs=118.1 Q ss_pred cccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccc-------------cCCCCeeeeeeeeee--ecccCcccCC Q lcl|Aclame:pro 9 EENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKIS-------------VSEGMTLKTYAGYDV--TLAEGNVPEG 73 (296) Q Consensus 9 e~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~-------------~~pG~tIt~pk~~yi--g~A~gdVaEG 73 (296) -.|--..-|| .||+++...-++ -..+.++.+.+| .+.+..+.+.+...- -.|. -|+.+ T Consensus 1 ~~~~~~~~~~-----~~~~~~~~d~~~-~~~l~~~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~-~v~~~ 73 (349) T protein:vir:10 1 MKNQKLQLDL-----QRFATPILDMFS-QNTVLDYTRNRQYPEMLGDTLFPAVKVPTLEVDILKAGSRVPTIAS-VSAFD 73 (349) T ss_pred CCcchhhHHH-----HHHHHHhhcccC-HHHHHHHHHhcCcchhhHhhcCCccccccceeEEEeeccCcceeee-eecCC Confidence 1121112122 233333211000 012233333222 233333333332110 0122 45566 Q ss_pred ceechhheeeeecceeEEEEeec--ccccCHHHHH-hhcCCch-h-HHH-------HHHHHHHHHhhhhHHHHHHHhcCc Q lcl|Aclame:pro 74 EVIPLSKVERKIHSEKKIELKKY--RKATTGEDIQ-MYGSNEA-V-TNT-------DNALVRQLQKKIRTDFVTALKTGT 141 (296) Q Consensus 74 e~Iplskv~~~~~~t~~~tikK~--~K~vTdEAIq-lsGygda-v-~et-------d~QL~~~iq~kIdnD~~~aLktat 141 (296) .+-|+.+-+. ...+.++-++ .+.++.+.+. +..++.+ . ... ..+|...|...+.-=|..+|.+|. T Consensus 74 ~~~~~~~r~~---~~~~~~~p~ik~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~q~l~~Gk 150 (349) T protein:vir:10 74 AEAEIGTREA---SKMTAELAYVKRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEKMTMEMFATGK 150 (349) T ss_pred CCcceecccc---eeEEeeccccccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCe Confidence 6666544221 1223333333 3345666553 2222222 1 111 233444455555445677776652 Q ss_pred -----------------ccee---------cchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcccc- Q lcl|Aclame:pro 142 -----------------GTQD---------ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITT- 194 (296) Q Consensus 142 -----------------~t~~---------~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~- 194 (296) .+.. ++.+-+. -|. .|.+ .. +...-+++++|+-...++.++++-. T Consensus 151 i~~~~~g~~vD~g~~~~~~~~lt~~~~Ws~~~adpi~-Di~-~~~~---~~---g~~p~~~vm~~~~~~~l~~~~~i~~~ 222 (349) T protein:vir:10 151 ITDKKNGIAIDYGVPKKHQETLSGTKTWDKSDASIID-NLQ-DWSD---SL---DVTPTRALTSKKVLRILMRSTEIKEA 222 (349) T ss_pred eEEcCCcEEEecccCccceeEecCcccCCCCCCCHHH-HHH-HHHH---Hh---CCCccEEEeCHHHHHHHhcCHHHHHH Confidence 1111 1122221 121 2222 22 3345678899987777666664421 Q ss_pred ----ce--ee----chhhhhhhheeEEEEe----------------ccCCCceEEEEcccce-EEEEecC-cchhhhhhh Q lcl|Aclame:pro 195 ----QT--AF----GLTYLVDFTGTVIIST----------------NDVTKGEIWATVPENI-IFAYINP-NNSELAKEF 246 (296) Q Consensus 195 ----q~--~f----g~tyl~nfLG~~II~S----------------~kV~~G~~~~t~~~Nl-~~ay~~~-~~g~~~~~f 246 (296) +. .. .-.|++.+.|.+|+.- +-+|+|.+++.+++++ ..+|-.. ...++...- T Consensus 223 ~~~~~~~~~~~~~~~~~~l~~~~~~~i~~yd~~y~d~~~~~~~t~~~~~p~~~v~l~~~~~~G~~~yG~~~e~~~~~~g~ 302 (349) T protein:vir:10 223 IFGKDTGRVVGQADLDQWMTAQGLPIIRAYDGKYRDEDSRGNLTTNSYFPEDRIVLFNDEVPGQKIYGPTPEENRLISSN 302 (349) T ss_pred hcccccccccCHHHHHHHHHhcCCceEEEEeeEEEeecCCCceeecccccCCeEEEecCCCceeEEeeccchhhhhcccc Confidence 11 01 1134444444433221 2478899999988875 6666432 223332111 Q ss_pred ccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 247 NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 247 ~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) .-+...-+.+-+.+..+.+....++.+-|..+-=|+++|+++.+++= T Consensus 303 ~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl 349 (349) T protein:vir:10 303 AQVSNVGNIMAKIYETSEDPIGTWILASATMLPSFASADDVFQAKVL 349 (349) T ss_pred cceeeccceEEEeeeecCCCceEEEEEeeeeeeeecCCCcEEEEEeC Confidence 11112223344444445556677777777777788999999999887 No 177 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=69.32 E-value=0.22 Score=24.14 Aligned_cols=258 Identities=15% Similarity=0.187 Sum_probs=108.1 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHh---hhHHHHHHHhCcccccccCCCCeeeeeeeeeeeccc---CcccCCc Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQ---ENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAE---GNVPEGE 74 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~---~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~---gdVaEGe 74 (296) |-+-..+-. +.+- ....+|+.+-.. .-=.=++-++| + ....+ +.+.|-.-.+++ ..-.||. T Consensus 1 ma~~~~~~~-----t~~~-~g~~~dl~~~I~~isp~dTPf~S~i~---~--~~a~~--~~~~W~~d~l~~~~~~~~~EG~ 67 (317) T protein:vir:88 1 MATPTNAVS-----TVEI-NGKREDLIDIIYNIAPYDTPFMSAIG---K--GVATA--ITHEWQTDELRQPGKNTRVEGE 67 (317) T ss_pred CCccccceE-----eeee-eeeeechhhhheecCCccCcceeeec---C--ceecc--cEEEEEeeecCCccccccccCc Confidence 433222111 1111 122333322111 10011111121 1 12222 355676554332 2234998 Q ss_pred eechhhee-eeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHHHHH---HHHhhhhHHHHHHHhc---Ccccee Q lcl|Aclame:pro 75 VIPLSKVE-RKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVR---QLQKKIRTDFVTALKT---GTGTQD 145 (296) Q Consensus 75 ~Iplskv~-~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~QL~~---~iq~kIdnD~~~aLkt---at~t~~ 145 (296) .-|-.... |+..+.+ ..|..-.=.| |.+|+...|.++ |-.+|+.. +|.+.+.+-|+.--+. ...+.. T Consensus 68 da~~~~~~~r~~~~N~-tQIf~k~v~VSgTa~av~~~G~~~---ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~ 143 (317) T protein:vir:88 68 DATIKAGSFTTMLNNY-CQISDETLQVTGTADRVKKAGRKN---ELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTP 143 (317) T ss_pred ccccccccCCEEeccE-EEEEEeEEEEeehhhhhhhcCccc---hhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccc Confidence 76655433 2222222 2343222233 889987788775 44444433 3444444443331111 000000 Q ss_pred cchhhHHH-----------------------------HHH--HHHHHHHHhhccccCcceEEEEcHHHHH---HHhcC-C Q lcl|Aclame:pro 146 ALGAGLQG-----------------------------ALA--SAWGKLQVLFEDYGSERAIVFANSLDVA---EYIAK-A 190 (296) Q Consensus 146 ~t~~~lQ~-----------------------------Ala--~~~~~~~~~Feded~~~~VlFvNP~Daa---~~l~~-a 190 (296) -.-.|+.. +|. .-+.-++.+|+. +.....+||||...- ++.++ + T Consensus 144 r~~~Gl~~~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~-Gg~~~~i~v~a~~k~~i~~~~~~~~ 222 (317) T protein:vir:88 144 GQMANIFAYYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRN-GGQANSIQTSSSIKKAISKNMKGRA 222 (317) T ss_pred hhhhhHHHHhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhc-CCCCCEEEeChHHHHHHHHHhcCCc Confidence 00011111 011 112223445554 334446799997643 34322 1 Q ss_pred -cc---ccceeechh---hhhhhhe-eEEEEeccCCCceEEEEcccceEEEEecC-cchhhhhhhccccccccceEEEec Q lcl|Aclame:pro 191 -GI---TTQTAFGLT---YLVDFTG-TVIISTNDVTKGEIWATVPENIIFAYINP-NNSELAKEFNLYGDPTGYIGMNHF 261 (296) Q Consensus 191 -~i---~~q~~fg~t---yl~nfLG-~~II~S~kV~~G~~~~t~~~Nl~~ay~~~-~~g~~~~~f~~~td~tGliGv~h~ 261 (296) .+ ..++.||.+ |..+| | +.||.++.+|.+++++.-++.+.++|+-| ..-+|++..+ T Consensus 223 ~~i~~~~~~~~~g~~v~~~~tdf-G~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~laKtGd-------------- 287 (317) T protein:vir:88 223 TEITLDASDNRIAQTVDVYESDF-GKYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHELAKTGD-------------- 287 (317) T ss_pred eeEEEcccCeEEEEEEEEEEeCC-eEEEEEeCCCCCCCeEEEEcccccceeecccceeeccCCCcc-------------- Confidence 22 334566663 44444 7 68999999999999999999999999843 2222333221 Q ss_pred cccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 262 QENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 262 ~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) .+...+..| .++.+-+|.-=|++. -+++++ T Consensus 288 ~~k~~i~~E----~tLe~~N~~a~a~i~-~l~~~~ 317 (317) T protein:vir:88 288 SEKRQLLVE----YTFRVNNEKSGALIR-DVVAQL 317 (317) T ss_pred cceeEEEEE----EEEEEcCccceeEEE-EecccC Confidence 111111111 111222222222222 223333 No 178 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=58.92 E-value=0.4 Score=22.75 Aligned_cols=264 Identities=14% Similarity=0.127 Sum_probs=108.7 Q ss_pred cccccc------------cccc-----ceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeee- Q lcl|Aclame:pro 2 VTSRTY------------PEEN-----LIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV- 63 (296) Q Consensus 2 ~~~~~~------------ae~n-----l~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yi- 63 (296) ..|..+ .+.+ | .+.-|.+++.=-|++ -+++=-..|+--|+++|..= +..++|-.+- T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~it~~~l-~~g~L~p~~a~~Fl~----~v~~~t~iL~~~r~~~~~s~-~~ei~kig~G~ 74 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKDIGLAEL-DGFQLPVDVTEEFLE----RMQKGVQILGMADTMTLARL-EMEVPQFGVPR 74 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhhcccccc-CceeecHHHHHHHHH----HHhhccchhhhcceeecccc-cccccccccce Confidence 000000 0111 1 123344444444444 34433344454566654321 1122222110 Q ss_pred --eccc---CcccCCceechhheeeeecceeEEEEeecccc--cCHHHHHh--hcCCchhHHHHHHHHHHHHhhhhHHHH Q lcl|Aclame:pro 64 --TLAE---GNVPEGEVIPLSKVERKIHSEKKIELKKYRKA--TTGEDIQM--YGSNEAVTNTDNALVRQLQKKIRTDFV 134 (296) Q Consensus 64 --g~A~---gdVaEGe~Iplskv~~~~~~t~~~tikK~~K~--vTdEAIql--sGygdav~etd~QL~~~iq~kIdnD~~ 134 (296) +-+. ++=.+.-.++-+.+.. .+.+|++-. ++.+.+.. +-.+.+-+++ |...+++.+-+|+- T Consensus 75 r~~r~~~e~~~~~~~~~~~~~~v~~-------~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~---i~~~~ae~~~~Dle 144 (360) T protein:vir:99 75 LSGHTRDEEGSRTENSEAESGSVKF-------NATDKSYYILVEPKRDALKNTHYGPDQFGDY---IVDQFIERYGNDLG 144 (360) T ss_pred eeccccccCCCCCcCCcCccccCcc-------ccccceeeEeechHHHHHhhhhcccchhHHH---HHHHHHHHHHHHHH Confidence 0010 0000111122222221 112232222 35666543 3334443343 23333333434322 Q ss_pred HHHhcCccce-----ecchhhHH--------------------------------------------------------H Q lcl|Aclame:pro 135 TALKTGTGTQ-----DALGAGLQ--------------------------------------------------------G 153 (296) Q Consensus 135 ~aLktat~t~-----~~t~~~lQ--------------------------------------------------------~ 153 (296) -....+.... ..+.+.|. . T Consensus 145 ~l~~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 224 (360) T protein:vir:99 145 LMGIRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTS 224 (360) T ss_pred HHHhhccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHH Confidence 2221111000 01112222 1 Q ss_pred HHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcC-----CccccceeechhhhhhhheeEEEEeccCCCceEEEEccc Q lcl|Aclame:pro 154 ALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAK-----AGITTQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVPE 228 (296) Q Consensus 154 Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~-----a~i~~q~~fg~tyl~nfLG~~II~S~kV~~G~~~~t~~~ 228 (296) .+...+..+.++|-..+.-..+.|++|.+...|+.. ..++.+..+|...+ +++|..|+.-+..|+|.+++|+|. T Consensus 225 lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~~l~g~~~~-~~~Gipi~~v~~~pd~~~mlT~p~ 303 (360) T protein:vir:99 225 LFNETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLTEREDPLGSAVIFGDSDI-TPFSYDLVGVNGFPDEYMMFTDPN 303 (360) T ss_pred HHHHHHHhcchhhhcCcccceEEEccCchHHHHHHHHhccCcccchhheeccccc-ccceeeeEEcCCCCCCceEEeccC Confidence 122333444445543333356899999999988853 36788888988776 488999999999999999999999 Q ss_pred ceEEE-EecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEE-ecCCC Q lcl|Aclame:pro 229 NIIFA-YINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVT-LTPGV 296 (296) Q Consensus 229 Nl~~a-y~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~t-I~~~v 296 (296) ||.+. |-|++ + ...++.. . ...+++-+...+-+-.-+-=|..|+|+.+| |..|- T Consensus 304 NLi~g~~~~ir---i----~~~~e~~------~-~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~ 359 (360) T protein:vir:99 304 NLAFGLYEEME---L----DQSTDTD------K-VHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPT 359 (360) T ss_pred ceeEEeeeeeE---E----eecccch------h-hhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCC Confidence 99541 22332 1 1111100 0 000010000000000001114455555554 33333 No 179 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=56.63 E-value=0.34 Score=23.18 Aligned_cols=241 Identities=13% Similarity=0.120 Sum_probs=108.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCC-Cee----eeeeeeeeecccCcccCCce Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEG-MTL----KTYAGYDVTLAEGNVPEGEV 75 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG-~tI----t~pk~~yig~A~gdVaEGe~ 75 (296) |.|...+.-+-++....|. ..|+| +... +|.-.-+-|---++++.+|= +|+ ....|.-+|++- -+.||+. T Consensus 25 ~~agy~~~p~tq~~~~AlR-~EsL~--~~i~-~Lt~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~~~-f~~E~g~ 99 (467) T protein:vir:80 25 FTTGYGITPDTQTDAGALR-REFLD--DQIS-MLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTR-FTREIGV 99 (467) T ss_pred HHcccccCCccccCcchhh-hhhhh--hhhh-eeeccccchhhhhhcccchhhhhhhhheeeeccCcccccc-ccccccc Confidence 4333332222244333332 11111 0000 00000011111233333331 111 122455557775 7899999 Q ss_pred echhheeeeecceeEEEEeecccccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc-eecchhhHHHH Q lcl|Aclame:pro 76 IPLSKVERKIHSEKKIELKKYRKATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT-QDALGAGLQGA 154 (296) Q Consensus 76 Iplskv~~~~~~t~~~tikK~~K~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t-~~~t~~~lQ~A 154 (296) ++.+...-.. ++..+++--.+|-+|.-+=-..+.+||..+-.+-=...+++.|.--+|=.= +... ...+..+|| T Consensus 100 ~~~~~~~~~r-~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGd--s~l~~s~~~~~glq-- 174 (467) T protein:vir:80 100 APVSDPNIRQ-KTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGD--SDLSDSPEPQAGLE-- 174 (467) T ss_pred cccCCCceEE-EEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhcc--cccccCCCcccccc-- Confidence 9998766542 344555555566677666323678888877777666667776665555210 0000 112344554 Q ss_pred HHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCc---------cccceeechh---hhh-----hhheeEEEEeccC Q lcl|Aclame:pro 155 LASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAG---------ITTQTAFGLT---YLV-----DFTGTVIISTNDV 217 (296) Q Consensus 155 la~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~---------i~~q~~fg~t---yl~-----nfLG~~II~S~kV 217 (296) |+ + ++-.+||+.+.+.+|... ......||.- |+- +|+-.+..+++.+ T Consensus 175 -----------fD--G---i~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~~q~~v 238 (467) T protein:vir:80 175 -----------FD--G---LAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQL 238 (467) T ss_pred -----------cc--c---eeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEEE Confidence 22 1 445568888877666541 1112235541 111 1111112222222 Q ss_pred CCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 218 TKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 218 ~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ...-.+...+-.| --||++..|++.... +.+|.--..++|..+++-. +|+ T Consensus 239 ------~~~n~~~~~~G~~---------------v~g~~sa~G~I~l~g----s~il~~~~~l~~~~~~~~~----Aps 288 (467) T protein:vir:80 239 ------VRDNGNNVSVGFN---------------IQGFHSARGFIKLHG----STVMENEQILDERILALPT----APQ 288 (467) T ss_pred ------EcCCCCceeeeec---------------ccceecceeeeeecC----ceeeccccCCCcccccccc----ccc Confidence 1111222222233 246777777666544 2345555556666666553 233 No 180 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=54.82 E-value=0.39 Score=22.85 Aligned_cols=238 Identities=15% Similarity=0.136 Sum_probs=108.0 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHHHH---HHHhCcccccccCCC-Cee----eeeeeeeeecccCcccC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKL---LEMLGVTRKISVSEG-MTL----KTYAGYDVTLAEGNVPE 72 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~~L---~~~LgVtr~~~~~pG-~tI----t~pk~~yig~A~gdVaE 72 (296) +.|...+.-+.++....|. ..|+| ..|.-| -+-|---++++.+|= +|+ ....|.-+|++- -+.| T Consensus 26 ~~agy~~~p~~q~~~~AlR-~EsL~------~~i~~L~~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~~~-f~~E 97 (468) T protein:vir:63 26 FTTGYGITPDTQTDAGALR-REFLD------DQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTR-FTRE 97 (468) T ss_pred HHcCcccCCccccCcchhh-hhhhh------hhhheeeecccchhhhhhcccchhhhhhhhheeeeccCcccccc-cccc Confidence 4343333222244333332 11111 111100 011111233333321 111 122455557775 7899 Q ss_pred CceechhheeeeecceeEEEEeecccccCHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc-eecchhhH Q lcl|Aclame:pro 73 GEVIPLSKVERKIHSEKKIELKKYRKATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT-QDALGAGL 151 (296) Q Consensus 73 Ge~Iplskv~~~~~~t~~~tikK~~K~vTdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t-~~~t~~~l 151 (296) |+.++.+...-.. ++..+++--.+|-+|.-+=-..+.+||..+-.+-=...+++.|.--+|=.= +... ...+..+| T Consensus 98 ~g~~~~~~~~~~r-~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGd--s~l~~s~~~~~gl 174 (468) T protein:vir:63 98 IGVAPVSDPNIRQ-KTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGD--SDLSDSPEPQAGL 174 (468) T ss_pred ccccccCCCceEE-EEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhcc--cccccCCCccccc Confidence 9999998766542 344555555566677666323678888877777666667776665555210 0000 11234455 Q ss_pred HHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCc---------cccceeechh---hhh-----hhheeEEEEe Q lcl|Aclame:pro 152 QGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAG---------ITTQTAFGLT---YLV-----DFTGTVIIST 214 (296) Q Consensus 152 Q~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~---------i~~q~~fg~t---yl~-----nfLG~~II~S 214 (296) | |+ + ++-.+||+.+.+.+|... ......||.- |+- +|+-.+..++ T Consensus 175 q-------------fD--G---i~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~~q 236 (468) T protein:vir:63 175 E-------------FD--G---LAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQ 236 (468) T ss_pred c-------------cc--c---eeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCce Confidence 4 22 1 445568888877666541 1112235541 111 1111112222 Q ss_pred ccCCCceEEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecC Q lcl|Aclame:pro 215 NDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTP 294 (296) Q Consensus 215 ~kV~~G~~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~ 294 (296) +.+ ...-.+...+-.| --||++..|++.... +.+|.--..++|..+++-. + T Consensus 237 ~~v------~~~n~~~~~~G~~---------------v~g~~sa~G~I~l~g----s~il~~~~~l~~~~~~~~~----A 287 (468) T protein:vir:63 237 TQL------VRDNGNNVSVGFN---------------IQGFHSARGFIKLHG----STVMENEQILDERILALPT----A 287 (468) T ss_pred EEE------EcCCCCceeeeec---------------ccceecceeeeeecC----ceeeccccCCCcccccccc----c Confidence 222 1111222222233 246777777766544 2345555556666666553 2 Q ss_pred CC Q lcl|Aclame:pro 295 GV 296 (296) Q Consensus 295 ~v 296 (296) |+ T Consensus 288 ps 289 (468) T protein:vir:63 288 PQ 289 (468) T ss_pred cc Confidence 33 No 181 >protein:vir:393 Length: 341 # NCBI annotation: gp8 # Family: family:all:1021 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046903;genbank:gi:9630472;genbank:GeneID:1261647 Probab=37.40 E-value=1.1 Score=20.31 Aligned_cols=266 Identities=13% Similarity=0.041 Sum_probs=115.9 Q ss_pred ccceehhhhhhhhhhhhHHHHhhhHHHHHHHhCcccccccCCCCeeeeeeeeeee----cccCccc---CCceechhhee Q lcl|Aclame:pro 10 ENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVT----LAEGNVP---EGEVIPLSKVE 82 (296) Q Consensus 10 ~nl~~~~dl~~a~siDf~~~f~~~i~~L~~~LgVtr~~~~~pG~tIt~pk~~yig----~A~gdVa---EGe~Iplskv~ 82 (296) =++-+..-|.. ++++--..=..|++.+ ..+ .+..+-.+|.+ .++. .|- -|. +|..+.-...+ T Consensus 1 ~d~f~~~~L~~-----~i~~~~~~~~~l~~~~-Fp~-~~~~~t~~v~~---~~~~~~~~lap-~v~~~~~~~~~~~~~~~ 69 (341) T protein:vir:39 1 MSVYTTAQLLA-----VNEKKFKFDPLFLRIF-FRE-TYPFSTEKVYL---SQIPGLVNMAL-YVSPIVSGKVIRSRGGS 69 (341) T ss_pred CCccCHHHHHH-----HHHhhcCccchhHhhc-CCc-ccccCcceEEE---EEecCCceeeE-EecCCCCcceeccccee Confidence 22222222211 1121111112233332 111 11112223322 1221 111 122 34444333322 Q ss_pred eeecceeEEEEeecccccCHHHHHhh-------cCCchhHHHHHHH-------HHHHHhhhhHHHHHHHhcCcccee--- Q lcl|Aclame:pro 83 RKIHSEKKIELKKYRKATTGEDIQMY-------GSNEAVTNTDNAL-------VRQLQKKIRTDFVTALKTGTGTQD--- 145 (296) Q Consensus 83 ~~~~~t~~~tikK~~K~vTdEAIqls-------Gygdav~etd~QL-------~~~iq~kIdnD~~~aLktat~t~~--- 145 (296) + .+.+...=|....++.+.++.. |..++.....++| ...|+..+.--|..+|.++.-... T Consensus 70 ~---~~~~~p~i~~~~~i~~~d~~~r~~g~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~r~E~m~~qaL~~Gki~i~~~g 146 (341) T protein:vir:39 70 T---SEFTPGYVKPKHEVNPLMTLRRLPDEDPQNLADPVYRRRRIILQNMKDEELAIAQVEEKQAVAAVLSGKYTMTGEA 146 (341) T ss_pred e---eeEeccccCcccccCHHHHHHHhhcccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceEEEcCC Confidence 1 1223332333445677766432 2224444444443 333344444445666765532111 Q ss_pred -----------------cchhh----HHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcccc---------- Q lcl|Aclame:pro 146 -----------------ALGAG----LQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITT---------- 194 (296) Q Consensus 146 -----------------~t~~~----lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~---------- 194 (296) .++.. -....++-+.++.+..++.+....+++++|+=...++.++.+-. T Consensus 147 ~~~~~vDfg~~~~~~~~lt~~~~W~~~~~~~~d~l~di~~~~~~~g~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~ 226 (341) T protein:vir:39 147 FEPVEVDMGRSAGNNIVQAGAAAWSSRDKETYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNS 226 (341) T ss_pred CcEEEEeccCCccceeEecCCccCCCCCCchHHHHHHHHHHHHhcCCceEEEEeChHHHHHHhcCHHHHHHHhhcccccc Confidence 01110 01112334445555555555556678888876665555543211 Q ss_pred -------ceeechhhhhhhheeEEEEe-----------ccCCCceEEEEcccc-eEEEEecCcchhhhhh-hcccccccc Q lcl|Aclame:pro 195 -------QTAFGLTYLVDFTGTVIIST-----------NDVTKGEIWATVPEN-IIFAYINPNNSELAKE-FNLYGDPTG 254 (296) Q Consensus 195 -------q~~fg~tyl~nfLG~~II~S-----------~kV~~G~~~~t~~~N-l~~ay~~~~~g~~~~~-f~~~td~tG 254 (296) +..-|..|+..+.|.+|+.- +-+|+|++++.+++. -..+|-.+. |+... ...+ +.- T Consensus 227 ~~~~~~~~~~~~~~~~~~~~g~~i~~y~~~y~d~g~~~~~ip~~~~~l~p~~~~g~~~yg~~~--d~~~~~~~~~--~~~ 302 (341) T protein:vir:39 227 ELETALKDLGKAVSYKGMYGDVAIVVYSGQYIENDVKKNYLPDLTMVLGNTQARGLRTYGCIL--DADAQREGIN--AST 302 (341) T ss_pred cccchhhhhhhHhhhhhhhcCceEEEEccEEEecCcEEeeecCCeEEEeeCCCcceEEEeccc--chhhccccee--eee Confidence 11123456555556665542 338999999998875 466775542 22111 0111 111 Q ss_pred ceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEec Q lcl|Aclame:pro 255 YIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLT 293 (296) Q Consensus 255 liGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~ 293 (296) ++--......+....++.+-|..+--|+++|+++++++- T Consensus 303 ~~~~~~~~~~dp~~~~~~~~s~plPv~~~p~~~~~a~V~ 341 (341) T protein:vir:39 303 RYPKNWVQTGDPAREFTMIQSAPLMLLADPDEFVSVKLA 341 (341) T ss_pred eeeeeeeecCCCcEEEEEEeccccceeeCCCcEEEEEeC Confidence 111111112233456666667788889999999999988 No 182 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=35.86 E-value=1.2 Score=20.14 Aligned_cols=251 Identities=12% Similarity=0.063 Sum_probs=115.2 Q ss_pred hhhHHHHhhhHHHHHHHhCcc-cccccCCCCeeeeeeeeee-ecccCcccCCceechhheeeeecceeEEEE--eecccc Q lcl|Aclame:pro 24 IDVTNKFQENISKLLEMLGVT-RKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPLSKVERKIHSEKKIEL--KKYRKA 99 (296) Q Consensus 24 iDf~~~f~~~i~~L~~~LgVt-r~~~~~pG~tIt~pk~~yi-g~A~gdVaEGe~Iplskv~~~~~~t~~~ti--kK~~K~ 99 (296) |+++=.+-. ..||+. +-.|. +=.+|.+=..... .++. -|+.|.+ -..+.+.+.+.+.++. -|.... T Consensus 1 i~~~P~~~g------~~~glff~~~~v-~T~~V~ie~~~~~l~lip-~v~rg~~--g~~~~~~~~~~~~f~~p~~~~~d~ 70 (320) T protein:vir:10 1 MNLLPVNYG------DSRALFAREKKV-RTRTILVEEKNGVLTLIQ-SREPGST--ENVAKRGKRKVRSFVIPHLPLEDV 70 (320) T ss_pred CCcCCchhh------hhhhhccCCCCc-ccceEEEEEecCceeeee-ccCCCCC--ceeecCCcceEEEEecceeccCCc Confidence 444432211 124443 22232 3344444222111 2222 3444442 1223333333344443 345567 Q ss_pred cCHHHHH-hhcCCchhHHHHH----HHHHHHHhhhhHH----HHHHHhc------C------------ccce------ec Q lcl|Aclame:pro 100 TTGEDIQ-MYGSNEAVTNTDN----ALVRQLQKKIRTD----FVTALKT------G------------TGTQ------DA 146 (296) Q Consensus 100 vTdEAIq-lsGygdav~etd~----QL~~~iq~kIdnD----~~~aLkt------a------------t~t~------~~ 146 (296) ++.+.|| +..||..--++.. +.+..+.++++.- +.-+|+. + +++. ++ T Consensus 71 i~a~eiq~~Ra~G~~~~~~~~~~v~~~l~~lr~~~~~T~E~m~~~AL~G~ildadGtv~~d~y~~fGi~~~~i~~~l~~a 150 (320) T protein:vir:10 71 ILPDEYEGLRGFGTTALAAKSELVKERXETMKSSHDITHEHLRMGAKKGQILDADGTVLYDLYAEFGITKKTIYFGLDNK 150 (320) T ss_pred cCHHHHcCcccCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCeEEcCCCcEEEechhhhCCccceeEEecCCC Confidence 8999997 5556643233332 2333334444332 2223431 0 1110 12 Q ss_pred chhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCcc-------------ccceeechhhhhhhheeEEEE Q lcl|Aclame:pro 147 LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI-------------TTQTAFGLTYLVDFTGTVIIS 213 (296) Q Consensus 147 t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i-------------~~q~~fg~tyl~nfLG~~II~ 213 (296) + ..+...+.+.+..+++.+..+.....++++.|.=...+++.+.| ..+..+++- .|-|..+.+ T Consensus 151 ~-~dv~~~~~~~~~~i~~~l~g~~~t~v~al~g~~f~~al~~h~~Vke~y~~~~~~~~~l~~~~~~~f---~~gGi~~~~ 226 (320) T protein:vir:10 151 D-ANVAESCRQVLRHVEDNLRGDVMKDVSVDVSEEFFDKFIKHASVKEVFLNHEAAVNRLGGDTRKGF---KFGGLIFNE 226 (320) T ss_pred C-ccHHHHHHHHHHHHHHHhccCCCCceEEEEChHHHHHHhcCHHHHHHHHhhhhhhhhccccccceE---EecCEEEEE Confidence 2 23445566667777777766555567889988755544443321 111111110 122333332 Q ss_pred e------------ccCCCceEEEEcccce---EEEEecCcchhhhhhh--ccccccccceEEEeccccceeehhhhhhHH Q lcl|Aclame:pro 214 T------------NDVTKGEIWATVPENI---IFAYINPNNSELAKEF--NLYGDPTGYIGMNHFQENTTLTIQTLLVSG 276 (296) Q Consensus 214 S------------~kV~~G~~~~t~~~Nl---~~ay~~~~~g~~~~~f--~~~td~tGliGv~h~~~~~~~t~et~~~~~ 276 (296) = ..||.|++++.+.+.. +.||++.+.-|..+.- .+|.-+ =...+.....+-.|+.- T Consensus 227 Y~g~~~d~~g~~~~~I~~~~~~~~p~g~~~~f~~~~apad~~e~vnt~g~p~y~k~----~~~~~~~g~~l~~qS~P--- 299 (320) T protein:vir:10 227 NRARHVDEEGKETRFIKAGKGHAFPTGTTNTFFTALAPADFNETAGTLGKRYYAKM----EPRRMGRGFDLHSQSNV--- 299 (320) T ss_pred cccEEEcCCCCeeEeecCCeeEEEEecCchhheeeecccCcHhhcCCccccccccc----ccccCCCeEEEEeeecc--- Confidence 1 2499999999987654 6678887533322211 122200 00012222334444433 Q ss_pred HHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 277 MLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 277 ~~lfpE~~dgvv~~tI~~~v 296 (296) +-.+=|++.+++++-.++= T Consensus 300 -Lpi~~rP~~lv~~~~~a~~ 318 (320) T protein:vir:10 300 -LPMCCRPGVLVELDAAAQP 318 (320) T ss_pred -cccccCcceEEEEEecCCC Confidence 3456788999999986655 No 183 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=28.01 E-value=1.8 Score=19.20 Aligned_cols=245 Identities=16% Similarity=0.178 Sum_probs=104.7 Q ss_pred CccccccccccceehhhhhhhhhhhhHHHHhhhHH-HHH------HHhCcccccccCCCCeeeeee-eeeeecccCcccC Q lcl|Aclame:pro 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENIS-KLL------EMLGVTRKISVSEGMTLKTYA-GYDVTLAEGNVPE 72 (296) Q Consensus 1 ~~~~~~~ae~nl~~~~dl~~a~siDf~~~f~~~i~-~L~------~~LgVtr~~~~~pG~tIt~pk-~~yig~A~gdVaE 72 (296) ||+-++- +.-+-+.++...-+- .+. ++++|.. ++.+|++...|. ....|-++ .++. T Consensus 1 ~~~~~~g-------------~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~--~~~~~~~~~~~~~~~~~G~~~-~~~~ 64 (301) T protein:vir:80 1 MQGKITA-------------TIEARDLQAIDNVIYEPKQEELTARSVFPQKF--DVNEGAESYSFDVMTRSGAAK-IIAN 64 (301) T ss_pred CCccccc-------------hhhHHHHHHHHHHHHHhhhhhhhhhhhccccc--CCCCceEEEEEeeeccceeEE-EecC Confidence 5543321 223333333333322 233 3444444 445666654443 34445555 4443 Q ss_pred -CceechhheeeeecceeEEEEeeccccc--CHHHHHhhc-CCchhHHHHHH-HHHHHHhhhhHHHHHH--------Hhc Q lcl|Aclame:pro 73 -GEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYG-SNEAVTNTDNA-LVRQLQKKIRTDFVTA--------LKT 139 (296) Q Consensus 73 -Ge~Iplskv~~~~~~t~~~tikK~~K~v--TdEAIqlsG-ygdav~etd~Q-L~~~iq~kIdnD~~~a--------Lkt 139 (296) ++.||+..+..+ .+..++..++++. +...++... .|.|+.....+ -+++++++.+.=.|-= |.+ T Consensus 65 ~~~dip~~~~~~~---~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN 141 (301) T protein:vir:80 65 GADDLPLVDVDMV---RKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFE 141 (301) T ss_pred cccccccccccce---eEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeec Confidence 456899888764 4667889999976 444433322 34444433322 3344444444332211 001 Q ss_pred Ccc--ce--------------ecchhhHHHHHHHHHHHHHHhhccccCcceEEEEcHHHHHHHhcCCccccceeechhhh Q lcl|Aclame:pro 140 GTG--TQ--------------DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYL 203 (296) Q Consensus 140 at~--t~--------------~~t~~~lQ~Ala~~~~~~~~~Feded~~~~VlFvNP~Daa~~l~~a~i~~q~~fg~tyl 203 (296) .++ +. +.+++-+..-+-+++.++... ......+..+.++|.... +|-...++.+ .|.+.+ T Consensus 142 ~p~~~~~~~~~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~-s~g~~~p~~L~L~p~~~~-~L~~~~~~~~--~~~tvl 217 (301) T protein:vir:80 142 ATGIQIDVSPTTGVGNVSKWEKKTAEQIIDEIGEAHTKITVL-PGYGTASLKLCLPPKQFE-LINKKRYSNE--DSRSVL 217 (301) T ss_pred CCCcccccccCcccccccccccCCHHHHHHHHHHHHHHHHHh-cCceecccEEEecHHHHH-hhhhccccCC--CCeeHH Confidence 111 00 001111222222222222211 001124567888887655 4433333333 244443 Q ss_pred h----hhheeEEEEeccCCC----ce----EEEEcccceEEEEecCcchhhhhhhccccccccceEEEeccccceeehhh Q lcl|Aclame:pro 204 V----DFTGTVIISTNDVTK----GE----IWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQT 271 (296) Q Consensus 204 ~----nfLG~~II~S~kV~~----G~----~~~t~~~Nl~~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et 271 (296) + |+.+.+|+..+.++. |+ +|..-++|+.+..--| | ..|..+.+++.++. T Consensus 218 ~~l~~~~~~~~I~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~--------~-----------~~~~~e~~~~~~~~ 278 (301) T protein:vir:80 218 KVLQDNAWFSAIVRVPDLAGMGTAGSDSFAVIHDSNETAELIIPMD--------I-----------TRHPEEYSFPRTKV 278 (301) T ss_pred HHHHHHcCcceEEEcceeccCCCCcccEEEEEecCCcEEEEEecCc--------e-----------eeecceecCceeEe Confidence 3 455678888777763 11 1111234433332100 0 12444444444432 Q ss_pred ---hhhHH-HHhhhh---ccceE Q lcl|Aclame:pro 272 ---LLVSG-MLMYPE---RIDGI 287 (296) Q Consensus 272 ---~~~~~-~~lfpE---~~dgv 287 (296) .-+.| +..+|+ ++||| T Consensus 279 ~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 279 PFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred eeeeeeEEEEEEccceEEEEecC Confidence 11222 344565 45777 No 184 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=26.73 E-value=1.9 Score=19.04 Aligned_cols=285 Identities=11% Similarity=0.021 Sum_probs=122.6 Q ss_pred Cccccccccccce-ehhhhhh-hhhhhhHHHHhhhHH--HHHHHhCcccccccCCCCeeeeeeeeeeeccc-----Cccc Q lcl|Aclame:pro 1 MVTSRTYPEENLI-KSTDLKY-PITIDVTNKFQENIS--KLLEMLGVTRKISVSEGMTLKTYAGYDVTLAE-----GNVP 71 (296) Q Consensus 1 ~~~~~~~ae~nl~-~~~dl~~-a~siDf~~~f~~~i~--~L~~~LgVtr~~~~~pG~tIt~pk~~yig~A~-----gdVa 71 (296) |.-=.--+..-++ ...+.++ .+..=|..|-=.... -.+.-|+=+..+|-.-|.||++.+|.-.-++. |.-+ T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 2211000000000 0001111 001111111111111 12345667889999999999999986665543 3334 Q ss_pred CCcee---------------ch--------------hheeeeecceeEEEEeeccccc--CHHHHHhhcCCchhHHHHHH Q lcl|Aclame:pro 72 EGEVI---------------PL--------------SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNA 120 (296) Q Consensus 72 EGe~I---------------pl--------------skv~~~~~~t~~~tikK~~K~v--TdEAIqlsGygdav~etd~Q 120 (296) +|.+| -+ -+++|. ..+.+++||+-=+ ||+.. +.--++.+.+.-.+ T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~---d~~g~l~qyG~~~e~Td~~~-dt~~D~~l~~h~s~ 156 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRI---AREGSIHKFGFFYEFTQESI-DFDSDDGLMEHLSR 156 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceee---eeeeeeeeccCccchhhhhh-hhhcchHHHHHHHH Confidence 44433 11 144443 4677899999855 88885 67777777775333 Q ss_pred HHHHH-----HhhhhHHHHHHHh----cCccc------------eecchhhHHHHHHHHHH---H-------HHHhhccc Q lcl|Aclame:pro 121 LVRQL-----QKKIRTDFVTALK----TGTGT------------QDALGAGLQGALASAWG---K-------LQVLFEDY 169 (296) Q Consensus 121 L~~~i-----q~kIdnD~~~aLk----tat~t------------~~~t~~~lQ~Ala~~~~---~-------~~~~Fede 169 (296) .++.= +.-|.+|++++.. .++.+ +..+.+.+-.+-..... . ....+... T Consensus 157 ell~g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk 236 (401) T protein:vir:95 157 ELMNGATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTK 236 (401) T ss_pred HHhhhhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCcc Confidence 33333 4556677775541 01100 01122233221111000 0 01113332 Q ss_pred -cCcceEEEEcH------HHHHHHhcCCccc-------cceeechhhhhhhheeEEEEeccCC--------C-ce----- Q lcl|Aclame:pro 170 -GSERAIVFANS------LDVAEYIAKAGIT-------TQTAFGLTYLVDFTGTVIISTNDVT--------K-GE----- 221 (296) Q Consensus 170 -d~~~~VlFvNP------~Daa~~l~~a~i~-------~q~~fg~tyl~nfLG~~II~S~kV~--------~-G~----- 221 (296) .....|++|+| .++++.+++...- ....|.+-.++ +=+.++|.+.... . |. T Consensus 237 ~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~-i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~ 315 (401) T protein:vir:95 237 VIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGS-IDKFRIIQVPEMLHWAGAGAQATGANPGYR 315 (401) T ss_pred ccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccc-cCceeEEecccceeecCCcccccccccccc Confidence 34678999999 7888888876432 11223222222 3347888877754 1 10 Q ss_pred -EEEEcccceEEEEe-----------cCcchhhhhhhccccccccc-eEEEeccccceeehhhhhhHHHHhh--hhccce Q lcl|Aclame:pro 222 -IWATVPENIIFAYI-----------NPNNSELAKEFNLYGDPTGY-IGMNHFQENTTLTIQTLLVSGMLMY--PERIDG 286 (296) Q Consensus 222 -~~~t~~~Nl~~ay~-----------~~~~g~~~~~f~~~td~tGl-iGv~h~~~~~~~t~et~~~~~~~lf--pE~~dg 286 (296) +..+.-.|--+|+. +.++++-+--|...+..-|. .+=.-++ +-=..++|+++| .+++.. T Consensus 316 ~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DP------lgQ~g~vgwK~~~a~~vL~~ 389 (401) T protein:vir:95 316 TSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDP------YGETGFSSIKWYYGILVKRP 389 (401) T ss_pred cccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCc------ccceehhhhhhhhhhheecc Confidence 11111122223333 33322111112222222221 0000111 122345666665 333333 Q ss_pred EEEEEec--CCC Q lcl|Aclame:pro 287 IVKVTLT--PGV 296 (296) Q Consensus 287 vv~~tI~--~~v 296 (296) =.-+.|+ +|+ T Consensus 390 e~m~~ies~a~~ 401 (401) T protein:vir:95 390 ERLALIKTVAPL 401 (401) T ss_pred ceeEEEEeecCC Confidence 3334443 444 No 185 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=23.16 E-value=2.3 Score=18.56 Aligned_cols=272 Identities=11% Similarity=0.083 Sum_probs=129.5 Q ss_pred Cccccccc---cccceehhhhhh--hhhhhhHHHHhhhHHHHHHHhCcccc--cccCCCCeeeeeeeeeee---cccCcc Q lcl|Aclame:pro 1 MVTSRTYP---EENLIKSTDLKY--PITIDVTNKFQENISKLLEMLGVTRK--ISVSEGMTLKTYAGYDVT---LAEGNV 70 (296) Q Consensus 1 ~~~~~~~a---e~nl~~~~dl~~--a~siDf~~~f~~~i~~L~~~LgVtr~--~~~~pG~tIt~pk~~yig---~A~gdV 70 (296) |-.+.+-+ .....=+.+|.. .+...|.+||-..=+ --.|-|+ +.-.+|++|++.--.-.+ --.+.- T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~----~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~gv~Gd~~ 76 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSE----NAVIQRKTELESDAGDRITFDLSVHLRGKPTYGDAR 76 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCC----CCcEEEeeecCCCCCceEEeeeeeecccCCcccCce Confidence 66555443 333334445544 355668877743211 1134444 566899999976533221 111234 Q ss_pred cCCceechhheeeeecceeEEEEeecccccC----HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhcCccc--- Q lcl|Aclame:pro 71 PEGEVIPLSKVERKIHSEKKIELKKYRKATT----GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT--- 143 (296) Q Consensus 71 aEGe~Iplskv~~~~~~t~~~tikK~~K~vT----dEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~aLktat~t--- 143 (296) .||-+=+|+-.+ .+++|...|.+|. ... |++=+ |=..++.+.|...+++..|..+|-.|.++++- T Consensus 77 leGnee~L~~~~------~~i~idq~r~~V~~~g~ms~-qRt~~-dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~ 148 (364) T protein:vir:93 77 VEGKEESLRFYQ------DEVRIDQVRHSVSAGGRMSR-KRTVH-NIRRIARDRLGDYFYKFTDELLFIYLSGARGINLD 148 (364) T ss_pred eeccccceeEEe------eEEEEeeccccccccCchhh-hhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 588777777655 5689999999882 222 22222 23466889999999999999999888654321 Q ss_pred ------------------------------eec---chhhHH-HHHHHHHHHHHHh-------h-----ccccCcceEEE Q lcl|Aclame:pro 144 ------------------------------QDA---LGAGLQ-GALASAWGKLQVL-------F-----EDYGSERAIVF 177 (296) Q Consensus 144 ------------------------------~~~---t~~~lQ-~Ala~~~~~~~~~-------F-----eded~~~~VlF 177 (296) .++ +.+.+. ..+.++...+... + .-.++..+|+| T Consensus 149 ~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~ 228 (364) T protein:vir:93 149 FIETPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCV 228 (364) T ss_pred cccccCcccccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEE Confidence 000 011111 0112222222110 0 00123568999 Q ss_pred EcHHHHHHHhcCCc-----ccc---------ceeechhhhhhhheeEEEEeccCCCceEEE------E------cccceE Q lcl|Aclame:pro 178 ANSLDVAEYIAKAG-----ITT---------QTAFGLTYLVDFTGTVIISTNDVTKGEIWA------T------VPENII 231 (296) Q Consensus 178 vNP~Daa~~l~~a~-----i~~---------q~~fg~tyl~nfLG~~II~S~kV~~G~~~~------t------~~~Nl~ 231 (296) ++|..+.+++.++. +.. +-.|-|.++. .-|+-|....++..+..+. + =+..+. T Consensus 229 l~p~q~~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm-~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~ 307 (364) T protein:vir:93 229 MSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGM-INNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGV 307 (364) T ss_pred EcchhhhhhhhcCCHHHHHHHHHhhhcccccCCceecCeee-EcCeEEeccCCcccccccccCccccchhhheecceeeE Confidence 99999999886552 221 2356554432 3344444444554332222 1 144455 Q ss_pred EEEecCcchhhhhhhccccccccceEEEeccccceeehhhhhhHHHHhhhhccceEEEEEecCCC Q lcl|Aclame:pro 232 FAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) Q Consensus 232 ~ay~~~~~g~~~~~f~~~td~tGliGv~h~~~~~~~t~et~~~~~~~lfpE~~dgvv~~tI~~~v 296 (296) ++|...+ | .-|. +..|+- -| .+..-++..++.=..=.=|+..==||+.+. ++++ T Consensus 308 ~a~g~~~-g---~~~~-w~Ee~~----D~-gn~~~i~~~~i~G~kK~rF~~~DfGvi~id-taa~ 361 (364) T protein:vir:93 308 IAYGTAN-G---LRFD-WEETVK----DY-GNEPAIAAGFIAGMKKARFNNKDFGVISID-TAAK 361 (364) T ss_pred EEeecCC-C---CCce-eeeccc----CC-CCchhhhhhhHhhhhhcccCCccceEEEec-cccc Confidence 5554421 1 0110 111110 00 000112333333222333444333555433 2222 No 186 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=21.63 E-value=2.6 Score=18.33 Aligned_cols=213 Identities=11% Similarity=0.088 Sum_probs=100.7 Q ss_pred ccccccccccceehhhh----hhhhhhhhHHHHhhhHHH-HH-----HH-------hCcccc--cccCCCCeeeeeeeee Q lcl|Aclame:pro 2 VTSRTYPEENLIKSTDL----KYPITIDVTNKFQENISK-LL-----EM-------LGVTRK--ISVSEGMTLKTYAGYD 62 (296) Q Consensus 2 ~~~~~~ae~nl~~~~dl----~~a~siDf~~~f~~~i~~-L~-----~~-------LgVtr~--~~~~pG~tIt~pk~~y 62 (296) .|.-++..+|-.-...| +.-.+ +++++++.+.. -. +. .-|-|+ +.-.+|++|++.--.- T Consensus 1 mt~~~~~~~~~~~~~~~ft~~~~~~~--~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~ 78 (318) T protein:vir:27 1 MTTVTSAQANKLFQVALFTAANRNRS--MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (318) T ss_pred CCccCCCChHHHHHHHHHHHHhcCCh--HHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeec Confidence 34444444542221111 00001 23333332110 00 00 113333 4457899998865322 Q ss_pred ee---cccCcccCCceechhheeeeecceeEEEEeeccccc----CHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHH Q lcl|Aclame:pro 63 VT---LAEGNVPEGEVIPLSKVERKIHSEKKIELKKYRKAT----TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVT 135 (296) Q Consensus 63 ig---~A~gdVaEGe~Iplskv~~~~~~t~~~tikK~~K~v----TdEAIqlsGygdav~etd~QL~~~iq~kIdnD~~~ 135 (296) .+ --.+.-.||-+=+|+..+ .+++|+..|.+| +... |++=+ |=-.++...|...+++..|.-+|. T Consensus 79 L~g~gv~Gd~~lEGnee~L~~~~------d~l~IDq~r~~V~~gg~msq-qRt~~-dlR~~ar~~L~~w~~~~~Dq~~~v 150 (318) T protein:vir:27 79 LSKRPTMGDERVEGRGEDLSHAD------FSLKINQGRHLVDAGGRMSQ-QRTKF-NLASSARTLLGTYFNDLQDQCAIV 150 (318) T ss_pred cccCccccCceeeccccceEEEe------eEEEEeeeccccccccchhh-hhhhH-HHHHHHHHHHHHHHHHHHHHHHHH Confidence 21 111234688877777655 568899999987 2222 11111 224567788999999999999999 Q ss_pred HHhcCccc-----------------------eec-------------------chhhHHHHHHHHHHHHHH--------- Q lcl|Aclame:pro 136 ALKTGTGT-----------------------QDA-------------------LGAGLQGALASAWGKLQV--------- 164 (296) Q Consensus 136 aLktat~t-----------------------~~~-------------------t~~~lQ~Ala~~~~~~~~--------- 164 (296) .|..+++. .++ +.+.+.-.+ +.++.. T Consensus 151 ~laGarg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~l---id~~~~~~~~~a~pi 227 (318) T protein:vir:27 151 HLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGL---VDNLSLFIDEMAHPL 227 (318) T ss_pred HHhhcccccccccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHH---HHHHHHHHHHhCCCC Confidence 99655531 011 011111111 112211 Q ss_pred ---hhccc----cCcceEEEEcHHHHHHHhcCCc------cc----------cceeechhhhhhhheeEEEEecc-CC-- Q lcl|Aclame:pro 165 ---LFEDY----GSERAIVFANSLDVAEYIAKAG------IT----------TQTAFGLTYLVDFTGTVIISTND-VT-- 218 (296) Q Consensus 165 ---~Fede----d~~~~VlFvNP~Daa~~l~~a~------i~----------~q~~fg~tyl~nfLG~~II~S~k-V~-- 218 (296) .++.+ +...+|+|++|-++.+++.++. +. .+-.|.|.++. .-|+ ||+... || T Consensus 228 ~PV~v~g~~~~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm-~ngv-il~~~~~vpIr 305 (318) T protein:vir:27 228 QPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAM-WRNI-LVRKYAGMPIR 305 (318) T ss_pred cceeeccccccCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceee-ecCE-EEeecCCccEE Confidence 22221 1125899999999999987752 11 12356554332 2232 333332 33 Q ss_pred --CceEEEEcccceE Q lcl|Aclame:pro 219 --KGEIWATVPENII 231 (296) Q Consensus 219 --~G~~~~t~~~Nl~ 231 (296) .|.-+ ..-||. T Consensus 306 f~~G~~v--~~~~~~ 318 (318) T protein:vir:27 306 FYQGQRF--WYQRIT 318 (318) T ss_pred EcCCCee--eeeecC Confidence 33322 122222 Done!