Query lcl|Aclame:protein:vir:80446|NCBI_annot:BcepGomrgp07|genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Match_columns 367 No_of_seqs 137 out of 169 Neff 6.7 Searched_HMMs 1612 Date Mon Dec 2 01:39:46 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_65 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_65_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80446 Length: 367 100.0 3E-149 2E-152 834.7 28.0 367 1-367 1-367 (367) 2 protein:vir:94989 Length: 349 100.0 5E-129 3E-132 724.1 27.9 339 1-367 1-347 (349) 3 protein:vir:78387 Length: 349 100.0 6E-129 3E-132 723.7 27.7 339 1-367 1-347 (349) 4 protein:vir:1583 Length: 351 # 100.0 4E-121 3E-124 680.5 25.9 324 1-367 1-328 (351) 5 protein:vir:102944 Length: 330 100.0 3E-120 2E-123 675.9 25.1 326 1-367 1-328 (330) 6 protein:vir:5974 Length: 324 # 100.0 1E-118 6E-122 667.6 25.9 319 1-367 1-322 (324) 7 protein:vir:95131 Length: 325 100.0 9.6E-87 6E-90 492.2 22.6 316 1-366 1-325 (325) 8 protein:vir:95107 Length: 270 100.0 2.1E-68 1.3E-71 391.6 19.5 266 1-343 1-270 (270) 9 protein:vir:96792 Length: 315 100.0 1.8E-67 1.1E-70 386.5 20.5 293 1-367 1-313 (315) 10 protein:vir:105334 Length: 276 100.0 2.5E-62 1.5E-65 358.3 18.0 270 1-341 1-276 (276) 11 protein:vir:95898 Length: 274 100.0 9.9E-61 6.2E-64 349.5 16.5 268 1-353 1-274 (274) 12 protein:vir:96262 Length: 274 100.0 9.9E-61 6.2E-64 349.5 16.5 268 1-353 1-274 (274) 13 protein:vir:96833 Length: 275 100.0 6.7E-60 4.2E-63 345.0 18.5 269 1-338 1-275 (275) 14 protein:vir:1239 Length: 274 # 100.0 9E-60 5.6E-63 344.3 16.5 268 1-353 1-274 (274) 15 protein:vir:3613 Length: 272 # 100.0 8.6E-59 5.4E-62 338.9 18.8 263 1-319 1-272 (272) 16 protein:vir:97433 Length: 274 100.0 3.1E-57 1.9E-60 330.4 18.3 268 1-345 1-274 (274) 17 protein:vir:94494 Length: 274 100.0 3.1E-57 1.9E-60 330.4 18.3 268 1-345 1-274 (274) 18 protein:vir:96123 Length: 274 100.0 1.1E-55 7E-59 321.8 18.4 262 1-367 1-268 (274) 19 protein:vir:93742 Length: 274 100.0 2.9E-53 1.8E-56 308.6 18.3 268 1-345 1-274 (274) 20 protein:vir:80930 Length: 278 100.0 1.2E-52 7.2E-56 305.3 18.2 271 1-336 1-278 (278) 21 protein:vir:9820 Length: 272 # 100.0 2.1E-46 1.3E-49 271.0 18.9 262 1-337 1-272 (272) 22 protein:vir:3033 Length: 272 # 100.0 2.1E-46 1.3E-49 271.0 18.9 262 1-337 1-272 (272) 23 protein:vir:739 Length: 231 # 100.0 1.3E-39 7.9E-43 233.8 15.3 225 47-367 1-230 (231) 24 protein:vir:9927 Length: 295 # 99.8 7.1E-23 4.4E-26 142.0 8.0 278 1-334 1-295 (295) 25 protein:vir:7990 Length: 273 # 99.7 2.1E-18 1.3E-21 117.5 16.0 264 1-335 1-273 (273) 26 protein:vir:105822 Length: 273 99.6 1.8E-17 1.1E-20 112.4 15.0 264 1-335 1-273 (273) 27 protein:vir:102605 Length: 273 99.6 1.8E-17 1.1E-20 112.4 15.0 264 1-335 1-273 (273) 28 protein:vir:9875 Length: 296 # 99.5 3E-17 1.9E-20 111.2 10.0 275 1-340 7-296 (296) 29 protein:vir:80180 Length: 381 99.5 3.4E-16 2.1E-19 105.4 13.4 318 1-367 11-346 (381) 30 protein:vir:94622 Length: 341 99.5 8.6E-16 5.3E-19 103.2 14.5 315 1-367 1-338 (341) 31 protein:vir:106647 Length: 303 99.4 3E-15 1.8E-18 100.2 12.3 275 1-339 1-303 (303) 32 protein:vir:99075 Length: 392 99.3 4.3E-14 2.7E-17 93.9 12.0 302 1-367 1-313 (392) 33 protein:vir:108211 Length: 318 99.1 4.3E-12 2.6E-15 82.9 14.4 276 1-338 1-318 (318) 34 protein:vir:4856 Length: 293 # 98.9 2.9E-10 1.8E-13 72.9 16.5 283 1-345 1-293 (293) 35 protein:vir:9759 Length: 303 # 98.8 3.1E-10 1.9E-13 72.7 14.7 281 1-319 1-303 (303) 36 protein:vir:7771 Length: 330 # 98.8 1.8E-10 1.1E-13 74.0 13.5 294 1-367 1-324 (330) 37 protein:vir:9309 Length: 324 # 98.8 9.8E-10 6.1E-13 70.0 14.8 279 1-343 21-324 (324) 38 protein:vir:80684 Length: 315 98.7 2.1E-09 1.3E-12 68.2 15.9 292 1-342 1-315 (315) 39 protein:vir:2504 Length: 305 # 98.7 1.1E-09 7E-13 69.6 14.4 279 1-340 1-305 (305) 40 protein:vir:2344 Length: 397 # 98.7 2.6E-09 1.6E-12 67.6 16.2 304 1-367 10-341 (397) 41 protein:vir:99749 Length: 324 98.7 2.4E-09 1.5E-12 67.8 15.8 279 1-343 21-324 (324) 42 protein:vir:104256 Length: 458 98.7 2.7E-09 1.7E-12 67.6 15.8 280 1-335 161-458 (458) 43 protein:vir:100135 Length: 418 98.7 1.6E-09 1E-12 68.8 14.6 277 1-335 132-418 (418) 44 protein:vir:41 Length: 299 # N 98.7 1.3E-09 8.2E-13 69.3 13.4 287 1-367 1-297 (299) 45 protein:vir:4830 Length: 397 # 98.7 7.7E-09 4.8E-12 65.1 17.2 279 1-345 109-397 (397) 46 protein:vir:1328 Length: 392 # 98.6 3.3E-09 2.1E-12 67.1 14.8 277 1-334 107-392 (392) 47 protein:vir:78223 Length: 333 98.6 2.8E-09 1.7E-12 67.5 14.4 290 1-338 1-333 (333) 48 protein:vir:97053 Length: 390 98.6 3.4E-09 2.1E-12 67.0 14.8 271 1-337 113-390 (390) 49 protein:vir:9410 Length: 415 # 98.6 1.7E-08 1E-11 63.2 18.2 289 1-345 119-415 (415) 50 protein:vir:3991 Length: 404 # 98.6 1.1E-08 6.5E-12 64.3 16.8 277 1-336 116-404 (404) 51 protein:vir:10364 Length: 390 98.6 5E-09 3.1E-12 66.1 14.9 265 1-337 110-390 (390) 52 protein:vir:98339 Length: 415 98.6 1.2E-08 7.6E-12 64.0 16.9 290 1-345 119-415 (415) 53 protein:vir:79987 Length: 415 98.6 1.2E-08 7.6E-12 64.0 16.9 290 1-345 119-415 (415) 54 protein:vir:81100 Length: 415 98.6 1.2E-08 7.6E-12 64.0 16.9 290 1-345 119-415 (415) 55 protein:vir:6212 Length: 434 # 98.6 1.3E-08 8.2E-12 63.8 17.1 283 1-367 131-430 (434) 56 protein:vir:4953 Length: 397 # 98.6 1.3E-08 8.1E-12 63.8 16.9 273 1-342 109-397 (397) 57 protein:vir:1886 Length: 385 # 98.6 5.2E-09 3.3E-12 66.0 14.7 274 1-335 105-385 (385) 58 protein:vir:191 Length: 385 # 98.6 5.2E-09 3.3E-12 66.0 14.7 274 1-335 105-385 (385) 59 protein:vir:8102 Length: 543 # 98.6 1.6E-08 9.6E-12 63.4 17.3 277 1-334 247-543 (543) 60 protein:vir:1638 Length: 298 # 98.6 8.9E-09 5.5E-12 64.7 15.9 281 1-339 1-298 (298) 61 protein:vir:103955 Length: 324 98.6 1E-08 6.4E-12 64.4 16.0 279 1-343 21-324 (324) 62 protein:vir:4700 Length: 415 # 98.6 2.3E-08 1.4E-11 62.5 17.9 290 1-345 119-415 (415) 63 protein:vir:4600 Length: 415 # 98.6 2.3E-08 1.4E-11 62.5 17.9 290 1-345 119-415 (415) 64 protein:vir:96223 Length: 324 98.6 1.1E-08 6.6E-12 64.3 15.9 279 1-341 21-324 (324) 65 protein:vir:4339 Length: 395 # 98.6 6.2E-09 3.9E-12 65.6 14.4 276 1-339 109-395 (395) 66 protein:vir:94142 Length: 304 98.6 7.5E-09 4.7E-12 65.1 14.3 276 1-342 1-304 (304) 67 protein:vir:105905 Length: 304 98.6 7.5E-09 4.7E-12 65.1 14.3 276 1-342 1-304 (304) 68 protein:vir:4997 Length: 397 # 98.5 2.8E-08 1.8E-11 62.0 17.3 278 1-345 109-397 (397) 69 protein:vir:5739 Length: 366 # 98.5 6.8E-09 4.2E-12 65.4 13.9 287 1-349 64-366 (366) 70 protein:vir:80213 Length: 334 98.5 2.9E-09 1.8E-12 67.4 11.7 293 1-338 1-334 (334) 71 protein:vir:97148 Length: 324 98.5 2.3E-08 1.4E-11 62.5 16.5 279 1-343 23-324 (324) 72 protein:vir:81070 Length: 390 98.5 9.6E-09 5.9E-12 64.5 14.1 264 1-337 113-390 (390) 73 protein:vir:8187 Length: 311 # 98.5 1E-08 6.4E-12 64.4 14.2 287 1-341 1-311 (311) 74 protein:vir:9574 Length: 300 # 98.5 2.1E-08 1.3E-11 62.7 15.5 282 1-335 1-300 (300) 75 protein:vir:94711 Length: 347 98.5 6.7E-09 4.2E-12 65.4 12.1 288 1-342 1-347 (347) 76 protein:vir:3870 Length: 400 # 98.4 6.5E-08 4E-11 60.0 16.8 263 1-334 130-400 (400) 77 protein:vir:78523 Length: 338 98.4 2E-08 1.2E-11 62.8 13.8 283 1-340 1-338 (338) 78 protein:vir:96392 Length: 324 98.4 5.5E-08 3.4E-11 60.4 16.1 284 1-341 18-324 (324) 79 protein:vir:78830 Length: 324 98.4 5.5E-08 3.4E-11 60.4 16.1 284 1-341 18-324 (324) 80 protein:vir:8420 Length: 477 # 98.4 2E-08 1.2E-11 62.8 13.5 291 1-341 153-477 (477) 81 protein:vir:1383 Length: 421 # 98.4 6.5E-08 4.1E-11 60.0 16.3 298 1-367 109-419 (421) 82 protein:vir:94771 Length: 298 98.4 4.3E-08 2.7E-11 61.0 15.2 275 1-339 1-298 (298) 83 protein:vir:3845 Length: 395 # 98.4 6.2E-08 3.8E-11 60.1 16.0 274 1-336 105-395 (395) 84 protein:vir:100884 Length: 389 98.4 2.5E-07 1.6E-10 56.7 19.1 272 1-340 105-389 (389) 85 protein:vir:7409 Length: 408 # 98.3 1.6E-07 1E-10 57.8 16.8 278 1-345 116-408 (408) 86 protein:vir:78739 Length: 332 98.3 3.8E-08 2.4E-11 61.3 13.1 287 1-346 7-332 (332) 87 protein:vir:102655 Length: 322 98.3 6.6E-08 4.1E-11 60.0 14.1 286 1-318 10-322 (322) 88 protein:vir:108303 Length: 418 98.3 1.9E-07 1.2E-10 57.4 16.5 308 1-367 1-375 (418) 89 protein:vir:100172 Length: 394 98.3 4.5E-07 2.8E-10 55.4 18.4 274 1-347 106-394 (394) 90 protein:vir:10450 Length: 344 98.3 5.3E-08 3.3E-11 60.5 12.9 295 1-366 1-344 (344) 91 protein:vir:104085 Length: 320 98.3 1.7E-07 1.1E-10 57.6 15.4 281 1-341 14-320 (320) 92 protein:vir:95763 Length: 297 98.3 3.5E-07 2.1E-10 56.0 16.7 271 1-339 1-297 (297) 93 protein:vir:100247 Length: 425 98.2 2.4E-07 1.5E-10 56.9 15.4 282 1-336 127-425 (425) 94 protein:vir:1025 Length: 408 # 98.2 4.6E-07 2.8E-10 55.4 16.8 282 1-343 105-408 (408) 95 protein:vir:485 Length: 407 # 98.2 4.2E-07 2.6E-10 55.5 16.4 295 1-341 90-407 (407) 96 protein:vir:8885 Length: 347 # 98.2 1.4E-07 8.6E-11 58.2 13.2 296 1-334 1-347 (347) 97 protein:vir:2430 Length: 318 # 98.2 1.7E-07 1E-10 57.7 13.6 288 1-367 14-314 (318) 98 protein:vir:6242 Length: 390 # 98.2 1.4E-07 8.7E-11 58.2 13.1 272 1-335 106-390 (390) 99 protein:vir:9704 Length: 394 # 98.2 4.8E-07 3E-10 55.2 15.9 263 1-343 125-394 (394) 100 protein:vir:81160 Length: 371 98.2 5.1E-07 3.2E-10 55.1 16.0 266 1-334 91-371 (371) 101 protein:vir:101607 Length: 379 98.2 6.8E-07 4.2E-10 54.4 16.4 265 1-335 106-379 (379) 102 protein:vir:107593 Length: 392 98.2 8.6E-07 5.3E-10 53.8 16.9 277 1-328 103-392 (392) 103 protein:vir:102082 Length: 392 98.2 8.6E-07 5.3E-10 53.8 16.9 277 1-328 103-392 (392) 104 protein:vir:102873 Length: 392 98.2 8.6E-07 5.3E-10 53.8 16.9 277 1-328 103-392 (392) 105 protein:vir:105004 Length: 392 98.2 8.6E-07 5.3E-10 53.8 16.9 277 1-328 103-392 (392) 106 protein:vir:94673 Length: 419 98.1 4.5E-07 2.8E-10 55.4 14.9 275 1-339 121-419 (419) 107 protein:vir:94576 Length: 347 98.1 3.6E-07 2.2E-10 55.9 14.3 295 1-338 1-347 (347) 108 protein:vir:4226 Length: 326 # 98.1 4.1E-07 2.6E-10 55.6 13.6 284 1-337 17-326 (326) 109 protein:vir:1433 Length: 435 # 98.0 9.8E-07 6.1E-10 53.5 15.3 289 1-351 126-435 (435) 110 protein:vir:4456 Length: 401 # 98.0 2.1E-06 1.3E-09 51.7 16.9 280 1-334 107-401 (401) 111 protein:vir:3364 Length: 347 # 98.0 6.3E-07 3.9E-10 54.6 13.8 300 1-342 1-347 (347) 112 protein:vir:4092 Length: 390 # 98.0 4.9E-07 3.1E-10 55.2 13.1 294 1-342 84-390 (390) 113 protein:vir:102119 Length: 404 98.0 1.1E-06 7.1E-10 53.2 14.8 283 1-339 110-404 (404) 114 protein:vir:80376 Length: 435 98.0 1.3E-06 8.3E-10 52.8 14.9 286 1-351 130-435 (435) 115 protein:vir:4511 Length: 409 # 98.0 1E-06 6.3E-10 53.4 14.0 277 1-335 111-409 (409) 116 protein:vir:105038 Length: 428 98.0 8.6E-07 5.3E-10 53.9 13.5 286 1-349 118-428 (428) 117 protein:vir:96762 Length: 632 97.9 3.7E-07 2.3E-10 55.9 11.3 269 1-334 347-632 (632) 118 protein:vir:1541 Length: 347 # 97.9 2.3E-06 1.4E-09 51.5 15.3 305 1-342 1-347 (347) 119 protein:vir:81227 Length: 413 97.9 1.7E-06 1.1E-09 52.2 14.3 278 1-338 116-413 (413) 120 protein:vir:962 Length: 397 # 97.9 1.4E-05 8.8E-09 47.2 19.0 262 1-334 129-397 (397) 121 protein:vir:95376 Length: 425 97.8 2.4E-06 1.5E-09 51.4 14.0 275 1-335 136-425 (425) 122 protein:vir:3136 Length: 322 # 97.8 5.6E-06 3.5E-09 49.4 15.7 283 1-341 1-322 (322) 123 protein:vir:2201 Length: 345 # 97.8 1.9E-06 1.2E-09 52.0 13.0 290 1-356 1-345 (345) 124 protein:vir:1268 Length: 397 # 97.8 9.8E-06 6.1E-09 48.0 16.8 266 1-334 120-397 (397) 125 protein:vir:99920 Length: 311 97.7 2.2E-06 1.3E-09 51.6 12.4 286 1-334 1-311 (311) 126 protein:vir:4197 Length: 314 # 97.7 7.5E-06 4.7E-09 48.7 14.9 295 1-335 4-314 (314) 127 protein:vir:1084 Length: 437 # 97.7 1.3E-05 8.3E-09 47.3 16.2 272 1-341 156-437 (437) 128 protein:vir:6324 Length: 335 # 97.6 7.1E-06 4.4E-09 48.8 12.9 300 1-334 1-335 (335) 129 protein:vir:78935 Length: 335 97.5 1.1E-05 6.9E-09 47.7 13.8 295 1-334 1-335 (335) 130 protein:vir:101650 Length: 497 97.5 2E-05 1.2E-08 46.4 15.1 299 1-335 151-497 (497) 131 protein:vir:7855 Length: 497 # 97.5 2E-05 1.2E-08 46.4 15.1 299 1-335 151-497 (497) 132 protein:vir:99675 Length: 324 97.5 9.7E-06 6E-09 48.1 12.5 279 48-345 1-324 (324) 133 protein:vir:2685 Length: 387 # 97.4 1.3E-05 8.3E-09 47.3 12.6 266 1-339 115-387 (387) 134 protein:vir:96978 Length: 387 97.4 1.3E-05 8.3E-09 47.3 12.6 266 1-339 115-387 (387) 135 protein:vir:94424 Length: 387 97.4 1.3E-05 8.3E-09 47.3 12.6 266 1-339 115-387 (387) 136 protein:vir:93881 Length: 387 97.4 1.8E-05 1.1E-08 46.6 13.0 264 1-339 118-387 (387) 137 protein:vir:9361 Length: 402 # 97.3 1.5E-05 9E-09 47.1 12.0 266 1-339 130-402 (402) 138 protein:vir:4159 Length: 315 # 97.0 0.00019 1.2E-07 41.0 15.1 286 1-352 12-315 (315) 139 protein:vir:3525 Length: 423 # 96.9 0.00025 1.6E-07 40.3 15.7 312 1-367 1-382 (423) 140 protein:vir:100057 Length: 375 96.8 0.0003 1.9E-07 39.9 17.3 294 1-345 1-375 (375) 141 protein:vir:93696 Length: 364 96.7 5.2E-05 3.2E-08 44.1 10.1 314 1-345 1-364 (364) 142 protein:vir:78640 Length: 352 96.7 0.00015 9.4E-08 41.5 12.6 264 1-339 83-352 (352) 143 protein:vir:103323 Length: 364 96.6 0.00049 3E-07 38.7 17.1 316 1-367 1-364 (364) 144 protein:vir:174 Length: 423 # 96.5 0.0006 3.7E-07 38.3 16.3 314 1-367 1-382 (423) 145 protein:vir:80128 Length: 466 95.9 0.00092 5.7E-07 37.2 12.9 300 1-361 144-466 (466) 146 protein:vir:105374 Length: 423 95.7 0.0016 9.7E-07 36.0 15.7 316 1-367 1-382 (423) 147 protein:vir:93616 Length: 645 95.0 0.0029 1.8E-06 34.5 13.6 285 1-335 332-645 (645) 148 protein:vir:97331 Length: 319 94.7 0.0037 2.3E-06 33.9 16.8 296 1-351 1-319 (319) 149 protein:vir:94800 Length: 319 94.7 0.0037 2.3E-06 33.9 16.8 296 1-351 1-319 (319) 150 protein:vir:107120 Length: 329 94.1 0.0053 3.3E-06 33.1 17.3 287 1-353 12-329 (329) 151 protein:vir:105645 Length: 400 93.6 0.0039 2.4E-06 33.8 10.2 316 1-367 1-364 (400) 152 protein:vir:105522 Length: 423 93.6 0.0071 4.4E-06 32.3 17.1 313 1-367 1-382 (423) 153 protein:vir:104439 Length: 404 93.4 0.0015 9.5E-07 36.0 7.7 324 1-343 1-404 (404) 154 protein:vir:819 Length: 404 # 93.4 0.0015 9.5E-07 36.0 7.7 324 1-343 1-404 (404) 155 protein:vir:10123 Length: 404 93.4 0.0015 9.5E-07 36.0 7.7 324 1-343 1-404 (404) 156 protein:vir:3298 Length: 404 # 93.4 0.0015 9.5E-07 36.0 7.7 324 1-343 1-404 (404) 157 protein:vir:3158 Length: 321 # 93.3 0.008 5E-06 32.1 13.7 288 1-339 1-321 (321) 158 protein:vir:97031 Length: 402 93.0 0.009 5.6E-06 31.8 11.7 323 1-367 1-364 (402) 159 protein:vir:105610 Length: 430 92.6 0.0052 3.3E-06 33.1 9.5 328 1-356 1-430 (430) 160 protein:vir:2770 Length: 318 # 92.0 0.0029 1.8E-06 34.5 7.3 263 1-304 1-318 (318) 161 protein:vir:1781 Length: 221 # 88.2 0.034 2.1E-05 28.6 13.1 183 77-291 1-221 (221) 162 protein:vir:79928 Length: 393 88.2 0.034 2.1E-05 28.6 16.5 286 1-348 74-393 (393) 163 protein:vir:7019 Length: 401 # 86.0 0.049 3E-05 27.8 11.2 314 1-367 1-365 (401) 164 protein:vir:95963 Length: 395 79.9 0.1 6.2E-05 26.1 13.6 290 1-346 83-395 (395) 165 protein:vir:79008 Length: 299 79.0 0.11 6.7E-05 25.9 15.4 277 1-367 1-298 (299) 166 protein:vir:95875 Length: 401 72.3 0.18 0.00011 24.6 13.4 303 1-339 1-401 (401) 167 protein:vir:97255 Length: 310 69.7 0.22 0.00014 24.2 18.3 287 1-366 1-310 (310) 168 protein:vir:96490 Length: 348 62.9 0.33 0.0002 23.3 14.3 313 1-367 1-348 (348) 169 protein:vir:4902 Length: 348 # 61.0 0.36 0.00022 23.0 14.8 313 1-367 1-348 (348) 170 protein:vir:79712 Length: 285 43.7 0.83 0.00052 21.0 12.9 268 1-367 1-284 (285) 171 protein:vir:9509 Length: 381 # 41.1 0.94 0.00058 20.7 13.8 281 1-367 76-369 (381) 172 protein:vir:101291 Length: 381 41.1 0.94 0.00058 20.7 13.8 281 1-367 76-369 (381) 173 protein:vir:2736 Length: 348 # 39.5 1 0.00063 20.6 15.2 315 1-367 1-348 (348) 174 protein:vir:96666 Length: 462 38.4 1.1 0.00066 20.4 12.5 307 1-367 1-387 (462) 175 protein:vir:103759 Length: 330 21.9 2.5 0.0016 18.4 8.0 238 1-272 1-330 (330) 176 protein:vir:9643 Length: 377 # 21.4 2.6 0.0016 18.3 14.3 279 1-367 76-377 (377) 177 protein:vir:100632 Length: 381 20.6 2.7 0.0017 18.2 10.7 293 1-343 73-381 (381) No 1 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=100.00 E-value=3.2e-149 Score=834.68 Aligned_cols=367 Identities=100% Similarity=1.520 Sum_probs=362.7 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||+||++|+|+|||+||||++|+.++.+|+++|+|||||+++++|+.++++||++++||||++|+|+++||.++++.+++ T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) +|+||+++++++++++|+|||+++||+.+++|+|||++|++||++||+|++|++|||+|+|||+++.++++.+++..+.. T Consensus 81 t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~~ 160 (367) T protein:vir:80 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) T ss_pred cccccccchheeeeehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccccchhhc Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQLTIPTYM 240 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~~i~t~~ 240 (367) .++..+.+.+|++|||++++++...|++++|++|+++|||++++|++++|||+||++|+|++||+|++++++++.|+||+ T Consensus 161 ~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~li~~i~~sd~~~~i~ty~ 240 (367) T protein:vir:80 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQLTIPTYM 240 (367) T ss_pred ccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccccccccCCCCccccceec Confidence 99999999999999999999888999999999999999999999999999999999999999999999999999999999 Q ss_pred CcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccc Q lcl|Aclame:pro 241 GKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVT 320 (367) Q Consensus 241 G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~ 320 (367) ||||||||+||+..+++.++|+||||++|||+|++++|.+|+|++||++++|++|+|+||+||||++||+|+||++++++ T Consensus 241 G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~~~~hP~G~s~~~~~v~ 320 (367) T protein:vir:80 241 GKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVT 320 (367) T ss_pred ceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccceecccchhhhcCCceEEEEeeeeEEeecceeeecccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 321 IPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 321 ~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) +|+|+.++++++++..|||++||++++||+||||||+||||+||||| T Consensus 321 ~~~~~~~~~~~~~~~~sPt~~eLa~~~NW~~v~d~K~I~iv~~it~g 367 (367) T protein:vir:80 321 IPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) T ss_pred cccccccccccccccCCCChHHhcCCcccccccchhhcceEEEEecC Confidence 99999999999999999999999999999999999999999999999 No 2 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=100.00 E-value=4.8e-129 Score=724.08 Aligned_cols=339 Identities=38% Similarity=0.679 Sum_probs=318.5 Q ss_pred CCCccccccceeccchH--HHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCc-ccccCCCCcc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPE--VYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSL-EPNYGSDNPN 77 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PE--Vf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~-~~~~~~~~~~ 77 (367) ||- |+|+|||+|| ||++|+.++.+|+++|+||||++++++|+.++++||++++||||++|+|+ +++|.++++. T Consensus 1 Ma~----T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~ 76 (349) T protein:vir:94 1 MAI----TTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQ 76 (349) T ss_pred CCc----eEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcc Confidence 994 9999999998 89999999999999999999999999999999999999999999999987 6799999998 Q ss_pred ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 78 ~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) ++++|.||+++++++++++|+|||+++||+++++|+|||++|++||++||.|++|++|||+|+|+|+++.+++.. T Consensus 77 ~~~t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~----- 151 (349) T protein:vir:94 77 DIATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDA----- 151 (349) T ss_pred cccccccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhccccccccc----- Confidence 899999999999999999999999999999999999999999999999999999999999999999998765532 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccc-----cCceeEEEEccHHHHHHHhcchhhhcccccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDH-----VGSIAAIAVHSMVYKRMTNNDEIEFIPDSKG 232 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~-----~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g 232 (367) .....+|++|+++.++ ++++.|++|+++|||+ +++|++++|||.||++|+|++||+|++++++ T Consensus 152 -------~~~~~~~~~d~~~~a~-----~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~ 219 (349) T protein:vir:94 152 -------YHEQNDMVVDVSATSG-----FDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAEN 219 (349) T ss_pred -------ccccCceeEEecccCC-----CChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhhccCccc Confidence 2345689999986654 8899999999998876 7899999999999999999999999999999 Q ss_pred cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeee Q lcl|Aclame:pro 233 QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGF 312 (367) Q Consensus 233 ~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~ 312 (367) ++.|++|+||||||||+||+.+++++++|+||||++|||+|++++|++++|++||+++++++|+|+||+||||++||+|| T Consensus 220 ~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~~~hp~G~ 299 (349) T protein:vir:94 220 NTMFATYQGYRVIVDDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTWLLHPFGY 299 (349) T ss_pred CcccceecCcEEEEeCCCccccCCCCceEEEEEeecceEEeecCCCCcceeeecccccCCcceeEEEEEeeEEEeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 313 NWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 313 s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) ||+++.++ .++..+.+.|||++||++++||+||||||+||||+||||= T Consensus 300 s~~~a~v~-------~~~~~~~~~sPt~aeLa~~~NW~~v~~~K~I~iv~~~~~~ 347 (349) T protein:vir:94 300 SFTSAVIT-------GNGTETIARSASWQDLANAANWNRVVDRKHVPIAFLVTGV 347 (349) T ss_pred eecccccC-------CCccccccCCCChHHhcCCcCcccccChhhcceEEEEecc Confidence 99998764 2334456789999999999999999999999999999998 No 3 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=100.00 E-value=5.5e-129 Score=723.74 Aligned_cols=339 Identities=39% Similarity=0.683 Sum_probs=317.9 Q ss_pred CCCccccccceeccchH--HHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCc-ccccCCCCcc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPE--VYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSL-EPNYGSDNPN 77 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PE--Vf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~-~~~~~~~~~~ 77 (367) ||- |+|+|||+|| ||++||.++.+|+++|+||||++++++|+.++++||++++||||++|+|+ +++|.+|++. T Consensus 1 Ma~----T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~ 76 (349) T protein:vir:78 1 MAI----TTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQ 76 (349) T ss_pred CCc----eEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcc Confidence 994 9999999998 89999999999999999999999999999999999999999999999986 6789888877 Q ss_pred ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 78 ~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) ++++|.||+++++++++++|+|||+++||+++++|+|||++|++||++||.|++|++||++|+|+|+++.++... T Consensus 77 ~~~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~----- 151 (349) T protein:vir:78 77 DIATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDA----- 151 (349) T ss_pred cccccccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccch----- Confidence 889999999999999999999999999999999999999999999999999999999999999999988665432 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccc-----cCceeEEEEccHHHHHHHhcchhhhcccccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDH-----VGSIAAIAVHSMVYKRMTNNDEIEFIPDSKG 232 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~-----~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g 232 (367) .....+|++|+++.++ ++++.|++|+++|||. +++|++++|||+||++|++++||+|++++++ T Consensus 152 -------~~~~~~~t~d~s~~a~-----~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~ 219 (349) T protein:vir:78 152 -------YHEQNDMVVDVSATLG-----FDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAEN 219 (349) T ss_pred -------hhhcccceeeeccccC-----CChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhhccCccc Confidence 2345789999987664 8999999999998886 7899999999999999999999999999999 Q ss_pred cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeee Q lcl|Aclame:pro 233 QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGF 312 (367) Q Consensus 233 ~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~ 312 (367) .+.|++|+||||||||+||+.+++++++|+||||++|||+|++++|++++|++||+++++++|+|+||+||||++||+|| T Consensus 220 ~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~~~hp~G~ 299 (349) T protein:vir:78 220 NTMFATYQGYRVIVDDSMTVVGQGAQRKFISIIFGQGAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTWLLHPFGY 299 (349) T ss_pred CcccceecCeEEEEeCCCccccCCCCceEEEEEeecceEEEccCCCccceeeecccccCCcceeEEEEEeeEEEeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 313 NWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 313 s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) ||+++.++. +++.+.+.|||++||++++||+||||||+||||+||||= T Consensus 300 s~~~a~v~~-------~~~~~~~~sPt~aeLa~~~NW~~v~~~K~I~iv~~~~~~ 347 (349) T protein:vir:78 300 RFTSAVITG-------NGTETIARSASWQDLANATNWNRVVDRKHVPIAFLVTGV 347 (349) T ss_pred eeccccccC-------CccccccCCCChHHhcCCcCcccccChhhcceEEEEecc Confidence 999987642 234556789999999999999999999999999999998 No 4 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=100.00 E-value=4.3e-121 Score=680.46 Aligned_cols=324 Identities=25% Similarity=0.377 Sum_probs=302.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) || .|+|+|||+||||++||+++++++++|+|||+++++++|+.++++||++++||||++|+|+++++.+++ ++ T Consensus 1 MA----~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~---~i 73 (351) T protein:vir:15 1 MA----ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSD---DI 73 (351) T ss_pred CC----ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCc---cc Confidence 99 499999999999999999999999999999999999999999999999999999999999999998875 58 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++|++++++++|++|+|||+++||+.+++|+|||++|++||++||+|++|++||++|+|+|++... T Consensus 74 ~~~kitt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~------------ 141 (351) T protein:vir:15 74 DVNNLTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKI------------ 141 (351) T ss_pred chheecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhh------------ Confidence 99999999999999999999999999999999999999999999999999999999999999987533 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc-CceeEEEEccHHHHHHHhcchhhhcccccccccchhh Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV-GSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQLTIPTY 239 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~-~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~~i~t~ 239 (367) ...|++|++..++ +.+.|++++|++|+++|||.. +.|++++|||++|++|++++|++|++++++++.|++| T Consensus 142 -------~~~~~~d~t~~~~-~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~t~ 213 (351) T protein:vir:15 142 -------ANSKVYDQTKVSP-SEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIETIQPQNGATPFEAY 213 (351) T ss_pred -------cccceeccccccc-cccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhhccccccCccccee Confidence 2468899998875 456799999999999999975 5699999999999999999999999999999999999 Q ss_pred cCcEEEEeCCCcccCCC-CCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccc Q lcl|Aclame:pro 240 MGKVVIVDDGMPVFGTG-ADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDAD 318 (367) Q Consensus 240 ~G~~VivdD~~pv~~t~-~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~ 318 (367) +||||||||+||+..++ ..++|+||+|++|||+|+++++. +|++|++.++ +|+|+||+||||++||+||||+++. T Consensus 214 ~G~~VivdD~~p~~~~~~~~~~ytsyl~~~GAi~~~~~~~~--ve~~rd~~~~--~g~d~l~~r~~~~~hp~G~s~~~~~ 289 (351) T protein:vir:15 214 NGLRIVLDDDIEIDLTDKTKPVSTSYIFAPGAVRYSTNMRS--TETKYDPLIN--GGQDVIVQKRVGTIHVAGTSIKASF 289 (351) T ss_pred cceEEEEcCCCccccCCCCCceeEEEEEecceeeeecCCcC--cceeecccCC--CCceEEEEeeeeeeeeeeeeecccc Confidence 99999999999998765 45689999999999999998874 7999999986 7999999999999999999999775 Q ss_pred cccccccccccccccccCCCChHHhcCCccceee--ecccccceEEEEecC Q lcl|Aclame:pro 319 VTIPDNTGSPSGITSGPPAITLANLANPDNWERV--TYRKNVPMAFLVTKG 367 (367) Q Consensus 319 ~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v--~d~K~i~iv~~~t~g 367 (367) + .+++.|||++||++++||+|| ||||+||||+||||= T Consensus 290 ~------------~~~~~sPt~~~L~~~~NW~~v~~~d~k~I~iv~~~~~~ 328 (351) T protein:vir:15 290 S------------PSKASFPTIDELAKSSTWEVVDGIDVRSIGVVAYTAQL 328 (351) T ss_pred c------------ccCcCCcChHHhcCCcccccccCCCccccceEEEEEec Confidence 4 247889999999999999999 899999999999996 No 5 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=100.00 E-value=2.9e-120 Score=675.92 Aligned_cols=326 Identities=30% Similarity=0.535 Sum_probs=299.1 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||+ +.|+|+|||+||||++||+++++++++|+|||+++++++|++++++||++++||||++|+|+++++.+++ +++ T Consensus 1 Ma~--~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~--~~i 76 (330) T protein:vir:10 1 MAN--ELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGD--KAL 76 (330) T ss_pred CCC--CceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCc--ccc Confidence 997 7799999999999999999999999999999999999999999999999999999999999999997764 358 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) +++||+++++++++++|+|||+++||+.+++|+|||++|++||++||.|++|++|||+|+|+|++..+++.... T Consensus 77 ~~~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~------ 150 (330) T protein:vir:10 77 ETGKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGAL------ 150 (330) T ss_pred chhhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhh------ Confidence 99999999999999999999999999999999999999999999999999999999999999999877654321 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccccchhhc Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQLTIPTYM 240 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~~i~t~~ 240 (367) ...++.|.+. +.+.|++++|++|+++|||+.+.|++++|||++|++|++++||+|++++++++.|++|+ T Consensus 151 -------~~~~~~~~~~----~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~~~ 219 (330) T protein:vir:10 151 -------EETHVSDQSK----ASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQYIQPTTATINIPTYL 219 (330) T ss_pred -------hhhheecccc----cccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhhhhcccccCccccccc Confidence 1233444432 34579999999999999999999999999999999999999999999999999999999 Q ss_pred CcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCC--cceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccc Q lcl|Aclame:pro 241 GKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQ--VPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDAD 318 (367) Q Consensus 241 G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~--~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~ 318 (367) ||||||||+||+. .++|++|+|++|||+|.+++|+ +++|++|++++ |+|+|++|+||++||+||||+++. T Consensus 220 G~~VivdD~~p~~----~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~----g~~~l~~r~~~~~hp~G~s~~~~~ 291 (330) T protein:vir:10 220 GYRVIIDDGIAPT----GDIYTSYLFRTGSIGLNTGNPSGLTTFETSREAAK----GNDMIYTRRALVMHPYGVKWTGAE 291 (330) T ss_pred ceEEEEeCCCCCC----CCceeEEEEecCceeeecccCCccccccccCCccc----cceEEEEeeEEEeeeeeeeecccc Confidence 9999999999975 4799999999999999998864 68999999864 679999999999999999999875 Q ss_pred cccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 319 VTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 319 ~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) + ..++.|||++||++++||+||||||+||||+||||= T Consensus 292 ~------------~~~~~sPt~~~L~~~~NW~~v~~~k~i~iv~~~~~~ 328 (330) T protein:vir:10 292 V------------DAGNITPSNADLAKFKNWKRVYEPKNIGIIALKHKI 328 (330) T ss_pred c------------ccCcCCcChHHhcCCcCcccccChhhcceEEEEEec Confidence 4 247889999999999999999999999999999997 No 6 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=100.00 E-value=9.6e-119 Score=667.60 Aligned_cols=319 Identities=34% Similarity=0.621 Sum_probs=300.5 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhh--CCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLS--APGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~--~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) || .|+|+|||+||||++||+++++++++|+|||+++|+++++.+++ +||++++||||++|+|++++|.+++ T Consensus 1 MA----~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~--- 73 (324) T protein:vir:59 1 MA----YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTD--- 73 (324) T ss_pred CC----ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCc--- Confidence 99 49999999999999999999999999999999999999999885 4999999999999999999998875 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) ++++++|+++++++++++|+|||+++||+.+++|+|||++|++||++||.|++|++||++|+|+|+++.+ T Consensus 74 ~i~~~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~---------- 143 (324) T protein:vir:59 74 DLVPQKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDM---------- 143 (324) T ss_pred ccchhhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc---------- Confidence 5889999999999999999999999999999999999999999999999999999999999999988643 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccccchh Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQLTIPT 238 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~~i~t 238 (367) ++|++|+|+.+ ...|++++|++|+++|||+.+.|++++|||++|++|++++|++|++++++++.|++ T Consensus 144 ----------~~~~~dvsa~~---~~~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~ 210 (324) T protein:vir:59 144 ----------KDNKLDISGTA---DGIYSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEFVKDSQSGIRFPT 210 (324) T ss_pred ----------ccceeeeeccc---cceecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhhccccccCceeee Confidence 46789998755 35799999999999999999999999999999999999999999999999999999 Q ss_pred hcCcEEEEeCCCccc-CCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeeccc Q lcl|Aclame:pro 239 YMGKVVIVDDGMPVF-GTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDA 317 (367) Q Consensus 239 ~~G~~VivdD~~pv~-~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~ 317 (367) |+||+|||||+||+. .+++.++|+||+|++|||+|.++++.+++|++|++. +|+++|++||||++||+||||+++ T Consensus 211 ~~G~~VivdD~~p~~~~~~~~~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~----~g~~~l~~r~~~~~~p~G~s~~~~ 286 (324) T protein:vir:59 211 YMNKRVIVDDSMPVETLEDGTKVFTSYLFGAGALGYAEGQPEVPTETARNAL----GSQDILINRKHFVLHPRGVKFTEN 286 (324) T ss_pred ecccEEEEeCCCCccccCCCCceEEEEEEecCeEEEeecCCCcceecccCcc----ccceEEEEeeEEEeEeeeEEeccc Confidence 999999999999986 456788999999999999999999999999999984 478999999999999999999876 Q ss_pred ccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 318 DVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) .+ ++.|||++||++++||+||||||+||||+||||= T Consensus 287 ~~--------------~~~sPt~~~L~~~~NW~~v~~~k~i~i~~~~~~~ 322 (324) T protein:vir:59 287 AM--------------AGTTPTDEELANGANWQRVYDPKKIRIVQFKHRL 322 (324) T ss_pred cc--------------CCCCCChhhhcCCcccccccCccccceEEEEeec Confidence 54 5789999999999999999999999999999999 No 7 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=100.00 E-value=9.6e-87 Score=492.16 Aligned_cols=316 Identities=15% Similarity=0.127 Sum_probs=248.7 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhh---cccccccHHHHHHhhCCCceEEeeeeccCCCcc---cccCCC Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFL---SGAVASNDFLSQFLSAPGRLINIPFWRDLDSLE---PNYGSD 74 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~---SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~---~~~~~~ 74 (367) |+.++-+ +|||+++..|++. ++++..++. .|+|+-..+. ..|++++||||++|.|+. +++.++ T Consensus 1 m~lsD~~-----vfN~~~~~a~~e~-~~q~~~~fn~as~gai~l~~~~-----~~Gd~~~~pf~~~l~g~~~~~~~~~~~ 69 (325) T protein:vir:95 1 MALSDLA-----VYSEYAYSAFSET-LRQQVDLFNTATGGAIMLQSAA-----HQGDFSDVAFFAKVTGGLVRRRNAYGS 69 (325) T ss_pred Cchhhhh-----hhhhhhhhhhhhh-hhhhHhhhhhcccceeEecccc-----ccCceeeccccccccccccccccCCCC Confidence 8875533 4888888887754 333322222 3555433221 249999999999998854 556443 Q ss_pred CccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 75 NPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATI 154 (367) Q Consensus 75 ~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~ 154 (367) .+++|.||+++++++++++|++||..+|++.++.+.|||.+++++|+++|++..++++|+++.|++... T Consensus 70 ---~~vt~~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a-------- 138 (325) T protein:vir:95 70 ---GTVAEKVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSA-------- 138 (325) T ss_pred ---ceeccceeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------- Confidence 469999999999999999999999999999999999999866665555444444444444443333211 Q ss_pred hhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccc--ccc Q lcl|Aclame:pro 155 KTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPD--SKG 232 (367) Q Consensus 155 ~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~--~~g 232 (367) .+..++|++|+|+.++.+...+++++|++|+++|||++++|++++|||+||++|++++|+++++. .++ T Consensus 139 ----------~~~~~~~v~dis~~~~~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g 208 (325) T protein:vir:95 139 ----------LSQVSDVVYDATANTDAADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGT 208 (325) T ss_pred ----------hcccccceeeeecccCcccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCC Confidence 12245889999999987778899999999999999999999999999999999999999999885 456 Q ss_pred cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeee Q lcl|Aclame:pro 233 QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGF 312 (367) Q Consensus 233 ~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~ 312 (367) ...|++|+||||||||+||+.+++.+++|+||+|++|||+|++++|...+..+.+..++.+.+ ...||+|++||+|| T Consensus 209 ~~~i~t~~G~~VIVdD~~p~~~~g~~~~ytty~lg~GAi~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~tf~lhp~G~ 285 (325) T protein:vir:95 209 VNVVRDPFGKLLVMTDSPNLFAAGTPNVYHILGLVPGGVLIGQNNDFDANEETKNGDENIIRT---YQAEWSYNIGVKGF 285 (325) T ss_pred cccccccCCcEEEEeCCCCCCCccCceeEEEEEEecCeEEecCCCCccccccccCcccceeee---eeeeeeEEeeccee Confidence 667999999999999999999999999999999999999999998865444333333332233 34678899999999 Q ss_pred eecccccccccccccccccccccCCCChHHhcCCccceeee-cccccceEEEEec Q lcl|Aclame:pro 313 NWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVT-YRKNVPMAFLVTK 366 (367) Q Consensus 313 s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~-d~K~i~iv~~~t~ 366 (367) ||++++ ++.|||++||++++||+||| ++|.+++|.+||| T Consensus 286 sw~~s~---------------~g~sPt~aeL~~~~NW~rv~~~~K~tagv~~~~~ 325 (325) T protein:vir:95 286 AWDKAN---------------GGKSPTDAALFTSTNWDKYATSHKDLAGVVVKTN 325 (325) T ss_pred eeeccc---------------ccCCcChHhhcCCcCcceecCCCccccceeEeeC Confidence 997653 56799999999999999999 5699999999999 No 8 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=100.00 E-value=2.1e-68 Score=391.63 Aligned_cols=266 Identities=11% Similarity=0.065 Sum_probs=227.2 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||. |+|+|||+||||++||+++++++++|.+++.+ +. .|.++||++|+||+|+++ |+++++.+++ ++ T Consensus 1 Ma~----T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~--d~---~L~g~~G~ti~~P~~~~i-gdae~~~eg~---~i 67 (270) T protein:vir:95 1 MTQ----TKKANLINPEVLANVVSAQMQNAIRFTPYAVT--DD---TLVGQPGDTITRPKYAYI-GAAEDLQEGV---AM 67 (270) T ss_pred CCc----eehhhhcchHHHHHHHHHHHHhHHhhcccccc--cc---ccCCCCCCEEEeeeecCC-CccccccCCC---cc Confidence 995 99999999999999999999999999776655 32 244679999999999966 8999999886 58 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++|++++++++|++|+|+|+++||+.+++|+|||+++++|++.||+|+.|++|++.|+|++... T Consensus 68 ~~~~lt~~~~~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~-------------- 133 (270) T protein:vir:95 68 DTTQMSMTTTKVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA-------------- 133 (270) T ss_pred chhhcccchheeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc-------------- Confidence 899999999999999999999999999999999999999999999999999999999999876431 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc---cch Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL---TIP 237 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~---~i~ 237 (367) ...++++.|++|.++|||+.+.+++++|||+++++|+|++++++.++.++.+ .|+ T Consensus 134 ----------------------~~~~t~~~~~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig 191 (270) T protein:vir:95 134 ----------------------TVSADATGILDAIEVFNSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLV 191 (270) T ss_pred ----------------------ccccCHHHHHHHHHHhccccCCCcEEEEcHHHHHHHHhhhcccccccccchhcccccc Confidence 1236789999999999999999999999999999999999999988887643 699 Q ss_pred hhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeee-eecc Q lcl|Aclame:pro 238 TYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGF-NWLD 316 (367) Q Consensus 238 t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~-s~~~ 316 (367) +|+|+||||+|++| .+|++|+|++|||++....+ ..+|++||++++ .+.+++|+||.+|+..= ++.. T Consensus 192 ~~~G~~Viv~s~~~-------~~~~~~l~~~gAi~~~~~~~-~~vEtdRd~~~~----~d~i~~~~~y~v~~~~~skvv~ 259 (270) T protein:vir:95 192 EIVGVSDIVKSKRV-------SENTAFLQRYGAMEIVNKKK-PEAYTDFDILKR----THLLSTNYHYSVNLKDETGVVK 259 (270) T ss_pred eecceeEEEeCCCC-------CceeEEEEeccceeeeecCC-ceeeeccchhhc----ccEEEeeeEEEEEEEccceEEE Confidence 99999999999986 46899999999999999886 449999999874 69999999999998872 2222 Q ss_pred cccccccccccccccccccCCCChHHh Q lcl|Aclame:pro 317 ADVTIPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~sPt~a~L 343 (367) .+. +|+.+ + |+ T Consensus 260 ~t~-~~a~~------~---------~~ 270 (270) T protein:vir:95 260 VTF-KPSGS------L---------EM 270 (270) T ss_pred EEe-cCCCC------c---------CC Confidence 111 11111 1 11 No 9 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=100.00 E-value=1.8e-67 Score=386.48 Aligned_cols=293 Identities=17% Similarity=0.110 Sum_probs=223.1 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHh----hhHhh---cccccccHHHHHHhhCC--CceEEeeeeccCCC--ccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPEL----TAFFL---SGAVASNDFLSQFLSAP--GRLINIPFWRDLDS--LEP 69 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~----~~f~~---SGi~~~~~~l~~~~~~~--G~~i~~P~~~~l~g--~~~ 69 (367) ||- |.++||+ ||++|+.....|+ ...+. +|++ .|.+.| |++...|||+ +.| ... T Consensus 1 ~~~----t~~sdl~---vfn~~~~~a~~e~~~~~~~~Fnaas~Gai-------~l~~~~~~GDf~~~~ff~-i~~~~~~r 65 (315) T protein:vir:96 1 MAT----TVNSDLV---IYNDTAQTAYLERNMDNLAVFNENSRAAI-------GLNSELIEGDLKLRSFYK-VGGAIADR 65 (315) T ss_pred Cce----eeeccee---eehhhhhhhHHhhhHHHHHHhhhhcCCcc-------cccccccccccccccccc-cccchhhc Confidence 884 9999974 3555554333333 22222 2222 122333 9999999998 555 345 Q ss_pred ccCCCCccccccccccchhhhhhhhhHhhcc-cchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 70 NYGSDNPNVEAPIDGLGSGEMKTTKTWLNKA-YGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLA 148 (367) Q Consensus 70 ~~~~~~~~~~~t~~kitt~~~~a~i~~r~kg-~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a 148 (367) ||.+++ ++++.||++++++++++.++.+ |..+..+....|.|||..++.....||.+..|..|..+++++|+... T Consensus 66 nv~~~~---~~t~~kit~~~dvaVk~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~- 141 (315) T protein:vir:96 66 DVNSTA---TVAGTKIAADEMVSVKVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIG- 141 (315) T ss_pred ccCCCc---cccceecccccceeEEEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhc- Confidence 765543 5999999999999998865544 33444444456899999999888899999999999988888886432 Q ss_pred hhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcc Q lcl|Aclame:pro 149 GNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIP 228 (367) Q Consensus 149 ~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~ 228 (367) .|+.++.. ++.+.++.++|++|+|+|||++++|++++|||+||++|+||+|+++++ T Consensus 142 ---------------------~~t~~~~~---~~~a~~~~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~q~L~~~~~ 197 (315) T protein:vir:96 142 ---------------------SNAGMNVS---GELATEGKKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVDEAIDNKLY 197 (315) T ss_pred ---------------------cccccccc---ccccccCHHHHHHHHHHhcccccCeeEEEEchHHHHHHHHhhhhhhcc Confidence 23333332 245679999999999999999999999999999999999999998775 Q ss_pred cccccccc---hhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccE- Q lcl|Aclame:pro 229 DSKGQLTI---PTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE- 304 (367) Q Consensus 229 ~~~g~~~i---~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~- 304 (367) ...+...+ ++|+||||||||+||+ |++|+|++|||+|++++|.. +.+.+. +|++.|++||+ T Consensus 198 ~~~~~~~~~~~~~~lGkrViVdD~~P~--------~~~~gl~~GAi~~~~~~~~~-----~~~~~~--~g~e~l~~~~r~ 262 (315) T protein:vir:96 198 EEAGVVVYGGTPGTLGKPVLVTDQCPA--------TKIFGLVAGAVMITESQAPG-----MRSYQI--DDQENLAIGFRA 262 (315) T ss_pred cccceeEecCcCcccccEEEEECCCCc--------ceeeeeecceeeecCCCccc-----cccccC--CCcceeEEEEee Confidence 44433332 6788999999999995 78999999999999988742 222222 36689999877 Q ss_pred ---EEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeec-ccccceEEEEecC Q lcl|Aclame:pro 305 ---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTY-RKNVPMAFLVTKG 367 (367) Q Consensus 305 ---~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d-~K~i~iv~~~t~g 367 (367) |++||+||||+.+ ++.|||++||++++||+|||+ .|..+.|.++--| T Consensus 263 e~tf~l~p~G~sw~~~----------------~~~sPt~aeLat~~NWekV~~~~K~tagv~~~~~~ 313 (315) T protein:vir:96 263 EGTANVEVLGYKWKTK----------------TNVNPASATLATTTNWEKYATDDKATAGFIITLTT 313 (315) T ss_pred eeEeeeeeeeEEeecC----------------CCcCCChHHhcCCcCcccccCCCcccceEEEEecC Confidence 9999999999743 567999999999999999996 5999999999999 No 10 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=2.5e-62 Score=358.33 Aligned_cols=270 Identities=11% Similarity=0.088 Sum_probs=227.3 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||+ ..|+|+|||+||||++||.+++.++++|.+ ++..+.+ +.++||++|+||+|++| |+++++.+++ ++ T Consensus 1 Ma~--~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~--~~~~~~~---l~g~~G~ti~iP~~~~i-gda~~~~eg~---~i 69 (276) T protein:vir:10 1 MAQ--GTTTKSTQIVPEVLAPMMQAELDKKLRFAQ--FADIDST---LVGQPGDTLTFPAFVYS-GDATVVPEGQ---KI 69 (276) T ss_pred CCc--ceeehhhhhchHHHHHHHHHHHHhhhhhcc--cceeccc---ccCCCCCEEEeeeecCC-CccccccCCC---cc Confidence 996 679999999999999999999999999944 4444443 33579999999999999 8899998885 58 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++|+++++.+++++|+|+|+++|++.+.+++|||+++.+|++.||+++.|+.+++.|++.... T Consensus 70 ~~~~lt~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~--------------- 134 (276) T protein:vir:10 70 PVDKIETNRREAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLT--------------- 134 (276) T ss_pred CccccccceeeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------- Confidence 99999999999999999999999999999999999999999999999999999999998763221 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc------c Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ------L 234 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~------~ 234 (367) +++ ..++++.|.+|.++|||+...+.+++|||++|+.|+|+++++|++.++.. . T Consensus 135 --------------~~~------~~~t~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G 194 (276) T protein:vir:10 135 --------------VSA------DIGTLAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKG 194 (276) T ss_pred --------------ccc------cccCHHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhccccccccccccccceecc Confidence 111 23789999999999999999999999999999999999999999988642 2 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeee Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNW 314 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~ 314 (367) .|++|+|++||+||.+| +|++|+|++|||++...++ ..+|++|++++. .+.++.|+|| |..+ T Consensus 195 ~ig~~~G~~Vi~s~~~p--------~~t~~l~~~gAi~~~~~~~-~~vE~dRd~~~~----~d~i~~~~~y-----~~~~ 256 (276) T protein:vir:10 195 AFGEALGAVIVRSKKLD--------EGEAILAKRGAVKLITKRD-FFLETDRDPSTK----TTALYSDKHY-----VAYL 256 (276) T ss_pred ccceecceeEEEcCCCC--------cceEEEEeccceeeeecCC-ceeecccchhhc----ccEEEEeeEE-----EEEE Confidence 48999999999999997 4789999999999988765 559999999874 6899999888 4455 Q ss_pred cccccccccccccccccccccCCCChH Q lcl|Aclame:pro 315 LDADVTIPDNTGSPSGITSGPPAITLA 341 (367) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~sPt~a 341 (367) .+.+.... .+..++..|+.| T Consensus 257 ~~~~~vv~-------~t~~~~~~~~~~ 276 (276) T protein:vir:10 257 YDESKAVK-------VTKGAGTTDSGA 276 (276) T ss_pred EcCcceEE-------EecCCcCCcCCC Confidence 44432211 122356677777 No 11 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=9.9e-61 Score=349.54 Aligned_cols=268 Identities=10% Similarity=0.108 Sum_probs=223.5 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||+ +.|+|+|||+||||++||.+++.++++| |+++..+.. +.++||++|+||+|+++ |+++++.+++ .+ T Consensus 1 m~~--~~T~l~d~i~Pev~~~~v~~~~~~~l~~--~~~~~~~~~---l~g~~G~tv~iP~~~~i-g~a~~~~~g~---~i 69 (274) T protein:vir:95 1 MAQ--GMTKLTNQIVPEVLAPMMQAELEKKLRF--ASFAEIDNT---LVGQPGDTLTFPAFIYS-GDAKVVAEGE---KI 69 (274) T ss_pred CCc--ceeehhheechHHHHHHHHHHHHhhhhc--cccceeccc---ccCCCCCEEEeeeecCC-CccccccCCC---cc Confidence 997 6799999999999999999999887777 666666643 34579999999999988 8888998875 58 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++|+++++.++|++++|+|.++|++.+.+++|||+++.+|++.+|+++.|+.|++.+++.... T Consensus 70 ~~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~--------------- 134 (274) T protein:vir:95 70 PTDILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT--------------- 134 (274) T ss_pred chhhcccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------- Confidence 89999999999999999999999999999999999999999999999999999999988763211 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc------ Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL------ 234 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~------ 234 (367) +++ ..++++.|++|.++|||+...+++++|||.+|+.|+|+++++|++.+++.. T Consensus 135 --------------~~~------~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G 194 (274) T protein:vir:95 135 --------------VEA------DITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKG 194 (274) T ss_pred --------------ccc------cccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceecc Confidence 111 236799999999999999999999999999999999999999999887432 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeee Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNW 314 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~ 314 (367) .|++|+|++||+||++| +|++|+|++|||+|...++ ..+|++|++++ +.|.++.|+||.+|+ T Consensus 195 ~ig~~~G~~Vi~s~~~~--------~~t~~l~~~gA~~~~~~~~-~~vE~~Rd~~~----~~d~i~~~~~y~~~~----- 256 (274) T protein:vir:95 195 AFGEALGAVIVRSNKLE--------AGTAILAKKGAVKLITKRD-FFLETDRDPST----KTTALYSDKHYVAYL----- 256 (274) T ss_pred ccceecCeEEEEeCCCC--------CceEEEEeccceeeeecCC-ccccccccccc----ccCEEEEeEEEEEEE----- Confidence 48999999999999987 5789999999999988665 55999999986 569999999985544 Q ss_pred cccccccccccccccccccccCCCChHHhcCCccceeee Q lcl|Aclame:pro 315 LDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVT 353 (367) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~ 353 (367) .+ |+-.-..+..+|++-- T Consensus 257 ~~---------------------~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 257 YD---------------------ESKAVKITKGSGSLEM 274 (274) T ss_pred Ec---------------------CCcEEEEEcCCccccC Confidence 22 2222222333333322 No 12 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=9.9e-61 Score=349.54 Aligned_cols=268 Identities=10% Similarity=0.108 Sum_probs=223.5 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||+ +.|+|+|||+||||++||.+++.++++| |+++..+.. +.++||++|+||+|+++ |+++++.+++ .+ T Consensus 1 m~~--~~T~l~d~i~Pev~~~~v~~~~~~~l~~--~~~~~~~~~---l~g~~G~tv~iP~~~~i-g~a~~~~~g~---~i 69 (274) T protein:vir:96 1 MAQ--GMTKLTNQIVPEVLAPMMQAELEKKLRF--ASFAEIDNT---LVGQPGDTLTFPAFIYS-GDAKVVAEGE---KI 69 (274) T ss_pred CCc--ceeehhheechHHHHHHHHHHHHhhhhc--cccceeccc---ccCCCCCEEEeeeecCC-CccccccCCC---cc Confidence 997 6799999999999999999999887777 666666643 34579999999999988 8888998875 58 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++|+++++.++|++++|+|.++|++.+.+++|||+++.+|++.+|+++.|+.|++.+++.... T Consensus 70 ~~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~--------------- 134 (274) T protein:vir:96 70 PTDILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT--------------- 134 (274) T ss_pred chhhcccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------- Confidence 89999999999999999999999999999999999999999999999999999999988763211 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc------ Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL------ 234 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~------ 234 (367) +++ ..++++.|++|.++|||+...+++++|||.+|+.|+|+++++|++.+++.. T Consensus 135 --------------~~~------~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G 194 (274) T protein:vir:96 135 --------------VEA------DITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKG 194 (274) T ss_pred --------------ccc------cccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceecc Confidence 111 236799999999999999999999999999999999999999999887432 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeee Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNW 314 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~ 314 (367) .|++|+|++||+||++| +|++|+|++|||+|...++ ..+|++|++++ +.|.++.|+||.+|+ T Consensus 195 ~ig~~~G~~Vi~s~~~~--------~~t~~l~~~gA~~~~~~~~-~~vE~~Rd~~~----~~d~i~~~~~y~~~~----- 256 (274) T protein:vir:96 195 AFGEALGAVIVRSNKLE--------AGTAILAKKGAVKLITKRD-FFLETDRDPST----KTTALYSDKHYVAYL----- 256 (274) T ss_pred ccceecCeEEEEeCCCC--------CceEEEEeccceeeeecCC-ccccccccccc----ccCEEEEeEEEEEEE----- Confidence 48999999999999987 5789999999999988665 55999999986 569999999985544 Q ss_pred cccccccccccccccccccccCCCChHHhcCCccceeee Q lcl|Aclame:pro 315 LDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVT 353 (367) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~ 353 (367) .+ |+-.-..+..+|++-- T Consensus 257 ~~---------------------~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 257 YD---------------------ESKAVKITKGSGSLEM 274 (274) T ss_pred Ec---------------------CCcEEEEEcCCccccC Confidence 22 2222222333333322 No 13 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=6.7e-60 Score=344.99 Aligned_cols=269 Identities=13% Similarity=0.121 Sum_probs=222.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||+++. |+|+|||+||||++||.+++.++++| ++++..+.+ +.++||++|+||+|+++ |+++++.+++ ++ T Consensus 1 ~~~~~~-T~l~d~i~PEv~~~~v~~~~~~~~~~--~~~~~~~~~---l~g~~G~tv~iP~~~~i-g~a~~~~~g~---~i 70 (275) T protein:vir:96 1 MALENM-TKLANMVNPEVLAPMMQAELDKKLKF--AQFADIDNT---LVGQPGNTITFPAFVYS-GDAKVVPEGE---EI 70 (275) T ss_pred CCCccc-chhhhhhchHHHHHHHHHHHHHhhhh--cccceeccc---ccCCCCCEEEeeeeccC-CccccccCCC---Cc Confidence 999884 99999999999999999999999888 555554543 34679999999999988 8889998875 58 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++++++++.++|++++|+|.++|++.+.+++|||+++.+|++.+|+++.++.|++.|++.... T Consensus 71 ~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~--------------- 135 (275) T protein:vir:96 71 PIDLIETKKRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLK--------------- 135 (275) T ss_pred chhhcccceeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------- Confidence 89999999999999999999999999999999999999999999999999999999988763211 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccc------cc Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKG------QL 234 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g------~~ 234 (367) ++ ...++++.|++|.++|||+.+.+++++|||.+|..|+|++.++|++.++. +- T Consensus 136 --------------~~------~~~~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G 195 (275) T protein:vir:96 136 --------------VE------ADITKLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKG 195 (275) T ss_pred --------------cc------ccccCHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceecc Confidence 11 12378999999999999999999999999999999999999999988753 22 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeee Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNW 314 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~ 314 (367) .|++|+|++||+||.+| +|++|+|++|||++....+ ..+|++|++++. .|.++.|+||.+|+.- T Consensus 196 ~ig~~~G~~Vi~s~~~p--------~~t~~i~~~gA~~~~~~~~-~~vE~~Rd~~~~----~d~i~~~~~y~~~~~~--- 259 (275) T protein:vir:96 196 AFGEALGAIIVRSNKIK--------EGEAILAKRGAVKLITKRD-FFLETERHASHK----STALFSDKHYVAYLYD--- 259 (275) T ss_pred ccceecCeeEEEeCCCC--------cceEEEEeccceeeeecCC-cccccccchhhc----CcEEEEeEEEEEEEEc--- Confidence 58999999999999997 4689999999999988654 569999999874 6999999999775541 Q ss_pred cccccccccccccccccccccCCC Q lcl|Aclame:pro 315 LDADVTIPDNTGSPSGITSGPPAI 338 (367) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~sP 338 (367) .+-++.-..+.++-| - T Consensus 260 -~~~vv~~t~~~~~~~-------~ 275 (275) T protein:vir:96 260 -ESKVVKITKSASGLG-------V 275 (275) T ss_pred -CccEEEEEecccccC-------C Confidence 111111111111111 1 No 14 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=9e-60 Score=344.30 Aligned_cols=268 Identities=11% Similarity=0.120 Sum_probs=220.2 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||. ..|+|+|||+||||++||.+++.++++| |+++..+.++ .++||++|+||+|+++ |++++|.+++ .+ T Consensus 1 ma~--~~T~l~d~iiPev~~~~v~~~~~~~l~~--~~~~~~d~~l---~g~~G~tv~iP~~~~i-g~a~~~~~g~---~i 69 (274) T protein:vir:12 1 MAQ--GLTKTSNQIIPEVLAPMMQAQLEKKLRF--ASFAEVDSTL---QGQPGDTLTFPAFVYS-GDAQVVAEGE---KI 69 (274) T ss_pred CCc--ceeehhhhhchHHHHHHHHHHHHhhhhh--cccceecccc---cCCCCCEEEEeeecCC-CccccccCCC---cc Confidence 997 6799999999999999999999876665 7777666543 4579999999999988 8899998875 58 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++++++++.++|++++|+|+++|++.+.+++|||+++.+|++.+|+++.++.+++.+++.. T Consensus 70 ~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~----------------- 132 (274) T protein:vir:12 70 PTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK----------------- 132 (274) T ss_pred chhhcccceeeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccc----------------- Confidence 899999999999999999999999999999999999999999999999999999998876411 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc------ Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL------ 234 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~------ 234 (367) ++++ ...++++.|++|.++|||+...+++++|||.+|+.|+|+++++|++.+++.. T Consensus 133 ------------~~~~------~~a~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G 194 (274) T protein:vir:12 133 ------------LTVN------ADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKG 194 (274) T ss_pred ------------cccc------ccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhhhccccccccccceecc Confidence 1111 1237899999999999999999999999999999999999999999987432 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeee Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNW 314 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~ 314 (367) .|++|+|++||+||++| +|++|+|++|||++...++ ..+|++|++++ +.|.++.|+||.++ . T Consensus 195 ~ig~~~G~~Vi~s~~~p--------~~t~~l~~~gA~~~~~~~~-~~vE~~Rd~~~----~~d~i~~~~~y~~~-----~ 256 (274) T protein:vir:12 195 AFGEALGAIIVRSNKLE--------AGTAILAKKGAVKLILKRD-FFLEVARDAST----KTTALYSDKHYVAY-----L 256 (274) T ss_pred cceeecCeeEEEeCCCC--------cceEEEEeccceeeeecCC-ceeccccchhh----cccEEEeeeEEEEE-----E Confidence 48999999999999998 4789999999999988665 55999999986 46899999998544 4 Q ss_pred cccccccccccccccccccccCCCChHHhcCCccceeee Q lcl|Aclame:pro 315 LDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVT 353 (367) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~ 353 (367) .+.+.. -.-+...|..-- T Consensus 257 ~~~~~v---------------------v~~t~~~~~~~~ 274 (274) T protein:vir:12 257 YDESKA---------------------VKITKGSGSLEM 274 (274) T ss_pred EcCCce---------------------EEEEcCCccccC Confidence 332211 111111111111 No 15 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=8.6e-59 Score=338.91 Aligned_cols=263 Identities=14% Similarity=0.155 Sum_probs=222.5 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||. ..|+|+|+|+||||++||.+++.++++|.+++++ +.. +.++||++|+||+|+++ |+++++.+++ ++ T Consensus 1 ma~--~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~--~~~---l~g~~G~ti~iP~~~~~-gda~~~~eg~---~i 69 (272) T protein:vir:36 1 MSK--QKTTLADLVNPEVLAPIVSYELNKALRFAPLAQV--DTT---LQGQPGNTLKFPAFTYI-GDAADVAEGG---EI 69 (272) T ss_pred CCC--cceehhhhhchHHHHHHHHHHHHhhhhhcccccc--ccc---cccCCCCEEEEeeeccC-ccccccCCCC---cc Confidence 996 6799999999999999999999998888554444 433 34569999999999988 8889998885 58 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++++++++.+++++++|+|.++|++.+.+++|||+++.+|++.+|+++.|+.|++.|+|.... T Consensus 70 ~~~~lt~~~~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~--------------- 134 (272) T protein:vir:36 70 SLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQT--------------- 134 (272) T ss_pred ChhhcCCcceeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--------------- Confidence 99999999999999999999999999999999999999999999999999999999988763221 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc-----cc Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ-----LT 235 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~-----~~ 235 (367) ....++++.+++|.++|||....+++++|||++|..|+|+..+++...+.++ .. T Consensus 135 ---------------------~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ 193 (272) T protein:vir:36 135 ---------------------VSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGT 193 (272) T ss_pred ---------------------ccccccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeec Confidence 1123678999999999999999999999999999999999888887665543 25 Q ss_pred chhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEee---eee- Q lcl|Aclame:pro 236 IPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVH---PGG- 311 (367) Q Consensus 236 i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~h---p~G- 311 (367) |++|+|+|||+||.||.. .+.|++|+|++||+++...++ ..+|++|+++++ .|.++.|+||.+| |.| T Consensus 194 ig~~~G~~Vv~s~~~p~~----~~~~~~~~~~~gA~~~~~~~~-~~vE~~R~~~~~----~d~i~~~~~y~~~v~~~~~v 264 (272) T protein:vir:36 194 YADVLGAQIVRSKKLAEG----SALMFKIVSNSPALKLVLKRG-VQVETDRDIVTK----TTVITADEHYAAYLYDLTKV 264 (272) T ss_pred cceecCeeEEEeCCCCCC----ceeEEEEEecccceeeeecCC-cccccccchhhc----CcEEEEEEEEEEEEEcCccE Confidence 899999999999999963 468999999999999987764 459999999864 5899999999654 555 Q ss_pred eeeccccc Q lcl|Aclame:pro 312 FNWLDADV 319 (367) Q Consensus 312 ~s~~~~~~ 319 (367) +..+-+.| T Consensus 265 v~~t~~g~ 272 (272) T protein:vir:36 265 VNITFTGV 272 (272) T ss_pred EEEeecCC Confidence 34333222 No 16 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=3.1e-57 Score=330.39 Aligned_cols=268 Identities=12% Similarity=0.137 Sum_probs=219.7 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||. ..|+|+|||+||||++||.+++.+++ ++++++..+.++ .++||++|+||+|+++ |++++|.+++ .+ T Consensus 1 ma~--~~T~~~d~iiPev~~~~v~~~~~~~l--~~~~~~~~d~~l---~g~~G~tv~iP~~~~~-g~a~~~~~g~---~i 69 (274) T protein:vir:97 1 MPQ--GLTKTSDQIIPEVLAPMMQAQLEKKL--RFASFAEVDSTL---QGQPGDTLTFPAFVYS-GDAQVVAEGE---KI 69 (274) T ss_pred CCc--cceehhheechHHHHHHHHHhhhhhh--hhcccceecccc---cCCCCCEEEEeeecCC-CccccccCCC---cc Confidence 997 67999999999999999999997765 457787777544 3579999999999987 8899998875 48 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++++++++.++|++++|+|+++|++.+.+++|||+++.+|++.+|+++.++.+++.|++.- T Consensus 70 ~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~----------------- 132 (274) T protein:vir:97 70 PTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK----------------- 132 (274) T ss_pred cccccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC----------------- Confidence 899999999999999999999999999999999999999999999999999999999886521 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc------c Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ------L 234 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~------~ 234 (367) ..++ ...++++.|++|.++|||+...+++++|||.+|..|+|+++++|++.++.. - T Consensus 133 ------------~~~~------~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G 194 (274) T protein:vir:97 133 ------------LTVN------ADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKG 194 (274) T ss_pred ------------cccc------ccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceecc Confidence 0111 124789999999999999999999999999999999999999999988732 2 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeee Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNW 314 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~ 314 (367) .|++|+|++|++||++| +|++|+|++|||++...++ ..+|++|++++. .|.++.|+||.+ +. T Consensus 195 ~ig~~~G~~Vi~s~~~p--------~~t~~l~~~gA~~~~~~~~-~~vE~~Rd~~~~----~d~i~~~~~y~~-----~~ 256 (274) T protein:vir:97 195 AFGEALGAIIVRTNKLE--------AGTAILAKKGAVKLILKRD-FFLEVARDASTK----TTALYSDKHYVA-----YL 256 (274) T ss_pred ccceecCeeEEEcCCCC--------cceEEEEeCcceEeeecCC-ceeccccchhhc----ccEEEEEEEEEE-----EE Confidence 48999999999999998 4789999999999988765 459999999874 588999988855 33 Q ss_pred cccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 315 LDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) .+.+.... ...+-+-|+- T Consensus 257 ~~~~~vv~-------------~t~~~~~~~~ 274 (274) T protein:vir:97 257 YDESKAVK-------------ITKGSGSLEM 274 (274) T ss_pred EcCCceEE-------------EecCcccccC Confidence 33221100 0011111111 No 17 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=3.1e-57 Score=330.39 Aligned_cols=268 Identities=12% Similarity=0.137 Sum_probs=219.7 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||. ..|+|+|||+||||++||.+++.+++ ++++++..+.++ .++||++|+||+|+++ |++++|.+++ .+ T Consensus 1 ma~--~~T~~~d~iiPev~~~~v~~~~~~~l--~~~~~~~~d~~l---~g~~G~tv~iP~~~~~-g~a~~~~~g~---~i 69 (274) T protein:vir:94 1 MPQ--GLTKTSDQIIPEVLAPMMQAQLEKKL--RFASFAEVDSTL---QGQPGDTLTFPAFVYS-GDAQVVAEGE---KI 69 (274) T ss_pred CCc--cceehhheechHHHHHHHHHhhhhhh--hhcccceecccc---cCCCCCEEEEeeecCC-CccccccCCC---cc Confidence 997 67999999999999999999997765 457787777544 3579999999999987 8899998875 48 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++++++++.++|++++|+|+++|++.+.+++|||+++.+|++.+|+++.++.+++.|++.- T Consensus 70 ~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~----------------- 132 (274) T protein:vir:94 70 PTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK----------------- 132 (274) T ss_pred cccccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC----------------- Confidence 899999999999999999999999999999999999999999999999999999999886521 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc------c Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ------L 234 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~------~ 234 (367) ..++ ...++++.|++|.++|||+...+++++|||.+|..|+|+++++|++.++.. - T Consensus 133 ------------~~~~------~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G 194 (274) T protein:vir:94 133 ------------LTVN------ADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKG 194 (274) T ss_pred ------------cccc------ccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceecc Confidence 0111 124789999999999999999999999999999999999999999988732 2 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeee Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNW 314 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~ 314 (367) .|++|+|++|++||++| +|++|+|++|||++...++ ..+|++|++++. .|.++.|+||.+ +. T Consensus 195 ~ig~~~G~~Vi~s~~~p--------~~t~~l~~~gA~~~~~~~~-~~vE~~Rd~~~~----~d~i~~~~~y~~-----~~ 256 (274) T protein:vir:94 195 AFGEALGAIIVRTNKLE--------AGTAILAKKGAVKLILKRD-FFLEVARDASTK----TTALYSDKHYVA-----YL 256 (274) T ss_pred ccceecCeeEEEcCCCC--------cceEEEEeCcceEeeecCC-ceeccccchhhc----ccEEEEEEEEEE-----EE Confidence 48999999999999998 4789999999999988765 459999999874 588999988855 33 Q ss_pred cccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 315 LDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) .+.+.... ...+-+-|+- T Consensus 257 ~~~~~vv~-------------~t~~~~~~~~ 274 (274) T protein:vir:94 257 YDESKAVK-------------ITKGSGSLEM 274 (274) T ss_pred EcCCceEE-------------EecCcccccC Confidence 33221100 0011111111 No 18 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=1.1e-55 Score=321.82 Aligned_cols=262 Identities=12% Similarity=0.157 Sum_probs=217.3 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||. +.|+|+|||+||||++|+.+++.++++| ++++..+.++ .++||++++||+|+.+ |++++|.+++ .+ T Consensus 1 ma~--~~T~~~d~i~Pev~s~~v~~~~~~~~~~--~~~~~~~~~l---~g~~G~tv~ip~~~~~-g~~~~~~~g~---~i 69 (274) T protein:vir:96 1 MAQ--GTTKVSNLIVPEVLAPMMQAELDKKLRF--AQFADIDSTL---VGQPGDTLTFPAFTYS-GDAQVIAEGE---KI 69 (274) T ss_pred CCc--cccchhhhhhhHHHHHHHHHHHHhhhhh--cccccccccc---cCCCCCEEEEEeeccC-CCccccCCCC---cC Confidence 997 5699999999999999999999887766 6666666443 3579999999999966 8899998875 48 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++++++++.+++++++|+|.++|++.+.+++|||+++.+|++.+|+++.|+.+++.|++.- T Consensus 70 ~~~~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~----------------- 132 (274) T protein:vir:96 70 PVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT----------------- 132 (274) T ss_pred chhhcccceeEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----------------- Confidence 899999999999999999999999999999999999999999999999999999999886521 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc------c Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ------L 234 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~------~ 234 (367) . ..+ ...++++.|++|.++|||+...+++++|||.+|+.|+|+++++|++.++.. . T Consensus 133 ----------~--~~~------~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g 194 (274) T protein:vir:96 133 ----------L--TVE------ADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKG 194 (274) T ss_pred ----------C--CcC------cccccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhcccccccccccccccceeec Confidence 0 011 123679999999999999999999999999999999999999999887632 3 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeee Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNW 314 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~ 314 (367) .|++|+|++||+||.+|. |++|+|++|||++..+.+ ..+|++|+++++ .|.++.|.|| |.+. T Consensus 195 ~ig~~~G~~Vi~s~~~p~--------~t~~l~~~gA~~~~~~~~-~~vE~~Rd~~~~----~d~i~~~~~y-----g~~~ 256 (274) T protein:vir:96 195 AFGEALGAVIVRSNKLNK--------GEALLAKKGAVKLITKRD-FFLEKDRDASRK----STALYSDKHY-----VAYL 256 (274) T ss_pred ccceecCeeEEEcCCCCc--------ceEEEEeCcceeeeecCC-cccccccchhhc----ccEEEEeeEE-----EEEE Confidence 599999999999999983 679999999999988765 458999999864 5889988776 4444 Q ss_pred cccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 315 LDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) ...+. +..+||| T Consensus 257 ~~~~~-----------------------------------------vv~~t~~ 268 (274) T protein:vir:96 257 YDESK-----------------------------------------VVKITKG 268 (274) T ss_pred EcCcc-----------------------------------------EEEEEcC Confidence 33211 1111111 No 19 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=2.9e-53 Score=308.63 Aligned_cols=268 Identities=11% Similarity=0.125 Sum_probs=216.6 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||. +.|+++|+|+||||++|+.+++.+++.|.+ ++..+.+ +.++||++|+||+|+++ |++++|.+++ .+ T Consensus 1 ma~--~~T~~~~~iiPev~~~~v~~~~~~~~~~~~--~~~~~~~---l~g~~G~tv~ip~~~~~-g~~~~~~eg~---~i 69 (274) T protein:vir:93 1 MPQ--GITKTSNQIIPEVLAPMMQAQLEKKLRFAS--FAEVDST---LQGQPGDTLTFPAFVYS-GDAQVVAEGE---KI 69 (274) T ss_pred CCc--cceehhheechHHHHHHHHHHHHhhhhhcc--ccccccc---ccCCCCCEEEEEeeccC-CCcccccCCC---cc Confidence 997 779999999999999999999999887744 4444433 34579999999999988 7888998875 48 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++++++++.+++++++|+|+++|+..+.++.|||+++.+|++.+|+++.++.+++.+++... T Consensus 70 ~~~~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~---------------- 133 (274) T protein:vir:93 70 PTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------------- 133 (274) T ss_pred cccccccceeEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------------- Confidence 8999999999999999999999999999999999999999999999999999999988765210 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc------c Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ------L 234 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~------~ 234 (367) .++ ...++++.|++|.++|||+...+.+++|||.++..|+|+++++|++.++.. . T Consensus 134 -------------~~~------~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G 194 (274) T protein:vir:93 134 -------------TVN------ADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKG 194 (274) T ss_pred -------------ccc------ccccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhcccccccccccceeec Confidence 111 123689999999999999999999999999999999999999999887632 2 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeee Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNW 314 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~ 314 (367) .|++|+|++|++||.+| +|++|+|++|||++....+ ..+|++|++++. .+.++.|+||.+ +. T Consensus 195 ~ig~~~G~~Vi~s~~~p--------~~t~~l~~~gai~~~~~~~-~~vE~~Rd~~~~----~d~i~~~~~y~~-----~~ 256 (274) T protein:vir:93 195 AFGEALGAIIVRTNKLE--------AGTAILAKKGAVKLILKRD-FFLEVARDASTK----TTALYSDKHYVA-----YL 256 (274) T ss_pred ccceecCeeEEEcCCCC--------cceEEEEeCCeEEEEecCC-cccccccchhhc----ccEEEEEEEEEE-----EE Confidence 48999999999999998 4789999999999988664 559999998764 588888888754 33 Q ss_pred cccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 315 LDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) ...+..... ...-+-|+- T Consensus 257 ~~~~~~v~~-------------t~~~~s~~~ 274 (274) T protein:vir:93 257 YDESKAVKI-------------TKGSGSLEM 274 (274) T ss_pred EcCCceEEE-------------eeCccccCC Confidence 332211000 000000111 No 20 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=1.2e-52 Score=305.31 Aligned_cols=271 Identities=12% Similarity=0.094 Sum_probs=214.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||+ ..|+++|+|+||||.+||.+++.++.+|.++..+ +.. +.++||++|+||+|+++ |++++|.+++ .+ T Consensus 1 Ma~--~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~--~~~---l~g~~G~tv~ip~~~~~-g~a~~~~~g~---~i 69 (278) T protein:vir:80 1 MAD--LTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPI--DNS---LEGQPGSEITVPKYKYI-GDAQDVAEGA---AI 69 (278) T ss_pred CCC--cceehhheecHHHHHHHHHHHHHHhhhhccccee--ccc---ccCCCCCEEEEeeeccC-CcceeecCCC---cC Confidence 997 6799999999999999999999998888555433 222 33579999999999988 8888998875 58 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++++++++++.+++++++++|+++|++.+.++.|||+++++|++.||+|+.|+.|++.++|.+...... T Consensus 70 ~~~~lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~----------- 138 (278) T protein:vir:80 70 DYSALETESVKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGA----------- 138 (278) T ss_pred cccccccceeeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc----------- Confidence 899999999999999999999999999999999999999999999999999999999998864321110 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCc-eeEEEEccHHHHHHHhcchhhhcccccc------c Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGS-IAAIAVHSMVYKRMTNNDEIEFIPDSKG------Q 233 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~-l~~~vmhS~v~~~L~k~~li~~~~~~~g------~ 233 (367) ++.. .....++.|++|..+|++.... -+.++|||.+|+.|+|+++++|++.++. + T Consensus 139 --------------~t~~----~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~ 200 (278) T protein:vir:80 139 --------------INIG----LIDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVK 200 (278) T ss_pred --------------cccc----hhhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceee Confidence 0000 1113467899999998775443 5579999999999999999999987762 1 Q ss_pred ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeee Q lcl|Aclame:pro 234 LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFN 313 (367) Q Consensus 234 ~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s 313 (367) ..|++|+|++|++||.+| +|++|+|++|||++...++ ..+|++|++++ +.+.++.|+||.+|+. T Consensus 201 G~ig~~~G~~Vi~s~~~p--------~~t~~l~~~gAi~~~~~~~-~~vE~~Rd~~~----~~d~i~~~~~yg~~v~--- 264 (278) T protein:vir:80 201 GAFGELLGWEIVRTKKLA--------DGNALAVKAGALKTFLKRN-LLAESGRDMDH----KLTKFNADQHYAVALV--- 264 (278) T ss_pred ccceeecceeEEEcCCCC--------cceEEEEeccceeeeecCC-cccccccchhh----ccceeeeeeEEEEEEE--- Confidence 248999999999999998 3679999999999988875 45899999986 4689999999866542 Q ss_pred ecccccccccccccccccccccC Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPP 336 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~ 336 (367) +.+.+-.- +..++. T Consensus 265 --~~~~~v~i-------t~~a~~ 278 (278) T protein:vir:80 265 --DETKAVKV-------VPVAGN 278 (278) T ss_pred --cCcceEEE-------eeccCC Confidence 21110000 001111 No 21 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=2.1e-46 Score=271.02 Aligned_cols=262 Identities=11% Similarity=0.108 Sum_probs=215.4 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||. ..|+++++|+||+|.+|+.+++.+++.| ++++..+.. +.+++|++++||+|+.+ ++++++.|++ .+ T Consensus 1 MA~--~~T~~~~~~iPev~s~~v~~~~~~~~~~--~~~~~~~~~---~~g~~G~tv~iP~~~~~-~~a~~v~eg~---~i 69 (272) T protein:vir:98 1 MAV--GTTKMAQMLDPEVLADMIDAEVGKAIRF--APLAEVDTT---LEGQPGTTLTVPKWDYI-GDAEDVAEGE---AI 69 (272) T ss_pred CCC--ccccchheechHHHHHHHHHHHHHHhhh--hcccccccc---ccCCCCCEEEEEEecCC-CCcccccCCC---cc Confidence 997 4599999999999999999999998877 334433332 23468999999999876 7888998875 58 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++.+++.++..+++++++++|.++|+..+.+..||+.++.+|++.+|+++.++.+++.++|.... T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~--------------- 134 (272) T protein:vir:98 70 PMTQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT--------------- 134 (272) T ss_pred cccccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--------------- Confidence 89999999999999999999999999999999999999999999999999999999987663211 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc------c Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ------L 234 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~------~ 234 (367) .....+++.+++|.++|||....+.+++|||.+|..|+++++.+|.+.++.. - T Consensus 135 ---------------------~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g 193 (272) T protein:vir:98 135 ---------------------VEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSG 193 (272) T ss_pred ---------------------cccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccc Confidence 0112568899999999999999999999999999999999999998876532 2 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeee--- Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGG--- 311 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G--- 311 (367) .+++|+|++|++|+.|| ++++|+|++|||++....+ ..+|++|++.+ +.+.++.|++|.+|+.- T Consensus 194 ~ig~i~G~~Vi~s~~~p--------~~t~~~~~~~a~~~~~~~~-~~ve~~r~~~~----~~~~i~~~~~~~~~v~~~~~ 260 (272) T protein:vir:98 194 VYGEVLGVQIVRSRKCP--------KGTAYMVRKGALRIMLKRN-TMVETDRDITK----AINQIVANKHYGVYLYKAEK 260 (272) T ss_pred cchhhcCeeEEEcCCCC--------cceEEEEcCCeEEEEecCC-ceeeecccccc----ceeEEEEEEEEEEEEEcCCc Confidence 47899999999999998 4679999999999988765 45899999864 57999999998876542 Q ss_pred -eeecccccccccccccccccccccCC Q lcl|Aclame:pro 312 -FNWLDADVTIPDNTGSPSGITSGPPA 337 (367) Q Consensus 312 -~s~~~~~~~~~~~~~~~~~~~~~~~s 337 (367) +.++-+ ++++. T Consensus 261 vv~~t~~---------------~a~~~ 272 (272) T protein:vir:98 261 AVKITLK---------------DAAKK 272 (272) T ss_pred eEEEEec---------------ccccC Confidence 222111 12222 No 22 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=2.1e-46 Score=271.02 Aligned_cols=262 Identities=11% Similarity=0.108 Sum_probs=215.4 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||. ..|+++++|+||+|.+|+.+++.+++.| ++++..+.. +.+++|++++||+|+.+ ++++++.|++ .+ T Consensus 1 MA~--~~T~~~~~~iPev~s~~v~~~~~~~~~~--~~~~~~~~~---~~g~~G~tv~iP~~~~~-~~a~~v~eg~---~i 69 (272) T protein:vir:30 1 MAV--GTTKMAQMLDPEVLADMIDAEVGKAIRF--APLAEVDTT---LEGQPGTTLTVPKWDYI-GDAEDVAEGE---AI 69 (272) T ss_pred CCC--ccccchheechHHHHHHHHHHHHHHhhh--hcccccccc---ccCCCCCEEEEEEecCC-CCcccccCCC---cc Confidence 997 4599999999999999999999998877 334433332 23468999999999876 7888998875 58 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ++.+++.++..+++++++++|.++|+..+.+..||+.++.+|++.+|+++.++.+++.++|.... T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~--------------- 134 (272) T protein:vir:30 70 PMTQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT--------------- 134 (272) T ss_pred cccccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--------------- Confidence 89999999999999999999999999999999999999999999999999999999987663211 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc------c Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ------L 234 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~------~ 234 (367) .....+++.+++|.++|||....+.+++|||.+|..|+++++.+|.+.++.. - T Consensus 135 ---------------------~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g 193 (272) T protein:vir:30 135 ---------------------VEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSG 193 (272) T ss_pred ---------------------cccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccc Confidence 0112568899999999999999999999999999999999999998876532 2 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeee--- Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGG--- 311 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G--- 311 (367) .+++|+|++|++|+.|| ++++|+|++|||++....+ ..+|++|++.+ +.+.++.|++|.+|+.- T Consensus 194 ~ig~i~G~~Vi~s~~~p--------~~t~~~~~~~a~~~~~~~~-~~ve~~r~~~~----~~~~i~~~~~~~~~v~~~~~ 260 (272) T protein:vir:30 194 VYGEVLGVQIVRSRKCP--------KGTAYMVRKGALRIMLKRN-TMVETDRDITK----AINQIVANKHYGVYLYKAEK 260 (272) T ss_pred cchhhcCeeEEEcCCCC--------cceEEEEcCCeEEEEecCC-ceeeecccccc----ceeEEEEEEEEEEEEEcCCc Confidence 47899999999999998 4679999999999988765 45899999864 57999999998876542 Q ss_pred -eeecccccccccccccccccccccCC Q lcl|Aclame:pro 312 -FNWLDADVTIPDNTGSPSGITSGPPA 337 (367) Q Consensus 312 -~s~~~~~~~~~~~~~~~~~~~~~~~s 337 (367) +.++-+ ++++. T Consensus 261 vv~~t~~---------------~a~~~ 272 (272) T protein:vir:30 261 AVKITLK---------------DAAKK 272 (272) T ss_pred eEEEEec---------------ccccC Confidence 222111 12222 No 23 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=100.00 E-value=1.3e-39 Score=233.82 Aligned_cols=225 Identities=11% Similarity=0.064 Sum_probs=181.0 Q ss_pred HHhhCCCceEEeeeeccCCCcccccCCCCccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHH Q lcl|Aclame:pro 47 QFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVY 126 (367) Q Consensus 47 ~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~y 126 (367) .-.-..|+||++|.| + |+++++.|++ ++++++|++.++.++|++++|||.++|++.+.+++||++++++|++.. T Consensus 1 ~~~~~~Gdtit~P~~--i-Gda~~v~eG~---~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~ 74 (231) T protein:vir:73 1 ENGINLANLCEYPND--I-GDAADVAEGG---EISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLS 74 (231) T ss_pred CccccCCceEEeccc--c-cchhhhcCCC---cCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHH Confidence 112247999999988 6 8999999986 589999999999999999999999999999999999999999999999 Q ss_pred HhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCcee Q lcl|Aclame:pro 127 WTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIA 206 (367) Q Consensus 127 w~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~ 206 (367) .+++.++++++.|++.... ....++++.+++|.++|||..+... T Consensus 75 iA~kvD~di~~~~~~a~l~------------------------------------~~~~~t~d~i~~A~~~fgde~~~~~ 118 (231) T protein:vir:73 75 LANKVDDDLLKAAKTTSQT------------------------------------VSTKANVDGVQAALDIFNDEDAQAY 118 (231) T ss_pred HHHhhhHHHHHhhcccccc------------------------------------ccccccHHHHHHHHHHhccccccce Confidence 9999999999988753211 0123789999999999999999999 Q ss_pred EEEEccHHHHHHHhcchhhhcc--cccc---cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcc Q lcl|Aclame:pro 207 AIAVHSMVYKRMTNNDEIEFIP--DSKG---QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVP 281 (367) Q Consensus 207 ~~vmhS~v~~~L~k~~li~~~~--~~~g---~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~ 281 (367) +++|||+++++|||.--..... ..++ .-.|++++|++|++||.+|.. .+.+..|+..+||+++....... T Consensus 119 vivv~p~~~~~Lrk~~~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~----~~~~~~~i~~~gAl~~~~k~~~~- 193 (231) T protein:vir:73 119 VLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEG----SALMFKIVSNSPALKLVLKRGVQ- 193 (231) T ss_pred EEEEcchHHHhhhhccchhhhhhhhccceeeecccceEcceEEEEcCCCCCC----ceeeeeEEeeccceeeeecccce- Confidence 9999999999999963222221 1121 235899999999999999953 35677899999999999887644 Q ss_pred eeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccceE Q lcl|Aclame:pro 282 VAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMA 361 (367) Q Consensus 282 ~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv 361 (367) +|++||++.. .+.++.++||.+|..= ++ .+| T Consensus 194 vEtdRd~~~k----~~~i~~~~~y~v~l~~----~~-----------------------------------------~vv 224 (231) T protein:vir:73 194 VETDRDIVTK----TTVITADEHYAAYLYD----LT-----------------------------------------KVV 224 (231) T ss_pred eecccccccc----ccEEEEeEEEEEEEEc----Cc-----------------------------------------cEE Confidence 9999999875 5889999999776531 11 123 Q ss_pred EEEecC Q lcl|Aclame:pro 362 FLVTKG 367 (367) Q Consensus 362 ~~~t~g 367 (367) .+..+| T Consensus 225 ~~t~~g 230 (231) T protein:vir:73 225 NITFTG 230 (231) T ss_pred EEEeec Confidence 333344 No 24 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.79 E-value=7.1e-23 Score=142.02 Aligned_cols=278 Identities=12% Similarity=0.067 Sum_probs=175.2 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhh-cccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFL-SGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~-SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) ||+ .+.|++.|+.+|+++ +++..-...-++|.+ -||....| -.-|++|++|.|.++ |++++|.|+. . T Consensus 1 mAe-~nlt~~~dL~~~~si-dfv~~f~~~i~~L~~~Lgi~r~~p------~a~G~tIt~pK~~~t-gda~dVaEGe---~ 68 (295) T protein:vir:99 1 MAE-KNLNTMADLGDIKSI-DFVNKFSKNINDLLKLLGVTRRET------LTNDLKIQTYKWEVT-LDQTDPGEGE---T 68 (295) T ss_pred CCC-cccccHhhccCceee-hhhHHhhhhHHHHHHHhccccccc------cccCCeEEeeeeeee-cccccccCCc---c Confidence 999 568999999999988 444332222223322 23322211 124999999999988 9999999995 6 Q ss_pred ccccccchh---hhhhhhhHhhcccchhHHHH-HhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSG---EMKTTKTWLNKAYGAMDLTA-ELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIK 155 (367) Q Consensus 80 ~t~~kitt~---~~~a~i~~r~kg~~~tDla~-~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~ 155 (367) |+.+|+++. ...+.+++.+|+. ||+|. +++++||+++..+|+....+++.++++++.|+.--.. T Consensus 69 Iplskvt~~~~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t---------- 136 (295) T protein:vir:99 69 IPLSKVTRTKDKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTK---------- 136 (295) T ss_pred cchhhheeeeeeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCcee---------- Confidence 889999976 4677788888875 99995 8888999999999999999999999999998641110 Q ss_pred hhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccc-ccc Q lcl|Aclame:pro 156 TRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSK-GQL 234 (367) Q Consensus 156 ~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~-g~~ 234 (367) ++ ++. --.++....++.++|.|..+.-.+++|||+.+++|++..-+++.+.++ |.. T Consensus 137 -------------------~t---g~~-lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~ 193 (295) T protein:vir:99 137 -------------------VK---GVG-LQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMT 193 (295) T ss_pred -------------------ee---hhh-HHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhhh Confidence 01 110 013456667778889888888899999999999999999888877754 555 Q ss_pred cchhhcCcE-EEEeCCCcccC---CCCCceEEEEEEec-----ceeeeeccCC-CcceeeeeehhhcCCceeEEEEEccE Q lcl|Aclame:pro 235 TIPTYMGKV-VIVDDGMPVFG---TGADKTYLSILFGG-----AAFGYADGAP-QVPVAVGRRELRGNGSGLEYILERKE 304 (367) Q Consensus 235 ~i~t~~G~~-VivdD~~pv~~---t~~~~~yttyl~~~-----GAi~~~~~~~-~~~~e~~rd~~~~~~~g~~~l~~r~~ 304 (367) -|..++|.. ||++.++|... +..++--..|+=.. ++|.+ -.+ ...+-+.+++...+-..++.+.+-.. T Consensus 194 ~L~nfLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~~~f~~--~~D~tglIg~~h~~~~~~~t~et~~~~~~~ 271 (295) T protein:vir:99 194 LLKNFLGMQNVIVMPSVPEGKIYSTAVENLVFASLNVKGGDLGGLFAD--FTDETGLIAAARNRQLSNLTYESVFFGANV 271 (295) T ss_pred hhhhhhccceEEEcccCCCceEEEeeccceEEEEecCCchhhhhhhhh--ccCcccceEEEeccccceeeehhhhHhHHH Confidence 567899996 99999998532 33333322333322 12221 111 22344555555554444433322100 Q ss_pred -EEeeeeeeeecccccccccccccccccccc Q lcl|Aclame:pro 305 -WIVHPGGFNWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 305 -~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~ 334 (367) |-=-+-|+= .+++..+.....+ + T Consensus 272 lfpE~~dgiv--~~tI~~~~~~~~~-----~ 295 (295) T protein:vir:99 272 LFAEIPEGVV--EATIEAAAVPGIG-----G 295 (295) T ss_pred hcccccceEE--EEEEecCcCCCCC-----C Confidence 000111211 0111111111111 2 No 25 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.67 E-value=2.1e-18 Score=117.49 Aligned_cols=264 Identities=11% Similarity=0.042 Sum_probs=174.6 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||-- .|.||+|.+++.+++.+.+.|. .++..+-+ .....|++|++|.|+... ..+...++. .+ T Consensus 1 MA~~--------~~~pei~~~~v~~~~~~~lv~~--~l~~~~~~---~~~~~GdTv~ip~~~~~~-~~d~~~~~~---~~ 63 (273) T protein:vir:79 1 MAFN--------NFIPELWSDMLLEEWTAQTVFA--NLVNREYE---GIASKGNVVHIAGVVAPT-VKDYKAAGR---QT 63 (273) T ss_pred Ccch--------hhhHHHHHHHHHHHHHhhccch--hhhhcccc---ccccCCcEEEEeecCccc-ccccccCCC---cc Confidence 9852 2789999999999998887662 22322222 224579999999999874 333233332 36 Q ss_pred cccccchhhhhhhhh-HhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKT-WLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 81 t~~kitt~~~~a~i~-~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) +++.++.+....++. .+..++.++|+-......| ++++.+|.+...++..++.+++.+.+.-... T Consensus 64 ~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~vD~~i~~~~~~a~~~~------------- 129 (273) T protein:vir:79 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTAL------------- 129 (273) T ss_pred CccccccceEEEEEeeecccceeeccHHHHhhccc-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------- Confidence 677888888888884 4899999999888777777 5679999998889999999888775421100 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc--CceeEEEEccHHHHHHHhcc--hhhhccccc-c-- Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTNND--EIEFIPDSK-G-- 232 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~--~~l~~~vmhS~v~~~L~k~~--li~~~~~~~-g-- 232 (367) ... .+.+. .-.++.|.+|..+|.+.. ..=..++|+|.++..|++.. +.+.....+ + T Consensus 130 --------------~~~-~~~~~--~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l 192 (273) T protein:vir:79 130 --------------TGS-APSDA--DDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) T ss_pred --------------ccc-cccch--hhHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccce Confidence 000 00111 123578999999998764 23468999999999998863 333332221 1 Q ss_pred -cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeee Q lcl|Aclame:pro 233 -QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGG 311 (367) Q Consensus 233 -~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G 311 (367) +-.|+.+.|+.|+.+..+|.. +.|+++.+.++|+++...- .-+|..|++... +.-..-++.=-+++++|.| T Consensus 193 ~~G~ig~~~G~~i~~s~~lp~~-----~~~~~~a~~~~A~~~a~~~--~~~e~~r~~~~~-~~~v~~~~~yg~~v~~p~~ 264 (273) T protein:vir:79 193 RAGTIGNLLGARIVESNNLRDT-----DDEQFVAFHPSAAAYVSQI--DTVEALRDQDSF-SDRIRALHVYGGKVVRPTG 264 (273) T ss_pred eeeEeeEEeceEEEeccccccc-----CceEEEEEeccceeeeeeh--hhhhcccCcccc-eeeeeeeeeeeeEEecCce Confidence 235789999999999999963 2256788899999886533 357888998654 2222222222345667777 Q ss_pred eeeccccccccccccccccccccc Q lcl|Aclame:pro 312 FNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 312 ~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) +-=-.++ ++ T Consensus 265 vv~~~~~---------------g~ 273 (273) T protein:vir:79 265 VVVFNKT---------------GS 273 (273) T ss_pred EEEEecc---------------CC Confidence 6543332 11 No 26 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.61 E-value=1.8e-17 Score=112.36 Aligned_cols=264 Identities=11% Similarity=0.041 Sum_probs=170.9 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||-- .|.||+|.+.+.+++.+.+.|. .++-.+-+.. ...|+++++|.|+.+. ..+-..++. .+ T Consensus 1 MA~~--------~~~pe~~~~~v~~~~~~~lv~~--~l~~~~~~~~---~~~Gdtv~ip~~~~~~-~~d~~~~~~---~~ 63 (273) T protein:vir:10 1 MAFN--------NFIPELWSDMLLEEWTAQTVFA--NLVNREYEGT---ASKGNVVHIAGVVAPT-VKDYKAAGR---QT 63 (273) T ss_pred Ccch--------hhhHHHHHHHHHHHHHhhhccc--hhhccccccc---cccCceEEEeeccccc-ccccccCCC---cc Confidence 8852 3789999999999988877662 2222222111 2469999999999874 332222222 35 Q ss_pred cccccchhhhhhhh-hHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 81 t~~kitt~~~~a~i-~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) +++.++......++ +.+..++.++|+-......| +.++.+|.+...++..++.+++.+.+.-... T Consensus 64 ~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~------------- 129 (273) T protein:vir:10 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTAL------------- 129 (273) T ss_pred CccccccceEEEEEeeeeecceEeecHHHhhhhcc-HHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------- Confidence 67788888877877 45789999999877777666 5679999998888998888888765421100 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc--CceeEEEEccHHHHHHHhcc--hhhhccccc-c-- Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTNND--EIEFIPDSK-G-- 232 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~--~~l~~~vmhS~v~~~L~k~~--li~~~~~~~-g-- 232 (367) ..+ .+.+. .-.++.|.+|..+|.+.. ..=..++++|.+|..|++.. +.+.....+ + T Consensus 130 --------------~~~-~~~~~--~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l 192 (273) T protein:vir:10 130 --------------TGS-APTDA--DDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) T ss_pred --------------ccc-cccch--hHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccce Confidence 000 01111 123678999999997764 23478999999999998863 223222211 1 Q ss_pred -cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeee Q lcl|Aclame:pro 233 -QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGG 311 (367) Q Consensus 233 -~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G 311 (367) +-.|+.+.|+.|+.+..+|.. ..++++.+.++|+++...- .-+|..|++... +....-++.=-+++++|.| T Consensus 193 ~~G~ig~i~G~~v~~s~~lp~~-----~~~~~~~~~~~A~~~a~q~--~~~e~~r~~~~~-~~~v~~~~~yg~~v~~~~~ 264 (273) T protein:vir:10 193 RAGTIGNLLGARIVESNNLRDT-----DDEQFVAFHPSAAAYVSQI--DTVEALRDQDSF-SDRIRALHVYGGKVVRPTG 264 (273) T ss_pred eeeeeeEEeceEEEEecccccC-----CccEEEEEeccceeeeeee--ehhhcccCCCcc-eeeeeeeeeeeeeEeccce Confidence 235788999999999999963 2356788899999886533 357888888654 2222222222344557776 Q ss_pred eeeccccccccccccccccccccc Q lcl|Aclame:pro 312 FNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 312 ~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) +-=-.++ ++ T Consensus 265 ~~~l~~~---------------g~ 273 (273) T protein:vir:10 265 VVVFNKT---------------GS 273 (273) T ss_pred EEEEecc---------------CC Confidence 6533222 11 No 27 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.61 E-value=1.8e-17 Score=112.36 Aligned_cols=264 Identities=11% Similarity=0.041 Sum_probs=170.9 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||-- .|.||+|.+.+.+++.+.+.|. .++-.+-+.. ...|+++++|.|+.+. ..+-..++. .+ T Consensus 1 MA~~--------~~~pe~~~~~v~~~~~~~lv~~--~l~~~~~~~~---~~~Gdtv~ip~~~~~~-~~d~~~~~~---~~ 63 (273) T protein:vir:10 1 MAFN--------NFIPELWSDMLLEEWTAQTVFA--NLVNREYEGT---ASKGNVVHIAGVVAPT-VKDYKAAGR---QT 63 (273) T ss_pred Ccch--------hhhHHHHHHHHHHHHHhhhccc--hhhccccccc---cccCceEEEeeccccc-ccccccCCC---cc Confidence 8852 3789999999999988877662 2222222111 2469999999999874 332222222 35 Q ss_pred cccccchhhhhhhh-hHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 81 t~~kitt~~~~a~i-~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) +++.++......++ +.+..++.++|+-......| +.++.+|.+...++..++.+++.+.+.-... T Consensus 64 ~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~------------- 129 (273) T protein:vir:10 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADMLVDNGTAL------------- 129 (273) T ss_pred CccccccceEEEEEeeeeecceEeecHHHhhhhcc-HHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------- Confidence 67788888877877 45789999999877777666 5679999998888998888888765421100 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc--CceeEEEEccHHHHHHHhcc--hhhhccccc-c-- Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTNND--EIEFIPDSK-G-- 232 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~--~~l~~~vmhS~v~~~L~k~~--li~~~~~~~-g-- 232 (367) ..+ .+.+. .-.++.|.+|..+|.+.. ..=..++++|.+|..|++.. +.+.....+ + T Consensus 130 --------------~~~-~~~~~--~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l 192 (273) T protein:vir:10 130 --------------TGS-APTDA--DDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) T ss_pred --------------ccc-cccch--hHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccce Confidence 000 01111 123678999999997764 23478999999999998863 223222211 1 Q ss_pred -cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeee Q lcl|Aclame:pro 233 -QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGG 311 (367) Q Consensus 233 -~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G 311 (367) +-.|+.+.|+.|+.+..+|.. ..++++.+.++|+++...- .-+|..|++... +....-++.=-+++++|.| T Consensus 193 ~~G~ig~i~G~~v~~s~~lp~~-----~~~~~~~~~~~A~~~a~q~--~~~e~~r~~~~~-~~~v~~~~~yg~~v~~~~~ 264 (273) T protein:vir:10 193 RAGTIGNLLGARIVESNNLRDT-----DDEQFVAFHPSAAAYVSQI--DTVEALRDQDSF-SDRIRALHVYGGKVVRPTG 264 (273) T ss_pred eeeeeeEEeceEEEEecccccC-----CccEEEEEeccceeeeeee--ehhhcccCCCcc-eeeeeeeeeeeeeEeccce Confidence 235788999999999999963 2356788899999886533 357888888654 2222222222344557776 Q ss_pred eeeccccccccccccccccccccc Q lcl|Aclame:pro 312 FNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 312 ~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) +-=-.++ ++ T Consensus 265 ~~~l~~~---------------g~ 273 (273) T protein:vir:10 265 VVVFNKT---------------GS 273 (273) T ss_pred EEEEecc---------------CC Confidence 6533222 11 No 28 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=99.54 E-value=3e-17 Score=111.18 Aligned_cols=275 Identities=11% Similarity=0.037 Sum_probs=151.2 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhh-cccccccHHHHHHhhCCCceE-EeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFL-SGAVASNDFLSQFLSAPGRLI-NIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~-SGi~~~~~~l~~~~~~~G~~i-~~P~~~~l~g~~~~~~~~~~~~ 78 (367) -|+-| .|+..|+-.+. =-+|+.+-...-++|.+ -||...-| + .-|++| ++|.|.++ |+++++.|+. T Consensus 7 ~~e~n-lt~~~dl~~~~-siDf~~~f~~~i~~L~~~LGv~r~~p-l-----a~GstIkt~k~~~y~-gda~dVaEGe--- 74 (296) T protein:vir:98 7 YPEEN-LIKSTDLKYPI-TIDVTNKFQENISKLLEMLGVTRKIS-V-----SEGMTLKTYAGYDVT-LAEGNVPEGE--- 74 (296) T ss_pred cCcCC-Ccchhhhhhhh-hhhhHHHHhhhHHHHHHHhhhccccc-c-----cCCCEEeeccceeee-eccccccCCc--- Confidence 45522 34444442221 11333322222222222 12222111 1 238999 77889988 8999999995 Q ss_pred cccccccchhh---hhhhhhHhhcccchhHHHH-HhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGE---MKTTKTWLNKAYGAMDLTA-ELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATI 154 (367) Q Consensus 79 ~~t~~kitt~~---~~a~i~~r~kg~~~tDla~-~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~ 154 (367) .|+.+|+++.+ ..+.+++.+|+. ||+|. +++++||+++.-+|+....+++..+++++.|++--.. T Consensus 75 ~Iplskvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t--------- 143 (296) T protein:vir:98 75 VIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT--------- 143 (296) T ss_pred ccchhhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccce--------- Confidence 68899999864 777788999995 99995 8888999999999999999999999999998753211 Q ss_pred hhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc Q lcl|Aclame:pro 155 KTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL 234 (367) Q Consensus 155 ~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~ 234 (367) .+.+ ++.-...-+..+.++..+|.|+.+.-.+++|||..++++++..-|.....+ |.. T Consensus 144 ------------------~~~t---~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~it~qt~f-G~t 201 (296) T protein:vir:98 144 ------------------QDAL---GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF-GLT 201 (296) T ss_pred ------------------eeec---hhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCccchhhee-chh Confidence 1101 000001113456666688999988899999999999999988755433233 333 Q ss_pred cchhhcCcEEEEeCCCcccC---CCCCceEEEEEEec-c----eeeeeccCCCcceeeeeehhhcCCceeEEEEEccE-E Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFG---TGADKTYLSILFGG-A----AFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE-W 305 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~---t~~~~~yttyl~~~-G----Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~-~ 305 (367) -+..++|..||.+.++|... +..++--..|+=.. | +|.+.. -....+-+.+++...+-..++.+.+-.. | T Consensus 202 yl~nfLG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~-d~tglIGv~h~~~~~~~t~eT~~~~~~~lf 280 (296) T protein:vir:98 202 YLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYG-DPTGYIGMNHFQENTTLTIQTLLVSGMLMY 280 (296) T ss_pred hhhhccccEEEEcCcCCCceEEEeeecceEEEeecccccchhhhhcccc-ccccceEEEeccccceeeehhHhHhHHHhc Confidence 34459999999999999432 22233222222211 1 111100 0111233333333333222222111000 0 Q ss_pred EeeeeeeeecccccccccccccccccccccCCCCh Q lcl|Aclame:pro 306 IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITL 340 (367) Q Consensus 306 ~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~ 340 (367) -=-+-|+= . +..+|.- T Consensus 281 pE~~dgiv--------------~-----~tI~~~~ 296 (296) T protein:vir:98 281 PERIDGIV--------------K-----VTLTPGV 296 (296) T ss_pred ccccceEE--------------E-----EEecCCC Confidence 00011111 0 0111111 No 29 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.51 E-value=3.4e-16 Score=105.40 Aligned_cols=318 Identities=15% Similarity=0.094 Sum_probs=174.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |....+.| ....|+||+|...+.+.+.+.+-|.. ++ .+. .+-..+|+++++|.++.. ...++.++. .+ T Consensus 11 ~~~~~~~t-~~~~fiPev~s~~v~~~l~~~lv~~~--l~-~~~---~~~~~~GdTV~ip~~g~~--~a~d~~~g~---~i 78 (381) T protein:vir:80 11 KGSAVDLS-NVQVFIPEVWSSEVRMFRDQKFAALE--AT-KKI---PFEGKKGDLIHIPNISRA--AVYDKQPQT---PV 78 (381) T ss_pred cCcccchh-hHHhhhhHHHHHHHHHHHHHhhhhhh--cc-ccc---cceeecCceEEeeccCcc--eeeeecCCC---cc Confidence 66655434 44567799999999998877766622 22 221 222357999999999865 355666554 47 Q ss_pred cccccchhhhhhhh-hHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 81 t~~kitt~~~~a~i-~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) +++.++..+...++ +.+..++.++|+-......||+.++.+|.+...+++.++.+++.+..+-.......... T Consensus 79 ~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~------ 152 (381) T protein:vir:80 79 NLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSY------ 152 (381) T ss_pred cccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc------ Confidence 78888888887777 55778899999998888889999999999999999999999988765533211100000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccC--ceeEEEEccHHHHHHHhc-chhhhcccccc---c Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG--SIAAIAVHSMVYKRMTNN-DEIEFIPDSKG---Q 233 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~--~l~~~vmhS~v~~~L~k~-~li~~~~~~~g---~ 233 (367) ........+. ...++ ......++.|.+|..+|.+..- .=..++++|.+|..|++. ++++..-..+. + T Consensus 153 ----~~~i~~~~~~--~~~t~-~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~ 225 (381) T protein:vir:80 153 ----DTTLGDGTVN--AHLTG-TPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTS 225 (381) T ss_pred ----cccccccccc--ccccc-chhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhc Confidence 0000000000 00011 1223568899999999987532 235899999999999886 34432211111 1 Q ss_pred ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeec----cCCCcc-eeeeeehhhcCCceeEEEEEccEEEee Q lcl|Aclame:pro 234 LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYAD----GAPQVP-VAVGRRELRGNGSGLEYILERKEWIVH 308 (367) Q Consensus 234 ~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~----~~~~~~-~e~~rd~~~~~~~g~~~l~~r~~~~~h 308 (367) -.|+.+.|++|+++..+|.... +.|.+..|+-.... +.+-.+ ....++..... ...|.-..+.++.++ T Consensus 226 G~Ig~i~G~~Vv~Sn~lp~~~~------t~~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~-k~yd~~~~~~~~~~~ 298 (381) T protein:vir:80 226 GVVGTILGMEVIVTTQIGINSL------TGYVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTG-SASDLAVSLSYFGLP 298 (381) T ss_pred eeeeEEcceEEEeecccccccc------cceeeeccccccccccccccccccccccceeeeeee-eeeceeeeeeeccce Confidence 2478999999999999997432 12334444322111 000000 00001110000 111112222223332 Q ss_pred -eeeeeecccccccccccccccccccccCCCChHHhcCCcccee-eecc----cccceEEEEecC Q lcl|Aclame:pro 309 -PGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWER-VTYR----KNVPMAFLVTKG 367 (367) Q Consensus 309 -p~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~-v~d~----K~i~iv~~~t~g 367 (367) ..|..|+-+. ..+|..-..-..-|-. |.+| -.-|=+++++-| T Consensus 299 ~~~g~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (381) T protein:vir:80 299 VFSGAGATAAD-----------------GGQTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSES 346 (381) T ss_pred eeecceeeecC-----------------CCceeeeehhhhhhhhhcccccccccccceeEeeccc Confidence 2445544332 1222221111222322 1111 111223344444 No 30 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.49 E-value=8.6e-16 Score=103.19 Aligned_cols=315 Identities=13% Similarity=-0.008 Sum_probs=177.2 Q ss_pred CCCcccccc------ceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCC Q lcl|Aclame:pro 1 MPDFNNQVR------LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSD 74 (367) Q Consensus 1 Ma~~~~~T~------l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~ 74 (367) |+-.|..|. ...-|+||+|..++.+++.+++.|.+ . .++-+. ....|++|+||.++.. ...++..+ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~--~-~~d~~~---~~~~Gdtv~ip~~g~~--~~~d~~~~ 72 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTS--V-VKTWGA---QVKKGDTFHVPRISEL--GVEDKATD 72 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhh--c-cccccc---cccCCceEEEeccCcc--eeeeecCC Confidence 888777765 34457899999999998888776632 1 222211 1245999999998865 35566554 Q ss_pred Cccccccccccchhhhhhhh-hHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 75 NPNVEAPIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFAT 153 (367) Q Consensus 75 ~~~~~~t~~kitt~~~~a~i-~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~ 153 (367) . .++++.++..+...++ +.+..++.++|+-......|++.++.+|.+...+++.++.+++.+.+.-...... T Consensus 73 ~---~i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~---- 145 (341) T protein:vir:94 73 V---PVGVQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQN---- 145 (341) T ss_pred C---ccccccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCc---- Confidence 3 4778888888888888 6788999999999999999999999999999999999999887764321100000 Q ss_pred hhhhhhhhhhhhcchhhcceeecC-cccchhhcccHHHHHHHHHHhcccc--CceeEEEEccHHHHHHHhc-chhhhccc Q lcl|Aclame:pro 154 IKTRGRVPAEVLGTAGDMVIDISG-QTNPADAVFNREAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTNN-DEIEFIPD 229 (367) Q Consensus 154 ~~~~~~~~a~~~~~~~~~v~disa-~t~~a~~~~s~~~l~~A~~~~GD~~--~~l~~~vmhS~v~~~L~k~-~li~~~~~ 229 (367) -....+. .++ ....+.++.|.+|..+|.+.. ..=..++++|.+|..|++. ++.+.... T Consensus 146 -----------------~~~~~~~~~t~-~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~ 207 (341) T protein:vir:94 146 -----------------VFSSSNGAITG-NGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFI 207 (341) T ss_pred -----------------cccCccccccC-chhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhcc Confidence 0000000 111 123467788999999997753 2336789999999999887 34433222 Q ss_pred ccc---cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCC-cceeeeeehhhcCCceeEEEEEccEE Q lcl|Aclame:pro 230 SKG---QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQ-VPVAVGRRELRGNGSGLEYILERKEW 305 (367) Q Consensus 230 ~~g---~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~-~~~e~~rd~~~~~~~g~~~l~~r~~~ 305 (367) .++ +-.|+.+.|..|+++..+|..... .|..+.|-.......+. ...+..|... +..+...-|...+.. T Consensus 208 g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~------~~~~~~~~~~~~~~~~~i~~~~~~~~~~-~~~~~~~gl~~~~~a 280 (341) T protein:vir:94 208 NNAPIAQGQIGSLMGVRVIRTSLIGNNSAT------GWRNGAPTIAPAEATPGFTGSRYLPKQD-SFTSLPATFTGNSRP 280 (341) T ss_pred ccchhheeeeeeEeceEEEEeccccccccc------cccccccceecccccccccccccccccc-cccccEEEEEEeccc Confidence 222 224788999999999999975311 22233332222221111 1222223211 111222222211111 Q ss_pred -----EeeeeeeeecccccccccccccccccccccCCC-ChHHhcCCcc--ceeeecccccceEEEEecC Q lcl|Aclame:pro 306 -----IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAI-TLANLANPDN--WERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 306 -----~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sP-t~a~L~~~~N--W~~v~d~K~i~iv~~~t~g 367 (367) ++||..+.-..... ....+.-.| .-+++-.+.+ =-+|..|+. .|.|++=| T Consensus 281 v~~~k~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~--~v~~~~~~ 338 (341) T protein:vir:94 281 VHTAVMCHMDWAAAVVSKA----------PRVTQSFENREQVWLMVGRQAYGARLYRPLH--AVNIHTTG 338 (341) T ss_pred ccceeeecchhhhcccccc----------ccccccchhhhhhhhhhhhhhhcccccCcce--eEEEecCc Confidence 11221111100000 000000000 1112111221 012344555 57788777 No 31 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=99.41 E-value=3e-15 Score=100.23 Aligned_cols=275 Identities=10% Similarity=-0.001 Sum_probs=157.2 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHh-----hhHhh-cccccccHHHHHHhhCCCceEEeee---eccCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPEL-----TAFFL-SGAVASNDFLSQFLSAPGRLINIPF---WRDLDSLEPNY 71 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~-----~~f~~-SGi~~~~~~l~~~~~~~G~~i~~P~---~~~l~g~~~~~ 71 (367) |+.-++ +++++.+++-++....++ ++|.+ -||...-| + .-|.+|+++. |.++ |++.++ T Consensus 1 M~~e~n------l~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~p----l--a~Gt~iktyK~~~~~y~-gda~dV 67 (303) T protein:vir:10 1 MSAENN------LINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIP----M--NVGSALKQYRFKVEDSE-KPNGDV 67 (303) T ss_pred CCCCcC------CcchhhcccceeehhhhhhhhhHHHHHHHhhhhcccc----c--cCCceeeeeeeeceeec-cccccc Confidence 887554 455555554333333331 12211 12221111 1 1377776554 5555 899999 Q ss_pred CCCCccccccccccchh---hhhhhhhHhhcccchhHHHH-HhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhh Q lcl|Aclame:pro 72 GSDNPNVEAPIDGLGSG---EMKTTKTWLNKAYGAMDLTA-ELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNL 147 (367) Q Consensus 72 ~~~~~~~~~t~~kitt~---~~~a~i~~r~kg~~~tDla~-~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~ 147 (367) .|+. .|+.+|+++. ...+.+++.+|+. ||+|. +++++||+++.-+|+....+++..+++++.|+.--... T Consensus 68 aEGe---~Iplskvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~- 141 (303) T protein:vir:10 68 AEGD---VIPLTKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENG- 141 (303) T ss_pred cCCc---ccchhhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhccccc- Confidence 9995 5889999975 4677788889977 99995 88889999999999999999999999999987532110 Q ss_pred hhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhc------cccCceeEEEEccHHHHHHHhc Q lcl|Aclame:pro 148 AGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMG------DHVGSIAAIAVHSMVYKRMTNN 221 (367) Q Consensus 148 a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~G------D~~~~l~~~vmhS~v~~~L~k~ 221 (367) + +......+++.|-.|+..+- +..+.-.+++|||+.++++++. T Consensus 142 --------------------------~-----~t~~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~ 190 (303) T protein:vir:10 142 --------------------------K-----RTNKTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLAN 190 (303) T ss_pred --------------------------c-----cccceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhc Confidence 0 00112356778888877653 2233445999999999999988 Q ss_pred chhhhccccc-ccccchhhcCcEEEEeCCCcccC---CCCCceEEEEEEecc----eeeeeccCCCcceeeeeehhhcCC Q lcl|Aclame:pro 222 DEIEFIPDSK-GQLTIPTYMGKVVIVDDGMPVFG---TGADKTYLSILFGGA----AFGYADGAPQVPVAVGRRELRGNG 293 (367) Q Consensus 222 ~li~~~~~~~-g~~~i~t~~G~~VivdD~~pv~~---t~~~~~yttyl~~~G----Ai~~~~~~~~~~~e~~rd~~~~~~ 293 (367) .-+. .+.++ |..-|..++|..||++.++|... +..++--..|+=..| +|.+.. -....+-+.+++...+- T Consensus 191 A~i~-~~~t~fG~n~L~nfLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~~~f~~t~-D~tglIGv~h~~~~~~~ 268 (303) T protein:vir:10 191 GFIN-STGAQFGVNLLTPYVGVKIVEFADVPQGEVWMTVAENLNVAYANPRGELSRAFAFAT-DATGFVGVLHDIQPQRL 268 (303) T ss_pred CCcc-hhhhhhhhhhhhhhhcceEEEeccCCCceEEEeeccceEEEEecCchhhhhhhhhcc-ccccceEEEecccccee Confidence 7655 34343 65567789999999999998542 333333223333323 222211 01223444444444433 Q ss_pred ceeEEEEEccE-EEeeeeeeeecccccccccccccccccccccCCCC Q lcl|Aclame:pro 294 SGLEYILERKE-WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 294 ~g~~~l~~r~~-~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) ..++.+.+-.. |-=-+-|+== ++ +.. ...++-|+ T Consensus 269 t~eT~~~~~~~lfpE~~dgiv~--~t-------i~~---~e~~~~~~ 303 (303) T protein:vir:10 269 TSDTIYASAISMFPENIDAVIK--VT-------IKK---DEAGELPS 303 (303) T ss_pred eehhHhHhHHHhcccccceEEE--EE-------Eec---cccCCCCC Confidence 33322221000 0001112110 01 000 11345566 No 32 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=99.30 E-value=4.3e-14 Score=93.85 Aligned_cols=302 Identities=13% Similarity=0.046 Sum_probs=161.7 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||- .+|+||++.+.+.+.+.+.+.|.+ ++-.+ .-..+.+..|++|+||.++......-+...+.....+ T Consensus 1 Ma~--------~~~~p~~~a~~~l~~l~~~lv~~~--lv~~~-~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~ 69 (392) T protein:vir:99 1 MAN--------AFSKPTAVVDTAIQMLQNELILTN--LVWLN-GIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNL 69 (392) T ss_pred Ccc--------ccccHHHHHHHHHHHHHhhccchh--hhccc-cccccccCCCCeEEEeecccccceeeeccccccCCcc Confidence 883 249999999999999888877722 22121 1122233579999999988764322222222223457 Q ss_pred cccccchhhhhhhh-hHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 81 t~~kitt~~~~a~i-~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) +++.++..+...++ +.+..++.++|+-..+...|++.++.+|.+...+++.+..+++.+.+.-..... T Consensus 70 ~~~~~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~----------- 138 (392) T protein:vir:99 70 TVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAG----------- 138 (392) T ss_pred cccccccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc----------- Confidence 77788887777777 778999999999999999999999999999888998888888877653211100 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc-CceeEEEEccHHHHHHHhc-chhhhcccccc----- Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV-GSIAAIAVHSMVYKRMTNN-DEIEFIPDSKG----- 232 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~-~~l~~~vmhS~v~~~L~k~-~li~~~~~~~g----- 232 (367) +.+. ......++.|++|.++|.+.. ..=+.+++.|..+..|.+. +++.+....+. T Consensus 139 -----------------~~~~-~~~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l 200 (392) T protein:vir:99 139 -----------------AVHE-VAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSAL 200 (392) T ss_pred -----------------cccc-cChhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhh Confidence 0000 011245788999999987642 2237899999999999887 45544333221 Q ss_pred -cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeee Q lcl|Aclame:pro 233 -QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGG 311 (367) Q Consensus 233 -~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G 311 (367) +-.|+.+.|+.|+.+..+|... .+.+.+.++.+....|..+.-....... .+..-+-.| ++..-.+ T Consensus 201 ~~G~vg~i~G~~v~~s~~~~~~t--------~~a~~~~a~~~at~a~v~~~~~~~~~s~---s~~~~v~~~--~~~~~~~ 267 (392) T protein:vir:99 201 QEARLGRIYGYEIVESTLIPHGD--------AYLYHPTAFIMATRAPAPPMGAVRSTAI---SGDQRIAMR--WLVDYDS 267 (392) T ss_pred hcceeeeeeeeEEEeeccccccc--------ceeeeccccccccccccccccccceeEE---ecccceecc--eeecccc Confidence 2357889999999999988642 2444455554444443322111110000 000001111 1111111 Q ss_pred eeecccccccccccccccccccccCCCChHHhcCCccceeeeccccc--ceEEEEecC Q lcl|Aclame:pro 312 FNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNV--PMAFLVTKG 367 (367) Q Consensus 312 ~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i--~iv~~~t~g 367 (367) ....+..... +.. +...-+ -..+......+..+.. ++....-.. T Consensus 268 t~~s~~~~v~---~~~------g~~~v~---~~~~~~~~~~~~~~~~~~~v~v~~v~~ 313 (392) T protein:vir:99 268 TITSNRSLID---TYF------GLKVVE---DPNGVGFVRARKIHLIPGSIEVAPEAG 313 (392) T ss_pred eeeccccccc---eeE------EEEEEe---eccccceeeeeeeeeecceeeeeeeec Confidence 1111110000 000 000000 0001111111111110 000000001 No 33 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.12 E-value=4.3e-12 Score=82.92 Aligned_cols=276 Identities=13% Similarity=0.094 Sum_probs=155.6 Q ss_pred CCCcc--------ccccceeccc-hHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCc----eEE----eeeecc Q lcl|Aclame:pro 1 MPDFN--------NQVRLVDAVI-PEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGR----LIN----IPFWRD 63 (367) Q Consensus 1 Ma~~~--------~~T~l~d~i~-PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~----~i~----~P~~~~ 63 (367) |--.. .+=+++|+++ |+++-.++.+.+ ++ .|+. +.++.+.|. .+. .|+| T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~-~~-~~ia----------d~lf~~~~a~~~~~v~f~~~~p~~-- 66 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMM-VN-QFIS----------ESLFRNGGANPNGVVAYNEGNPSF-- 66 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHH-hc-cchh----------hhhhhcccccccceeEEEeccccc-- Confidence 33211 2234677776 998877775543 32 2321 223333222 222 2555 Q ss_pred CCCcccccCCCCccccccccccch-hhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHH- Q lcl|Aclame:pro 64 LDSLEPNYGSDNPNVEAPIDGLGS-GEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVG- 141 (367) Q Consensus 64 l~g~~~~~~~~~~~~~~t~~kitt-~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~G- 141 (367) +.++.+.+.|+ .+++....+. ...+++++|+|+++.++|++..-.+.|+++...+|++.-..|..++.++..|.- T Consensus 67 ~~~d~e~VaEg---gEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa 143 (318) T protein:vir:10 67 LEDDVADVAEF---GEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSP 143 (318) T ss_pred ccCcHhhccCc---ccccccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 35788888887 4677777777 556667789999999999999999999999999999999999999998876521 Q ss_pred HHhhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccC----ceeEEEEccHHHHH Q lcl|Aclame:pro 142 VYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG----SIAAIAVHSMVYKR 217 (367) Q Consensus 142 vf~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~----~l~~~vmhS~v~~~ 217 (367) ..... +.. + .-...+....|+..+-....+ ....++. ..+|.... ....++||+..++. T Consensus 144 ~t~~~-~~s-----------~-~w~~~~~~~~d~~~A~e~v~~--a~~~~~~--a~~~~~~~~~GY~pdtIVlhP~~~~~ 206 (318) T protein:vir:10 144 IVPTL-AVP-----------T-AWDNGGKVRTDIAIAIEQIST--AAPTAYP--AGVGSSDEYFGFIPDTIVMHYALLPI 206 (318) T ss_pred ccccc-cCC-----------c-CCCCcccccccchhhhhhhhh--hhhhhhh--hhhhhhhhccCccceeeEECHHHHHH Confidence 11100 000 0 000011112232211000000 0001111 11232222 35699999999999 Q ss_pred HHhcchh-hhccccccc---------ccc-hhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCccee--e Q lcl|Aclame:pro 218 MTNNDEI-EFIPDSKGQ---------LTI-PTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVA--V 284 (367) Q Consensus 218 L~k~~li-~~~~~~~g~---------~~i-~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e--~ 284 (367) |.++..+ ++.. .+++ -.| +.++|++||+|..+|-. +.|++-.|.+|+-. +..|++ . T Consensus 207 l~~n~~~~~~y~-~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~--------~alvlq~g~vG~~~--d~~pl~~t~ 275 (318) T protein:vir:10 207 LMDNENFMKVYE-RNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPID--------RVLIMERGTVGFYS--DTRPLQFTA 275 (318) T ss_pred Hhcchhhhhhhh-ccchhhhhcccccccccceeeceEEeecCccCCC--------eeEEEecCCcceee--ccccceeee Confidence 9888543 2211 1111 112 35689999999999953 26999999998754 334433 3 Q ss_pred eeehhhcCCcee-EEEEEc-----cEEEeeeeeeeecccccccccccccccccccccCCC Q lcl|Aclame:pro 285 GRRELRGNGSGL-EYILER-----KEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAI 338 (367) Q Consensus 285 ~rd~~~~~~~g~-~~l~~r-----~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sP 338 (367) -|.....-.+|. +..++| --++..|+.+-|-. +-.+| T Consensus 276 ~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~it-----------------gi~~~ 318 (318) T protein:vir:10 276 LYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLT-----------------GIVTP 318 (318) T ss_pred cccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEe-----------------eccCC Confidence 343211111222 222222 33456899999963 34455 No 34 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=98.90 E-value=2.9e-10 Score=72.90 Aligned_cols=283 Identities=9% Similarity=0.001 Sum_probs=153.4 Q ss_pred CCCcccccccee--ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLVD--AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~d--~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) |...-..+..++ ..+|+.+..-+.+...+.+.|.+-.-+.| .+.......+|.+..-++.+..+.|+.... T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~-------~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~ 73 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVEN-------VTTLTGSRVYEKWTDITGLANIDDEAGKIA 73 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeee-------ccCCcceEEEEeecCCCcceeeecCCcccc Confidence 333222233334 57788887777777777666633211111 112334667777766656677777765422 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) + .++.+-.+.....++.+....++++...-+.-|....+.+++++.+.+..++.++..+... T Consensus 74 ~--~~~~~~~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~---------------- 135 (293) T protein:vir:48 74 D--IDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKL---------------- 135 (293) T ss_pred c--ccccceeEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccc---------------- Confidence 1 2345555555667777888899998888777888899999999888887776555432110 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hccc-cccccc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPD-SKGQLT 235 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~-~~g~~~ 235 (367) .......+++.|.++..++......-..++||+..+..|++..--+ ++-. .-.... T Consensus 136 ---------------------~~~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~ 194 (293) T protein:vir:48 136 ---------------------PTKPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPT 194 (293) T ss_pred ---------------------cccccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCC Confidence 0112346788999998888666666678999999999998753111 1111 101123 Q ss_pred chhhcCcEEEEeCCCcccCCCCCceEEEEEEecc--eeeeeccCCCcceeeeeehhhcCCceeEEEEEcc---EEEeeee Q lcl|Aclame:pro 236 IPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRELRGNGSGLEYILERK---EWIVHPG 310 (367) Q Consensus 236 i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G--Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~---~~~~hp~ 310 (367) -++++|++|++.++.++...+ .+++ +++||.= ++.+.... ...++..+.....-..++..+.... -.+.||. T Consensus 195 ~~~l~G~Pv~~~~~~~~~~~~-~~~~-~~~~gd~~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~ 271 (293) T protein:vir:48 195 GYSIAGFAVKEISDRWLPNAS-SGVM-PLYFGDLKQAVTLFDRQ-QMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTE 271 (293) T ss_pred CceecceeeEEecccccCCcc-CCce-EEEEEeccceEEEEEec-ceEEEEecccchhhhcCeEEEEEEEeeCcEEeccc Confidence 358999999887766554322 3333 3455532 23222221 1223333322111112333333322 2345777 Q ss_pred eeeecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 311 GFNWLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 311 G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) .|..-+-+. ..+..+|..-.+- T Consensus 272 a~~~l~~~~-------------~~~~~~~~~~~~~ 293 (293) T protein:vir:48 272 AFVPASFKA-------------IADQKGNIGSTAV 293 (293) T ss_pred ceEEEEeec-------------cccCCccccccCC Confidence 775543110 0111111111111 No 35 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.84 E-value=3.1e-10 Score=72.71 Aligned_cols=281 Identities=8% Similarity=-0.027 Sum_probs=147.9 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |+-. |. ...++|+.+..-+.+.+.+.+.+.+-+-. ...++..+++|.+..- +.+.-+.|+. .+ T Consensus 1 m~t~---t~-gg~liP~~~~~~ii~~l~~~s~i~~l~~~---------~~~~~~~~~ip~~~~~-~~a~wv~E~~---~~ 63 (303) T protein:vir:97 1 MGTE---TS-KASLFDKHLVSDLINKVKGHSSLAKLSSQ---------KPIPFNGSKEFTFTLD-SDIDVVAENG---KK 63 (303) T ss_pred Cccc---CC-CCeEcchhHHHHHHHHHHhhchhhhhcce---------eecCCCceEEEEEecC-cceEEeecCc---cc Confidence 9851 22 34566666655566666666555332211 1235677899998654 5667777764 35 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhh---cccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELA---GSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~---g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) +..+.+-.+.....++.+.-..++++-...+ ..+.++++.+++++...+..+..+| .|.-+......... T Consensus 64 ~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l---~G~~~~~g~~~~~~---- 136 (303) T protein:vir:97 64 THGGLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAM---HGINPRTKKASDVI---- 136 (303) T ss_pred cccccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhh---cccccCCccccccc---- Confidence 5556655555566667777778887755333 3356778999998777776665444 33211111100000 Q ss_pred hhhhhhhhcchhhcceeecCc-ccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhccccc--c Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQ-TNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPDSK--G 232 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~-t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~--g 232 (367) ........... .........++.+.++..++-+.......++||+..+..|++..-- .++-..+ . T Consensus 137 ----------~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~ 206 (303) T protein:vir:97 137 ----------GTNHFDSKVTQVVKFTESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAW 206 (303) T ss_pred ----------cccccccccccccccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccC Confidence 00000000000 0111233567899999888866666778899999999999865211 1221111 1 Q ss_pred cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhc-CC-------ceeEEEEEc Q lcl|Aclame:pro 233 QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRG-NG-------SGLEYILER 302 (367) Q Consensus 233 ~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~-~~-------~g~~~l~~r 302 (367) .....+++|++|++++.||-......++. .++||. .++.++.-. .++.++.+... .+ ..+..+... T Consensus 207 ~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~-~~~~Gdf~~~~~~~~~~---~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~ 282 (303) T protein:vir:97 207 GANPDSINGLKSSVNTTVGAGADEAESKD-LVIIGDFESMFKWGYAK---QIPMEIIKYGDPDNSGKDLKGYNQIYLRAE 282 (303) T ss_pred CCCCceecceeeEEecccCCccccCCCcc-EEEEeeccccEEEEEec---CcEEEEeeccCCCCcchhhhhcCcEEEEEE Confidence 12345899999999999996543333333 355654 344444322 23333322110 00 111112111 Q ss_pred c---EEEeeeeeeeecc-ccc Q lcl|Aclame:pro 303 K---EWIVHPGGFNWLD-ADV 319 (367) Q Consensus 303 ~---~~~~hp~G~s~~~-~~~ 319 (367) . -.++||..|.-.. +.| T Consensus 283 ~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 283 AYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred EEeccEeecccceEEeeCCCC Confidence 1 2355777775432 221 No 36 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.84 E-value=1.8e-10 Score=73.96 Aligned_cols=294 Identities=11% Similarity=0.051 Sum_probs=146.8 Q ss_pred CCCcccc----c---cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCC Q lcl|Aclame:pro 1 MPDFNNQ----V---RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGS 73 (367) Q Consensus 1 Ma~~~~~----T---~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~ 73 (367) |+..-.. + .-..++.|++...+ .+.+.+.+.+.+ . -....-++..+++|.+..- ..+.-+.| T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~i-i~~~~~~s~l~~------~---~~~~~~~~~~~~~p~~~~~-~~a~~v~E 69 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDY-FAEIEKTSIVQR------I---ARKVPMGPTGISIPHWTGA-VSASWTGE 69 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHH-HHHHHhccchhh------h---cceeeccCCceEEEEEcCC-cceeEecC Confidence 7763111 1 11345777766554 444444444322 1 1122234666889988743 34555666 Q ss_pred CCccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHH------HHHHHHhhhh Q lcl|Aclame:pro 74 DNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIA------MAVGVYKSNL 147 (367) Q Consensus 74 ~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla------~l~Gvf~~~~ 147 (367) +. .++..+.+-.+.....++.+.-+.++++...-+..|..+.+.+++++.+.+..++.+|. -..|+++... T Consensus 70 g~---~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~ 146 (330) T protein:vir:77 70 AE---RKPITKGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETT 146 (330) T ss_pred CC---ccccccceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcccccccccc Confidence 64 35555666666667778888888999987777777888999999998888888876651 1111111110 Q ss_pred hhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh-- Q lcl|Aclame:pro 148 AGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE-- 225 (367) Q Consensus 148 a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~-- 225 (367) . ...+......+..+.....++.|.+++.++.........++||+..+..|++..--+ T Consensus 147 ~--------------------~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~ 206 (330) T protein:vir:77 147 K--------------------VVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGR 206 (330) T ss_pred c--------------------cceeecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCc Confidence 0 001111111111122234567888888887777667778999999999998753111 Q ss_pred hc-ccc--c---ccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCCc---- Q lcl|Aclame:pro 226 FI-PDS--K---GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNGS---- 294 (367) Q Consensus 226 ~~-~~~--~---g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~~---- 294 (367) ++ +.. . +...-.+++|++|+++|.||-.+. .+++ ..+||.-. +.++... ...++..++..-..+. T Consensus 207 ~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~~~--~~~~-~~~~gd~s~~~i~~~~-~~~i~~~~e~~~~~~~~~~~ 282 (330) T protein:vir:77 207 PLFVESTYTEQVGAIREGRILGRPTYVADNVVNGTV--GNRV-VGVMGDFSQVIWGQIG-GLSFDVTDQATLDFGEEQGG 282 (330) T ss_pred eeecCccccccccccCCceecceeeEEeccccCCCC--CCcc-EEEEEecceEEEEEec-CcEEEEeecceeeecccccc Confidence 11 110 0 111235789999999999985322 2333 24444322 1222221 1122222221100000 Q ss_pred ----eeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 295 ----GLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 295 ----g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) ..-.++.|.. +..+...+-+ | .+.+++++..+..++-| T Consensus 283 ~~~~~~~~~f~~~~--~~~r~~~r~d--------------------------------~-~v~~~~a~~~i~~~~~~ 324 (330) T protein:vir:77 283 VWVPKLISLWQHNM--VAVRCEAEFA--------------------------------F-MVNDKDAFVKLTDQVAG 324 (330) T ss_pred cccccccchhhcCc--EEEEEEEEec--------------------------------c-EEecccceEEEEeccCC Confidence 0000011000 0000100000 0 02334444444444444 No 37 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.75 E-value=9.8e-10 Score=69.98 Aligned_cols=279 Identities=10% Similarity=0.036 Sum_probs=151.8 Q ss_pred CCCcc--ccc--cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCc Q lcl|Aclame:pro 1 MPDFN--NQV--RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) Q Consensus 1 Ma~~~--~~T--~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~ 76 (367) |..++ ..| .-+..++|+.+..-+.+...+.+-|.+ +......++..+++|.+..- ..+..+.|+.. T Consensus 21 ~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~---------l~~~~~~~~~~~~ip~~~~~-~~a~~v~Eg~~ 90 (324) T protein:vir:93 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ---------LGKYEPMEGTEKKFTFWADK-PGAYWVGEGQK 90 (324) T ss_pred hhhcccccccccCCCcceechhHHHHHHHHHHhhchhhh---------hcceeeccCCceEEEEEecC-cceeeecCCcc Confidence 22222 112 123446677766666666555554422 11222345677899998744 45667777754 Q ss_pred cccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhh Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKT 156 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~ 156 (367) ++..+.+-++.....++.+.-..++++...-+..|....+.+++++.+.+..++.+|. |.- ++. ..... T Consensus 91 ---~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~---G~g----~~~-~~~~~ 159 (324) T protein:vir:93 91 ---IETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQG----NNP-FGKSI 159 (324) T ss_pred ---ccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc---CCC----CCC-cCccc Confidence 5566777777778888899999999988777777889999999999999988886652 211 000 00000 Q ss_pred hhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc-- Q lcl|Aclame:pro 157 RGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL-- 234 (367) Q Consensus 157 ~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~-- 234 (367) . . ... .........++++.+.++...+.+.......++||+..+..|++. ++.+|.. T Consensus 160 ~-----------~--~~~--~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l------~d~~G~~~~ 218 (324) T protein:vir:93 160 A-----------Q--SIE--KTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI------VDPETKERI 218 (324) T ss_pred c-----------c--ccc--ccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHh------hCCCCCeee Confidence 0 0 000 000011234678999999998877777778999999999999875 3333332 Q ss_pred ---cchhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehhhcC------------CceeEE Q lcl|Aclame:pro 235 ---TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGN------------GSGLEY 298 (367) Q Consensus 235 ---~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~~~~------------~~g~~~ 298 (367) .-++++|++|+++++.+... + . .+||. .-+.++...+ ..+++.|+..... ..++.. T Consensus 219 ~~~~~~~l~G~PVv~~~~~~~~~----~--~-i~~gdfs~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~ 290 (324) T protein:vir:93 219 YDRNSDSLDGLPVVNLKSSNLKR----G--E-LITGDFDKLIYGIPQL-IEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) T ss_pred cCCCCCcccceeeEeecCCCCCc----c--e-EEEEecceEEEEEecC-cEEEEeecccccccccccccchhhhhcCcEE Confidence 23578999999987765421 1 1 22222 1122333222 2344444432110 011222 Q ss_pred EEEccEE---EeeeeeeeecccccccccccccccccccccCCCChHHh Q lcl|Aclame:pro 299 ILERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 299 l~~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L 343 (367) +-...++ ++||..|.--..- .++..+|.-|. T Consensus 291 ~r~~~r~d~~v~~~~a~~~l~~a--------------~~~~~~~~~~~ 324 (324) T protein:vir:93 291 LRATMHVALHIADDKAFAKLVPA--------------DKRTDSVPGEV 324 (324) T ss_pred EEEEEEeccEEecccceEEEecc--------------cccCCCCCCCC Confidence 2222222 3345444322110 01112222222 No 38 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.73 E-value=2.1e-09 Score=68.15 Aligned_cols=292 Identities=9% Similarity=-0.000 Sum_probs=140.9 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |+. ..+.-....+|+.+..=+.+.+.+.+.+.+-+- ....++..+++|.+..- ..+.-+.|+.. + T Consensus 1 Ma~--~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~---------~i~~~~~~~~ip~~~~~-~~a~wv~Eg~~---~ 65 (315) T protein:vir:80 1 MAD--DFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSP---------EQPTIFGPVKGAVFSGV-PRAKIVGEGEV---K 65 (315) T ss_pred CCC--CcCCcCceEcchHHHHHHHHHHHhhchhhhhcc---------eeecCCCceEEEEEeCC-cceEEeeCCcc---c Confidence 997 345567788999887777777777666544221 12235667899998753 45556666643 4 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHH----HHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPM----TRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKT 156 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm----~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~ 156 (367) +..+.+-++.....++.+.-..++++...-+..|.. ..|.+++++...+..+..+| .|.-......... T Consensus 66 ~~s~~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~---~G~~~~~~~~~~~---- 138 (315) T protein:vir:80 66 PSASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAF---HGIDPATGKAASA---- 138 (315) T ss_pred cccccceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhhee---eccCCCCCccccc---- Confidence 444444444444445555556677776555555543 45666776666555554443 2211000000000 Q ss_pred hhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHh-ccccCceeEEEEccHHHHHHHhcchhh-------hcc Q lcl|Aclame:pro 157 RGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTM-GDHVGSIAAIAVHSMVYKRMTNNDEIE-------FIP 228 (367) Q Consensus 157 ~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~-GD~~~~l~~~vmhS~v~~~L~k~~li~-------~~~ 228 (367) ... ......+..+ .....+..+.++..++ +.....-.+++||++++..|++...-+ ++. T Consensus 139 -----------~~~-~~~~~~~~~~-~~~~~~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~ 205 (315) T protein:vir:80 139 -----------VHT-SLNKTKNIVD-ATDSATADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMY 205 (315) T ss_pred -----------ccc-ccccccceee-ccccchHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccc Confidence 000 0000111111 1112356677777665 444445567999999999998874221 111 Q ss_pred cccccccchhhcCcEEEEeCCCcccCCCC-CceEEEEEEecce-eeeeccCCCcceeeeeehhhcCC------ceeEEEE Q lcl|Aclame:pro 229 DSKGQLTIPTYMGKVVIVDDGMPVFGTGA-DKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNG------SGLEYIL 300 (367) Q Consensus 229 ~~~g~~~i~t~~G~~VivdD~~pv~~t~~-~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~------~g~~~l~ 300 (367) ..-....-.+++|++|++++.||...... ..+. -.+||.=. +.|+... ...+++.++...... .++-.+. T Consensus 206 ~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~-~~~~GDfs~~~~g~~~-~~~i~i~~~~~~~~~~~~~~~~~~v~~r 283 (315) T protein:vir:80 206 PAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGV-KAIVGDFSRVHWGFQR-NFPIELIEYGDPDQTGRDLKGHNEVMVR 283 (315) T ss_pred cccccCCCceecceeeEecCcCCccccccccccc-EEEEeecccEEEEEec-CeeEEEeccccccCcccchhhcCcEEEE Confidence 11111123689999999999999654222 2222 23333321 2222222 223444444322111 1111222 Q ss_pred EccEE---EeeeeeeeecccccccccccccccccccccCCCChHH Q lcl|Aclame:pro 301 ERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLAN 342 (367) Q Consensus 301 ~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~ 342 (367) ...++ +.||..|.--.... +| ..+|-.+. T Consensus 284 ~~~r~~~~v~~~~a~~~l~~~~-a~------------~~~~~~~~ 315 (315) T protein:vir:80 284 AEAVLYVAIESLDSFAVVKEKA-AP------------KPNPPAEN 315 (315) T ss_pred EEEEecceeecccceEEEeecc-CC------------CCCCCCCC Confidence 11222 34666665432211 11 11111111 No 39 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.73 E-value=1.1e-09 Score=69.64 Aligned_cols=279 Identities=11% Similarity=0.047 Sum_probs=142.4 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc- Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE- 79 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~- 79 (367) ||.... +.-. .++|+.+.+-+.+...+.+-+.+ +......++..+++|.+..- ..+.-+.|+..... T Consensus 1 ma~~t~-~~gg-~liP~~~~~~Ii~~~~~~s~l~~---------l~~~~~~~~~~~~~p~~~~~-~~a~wv~E~~~~~~~ 68 (305) T protein:vir:25 1 MADISR-AEVA-SLIQEAYSDTLLAAAKQGSTVLS---------AFQNVNMGTKTTHLPVLATL-PEADWVGESATDPKG 68 (305) T ss_pred CCCccC-Cccc-eecCHHHHHHHHHHHHhhchhhh---------hcceeeccCCcEEEEEEeCC-cceEEeecccccccc Confidence 998442 2233 45577776666666655554422 12223345778999998843 44555555543221 Q ss_pred -ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHH------Hhhhhhhhhh Q lcl|Aclame:pro 80 -APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGV------YKSNLAGNFA 152 (367) Q Consensus 80 -~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gv------f~~~~a~~~~ 152 (367) ++..+.+-++.....++.+..+.++++...-+..|....+.+++++.+.+..++.+|. |. +........ T Consensus 69 ~~~~s~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~---G~g~~~~~~~~~~~~~~- 144 (305) T protein:vir:25 69 VKPTSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF---GTDKPASWVSPALIPAA- 144 (305) T ss_pred cccccccceeeEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhhee---ccCCCCCcccccccccc- Confidence 2333444444455667778888999988888888889999999999888888877773 21 110000000 Q ss_pred hhhhhhhhhhhhhcchhhcceeecCcccchhhcccH----HHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcc Q lcl|Aclame:pro 153 TIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNR----EAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIP 228 (367) Q Consensus 153 ~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~----~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~ 228 (367) ..... ...+. ...... ..+.++.....+..-....++||+..+..|++. + T Consensus 145 --------------~~~~~-~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l------k 198 (305) T protein:vir:25 145 --------------VTAGQ-AVEVV-----GGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI------R 198 (305) T ss_pred --------------ccccc-ccccc-----ccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHh------h Confidence 00000 00011 111222 333334433333344455799999999998764 3 Q ss_pred cccccccc--hhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE Q lcl|Aclame:pro 229 DSKGQLTI--PTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW 305 (367) Q Consensus 229 ~~~g~~~i--~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~ 305 (367) +.+|...+ .+++|++|+++|.+|... ++. ..+||. -.+.++..+ ...+++.++..-..+...-.++.|... T Consensus 199 d~~G~~i~~~~~l~G~Pv~~~~~~~~~~----~~~-~~~~gd~s~~~i~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~ 272 (305) T protein:vir:25 199 DANGNPVFRDDSFAGFRTFFNRNGAWDA----DAA-IEVIADSSRVKIGVRQ-DITVKFLDQATLGTGENQINLAERDMV 272 (305) T ss_pred ccCCceeecCCcccccceEEcCccCCCC----Ccc-EEEEEecceEEEEEec-CeEEEEeeeeeeecCCceeeeeecCcE Confidence 44554433 378999999999998643 222 223332 112222222 223444444322222222223333222 Q ss_pred -----------EeeeeeeeecccccccccccccccccccccCCCCh Q lcl|Aclame:pro 306 -----------IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITL 340 (367) Q Consensus 306 -----------~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~ 340 (367) ++||..+-..... +.+...|+- T Consensus 273 ~~R~~~r~~~~v~~p~a~v~~~~~-------------~~~~~~pa~ 305 (305) T protein:vir:25 273 ALRLKARFAYVLGVSATAQGANKT-------------PVAVVAPAA 305 (305) T ss_pred EEEEEEeecceeeCcccEEEEccc-------------cccccCCCC Confidence 2344444432211 111112222 No 40 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=98.72 E-value=2.6e-09 Score=67.61 Aligned_cols=304 Identities=13% Similarity=0.074 Sum_probs=159.3 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |+.. ..+.-..++.||+..+++. ...+.+.+.+ +......++..+++|.+..- ..+.-+.|+.. + T Consensus 10 ~~~~-~t~~~~g~l~~~~~~~ii~-~l~~~s~i~~---------l~~~~~~~~~~~~ip~~~~~-~~a~wv~Eg~~---~ 74 (397) T protein:vir:23 10 IAQT-KDTMFTGYLDPVQAKDYFA-EAEKTSIVQR---------VAQKIPMGATGIVIPHWTGD-VSAQWIGEGDM---K 74 (397) T ss_pred Hhhc-cCCCCccccchhHHHHHHH-HHHhccchhh---------hcceeeccCCceEEEEEcCC-cceEEecCCcc---c Confidence 4431 1233456788998776554 3334333322 12223345677899998753 45566666643 5 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) +..+.+-++.....++.+....++++...-+..|....+.+++++.+.+..++.+|. |. ...... T Consensus 75 ~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~---G~---gt~~~~--------- 139 (397) T protein:vir:23 75 PITKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALH---GT---NAPSAF--------- 139 (397) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh---cc---cCCccc--------- Confidence 556677777777888889999999998888888899999999999998888886662 21 110000 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hccccc--cc--- Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPDSK--GQ--- 233 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~~~--g~--- 233 (367) ....+....+........++.+.++...+-........++||++.+..|++..--+ ++-..+ +. T Consensus 140 ---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~ 210 (397) T protein:vir:23 140 ---------QGYLDQSNKTQSISPNAYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTT 210 (397) T ss_pred ---------ccccccccceeeecccchhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccc Confidence 00001111111112235567778887777666667789999999999999863211 111111 11 Q ss_pred -ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehhhcCC------------ceeEEE Q lcl|Aclame:pro 234 -LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGNG------------SGLEYI 299 (367) Q Consensus 234 -~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~~~~~------------~g~~~l 299 (367) ..-++++|++|++++.||-. +. ..+|+. .-+.++..+ ...+++.|+.....+ .++..+ T Consensus 211 ~~~~~tl~G~Pv~~s~~~~~g------~~-~~~~gDfs~~~i~~~~-~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ 282 (397) T protein:vir:23 211 PFREGRILGRPTILSDHVAEG------DV-VGYAGDFSQIIWGQVG-GLSFDVTDQATLNLGSQESPNFVSLWQHNLVAV 282 (397) T ss_pred cccCceeeeeeEEEeCCCCCC------ce-EEEEeecceEEEEEEe-ceEEEEeeeeeeeeccccccceeeeeeccceeE Confidence 12357899999999999842 21 112221 111122222 233555555332110 011222 Q ss_pred EEccE---EEeeeeeeeecccccccccccccccccccccCCCChHHhc---CCccceeeecccccceEEEE-ecC Q lcl|Aclame:pro 300 LERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLA---NPDNWERVTYRKNVPMAFLV-TKG 367 (367) Q Consensus 300 ~~r~~---~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~---~~~NW~~v~d~K~i~iv~~~-t~g 367 (367) ....+ -++||..|........ ..+...+. ++.+.++.++.+...-+.+- |.. T Consensus 283 ra~~r~d~~v~~~~a~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 341 (397) T protein:vir:23 283 RVEAEYGLLINDVNAFVKLTFDPV----------------LTTYALDLDGASAGNFTLSLDGKTSANIAYNASTA 341 (397) T ss_pred EEEeeeccceecccceEEEeeccc----------------cceeeecccccCcceEEEEecCccccCcccccchh Confidence 22122 2456666665433210 11111111 24445554443322221111 111 No 41 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.71 E-value=2.4e-09 Score=67.81 Aligned_cols=279 Identities=10% Similarity=0.057 Sum_probs=152.8 Q ss_pred CCCc--cccc--cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCc Q lcl|Aclame:pro 1 MPDF--NNQV--RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) Q Consensus 1 Ma~~--~~~T--~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~ 76 (367) +..+ ...+ .-+...+|+.+..-+.+...+.+-|.+ +......++.++++|.+... ..+..+.|+.. T Consensus 21 ~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~---------~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~ 90 (324) T protein:vir:99 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMR---------LGKYEPMEGTEKKFTFWADK-PGAYWVGEGQK 90 (324) T ss_pred hhhccccceeccCCCcceechhHHHHHHHHHHhhchhhh---------hcceeeccCCceEEEEEecC-cceeEeccCcc Confidence 1111 1111 112335677665555555555544422 12222345677999998753 56677777653 Q ss_pred cccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhh Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKT 156 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~ 156 (367) ++..+++-.+.....++.+....++++...-+..|..+.+.+++++.+.+..++.+|. | .. .+.. ... T Consensus 91 ---~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~---G---~g-~~~~-~~~- 158 (324) T protein:vir:99 91 ---IETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---N---QG-NNPF-GKS- 158 (324) T ss_pred ---ccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh---c---CC-CCcc-Ccc- Confidence 5566677777777788889889999987777777888999999999888888776653 2 10 1000 000 Q ss_pred hhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc-- Q lcl|Aclame:pro 157 RGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL-- 234 (367) Q Consensus 157 ~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~-- 234 (367) ....+..........++++.+.++...+.+....-..++||+..+..|++.. +.+|.. T Consensus 159 --------------~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~------d~~g~~~~ 218 (324) T protein:vir:99 159 --------------IAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV------DPETKERI 218 (324) T ss_pred --------------ccccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh------cCCCceee Confidence 0000011111122357899999999998877767778999999999998753 333321 Q ss_pred ---cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecc-eeeeeccCCCcceeeeeehhhcC------------CceeEE Q lcl|Aclame:pro 235 ---TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA-AFGYADGAPQVPVAVGRRELRGN------------GSGLEY 298 (367) Q Consensus 235 ---~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G-Ai~~~~~~~~~~~e~~rd~~~~~------------~~g~~~ 298 (367) .-.+++|++|++++.++... + .++|+.= -+.++... ...+++.++..... ..++.. T Consensus 219 ~~~~~~~l~G~PVv~~~~~~~~~----~---~~i~gd~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~ 290 (324) T protein:vir:99 219 YDRNSDTLDGLPVVNLKSSNLKR----G---ELITGDFDKLIYGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) T ss_pred cCCCCccccceeEEeecCCCCCc----c---eEEEEecccEEEEEec-CcEEEEeecccccccccccccchhhhhcCcEE Confidence 23578999999998887532 1 1223221 12233322 12344444432110 012222 Q ss_pred EEEccEE---EeeeeeeeecccccccccccccccccccccCCCChHHh Q lcl|Aclame:pro 299 ILERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 299 l~~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L 343 (367) +....++ ++||..|.--.. ...+..+|.+|. T Consensus 291 ~r~~~r~d~~v~~~~a~~~lt~--------------a~~~~~~~~~~~ 324 (324) T protein:vir:99 291 LRATMHVALHIADDKAFAKLVP--------------ADKKTDSVPGEV 324 (324) T ss_pred EEEEEEEccEEecccceEEEEe--------------ccCCCCCCCCCC Confidence 2222222 234444432211 112233444444 No 42 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=98.70 E-value=2.7e-09 Score=67.58 Aligned_cols=280 Identities=10% Similarity=0.045 Sum_probs=139.3 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ++.... +.-....+|+.+.+-+.....+.+.+.+ +.....-+|....+|..... +.+.-+.++...... T Consensus 161 ~~~~~~-~~~g~~~ip~~~~~~ii~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~-~~a~~v~e~~~~~~~ 229 (458) T protein:vir:10 161 VNQSSS-VEVSSESYETIFSQRIIRDLQKELVVGA---------LFEELPMSSKILTMLVEPDA-GKATWVAASTYGTDT 229 (458) T ss_pred hhhccc-CccccceehhhHhHHHHHHHHhhhhHHh---------hcceeecCCcceEEEEecCC-cceeecccccccccc Confidence 111111 1123345666666666655555444321 11111224555666654433 222222332211111 Q ss_pred c---ccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHH-----HHHHHhhhhhhhhh Q lcl|Aclame:pro 81 P---IDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAM-----AVGVYKSNLAGNFA 152 (367) Q Consensus 81 t---~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~-----l~Gvf~~~~a~~~~ 152 (367) + ..+.+-++.....++.+.-..+++....-+..+....+.+++++.+.+..+..+|.= -.|+++..... T Consensus 230 ~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~--- 306 (458) T protein:vir:10 230 TTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASED--- 306 (458) T ss_pred cccccccccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeeccccc--- Confidence 1 111122222233344555567777755555566788999999988888777655420 01111111000 Q ss_pred hhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcc-- Q lcl|Aclame:pro 153 TIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIP-- 228 (367) Q Consensus 153 ~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~-- 228 (367) ....+. ..++.....++++.|.++...+......-..++||+..+..|++..-.+ ++- T Consensus 307 ---------------~~~~~~---~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~ 368 (458) T protein:vir:10 307 ---------------SAKVVT---EAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQV 368 (458) T ss_pred ---------------ccceee---cccccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeecc Confidence 001111 1122233457899999999888766666678999999999988754222 111 Q ss_pred ccc---ccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE Q lcl|Aclame:pro 229 DSK---GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW 305 (367) Q Consensus 229 ~~~---g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~ 305 (367) ... ......+++|++|+++|.||..+. .+......|+.+.+. .+. ..+++.||+... .++..++...++ T Consensus 369 ~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~--~~~~~~~~f~~~~~~-~~~---~~~~v~~d~~~~--~~~~~~~~~~r~ 440 (458) T protein:vir:10 369 GNDSVKLQGQVGRIYGLPVVVSEYFPAKAN--SAEFAVIVYKDNFVM-PRQ---RAVTVERERQAG--KQRDAYYVTQRV 440 (458) T ss_pred ccccccccCcCceecceeeEEccccccccC--CcceEEEEecccEEE-EEe---eceEEEeecccC--CCceEEEEEEEe Confidence 011 112235799999999999996532 222223444554433 222 235566776644 344556655555 Q ss_pred E---eeeeeeeeccccccccccccccccccccc Q lcl|Aclame:pro 306 I---VHPGGFNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 306 ~---~hp~G~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) . ++|.||-- .+. +++ T Consensus 441 ~~~v~~~~a~v~--~~~-------------aa~ 458 (458) T protein:vir:10 441 NLQRYFANGVVS--GTY-------------AAS 458 (458) T ss_pred cceEecccceEE--Eee-------------ccC Confidence 3 57766632 221 111 No 43 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.70 E-value=1.6e-09 Score=68.78 Aligned_cols=277 Identities=11% Similarity=0.017 Sum_probs=137.5 Q ss_pred CCCccc-cccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNN-QVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~-~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) +..... .+.-...++|+.+..-+.....+.+.|.+- -.....++..+++|.+...+..+.-+.|+.. T Consensus 132 ~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~---------~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~--- 199 (418) T protein:vir:10 132 VPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDL---------LMPGQTSSSSIEYTVETGFTNNAAAVAEGAQ--- 199 (418) T ss_pred hhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhh---------cceeeccCCceeEEEEecCCCceeeeccCcc--- Confidence 111111 122234466776666566666665555331 1112235677889988766555555666643 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) ++..+.+-.+.....++.+....+++.....+ .+....+.+++++...+..++.+|. | +.....- ..+ T Consensus 200 ~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~l~~a~~~~~d~a~l~---G---~g~~~~p--~Gi--- 267 (418) T protein:vir:10 200 KPTSDLKFNLKNQPVRTIAHLFKASRQILDDA-PALQSYIDGRARYGLQLTEEGQILK---G---DGTGANI--LGI--- 267 (418) T ss_pred ccccccceeeEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhc---c---CCCCccc--ccc--- Confidence 33344444444455556666667777765544 3666778888876666665554442 1 1100000 000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhcccccccccch Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPDSKGQLTIP 237 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~g~~~i~ 237 (367) .. .....+. + .......+++.+.++...+-.....-.+++||+.++..|++..-- .++-.......-+ T Consensus 268 ~~-----~~~~~~~--~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~ 337 (418) T protein:vir:10 268 LP-----QASAFMP--S---ITLANATPIDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTP 337 (418) T ss_pred cc-----ccccccc--c---ccccccccHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCc Confidence 00 0000000 1 111223567889999888877777778899999999998865311 1111111112346 Q ss_pred hhcCcEEEEeCCCcccCCCCCceEEEEEEecc--eeeeeccCCCcceeeeeehhhcC--CceeEEEEEcc---EEEeeee Q lcl|Aclame:pro 238 TYMGKVVIVDDGMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRELRGN--GSGLEYILERK---EWIVHPG 310 (367) Q Consensus 238 t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G--Ai~~~~~~~~~~~e~~rd~~~~~--~~g~~~l~~r~---~~~~hp~ 310 (367) +++|++|++++.||-. +++||.- ++.+... ..+++++++.... ..++..+.... -.+.||. T Consensus 338 ~l~G~pV~~~~~~p~~---------~~~~gd~s~~~~~~~~---~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~ 405 (418) T protein:vir:10 338 RLWNLPVVETQAMTAN---------EFLVGAFSMAAQIFDR---MEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPE 405 (418) T ss_pred eecceeeEEcCCCCCC---------cEEEeeccceEEEEEe---cceEEEEecccchhhhcCceEEEEEEeeccEEeccc Confidence 8999999999999842 1334432 1222211 1233333322211 12222222222 2345777 Q ss_pred eeeeccccccccccccccccccccc Q lcl|Aclame:pro 311 GFNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 311 G~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) +|.+-.-. +..+| T Consensus 406 a~~~~~~~------------~~~~g 418 (418) T protein:vir:10 406 SFVTGALV------------EQAGG 418 (418) T ss_pred ceEEEEec------------cCCCC Confidence 77654322 11122 No 44 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.68 E-value=1.3e-09 Score=69.26 Aligned_cols=287 Identities=9% Similarity=-0.032 Sum_probs=149.1 Q ss_pred CCC-ccccccce--eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcc Q lcl|Aclame:pro 1 MPD-FNNQVRLV--DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPN 77 (367) Q Consensus 1 Ma~-~~~~T~l~--d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~ 77 (367) |.. +...|..+ ...+|+.+..=+.+++.+.+.+.+- .+....++...++|.+... .+.-+.|+.. T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~---------~~~~~~~~~~~~~~~~~~~--~a~~v~E~~~- 68 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKL---------AKAVPMTKPEEEFTFMSGV--GAFWVDEAER- 68 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhh---------ceeeecCCCcEEEEEEcCC--ceeeeecCcc- Confidence 443 11122222 2466777766566666665544221 1223346778899998743 3455666543 Q ss_pred ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 78 ~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) ++..+.+-++.....++.+.-..++++...-+..|....+.+++++.+.+..++.+| .|. .......... T Consensus 69 --~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l---~G~---g~~~~~gil~-- 138 (299) T protein:vir:41 69 --IQTSKPTFTKAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVF---TGV---ESPYNWNILK-- 138 (299) T ss_pred --ccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHh---hcc---cCcccccccc-- Confidence 445566666667778888888999999988888888999999999999998887665 232 1100000000 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hccccccccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPDSKGQLT 235 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~~~g~~~ 235 (367) . ... +.+.......+++.|+++..++-+....-.+++||+..+..|++..--+ ++-....... T Consensus 139 -~--------~~~------~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~ 203 (299) T protein:vir:41 139 -S--------ATD------ASNLVEETANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNG 203 (299) T ss_pred -c--------ccc------cceeeccccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCC Confidence 0 000 0000112346789999999888766666778999999999999753111 1100111112 Q ss_pred chhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCCceeE-E---EEEccEEEeeee Q lcl|Aclame:pro 236 IPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNGSGLE-Y---ILERKEWIVHPG 310 (367) Q Consensus 236 i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~~g~~-~---l~~r~~~~~hp~ 310 (367) .++++|++|+++|.||... ... ..+||.=+ +.++... ...+++.|+.....+...+ . ++.+... ..+ T Consensus 204 ~~~l~G~PV~~~~~~~~~~---~~~--~~~~gdfs~~~i~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~r 275 (299) T protein:vir:41 204 VDDVLGLPIAYTPKYTFGD---KDI--SELVGDWNQAYYGILR-GVEYEILTEATLTTVADETGKPLNLAERDMA--AIK 275 (299) T ss_pred CceecceeeEEecccCCCC---Cce--EEEEEecccEEEEEec-CcEEEEeecccccccccccccchhhhhcCcE--EEE Confidence 3689999999999999532 111 23333321 1122222 2345556655432211111 0 1111111 111 Q ss_pred eeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 311 GFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 311 G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) .+.+-+-. +.+++++.. ++.+. T Consensus 276 ~~~~~d~~---------------------------------v~~~~A~~~--l~~~a 297 (299) T protein:vir:41 276 ATFEVGFM---------------------------------VVKDEAFSA--VQPKA 297 (299) T ss_pred EEEEeccE---------------------------------EecccceEE--EEecc Confidence 11111100 111111111 11111 No 45 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=98.66 E-value=7.7e-09 Score=65.06 Aligned_cols=279 Identities=8% Similarity=-0.038 Sum_probs=145.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC--CcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD--SLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~--g~~~~~~~~~~~~ 78 (367) |... .+.-...++|+.+..-+.....+.+.|.+-. +....++...++|++...+ +.+..+.|+.... T Consensus 109 ~~~~--t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~---------~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 177 (397) T protein:vir:48 109 KTDA--SGSDAGLTIPQDIQTAIHTLVRQYDSLQEYV---------NVENVTTLTGSRVYEKWADITGLAKLDDEAGSIG 177 (397) T ss_pred hhcc--CCccccccccHHHHHHHHHHHHHHHHHHhhh---------ceeeccCCcceEEEEeecCCCcceeeeccccccc Confidence 3321 1112345778877777777666666553311 1112345556666655333 3455566654321 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) . .++.+-++.....++.+.-..+++....-+..|....+.++|++...+..++.++. |. +. T Consensus 178 ~--~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~---G~--------g~------ 238 (397) T protein:vir:48 178 T--NDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILE---AI--------AT------ 238 (397) T ss_pred c--ccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh---cc--------cc------ Confidence 1 22334444445566667777888887766667788899999998887777666543 21 00 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhccccc-cccc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPDSK-GQLT 235 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~-g~~~ 235 (367) ........+++.+.++...+......-..++||+..+..|++..-- .++-..+ .... T Consensus 239 --------------------~~~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~ 298 (397) T protein:vir:48 239 --------------------LPTKPTLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPT 298 (397) T ss_pred --------------------cccccccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCC Confidence 0011234678889998877766666678999999999999886311 1111111 1122 Q ss_pred chhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEE--c-cEEEeeee Q lcl|Aclame:pro 236 IPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILE--R-KEWIVHPG 310 (367) Q Consensus 236 i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~--r-~~~~~hp~ 310 (367) -.+++|++|++.|+.++... ..++. +++||. .++.+.... ...+++.+.....-..++..+.. | .-.++||. T Consensus 299 ~~~l~G~PV~~~~~~~~~~~-~~~~~-~~~~gd~~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~ 375 (397) T protein:vir:48 299 GYSIDGFAVKEVADRWLANA-SSGAM-PLYFGDLKQAVTLFDRQ-QMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTE 375 (397) T ss_pred CceeccceeEEecccccCCc-CCCce-EEEEEeccceEEEEeec-ceEEEEeccchhhhhcCceeEEEEeeeccEEeccc Confidence 35799999998776544321 22222 455653 233333222 23455555432211122333222 2 22356887 Q ss_pred eeeecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 311 GFNWLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 311 G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) +|.+-.-. .+.+..|+..-++- T Consensus 376 a~~~~~~~-------------~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 376 SFVPASFK-------------AIADQKGNLGSTAV 397 (397) T ss_pred ceEEEEec-------------ccccCCCCccccCC Confidence 77654311 11122222222222 No 46 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.65 E-value=3.3e-09 Score=67.08 Aligned_cols=277 Identities=12% Similarity=0.065 Sum_probs=138.7 Q ss_pred CCCccccccc--eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRL--VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l--~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) -+.....|.- ..++.|++..+.+.....+.+-+ . ..+.+ .-..++..+.+|....- ..+.-+.|+.. T Consensus 107 ~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l-~-----~~~~~--~~~~~~~~~~~~~~~~~-~~a~~v~E~~~-- 175 (392) T protein:vir:13 107 APEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIM-R-----GGAST--FTTSDANPMDFTVITGR-ATAGIVGETAE-- 175 (392) T ss_pred hhhhhcccccCCCccccccchHHHHHHHHhhhhhh-h-----hccee--eecCCCceeEEEEEcCC-cceeeeccccc-- Confidence 0011111211 23677888877766543333222 1 11110 01246778899988754 34444666643 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) ++..+.+-++.....++.+.-..+++....-+.-|-.+.+.++|++.+.+..+..+|. | +......+.... T Consensus 176 -~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~---G---~Gt~~p~Gil~~-- 246 (392) T protein:vir:13 176 -IPESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLT---G---TGTGQPRGILTD-- 246 (392) T ss_pred -ccccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc---c---cCCccccccccc-- Confidence 4445555555556666777777788887666666777889999998888877776552 2 100000000000 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhc-cccccccc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFI-PDSKGQLT 235 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~-~~~~g~~~ 235 (367) .. .......+. ....++++.+.++...+......-..++||+..+..|++..-- .++ ++...... T Consensus 247 ---------~~--~~~~~~~~~-~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~ 314 (392) T protein:vir:13 247 ---------AT--GANAAFGEA-DADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGA 314 (392) T ss_pred ---------cc--ccccccccc-ccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCC Confidence 00 000000011 1234778899998877755544556799999999998864211 111 11101112 Q ss_pred chhhcCcEEEEeCCCcccCCCCCceEEEEEEecc-eeeeeccCCCcceeeeeehhhcCCceeEEEEEccE---EEeeeee Q lcl|Aclame:pro 236 IPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA-AFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVHPGG 311 (367) Q Consensus 236 i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G-Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~---~~~hp~G 311 (367) -.+++|++|+++|.||.. +.+||.= .+.++..+ .+++++.....=..++..+....+ -++||.. T Consensus 315 ~~~l~G~Pv~~~~~~~~~---------~i~~Gdf~~~~i~~~~---~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~A 382 (392) T protein:vir:13 315 PDTFNGKVVETDDGMPAD---------KVLFADLSKYRVRFAG---SLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARG 382 (392) T ss_pred CceecceeeEEcCCCCCC---------cEEEeeccceeEEeec---ceEEEeeccccccCCcEEEEEEEEeccEEecccc Confidence 257899999999999842 2334331 12222222 234443322221223333333222 2345666 Q ss_pred eeecccccccccccccccccccc Q lcl|Aclame:pro 312 FNWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 312 ~s~~~~~~~~~~~~~~~~~~~~~ 334 (367) |.-..-+. ++ T Consensus 383 ~~~~~~~~-------------aa 392 (392) T protein:vir:13 383 AKVLTVTP-------------AA 392 (392) T ss_pred eEEEEeec-------------cC Confidence 55433221 11 No 47 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.65 E-value=2.8e-09 Score=67.46 Aligned_cols=290 Identities=10% Similarity=-0.000 Sum_probs=138.6 Q ss_pred CCCccccccc--------------eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCC Q lcl|Aclame:pro 1 MPDFNNQVRL--------------VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDS 66 (367) Q Consensus 1 Ma~~~~~T~l--------------~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g 66 (367) ||-.++.... ++ ..|+.+..=+.+.+.+.+.+.+- .....-++..+.+|.+..- . T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~-liP~~~~~~ii~~l~~~s~l~~~---------~~~~~~~~~~~~~p~~~~~-~ 69 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSD-LLPKEIVGPIFDKAQESSLVLRM---------GEQIPISYGETIIPTTVKR-P 69 (333) T ss_pred CchhHHhhhhcccccccCceecCCcc-ccchhHHHHHHHHHHhhchhhhh---------cceeeccCCceEEEEEeCC-c Confidence 5544433211 12 34555544455555554444221 1112235677889988754 2 Q ss_pred cccccCCCCc-----cccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHH Q lcl|Aclame:pro 67 LEPNYGSDNP-----NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVG 141 (367) Q Consensus 67 ~~~~~~~~~~-----~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~G 141 (367) .+.-+.|+.. .+.++..+.+-++.....++.+.-..++++...-+..|..+.+.+++++.+.+..++.+|. | T Consensus 70 ~a~~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~---G 146 (333) T protein:vir:78 70 EVGQVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFH---G 146 (333) T ss_pred eeEeecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhc---c Confidence 2222222211 1223334444444444556667777888887777777888999999998888877776662 1 Q ss_pred HHhhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccc-cCceeEEEEccHHHHHHHh Q lcl|Aclame:pro 142 VYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDH-VGSIAAIAVHSMVYKRMTN 220 (367) Q Consensus 142 vf~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~-~~~l~~~vmhS~v~~~L~k 220 (367) +..........+. ............+ ......+.++.+.++..++... ...-.+++||+..+..|++ T Consensus 147 ---~g~~~~~~~~g~~------~~~~~~~~~~~~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~ 214 (333) T protein:vir:78 147 ---KSPLTGSALQGID------TDNVIANTTNVDY---LQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLR 214 (333) T ss_pred ---cCCCCCccccccc------ccccccccccccc---cccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHH Confidence 1000000000000 0000000000001 1122346688899988776443 4445689999999999987 Q ss_pred cchhhhccccccc---------ccchhhcCcEEEEeCCCcccCC-CCCceEEEEEEecce-eeeeccCCCcceeeeeehh Q lcl|Aclame:pro 221 NDEIEFIPDSKGQ---------LTIPTYMGKVVIVDDGMPVFGT-GADKTYLSILFGGAA-FGYADGAPQVPVAVGRREL 289 (367) Q Consensus 221 ~~li~~~~~~~g~---------~~i~t~~G~~VivdD~~pv~~t-~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~ 289 (367) ..+. ++.+|. ..-++++|++|++++.||.... ...+++. .+||.-. +.++..+ ...++..++.. T Consensus 215 ~~~~---~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~-~~~gD~~~~~~g~~~-~~~i~~~~~~~ 289 (333) T protein:vir:78 215 AQAY---RDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTR-IIGGDFSQLKFGFAD-EIRIKMSDTAT 289 (333) T ss_pred Hhhh---cCCCCceeecCccccCCCceeeceeeEEccccCCCccccCCCccE-EEEEecccEEEEEee-ccEEEEecccc Confidence 6432 222221 2236899999999999996532 2333332 3333221 2233222 12233333322 Q ss_pred hcCCce---------eEEEEEcc---EEEeeeeeeeecccccccccccccccccccccCCC Q lcl|Aclame:pro 290 RGNGSG---------LEYILERK---EWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAI 338 (367) Q Consensus 290 ~~~~~g---------~~~l~~r~---~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sP 338 (367) ..+.++ +..+.... ..++||..|..-. ....| T Consensus 290 ~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~-----------------~~~a~ 333 (333) T protein:vir:78 290 LTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFV-----------------DDEQP 333 (333) T ss_pred ccccccceeehhhcCcEEEEEEEEEccEEecccceEEEe-----------------ccCCC Confidence 111111 11111111 1234555555431 12234 No 48 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.65 E-value=3.4e-09 Score=67.04 Aligned_cols=271 Identities=11% Similarity=0.018 Sum_probs=138.8 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |.. ...+.-..++.|++... +.....+.+.+.+ . -+.+..++..+++|.+..-++.+..+.|+.. + T Consensus 113 ~~~-~~~~~~g~lip~~~~~~-ii~~~~~~~~i~~------~---~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~---~ 178 (390) T protein:vir:97 113 AST-DAAGSAGALTTPNRLPG-FITPPDARLTVRD------L---IGSGRTDSALIEYVQETGFVNNAAIVAEGAL---K 178 (390) T ss_pred hhc-ccccccccccchhhhHH-HHHHHhhhhhhHh------h---cceeeccCCceEEEEEecCCcceeeecCCcc---c Confidence 111 00011122455555544 4444444444422 1 1122235677899998766566777777653 5 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) +..+.+-++.....++.+.-..++++...-+ .+....+.+++++...+..+..+|. | + .++.. ...+ . T Consensus 179 ~~~~~~~~~i~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~la~a~~~~~d~a~l~---G---~-g~~~~-p~Gi---~ 246 (390) T protein:vir:97 179 PESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILR---G---T-GANDG-LLGL---I 246 (390) T ss_pred cccccceeEEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhh---c---C-CCCcc-ccce---e Confidence 5556666666677777787788888765444 4677788888988777777665542 2 1 11100 0000 0 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcccccccccchh Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPDSKGQLTIPT 238 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~~~g~~~i~t 238 (367) .. .. .............++.+.++...+.+..-...+++||+..+..|++..--+ ++-.......-++ T Consensus 247 ~~-------~~---~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~ 316 (390) T protein:vir:97 247 PQ-------AT---TYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPT 316 (390) T ss_pred ec-------cc---cccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCccCCCCce Confidence 00 00 000011112345678899998888777777789999999999998754111 1111111223468 Q ss_pred hcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Eeeeeeee Q lcl|Aclame:pro 239 YMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IVHPGGFN 313 (367) Q Consensus 239 ~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~hp~G~s 313 (367) ++|++|+++|.||-. +++||. .++.+.... ...+++.++.... ..++..+....++ ++||..|. T Consensus 317 l~G~pV~~~~~~~~~---------~~~~gd~~~~~~~~~~~-~~~i~~~~~~~~f-~~~~~~~r~~~r~d~~v~~~~a~v 385 (390) T protein:vir:97 317 LWGLPVVATQAMAPG---------EFLVGAFDLAAQIFDQW-DARVEIGYVNDDF-QRNMVTVLAEERLALVVYRPEALI 385 (390) T ss_pred ecceeeEEcCCCCCC---------cEEEEeccceEEEEEec-ceEEEEeeccccc-ccCcEEEEEEEeeccEEeccccEE Confidence 899999999999842 133332 222222211 1223443332111 1233333322222 33555554 Q ss_pred ecccccccccccccccccccccCC Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPPA 337 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~s 337 (367) .-. .+ T Consensus 386 ~~~-------------------~a 390 (390) T protein:vir:97 386 TGS-------------------FA 390 (390) T ss_pred EEE-------------------eC Confidence 321 11 No 49 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.63 E-value=1.7e-08 Score=63.21 Aligned_cols=289 Identities=10% Similarity=-0.021 Sum_probs=140.6 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) +......|.-.-..+|+.+.+-+.....+.+.|.+ ...... .+++...+.+|.+... .....+.|+..... T Consensus 119 ~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~------~~~~~~-~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~- 189 (415) T protein:vir:94 119 IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK------YVTVKR-VTNGSGKYPVVRQSEV-AALEKVEELEENPE- 189 (415) T ss_pred hhhhccccccccccCcHHHHHHHHHHHHhhhhhhh------hcceee-ccCCceeEEEEeecCC-ccceeccccccccc- Confidence 11101111223456788776666666555555522 110000 1122334455555433 34445555543221 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) .+..+-.+.....++.+.-+.+++....-+..|....|.+++++.+.+..++.+|.-. -........ T Consensus 190 -~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~---g~g~~~~~~--------- 256 (415) T protein:vir:94 190 -LAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVI---TKGSTGSTS--------- 256 (415) T ss_pred -cccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc---ccCcccccc--------- Confidence 1223334444556667777788888766666677889999999888877776665421 100000000 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcc-cccccccch Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIP-DSKGQLTIP 237 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~-~~~g~~~i~ 237 (367) ..... ...+.......+++.|.++...+.+..-.-.+++||+..+..|++..--+ ++- +.-....-+ T Consensus 257 ---------~~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~ 326 (415) T protein:vir:94 257 ---------SGFEK-EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQ 326 (415) T ss_pred ---------ccccc-cccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCc Confidence 00000 00111123346789999999888776666778999999999998753111 111 110112346 Q ss_pred hhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEE--c-cEEEeeeeee Q lcl|Aclame:pro 238 TYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILE--R-KEWIVHPGGF 312 (367) Q Consensus 238 t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~--r-~~~~~hp~G~ 312 (367) +++|++|++++.+|.... +.+ .++||. -++.+.... . +++++.....+. +.+.. | .--++||..| T Consensus 327 ~l~G~pV~~~~~~~~~~~---~~~-~i~~gd~~~~~~~~~~~-~--~~v~~~~~~~~~---~~~r~~~r~d~~~~~~~a~ 396 (415) T protein:vir:94 327 RLLGAKIEILPDEVLGQK---GNN-TLIIGNLKDAIVLFDRS-Q--YQASWTDYMHFG---ECLMIAVRQDCRILDYKSA 396 (415) T ss_pred eecceeeEEecccccCCC---Ccc-EEEEEehhccEEEEeec-c--eEEEEeccccCc---eEEEEEEEeccEEeccccE Confidence 899999999999997543 222 345552 223222211 1 223333222211 11211 1 1224466666 Q ss_pred eecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 313 NWLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 313 s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) ..-.-+- ....|-+--|+. T Consensus 397 ~~~~~~~--------------~~~~~~~~~~~~ 415 (415) T protein:vir:94 397 IVIEYDD--------------SERGEGDLGLEA 415 (415) T ss_pred EEEEEec--------------cCCCCCccccCC Confidence 6543221 111122222222 No 50 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.62 E-value=1.1e-08 Score=64.32 Aligned_cols=277 Identities=9% Similarity=-0.003 Sum_probs=141.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC--CcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD--SLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~--g~~~~~~~~~~~~ 78 (367) |.... +.-....+|+.+.+.+.....+.+.|.+- -+....++....+|+|...+ +.+..+.|+.. T Consensus 116 ~~~~t--~~~gg~~iP~~~~~~ii~~~~~~~~l~~~---------~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-- 182 (404) T protein:vir:39 116 ETSGS--DSAAGLTIPQDIRTMINTLVRQYDSLQQY---------VRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGK-- 182 (404) T ss_pred hhccc--ccCCceeccHHHHHHHHHHHHhhhhHHhh---------cceeeccCCcceEEEEeecCCccceeeecCccc-- Confidence 21100 11123567888877777666665554221 11112345556677775433 33445666543 Q ss_pred ccc-ccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAP-IDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 79 ~~t-~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) ++ .++.+-.+.....++.+.-..++++...-+..|....+.+++++...+..++.+|.-. +.. T Consensus 183 -~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~-----------g~~---- 246 (404) T protein:vir:39 183 -IPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAM-----------GTV---- 246 (404) T ss_pred -cccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcc-----------ccc---- Confidence 22 2445555556667777877888888777777778889999999888887777555311 000 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHH-hccccCceeEEEEccHHHHHHHhcchh--hhccc-cccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-MGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPD-SKGQ 233 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~-~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~-~~g~ 233 (367) .+ .....+++.+.+++.. +......-.+++||+..+..|++..-- .++-. .-.. T Consensus 247 ------------------~~----~~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~ 304 (404) T protein:vir:39 247 ------------------PK----KPTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTK 304 (404) T ss_pred ------------------cc----ccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCcCC Confidence 00 1123457777777653 333334456899999999999975211 11110 0011 Q ss_pred ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecc--eeeeeccCCCcceeeeeehhhcCCceeEEEEEccE---EEee Q lcl|Aclame:pro 234 LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVH 308 (367) Q Consensus 234 ~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G--Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~---~~~h 308 (367) ..-.+++|++|++.|.+++... ..+.+ +++||.= ++.+..-. ...++.++........++..+....+ .++| T Consensus 305 ~~~~~l~G~pV~~~~~~~~~~~-~~~~~-~~~~gd~~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~ 381 (404) T protein:vir:39 305 PNSYLIKGKKVIVVADRWLPNS-GSTVY-PLYYGDMSQAITLFDRE-NMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTD 381 (404) T ss_pred CCcceecceeEEEecccccCcc-CCCcc-EEEEEeccccEEEEeec-ceEEEEeccchhhhhhceeeEEEEeeeccEEec Confidence 2235889999999887554332 22333 2445431 23222222 12234444322221123334433222 2446 Q ss_pred eeeeeecccccccccccccccccccccC Q lcl|Aclame:pro 309 PGGFNWLDADVTIPDNTGSPSGITSGPP 336 (367) Q Consensus 309 p~G~s~~~~~~~~~~~~~~~~~~~~~~~ 336 (367) |..|..-.-.-++ . ..++.++|+ T Consensus 382 ~~a~~~~~~~~~a----~-~~~~~~~~~ 404 (404) T protein:vir:39 382 SEALVAGSFTAIA----D-QVGNFTAGK 404 (404) T ss_pred ccceEEEEeeccc----c-CCCCCCCCC Confidence 6665543322111 1 112222333 No 51 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=98.62 E-value=5e-09 Score=66.10 Aligned_cols=265 Identities=11% Similarity=0.027 Sum_probs=135.6 Q ss_pred CCCcccc-cc-ceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQ-VR-LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~-T~-l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) +...... |. =.-++.|+++...+. ...+.+.|.+ +-.....++..+++|.|..-.+.+..+.|+.. T Consensus 110 ~~~~~~~~~~~~g~~~~~~~~~~ii~-~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-- 177 (390) T protein:vir:10 110 LNTASTDAAGSAGALTTPNRLPGFIT-QPDARLTVRD---------LIGSGRTDSALIEYVQETGFVNNAAIVAEGAL-- 177 (390) T ss_pred HHhhhcccccccccccchhHHHHHHH-HHHhhchhhh---------hcceeeccCCceEEEEEecCCcceeeecCCcc-- Confidence 0000000 11 122577777765443 3333333322 11122345668899999866566666666643 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHH------HHHHHhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAM------AVGVYKSNLAGNFA 152 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~------l~Gvf~~~~a~~~~ 152 (367) ++....+-.+.....++.+.-..+++....-+ .+-...+.+++++...+..++.+|.= ..|+++... T Consensus 178 -~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~----- 250 (390) T protein:vir:10 178 -KPESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQAT----- 250 (390) T ss_pred -ccccccceeEEEEeeEEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccccccccc----- Confidence 44455555555666666777777777654433 35667888888877777666655420 111111100 Q ss_pred hhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcccc Q lcl|Aclame:pro 153 TIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPDS 230 (367) Q Consensus 153 ~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~~ 230 (367) ....+........++.++++...+.+....-.+++||+..+..|++..--+ |+-.. T Consensus 251 ----------------------~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~ 308 (390) T protein:vir:10 251 ----------------------TYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGN 308 (390) T ss_pred ----------------------cccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecC Confidence 000011112234578888998888777777889999999999998754111 11111 Q ss_pred cccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEe---cceeeeeccCCCcceeeeeehhhcCCceeEEEEEccE--- Q lcl|Aclame:pro 231 KGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFG---GAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE--- 304 (367) Q Consensus 231 ~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~---~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~--- 304 (367) .....-++++|++|++++.||.. + ++|| .+...+.... ..+++.++.... ..++..+....+ T Consensus 309 ~~~~~~~~l~G~pv~~~~~~p~~------~---~~~gdf~~~~~~~~~~~--~~i~~~~~~~~~-~~~~~~~r~~~r~d~ 376 (390) T protein:vir:10 309 ARGTLTPTLWGLPVVATQAMAPG------E---FLVGAFDLAAQIFDQWD--ARVEIGYVNDDF-QRNMVTVLAEERLAL 376 (390) T ss_pred CcCcCCceecceeeEEcCCCCCC------c---EEEEeccceEEEEEecc--eEEEEeeccccc-ccCcEEEEEEEeecc Confidence 11122357899999999999842 1 2333 2222221111 223443332111 123333322222 Q ss_pred EEeeeeeeeecccccccccccccccccccccCC Q lcl|Aclame:pro 305 WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPA 337 (367) Q Consensus 305 ~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~s 337 (367) .+.+|..|..-. .+ T Consensus 377 ~v~~~~a~~~~~-------------------~a 390 (390) T protein:vir:10 377 VVYRPEALISGS-------------------FA 390 (390) T ss_pred EEeccccEEEEE-------------------eC Confidence 234565554321 01 No 52 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.61 E-value=1.2e-08 Score=63.97 Aligned_cols=290 Identities=11% Similarity=-0.028 Sum_probs=142.4 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC-CcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD-SLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~-g~~~~~~~~~~~~~ 79 (367) +....-.|.-.-.++|+.+.+-+.....+.+.+.+ .. +....++...++|+...-+ .....+.|+..... T Consensus 119 ~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~------~~---~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~ 189 (415) T protein:vir:98 119 IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK------YV---TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE 189 (415) T ss_pred hhhccccccccccccchHHHHHHHHHHHhhhhhhh------he---eeeeccCCceeEEEEeecCCccceeeccccccCc Confidence 11111111123457888776666655555444411 11 1111234444444443332 23344555433211 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) .+..+-++.....++.+.-..+++....-+..|....+.++|++.+.+..++.++.-+ -........ T Consensus 190 --~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~---g~g~~~~~~-------- 256 (415) T protein:vir:98 190 --LAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVI---TKGSTGSTS-------- 256 (415) T ss_pred --ccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc---ccCcccccc-------- Confidence 1222333444556666777788888776666777889999999888887776665422 110000000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcc-cccccccc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIP-DSKGQLTI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~-~~~g~~~i 236 (367) .+... ...+.......+++.|.++...+.+....-..++||+..+..|++..--+ |+- +.-....- T Consensus 257 ----------~~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:98 257 ----------SGFEK-EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ 325 (415) T ss_pred ----------ccccc-cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCC Confidence 00000 01111223457899999999888777666778999999999998752111 111 11111234 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEe--cceeeeeccCCCcceeeeeehhhcCCceeEEEEEc-cEEEeeeeeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFG--GAAFGYADGAPQVPVAVGRRELRGNGSGLEYILER-KEWIVHPGGFN 313 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~--~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r-~~~~~hp~G~s 313 (367) .+++|++|++.+.+|.... +.+ +++|| ..++.+.... ...+++.+ ...+..+.- .+.| .--++||..|- T Consensus 326 ~~l~G~pV~~~~~~~~~~~---~~~-~~~~Gd~~~~~~~~~~~-~~~v~~~~--~~~~~~~~~-~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:98 326 QRLLGAKIEILPDEVLGQK---GNN-TLIIGNLKDAIVLFDRS-QYQASWTD--YMHFGECLM-IAVRQDCRILDYKSAI 397 (415) T ss_pred ceecceeeEEecccccCCC---Ccc-EEEEEehhccEEEEeec-ceEEEEec--cccCceEEE-EEEEeccEEeccccEE Confidence 5899999999999997542 332 35565 2333232222 12233332 222212111 1112 23345777775 Q ss_pred ecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) +-.-+- ....|-+--|+. T Consensus 398 ~~~~~~--------------~~~~~~~~~~~~ 415 (415) T protein:vir:98 398 VIEYDD--------------SERGEGDLGLEA 415 (415) T ss_pred EEEEec--------------cCCCCCccccCC Confidence 543221 111222222222 No 53 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.61 E-value=1.2e-08 Score=63.97 Aligned_cols=290 Identities=11% Similarity=-0.028 Sum_probs=142.4 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC-CcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD-SLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~-g~~~~~~~~~~~~~ 79 (367) +....-.|.-.-.++|+.+.+-+.....+.+.+.+ .. +....++...++|+...-+ .....+.|+..... T Consensus 119 ~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~------~~---~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~ 189 (415) T protein:vir:79 119 IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK------YV---TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE 189 (415) T ss_pred hhhccccccccccccchHHHHHHHHHHHhhhhhhh------he---eeeeccCCceeEEEEeecCCccceeeccccccCc Confidence 11111111123457888776666655555444411 11 1111234444444443332 23344555433211 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) .+..+-++.....++.+.-..+++....-+..|....+.++|++.+.+..++.++.-+ -........ T Consensus 190 --~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~---g~g~~~~~~-------- 256 (415) T protein:vir:79 190 --LAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVI---TKGSTGSTS-------- 256 (415) T ss_pred --ccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc---ccCcccccc-------- Confidence 1222333444556666777788888776666777889999999888887776665422 110000000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcc-cccccccc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIP-DSKGQLTI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~-~~~g~~~i 236 (367) .+... ...+.......+++.|.++...+.+....-..++||+..+..|++..--+ |+- +.-....- T Consensus 257 ----------~~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:79 257 ----------SGFEK-EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ 325 (415) T ss_pred ----------ccccc-cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCC Confidence 00000 01111223457899999999888777666778999999999998752111 111 11111234 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEe--cceeeeeccCCCcceeeeeehhhcCCceeEEEEEc-cEEEeeeeeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFG--GAAFGYADGAPQVPVAVGRRELRGNGSGLEYILER-KEWIVHPGGFN 313 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~--~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r-~~~~~hp~G~s 313 (367) .+++|++|++.+.+|.... +.+ +++|| ..++.+.... ...+++.+ ...+..+.- .+.| .--++||..|- T Consensus 326 ~~l~G~pV~~~~~~~~~~~---~~~-~~~~Gd~~~~~~~~~~~-~~~v~~~~--~~~~~~~~~-~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:79 326 QRLLGAKIEILPDEVLGQK---GNN-TLIIGNLKDAIVLFDRS-QYQASWTD--YMHFGECLM-IAVRQDCRILDYKSAI 397 (415) T ss_pred ceecceeeEEecccccCCC---Ccc-EEEEEehhccEEEEeec-ceEEEEec--cccCceEEE-EEEEeccEEeccccEE Confidence 5899999999999997542 332 35565 2333232222 12233332 222212111 1112 23345777775 Q ss_pred ecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) +-.-+- ....|-+--|+. T Consensus 398 ~~~~~~--------------~~~~~~~~~~~~ 415 (415) T protein:vir:79 398 VIEYDD--------------SERGEGDLGLEA 415 (415) T ss_pred EEEEec--------------cCCCCCccccCC Confidence 543221 111222222222 No 54 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.61 E-value=1.2e-08 Score=63.97 Aligned_cols=290 Identities=11% Similarity=-0.028 Sum_probs=142.4 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC-CcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD-SLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~-g~~~~~~~~~~~~~ 79 (367) +....-.|.-.-.++|+.+.+-+.....+.+.+.+ .. +....++...++|+...-+ .....+.|+..... T Consensus 119 ~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~------~~---~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~ 189 (415) T protein:vir:81 119 IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK------YV---TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE 189 (415) T ss_pred hhhccccccccccccchHHHHHHHHHHHhhhhhhh------he---eeeeccCCceeEEEEeecCCccceeeccccccCc Confidence 11111111123457888776666655555444411 11 1111234444444443332 23344555433211 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) .+..+-++.....++.+.-..+++....-+..|....+.++|++.+.+..++.++.-+ -........ T Consensus 190 --~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~---g~g~~~~~~-------- 256 (415) T protein:vir:81 190 --LAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVI---TKGSTGSTS-------- 256 (415) T ss_pred --ccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc---ccCcccccc-------- Confidence 1222333444556666777788888776666777889999999888887776665422 110000000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcc-cccccccc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIP-DSKGQLTI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~-~~~g~~~i 236 (367) .+... ...+.......+++.|.++...+.+....-..++||+..+..|++..--+ |+- +.-....- T Consensus 257 ----------~~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:81 257 ----------SGFEK-EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ 325 (415) T ss_pred ----------ccccc-cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCC Confidence 00000 01111223457899999999888777666778999999999998752111 111 11111234 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEe--cceeeeeccCCCcceeeeeehhhcCCceeEEEEEc-cEEEeeeeeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFG--GAAFGYADGAPQVPVAVGRRELRGNGSGLEYILER-KEWIVHPGGFN 313 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~--~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r-~~~~~hp~G~s 313 (367) .+++|++|++.+.+|.... +.+ +++|| ..++.+.... ...+++.+ ...+..+.- .+.| .--++||..|- T Consensus 326 ~~l~G~pV~~~~~~~~~~~---~~~-~~~~Gd~~~~~~~~~~~-~~~v~~~~--~~~~~~~~~-~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:81 326 QRLLGAKIEILPDEVLGQK---GNN-TLIIGNLKDAIVLFDRS-QYQASWTD--YMHFGECLM-IAVRQDCRILDYKSAI 397 (415) T ss_pred ceecceeeEEecccccCCC---Ccc-EEEEEehhccEEEEeec-ceEEEEec--cccCceEEE-EEEEeccEEeccccEE Confidence 5899999999999997542 332 35565 2333232222 12233332 222212111 1112 23345777775 Q ss_pred ecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) +-.-+- ....|-+--|+. T Consensus 398 ~~~~~~--------------~~~~~~~~~~~~ 415 (415) T protein:vir:81 398 VIEYDD--------------SERGEGDLGLEA 415 (415) T ss_pred EEEEec--------------cCCCCCccccCC Confidence 543221 111222222222 No 55 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=98.61 E-value=1.3e-08 Score=63.79 Aligned_cols=283 Identities=12% Similarity=0.020 Sum_probs=134.3 Q ss_pred CCC--------c-cccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCccccc Q lcl|Aclame:pro 1 MPD--------F-NNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNY 71 (367) Q Consensus 1 Ma~--------~-~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~ 71 (367) |.. + +..|-=.-.++|+.|..-|...+.+.+-+.+-+-+ ...+| .+.+|.+..- +.+... T Consensus 131 l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~---------~~~~~-~~~~p~~~~~-~~a~~~ 199 (434) T protein:vir:62 131 IVGNIDEKEARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTG---------VKTKE-NIKYPVLVKK-AEAQGH 199 (434) T ss_pred hccccchhhhhhhcccccccceecchhhHHHHHHhhhhhhhhhhhcce---------eccCC-ceEEEEEecC-Ccccce Confidence 110 0 00010011456777766565555554444221111 11123 4678876433 222222 Q ss_pred CCCCccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 72 GSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNF 151 (367) Q Consensus 72 ~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~ 151 (367) .+......++....+-++.....++.+.-..+++....-+.-|-...|.++|++-..+..++.+|. | +...+.. T Consensus 200 ~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~---G---~G~~~~~ 273 (434) T protein:vir:62 200 KNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVN---G---DEANNIN 273 (434) T ss_pred ecccccccccccccceeeEEeeheeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc---c---CCCCccc Confidence 111111223333333344445556666666777776666666777889999987777776666552 1 1100000 Q ss_pred hhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcc- Q lcl|Aclame:pro 152 ATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIP- 228 (367) Q Consensus 152 ~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~- 228 (367) .. ++.-++.+.......+++.|++....+-.....-..++||+..+..|++..--+ |+- T Consensus 274 ~g------------------~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~ 335 (434) T protein:vir:62 274 DG------------------ALAKKAVEFKTDEKNLYDALVKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLR 335 (434) T ss_pred cc------------------eeecccccccccccchhhHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeec Confidence 00 000001111122346788888888777655555668899999999998763221 211 Q ss_pred cc-cc-cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEe---cceeeeeccCCCcceeeeeehhhcCCceeEEEEEcc Q lcl|Aclame:pro 229 DS-KG-QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFG---GAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERK 303 (367) Q Consensus 229 ~~-~g-~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~---~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~ 303 (367) ++ .. ...-.+++|++|++++.||....+. -..++|| ...|+ .- ..+++++|.....-..++..+. T Consensus 336 ~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~---~~~i~~Gdfs~~~i~--~~--~g~~~i~~~~~~~~~~~~v~~~--- 405 (434) T protein:vir:62 336 PFNQAEGGIGYTLLGFPVEEEDAIDIPDSPD---TPVFYFGDFSKFYIQ--DV--IGSLEVQKLVELFSRTNRVGFR--- 405 (434) T ss_pred cCCCccCCCCceecceeeEEecCccCccCCC---ceEEEEeeccceEEE--Ee--eceeEEEeehhhhcccCceEEE--- Confidence 11 11 1122479999999999999754322 2234443 11111 11 1123444433222111221111 Q ss_pred EEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 304 EWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 304 ~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) .+.+-+ +.+||.|.+++++.+.-|. T Consensus 406 -------~~~r~D--------------------------------gk~i~~~~~~~~~~~~~~~ 430 (434) T protein:vir:62 406 -------IWNLLD--------------------------------AQLIHSPFEVPVYKYVLKA 430 (434) T ss_pred -------EEeeec--------------------------------ceeecCcccceEEEEEecc Confidence 112111 3445666666666666444 No 56 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=98.61 E-value=1.3e-08 Score=63.80 Aligned_cols=273 Identities=9% Similarity=0.004 Sum_probs=144.1 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |.. ..+.=...++|+.+...+.....+.+.|.+ ...... .+.+.-.+.+|.+..-.+.+..+.|+..... T Consensus 109 ~~~--~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~------~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~- 178 (397) T protein:vir:49 109 KTD--ASGSDAGLTIPQDIQTAIHTLVSQYDSLQE------YVNVEN-VTTLTGSRVYEKWTDITGLANIDDEAGKIAD- 178 (397) T ss_pred hhc--cccccCcccccHhHHHHHHHHHHhhhhHHh------hhceee-cccCccceEEEeeccCCcceeeecCcccccc- Confidence 332 111123467788887777766666555522 111111 1112223455666655556677777653221 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) .++.+-.+.....++.+.-..++++...-+.-|....+.+++++...+..++.++. |. ..+ T Consensus 179 -~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~---G~----g~~----------- 239 (397) T protein:vir:49 179 -VDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILE---AI----AAL----------- 239 (397) T ss_pred -ccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh---hc----ccc----------- Confidence 23344455555667777777888887766667778899999998888777665543 21 000 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc------- Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ------- 233 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~------- 233 (367) .......+++.+.++...+-.....-.+++||+..+..|++.. +++|. T Consensus 240 -------------------~~~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lk------d~~G~~l~~~~~ 294 (397) T protein:vir:49 240 -------------------PTKPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVK------NALGDYLMERDV 294 (397) T ss_pred -------------------ccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhh------cCCCceeeccCc Confidence 0112335688899988877666666789999999999998763 22222 Q ss_pred --ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhc--CCceeEEEEE--c-cE Q lcl|Aclame:pro 234 --LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRG--NGSGLEYILE--R-KE 304 (367) Q Consensus 234 --~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~--~~~g~~~l~~--r-~~ 304 (367) ..-.+++|++|++.++.++... +.+.. +++||. .++.+.... .+++++++... -..++..+.. | .- T Consensus 295 ~~~~~~~l~G~PV~~~~~~~~~~~-~~~~~-~i~~gd~~~~~~~~~~~---~~~i~~~~~~~~~~~~~~~~~r~~~r~d~ 369 (397) T protein:vir:49 295 KSPTGYSIDGFAVKEVADRWLANG-TGGAM-PLYFGDLKQAVTLFDRQ---HMSLLSTNIGGGAFETDTTKVRVIDRFDV 369 (397) T ss_pred CCCCCceecceeeEEecccccccc-cCCce-eEEEeeccceEEEEeec---ceEEEEeccccchhhcCceeEEEEeeeCc Confidence 1235899999998776444322 22222 355552 223332221 23444433221 1122222222 2 12 Q ss_pred EEeeeeeeeecccccccccccccccccccccCCCChHH Q lcl|Aclame:pro 305 WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLAN 342 (367) Q Consensus 305 ~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~ 342 (367) .++||.+|..-.-. + ...+.+..||-|- T Consensus 370 ~~~~~~a~~~~~~~--~--------~~~~~~~~~~~~~ 397 (397) T protein:vir:49 370 VATDTEAFVPASFK--A--------IADQKGNLGSTAV 397 (397) T ss_pred EEecccceEEEEee--c--------ccCCCCCcccccC Confidence 34577666543211 0 0112223333332 No 57 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.60 E-value=5.2e-09 Score=65.98 Aligned_cols=274 Identities=11% Similarity=0.003 Sum_probs=137.7 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |.. ..+.-..++.|++. +.+.....+.+.+.+- -.....++..+++|.+..-+..+..+.|+.. + T Consensus 105 ~~~--~~~~~g~~i~~~~~-~~ii~~~~~~~~l~~~---------~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~---~ 169 (385) T protein:vir:18 105 LGS--DADSAGSLIQPMQI-PGIIMPGLRRLTIRDL---------LAQGRTSSNALEYVREEVFTNNADVVAEKAL---K 169 (385) T ss_pred hcc--ccccCCceecchhh-hHHHHHhhhccchhhh---------cceecccCcceEEEEEecCCcceeeeccCcc---c Confidence 221 00111124555544 4455555554444221 1111234667899998765555556666643 4 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) +..+.+-.+.....++.+....++++...-+ .+....+.+++++...+..+..+|. | + .++.. ...+ . T Consensus 170 ~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~---G---~-g~~~~-~~Gi---~ 237 (385) T protein:vir:18 170 PESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLN---G---D-GTGDN-LEGL---N 237 (385) T ss_pred cccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHh---c---c-CCCCc-cccc---c Confidence 5556666666777777888888888765433 4566788888887777766665542 2 1 11100 0000 0 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcccccccccchh Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPDSKGQLTIPT 238 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~~~g~~~i~t 238 (367) .. . .....+........++.|.++...+-.....-.+++||+..+..|++..--+ ++-.......-++ T Consensus 238 ~~-------~---~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~ 307 (385) T protein:vir:18 238 KV-------A---TAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNI 307 (385) T ss_pred cc-------c---ccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCce Confidence 00 0 0000111122335688999999888777777789999999999998753111 1111111123467 Q ss_pred hcCcEEEEeCCCcccCCCCCceEEEEEEecc--eeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Eeeeeeee Q lcl|Aclame:pro 239 YMGKVVIVDDGMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IVHPGGFN 313 (367) Q Consensus 239 ~~G~~VivdD~~pv~~t~~~~~yttyl~~~G--Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~hp~G~s 313 (367) ++|++|++++.||-. +.+|+.- ++.+.... ...+++.+.....-..++..+....++ +.+|..|. T Consensus 308 l~G~pV~~~~~~p~~---------~~~~gd~~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~ 377 (385) T protein:vir:18 308 MWGLPVVPTKAQAAG---------TFTVGGFDMASQVWDRM-DATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAII 377 (385) T ss_pred ecceeeEEcCcCCCC---------cEEEeecccEEEEEEec-ceEEEEeccccchhhcCcEEEEEEEeeccEEecccceE Confidence 899999999999832 1223321 12221111 112333322211111233333333233 44666664 Q ss_pred eccccccccccccccccccccc Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~ 335 (367) .-.-. +++ T Consensus 378 ~~~~~--------------aa~ 385 (385) T protein:vir:18 378 KGTFS--------------SGS 385 (385) T ss_pred EEEec--------------cCC Confidence 43211 011 No 58 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.60 E-value=5.2e-09 Score=65.98 Aligned_cols=274 Identities=11% Similarity=0.003 Sum_probs=137.7 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |.. ..+.-..++.|++. +.+.....+.+.+.+- -.....++..+++|.+..-+..+..+.|+.. + T Consensus 105 ~~~--~~~~~g~~i~~~~~-~~ii~~~~~~~~l~~~---------~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~---~ 169 (385) T protein:vir:19 105 LGS--DADSAGSLIQPMQI-PGIIMPGLRRLTIRDL---------LAQGRTSSNALEYVREEVFTNNADVVAEKAL---K 169 (385) T ss_pred hcc--ccccCCceecchhh-hHHHHHhhhccchhhh---------cceecccCcceEEEEEecCCcceeeeccCcc---c Confidence 221 00111124555544 4455555554444221 1111234667899998765555556666643 4 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) +..+.+-.+.....++.+....++++...-+ .+....+.+++++...+..+..+|. | + .++.. ...+ . T Consensus 170 ~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~---G---~-g~~~~-~~Gi---~ 237 (385) T protein:vir:19 170 PESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLN---G---D-GTGDN-LEGL---N 237 (385) T ss_pred cccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHh---c---c-CCCCc-cccc---c Confidence 5556666666777777888888888765433 4566788888887777766665542 2 1 11100 0000 0 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcccccccccchh Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPDSKGQLTIPT 238 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~~~g~~~i~t 238 (367) .. . .....+........++.|.++...+-.....-.+++||+..+..|++..--+ ++-.......-++ T Consensus 238 ~~-------~---~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~ 307 (385) T protein:vir:19 238 KV-------A---TAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNI 307 (385) T ss_pred cc-------c---ccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCce Confidence 00 0 0000111122335688999999888777777789999999999998753111 1111111123467 Q ss_pred hcCcEEEEeCCCcccCCCCCceEEEEEEecc--eeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Eeeeeeee Q lcl|Aclame:pro 239 YMGKVVIVDDGMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IVHPGGFN 313 (367) Q Consensus 239 ~~G~~VivdD~~pv~~t~~~~~yttyl~~~G--Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~hp~G~s 313 (367) ++|++|++++.||-. +.+|+.- ++.+.... ...+++.+.....-..++..+....++ +.+|..|. T Consensus 308 l~G~pV~~~~~~p~~---------~~~~gd~~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~ 377 (385) T protein:vir:19 308 MWGLPVVPTKAQAAG---------TFTVGGFDMASQVWDRM-DATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAII 377 (385) T ss_pred ecceeeEEcCcCCCC---------cEEEeecccEEEEEEec-ceEEEEeccccchhhcCcEEEEEEEeeccEEecccceE Confidence 899999999999832 1223321 12221111 112333322211111233333333233 44666664 Q ss_pred eccccccccccccccccccccc Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~ 335 (367) .-.-. +++ T Consensus 378 ~~~~~--------------aa~ 385 (385) T protein:vir:19 378 KGTFS--------------SGS 385 (385) T ss_pred EEEec--------------cCC Confidence 43211 011 No 59 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=98.60 E-value=1.6e-08 Score=63.39 Aligned_cols=277 Identities=10% Similarity=0.023 Sum_probs=138.4 Q ss_pred CCCccccccc--eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRL--VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l--~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) .......|.- ..+|.+++....+.....+.+.|.+ +.......| .+.+|.-.. ...+.-+.|+.. T Consensus 247 ~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~---------~~~~~~~~g-~~~~~~~~~-~~~a~~v~Eg~~-- 313 (543) T protein:vir:81 247 EVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRR---------FARQVVATG-DVWHGVSSA-AVQWSWDAEFEE-- 313 (543) T ss_pred hhhhcccccccCcccCchhhhhHHHHHHHhhhchhhh---------hcccccCCc-ceEEEEecC-CcceeecccCcc-- Confidence 1111122211 2234444444544333333333311 111112234 455665443 245556666643 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHH------HHHHHHhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIA------MAVGVYKSNLAGNFA 152 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla------~l~Gvf~~~~a~~~~ 152 (367) ++..+++-++.....++.+.-+.++.....-+ .|....|.++++..+.+..+..+|. ...|++.... T Consensus 314 -~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~----- 386 (543) T protein:vir:81 314 -VSDDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALA----- 386 (543) T ss_pred -ccccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhcc----- Confidence 55566676777777788888888888766544 5788899999998888877765542 1222221110 Q ss_pred hhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcccc Q lcl|Aclame:pro 153 TIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPDS 230 (367) Q Consensus 153 ~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~~ 230 (367) ..+..+... ....++++.+.++...+-.....-.+++||+.++..|++..--+ |+-.. T Consensus 387 -----------------~~~~~~~~~---~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~ 446 (543) T protein:vir:81 387 -----------------GTAAEIAPV---TAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTT 446 (543) T ss_pred -----------------ccccccccc---ccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccC Confidence 001111111 12347788899888776544445568999999999998763111 11111 Q ss_pred cccccchhhcCcEEEEeCCCcccC--CCCCceEEEEEEecc-eeeeeccCCCcceeeeeehhhcCC----ceeEEEEEcc Q lcl|Aclame:pro 231 KGQLTIPTYMGKVVIVDDGMPVFG--TGADKTYLSILFGGA-AFGYADGAPQVPVAVGRRELRGNG----SGLEYILERK 303 (367) Q Consensus 231 ~g~~~i~t~~G~~VivdD~~pv~~--t~~~~~yttyl~~~G-Ai~~~~~~~~~~~e~~rd~~~~~~----~g~~~l~~r~ 303 (367) .....-++++|++|+++|.||... ..+.+.+. ++||.= -+.++.. ..+++.+++..... .|+..++... T Consensus 447 ~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~-i~~gd~~~~~i~~~---~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 522 (543) T protein:vir:81 447 IGNGEPSQLLGRPVGEAEAMDANWNTSASADNFV-LLYGNFQNYVIADR---IGMTVEFIPHLFGTNRRPNGSRGWFAYY 522 (543) T ss_pred cCCCCCccccceeeEEeccccccccccccCCcce-EEEeeccceeEEee---cccEEEEeccccccchhhcCceEEEEEE Confidence 111123579999999999999754 22344443 444431 1222221 23555555432210 1222222221 Q ss_pred EE---Eeeeeeeeecccccccccccccccccccc Q lcl|Aclame:pro 304 EW---IVHPGGFNWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 304 ~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~ 334 (367) ++ +++|..|....-. +++ T Consensus 523 r~d~~v~~~~A~~~l~~~-------------~~a 543 (543) T protein:vir:81 523 RMGADVVNPNAFRLLNVE-------------TAS 543 (543) T ss_pred eeccEeecccceEEEEec-------------ccC Confidence 11 2345554433211 011 No 60 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.60 E-value=8.9e-09 Score=64.73 Aligned_cols=281 Identities=13% Similarity=0.045 Sum_probs=143.9 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||-.. ..++.||+..+.+ +.+.+.+.+.+- ......++..+++|.+..- +.+.-+.|+.. + T Consensus 1 ma~~g-----G~lvp~~~~~~ii-~~~~~~s~i~~l---------~~~~~~~~~~~~ip~~~~~-~~a~~v~E~~~---~ 61 (298) T protein:vir:16 1 MVLNK-----GTLFDPTLVTDLI-SKVAGKSSIARL---------SAQKPIPFNGEKVFTFTMD-SEIDVVAESGK---K 61 (298) T ss_pred CcccC-----cceechhHHHHHH-HHHHhhhhhhhh---------cceeeccCCceEEEEEecC-cceEEecCCcc---c Confidence 99633 3357777766654 444444433221 1111234566899998754 55666777643 4 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhc---ccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAG---SNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g---~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) +..+++-++.....++.+.-..++++....+. .+.++.+.+++++.+.+..+..++. |.-.-....... ... T Consensus 62 ~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~---G~~~~~g~~~~~-~~~- 136 (298) T protein:vir:16 62 THGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFH---GVNPRLGTASAV-IGT- 136 (298) T ss_pred cccccceeEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhc---cccCCCCccccc-ccc- Confidence 44555555555566667777788887765443 4567789999998888877766653 321111110000 000 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hc-ccccccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FI-PDSKGQL 234 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~-~~~~g~~ 234 (367) .......... .+ +. ....-.+..+.++..++........+++||++.+..|++.+-.+ ++ +...... T Consensus 137 -----~~~~~~~~~~--~~--~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~ 206 (298) T protein:vir:16 137 -----NHFDSKVTQK--VE--AP-RGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGA 206 (298) T ss_pred -----cccccccccc--cc--cc-cccccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCC Confidence 0000000000 00 00 11112256788888887766667788999999999998863111 11 1100111 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcC------CceeEEEEEcc--- Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGN------GSGLEYILERK--- 303 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~------~~g~~~l~~r~--- 303 (367) .-.+++|++|++++.+|-... .+++ .++||. .++.++... ...+++.++....+ ..++..+.... T Consensus 207 ~~~~l~G~PV~~~~~v~~~~~--~~~~-~~~~GDfs~~~~~~~~~-~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d 282 (298) T protein:vir:16 207 TPDTINGLPVDVNKTVSDMSL--TQRD-RAIIGDFANGFKWGYAK-EVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLG 282 (298) T ss_pred CCceecceeeEEecccccccC--CCcc-EEEEeeccceEEEEEec-CceEEEeeccCCcCcchhhhhcCcEEEEEEEEEc Confidence 236899999999999985432 2333 344543 344443322 22344444322110 01222222211 Q ss_pred EEEeeeeeeeecccccccccccccccccccccCCCC Q lcl|Aclame:pro 304 EWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 304 ~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) -.++||..|..-+. .| T Consensus 283 ~~v~~~~a~~~l~~--------------------at 298 (298) T protein:vir:16 283 WGILDATKFARVTE--------------------AN 298 (298) T ss_pred cEeecccceEEEee--------------------cC Confidence 13557777665421 11 No 61 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.59 E-value=1e-08 Score=64.38 Aligned_cols=279 Identities=10% Similarity=0.035 Sum_probs=150.1 Q ss_pred CCCcc--ccc--cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCc Q lcl|Aclame:pro 1 MPDFN--NQV--RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) Q Consensus 1 Ma~~~--~~T--~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~ 76 (367) +..++ ..+ .-+...+|+.+..-+.....+.+.|.+ +......++..+++|.+... +.+..+.|+.. T Consensus 21 ~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~---------~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~ 90 (324) T protein:vir:10 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ---------LGKYEPMEGTEKKFTFWADK-PGAYWVGEGQK 90 (324) T ss_pred cceecccceeccCCCcceechhHHHHHHHHHHhhchhhh---------hcceeeccCCceEEEEEeCC-cceeEeccCcc Confidence 11111 111 112345666665555555555544422 11222345677999998753 56777777754 Q ss_pred cccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhh Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKT 156 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~ 156 (367) ++..+.+-++.....++.+.-..++++...-+..|..+.+.+++++.+.+..++.+|. |. ..+. ....+ T Consensus 91 ---~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~---G~----g~~~-~~~~i 159 (324) T protein:vir:10 91 ---IETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQ----GNNP-FGKSI 159 (324) T ss_pred ---ccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh---cC----CCCc-cCccc Confidence 5556667777777788888888999987777777888999999999888887776653 21 0100 00000 Q ss_pred hhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc-- Q lcl|Aclame:pro 157 RGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL-- 234 (367) Q Consensus 157 ~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~-- 234 (367) ...+..........++++.|.++..++.+....-.+++||+..+..|++.. +.+|.. T Consensus 160 ---------------~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~------d~~g~~~~ 218 (324) T protein:vir:10 160 ---------------AQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV------DPETKERI 218 (324) T ss_pred ---------------cccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh------ccCCceee Confidence 000000011122357899999999998777666778999999999998753 333322 Q ss_pred ---cchhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehhhcC------------CceeEE Q lcl|Aclame:pro 235 ---TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGN------------GSGLEY 298 (367) Q Consensus 235 ---~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~~~~------------~~g~~~ 298 (367) .-.+++|++|++++.++... + . ++|+. .-+.++... ...+++.++..... ..++.. T Consensus 219 ~~~~~~~l~G~PV~~~~~~~~~~----~--~-~~~gd~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (324) T protein:vir:10 219 YDRNSDTLDGLPVVNLKSSNLKR----G--E-LITGDFDKLIYGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) T ss_pred cCCCCccccceeEEeecCCCCCc----c--e-EEEEecccEEEEEec-CcEEEEeecccccccccccccchhhhhcCcEE Confidence 23578999999988876431 1 1 22222 122233322 12344444432110 012222 Q ss_pred EEEccEE---EeeeeeeeecccccccccccccccccccccCCCChHHh Q lcl|Aclame:pro 299 ILERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 299 l~~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L 343 (367) +....++ ++||..|.--... .++..+|-+|. T Consensus 291 ~r~~~r~d~~v~~~~A~~~l~~a--------------~~~~~~~~~~~ 324 (324) T protein:vir:10 291 LRATMHVALHIADDKAFAKLVPA--------------DKKTDSVPGEV 324 (324) T ss_pred EEEEEEEccEEecccceEEEEec--------------cCCCCCCCCCC Confidence 2222222 3345544322110 11112233333 No 62 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.59 E-value=2.3e-08 Score=62.46 Aligned_cols=290 Identities=10% Similarity=-0.018 Sum_probs=144.1 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC-CcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD-SLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~-g~~~~~~~~~~~~~ 79 (367) +....-.|.-.-..+|+.+.+.+.....+.+.+.+-. +....++...++|....-. ..+..+.|+..... T Consensus 119 ~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~ 189 (415) T protein:vir:47 119 IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV---------TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE 189 (415) T ss_pred hhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhc---------ceeeccCCceeEEEEEecCCcceeeccccccccc Confidence 2221112333456899988887877766666553311 1111233344445433222 23445555543221 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) .+..+-.+.....++.+....+++....-+..|....+.+++++.+.+..++.+|.-. -....... T Consensus 190 --~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~---g~g~~~~~--------- 255 (415) T protein:vir:47 190 --LAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVI---TKGSTGST--------- 255 (415) T ss_pred --ccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc---ccCCcccc--------- Confidence 1222333334455666777788887766666677889999999888887777665422 11000000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhccccc-ccccc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPDSK-GQLTI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~-g~~~i 236 (367) ........ .........+++.+.++...+.+....-.+++||+..+..|++..-- .|+-..+ ....- T Consensus 256 ---------~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~ 325 (415) T protein:vir:47 256 ---------SSGFEKEG-KKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ 325 (415) T ss_pred ---------cccccccc-ceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCC Confidence 00000000 11112345778899999988877666677899999999999875211 1221111 11223 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEc-cEEEeeeeeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILER-KEWIVHPGGFN 313 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r-~~~~~hp~G~s 313 (367) .+++|++|++.+.+|.... +.. +++||. -++.+.... .+.+++.....+..+. ..+.| ..-++||..|. T Consensus 326 ~~l~G~pV~~~~~~~~~~~---~~~-~~~~gd~~~~~~~~~~~---~~~v~~~~~~~~~~~~-~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:47 326 QRLLGAKIEILPDEVLGQK---GNN-TLIIGNLKDAIVLFDRS---QYQASWTDYMHFGECL-MIAVRQDCRILDYKSAI 397 (415) T ss_pred ccccceeeEEeccccccCC---Ccc-EEEEEehhccEEEEeec---ceEEEeeccccCceEE-EEEEEeccEEeccccEE Confidence 6899999999999997543 222 355552 122222211 2333333322221111 11112 22344666665 Q ss_pred ecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) .-.-+ +....|-+--|+. T Consensus 398 ~~~~~--------------~~~~~~~~~~~~~ 415 (415) T protein:vir:47 398 VIEYD--------------DSERGEGDLGLEA 415 (415) T ss_pred EEEee--------------ccCCCCCCccCCC Confidence 43211 0111222222322 No 63 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.59 E-value=2.3e-08 Score=62.46 Aligned_cols=290 Identities=10% Similarity=-0.018 Sum_probs=144.1 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC-CcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD-SLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~-g~~~~~~~~~~~~~ 79 (367) +....-.|.-.-..+|+.+.+.+.....+.+.+.+-. +....++...++|....-. ..+..+.|+..... T Consensus 119 ~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~ 189 (415) T protein:vir:46 119 IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV---------TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE 189 (415) T ss_pred hhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhc---------ceeeccCCceeEEEEEecCCcceeeccccccccc Confidence 2221112333456899988887877766666553311 1111233344445433222 23445555543221 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) .+..+-.+.....++.+....+++....-+..|....+.+++++.+.+..++.+|.-. -....... T Consensus 190 --~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~---g~g~~~~~--------- 255 (415) T protein:vir:46 190 --LAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVI---TKGSTGST--------- 255 (415) T ss_pred --ccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc---ccCCcccc--------- Confidence 1222333334455666777788887766666677889999999888887777665422 11000000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhccccc-ccccc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPDSK-GQLTI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~-g~~~i 236 (367) ........ .........+++.+.++...+.+....-.+++||+..+..|++..-- .|+-..+ ....- T Consensus 256 ---------~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~ 325 (415) T protein:vir:46 256 ---------SSGFEKEG-KKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ 325 (415) T ss_pred ---------cccccccc-ceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCC Confidence 00000000 11112345778899999988877666677899999999999875211 1221111 11223 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEc-cEEEeeeeeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILER-KEWIVHPGGFN 313 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r-~~~~~hp~G~s 313 (367) .+++|++|++.+.+|.... +.. +++||. -++.+.... .+.+++.....+..+. ..+.| ..-++||..|. T Consensus 326 ~~l~G~pV~~~~~~~~~~~---~~~-~~~~gd~~~~~~~~~~~---~~~v~~~~~~~~~~~~-~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:46 326 QRLLGAKIEILPDEVLGQK---GNN-TLIIGNLKDAIVLFDRS---QYQASWTDYMHFGECL-MIAVRQDCRILDYKSAI 397 (415) T ss_pred ccccceeeEEeccccccCC---Ccc-EEEEEehhccEEEEeec---ceEEEeeccccCceEE-EEEEEeccEEeccccEE Confidence 6899999999999997543 222 355552 122222211 2333333322221111 11112 22344666665 Q ss_pred ecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) .-.-+ +....|-+--|+. T Consensus 398 ~~~~~--------------~~~~~~~~~~~~~ 415 (415) T protein:vir:46 398 VIEYD--------------DSERGEGDLGLEA 415 (415) T ss_pred EEEee--------------ccCCCCCCccCCC Confidence 43211 0111222222322 No 64 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.58 E-value=1.1e-08 Score=64.30 Aligned_cols=279 Identities=10% Similarity=0.041 Sum_probs=148.9 Q ss_pred CCCccccc----cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCc Q lcl|Aclame:pro 1 MPDFNNQV----RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) Q Consensus 1 Ma~~~~~T----~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~ 76 (367) +..++..+ .-..-++|+.+..-+.+...+.+-+.+ +......+|..+++|.+... ..+..+.|+. T Consensus 21 ~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~---------l~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~- 89 (324) T protein:vir:96 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ---------LGKYEPMEGTEKKFTFWADK-PGAYWVGEGQ- 89 (324) T ss_pred hhhcccccccccCCCcceechhHHHHHHHHHHhhchhhh---------hcceeeccCCceEEEEEecC-cceeeecCCc- Confidence 22222111 112335566665555555544444322 11222345777999998643 4566777765 Q ss_pred cccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhh Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKT 156 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~ 156 (367) .++..+++-++.....++.+.-..++++...-+..|..+.|.+++++.+.+..++.+|. |. . .+.. .... T Consensus 90 --~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~---G~---g-~~~~-~~~~ 159 (324) T protein:vir:96 90 --KIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQ---G-NNPF-GKSI 159 (324) T ss_pred --cccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh---cC---C-CCCc-Cccc Confidence 35566777777777888888888999987777777888999999999888888776653 21 0 0000 0000 Q ss_pred hhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc-- Q lcl|Aclame:pro 157 RGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL-- 234 (367) Q Consensus 157 ~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~-- 234 (367) ..............++++.+.++..++.+.......++||+..+..|++.. +.+|.. T Consensus 160 ---------------~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lk------d~~G~~~~ 218 (324) T protein:vir:96 160 ---------------AQSIKKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV------DPETKERI 218 (324) T ss_pred ---------------cccccccceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh------CCCCCeee Confidence 000000011122346789999999988777667778999999999998763 233322 Q ss_pred ---cchhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehhhcC------------CceeEE Q lcl|Aclame:pro 235 ---TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGN------------GSGLEY 298 (367) Q Consensus 235 ---~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~~~~------------~~g~~~ 298 (367) .-.+++|++|+++.+.+... + ..+||. ..+.++... ...+++.|+..... ..++.. T Consensus 219 ~~~~~~~l~G~PV~~~~~~~~~~----~---~~~~gd~s~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~ 290 (324) T protein:vir:96 219 YDRNSDSLDGLPVVNLKSSNLKR----G---ELITGDFDKLIYGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) T ss_pred cCCCCCcccceeeEeecCCCCCc----c---eEEEEecceEEEEEec-CcEEEEeecccccccccccccchhhhhcCcEE Confidence 34579999999987766432 1 122221 112233322 22344444432110 012222 Q ss_pred EEEccEE---EeeeeeeeecccccccccccccccccccccCCCChH Q lcl|Aclame:pro 299 ILERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLA 341 (367) Q Consensus 299 l~~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a 341 (367) +-...++ +.+|..|..-.... .+. ...|.-. T Consensus 291 ~r~~~r~d~~v~~~~a~~~l~~a~-------~~~-----~~~~~~~ 324 (324) T protein:vir:96 291 LRATMHVALHIADDKAFAKLVPAD-------KRT-----DSVPGEV 324 (324) T ss_pred EEEEEEeccEEecccceEEEeccc-------ccC-----CCCCCCC Confidence 3332333 33444444322110 010 1112111 No 65 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.58 E-value=6.2e-09 Score=65.56 Aligned_cols=276 Identities=13% Similarity=0.017 Sum_probs=134.6 Q ss_pred CCC-ccccccc--eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcc Q lcl|Aclame:pro 1 MPD-FNNQVRL--VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPN 77 (367) Q Consensus 1 Ma~-~~~~T~l--~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~ 77 (367) |.. +...|.- .-++.|++ ..-+...+.+.+.|.+ +-.....+|..+++|....-++.+..+.|+.. T Consensus 109 ~~~~~~~~~~~~~g~~vp~~~-~~~ii~~~~~~~~l~~---------l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~- 177 (395) T protein:vir:43 109 MPRSAITSIDGSGGALVAPDR-RPGVVAAPQRRLTIRD---------LVAPGTTESNSVEYVRETGFVNNAAPVSEGTQ- 177 (395) T ss_pred hhhhhhcccCCCCccccchhh-HHHHHHHHHhhhhHHh---------hccceecCCCceEEEEEecCCCceeeecCCcc- Confidence 211 1100111 12455554 4445555555544422 11112235677899988765556666777643 Q ss_pred ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 78 ~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) .+..+.+-.+.....++.+....+++.....+ .+-...+.++|++...+..+..+|. | + .++.. ...+. T Consensus 178 --~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~v~~~la~a~~~~~d~~~l~---G---~-g~~~~-~~Gi~ 246 (395) T protein:vir:43 178 --KPYSDLTFELENAPVRTIAHLFKASRQILDDA-SALQSYIDARARYGLMLVEECQLLY---G---N-GTGAN-LHGII 246 (395) T ss_pred --ccccccceeEEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHh---c---c-CCCCc-ccccc Confidence 44455555666666777777778888765443 3556777888887777766665542 2 1 00000 00000 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhccccccccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPDSKGQLT 235 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~g~~~ 235 (367) ......+...+. .......++.+.++...+......-.+++||+..+..|++..-- .|+-....... T Consensus 247 --------~~~~~~~~~~~~---~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~ 315 (395) T protein:vir:43 247 --------PQAQAYAPPSGV---VVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGSPQNGT 315 (395) T ss_pred --------cccccccccccc---ccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccccccCC Confidence 000011111111 11223467888888888766666677899999999998765311 11111111223 Q ss_pred chhhcCcEEEEeCCCcccCCCCCceEEEEEEec---ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Eeee Q lcl|Aclame:pro 236 IPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG---AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IVHP 309 (367) Q Consensus 236 i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~---GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~hp 309 (367) -++++|++|+++|.||-. + .+||. +...+ ... ...++..+.....-..++..+....++ +.|| T Consensus 316 ~~~l~G~pVv~~~~~~~~------~---~~~gd~~~~~~~~-~~~-~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 384 (395) T protein:vir:43 316 TPTLWRLPVVETQAITQD------E---FLTGAFSLGAQIF-DRM-DIEVLVSTENDKDFENNMVTIRAEERLAFAVYRP 384 (395) T ss_pred CceecceeeEEcCCCCCC------c---EEEEeccceEEEE-Eec-ceEEEEeccccchhhcCcEEEEEEEeeccEEecc Confidence 467899999999999842 1 22332 12121 111 222444443221111233333332222 3355 Q ss_pred eeeeecccccccccccccccccccccCCCC Q lcl|Aclame:pro 310 GGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 310 ~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) ..|..-. ++ ++ T Consensus 385 ~a~~~~~--~t-----------------aa 395 (395) T protein:vir:43 385 EAFVTGS--LT-----------------AS 395 (395) T ss_pred cceEEEE--ec-----------------cC Confidence 5554321 10 11 No 66 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.55 E-value=7.5e-09 Score=65.11 Aligned_cols=276 Identities=10% Similarity=0.013 Sum_probs=143.3 Q ss_pred CCC--cccc----ccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCC Q lcl|Aclame:pro 1 MPD--FNNQ----VRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSD 74 (367) Q Consensus 1 Ma~--~~~~----T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~ 74 (367) ||. .+.. |.-.-..+|+.+..-+.+...+.+.|.+.. ....-++..+++|.+..- ..+..+.|+ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~---------~~~~~~~~~~~ip~~~~~-~~a~~v~E~ 70 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLA---------KNEPMTAQKKKFTYLAKG-VGAYWVSET 70 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhc---------ceeeccCCceEEEEEeCC-cceEEeecC Confidence 765 1111 111234677777655656555555443321 112234667899999743 445556665 Q ss_pred CccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 75 NPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATI 154 (367) Q Consensus 75 ~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~ 154 (367) .. ++..+.+-++.....++.+.-..++++...-+..|..+.+.++|++.+.+..++.++. | +......... T Consensus 71 ~~---~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~---G---~g~~~~~~~~ 141 (304) T protein:vir:94 71 ER---IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF---G---TKSPYNTSTS 141 (304) T ss_pred cc---cccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee---c---cCCCcccccc Confidence 43 4445556666666777788888899888777778888999999998887776665542 1 1100000000 Q ss_pred hhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc Q lcl|Aclame:pro 155 KTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL 234 (367) Q Consensus 155 ~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~ 234 (367) .. .... ...... ...+.....++.|.++..++........+++||+..+..|++. ++.+|.. T Consensus 142 ~~---------~~~~--~~~~~~-~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l------kd~~G~~ 203 (304) T protein:vir:94 142 GK---------PLVE--GAEEKG-NVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNA------LDANDRP 203 (304) T ss_pred cc---------cccc--cccccc-cccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHh------hccCCcE Confidence 00 0000 000000 0111234678999999988877767777899999999999875 3344432 Q ss_pred ----cchhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehh--------hc------CCce Q lcl|Aclame:pro 235 ----TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRREL--------RG------NGSG 295 (367) Q Consensus 235 ----~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~--------~~------~~~g 295 (367) ..++++|++|++++.||.... .+. ++||. --+.++.... ..++..++.. .. -..+ T Consensus 204 l~~~~~~~l~G~PV~~~~~~~~~~~--~~~---~~~gd~~~~~~~~~~~-~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~ 277 (304) T protein:vir:94 204 LFDANGNEIMGLPLSYTGADVYDKK--KSL---ALMGDWDYARYGILQG-IEYAISEDATLTTLQASDASGQPVSLFERD 277 (304) T ss_pred eecCCCccccceeeEEecccccCCC--CcE---EEEEehhhEEEEEecc-eEEEEeecceeeeecccccCccchhhhhcC Confidence 236899999999999996432 121 22221 0111222111 1122222211 00 0011 Q ss_pred eEEEEEccE---EEeeeeeeeecccccccccccccccccccccCCCChHH Q lcl|Aclame:pro 296 LEYILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLAN 342 (367) Q Consensus 296 ~~~l~~r~~---~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~ 342 (367) +..+....+ -++||..|.--. .+| T Consensus 278 ~~~~r~~~r~~~~v~~~~a~~~l~-----------------------~a~ 304 (304) T protein:vir:94 278 MFALRATMHIAYMNVKPEAFATLK-----------------------PTE 304 (304) T ss_pred cEEEEEEEEeccEeecccceEEEE-----------------------ecC Confidence 111111111 234555554321 111 No 67 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.55 E-value=7.5e-09 Score=65.11 Aligned_cols=276 Identities=10% Similarity=0.013 Sum_probs=143.3 Q ss_pred CCC--cccc----ccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCC Q lcl|Aclame:pro 1 MPD--FNNQ----VRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSD 74 (367) Q Consensus 1 Ma~--~~~~----T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~ 74 (367) ||. .+.. |.-.-..+|+.+..-+.+...+.+.|.+.. ....-++..+++|.+..- ..+..+.|+ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~---------~~~~~~~~~~~ip~~~~~-~~a~~v~E~ 70 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLA---------KNEPMTAQKKKFTYLAKG-VGAYWVSET 70 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhc---------ceeeccCCceEEEEEeCC-cceEEeecC Confidence 765 1111 111234677777655656555555443321 112234667899999743 445556665 Q ss_pred CccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 75 NPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATI 154 (367) Q Consensus 75 ~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~ 154 (367) .. ++..+.+-++.....++.+.-..++++...-+..|..+.+.++|++.+.+..++.++. | +......... T Consensus 71 ~~---~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~---G---~g~~~~~~~~ 141 (304) T protein:vir:10 71 ER---IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF---G---TKSPYNTSTS 141 (304) T ss_pred cc---cccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee---c---cCCCcccccc Confidence 43 4445556666666777788888899888777778888999999998887776665542 1 1100000000 Q ss_pred hhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc Q lcl|Aclame:pro 155 KTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL 234 (367) Q Consensus 155 ~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~ 234 (367) .. .... ...... ...+.....++.|.++..++........+++||+..+..|++. ++.+|.. T Consensus 142 ~~---------~~~~--~~~~~~-~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l------kd~~G~~ 203 (304) T protein:vir:10 142 GK---------PLVE--GAEEKG-NVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNA------LDANDRP 203 (304) T ss_pred cc---------cccc--cccccc-cccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHh------hccCCcE Confidence 00 0000 000000 0111234678999999988877767777899999999999875 3344432 Q ss_pred ----cchhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehh--------hc------CCce Q lcl|Aclame:pro 235 ----TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRREL--------RG------NGSG 295 (367) Q Consensus 235 ----~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~--------~~------~~~g 295 (367) ..++++|++|++++.||.... .+. ++||. --+.++.... ..++..++.. .. -..+ T Consensus 204 l~~~~~~~l~G~PV~~~~~~~~~~~--~~~---~~~gd~~~~~~~~~~~-~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~ 277 (304) T protein:vir:10 204 LFDANGNEIMGLPLSYTGADVYDKK--KSL---ALMGDWDYARYGILQG-IEYAISEDATLTTLQASDASGQPVSLFERD 277 (304) T ss_pred eecCCCccccceeeEEecccccCCC--CcE---EEEEehhhEEEEEecc-eEEEEeecceeeeecccccCccchhhhhcC Confidence 236899999999999996432 121 22221 0111222111 1122222211 00 0011 Q ss_pred eEEEEEccE---EEeeeeeeeecccccccccccccccccccccCCCChHH Q lcl|Aclame:pro 296 LEYILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLAN 342 (367) Q Consensus 296 ~~~l~~r~~---~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~ 342 (367) +..+....+ -++||..|.--. .+| T Consensus 278 ~~~~r~~~r~~~~v~~~~a~~~l~-----------------------~a~ 304 (304) T protein:vir:10 278 MFALRATMHIAYMNVKPEAFATLK-----------------------PTE 304 (304) T ss_pred cEEEEEEEEeccEeecccceEEEE-----------------------ecC Confidence 111111111 234555554321 111 No 68 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=98.55 E-value=2.8e-08 Score=61.97 Aligned_cols=278 Identities=8% Similarity=-0.010 Sum_probs=138.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |..-. +.-.-..+|+.+..-+.....+.+.|.+-.=+.+ ...+...+.+|.+....+.+..+.|+.. + T Consensus 109 ~~~~t--~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~-------~~~~~~~~~~~~~~~~~~~a~~v~E~~~---~ 176 (397) T protein:vir:49 109 KTDGS--GSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVEN-------VTTLTGSRVYEKWADITGLAKLDDEGGQ---I 176 (397) T ss_pred hhccC--CccCcceecHHHHHHHHHHHHhhhhHhhhcceee-------ccCCcceEEEEeeccCCcceeeeccccc---c Confidence 32200 1112356688776666666666555532110100 1122223455656555455666666543 2 Q ss_pred cccc-cchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDG-LGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 81 t~~k-itt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) +... .+-.+.....++.+.-..++++...-+..|....+.+++++.+.+..++.+| .|. +. T Consensus 177 ~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail---~G~--------g~------- 238 (397) T protein:vir:49 177 GQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAIL---EAI--------GT------- 238 (397) T ss_pred ccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHH---hcc--------cc------- Confidence 2111 2223334455566666777777666566677788999998777776665544 220 00 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhcc-cccccccc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIP-DSKGQLTI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~-~~~g~~~i 236 (367) + ......++++.+.++...+-.....-..++||+..+..|++..=- .++- ..-....- T Consensus 239 -----------------~--~~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~ 299 (397) T protein:vir:49 239 -----------------L--PNKPTLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTG 299 (397) T ss_pred -----------------c--cccccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCC Confidence 0 001234678899998888876666778999999999999886311 1111 00011122 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhc--CCceeEEEEEccE---EEeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRG--NGSGLEYILERKE---WIVHP 309 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~--~~~g~~~l~~r~~---~~~hp 309 (367) .+++|++|++.+++++.... .+.+ +++||. -++.+.... .+++++++... -..++..+....+ -++|| T Consensus 300 ~~l~G~pV~~~~~~~~~~~~-~~~~-~~~~gd~~~~~~~~~~~---~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~ 374 (397) T protein:vir:49 300 YSIDGFVVKEISDRFLPNGT-GGAM-PLYFGDLKQAVTLFDRQ---HLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDT 374 (397) T ss_pred ceecceeeEEeccccccccc-CCce-eEEEeeccceEEEEeec---ccEEEEeccccchhhcCeeeEEEEEeeccEEecc Confidence 57999999987765543221 2222 355653 233333322 23344433221 1123333333222 24577 Q ss_pred eeeeecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 310 GGFNWLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 310 ~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) ..|..-.- ++ .....|+..-.+. T Consensus 375 ~a~~~~~~--~~-----------~~~~~~~~~~~~~ 397 (397) T protein:vir:49 375 EAFVPASF--KA-----------IADQKAKLSTAGA 397 (397) T ss_pred cceEEEEe--cc-----------cccccCcccccCC Confidence 77765421 11 1111122211111 No 69 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.55 E-value=6.8e-09 Score=65.36 Aligned_cols=287 Identities=12% Similarity=0.067 Sum_probs=134.4 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |+-... +.-.-..+|+.+..-+.+.+.+.+.+.+.|+ ..+..+...+++|.+..- ..+.-+.|+.. + T Consensus 64 ~a~~~~-~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~--------~~v~~~~g~~~~p~~t~~-~~a~wv~E~~~---~ 130 (366) T protein:vir:57 64 MAISTA-AGSGGALIPQNMQNEVIELLRDRTVVRILGA--------RSIPLPNGNLSMPRLSGG-ATAGYVGEGKD---V 130 (366) T ss_pred hhcccc-ccCCccccchhHHHHHHHHHhhhcchhhhce--------eeeecCCCceEEEEEeCC-cceeeeccCcc---c Confidence 221100 1112345687776656555544444422221 011223335889987643 34444566543 4 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHH------HHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIA------MAVGVYKSNLAGNFATI 154 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla------~l~Gvf~~~~a~~~~~~ 154 (367) +..+.+-++.....++.+.-+.++++...-+.-+-...+.+++++-..+..++.+|. --+|+++.... T Consensus 131 ~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~------ 204 (366) T protein:vir:57 131 VATGATFDDVKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATA------ 204 (366) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccc------ Confidence 445555555556666677777788777666666777889999998888777765552 11222221111 Q ss_pred hhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHH-hccc--cCceeEEEEccHHHHHHHhcchhhhccccc Q lcl|Aclame:pro 155 KTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-MGDH--VGSIAAIAVHSMVYKRMTNNDEIEFIPDSK 231 (367) Q Consensus 155 ~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~-~GD~--~~~l~~~vmhS~v~~~L~k~~li~~~~~~~ 231 (367) ...+...++.. ......+.+.+.+.. +.+. ...-..++||+..+..|++.. +++ T Consensus 205 --------------~~~~~~~~~t~---~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lk------d~~ 261 (366) T protein:vir:57 205 --------------ANRLVAWTGTA---INLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLR------DGN 261 (366) T ss_pred --------------ccceeeccccc---cchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhh------ccC Confidence 11111111111 111223444454433 2322 223567899999999998763 333 Q ss_pred ccc-----cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecc-eeeeeccCCCcceeeeeehhhcCCceeE-EEEEccE Q lcl|Aclame:pro 232 GQL-----TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA-AFGYADGAPQVPVAVGRRELRGNGSGLE-YILERKE 304 (367) Q Consensus 232 g~~-----~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G-Ai~~~~~~~~~~~e~~rd~~~~~~~g~~-~l~~r~~ 304 (367) |.. .-++++|++|++++.||.......+.. .++||.= -+.++.-. ...+++.|++.-.++.|.. -+|.|.. T Consensus 262 G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~~~~~-~i~~gdfs~~~i~~~~-~i~i~~~~ea~~~~~~g~~~~~f~~~~ 339 (366) T protein:vir:57 262 GNKVYPEMSQGILKGYPIQRTSAIPANLGDDGNES-EIYFCDFNDVVIGEDG-MMKVDFSTEATYKDADGQLVSAFARNQ 339 (366) T ss_pred CceeccCCCCCeecceeeEEccccccccccCCCcc-EEEEEecceEEEEEec-ceEEEEeeccccccccccchhhhhcCc Confidence 322 225789999999999997543222322 2334332 22222222 2224444554322222220 1222111 Q ss_pred EEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccc Q lcl|Aclame:pro 305 WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNW 349 (367) Q Consensus 305 ~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW 349 (367) ..+ +-+..-+- +...|.---+-++.+| T Consensus 340 ~~i--R~~~~~d~----------------~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 340 SLI--RVVTEHDI----------------GFRHPEGLVLGTGVIW 366 (366) T ss_pred eeE--EeeeeeCc----------------EeeccccEEEEecccC Confidence 111 11100000 0011222223345556 No 70 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.54 E-value=2.9e-09 Score=67.36 Aligned_cols=293 Identities=13% Similarity=-0.010 Sum_probs=159.1 Q ss_pred CCCc--cccccc------e--eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccc Q lcl|Aclame:pro 1 MPDF--NNQVRL------V--DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPN 70 (367) Q Consensus 1 Ma~~--~~~T~l------~--d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~ 70 (367) |+.+ |..|+- + +++. |+|...|.....+++.| .+.-.++++ .+|+++.+|+.+... ... T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~l-e~~~geV~~af~~~s~~------~~~~~~r~i--~~G~s~~~~~iG~~~--~~~ 69 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHI-EEHLGLVDASFMYSSKF------ASWMNVRSL--RGTNQLRVDRVGAST--IAG 69 (334) T ss_pred CCCCcCCCccccccccccchheehh-hhhhhHHHHHHHHhhhh------hccceeeec--cccceEEEeeeccee--eee Confidence 7776 333331 1 3444 88888787766666666 333333322 579999999888663 233 Q ss_pred cCCCCccccccccccchhhhhhhhhH-hhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHH-HHHHHHHhhhhh Q lcl|Aclame:pro 71 YGSDNPNVEAPIDGLGSGEMKTTKTW-LNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRII-AMAVGVYKSNLA 148 (367) Q Consensus 71 ~~~~~~~~~~t~~kitt~~~~a~i~~-r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~ll-a~l~Gvf~~~~a 148 (367) +.-+ +.+....+.+.+.+-+|=. .--.+.+.|+-...+-.|...++++|.+..-++..++.++ .++++.-..... T Consensus 70 ~~~g---~~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~ 146 (334) T protein:vir:80 70 RKAG---EELVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPA 146 (334) T ss_pred ecCC---CCCCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Confidence 3333 3344455555554443332 3345678899888898999999999999888887666544 455554322111 Q ss_pred hhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHH----HHHHHHHhccccC-----ceeEEEEccHHHHHHH Q lcl|Aclame:pro 149 GNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREA----FVDAAFTMGDHVG-----SIAAIAVHSMVYKRMT 219 (367) Q Consensus 149 ~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~----l~~A~~~~GD~~~-----~l~~~vmhS~v~~~L~ 219 (367) .... ...........+++.+. ...-++.. +.+|.+.|.+..- .-.+++|.|++|..|. T Consensus 147 ~~~~-----------~~~~G~~~~~~~~g~~~--~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll 213 (334) T protein:vir:80 147 HLKP-----------AFHDGILLPSTISGLAA--DAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLL 213 (334) T ss_pred cccc-----------cccCCcceeeccccccc--chhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHh Confidence 1000 00001111111222211 12233444 4455555654322 2489999999999998 Q ss_pred hc-chhhhcc-cccc-----cccchhhcCcEEEEeCCCcccCC------CCCceE-------EEEEEecceeeeeccCCC Q lcl|Aclame:pro 220 NN-DEIEFIP-DSKG-----QLTIPTYMGKVVIVDDGMPVFGT------GADKTY-------LSILFGGAAFGYADGAPQ 279 (367) Q Consensus 220 k~-~li~~~~-~~~g-----~~~i~t~~G~~VivdD~~pv~~t------~~~~~y-------ttyl~~~GAi~~~~~~~~ 279 (367) +. +|++-.- .+++ .-.|..++|.+|+.+..+|.... +..++| ..+++.+.|++.....+ T Consensus 214 ~~~r~~n~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~- 292 (334) T protein:vir:80 214 EHDRLMNVEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHP- 292 (334) T ss_pred cccccccceeccccccccccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEee- Confidence 87 5555321 1121 23588999999999999996531 122333 13566788998877664 Q ss_pred cceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCC Q lcl|Aclame:pro 280 VPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAI 338 (367) Q Consensus 280 ~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sP 338 (367) ...|..|++... +.-.+.++.=-+-++-|.+..-.+-+++ .| T Consensus 293 ~~~e~~~~~~~~-~d~i~~~~a~G~g~lRPeaa~vv~~~~~----------------~~ 334 (334) T protein:vir:80 293 VSAQFWEEKKDF-GHYLDTFQSYNIGQRRPDAVAVHDITVT----------------NP 334 (334) T ss_pred cceeeeechhhH-HHHHHHHHHcCCceeccceEEEEEEeee----------------cC Confidence 336666776542 1111112221223344443333222222 23 No 71 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.54 E-value=2.3e-08 Score=62.49 Aligned_cols=279 Identities=10% Similarity=0.038 Sum_probs=153.4 Q ss_pred CCCcccc--ccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQ--VRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~--T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) +-.+... +.-....+|+.+..-+.+...+.+.+.+ +......++..+++|.+... ..+.-+.|+. T Consensus 23 ~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~---------~~~~~~~~~~~~~ip~~~~~-~~a~~v~Eg~--- 89 (324) T protein:vir:97 23 VFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ---------LGKYEPMEGTEKKFTFWADK-PGAYWVGEGQ--- 89 (324) T ss_pred hhccccccccCCCcceechhHHHHHHHHHHhhcchhh---------hcceeeccCCceEEEEEecC-cceeEeccCc--- Confidence 1111111 1224457788776666666555554422 11222345778999999754 5566677764 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) .++..+++-++.....++.+.-..++++...-+..+..+.+.+++++.+.+..++.+|. |. .++.. ...+ T Consensus 90 ~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~---G~----g~~~~-~~gi-- 159 (324) T protein:vir:97 90 KIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQ----GNNPF-GKSI-- 159 (324) T ss_pred cccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc---cC----CCCcc-Cccc-- Confidence 46667777777778888888888999987777777888999999998888888776653 21 11100 0000 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc---- Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL---- 234 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~---- 234 (367) ...+..........++++.+.++..++.+....-.+++||+..+..|++.. +++|.. T Consensus 160 -------------~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lk------d~~g~~~~~~ 220 (324) T protein:vir:97 160 -------------AQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV------DPETKERIYD 220 (324) T ss_pred -------------cccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh------cCCCceeecC Confidence 000000111122457899999999988777777778999999999988653 333321 Q ss_pred -cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecc-eeeeeccCCCcceeeeeehhhcC------------CceeEEEE Q lcl|Aclame:pro 235 -TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA-AFGYADGAPQVPVAVGRRELRGN------------GSGLEYIL 300 (367) Q Consensus 235 -~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G-Ai~~~~~~~~~~~e~~rd~~~~~------------~~g~~~l~ 300 (367) .-++++|++|++++..+... + .++||.- -+.++... ...++..++..... ..++..+. T Consensus 221 ~~~~tl~G~PV~~~~~~~~~~----~---~~~~gd~~~~~i~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r 292 (324) T protein:vir:97 221 RNSDTLDGLPVVNLKSSNLKR----G---ELITGDFDKLIYGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred CCCccccceeeEeecCCCCCc----c---eEEEEecccEEEEEec-CcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 23678999999998876532 1 1333321 12232222 12344444332110 01122222 Q ss_pred EccEE---EeeeeeeeecccccccccccccccccccccCCCChHHh Q lcl|Aclame:pro 301 ERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 301 ~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L 343 (367) ...++ +.||..|.--... . ++..-|-+|. T Consensus 293 ~~~r~d~~v~~~~a~~~l~~~-------------~-~~~~~~~~~~ 324 (324) T protein:vir:97 293 ATMHVALHIADDKAFAKLVPA-------------D-KKTDSVPGEV 324 (324) T ss_pred EEEEeccEEecccceEEEEec-------------c-CCCCCCCCCC Confidence 21222 3455555432111 0 1111223333 No 72 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.52 E-value=9.6e-09 Score=64.55 Aligned_cols=264 Identities=11% Similarity=0.005 Sum_probs=140.4 Q ss_pred CCCccccccc-eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVRL-VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~l-~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) +.. ..|.- .-++.||.... +.+...+.+.|.+- -.....++..+++|.+..-.+.+..+.|+.. T Consensus 113 ~~~--~~~~~~g~~~~~~~~~~-ii~~~~~~~~l~~~---------~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~--- 177 (390) T protein:vir:81 113 AST--DAAGSAGALTTPNRLPG-FITPPDARLTVRDL---------IGSGRTDSALIEYVQETGFVNNAAIVAEGAL--- 177 (390) T ss_pred hcc--ccccCCcceechhhhHH-HHHHHhhhhhhhhh---------cceeeccCCceEEEEEecCCcceeeecCCcc--- Confidence 110 00111 12567776655 44444444444221 1112235677899998765556666777653 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHH------HHHHHhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAM------AVGVYKSNLAGNFAT 153 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~------l~Gvf~~~~a~~~~~ 153 (367) ++..+.+-++.....++.+....+++....-+ .+....+.++|++...+..++.+|.- ..|+++... T Consensus 178 ~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~------ 250 (390) T protein:vir:81 178 KPESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQAT------ 250 (390) T ss_pred cccccceeeEEEEeeeEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeeccc------ Confidence 55556666667777778888888888766555 46777888899888888777655420 111111100 Q ss_pred hhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhccccc Q lcl|Aclame:pro 154 IKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPDSK 231 (367) Q Consensus 154 ~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~ 231 (367) ....+........++.+.++...+......-.+++||+.++..|++..-- .|+-... T Consensus 251 ---------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~ 309 (390) T protein:vir:81 251 ---------------------TYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNA 309 (390) T ss_pred ---------------------ccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCc Confidence 00001111233567889999988877766777999999999999875311 1111111 Q ss_pred ccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecc--eeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---E Q lcl|Aclame:pro 232 GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---I 306 (367) Q Consensus 232 g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G--Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~ 306 (367) ....-++++|++|++++.||.. +++||.- ++.+... -...++..+..... ..++..+....++ + T Consensus 310 ~~~~~~~l~G~pv~~~~~~p~~---------~~~~gd~~~~~~~~~~-~~~~v~~~~~~~~~-~~~~v~~r~~~r~d~~v 378 (390) T protein:vir:81 310 RGTLTPTLWGLPVVATQAMAPG---------EFLVGAFDLAAQIFDQ-WDARVEIGYVGEDF-QRNMITVLAEERLALVV 378 (390) T ss_pred ccccCceecceeeEEcCCCCCC---------cEEEEehhceEEEEEe-cceEEEEecccchh-hcCcEEEEEEEeeccEE Confidence 1122358899999999999842 1333332 2222211 12224444432211 1223333222222 4 Q ss_pred eeeeeeeecccccccccccccccccccccCC Q lcl|Aclame:pro 307 VHPGGFNWLDADVTIPDNTGSPSGITSGPPA 337 (367) Q Consensus 307 ~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~s 337 (367) .||..|..-. .+ T Consensus 379 ~~~~a~v~~t-------------------~a 390 (390) T protein:vir:81 379 YRPEALISGS-------------------FA 390 (390) T ss_pred ecccceEEEE-------------------eC Confidence 4566554321 11 No 73 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.52 E-value=1e-08 Score=64.37 Aligned_cols=287 Identities=9% Similarity=-0.033 Sum_probs=137.3 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||- +.-..+++|+.|..=+.+.+.+.+.+.+-+-+ ...++..+++|.+..- ..+.-+.|+.. + T Consensus 1 mat----~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~---------i~~~~~~~~~p~~~~~-~~a~wv~Eg~~---~ 63 (311) T protein:vir:81 1 MVA----LATGTFQLPKHLVPGVWQKAQGQSVLARLSMA---------EPQEFGEQQYMTLTAP-PRGEVVGEGAQ---K 63 (311) T ss_pred Cce----ecCCceEcchhHHHHHHHHHHhcchhhhhcce---------eecCCCceEEEEEeCC-ceeEEeecCcc---c Confidence 885 22356789998877676666665555332211 1234557899998654 45555666643 4 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcc---cHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGS---NPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~---DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) +..+.+-++.....++.+.-..++++-...+.. +.++.+.+++++...+..+..++. |.-+-....... .. T Consensus 64 ~~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~---G~~~~~~~~~~g---i~ 137 (311) T protein:vir:81 64 SESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIH---GINPLTGAALSG---SP 137 (311) T ss_pred ccccceeeEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhc---cccCCCCccccc---cc Confidence 444444444444555666666777775443443 356789999998888777666552 210000000000 00 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hc-ccccccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FI-PDSKGQL 234 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~-~~~~g~~ 234 (367) ... .....+.... . .........+.++..++-+......+++||+..+..|++..--+ ++ +...... T Consensus 138 ~~~------~~~~~~~~~~--~--~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~ 207 (311) T protein:vir:81 138 AKI------LDTTNIVELT--T--GTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGT 207 (311) T ss_pred ccc------cccceeeeec--c--cccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccC Confidence 000 0000111111 0 11112234455566666555556678999999999998753111 11 1111112 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEE---------EEEEecce-eeeeccCCCcceeeeeehhhcC-----CceeEEE Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYL---------SILFGGAA-FGYADGAPQVPVAVGRRELRGN-----GSGLEYI 299 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yt---------tyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~-----~~g~~~l 299 (367) .-++++|++|++++.||-.......... .++||.=+ +.++... ...+++.++....+ ..++..+ T Consensus 208 ~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~v~~ 286 (311) T protein:vir:81 208 DVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQV-SIPLELIEFGDPDGLGDLKRQNQIAI 286 (311) T ss_pred CCceecceeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEec-cceEEEeccCCCCcchhhhhcCcEEE Confidence 3478999999999999843211111111 13333311 2222211 12234444322110 1122222 Q ss_pred EEccE---EEeeeeeeeecccccccccccccccccccccCCCChH Q lcl|Aclame:pro 300 LERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLA 341 (367) Q Consensus 300 ~~r~~---~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a 341 (367) ....+ .++||..|.--..-. .+ T Consensus 287 r~~~r~d~~v~~~~a~~~l~~a~--------------------~~ 311 (311) T protein:vir:81 287 RAEVVYGIGIMSTDAFAVVRDAD--------------------ES 311 (311) T ss_pred EEEEEeccEeecccceEEEEeec--------------------cC Confidence 22222 345666655432111 11 No 74 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.51 E-value=2.1e-08 Score=62.73 Aligned_cols=282 Identities=14% Similarity=0.070 Sum_probs=143.8 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||+.. |.-..+|.||+..+ +.+.+.+.+.+.+ +......++..+++|.+.. ++.+.-+.|+. .+ T Consensus 1 ma~~t--~~~G~lip~~~~~~-ii~~l~~~s~i~~---------l~~~~~~~~~~~~~p~~~~-~~~a~wv~Eg~---~~ 64 (300) T protein:vir:95 1 MSEAQ--LSKGNLFNPELVTK-VINKVKGHSSIAK---------LSPQKPIPFNGQREFVFDF-DSDIDIVAENG---KK 64 (300) T ss_pred Ccccc--cCCcceechhhHHH-HHHHHHhhhhhhh---------hcceeeccCCceEEEEEec-CcceEEeeCCc---cc Confidence 99833 33356666665555 5555555555433 1111123556788998764 34566667764 35 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHh---hcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAEL---AGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~---~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) +..+.+-++.....++.+.-..++++-... +..|..+++.+++++...+..++.+| .|.-....... .... T Consensus 65 ~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l---~G~~~~~g~~~-~~~~-- 138 (300) T protein:vir:95 65 THGGVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSI---HGINPRTKQAS-TIIG-- 138 (300) T ss_pred ccccccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhh---hcccCCCCCCc-cccc-- Confidence 555555555555666667777777775432 34567788999999877777776665 22110000000 0000 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcccc-cccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPDS-KGQL 234 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~~-~g~~ 234 (367) ... .....+ .+........++.+.++..++.+......+++||+..+..|++..--+ ++-.. .... T Consensus 139 -~~~-----~~~~~~-----~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~ 207 (300) T protein:vir:95 139 -DNC-----FDKKVT-----QTVPFKDTNPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGG 207 (300) T ss_pred -ccc-----cccccc-----eeecccccchHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccC Confidence 000 000000 001112345678999999988777777888999999999998763111 11110 0011 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCcee-------EEEEEccE- Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGL-------EYILERKE- 304 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~-------~~l~~r~~- 304 (367) .-++++|++|++++.+|...++... ..+||. .++.++.-. ...+++....... +.++ ..+....+ T Consensus 208 ~~~~l~G~Pv~~s~~v~~~~~~~~~---~~~~GDf~~~~~~~~~~-~~~~~v~~~~~~d-~~~~~~f~~~~v~~r~~~r~ 282 (300) T protein:vir:95 208 VPDAINGLAVDKNRTVSYSQTDPKN---TAIVGDFETMFKWGYAK-EVPMEIIKYGDPD-NSGRDLKGYNQIYIRCEAYI 282 (300) T ss_pred CCceecceeeEEecCCCCCCCCCcc---EEEEeeccceEEEEEec-ccEEEEeeccCCC-CcchhhhhcCcEEEEEEEee Confidence 3478999999999999865432221 233443 233333211 1122222221111 1121 11111111 Q ss_pred --EEeeeeeeeeccccccccccccccccccccc Q lcl|Aclame:pro 305 --WIVHPGGFNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 305 --~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) .+.||.-|.--.. .+| T Consensus 283 d~~v~~~~a~~~l~~---------------~~g 300 (300) T protein:vir:95 283 GWGIMDAASFARIVK---------------TGG 300 (300) T ss_pred cceeecccceEEEec---------------CCC Confidence 2346665554321 122 No 75 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.48 E-value=6.7e-09 Score=65.38 Aligned_cols=288 Identities=13% Similarity=0.060 Sum_probs=147.9 Q ss_pred CCCccccccc----------e-------eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeecc Q lcl|Aclame:pro 1 MPDFNNQVRL----------V-------DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRD 63 (367) Q Consensus 1 Ma~~~~~T~l----------~-------d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~ 63 (367) ||+.+. -++ + ..|.|||+..|.. ++.| .+.-...+ + .+|+++.+|..+. T Consensus 1 m~~~~~-~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~-----~s~~------~~~~~~r~-i-~~G~sv~i~~iG~ 66 (347) T protein:vir:94 1 MANVPG-QKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTR-----RSVT------ADKHIVRT-I-QNGKSAQFPVMGR 66 (347) T ss_pred CCCCCc-cccccccccCCccccHHHHHHHHHhHHHHHHHHH-----HHhh------hccccccc-c-cccceEEEecccc Confidence 888653 333 1 2677777776542 2223 22222221 1 4799999999987 Q ss_pred CCCcccccCCCCccccccccccchhhhhhhhhH-hhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHH Q lcl|Aclame:pro 64 LDSLEPNYGSDNPNVEAPIDGLGSGEMKTTKTW-LNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGV 142 (367) Q Consensus 64 l~g~~~~~~~~~~~~~~t~~kitt~~~~a~i~~-r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gv 142 (367) .. ...+..+++... ++..+...+..-+|-. .--.+.+.|+=..-.-.|++.+++++.+...+++.+..++..+..+ T Consensus 67 ~t--v~~~t~G~~l~~-~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~ 143 (347) T protein:vir:94 67 TS--GVYLAPGERLSD-KRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAIL 143 (347) T ss_pred ee--eeeecCCCCcCC-CCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 64 233332222110 1122222222111111 1223467788888888899999999999999999999998877655 Q ss_pred HhhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhccc----HHHHHHHHHHhcccc--CceeEEEEccHHHH Q lcl|Aclame:pro 143 YKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFN----REAFVDAAFTMGDHV--GSIAAIAVHSMVYK 216 (367) Q Consensus 143 f~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s----~~~l~~A~~~~GD~~--~~l~~~vmhS~v~~ 216 (367) -+...+...... ......++.+...........+ ++.|.+|.+.|.+.. ..=..++|.|.+|. T Consensus 144 aa~~~~~~~~~~-----------g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~ 212 (347) T protein:vir:94 144 CNLPAASNENIA-----------GLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYS 212 (347) T ss_pred hccccccccccC-----------CCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHH Confidence 443332221111 0111222222111111111111 355666777775432 23478999999999 Q ss_pred HHHhcchhhh-ccccccc---ccchhhcCcEEEEeCCCcccCCCC----------C-----------ceE-------EEE Q lcl|Aclame:pro 217 RMTNNDEIEF-IPDSKGQ---LTIPTYMGKVVIVDDGMPVFGTGA----------D-----------KTY-------LSI 264 (367) Q Consensus 217 ~L~k~~li~~-~~~~~g~---~~i~t~~G~~VivdD~~pv~~t~~----------~-----------~~y-------tty 264 (367) .|.+.....- ....++. -.|+.++|.+|+.+..+|+.+.+. . .+| ... T Consensus 213 ~Ll~~~~~~~~~~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l 292 (347) T protein:vir:94 213 AILAALMPNAANYAALIDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGL 292 (347) T ss_pred HHhccchhhhhhccccccccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEE Confidence 9987643322 1122222 257899999999999999743221 0 112 235 Q ss_pred EEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeee---ecccccccccccccccccccccCCCChH Q lcl|Aclame:pro 265 LFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFN---WLDADVTIPDNTGSPSGITSGPPAITLA 341 (367) Q Consensus 265 l~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s---~~~~~~~~~~~~~~~~~~~~~~~sPt~a 341 (367) +|-+-|++.....+ ..+|..|+.... .|.+.. +|.+|.. |..+-+. .-+-| T Consensus 293 ~~h~~A~~~v~~~~-~~~e~~r~~~~~----~d~i~~-----~~~~G~~~~rP~~a~~~----------------~~~~A 346 (347) T protein:vir:94 293 FSHRSAVGTVKLRD-LALERDRDVDAQ----GDLIVG-----KYAMGHGGLRPEAAGAL----------------VFSPA 346 (347) T ss_pred Eeehhhhhhhhccc-ccccchhchhhH----HHHhhh-----hhhhcCcccccceeEEE----------------EecCC Confidence 55666666555443 235666776553 133222 2333332 2211111 00111 Q ss_pred H Q lcl|Aclame:pro 342 N 342 (367) Q Consensus 342 ~ 342 (367) | T Consensus 347 ~ 347 (347) T protein:vir:94 347 E 347 (347) T ss_pred C Confidence 1 No 76 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=98.45 E-value=6.5e-08 Score=60.00 Aligned_cols=263 Identities=11% Similarity=-0.008 Sum_probs=128.8 Q ss_pred CCCccccccce--eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLV--DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~--d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) -+..+..++.+ ..++|+.+..-+.....+.+.+.+ .. +....++...++|.+..-++....+.|+.... T Consensus 130 ~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~------~~---~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~ 200 (400) T protein:vir:38 130 SDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKP------FT---NVFQASTQKGTYPTVANATTKMVTVAELEKNP 200 (400) T ss_pred HHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhh------cc---eeEeccCcceEEEEEecCCCcccccccccccc Confidence 00011111222 357787776666555555444421 11 11123566789998876656566665554322 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) .. ...+-.+.....++.+.-..++++...-+..|-.+.+.+++++......+..++. |.- T Consensus 201 ~~--~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~---~~~--------------- 260 (400) T protein:vir:38 201 AM--AKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVAT---LLK--------------- 260 (400) T ss_pred cc--ccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhh---ccc--------------- Confidence 11 2233334445566777777888876655555666777777765443332222211 100 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhcc-ccccccc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIP-DSKGQLT 235 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~-~~~g~~~ 235 (367) ++......+++.+.++....=+... -.+++||+..+..|++..-- .|+- +.-.... T Consensus 261 --------------------~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~ 319 (400) T protein:vir:38 261 --------------------GFTAKTISSVDDLKHINNVDLDPAY-SRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPS 319 (400) T ss_pred --------------------cccccccccHHHHHHHHHhhhhhhh-CcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCC Confidence 0111223456777777665433332 26899999999999875311 1221 1111223 Q ss_pred chhhcCcEEEEeCCCcccCCCCCceEEEEEEec-c-eeeeeccCCCcceeeeeehhhcCCceeEEEEEc-cEEEeeeeee Q lcl|Aclame:pro 236 IPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-A-AFGYADGAPQVPVAVGRRELRGNGSGLEYILER-KEWIVHPGGF 312 (367) Q Consensus 236 i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-G-Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r-~~~~~hp~G~ 312 (367) -++++|++|+++|.+|....+. ..++||. . ++.+.... ...+...++. ....+.-. +.| ..-+.||.+| T Consensus 320 ~~~l~G~pv~~~~~~~~~~~g~----~~~~~gd~s~~~~~~~~~-~~~~~~~~~~--~~~~~~~~-~~r~d~~~~~~~a~ 391 (400) T protein:vir:38 320 GKSVLGMPIAVVSDDTLGAAGE----AHAFLGDIKRAILFANRA-DFMVRWVDDQ--IYGQFLQA-GMRFGVSVADEKAG 391 (400) T ss_pred ccccccceeEEecccccCCCCc----eEEEEEeccccEEEEeec-ceEEEEeccc--ccceeEEE-EEEeccEEecccce Confidence 3689999999999999754332 1244443 1 12222111 1223333332 21122211 112 2335577777 Q ss_pred eecccccccccccccccccccc Q lcl|Aclame:pro 313 NWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 313 s~~~~~~~~~~~~~~~~~~~~~ 334 (367) .+-.-+- .+ T Consensus 392 ~~l~~~~-------------~a 400 (400) T protein:vir:38 392 YFLTYTP-------------KA 400 (400) T ss_pred EEEEeec-------------CC Confidence 7653221 11 No 77 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.44 E-value=2e-08 Score=62.78 Aligned_cols=283 Identities=11% Similarity=0.050 Sum_probs=139.1 Q ss_pred CCCccccccc-------------eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccC--- Q lcl|Aclame:pro 1 MPDFNNQVRL-------------VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDL--- 64 (367) Q Consensus 1 Ma~~~~~T~l-------------~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l--- 64 (367) ||-.|+.... .--++|+.|..-+.+.+.+.+-|.+-+ .....++..+++|.+..- T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~---------~~~~~~~~~~~ip~~~~~~~a 71 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLG---------ENIPISYGETIIPTTVKRPEV 71 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhc---------ceeeccCCceEEEEEecCccc Confidence 3322211110 112677777777777666666553211 122346788999987532 Q ss_pred ---C-CcccccCCCCccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHH-- Q lcl|Aclame:pro 65 ---D-SLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAM-- 138 (367) Q Consensus 65 ---~-g~~~~~~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~-- 138 (367) + +.+..+.|+. .++..+.+-++.....++.+.-..++++...-+..|..+.+.+++++-+.+..+..+|.= T Consensus 72 ~~v~~~~~~~~~Eg~---~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g 148 (338) T protein:vir:78 72 GQVGVGTSNEQREGG---TKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKS 148 (338) T ss_pred eeecccccccccccc---cccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccC Confidence 1 1222233332 344455555555556667777788888877777788889999999988888777766531 Q ss_pred ------HHHHHhhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhc-cccCceeEEEEc Q lcl|Aclame:pro 139 ------AVGVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMG-DHVGSIAAIAVH 211 (367) Q Consensus 139 ------l~Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~G-D~~~~l~~~vmh 211 (367) ..|+...... ......+.. .......++.+.++..++. .......+++|| T Consensus 149 ~~~~~~~~gi~~~~~~-------------------~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~ 205 (338) T protein:vir:78 149 PLTGSALQGIDTNNVI-------------------VNTTNVDYL----QTGTTPLLDRFLDGYDLVSANTDVDFNGWAAD 205 (338) T ss_pred CCcccccccccccccc-------------------ccccccccc----cccchhhHHHHHHHHHHhhhhccccceEEEEc Confidence 1111110000 000001111 1112244678888877663 333356689999 Q ss_pred cHHHHHHHhcchhhhccccccc---------ccchhhcCcEEEEeCCCcccCC-CCCceEEEEEEecce-eeeeccCCCc Q lcl|Aclame:pro 212 SMVYKRMTNNDEIEFIPDSKGQ---------LTIPTYMGKVVIVDDGMPVFGT-GADKTYLSILFGGAA-FGYADGAPQV 280 (367) Q Consensus 212 S~v~~~L~k~~li~~~~~~~g~---------~~i~t~~G~~VivdD~~pv~~t-~~~~~yttyl~~~GA-i~~~~~~~~~ 280 (367) +..+..|.+... +++.+|. ..-.+++|++|+++|.||-... ....+. ..+||.=+ +.++... .. T Consensus 206 ~~~~~~L~~~~~---l~d~~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~-~~~~gdfs~~~~~~~~-~~ 280 (338) T protein:vir:78 206 PRYRARLLRSQA---YRDANGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKV-RVVGGDFSQLKYGFAD-EI 280 (338) T ss_pred hHHHHHHHHHhh---hccCCCceeecccccCCCCceeeeeeEEEccccCccccccCCccc-EEEEEecceEEEEeec-cc Confidence 999999876532 1222222 1236899999999999985432 122222 23343322 2233222 12 Q ss_pred ceeeeeehhhcCCcee------------EEEEE--c-cEEEeeeeeeeecccccccccccccccccccccCCCCh Q lcl|Aclame:pro 281 PVAVGRRELRGNGSGL------------EYILE--R-KEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITL 340 (367) Q Consensus 281 ~~e~~rd~~~~~~~g~------------~~l~~--r-~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~ 340 (367) .++..|+.....+..+ ..+-. | .--++||..|.--.. .++ |.- T Consensus 281 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~-~~~----------------~~~ 338 (338) T protein:vir:78 281 RVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVD-DED----------------PDA 338 (338) T ss_pred EEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEec-ccC----------------CCC Confidence 3444444322211111 11100 1 012345555432211 000 110 No 78 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.43 E-value=5.5e-08 Score=60.37 Aligned_cols=284 Identities=10% Similarity=0.018 Sum_probs=148.3 Q ss_pred CC-----Cccccc--cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCC Q lcl|Aclame:pro 1 MP-----DFNNQV--RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGS 73 (367) Q Consensus 1 Ma-----~~~~~T--~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~ 73 (367) |. ++...+ .-...++|+.+..=+.+...+.+.+.+ +-.....+|..+++|.+..- +.+..+.| T Consensus 18 ~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~---------l~~~~~~~~~~~~~p~~~~~-~~a~~v~E 87 (324) T protein:vir:96 18 NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ---------LGKYEPMEGTEKKFTFWADK-PGAYWVGE 87 (324) T ss_pred hhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhh---------hcceeeccCCceEEEEEecC-cceeEecC Confidence 10 011111 123457777776555555555554422 11223345777899988643 56666777 Q ss_pred CCccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 74 DNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFAT 153 (367) Q Consensus 74 ~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~ 153 (367) +. .++..+++-.+.....++.+.-..++++...-+..|..+.+.+++++.+.+..++.+|. |.- .+.. . T Consensus 88 g~---~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~---G~g----~~~~-~ 156 (324) T protein:vir:96 88 GQ---KIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQG----NNPF-G 156 (324) T ss_pred Cc---cccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc---cCC----CCCc-C Confidence 64 35566777777777778888888999987777777888999999999888887776552 210 0000 0 Q ss_pred hhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc Q lcl|Aclame:pro 154 IKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ 233 (367) Q Consensus 154 ~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~ 233 (367) ..+ .+.. ...........+++.|.++..++........+++||++.+..|++..--+.-..-. . T Consensus 157 ~gi-------------~~~~--~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~-~ 220 (324) T protein:vir:96 157 KSI-------------AQSI--EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIY-D 220 (324) T ss_pred ccc-------------cccc--cccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeec-C Confidence 000 0000 00001112346799999999988877777788999999999998763211111100 1 Q ss_pred ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehhhcC------------CceeEEEE Q lcl|Aclame:pro 234 LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGN------------GSGLEYIL 300 (367) Q Consensus 234 ~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~~~~------------~~g~~~l~ 300 (367) ..-++++|++|+++.+++... + . .+||. .-+.++... ...+++.++..... ...+..+. T Consensus 221 ~~~~~l~G~PV~~~~~~~~~~----~--~-~~~gd~~~~~~g~~~-~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r 292 (324) T protein:vir:96 221 RNSDSLDGLPVVNLKSSNLKR----G--E-LITGDFDKLIYGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred CCCCcccceeeEeeCCCCCCc----c--e-EEEEecceEEEEEec-CcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 234679999999988776431 1 1 22321 122233322 22344444432110 01122222 Q ss_pred EccEE---EeeeeeeeecccccccccccccccccccccCCCChH Q lcl|Aclame:pro 301 ERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLA 341 (367) Q Consensus 301 ~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a 341 (367) ...++ +.||..|.--.. ...+ ....|... T Consensus 293 ~~~r~d~~v~~~~A~~~l~~---------a~~~---~~~~~~~~ 324 (324) T protein:vir:96 293 ATMHVALHIADDKAFAKLVP---------ADKR---TDSVPGEV 324 (324) T ss_pred EEEEEccEEecccceEEEec---------cccc---CCCCCCCC Confidence 22222 223333321110 0000 00122222 No 79 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.43 E-value=5.5e-08 Score=60.37 Aligned_cols=284 Identities=10% Similarity=0.018 Sum_probs=148.3 Q ss_pred CC-----Cccccc--cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCC Q lcl|Aclame:pro 1 MP-----DFNNQV--RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGS 73 (367) Q Consensus 1 Ma-----~~~~~T--~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~ 73 (367) |. ++...+ .-...++|+.+..=+.+...+.+.+.+ +-.....+|..+++|.+..- +.+..+.| T Consensus 18 ~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~---------l~~~~~~~~~~~~~p~~~~~-~~a~~v~E 87 (324) T protein:vir:78 18 NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ---------LGKYEPMEGTEKKFTFWADK-PGAYWVGE 87 (324) T ss_pred hhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhh---------hcceeeccCCceEEEEEecC-cceeEecC Confidence 10 011111 123457777776555555555554422 11223345777899988643 56666777 Q ss_pred CCccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 74 DNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFAT 153 (367) Q Consensus 74 ~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~ 153 (367) +. .++..+++-.+.....++.+.-..++++...-+..|..+.+.+++++.+.+..++.+|. |.- .+.. . T Consensus 88 g~---~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~---G~g----~~~~-~ 156 (324) T protein:vir:78 88 GQ---KIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQG----NNPF-G 156 (324) T ss_pred Cc---cccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc---cCC----CCCc-C Confidence 64 35566777777777778888888999987777777888999999999888887776552 210 0000 0 Q ss_pred hhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc Q lcl|Aclame:pro 154 IKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ 233 (367) Q Consensus 154 ~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~ 233 (367) ..+ .+.. ...........+++.|.++..++........+++||++.+..|++..--+.-..-. . T Consensus 157 ~gi-------------~~~~--~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~-~ 220 (324) T protein:vir:78 157 KSI-------------AQSI--EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIY-D 220 (324) T ss_pred ccc-------------cccc--cccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeec-C Confidence 000 0000 00001112346799999999988877777788999999999998763211111100 1 Q ss_pred ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehhhcC------------CceeEEEE Q lcl|Aclame:pro 234 LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGN------------GSGLEYIL 300 (367) Q Consensus 234 ~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~~~~------------~~g~~~l~ 300 (367) ..-++++|++|+++.+++... + . .+||. .-+.++... ...+++.++..... ...+..+. T Consensus 221 ~~~~~l~G~PV~~~~~~~~~~----~--~-~~~gd~~~~~~g~~~-~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r 292 (324) T protein:vir:78 221 RNSDSLDGLPVVNLKSSNLKR----G--E-LITGDFDKLIYGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred CCCCcccceeeEeeCCCCCCc----c--e-EEEEecceEEEEEec-CcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 234679999999988776431 1 1 22321 122233322 22344444432110 01122222 Q ss_pred EccEE---EeeeeeeeecccccccccccccccccccccCCCChH Q lcl|Aclame:pro 301 ERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLA 341 (367) Q Consensus 301 ~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a 341 (367) ...++ +.||..|.--.. ...+ ....|... T Consensus 293 ~~~r~d~~v~~~~A~~~l~~---------a~~~---~~~~~~~~ 324 (324) T protein:vir:78 293 ATMHVALHIADDKAFAKLVP---------ADKR---TDSVPGEV 324 (324) T ss_pred EEEEEccEEecccceEEEec---------cccc---CCCCCCCC Confidence 22222 223333321110 0000 00122222 No 80 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.42 E-value=2e-08 Score=62.79 Aligned_cols=291 Identities=12% Similarity=-0.016 Sum_probs=131.3 Q ss_pred CCCccccccc--eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcc- Q lcl|Aclame:pro 1 MPDFNNQVRL--VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPN- 77 (367) Q Consensus 1 Ma~~~~~T~l--~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~- 77 (367) .+.+...|.- ..++.||.+..-+.+.+.+.+.+.+ ..... .+..++..+++|....-...+.-+.|+... T Consensus 153 ~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~------~~~~~-~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~ 225 (477) T protein:vir:84 153 EYRDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYAN------LCPTE-PLPGGTSSINIPKILTGTSTAIQAADNAALT 225 (477) T ss_pred hhccccccCCCcceeeccchhHHHHHHHhhhcchHHH------hhcee-eecCCcceeEEEEEecCcceeeeeccCcccc Confidence 1111111111 1256777654434443333332211 00000 112345567888643111112223333211 Q ss_pred -ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHH------HHHHHHhhhhhhh Q lcl|Aclame:pro 78 -VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIA------MAVGVYKSNLAGN 150 (367) Q Consensus 78 -~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla------~l~Gvf~~~~a~~ 150 (367) +..+..+++-+......++.+.-..+++....-+.-|-...|.++++..+.+..+..+|. -..|+++..... T Consensus 226 ~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~- 304 (477) T protein:vir:84 226 APSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGIT- 304 (477) T ss_pred cccccccccceeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccc- Confidence 112222333333344455555556677766666666777899999998888877765552 122232211110 Q ss_pred hhhhhhhhhhhhhhhcchhhcceeecCcccc-hhhcccHHHHHHHHHHhccc-cCceeEEEEccHHHHHHHhcchhh--h Q lcl|Aclame:pro 151 FATIKTRGRVPAEVLGTAGDMVIDISGQTNP-ADAVFNREAFVDAAFTMGDH-VGSIAAIAVHSMVYKRMTNNDEIE--F 226 (367) Q Consensus 151 ~~~~~~~~~~~a~~~~~~~~~v~disa~t~~-a~~~~s~~~l~~A~~~~GD~-~~~l~~~vmhS~v~~~L~k~~li~--~ 226 (367) .........+ +.....+..+.++...+... ...-.+++||+..+..|++..--+ + T Consensus 305 ---------------------~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~ 363 (477) T protein:vir:84 305 ---------------------QVTATSAGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRP 363 (477) T ss_pred ---------------------cccccccccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCe Confidence 0111110000 01112345566766654333 334568999999999888764221 1 Q ss_pred ccccc--------------ccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhc Q lcl|Aclame:pro 227 IPDSK--------------GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRG 291 (367) Q Consensus 227 ~~~~~--------------g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~ 291 (367) +-.++ .....++++|++|++++.||... +..+.-..++||.-+ +.+... .+++++++... T Consensus 364 l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~-~~~~d~~~i~~gd~~~~~i~~~----~~~~~~~~~~~ 438 (477) T protein:vir:84 364 LIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTL-GTGTDQDVIHVLRASDLALFES----SVRMRALQETR 438 (477) T ss_pred eeecCcccccccccccccccccccchhcccceEecCcccccc-cccCCcceEEEEEeceEEEEee----ceeEEeccccc Confidence 11110 01124689999999999999632 222333345554332 222221 23444444443 Q ss_pred CCceeEEEEEccEEE-----eeeeeeeecccccccccccccccccccccCCCChH Q lcl|Aclame:pro 292 NGSGLEYILERKEWI-----VHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLA 341 (367) Q Consensus 292 ~~~g~~~l~~r~~~~-----~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a 341 (367) .+.++..+ ..+.|. -||.-|. .- +.++...||.+ T Consensus 439 ~~~~~~~~-~v~~~~~~~~~r~~~afv--~~-------------t~~~~~~~~~~ 477 (477) T protein:vir:84 439 AENLSVLL-QVYGYLAFTAARFPQSVV--EI-------------GGTALTAPTFA 477 (477) T ss_pred cccceeee-eehhhhhhhhhccccceE--Ee-------------ecccccccccC Confidence 32333222 112221 1555444 11 12355668877 No 81 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=98.42 E-value=6.5e-08 Score=59.97 Aligned_cols=298 Identities=9% Similarity=-0.022 Sum_probs=137.0 Q ss_pred CCC-c-cccc-cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCc-ccccCCCCc Q lcl|Aclame:pro 1 MPD-F-NNQV-RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSL-EPNYGSDNP 76 (367) Q Consensus 1 Ma~-~-~~~T-~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~-~~~~~~~~~ 76 (367) |.. . ...| .-.-..+|+-+..=+.....+.+.+. .. -+....++..+.+|.+..-... ...+.|+. T Consensus 109 ~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~------~l---~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~- 178 (421) T protein:vir:13 109 LSEEERDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLK------EH---CHVIPVNRNAGKMPVRAGASVDKLANLAKDT- 178 (421) T ss_pred hhHHHhhccccCCcceecchhhHHHHHHHHHhhhhhh------hh---ceeeeccCCceEEEEeecCCccceeeccccc- Confidence 100 0 0001 11223445544333333322222221 11 1111234556788877654322 23344443 Q ss_pred cccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhh Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKT 156 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~ 156 (367) .++..+++-++.....++.+.-+.++++...-+..|....|.+++++...+..+..++..++|+++.. T Consensus 179 --~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~~~~---------- 246 (421) T protein:vir:13 179 --ELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVLAEE---------- 246 (421) T ss_pred --cccccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhccccc---------- Confidence 34444555555555666677777888877665666677889999998777777766666666554211 Q ss_pred hhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcccccccc Q lcl|Aclame:pro 157 RGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPDSKGQL 234 (367) Q Consensus 157 ~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~~~g~~ 234 (367) ...+++.+.++...+-.....-.+++||+..+..|++..--+ |+-...... T Consensus 247 ---------------------------~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~ 299 (421) T protein:vir:13 247 ---------------------------TINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDG 299 (421) T ss_pred ---------------------------cccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecCcCCC Confidence 123467788888777665556679999999999998753111 111111112 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecc--eeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Eeee Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IVHP 309 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G--Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~hp 309 (367) .-++++|++|+++|.+|.... +.+ .++||.= ++.+.... ...++..++. .-..++..+....+| ++|| T Consensus 300 ~~~tl~G~pV~~~~~~~~~~~---~~~-~~~~gd~~~~~~~~~~~-~~~v~~~~~~--~f~~~~~~~r~~~r~d~~~~~~ 372 (421) T protein:vir:13 300 GDLVFKGRPVIELEESIFDVG---DET-KFIVSDFKTLIKFMDRK-QYLIDQSKEA--GYTKNETIARIIERFDVNSPLD 372 (421) T ss_pred CCceecceeeEEeccccccCC---Cce-EEEEEeccccEEEEEec-ceEEEeeccc--ccccCeeEEEEEeeecceeecc Confidence 235799999999999986432 222 3445431 23232222 2223333332 212333344443333 2222 Q ss_pred eee-eecccccc-cccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 310 GGF-NWLDADVT-IPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 310 ~G~-s~~~~~~~-~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) .-+ .|...+.. ....+....++.+.+.+|.-- -|.|..--=++.| T Consensus 373 ~a~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~ 419 (421) T protein:vir:13 373 KSSDAEKIRKFGVIVKLQEVLKSSPRSGKNKNES-------------KEEIKEEGEATQQ 419 (421) T ss_pred hhhheeeecccceeeccccccCCCCcCCCCcccc-------------chheeeccccccC Confidence 221 11111000 001111122222223332210 0000000001111 No 82 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.42 E-value=4.3e-08 Score=60.96 Aligned_cols=275 Identities=15% Similarity=0.054 Sum_probs=138.1 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||--. ..+|.||+..+ +.+.+.+.+.+.+ +......++..+++|.+..- ..+.-+.|+. .+ T Consensus 1 ma~~g-----G~lip~~~~~~-ii~~~~~~s~i~~---------~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~---~~ 61 (298) T protein:vir:94 1 MVLNK-----GTLFDPELVTD-LISKVAGKSSIAR---------LSAQKPIPFNGEKVFTFTMD-SEIDVVAESG---KK 61 (298) T ss_pred Ceecc-----ccccChhHHHH-HHHHHHhhchhhh---------hcceeeccCCceEEEEEecC-cceEEeeCCc---cc Confidence 88533 33455554444 4444444443321 11112235567889988643 3455566664 34 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhc---ccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAG---SNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g---~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) +..+.+-.+.....++.+.-+.++++....+. .+-++.+.+++++.+.+..+..+| .|.-.......... ... T Consensus 62 ~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l---~G~~~~~g~~~~~~-~~~ 137 (298) T protein:vir:94 62 THGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAF---HGVNPRLGTASAVI-GTN 137 (298) T ss_pred cccccceeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhh---cccccCCCcccccc-ccc Confidence 44555555555666677777888888754433 345677888898888877766555 33211111110000 000 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc---- Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ---- 233 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~---- 233 (367) .....+..... + .......++.+.++..++-.......+++||++.+..|++.. +++|. T Consensus 138 ---------~~~~~~~~~~~-~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk------d~~G~~l~~ 200 (298) T protein:vir:94 138 ---------HFDSKVTQKVE-A-PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQK------DLQGNALFP 200 (298) T ss_pred ---------ccccccccccc-c-ccccccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhh------ccCCCeeec Confidence 00000000000 0 011123356788888888766667788999999999998853 22222 Q ss_pred -----ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcC------CceeEEEE Q lcl|Aclame:pro 234 -----LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGN------GSGLEYIL 300 (367) Q Consensus 234 -----~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~------~~g~~~l~ 300 (367) ..-.+++|++|++++.+|-... ..+. ..+||. .++.|+... ...+++.+.....+ ..++-.+. T Consensus 201 ~~~~~~~~~tl~G~PV~~~~~v~~~~~--~~~~-~~~~Gdfs~~~~~~~~~-~~~~~~~~~~~~d~~~~~~f~~~~v~~r 276 (298) T protein:vir:94 201 ELKWGATPDTINGLPVDVNKTVSDMSL--TQRD-RAIIGDFANGFKWGYAK-EVPLEVIQYGDPDNSGLDLKGYNQVYIR 276 (298) T ss_pred CcccCCCCceecceeeEEecccccccC--CCcc-EEEEeeccceEEEEEec-CceEEEeecCCCcCcchhhhhcCcEEEE Confidence 1235799999999999985432 1222 355553 234444332 23344444322210 01111221 Q ss_pred EccE---EEeeeeeeeecccccccccccccccccccccCCCC Q lcl|Aclame:pro 301 ERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 301 ~r~~---~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) ...+ .+.||..|..-+. .| T Consensus 277 ~~~r~~~~~~~~~a~~~l~~--------------------~t 298 (298) T protein:vir:94 277 AELFLGWGILDATKFARVTE--------------------AN 298 (298) T ss_pred EEEEeccEeecccceEEEEe--------------------cC Confidence 2122 2345555543211 11 No 83 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=98.41 E-value=6.2e-08 Score=60.10 Aligned_cols=274 Identities=10% Similarity=-0.025 Sum_probs=133.2 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccC--CCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDL--DSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l--~g~~~~~~~~~~~~ 78 (367) |......+.-.-.++|+.+..-+.....+.+.|.+- ......++....+|+|..- .+.+..+.|+.... T Consensus 105 ~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~---------~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 175 (395) T protein:vir:38 105 VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESL---------ANVENVTTSHGSRVYEKLADITPLKDLDDESALIG 175 (395) T ss_pred HhhccCccCCCceecchhHhhHHHHHHHhhcchhhh---------cceeeccCCcceEEEEeeccCCccccccccccccc Confidence 332111111123567777766666655554444221 1111123445566666533 23444555554321 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) . .++.+-.+.....++.+.-..++++...-+..|-...+.++|++.+.+..++.++.-. .. . T Consensus 176 ~--~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~-------g~--~------- 237 (395) T protein:vir:38 176 D--NDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVM-------GK--A------- 237 (395) T ss_pred c--ccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------cc--c------- Confidence 1 1223333333445555666667776665555667889999999888877666554310 00 0 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHH-HhccccCceeEEEEccHHHHHHHhcchhhhccccccc---- Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAF-TMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ---- 233 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~-~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~---- 233 (367) .......+++.+.++.. .+......-.+++||+..+..|++.. +++|. T Consensus 238 ---------------------~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lk------d~~G~~l~~ 290 (395) T protein:vir:38 238 ---------------------PKKPTISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVK------DADGRYLMQ 290 (395) T ss_pred ---------------------ccccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhh------ccCCceeec Confidence 00011234667777664 33333444568999999999998753 33332 Q ss_pred -----ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEE--c-c Q lcl|Aclame:pro 234 -----LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILE--R-K 303 (367) Q Consensus 234 -----~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~--r-~ 303 (367) ..-.+++|++|+++|.++....+ +.+ +++||. .++.+.... ...+++.+.....-..++..+.. | . T Consensus 291 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~--~~~-~i~~gd~~~~~~i~~~~-~~~i~~~~~~~~~~~~~~~~~r~~~r~d 366 (395) T protein:vir:38 291 PDVTSPDKYLIDGKPVIRIADKWLPDVS--GSH-PLYFGDLKQGITLFDRQ-QMQIDTTNVGAGSFEHDTTKLRFIDRFD 366 (395) T ss_pred cCcCCCCcceeccceeEEecccccCcCC--Ccc-eEEEEeccccEEEEEec-ceEEEEeccccchhhcCceEEEEEEeec Confidence 22357899999999988765432 222 355653 233333222 12344444332211122223322 2 2 Q ss_pred EEEeeeeeeeecccccccccccccccccccccC Q lcl|Aclame:pro 304 EWIVHPGGFNWLDADVTIPDNTGSPSGITSGPP 336 (367) Q Consensus 304 ~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~ 336 (367) ..++||..|.--.-. +..+.+...+. .|+ T Consensus 367 ~~~~~~~a~~~~~~~---~~~~~~~~~~~-~~~ 395 (395) T protein:vir:38 367 VQLIDDGAFAAASFK---TVANQAQGTAG-TGK 395 (395) T ss_pred cEEecccceEEEEee---cccCCCCCccC-CCC Confidence 234567776654321 12222222212 222 No 84 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=98.40 E-value=2.5e-07 Score=56.74 Aligned_cols=272 Identities=9% Similarity=0.007 Sum_probs=130.4 Q ss_pred CCCcccccccee--ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLVD--AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~d--~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) ....-..+..++ .++|+.+..-+.....+.+.+.+. .+....++...++|....-++....+.|+.... T Consensus 105 ~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~ 175 (389) T protein:vir:10 105 VIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTL---------VTKTPVTTPKGTYPILKRATDRFSSVAELAENP 175 (389) T ss_pred hhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhh---------cceeeccCCeeEEEEEecCCCcccccccccccc Confidence 111111122233 566777666555555554444221 111123466788998876655555666654221 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) ..++.+-++.....++.+.-+.++++...-+..|-...+.++|++...+..+..++..+.+. T Consensus 176 --~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~---------------- 237 (389) T protein:vir:10 176 --KLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSF---------------- 237 (389) T ss_pred --ccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccc---------------- Confidence 12344444555566677777788887766666666778888888766666555544332110 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHH-hccccCceeEEEEccHHHHHHHhcchh--hhccccc---- Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-MGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPDSK---- 231 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~-~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~---- 231 (367) .+.......+++.+.++... +-... -.+++||+..+..|++..-- .|+-... T Consensus 238 -------------------~~~~~~~~~~~d~l~~~~~~~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~ 296 (389) T protein:vir:10 238 -------------------TAKKTTTDTLVDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSI 296 (389) T ss_pred -------------------ccccccccccHHHHHHHHHhhhhhhh--CcEEEecHHHHHHHHHhhccCCCeeeecCcccc Confidence 00011223456677766543 21111 25799999999999976421 1211110 Q ss_pred -ccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEc-cEEEe Q lcl|Aclame:pro 232 -GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILER-KEWIV 307 (367) Q Consensus 232 -g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r-~~~~~ 307 (367) ....-.+++|++|++.+++.....+ +. .+++||. -++.+...+ ...+++.++ .....+.-.+ .| ..-++ T Consensus 297 ~~~~~~~~l~G~pV~~~~~~~~~~~~--~~-~~~~~gd~~~~~~~~~~~-~~~i~~~~~--~~~~~~~~~~-~r~d~~~~ 369 (389) T protein:vir:10 297 TDGTAKGTILGVPVYVVGDTLLGSLA--GD-QKAFVGDLKRGVLFTDRQ-QVTLAWEDS--KIYGKYLGAA-FRFGVQKA 369 (389) T ss_pred cccccccccccceeEEecccccCCCC--Cc-eEEEEeeccccEEEEeec-ceEEEeecc--ccccceEEEE-EEeccEEe Confidence 0112257999999876554322211 22 2456653 123333222 122333332 2211111111 12 22345 Q ss_pred eeeeeeecccccccccccccccccccccCCCCh Q lcl|Aclame:pro 308 HPGGFNWLDADVTIPDNTGSPSGITSGPPAITL 340 (367) Q Consensus 308 hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~ 340 (367) ||..|.+-.-+ ...+..|+. T Consensus 370 ~~~a~~~~~~~-------------~~~~~~~~~ 389 (389) T protein:vir:10 370 DSKAGYFVTNT-------------DVPGSALGK 389 (389) T ss_pred cccceEEEEee-------------ccCCCCCCC Confidence 67766654211 112233333 No 85 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=98.35 E-value=1.6e-07 Score=57.77 Aligned_cols=278 Identities=11% Similarity=0.033 Sum_probs=135.5 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCc--eEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGR--LINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~--~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) |.. ..+.-.-.++|+.+...+.....+.+.|.+ . -+....++. .+.+|.+..-...+..+.|+.... T Consensus 116 ~~~--~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~------~---~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~ 184 (408) T protein:vir:74 116 ETS--GSDSAAGLTIPQDIRTMINTLVRQYDSLQQ------Y---VRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIP 184 (408) T ss_pred hcc--cccCCCceeechhHhhHHHHHHhhhcchhh------h---cceeeccCCcceEEEEeecCCcccccccccccccc Confidence 211 001112346788777777766666555422 1 111112233 344555544433444555554321 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) . .++.+-++.....++.+....++++...-+.-|....+.++|++...+..+..+|. | ++. T Consensus 185 ~--~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~---G--------~G~------ 245 (408) T protein:vir:74 185 D--LDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIA---A--------MGT------ 245 (408) T ss_pred c--ccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh---c--------ccc------ Confidence 1 23345555556667777778888887776777788899999998887777665442 2 000 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHH-hccccCceeEEEEccHHHHHHHhcchh--hhccccc-ccc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-MGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPDSK-GQL 234 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~-~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~-g~~ 234 (367) . .......+++.+.++... +-.....-.+++||+..+..|++..-- .++-..+ ... T Consensus 246 ------------------~--~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~ 305 (408) T protein:vir:74 246 ------------------V--PKKPTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKP 305 (408) T ss_pred ------------------c--ccccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCC Confidence 0 001123567788877643 322222345799999999999875311 1111110 111 Q ss_pred cchhhcCcEEEEeCC--CcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhc--CCceeEEEEEc---cEE Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDG--MPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRG--NGSGLEYILER---KEW 305 (367) Q Consensus 235 ~i~t~~G~~VivdD~--~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~--~~~g~~~l~~r---~~~ 305 (367) .-.+++|++|++.+. ||..+ .+++ +++||. .++.+.... .+++++++... ...++.++... ... T Consensus 306 ~~~~l~G~pV~~~~~~~~~~~~---~~~~-~i~~gd~~~~~~~~~~~---~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~ 378 (408) T protein:vir:74 306 NSYLIKGKQVIVVADRWLPNSG---STVY-PLYYGDMSQAITLFDRE---NMSLLPTNIGAGAFETDTTKIRVIDRFDVK 378 (408) T ss_pred CCceecceeeEEecCccccccc---CCcc-eEEEEehhccEEEEEec---ceEEEEeccccchhhcceeeEEEEEeeCcE Confidence 225799999998764 55432 2332 345553 223332222 23344443221 11223333222 223 Q ss_pred EeeeeeeeecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 306 IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 306 ~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) +++|..|.+-.-.-+ ....+.+|+.+-=+- T Consensus 379 ~~~~~a~~~~~~~~~----------~~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 379 ATDSEALVAGSFTAI----------ADQVGNFKTTTSTAV 408 (408) T ss_pred EecccceEEEEeecc----------cCCCCCCCCCccccC Confidence 557777766432111 111122222221111 No 86 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.34 E-value=3.8e-08 Score=61.25 Aligned_cols=287 Identities=11% Similarity=0.075 Sum_probs=149.8 Q ss_pred CCCcccc-------cccee--ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCccccc Q lcl|Aclame:pro 1 MPDFNNQ-------VRLVD--AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNY 71 (367) Q Consensus 1 Ma~~~~~-------T~l~d--~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~ 71 (367) |-.+|.- .-=.| +|. |+|...|.....+++.| .+.-...+ + .+|+++.+|..+... ...+ T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~------~~~~~~r~-i-~~G~tv~i~~ig~~~--~~~~ 75 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIF------KGLVRSYD-L-RGGKSKQFMFTGKLS--AGYH 75 (332) T ss_pred ccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhh------hhcccccc-c-cccceEEEEecccee--Eeee Confidence 3322221 00022 444 66666666665555555 22222221 1 379999999998662 2333 Q ss_pred CCCCcccccccc-ccchhhhhhhhh-HhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhh Q lcl|Aclame:pro 72 GSDNPNVEAPID-GLGSGEMKTTKT-WLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAG 149 (367) Q Consensus 72 ~~~~~~~~~t~~-kitt~~~~a~i~-~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~ 149 (367) ..+++ +.+. .+.+.+.+-+|= ..--++.+.|+-..-+-.|.+.+++++.+..-+++.+..++..+...-...... T Consensus 76 ~~g~~---l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~ 152 (332) T protein:vir:78 76 TPGTP---IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPV 152 (332) T ss_pred cCCCC---CCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcc Confidence 33332 2222 233333222222 234456788999888889999999999999999999998888775432111000 Q ss_pred hhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc--CceeEEEEccHHHHHHHhc---chh Q lcl|Aclame:pro 150 NFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTNN---DEI 224 (367) Q Consensus 150 ~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~--~~l~~~vmhS~v~~~L~k~---~li 224 (367) . ....+.+ +-+.+.. ...+..-++.|.+|..+|-+.. ..=..+++.|.+|..|.+. +++ T Consensus 153 ~--------------~~~g~~~-~~~~~~~-~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~ 216 (332) T protein:vir:78 153 T--------------GEPGGFH-VNIGAGN-TNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNIL 216 (332) T ss_pred c--------------ccccccc-cccCCcc-ccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceee Confidence 0 0001111 1112111 1111123566778877775432 2236788999999999873 344 Q ss_pred hhcc-ccccc----ccchhhcCcEEEEeCCCcccCC---------CCC-------ceEEEEEEecceeeeeccCCCcc-- Q lcl|Aclame:pro 225 EFIP-DSKGQ----LTIPTYMGKVVIVDDGMPVFGT---------GAD-------KTYLSILFGGAAFGYADGAPQVP-- 281 (367) Q Consensus 225 ~~~~-~~~g~----~~i~t~~G~~VivdD~~pv~~t---------~~~-------~~yttyl~~~GAi~~~~~~~~~~-- 281 (367) +... .+++. ..|+.++|.+|+.+..+|..+. +.. .+..+++|-+-|++.....+..- T Consensus 217 n~~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~ 296 (332) T protein:vir:78 217 NREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQT 296 (332) T ss_pred eeeccccccceecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhh Confidence 3211 12221 2378999999999999996531 111 13457888888888766554221 Q ss_pred eeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCC Q lcl|Aclame:pro 282 VAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANP 346 (367) Q Consensus 282 ~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~ 346 (367) .|-+|++... +.....++.=-+.++.|.+.-=- .++ T Consensus 297 t~~~~~~~~~-~d~i~~~~~~G~~v~rPe~~v~l----------------------------~~a 332 (332) T protein:vir:78 297 TSGDFNVQYQ-GDLIVGKLAMGCGSLRTSVAGSF----------------------------QAA 332 (332) T ss_pred hhcccchhhh-HhhhhhhhhhcCceecccceEEE----------------------------eeC Confidence 2334554442 11111122212223333333211 111 No 87 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.32 E-value=6.6e-08 Score=59.97 Aligned_cols=286 Identities=10% Similarity=0.020 Sum_probs=155.7 Q ss_pred CCCccccccceeccchHHHHHHHhh-hhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC--C--ccccc-CCC Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAI-DRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD--S--LEPNY-GSD 74 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~-~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~--g--~~~~~-~~~ 74 (367) .|... |++...|+ +.|.+-+.. -..+.++|... +-.+ ....++.+++.|--.... + ..... .++ T Consensus 10 ~~~Ms--~~i~~~fv-~qy~~~v~~~~qq~~s~L~~t-V~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 79 (322) T protein:vir:10 10 LPLIA--GDIDQAFV-QTYETTLRILSQQKSAKLKQY-CQHK------NESSESHNWETLASMDPDAVKRKRSRQQSADG 79 (322) T ss_pred eeeee--chhhhHHH-HHHHHHHHHHHHHhhhhhhcc-cccc------cccccccceeecccccccccccccccccccCc Confidence 03322 45555666 555444432 23334455221 1101 112355565555433220 1 11111 111 Q ss_pred CccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 75 NPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATI 154 (367) Q Consensus 75 ~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~ 154 (367) + ..+++..+..+...+...-..-++-+.|+-.+-...||....+++.+..++|+.++.+++.+.|.-.....+ T Consensus 80 ~--~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~g----- 152 (322) T protein:vir:10 80 T--YPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTG----- 152 (322) T ss_pred c--cCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccc----- Confidence 1 012333344444444444444566777888888889999999999999999999998887776542211000 Q ss_pred hhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccc--cCc-eeEEEEccHHHHHHHhcc-hhhhcccc Q lcl|Aclame:pro 155 KTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDH--VGS-IAAIAVHSMVYKRMTNND-EIEFIPDS 230 (367) Q Consensus 155 ~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~--~~~-l~~~vmhS~v~~~L~k~~-li~~~~~~ 230 (367) ....+--+...+++...++...|.+|.++|..+ .+. -..+++.|..+..|.+.. +.+.-... T Consensus 153 --------------t~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~ 218 (322) T protein:vir:10 153 --------------QPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTS 218 (322) T ss_pred --------------cccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhccc Confidence 000111111122233457889999999998754 322 468999999999988763 33221111 Q ss_pred ----cccccchhhcCcEEEEeCCCcccCC----------CCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCcee Q lcl|Aclame:pro 231 ----KGQLTIPTYMGKVVIVDDGMPVFGT----------GADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGL 296 (367) Q Consensus 231 ----~g~~~i~t~~G~~VivdD~~pv~~t----------~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~ 296 (367) ..+-.+++|+|..|+++..+|..++ ....+..||.+-..||+++.+.. ...++..++.+....-. T Consensus 219 ~~~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~d-v~~~i~~~~~~~~a~~I 297 (322) T protein:vir:10 219 AMDLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKD-IWTKVAEDPSASFAWRI 297 (322) T ss_pred chhhhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeee-eeEEeeccCCcchhhhh Confidence 0122488999999999999996543 23456789999999999998775 33455555555421111 Q ss_pred EEEEEccEEEeeeee---eeecccc Q lcl|Aclame:pro 297 EYILERKEWIVHPGG---FNWLDAD 318 (367) Q Consensus 297 ~~l~~r~~~~~hp~G---~s~~~~~ 318 (367) ..+.+.-.-.+-|.| +..+++- T Consensus 298 ~~~~~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 298 YSAFTADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred hhhhhhCceEeccCcEEEEEEeccC Confidence 222223334444444 3333221 No 88 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=98.32 E-value=1.9e-07 Score=57.45 Aligned_cols=308 Identities=12% Similarity=-0.002 Sum_probs=153.1 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||--++ .++-||++..-+.+.+.+.+-|. .++-.+.+ .. +..-|++|+||-..... .. ++. .+ T Consensus 1 m~~~~N-----~~ltp~iia~~~l~~l~~~lV~~--~lv~r~y~-~e-~~~~GDTV~I~vp~~~~--v~---dg~---~~ 63 (418) T protein:vir:10 1 MAVQDN-----NLLTDDVIAKEALRLLKNNLVMA--KCVYRNYE-KT-FGKVGDTIRLKLPYRVK--SA---SGR---TL 63 (418) T ss_pred CCcccc-----ccccHHHHHHHHHHHHHHhccch--hhhcCCCc-hH-HhhCCCEEEEeeCCcee--ec---ccC---Cc Confidence 995322 34669999888888777766552 23333221 12 23459999999866442 11 222 25 Q ss_pred cccccchhhhhhhh-hHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 81 t~~kitt~~~~a~i-~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) +++.++..+...++ +....++.++|+-..+.-.|.+.++.++-+...+++.+..|++.+++.-.. T Consensus 64 ~~~~~te~~v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~-------------- 129 (418) T protein:vir:10 64 VKQPMVDQTIPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHS-------------- 129 (418) T ss_pred cccccccceEEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------- Confidence 56666666555555 556778899999988888999999988888888888888888877653211 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc---CceeEEEEccHHHHHHHhcchhhhcccccc---- Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV---GSIAAIAVHSMVYKRMTNNDEIEFIPDSKG---- 232 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~---~~l~~~vmhS~v~~~L~k~~li~~~~~~~g---- 232 (367) .+.++.. .-.++.+.+|..+|.++. +.-+.+++.|..|..|.+.....+.+.... T Consensus 130 ----------------~gt~gt~--~~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr 191 (418) T protein:vir:10 130 ----------------SGTPGVR--PGAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYK 191 (418) T ss_pred ----------------cccCCcC--cchHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhh Confidence 0001111 123788999999987653 235789999999999987754433332111 Q ss_pred cccchhhcCcEEEEeCCCcccCCCC-CceEEEEEEecceeeeeccCCCcceeeeeehhh--cCC-ceeEEEEEccEEEee Q lcl|Aclame:pro 233 QLTIPTYMGKVVIVDDGMPVFGTGA-DKTYLSILFGGAAFGYADGAPQVPVAVGRRELR--GNG-SGLEYILERKEWIVH 308 (367) Q Consensus 233 ~~~i~t~~G~~VivdD~~pv~~t~~-~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~--~~~-~g~~~l~~r~~~~~h 308 (367) +-.|+.+.|+.|+.+..+|....+. .+.. +..+.+.. ...+.++.+... +.. -| |.+-=--.+.+| T Consensus 192 ~G~IG~i~GF~V~~S~nip~~tag~~~~t~--~v~ga~~~-------~~~~~~~~~t~s~~g~l~~G-d~~ti~gv~~v~ 261 (418) T protein:vir:10 192 MGYRGNVAAYEVYESQNLPKHTVGDHGGTP--LVNGTVVN-------GDTVGFDGGTASTTGFLKAG-DVITFGGVFGVN 261 (418) T ss_pred eeeeeeeeceEEEEecCCCcccccccccce--eeeccccc-------ceeEEEeecceeeccceeec-cEEEECceeecc Confidence 2357899999999999999654332 2222 22222211 111112211110 000 01 110000111111 Q ss_pred ee-------eeeeccccccc------cccccccccc-ccccCCCChHHhcCCccce------------------------ Q lcl|Aclame:pro 309 PG-------GFNWLDADVTI------PDNTGSPSGI-TSGPPAITLANLANPDNWE------------------------ 350 (367) Q Consensus 309 p~-------G~s~~~~~~~~------~~~~~~~~~~-~~~~~sPt~a~L~~~~NW~------------------------ 350 (367) +. .-.|.-..... .+.++++... ...+..|+..+.-...|.. T Consensus 262 ~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~ 341 (418) T protein:vir:10 262 PQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQ 341 (418) T ss_pred cccccccccceEEEEEeeccccccCcceeEeccccccccccccccccccccccCCCcccccccCcceeeeecccccceee Confidence 11 11121100000 0000010000 0001111112211111111 Q ss_pred -eeecccccceEEEEe----------------cC Q lcl|Aclame:pro 351 -RVTYRKNVPMAFLVT----------------KG 367 (367) Q Consensus 351 -~v~d~K~i~iv~~~t----------------~g 367 (367) .+|.+..+.++-..= +| T Consensus 342 nl~f~~~a~~l~~~~l~~p~g~~~~~~~~~~~~G 375 (418) T protein:vir:10 342 NYLFHRDAIALAMIDLELPQSAVIKSRAADPETG 375 (418) T ss_pred eeeeecceEEEEEeeccCCCCCCcceEEEeccCC Confidence 222222332222110 11 No 89 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=98.31 E-value=4.5e-07 Score=55.36 Aligned_cols=274 Identities=10% Similarity=0.031 Sum_probs=123.8 Q ss_pred CCC-cccccccee--ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcc Q lcl|Aclame:pro 1 MPD-FNNQVRLVD--AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPN 77 (367) Q Consensus 1 Ma~-~~~~T~l~d--~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~ 77 (367) ..+ ....+..++ +.+|+.+..-+...+.+.+.|.+ +-+....++...++|....-++....+.|+... T Consensus 106 ~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~ 176 (394) T protein:vir:10 106 VIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLST---------LVTKTPVTTPKGTYPILKRATDRFSSVAELAEN 176 (394) T ss_pred hhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhh---------hceeeeccCCceEEEEEecCCCccccccccccc Confidence 000 000011121 44555543333333333222211 111112456778888877655555666665432 Q ss_pred ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 78 ~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) .. .+..+-++.....++.+.-..++++...-+.-|-...+.+++++-..+..++.++..+. . T Consensus 177 ~~--~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g----~------------ 238 (394) T protein:vir:10 177 PA--LAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQ----S------------ 238 (394) T ss_pred cc--cccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccc----c------------ Confidence 11 22333344444555666666777776555555666788888886666665554443221 0 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHH-hccccCceeEEEEccHHHHHHHhcchhh--hcccc--cc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-MGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPDS--KG 232 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~-~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~~--~g 232 (367) +.+....+..+++.+.++... +.... -.+++||+..+..|++..--+ |+-.. .. T Consensus 239 -------------------~~~~~~~~~~~~d~l~~~~~~~~~~~~--~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~ 297 (394) T protein:vir:10 239 -------------------FTAKATTTDTLVDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDS 297 (394) T ss_pred -------------------cccccccccccHHHHHHHHHhhhhhhc--cCEEEecHHHHHHHHHhhccCCCeeeeccccc Confidence 000011123456677776653 33222 258999999999999763111 11000 00 Q ss_pred ---cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec---ceeeeeccCCCcceeeeeehhhcCCceeEEEEEc-cEE Q lcl|Aclame:pro 233 ---QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG---AAFGYADGAPQVPVAVGRRELRGNGSGLEYILER-KEW 305 (367) Q Consensus 233 ---~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~---GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r-~~~ 305 (367) ...-.+++|++|++.|++..... .+.. .++||. +.+ +.... ...+++.++.. ...+. ..+.| ..- T Consensus 298 ~~~~~~~~~L~G~PV~~~~~~~~~~~--~~~~-~i~~gd~s~~~~-~~~~~-~~~v~~~~~~~--~~~~~-~~~~r~d~~ 369 (394) T protein:vir:10 298 ITDGTAKGTVLGVPVYVVGDALLGSA--AGDQ-KAFVGDLKRGVL-FADRQ-QVTLAWEDSKI--YGRYL-GAAFRFGVK 369 (394) T ss_pred cccCCcccccccceeEEecccccCCC--CCce-EEEEeeccccEE-EEeec-ceEEEEecccc--cceeE-EEEEEeccE Confidence 01124789999988766543321 1221 344442 222 22211 12233333322 11222 12222 234 Q ss_pred EeeeeeeeecccccccccccccccccccccCCCChHHhcCCc Q lcl|Aclame:pro 306 IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPD 347 (367) Q Consensus 306 ~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~ 347 (367) ++||..|.|-.-+-++. .+..+ +|. T Consensus 370 ~~~~~ai~~~~~~~~~~-~~~~~----------------~~~ 394 (394) T protein:vir:10 370 QADSNAGYFVTNTDAAS-GSTSG----------------TGK 394 (394) T ss_pred EeccccEEEEEeecccC-CCCCC----------------CCC Confidence 66888888753321110 01011 111 No 90 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.29 E-value=5.3e-08 Score=60.47 Aligned_cols=295 Identities=12% Similarity=0.100 Sum_probs=145.5 Q ss_pred CCCc------cccccc--------eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCC Q lcl|Aclame:pro 1 MPDF------NNQVRL--------VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDS 66 (367) Q Consensus 1 Ma~~------~~~T~l--------~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g 66 (367) ||.. |..|.- -+++. |+|...|.....+++.| .+.-.+.+ + .+|+++.+|+.+... T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~------~~~~~~r~-i-~~g~s~~~~~iG~~~- 70 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVT------TSRHMVRS-I-SSGKSAQFPVLGRTQ- 70 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhh------cccceeee-e-cccceEEEEeeceeE- Confidence 6632 222211 13455 77777777766666655 33322222 2 479999999887553 Q ss_pred cccccCCCCccccccccccchhhhhhhhh-HhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhh Q lcl|Aclame:pro 67 LEPNYGSDNPNVEAPIDGLGSGEMKTTKT-WLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKS 145 (367) Q Consensus 67 ~~~~~~~~~~~~~~t~~kitt~~~~a~i~-~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~ 145 (367) ...+..+++.+. ++..+.+.+.+-+|= ..--.+.+.|+-..-+-.|.+.+++++.+..-++..++.++..|.+.-+. T Consensus 71 -~~~~~~G~~l~~-t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~ 148 (344) T protein:vir:10 71 -AAYLAPGENLDD-IRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNV 148 (344) T ss_pred -EEeeecCCCCCC-CCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 222222222110 111122222111111 12234578899999999999999999999999998888887766543222 Q ss_pred hhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccc-hhhccc----HHHHHHHHHHhccc--cCceeEEEEccHHHHHH Q lcl|Aclame:pro 146 NLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNP-ADAVFN----REAFVDAAFTMGDH--VGSIAAIAVHSMVYKRM 218 (367) Q Consensus 146 ~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~-a~~~~s----~~~l~~A~~~~GD~--~~~l~~~vmhS~v~~~L 218 (367) ....+.... ......+....+.... .....+ ++.|.+|.+.|-+. ...=.+++|.|.+|..| T Consensus 149 ~~~~~~~~~-----------g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~L 217 (344) T protein:vir:10 149 ESQYNENIT-----------GLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAI 217 (344) T ss_pred ccccccccc-----------cccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHH Confidence 111111000 0001111211111100 011122 45566677776432 22337899999999999 Q ss_pred Hhcchhhhccccc-c---cccchhhcCcEEEEeCCCcccCCC--------CCceE---------------EEEEEeccee Q lcl|Aclame:pro 219 TNNDEIEFIPDSK-G---QLTIPTYMGKVVIVDDGMPVFGTG--------ADKTY---------------LSILFGGAAF 271 (367) Q Consensus 219 ~k~~li~~~~~~~-g---~~~i~t~~G~~VivdD~~pv~~t~--------~~~~y---------------ttyl~~~GAi 271 (367) .+...+....+.. + +-.|+.++|.+|+.+..+|....+ .+..| ...+|-+-|+ T Consensus 218 l~~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~ 297 (344) T protein:vir:10 218 LAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAV 297 (344) T ss_pred hhcccccccccccccceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhh Confidence 8886554332221 1 235888999999999999853211 11000 0122223333 Q ss_pred eeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCcccee Q lcl|Aclame:pro 272 GYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWER 351 (367) Q Consensus 272 ~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~ 351 (367) +.....+ ..+|..|++... + +.+. .+|.+|.. T Consensus 298 ~~v~~~~-~~~e~~r~~~~~---~-d~i~-----g~~~~G~~-------------------------------------- 329 (344) T protein:vir:10 298 GTVKLRD-LALERARRANFQ---A-DQII-----AKYAMGHG-------------------------------------- 329 (344) T ss_pred hhhhhcc-ceeecccchhHH---H-HHHH-----HHhhcccc-------------------------------------- Confidence 2222221 112333332211 0 1111 12222222 Q ss_pred eecccccceEEEEec Q lcl|Aclame:pro 352 VTYRKNVPMAFLVTK 366 (367) Q Consensus 352 v~d~K~i~iv~~~t~ 366 (367) +..|+...-|.|+|| T Consensus 330 vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 330 GLRPEAAGAVVFKTK 344 (344) T ss_pred eecccceEEEEeecC Confidence 345667777788888 No 91 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.27 E-value=1.7e-07 Score=57.64 Aligned_cols=281 Identities=10% Similarity=0.062 Sum_probs=135.9 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |+... .+..+.+|.||+.... .+...+.+.+.+ +-.....++..+++|.+..- ..+.-+.|+. .+ T Consensus 14 ~~~t~-~~~~~~~ip~~~~~~i-i~~~~~~s~l~~---------~~~~~~~~~~~~~~p~~~~~-~~a~~v~E~~---~~ 78 (320) T protein:vir:10 14 IAQTG-DTMFKGYLEPEQAKDY-FAEAEKTSIVQQ---------FAQKVPMGTTGQKIPHWIGD-VSAQWIGEGD---MK 78 (320) T ss_pred hhccc-cccccccccHHHHHHH-HHHHHhccchhh---------hcceeeccCCceEEEEEeCC-cceEEecCCc---cc Confidence 33211 1223456666655544 444444443322 11222345778999998743 3445566654 35 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) +..+++-.+.....++.+....++++...-+..|-.+.+.+++++.+.+..++.+|. |.-+ ........ T Consensus 79 ~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~---G~g~---~~~~~~~~----- 147 (320) T protein:vir:10 79 PITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALN---GTDS---PFPTYLAQ----- 147 (320) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhc---ccCC---CCCccccc----- Confidence 556666677777888889999999998887778888999999999988887777642 2100 00000000 Q ss_pred hhhhhcchhhcceeecCccc-chhhcccH-HHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hcccc---c-- Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTN-PADAVFNR-EAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPDS---K-- 231 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~-~a~~~~s~-~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~~---~-- 231 (367) ...........+ .+...... ..+.++..+.-.....-.+++||+..+..|++..--+ ++-.. . T Consensus 148 --------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~ 219 (320) T protein:vir:10 148 --------TTKSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDE 219 (320) T ss_pred --------ccccccceecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCc Confidence 000000011101 01111112 2355666666555666779999999999998753211 11000 1 Q ss_pred -ccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehhhcCCcee----EEEEEc--- Q lcl|Aclame:pro 232 -GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGNGSGL----EYILER--- 302 (367) Q Consensus 232 -g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~~~~~~g~----~~l~~r--- 302 (367) ....-.+++|++|++++.+|-. ++. .+||. .-+.++.-. ...++..|+...-.+... -.++.| T Consensus 220 ~~~~~~~~i~g~pv~~~~~~~~~------~~~-~~~gd~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~ 291 (320) T protein:vir:10 220 NSPFRAGRIVSRPTILSDHVADG------TTV-GYMGDFRNVIWGQVG-GLSFDVTDQATLNLGTPTEPNFVSLWQHNLV 291 (320) T ss_pred cccccCceeeeeeeEecCCCCCC------ceE-EEEeecceEEEEEec-CeEEEEeecceeeeccccccccchhhhcCcE Confidence 1122357889999999998742 211 11111 011122211 122444444321111100 011111 Q ss_pred --------cEEEeeeeeeeecccccccccccccccccccccCCCChH Q lcl|Aclame:pro 303 --------KEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLA 341 (367) Q Consensus 303 --------~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a 341 (367) ..-++||.-|.-... +++ . +| T Consensus 292 ~~r~~~~~d~~v~~~~a~~~l~~-~~a-------------p----~~ 320 (320) T protein:vir:10 292 AVRVEAEYAFHNNDKDAFVKLTN-VVT-------------P----DA 320 (320) T ss_pred EEEEEEeeccEEecccceEEEEe-ccC-------------C----CC Confidence 112234444332210 000 0 11 No 92 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.26 E-value=3.5e-07 Score=56.00 Aligned_cols=271 Identities=9% Similarity=-0.001 Sum_probs=142.0 Q ss_pred CC----Cccccc--cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCC Q lcl|Aclame:pro 1 MP----DFNNQV--RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSD 74 (367) Q Consensus 1 Ma----~~~~~T--~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~ 74 (367) |- ++.+.| .-....+|+.+..=+.+...+.+.+.+-.-+.+ ..++..+++|....- ..+.-+.|+ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~--------~~~~~~~~~~~~~~~-~~a~~v~Eg 71 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQE--------MEGEQEKTVYVQTDG-ISAYWVNET 71 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceee--------cCCCccEEEEEEcCC-ceeEEeecC Confidence 21 111222 223446777776555555555554433221111 123345667755432 345556665 Q ss_pred CccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 75 NPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATI 154 (367) Q Consensus 75 ~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~ 154 (367) . .++..+.+-++.....++.+....++++...-+..|....|.+++++.+.+..++.+|. | +....... T Consensus 72 ~---~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~---G---~g~~~~~g-- 140 (297) T protein:vir:95 72 E---KIKTDKPEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLL---G---HDTPFANS-- 140 (297) T ss_pred c---cccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhc---c---cCCccccc-- Confidence 4 34555666666677788888889999988887778889999999999999998888772 2 11111100 Q ss_pred hhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhccccccc- Q lcl|Aclame:pro 155 KTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ- 233 (367) Q Consensus 155 ~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~- 233 (367) +. ........ .....++++.+.++..++.+......+++||++.+..|++.. +.+|. T Consensus 141 -i~------------~~~~~~~~---~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~------d~~G~~ 198 (297) T protein:vir:95 141 -VA------------KAAKDANK---VIGGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALREAR------DGNKVS 198 (297) T ss_pred -cc------------ccccccce---ecccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhh------ccCCce Confidence 00 00001111 112347899999999998887777789999999999998752 22222 Q ss_pred ---ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecc-eeeeeccCCCcceeeeeehhhc-----C-------CceeE Q lcl|Aclame:pro 234 ---LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA-AFGYADGAPQVPVAVGRRELRG-----N-------GSGLE 297 (367) Q Consensus 234 ---~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G-Ai~~~~~~~~~~~e~~rd~~~~-----~-------~~g~~ 297 (367) ....+++|++|++..+++... + ..+|+.= .+.++... ...++..++.... . ..++. T Consensus 199 i~~~~~~~l~G~Pv~~~~~~~~~~----~---~~~~gd~s~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 270 (297) T protein:vir:95 199 IYDKAANTIDGITTVDLKSARFEK----G---DLLAGDFDNLIYGVPY-NITYKISEEGQISTITNADGTPINLFEQEMI 270 (297) T ss_pred eecCCCCcccceeeEeecCCCCCC----c---eEEEEecccEEEEEec-CeEEEEeeccccccccccCccchhhhhcCcE Confidence 234678999999887766532 2 1233321 11122222 1223333332110 0 00111 Q ss_pred EEEEccE---EEeeeeeeeecccccccccccccccccccccCCCC Q lcl|Aclame:pro 298 YILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 298 ~l~~r~~---~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) .+-...+ -+++|..|.-.. ...|- T Consensus 271 ~~r~~~~~d~~v~~~~a~~~l~------------------~at~~ 297 (297) T protein:vir:95 271 AIRATMDIAVMITKTDAFAKLT------------------PAERV 297 (297) T ss_pred EEEEEEEeccEeecccceEEEe------------------ecCCC Confidence 1111111 123444443211 00111 No 93 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=98.23 E-value=2.4e-07 Score=56.87 Aligned_cols=282 Identities=12% Similarity=0.077 Sum_probs=135.1 Q ss_pred CCCcccccc-ceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVR-LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~-l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) ....+.-|. =.-.++|+-+..-+.+.+.+.+.|.+..-+ ...++...++|....- ..+.-+.|+.. T Consensus 127 ~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~---------~~~~~~~~~~~~~~~~-~~a~wv~E~~~--- 193 (425) T protein:vir:10 127 QAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRV---------QPVSKAGFSKLFNMGG-TTSGWVGEASQ--- 193 (425) T ss_pred HHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhcee---------eeccCCceEEEEEcCC-cceeeeccccc--- Confidence 000010110 011345665655555554444444321111 1223455677765432 33444455432 Q ss_pred ccccc-cchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHH-----HHHHHhhhhhhhhhh Q lcl|Aclame:pro 80 APIDG-LGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAM-----AVGVYKSNLAGNFAT 153 (367) Q Consensus 80 ~t~~k-itt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~-----l~Gvf~~~~a~~~~~ 153 (367) .+..+ .+-.+.....++.+--..++++...-+.-|-.+.+.+++++-+.+..+..+|.= -.|+++...... T Consensus 194 ~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~--- 270 (425) T protein:vir:10 194 RPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGA--- 270 (425) T ss_pred cccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeecccccc--- Confidence 22222 222333344455555567777766555567778999999988877776655420 011111110000 Q ss_pred hhhhhhhhhhhhcchhhcceee--cCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hc-c Q lcl|Aclame:pro 154 IKTRGRVPAEVLGTAGDMVIDI--SGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FI-P 228 (367) Q Consensus 154 ~~~~~~~~a~~~~~~~~~v~di--sa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~-~ 228 (367) ....+..+. ...+. ....++++.+++....+......-..|+||+..+..|++..--+ ++ + T Consensus 271 -------------~~~~~~~~~~~~~~~~-~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~ 336 (425) T protein:vir:10 271 -------------NAAKHPFGAIEVVNSG-AAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQ 336 (425) T ss_pred -------------cccccccccccccccc-ccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeec Confidence 000000000 00111 22457888999888777655555568999999999988753111 11 1 Q ss_pred cccccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE- Q lcl|Aclame:pro 229 DSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW- 305 (367) Q Consensus 229 ~~~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~- 305 (367) +.-....-.+++|++|+++|.||..+++.. +.+||. .++.+... ..+++.+++... .++..+....++ T Consensus 337 ~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~----~i~~Gd~~~~~~i~~~---~~~~v~~d~~~~--~~~~~~~~~~r~d 407 (425) T protein:vir:10 337 PSYVAGQPATLAGYPVTEVPDMPDVAANST----PILFGDFQQTYLIIDR---IGVRVLRDPYTA--KPYVLFYTTKRVG 407 (425) T ss_pred cCccCCCCceecceeeEEecCcCCccCCcc----EEEEEehhccEEEEEe---cceEEEeccccc--CCcEEEEEEEEec Confidence 111111225799999999999996554322 344542 23333221 235666665543 344444443333 Q ss_pred --EeeeeeeeecccccccccccccccccccccC Q lcl|Aclame:pro 306 --IVHPGGFNWLDADVTIPDNTGSPSGITSGPP 336 (367) Q Consensus 306 --~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~ 336 (367) ++||..|.--.-. +++ T Consensus 408 ~~v~~~~A~~~l~~~---------------as~ 425 (425) T protein:vir:10 408 GGLLNPEPMRAMKVA---------------ASE 425 (425) T ss_pred cEeecccceEEEEee---------------ccC Confidence 3456555432211 111 No 94 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.23 E-value=4.6e-07 Score=55.35 Aligned_cols=282 Identities=10% Similarity=0.010 Sum_probs=131.6 Q ss_pred CCC-------ccccccce--eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeec--cCCCccc Q lcl|Aclame:pro 1 MPD-------FNNQVRLV--DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWR--DLDSLEP 69 (367) Q Consensus 1 Ma~-------~~~~T~l~--d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~--~l~g~~~ 69 (367) |.. +...+..+ ...+|+.+...+.....+.+.+.+ . -+....++....+|++. .-.+.+. T Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~------~---~~~~~~~~~~~~~~~~~~~~~~~~a~ 175 (408) T protein:vir:10 105 MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQ------Y---VRVESVSTSNGSRVYEKWTDVTPLTV 175 (408) T ss_pred hhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhh------h---cceeeccCCcceEEEeecccccccee Confidence 000 00012222 256788888777776666555422 1 11112234445555543 3323444 Q ss_pred ccCCCCccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhh Q lcl|Aclame:pro 70 NYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAG 149 (367) Q Consensus 70 ~~~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~ 149 (367) -+.|+..... .+..+-++.....++.+....++++...-+.-|....+.+++++-..+..++.++.-.. .+ T Consensus 176 ~v~E~~~~~~--~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g-------~~ 246 (408) T protein:vir:10 176 MDAEDGKIPD--LDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMK-------AA 246 (408) T ss_pred eecCcccccc--ccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccc-------cc Confidence 5555543211 12233333344555566666777776666666778889999987777666554432110 00 Q ss_pred hhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHH-hccccCceeEEEEccHHHHHHHhcchhh--h Q lcl|Aclame:pro 150 NFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-MGDHVGSIAAIAVHSMVYKRMTNNDEIE--F 226 (367) Q Consensus 150 ~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~-~GD~~~~l~~~vmhS~v~~~L~k~~li~--~ 226 (367) . ......+++.+.++... +-.....=..++||+..+..|++..--+ | T Consensus 247 --------------------------~----~~~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~ 296 (408) T protein:vir:10 247 --------------------------P----KKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKY 296 (408) T ss_pred --------------------------c----cccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCce Confidence 0 01123457778887643 3222223357999999999998864221 2 Q ss_pred cc-cccccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecc--eeeeeccCCCcceeeeeehhhc--CCceeEEEEE Q lcl|Aclame:pro 227 IP-DSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRELRG--NGSGLEYILE 301 (367) Q Consensus 227 ~~-~~~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G--Ai~~~~~~~~~~~e~~rd~~~~--~~~g~~~l~~ 301 (367) +- ..-....-.+++|++|++.+..++... ..+.+ .++||.= ++.+..-. .+++++++... ...++..+.. T Consensus 297 i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~-~~~~~-~i~~gd~~~~~~~~~~~---~~~v~~~~~~~~~f~~~~~~~r~ 371 (408) T protein:vir:10 297 LLEPDPTKPNSYLIKGKQVIVVADRWLPNT-GSTVY-PLYYGDMSQAITLFDRE---NMSLLPTNIGAGAFETDTTKIRV 371 (408) T ss_pred EeccCcCCCCCceecceeeEEecccccCcc-CCCce-EEEEEehhccEEEEEec---ceEEEEcccccchhhcCceEEEE Confidence 11 111112235899999999765433222 22333 3556542 23332221 23333333221 1123344443 Q ss_pred ccE---EEeeeeeeeecccccccccccccccccccccCCCChHHh Q lcl|Aclame:pro 302 RKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 302 r~~---~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L 343 (367) ..+ .++||.+|..-.-.-++|. .+.. ..||-... T Consensus 372 ~~r~d~~v~~~~a~~~~~~~~~~~~---~~~~-----~~~~~~~~ 408 (408) T protein:vir:10 372 IDRFDVKATDSEALVAGSFSAIADQ---VGNF-----KTTTSTAV 408 (408) T ss_pred EEeeccEEeccccEEEEEeeccccC---CCCC-----CCCCcccC Confidence 333 3457777765432111111 1111 11111111 No 95 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.22 E-value=4.2e-07 Score=55.52 Aligned_cols=295 Identities=12% Similarity=0.050 Sum_probs=137.0 Q ss_pred CCC------------cccccccee--ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCC Q lcl|Aclame:pro 1 MPD------------FNNQVRLVD--AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDS 66 (367) Q Consensus 1 Ma~------------~~~~T~l~d--~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g 66 (367) |.. +-..+.-++ .++|+.+.+-+...+.+.+-|.+ . ......++..+.+|....- . T Consensus 90 l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~------~---~~~~~~~~~~~~~~~~~~~-~ 159 (407) T protein:vir:48 90 MRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQ------E---ATVITLGGSDYKKLVNLGG-T 159 (407) T ss_pred HhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhh------h---ceeeecCCCceEEEEecCC-c Confidence 100 000011111 35677776666655554443322 1 1112234456666654322 2 Q ss_pred cccccCCCCcccccccccc-chhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhh Q lcl|Aclame:pro 67 LEPNYGSDNPNVEAPIDGL-GSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKS 145 (367) Q Consensus 67 ~~~~~~~~~~~~~~t~~ki-tt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~ 145 (367) .+.-+.|+.. .+..+. +-.+.....++.+.-..++++...-+..|-.+.|.+++++-+.+..+..+| .| + T Consensus 160 ~a~~v~E~~~---~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l---~G---~ 230 (407) T protein:vir:48 160 TSGWVGETDA---RPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFT---SG---D 230 (407) T ss_pred ceeeeccccc---ccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhh---cc---C Confidence 3333445432 221121 223333444555555677777766666777889999999877776665443 22 1 Q ss_pred hhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh Q lcl|Aclame:pro 146 NLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE 225 (367) Q Consensus 146 ~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~ 225 (367) ......+..... ................+.. .....++++.+.+....+......-..++||+..+..|++..--+ T Consensus 231 G~~~p~Gil~~~-~~~~~~~~~~~~~~~~~~~---~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~ 306 (407) T protein:vir:48 231 GSKKPKGFLAYE-STDEDDKTRAFGKLQHIAS---GAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDND 306 (407) T ss_pred CCCccceeeecc-ccccccccccccccccccc---ccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccC Confidence 100000000000 0000000000000011111 122357889999988777555555567999999999988753211 Q ss_pred --hc-ccccccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEE Q lcl|Aclame:pro 226 --FI-PDSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYIL 300 (367) Q Consensus 226 --~~-~~~~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~ 300 (367) ++ ++.-......+++|++|+++|.||..++++. +.+||. .++.+.+ ...+++.|++... .++..+. T Consensus 307 Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~----~i~~Gd~~~~~~i~~---~~~~~i~~d~~~~--~~~~~~~ 377 (407) T protein:vir:48 307 GNYLWRPGIELGQPSSLAGYGIVENEQMPDIAADAK----AIAFGNFKRGYTIVD---RIGTRILRDPYTN--KPFVGFY 377 (407) T ss_pred CceeeccCcCCCCCceecceeeEEecCcCCccCCcc----EEEEEeccccEEEEE---eeceEEEeecccc--CCcEEEE Confidence 22 1111111235799999999999997543322 234442 1222222 1235566665543 3444444 Q ss_pred EccE---EEeeeeeeeecccccccccccccccccccccCCCChH Q lcl|Aclame:pro 301 ERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLA 341 (367) Q Consensus 301 ~r~~---~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a 341 (367) ...+ -+++|..|..-.-.- +..+-.-+ T Consensus 378 ~~~r~d~~v~~~~a~~~l~~~a--------------a~~~~~~~ 407 (407) T protein:vir:48 378 TTKRTGGMLVDSQAIKLMKIGA--------------ATRQKAAA 407 (407) T ss_pred EEEEeccEEecccceEEEEeec--------------cCCCCCCC Confidence 4322 355777776432211 00000000 No 96 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.19 E-value=1.4e-07 Score=58.18 Aligned_cols=296 Identities=11% Similarity=0.038 Sum_probs=150.7 Q ss_pred CCCccccccc-------------eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCc Q lcl|Aclame:pro 1 MPDFNNQVRL-------------VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSL 67 (367) Q Consensus 1 Ma~~~~~T~l-------------~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~ 67 (367) ||....--++ -++|. |+|...|.....+.+.| .+.-...+ + .+|+++.+|..+...- T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~------~~~~~~r~-i-~~G~sv~~~~iG~~~~- 70 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVT------MDKHMVRT-I-QNGKSASFPVMGRTKG- 70 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHH-HHHHHHHHHHHHHHhhh------hhcccccc-c-cCcceEEEeeecceee- Confidence 7743222222 24566 88888887766666655 22222221 1 4799999999987643 Q ss_pred ccccCCCCccccccccccchhhhhhhhh-HhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhh Q lcl|Aclame:pro 68 EPNYGSDNPNVEAPIDGLGSGEMKTTKT-WLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSN 146 (367) Q Consensus 68 ~~~~~~~~~~~~~t~~kitt~~~~a~i~-~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~ 146 (367) ..+..+++.+ .+...+.+.+..-+|= ..--.+.+.|+-..-.-.|++.+++++.+..-+|..++.++..|....... T Consensus 71 -~~~~~g~~l~-~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~ 148 (347) T protein:vir:88 71 -YYLAPGENLD-DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLP 148 (347) T ss_pred -eeeccccCCC-CCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 2222222211 1111233232222221 123456788888888888999999999998888988887776665443322 Q ss_pred hhhhhhhhhhhhhhhhhhhcchhhcceeec-Cccc-c--hhhcccHHHHHHHHHHhcccc--CceeEEEEccHHHHHHHh Q lcl|Aclame:pro 147 LAGNFATIKTRGRVPAEVLGTAGDMVIDIS-GQTN-P--ADAVFNREAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTN 220 (367) Q Consensus 147 ~a~~~~~~~~~~~~~a~~~~~~~~~v~dis-a~t~-~--a~~~~s~~~l~~A~~~~GD~~--~~l~~~vmhS~v~~~L~k 220 (367) ...+...... .....+.+. +... + .....-++.|.+|...|.+.. ..=..+++.|.+|..|.+ T Consensus 149 ~~~~~~~~g~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~ 217 (347) T protein:vir:88 149 AASNENIAGL-----------GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILS 217 (347) T ss_pred cccccccCCc-----------cccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhc Confidence 2221111110 001111111 1100 0 001112577888888885542 335899999999999987 Q ss_pred cchhh-hccccccc---ccchhhcCcEEEEeCCCcccCCCCC--------------------ceE------E-EEEEecc Q lcl|Aclame:pro 221 NDEIE-FIPDSKGQ---LTIPTYMGKVVIVDDGMPVFGTGAD--------------------KTY------L-SILFGGA 269 (367) Q Consensus 221 ~~li~-~~~~~~g~---~~i~t~~G~~VivdD~~pv~~t~~~--------------------~~y------t-tyl~~~G 269 (367) ..... ....+++. -.++.++|.+|+.+..+|+...+.. ++| + .++|-+- T Consensus 218 ~~~~~~~~~~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~ 297 (347) T protein:vir:88 218 ALMPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRS 297 (347) T ss_pred chhhhhhhhccccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechh Confidence 54222 11112222 2478899999999999997533210 011 1 1334445 Q ss_pred eeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccc Q lcl|Aclame:pro 270 AFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 270 Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~ 334 (367) |++.....+. .+|..|++... .+.+..+. .+....+.+.-+-+. ..+.++ T Consensus 298 a~g~v~~~d~-~~e~~r~~~~~----~d~i~~~~--~~G~~~~rPe~a~~~--------~~~~a~ 347 (347) T protein:vir:88 298 AVGTVKLKDM-ALERARRPEFQ----ADQIIGKY--AMGHGGLRPEAAGAL--------VFTPAA 347 (347) T ss_pred hhhheecccc-eeeeeechhhH----HHHhhhhh--hhcCceeccceEEEE--------EeCCCC Confidence 5555444432 27777877653 13333222 222222333222111 011111 No 97 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=98.18 E-value=1.7e-07 Score=57.71 Aligned_cols=288 Identities=9% Similarity=0.011 Sum_probs=137.4 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |+... .|.-+-+| |+.+..-+.+...+.+.+.+ +-+....++..+.+|.+..- +.+.-+.|+.. + T Consensus 14 ~~~~~-~~~~~~~i-p~~~~~~ii~~~~~~~~l~~---------~~~~~~~~~~~~~ip~~~~~-~~a~~v~Eg~~---~ 78 (318) T protein:vir:24 14 IAQTG-DTMFKGYL-EPEQAKDYFAEAEKTSIVQQ---------FAQKVPMGTTGQKIPHWVGD-VSAQWIGEGDM---K 78 (318) T ss_pred hhccc-Ccccceee-chhHHHHHHHHHHhhchhhh---------hcceeeccCCceEEEEEeCC-cceEEecCCcc---c Confidence 44311 12233444 55454545555444444322 12222345778999998754 55666777654 4 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) +..+.+-.+.....++.+..+.++++...-+..|..+.+.+++++.+.+..++.+| .|.-. ...... T Consensus 79 ~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l---~G~g~---~~~~~~------- 145 (318) T protein:vir:24 79 PITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAM---HGTDS---PFPTYI------- 145 (318) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhh---cccCC---CCCccc------- Confidence 45556555666667788888899998877777788999999999999988887665 22110 000000 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hc-ccc--cc--- Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FI-PDS--KG--- 232 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~-~~~--~g--- 232 (367) ...+..++.....+........+.++....-.....-.+++||+..+..|++..--+ ++ +.. .+ T Consensus 146 --------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~ 217 (318) T protein:vir:24 146 --------GQTTKAISIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAAS 217 (318) T ss_pred --------ccccccccccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccc Confidence 000001111111111122334455555555444445568999999999998753111 11 111 01 Q ss_pred cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehhhcCCceeE----EEEEccEEEe Q lcl|Aclame:pro 233 QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGNGSGLE----YILERKEWIV 307 (367) Q Consensus 233 ~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~~~~~~g~~----~l~~r~~~~~ 307 (367) ...-..+.|++|++++.+|... . ..+|+. .-+.++..+ ...+++.|+.....+...+ .++.|.. + T Consensus 218 ~~~~~~i~g~pv~~~~~~~~~~------~-~~~~gdfs~~~~~~~~-~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~--~ 287 (318) T protein:vir:24 218 PFRSGRIVARPTILSDHVVEGT------T-VGFMGDFSQLIWGQIG-GLSFDVTDQATLNLGTVESPNFVSLWQHNL--V 287 (318) T ss_pred cccCceEEEEeeEEeCCCCCCc------c-EEEEeecceEEEEEec-CeEEEEeeccceeccccccccchhhhhcCc--E Confidence 1122467899999999987421 1 112221 112233322 2235555543321111000 1121111 1 Q ss_pred eeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 308 HPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 308 hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) ..+...+-+- .|.+++++..+..++=| T Consensus 288 ~~r~~~r~d~---------------------------------~v~~~~a~~~i~~~~a~ 314 (318) T protein:vir:24 288 AVRVEAEYAF---------------------------------HCNDAEAFVALTNVVSG 314 (318) T ss_pred EEEEEEEEcc---------------------------------EEecccceEEEEeeccC Confidence 1122222110 01222222222222222 No 98 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.18 E-value=1.4e-07 Score=58.16 Aligned_cols=272 Identities=13% Similarity=0.063 Sum_probs=133.6 Q ss_pred CC-Ccccccc--ceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcc Q lcl|Aclame:pro 1 MP-DFNNQVR--LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPN 77 (367) Q Consensus 1 Ma-~~~~~T~--l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~ 77 (367) ++ .....|. -..++.|+++...+..-..+.. ++.....+-. ...+..+.+|.+..- ..+.-+.|+.. T Consensus 106 ~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~------~l~~~~~~~~--~~~~~~~~~p~~~~~-~~a~wv~E~~~- 175 (390) T protein:vir:62 106 FAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSA------IMRGGATTFT--TSDANPLDFTVITGR-SSASIVGETAE- 175 (390) T ss_pred hhhhhhcccccCCCccccccchHHHHHHHHhhhh------hhhhcceeee--cCCCceeEEEEEcCC-cceeeeccccc- Confidence 00 0001111 1235667766665543322222 2211111100 134567889988654 34444556543 Q ss_pred ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHH---HHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIA---MAVGVYKSNLAGNFATI 154 (367) Q Consensus 78 ~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla---~l~Gvf~~~~a~~~~~~ 154 (367) ++..+.+-++..-..++.+.-..++++...-+.-|-...+.+++++.+.+..+..+|. .-+|+++.... T Consensus 176 --~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~Gi~~~~~~------ 247 (390) T protein:vir:62 176 --IPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPRGILTDASP------ 247 (390) T ss_pred --ccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccccccc------ Confidence 4444455555555666667667777777766666777889999998888777775552 00112111000 Q ss_pred hhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcc--hhhhccccc- Q lcl|Aclame:pro 155 KTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNND--EIEFIPDSK- 231 (367) Q Consensus 155 ~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~--li~~~~~~~- 231 (367) ..... .+ + ....++++.+++....+......-..++||+..+..|++.. .-.|+-.++ T Consensus 248 --------------~~~~~-~~---~-~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~ 308 (390) T protein:vir:62 248 --------------ATATF-LA---T-DTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGL 308 (390) T ss_pred --------------cccce-ec---c-cccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCc Confidence 00011 01 1 12347788888877666544444557999999999987652 112221111 Q ss_pred ccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCCceeEEEEEccE---EEe Q lcl|Aclame:pro 232 GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIV 307 (367) Q Consensus 232 g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~---~~~ 307 (367) ....-.+++|++|+++|.+|... ++||.=. +.+.. ...+++++.....-..++..+....+ -++ T Consensus 309 ~~g~~~~l~G~Pv~~~~~~p~~~---------i~~gd~s~~~i~~---~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~ 376 (390) T protein:vir:62 309 TVGAPSLFNGKVVETDDGMPADK---------ILFADLSKYRVRF---AGSLRVDRSVDAKFSTDQIVYRFLQRADGLLV 376 (390) T ss_pred CCCccceecccceEEecCCCCcc---------EEEeeccceeEEe---ecceEEEeeccccccCCcEEEEEEEEeCcEee Confidence 11122579999999999998531 3333211 11111 12233333322221123333333222 345 Q ss_pred eeeeeeeccccccccccccccccccccc Q lcl|Aclame:pro 308 HPGGFNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 308 hp~G~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) ||..+....-. . +. T Consensus 377 ~~~A~~~l~~~-------------~-~a 390 (390) T protein:vir:62 377 DARGAKVLTVT-------------P-GA 390 (390) T ss_pred chhheEEEEee-------------c-CC Confidence 66666654311 0 11 No 99 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=98.17 E-value=4.8e-07 Score=55.22 Aligned_cols=263 Identities=11% Similarity=0.002 Sum_probs=121.3 Q ss_pred CCCcccccc-ceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVR-LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~-l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) .......|. =.-..+|+.+..-+.....+.+.|.+ . -.....++...++|.+..-++...-+.|+..... T Consensus 125 ~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~------~---~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~ 195 (394) T protein:vir:97 125 EPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP------F---TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPA 195 (394) T ss_pred hhhccccccccccccChHHHHHHHHHHhhhhhhhhh------h---ceeeeccCcceEEEEEecCCCccceecccccccc Confidence 111111111 12246777665555544444333311 1 1112234556889988755445555666543221 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) .+..+-+......++.+.-..+++....-+..|-...+.+++++...+.....+|..+. T Consensus 196 --~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~------------------- 254 (394) T protein:vir:97 196 --LAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLK------------------- 254 (394) T ss_pred --cccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccc------------------- Confidence 12233333344555666666777765555555666778888876555544433322110 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhccccc-ccccc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPDSK-GQLTI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~-g~~~i 236 (367) ++......+++.++++....-+... -..++||+..+..|++..-- .|+-..+ .+..- T Consensus 255 -------------------~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~ 314 (394) T protein:vir:97 255 -------------------SFTTKTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSG 314 (394) T ss_pred -------------------cccccccccHHHHHHHHHhhhhhhh-CCEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCC Confidence 0011223567778777765433322 25799999999998875311 1111000 11123 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEc-cEEEeeeeeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILER-KEWIVHPGGFN 313 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r-~~~~~hp~G~s 313 (367) ++++|++|++.+++++.. + +++||. -.+.+..-. ...++..++. ....+.- .+.| ...+.||..|. T Consensus 315 ~~l~G~pv~~~~~~~~~~----~---~~~~gd~~~~~~~~~~~-~~~~~~~~~~--~~~~~~~-~~~r~d~~v~~~~a~~ 383 (394) T protein:vir:97 315 KVLLGKPVFVLSDEVLGA----N---KAFIGDFKRGVLFADRK-DLGLRWADNE--IYGQYLQ-AVLRFGVSKVDDKAGY 383 (394) T ss_pred ceeccceeEEecccccCC----c---cEEEeeccccEEEEEec-ceEEEEeccc--ccceeEE-EEEEEccEEecccceE Confidence 589999999987765432 1 133442 112222111 1123332222 1111111 1111 22344666666 Q ss_pred ecccccccccccccccccccccCCCChHHh Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L 343 (367) .-.- +|+-+-| T Consensus 384 ~~~~-------------------~~~~~p~ 394 (394) T protein:vir:97 384 YVTF-------------------TPEPLPL 394 (394) T ss_pred EEEe-------------------cccccCC Confidence 5322 2222222 No 100 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=98.17 E-value=5.1e-07 Score=55.09 Aligned_cols=266 Identities=8% Similarity=-0.003 Sum_probs=128.2 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC-CcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD-SLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~-g~~~~~~~~~~~~~ 79 (367) |.. ..+.-...++|+.+.+.+.....+.+.+.+-. +...-++...++|+...-+ +.+.-+.|++.... T Consensus 91 ~~~--~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~---------~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 159 (371) T protein:vir:81 91 MSE--GSNQDGGYTVPQDIQTRINELRESKDALQNLI---------TVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGE 159 (371) T ss_pred hcc--CCCccCceeecHhHHHHHHHHHHhhhhhhhhc---------eeeeccCCceeEEEEeecCCcceeeecccccccc Confidence 322 11112345677777666666655555542210 0011234444444443332 23445566543211 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) ..+.+-++.....++.+.-..++++...-+..|-...+.+++++...+..+..++.-. . T Consensus 160 --~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~-----------g-------- 218 (371) T protein:vir:81 160 --KATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVL-----------N-------- 218 (371) T ss_pred --ccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc-----------c-------- Confidence 2233334444455555666677777655555566678888888766666555444311 0 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHH-hccccCceeEEEEccHHHHHHHhcchhh--hccc-cccccc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-MGDHVGSIAAIAVHSMVYKRMTNNDEIE--FIPD-SKGQLT 235 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~-~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~~~-~~g~~~ 235 (367) ++......+++.+.++... +-.....-..++||+..+..|++..--+ ++-. .-.... T Consensus 219 -------------------~~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~ 279 (371) T protein:vir:81 219 -------------------TKAKTAIADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPT 279 (371) T ss_pred -------------------cccccccccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCC Confidence 0011223566777776643 3233334568999999999998763221 1111 101122 Q ss_pred chhhcCcEEEEeCCCcccCCC---CCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCC--ceeEEEEEccE---E Q lcl|Aclame:pro 236 IPTYMGKVVIVDDGMPVFGTG---ADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNG--SGLEYILERKE---W 305 (367) Q Consensus 236 i~t~~G~~VivdD~~pv~~t~---~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~--~g~~~l~~r~~---~ 305 (367) -++++|++|+++|.||..... .......++||. -.+.+.. ...++++++....+. .++..+....+ - T Consensus 280 ~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~---~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~ 356 (371) T protein:vir:81 280 GRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFD---RQRTEIMSSNVAMDAFETDATLWRAIERMDVK 356 (371) T ss_pred CceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEe---ecceEEEEeccccchhhcCceEEEEEEeeccE Confidence 368999999999999864311 111223455553 1122211 112334444332211 23333333222 2 Q ss_pred Eeeeeeeeecccccccccccccccccccc Q lcl|Aclame:pro 306 IVHPGGFNWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 306 ~~hp~G~s~~~~~~~~~~~~~~~~~~~~~ 334 (367) +.||..|..-.-.. + T Consensus 357 ~~~~~a~~~~~~~~--------------A 371 (371) T protein:vir:81 357 MRDDEAFVFGEVQL--------------A 371 (371) T ss_pred EecccceEEEEEec--------------C Confidence 45777776543211 1 No 101 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=98.16 E-value=6.8e-07 Score=54.38 Aligned_cols=265 Identities=8% Similarity=-0.040 Sum_probs=124.8 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCC-cccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDS-LEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g-~~~~~~~~~~~~~ 79 (367) .......|....+ +|+.+..-+.....+.+.+.+ . -+.....+..+++|.....++ ....+.|+.. T Consensus 106 ~~~~~~~~~~~~~-ip~~~~~~ii~~~~~~~~i~~------~---~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~--- 172 (379) T protein:vir:10 106 VGDMTLPVNLTGA-QPKDYNFDVVLNPSQMLNVSD------I---VGAVSISGGTYTFVRENGAGEGAIGAQVEGAT--- 172 (379) T ss_pred hcccccCCCCccc-cchhhhhHHHHhHHhhhhHHh------h---ceeeeccCCceEEEEeecCCCcccccccCCcc--- Confidence 2222222333434 466565555555444443311 1 111223466788888765543 2233455432 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) .+..+++-++.....++.+.-..++++...-+ .+....+.++++....+..+..++..+. + T Consensus 173 ~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~-~~l~~~i~~~la~~~~~~~~~~~~~g~~---~--------------- 233 (379) T protein:vir:10 173 KGQKDYDISMIDVNTDFIAGFTRYSKKMANNL-PFLTSFIPNALRRDYAKAENAAFNAVLA---A--------------- 233 (379) T ss_pred ccccccceeeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHhcccc---c--------------- Confidence 33344555555555666666667777653322 2344556666654443333333222110 0 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhccc-c-c-ccc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPD-S-K-GQL 234 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~-~-~-g~~ 234 (367) ..+....+. ....+.+.+.++...+.+..-.-.+++||+..+..|++..-- .|+-. . . ... T Consensus 234 ----------~~~~~~~~~----~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 299 (379) T protein:vir:10 234 ----------NATASTEII----TNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDN 299 (379) T ss_pred ----------ccccccccc----cCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCC Confidence 000000111 122346788998888777766777899999999998876311 12111 0 0 011 Q ss_pred cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Eeeeee Q lcl|Aclame:pro 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IVHPGG 311 (367) Q Consensus 235 ~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~hp~G 311 (367) .-.+++|++|++++.||- |++..-=|..+++.+.. ...+++.++....-..++..+....|+ +.||.. T Consensus 300 ~~~~l~G~pvv~s~~~~a------g~~~~gdf~~~~~~~~~---~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a 370 (379) T protein:vir:10 300 GVLRINGIPLFRATWLAA------NKYYVGDWTRVTKVTTE---GLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAA 370 (379) T ss_pred CcceecceeeEecCCCCC------CceEEeecccEEEEEEe---ceEEEEeecccccccCCcEEEEEEEEeccEEecCcc Confidence 224789999999999973 22111112222333322 123444444322111233344333333 445555 Q ss_pred eeeccccccccccccccccccccc Q lcl|Aclame:pro 312 FNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 312 ~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) |-+- +.++ - T Consensus 371 ~v~~--~~~~-------------~ 379 (379) T protein:vir:10 371 LIFG--DFTA-------------V 379 (379) T ss_pred EEEE--EecC-------------C Confidence 5432 1111 0 No 102 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=98.15 E-value=8.6e-07 Score=53.84 Aligned_cols=277 Identities=9% Similarity=-0.022 Sum_probs=127.2 Q ss_pred CCCcccccc-ceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCC-cccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVR-LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDS-LEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~-l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g-~~~~~~~~~~~~ 78 (367) +...+..|. =.-.++|+.+.+.+.....+.+.+.+- -.....++...++|++..-++ .+.-+.|+.... T Consensus 103 ~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~---------~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 173 (392) T protein:vir:10 103 QRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQY---------VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIP 173 (392) T ss_pred hhhccccccCCCceecchhHHHHHHHHHHhhhhhhhh---------ceeeeccCCceeEEEEeecCCccceeeccccccc Confidence 111111121 133467888877777766666655321 111112334434443332222 334455543321 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) .. +..+-.+..-..++.+.-..++++...-+.-|-...+.+++++...+..+..++... . T Consensus 174 ~~--~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~-----------g------- 233 (392) T protein:vir:10 174 ET--DNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI-----------E------- 233 (392) T ss_pred cc--ccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------c------- Confidence 11 112222323334455555666666544444456677888887665555444433210 0 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHH-HhccccCceeEEEEccHHHHHHHhcchh--hhcc-cccccc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAF-TMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIP-DSKGQL 234 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~-~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~-~~~g~~ 234 (367) ++......+++.+.++.. .+-.....-..++||+..+..|++..-- .++- ..-... T Consensus 234 --------------------~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~ 293 (392) T protein:vir:10 234 --------------------KLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK 293 (392) T ss_pred --------------------cccccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC Confidence 011123467788888874 4444444457799999999999875311 1111 111112 Q ss_pred cchhhcCcEEEE-eCCCcccC-CCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccE---EEe Q lcl|Aclame:pro 235 TIPTYMGKVVIV-DDGMPVFG-TGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIV 307 (367) Q Consensus 235 ~i~t~~G~~Viv-dD~~pv~~-t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~---~~~ 307 (367) .-++++|+++|+ +|.+++.. ....+.+ +++||. -++.+..-. ...+++++.....-..++..+....+ -++ T Consensus 294 ~~~tllG~~~v~~~~~~~~~~~~~~~~~~-~~~~gdfs~~~~i~~~~-~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 371 (392) T protein:vir:10 294 NKKLFAGTNPVVVVSNRFLKSKGTTAKKA-PLIIGDLKEAIVLFKRE-DMELASTDVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) T ss_pred ccccccCcccEEEecccccCCCcccCCce-EEEEEehhceEEEEeec-ceEEEEeccccchhhcCceEEEEEEeeccEEe Confidence 236789976555 44443322 2223332 455553 122222211 12233333211111122333333222 355 Q ss_pred eeeeeeecccccccccccccc Q lcl|Aclame:pro 308 HPGGFNWLDADVTIPDNTGSP 328 (367) Q Consensus 308 hp~G~s~~~~~~~~~~~~~~~ 328 (367) ||.+|..-.-...+|+.+..| T Consensus 372 ~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 372 DNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred cccceEEEEecccccccCCCC Confidence 888887755554555444333 No 103 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=98.15 E-value=8.6e-07 Score=53.84 Aligned_cols=277 Identities=9% Similarity=-0.022 Sum_probs=127.2 Q ss_pred CCCcccccc-ceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCC-cccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVR-LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDS-LEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~-l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g-~~~~~~~~~~~~ 78 (367) +...+..|. =.-.++|+.+.+.+.....+.+.+.+- -.....++...++|++..-++ .+.-+.|+.... T Consensus 103 ~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~---------~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 173 (392) T protein:vir:10 103 QRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQY---------VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIP 173 (392) T ss_pred hhhccccccCCCceecchhHHHHHHHHHHhhhhhhhh---------ceeeeccCCceeEEEEeecCCccceeeccccccc Confidence 111111121 133467888877777766666655321 111112334434443332222 334455543321 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) .. +..+-.+..-..++.+.-..++++...-+.-|-...+.+++++...+..+..++... . T Consensus 174 ~~--~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~-----------g------- 233 (392) T protein:vir:10 174 ET--DNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI-----------E------- 233 (392) T ss_pred cc--ccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------c------- Confidence 11 112222323334455555666666544444456677888887665555444433210 0 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHH-HhccccCceeEEEEccHHHHHHHhcchh--hhcc-cccccc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAF-TMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIP-DSKGQL 234 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~-~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~-~~~g~~ 234 (367) ++......+++.+.++.. .+-.....-..++||+..+..|++..-- .++- ..-... T Consensus 234 --------------------~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~ 293 (392) T protein:vir:10 234 --------------------KLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK 293 (392) T ss_pred --------------------cccccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC Confidence 011123467788888874 4444444457799999999999875311 1111 111112 Q ss_pred cchhhcCcEEEE-eCCCcccC-CCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccE---EEe Q lcl|Aclame:pro 235 TIPTYMGKVVIV-DDGMPVFG-TGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIV 307 (367) Q Consensus 235 ~i~t~~G~~Viv-dD~~pv~~-t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~---~~~ 307 (367) .-++++|+++|+ +|.+++.. ....+.+ +++||. -++.+..-. ...+++++.....-..++..+....+ -++ T Consensus 294 ~~~tllG~~~v~~~~~~~~~~~~~~~~~~-~~~~gdfs~~~~i~~~~-~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 371 (392) T protein:vir:10 294 NKKLFAGTNPVVVVSNRFLKSKGTTAKKA-PLIIGDLKEAIVLFKRE-DMELASTDVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) T ss_pred ccccccCcccEEEecccccCCCcccCCce-EEEEEehhceEEEEeec-ceEEEEeccccchhhcCceEEEEEEeeccEEe Confidence 236789976555 44443322 2223332 455553 122222211 12233333211111122333333222 355 Q ss_pred eeeeeeecccccccccccccc Q lcl|Aclame:pro 308 HPGGFNWLDADVTIPDNTGSP 328 (367) Q Consensus 308 hp~G~s~~~~~~~~~~~~~~~ 328 (367) ||.+|..-.-...+|+.+..| T Consensus 372 ~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 372 DNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred cccceEEEEecccccccCCCC Confidence 888887755554555444333 No 104 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=98.15 E-value=8.6e-07 Score=53.84 Aligned_cols=277 Identities=9% Similarity=-0.022 Sum_probs=127.2 Q ss_pred CCCcccccc-ceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCC-cccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVR-LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDS-LEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~-l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g-~~~~~~~~~~~~ 78 (367) +...+..|. =.-.++|+.+.+.+.....+.+.+.+- -.....++...++|++..-++ .+.-+.|+.... T Consensus 103 ~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~---------~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 173 (392) T protein:vir:10 103 QRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQY---------VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIP 173 (392) T ss_pred hhhccccccCCCceecchhHHHHHHHHHHhhhhhhhh---------ceeeeccCCceeEEEEeecCCccceeeccccccc Confidence 111111121 133467888877777766666655321 111112334434443332222 334455543321 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) .. +..+-.+..-..++.+.-..++++...-+.-|-...+.+++++...+..+..++... . T Consensus 174 ~~--~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~-----------g------- 233 (392) T protein:vir:10 174 ET--DNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI-----------E------- 233 (392) T ss_pred cc--ccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------c------- Confidence 11 112222323334455555666666544444456677888887665555444433210 0 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHH-HhccccCceeEEEEccHHHHHHHhcchh--hhcc-cccccc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAF-TMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIP-DSKGQL 234 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~-~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~-~~~g~~ 234 (367) ++......+++.+.++.. .+-.....-..++||+..+..|++..-- .++- ..-... T Consensus 234 --------------------~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~ 293 (392) T protein:vir:10 234 --------------------KLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK 293 (392) T ss_pred --------------------cccccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC Confidence 011123467788888874 4444444457799999999999875311 1111 111112 Q ss_pred cchhhcCcEEEE-eCCCcccC-CCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccE---EEe Q lcl|Aclame:pro 235 TIPTYMGKVVIV-DDGMPVFG-TGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIV 307 (367) Q Consensus 235 ~i~t~~G~~Viv-dD~~pv~~-t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~---~~~ 307 (367) .-++++|+++|+ +|.+++.. ....+.+ +++||. -++.+..-. ...+++++.....-..++..+....+ -++ T Consensus 294 ~~~tllG~~~v~~~~~~~~~~~~~~~~~~-~~~~gdfs~~~~i~~~~-~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 371 (392) T protein:vir:10 294 NKKLFAGTNPVVVVSNRFLKSKGTTAKKA-PLIIGDLKEAIVLFKRE-DMELASTDVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) T ss_pred ccccccCcccEEEecccccCCCcccCCce-EEEEEehhceEEEEeec-ceEEEEeccccchhhcCceEEEEEEeeccEEe Confidence 236789976555 44443322 2223332 455553 122222211 12233333211111122333333222 355 Q ss_pred eeeeeeecccccccccccccc Q lcl|Aclame:pro 308 HPGGFNWLDADVTIPDNTGSP 328 (367) Q Consensus 308 hp~G~s~~~~~~~~~~~~~~~ 328 (367) ||.+|..-.-...+|+.+..| T Consensus 372 ~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 372 DNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred cccceEEEEecccccccCCCC Confidence 888887755554555444333 No 105 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=98.15 E-value=8.6e-07 Score=53.84 Aligned_cols=277 Identities=9% Similarity=-0.022 Sum_probs=127.2 Q ss_pred CCCcccccc-ceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCC-cccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVR-LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDS-LEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~-l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g-~~~~~~~~~~~~ 78 (367) +...+..|. =.-.++|+.+.+.+.....+.+.+.+- -.....++...++|++..-++ .+.-+.|+.... T Consensus 103 ~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~---------~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 173 (392) T protein:vir:10 103 QRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQY---------VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIP 173 (392) T ss_pred hhhccccccCCCceecchhHHHHHHHHHHhhhhhhhh---------ceeeeccCCceeEEEEeecCCccceeeccccccc Confidence 111111121 133467888877777766666655321 111112334434443332222 334455543321 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) .. +..+-.+..-..++.+.-..++++...-+.-|-...+.+++++...+..+..++... . T Consensus 174 ~~--~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~-----------g------- 233 (392) T protein:vir:10 174 ET--DNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI-----------E------- 233 (392) T ss_pred cc--ccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------c------- Confidence 11 112222323334455555666666544444456677888887665555444433210 0 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHH-HhccccCceeEEEEccHHHHHHHhcchh--hhcc-cccccc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAF-TMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIP-DSKGQL 234 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~-~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~-~~~g~~ 234 (367) ++......+++.+.++.. .+-.....-..++||+..+..|++..-- .++- ..-... T Consensus 234 --------------------~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~ 293 (392) T protein:vir:10 234 --------------------KLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK 293 (392) T ss_pred --------------------cccccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCC Confidence 011123467788888874 4444444457799999999999875311 1111 111112 Q ss_pred cchhhcCcEEEE-eCCCcccC-CCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccE---EEe Q lcl|Aclame:pro 235 TIPTYMGKVVIV-DDGMPVFG-TGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIV 307 (367) Q Consensus 235 ~i~t~~G~~Viv-dD~~pv~~-t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~---~~~ 307 (367) .-++++|+++|+ +|.+++.. ....+.+ +++||. -++.+..-. ...+++++.....-..++..+....+ -++ T Consensus 294 ~~~tllG~~~v~~~~~~~~~~~~~~~~~~-~~~~gdfs~~~~i~~~~-~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 371 (392) T protein:vir:10 294 NKKLFAGTNPVVVVSNRFLKSKGTTAKKA-PLIIGDLKEAIVLFKRE-DMELASTDVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) T ss_pred ccccccCcccEEEecccccCCCcccCCce-EEEEEehhceEEEEeec-ceEEEEeccccchhhcCceEEEEEEeeccEEe Confidence 236789976555 44443322 2223332 455553 122222211 12233333211111122333333222 355 Q ss_pred eeeeeeecccccccccccccc Q lcl|Aclame:pro 308 HPGGFNWLDADVTIPDNTGSP 328 (367) Q Consensus 308 hp~G~s~~~~~~~~~~~~~~~ 328 (367) ||.+|..-.-...+|+.+..| T Consensus 372 ~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 372 DNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred cccceEEEEecccccccCCCC Confidence 888887755554555444333 No 106 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.13 E-value=4.5e-07 Score=55.36 Aligned_cols=275 Identities=12% Similarity=0.008 Sum_probs=129.2 Q ss_pred CCCcccc-ccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeee--------eccCCCccccc Q lcl|Aclame:pro 1 MPDFNNQ-VRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPF--------WRDLDSLEPNY 71 (367) Q Consensus 1 Ma~~~~~-T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~--------~~~l~g~~~~~ 71 (367) +...... +.-...+.|+.+...+.........+.+ .+ ......+..+++|. |... +.+.-+ T Consensus 121 ~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~--~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v 190 (419) T protein:vir:94 121 RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVAD--LL-------DQQNADYNVLEYIRDTSGTAGAGSTW-NKAAVV 190 (419) T ss_pred cccccccccCCcccccchhhhHHHHHHHhhhhhhhh--cc-------eeeeccCCceeeeeeccccccccccC-ccccee Confidence 2211111 1223356777777766543322221100 11 11122344555543 3322 233444 Q ss_pred CCCCccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHH-----HHHHHHhhh Q lcl|Aclame:pro 72 GSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIA-----MAVGVYKSN 146 (367) Q Consensus 72 ~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla-----~l~Gvf~~~ 146 (367) .|+.. ++..+++-.+.....++.+.-..++.+...-+ .+....|.+++++.+.+..++.+|. -.+|++... T Consensus 191 ~Eg~~---~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~ 266 (419) T protein:vir:94 191 PEGTA---KPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTP 266 (419) T ss_pred cCCcc---ccccccceeeEEeeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccc Confidence 55542 33344554555555666676677777655533 3566778888887777777766652 111222111 Q ss_pred hhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh- Q lcl|Aclame:pro 147 LAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE- 225 (367) Q Consensus 147 ~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~- 225 (367) .. ........+........++.+.++...+-.....-.+++||+..+..|++..--. T Consensus 267 ~~----------------------~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~ 324 (419) T protein:vir:94 267 GI----------------------GTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGS 324 (419) T ss_pred cc----------------------ccccccccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCC Confidence 00 0000001111122335578899998877555556668999999999988664221 Q ss_pred --h-cccccccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEe---cceeeeeccCCCcceeeeeehhhcCCceeEEE Q lcl|Aclame:pro 226 --F-IPDSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFG---GAAFGYADGAPQVPVAVGRRELRGNGSGLEYI 299 (367) Q Consensus 226 --~-~~~~~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~---~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l 299 (367) + ++..-....-.+++|++|++++.||-. + ++|| .+...+.... ..+++.+.....-..++..+ T Consensus 325 ~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~------~---~~~gd~~~~~~~~~~~~--~~v~~~~~~~~~~~~~~~~~ 393 (419) T protein:vir:94 325 GVFRVIANVQGEATPRIWGLNVVSTVAIAQG------T---ALVGGFRQGATLWSRQG--ITVLMTDSHADFFTANTLVI 393 (419) T ss_pred CceeecCCcccCCCccccceeeEEcCCCCCc------c---EEEeeccceEEEEEecc--eEEEEeccccchhhcCcEEE Confidence 1 121111233568999999999999842 1 2222 2222221111 11222222211111233333 Q ss_pred EEccEE---EeeeeeeeecccccccccccccccccccccCCCC Q lcl|Aclame:pro 300 LERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 300 ~~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) ....++ ++||.+|..-. -...|| T Consensus 394 r~~~r~d~~v~~~~a~~~~~-----------------~~aa~~ 419 (419) T protein:vir:94 394 LAEFRANLAVYQPKAFVRVT-----------------FAAATT 419 (419) T ss_pred EEEEeeccEEeccccEEEEE-----------------eccCCC Confidence 332222 34566655421 122344 No 107 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.13 E-value=3.6e-07 Score=55.92 Aligned_cols=295 Identities=11% Similarity=0.030 Sum_probs=152.4 Q ss_pred CCCcccc----ccc--e------e-ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCc Q lcl|Aclame:pro 1 MPDFNNQ----VRL--V------D-AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSL 67 (367) Q Consensus 1 Ma~~~~~----T~l--~------d-~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~ 67 (367) ||....- |+. + | ++. |+|..-|.....+++.| .+.-...+ + .+|+++.+|+.+...- T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~------~~~~~~rt-i-~~G~sv~~~~iG~~~~- 70 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFL-KVFGGEVLTAFTRTSVT------MNKHLVRS-I-QSGKSAQFPVLGRTKA- 70 (347) T ss_pred CCccccccccccccccCCcccchHHHHH-HHHhHHHHHHHHHHHhh------hhhhhhee-c-cccceEEeeeccceeE- Confidence 6531110 221 1 2 555 88888887777777666 33332222 2 4799999999987742 Q ss_pred ccccCCCCccccccccccchhhhhhhhhH-hhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhh Q lcl|Aclame:pro 68 EPNYGSDNPNVEAPIDGLGSGEMKTTKTW-LNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSN 146 (367) Q Consensus 68 ~~~~~~~~~~~~~t~~kitt~~~~a~i~~-r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~ 146 (367) ..+..+++... +...+...+.+-+|=. .--.+.+.|+-..-+-.|++.+++++.+...++..++.++..|...-+.. T Consensus 71 -~~~~~G~~l~~-~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~ 148 (347) T protein:vir:94 71 -AYLQPGENLDD-KRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLP 148 (347) T ss_pred -eeeecCcCCCC-CcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 33333332111 1112333322222211 13345678998888889999999999999999988877765443322221 Q ss_pred hhhhhhhhhhhhhhhhhhhcchhhcceee---cCcccch--hhcccHHHHHHHHHHhccc--cCceeEEEEccHHHHHHH Q lcl|Aclame:pro 147 LAGNFATIKTRGRVPAEVLGTAGDMVIDI---SGQTNPA--DAVFNREAFVDAAFTMGDH--VGSIAAIAVHSMVYKRMT 219 (367) Q Consensus 147 ~a~~~~~~~~~~~~~a~~~~~~~~~v~di---sa~t~~a--~~~~s~~~l~~A~~~~GD~--~~~l~~~vmhS~v~~~L~ 219 (367) .+...... .......+.+ ...+++. .+.--++.|.+|.++|-+. .+.=..+++.+++|..|. T Consensus 149 ~~~~~~~~-----------g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LL 217 (347) T protein:vir:94 149 TANNENIA-----------GLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAIL 217 (347) T ss_pred cccccccc-----------cCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHH Confidence 11111100 0001111111 1111110 0111245567777776543 223478888999999998 Q ss_pred hcchhhhcc-ccc---ccccchhhcCcEEEEeCCCcccCC--------------------CCCceEE-------EEEEec Q lcl|Aclame:pro 220 NNDEIEFIP-DSK---GQLTIPTYMGKVVIVDDGMPVFGT--------------------GADKTYL-------SILFGG 268 (367) Q Consensus 220 k~~li~~~~-~~~---g~~~i~t~~G~~VivdD~~pv~~t--------------------~~~~~yt-------tyl~~~ 268 (367) +.....+.. .+. ..-.|++++|++|+.+..+|.... +..++|. ..+|-+ T Consensus 218 k~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~ 297 (347) T protein:vir:94 218 AALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHR 297 (347) T ss_pred HhhcccccccccccccccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEech Confidence 752111111 111 123588999999999999996431 1123452 366666 Q ss_pred ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCC Q lcl|Aclame:pro 269 AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAI 338 (367) Q Consensus 269 GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sP 338 (367) -|++.....+. ..|..|++... .+.+..++.|. .....|.-+-++. -++. T Consensus 298 ~A~~tv~~~~~-~~e~~~~~~~~----~~~i~~~~a~G--~g~~rPe~a~~i~-------------~~~a 347 (347) T protein:vir:94 298 SAVGTVKLKDM-ALERARRANFQ----ADQIIAKYAMG--HGGLRPEACGALV-------------FKKA 347 (347) T ss_pred hhhhhhhhccc-ceeeeechhhh----hhhhhhhhhhc--CcccccceeEEEE-------------ecCC Confidence 67665544432 36777876653 24555554443 3334443332110 0011 No 108 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.07 E-value=4.1e-07 Score=55.59 Aligned_cols=284 Identities=11% Similarity=0.055 Sum_probs=132.7 Q ss_pred CCCcccc-ccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQ-VRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~-T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) -...+.. +.-..++.||+..+ +.+...+.+.+.+ +......++..+++|.+..- ..+.-+.|+. . T Consensus 17 ~~a~~~~~~~~g~~ip~~~~~~-ii~~~~~~s~i~~---------~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~---~ 82 (326) T protein:vir:42 17 PKVAQTGDSMFEGYLEPEQAQD-YFAEAEKISIVQQ---------FAQKIPMGTTGQKIPHWTGD-VSASWIGEGD---M 82 (326) T ss_pred hhheeccccCCcceechhhHHH-HHHHHHhcchhhh---------hcceeeccCCceEEEEEeCC-cceEEecCCc---c Confidence 0000000 11234666665544 4444444443322 22223346778999988754 4455566664 4 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) ++..+++-.+.....++.+..+.++++...-+..|..+.+.+++++...+..++.+| .| +........... T Consensus 83 ~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l---~G---~gs~~p~gi~~~--- 153 (326) T protein:vir:42 83 KPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAI---NG---TDSPFPTFLAQT--- 153 (326) T ss_pred ccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhh---cc---cCCCcccccccc--- Confidence 555667777777788889999999998888788888999999999888887777665 22 110000000000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHH-HHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hc--ccc-c-- Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREA-FVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FI--PDS-K-- 231 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~-l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~--~~~-~-- 231 (367) . ........++ ++........+. +.++..........-..++||++.+..|++..--+ ++ ... . T Consensus 154 ------~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~ 225 (326) T protein:vir:42 154 ------T-KEVSLVDPDG-TGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEE 225 (326) T ss_pred ------c-cccceeeccc-ccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCc Confidence 0 0000011111 110011111222 33444444444445667999999999998753111 11 110 1 Q ss_pred -ccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCC------------ceeE Q lcl|Aclame:pro 232 -GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNG------------SGLE 297 (367) Q Consensus 232 -g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~------------~g~~ 297 (367) ......+++|++|++++.+|-.. . ..+||.=+ +.++... ...+++.++.....+ .++. T Consensus 226 ~~~~~~~~l~G~pv~~~~~~~~~~------~-~~~~Gd~s~~~~~~~~-~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~ 297 (326) T protein:vir:42 226 NSPFRLGRIVARPTILSDHVASGT------V-VGYQGDFRQLVWGQVG-GLSFDVTDQATLNLGTPQAPNFVSLWQHNLV 297 (326) T ss_pred cccccCceeeeeeEEEcCCCCCCc------e-EEEEeecceEEEEEec-ceEEEEeecceeeecccccccchhhhhcCcE Confidence 11234579999999999998421 1 11122210 1111111 112333333211100 0111 Q ss_pred EEEEc---cEEEeeeeeeeecccccccccccccccccccccCC Q lcl|Aclame:pro 298 YILER---KEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPA 337 (367) Q Consensus 298 ~l~~r---~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~s 337 (367) .+-.. ..-++||..|.--..- +.+++ T Consensus 298 ~~r~~~~~d~~v~~~~a~~~l~~~--------------~~~~~ 326 (326) T protein:vir:42 298 AVRVEAEYAFHCNDKDAFVKLTNV--------------DATEA 326 (326) T ss_pred EEEEEEEeccEEecccceEEEeec--------------cccCC Confidence 11111 1123344444211100 01111 No 109 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=98.05 E-value=9.8e-07 Score=53.52 Aligned_cols=289 Identities=12% Similarity=0.054 Sum_probs=122.9 Q ss_pred CCCccccccce----eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCc Q lcl|Aclame:pro 1 MPDFNNQVRLV----DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) Q Consensus 1 Ma~~~~~T~l~----d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~ 76 (367) +...+..+..+ -.++|+.+..-+.+.+.+.+-+.+-+. .........+++|.+..- ..+.-+.|+.. T Consensus 126 ~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~--------~~~~~~~~~~~~p~~~~~-~~a~~v~E~~~ 196 (435) T protein:vir:14 126 EEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGA--------RTLPLSNGNITIPRLKGG-AIVGYIGADTD 196 (435) T ss_pred hhhhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcc--------eeeecCCCceEEEEEeCC-cceeeeccCcc Confidence 00011111111 135677766555555444443322111 011122336888988643 34555666543 Q ss_pred cccccccccchhhhhhhhhHhhcccchhHHHHHhhcccH--HHHHHHHHHHHHhhhhhHHHHH------HHHHHHhhhhh Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNP--MTRIRNRFGVYWTRQWQRRIIA------MAVGVYKSNLA 148 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DP--m~~i~~qia~yw~~~~q~~lla------~l~Gvf~~~~a 148 (367) ++..+.+-.+.....++.+....++++...-++.|| ...+.+++++.+.+..++.++. ..+|++.... T Consensus 197 ---~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~- 272 (435) T protein:vir:14 197 ---IPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWAL- 272 (435) T ss_pred ---ccccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccc- Confidence 333444444444556667777788887766666554 3678999998888887776651 1111111100 Q ss_pred hhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccC--ceeEEEEccHHHHHHHhcchhhh Q lcl|Aclame:pro 149 GNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG--SIAAIAVHSMVYKRMTNNDEIEF 226 (367) Q Consensus 149 ~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~--~l~~~vmhS~v~~~L~k~~li~~ 226 (367) ...+...+. + .........+.+....+-.... .-.+++||+..+..|++.. T Consensus 273 --------------------~~~~~~~~~--~-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk---- 325 (435) T protein:vir:14 273 --------------------PSNVITASD--A-STLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLR---- 325 (435) T ss_pred --------------------ccceecccc--c-cchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhh---- Confidence 000110000 0 0111112233444333322111 1246899999999988763 Q ss_pred cccccccc-----cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCCceeE-EE Q lcl|Aclame:pro 227 IPDSKGQL-----TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNGSGLE-YI 299 (367) Q Consensus 227 ~~~~~g~~-----~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~~g~~-~l 299 (367) +.+|.. .=++++|++|++++.||..... .+.-...+||.=+ +.++.-. ...+++.++..-.++.|.- .+ T Consensus 326 --d~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~~-~~~~~~i~~gd~s~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~ 401 (435) T protein:vir:14 326 --DGNGNKVYPELANGMLKGYPVGKTTQVPINLGE-TGKESEIYFTDFGDVFIGEEE-TLEIDYSKEATYKDADGHMVSA 401 (435) T ss_pred --ccCCceeccCCCCCeeecceeEeeccccccccC-CCccceEEEeecccEEEEEec-ccEEEEeccccccccccchhhh Confidence 233322 2257899999999999975322 2222234444211 1122211 1223333332211111110 00 Q ss_pred EEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCcccee Q lcl|Aclame:pro 300 LERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWER 351 (367) Q Consensus 300 ~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~ 351 (367) +.+. .+..+.+.+-+-.+ ..|.---.-++.+|-- T Consensus 402 f~~~--~~~~r~~~r~d~~~----------------~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 402 FQRD--QTLIRVIAKNDFGP----------------RHVESIAVLAGVAWGA 435 (435) T ss_pred hhcC--hhheeeeeeeCcee----------------ecccceEEEecCCCCC Confidence 1100 01111111111000 0111111222222222 No 110 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.03 E-value=2.1e-06 Score=51.69 Aligned_cols=280 Identities=10% Similarity=0.027 Sum_probs=129.9 Q ss_pred CCCccccccce--eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLV--DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~--d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) |.. +.-+ -..+|+.+.+-+...+.+.+.|.+ . ......+|..+.+|....-. .+.-+.|+.... T Consensus 107 ~~~----~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~------~---~~~~~~~~~~~~~~~~~~~~-~a~wv~E~~~~~ 172 (401) T protein:vir:44 107 LQV----GTDEDGGYAVPEELDRSILSLLKDEVVMRQ------E---ATVITVGGSDYKKLVNLGGT-ASGWVGETDTRS 172 (401) T ss_pred hhc----CCCCCCceeccHhHHHHHHHHHHhhhhhhh------h---ceeeecCCCceEEEEecCCc-cceeeccccccC Confidence 211 1011 134555554444443333332211 1 11112235556666543221 222234443211 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHH-----HHHHHHhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIA-----MAVGVYKSNLAGNFAT 153 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla-----~l~Gvf~~~~a~~~~~ 153 (367) . .++.+-++.....++.+.-..++++...-+..|-.+.+.++|++.+.+..+..+|. --.|+++......... T Consensus 173 ~--~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~ 250 (401) T protein:vir:44 173 Q--TATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDK 250 (401) T ss_pred c--cccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccc Confidence 1 11122233334445555556677766555556777899999988777776665552 0112221111100000 Q ss_pred hhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhh--hc-ccc Q lcl|Aclame:pro 154 IKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--FI-PDS 230 (367) Q Consensus 154 ~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~--~~-~~~ 230 (367) . ........+ .++ ....++++.++++...+......-..++||+..+..|++..--+ ++ +.. T Consensus 251 ~---------~~~~~~~~~-----~t~-~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~ 315 (401) T protein:vir:44 251 A---------RAFGKLQHI-----VSG-EATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPG 315 (401) T ss_pred c---------ccccccccc-----ccc-cccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCC Confidence 0 000000000 111 22357899999998877555445567999999999998753111 11 111 Q ss_pred cccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccE---E Q lcl|Aclame:pro 231 KGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---W 305 (367) Q Consensus 231 ~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~---~ 305 (367) -..-.-.+++|++|+++|.||..++++. +.+||. -++.+.+. ..+++.|++... .++..+....+ - T Consensus 316 ~~~g~~~~l~G~PVv~~~~~p~~~~~~~----~i~~Gd~~~~~~i~~~---~~~~~~~~~~~~--~~~v~~~a~~r~d~~ 386 (401) T protein:vir:44 316 LELGQPSSLAGYGIAENEQMPDIAADAK----AIAFGNFKRGYTIVDR---IGTRILRDPYTN--KPFVGFYTTKRTGGM 386 (401) T ss_pred cCCCCCceecceeeEEecCcCCccCCcc----EEEEeehhccEEEEEe---cceEEeeecccc--CCcEEEEEEEEeccE Confidence 1111235789999999999997654332 233432 23333221 235556665543 34433333222 2 Q ss_pred Eeeeeeeeecccccccccccccccccccc Q lcl|Aclame:pro 306 IVHPGGFNWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 306 ~~hp~G~s~~~~~~~~~~~~~~~~~~~~~ 334 (367) ++||..|..-.-. ++ T Consensus 387 ~~~~~a~~~l~~~--------------aa 401 (401) T protein:vir:44 387 LVDSQAIKLLKIA--------------AA 401 (401) T ss_pred EecccceEEEEee--------------cC Confidence 4456666543210 01 No 111 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.02 E-value=6.3e-07 Score=54.57 Aligned_cols=300 Identities=11% Similarity=0.024 Sum_probs=150.9 Q ss_pred CCC---cccc-ccc--e------e-ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCc Q lcl|Aclame:pro 1 MPD---FNNQ-VRL--V------D-AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSL 67 (367) Q Consensus 1 Ma~---~~~~-T~l--~------d-~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~ 67 (367) ||. .+.. |+. + + +|. |+|...|.....+++.| .+.-...+ + .+|+++.+|..+... T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~------~~~v~~r~-~-~~G~sv~i~~iG~~t-- 69 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVT------MPRHMLRS-I-ASGKSAQFPVIGRTK-- 69 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhh------hhhhcccc-c-cccceeEeeecccee-- Confidence 663 1111 322 1 2 577 99988887776666665 22211111 1 479999999998774 Q ss_pred ccccCCCCccccccccccchhhhhhhhh-HhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhh Q lcl|Aclame:pro 68 EPNYGSDNPNVEAPIDGLGSGEMKTTKT-WLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSN 146 (367) Q Consensus 68 ~~~~~~~~~~~~~t~~kitt~~~~a~i~-~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~ 146 (367) ...+..+++.+ -++..+...+..-+|= ..--.+.+.|+-..-+-.|++.++.++.+...+++.++.++..+.+..+.. T Consensus 70 ~~~~~~g~~l~-~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~ 148 (347) T protein:vir:33 70 AAYLKPGENLD-DKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLP 148 (347) T ss_pred eeeecCCCCCC-CCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 23443333211 1122222222211111 112245678888888889999999999999999999988887665443221 Q ss_pred hhhhhhhhhhhhhhhhhhhcchhhcceeecC-cccch--hhcccHHHHHHHHHHhccc--cCceeEEEEccHHHHHHHhc Q lcl|Aclame:pro 147 LAGNFATIKTRGRVPAEVLGTAGDMVIDISG-QTNPA--DAVFNREAFVDAAFTMGDH--VGSIAAIAVHSMVYKRMTNN 221 (367) Q Consensus 147 ~a~~~~~~~~~~~~~a~~~~~~~~~v~disa-~t~~a--~~~~s~~~l~~A~~~~GD~--~~~l~~~vmhS~v~~~L~k~ 221 (367) ......... ........+ ...++ ...+. .+..-++.|.+|..+|.+. ...=..++|.|.+|..|.+. T Consensus 149 ~~~~~~~~~-------~~~~~~~~~-~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~ 220 (347) T protein:vir:33 149 DGSNENIEG-------LGKPTVLTL-VKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAA 220 (347) T ss_pred ccccccccc-------ccccccccc-cccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcc Confidence 111000000 000000000 11111 11110 0112256677787887643 22347899999999999887 Q ss_pred c-hhhhcccccc---cccchhhcCcEEEEeCCCcccCCC---------CCce---------------EEEEEEecceeee Q lcl|Aclame:pro 222 D-EIEFIPDSKG---QLTIPTYMGKVVIVDDGMPVFGTG---------ADKT---------------YLSILFGGAAFGY 273 (367) Q Consensus 222 ~-li~~~~~~~g---~~~i~t~~G~~VivdD~~pv~~t~---------~~~~---------------yttyl~~~GAi~~ 273 (367) . +++......+ .-.|+.++|.+|+.+..+|..... .... ...++|-+.|++. T Consensus 221 ~~~~~~d~~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~ 300 (347) T protein:vir:33 221 LMPNAANYQALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGT 300 (347) T ss_pred ccccccccccccccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhhee Confidence 3 3322111111 235788999999999999974211 0000 1235667777766 Q ss_pred eccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHH Q lcl|Aclame:pro 274 ADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLAN 342 (367) Q Consensus 274 ~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~ 342 (367) -...+ ..+|..|++... + |.+. .+|.+|..--. |..-..- +-|--+| T Consensus 301 v~~~~-~~~e~~r~~~~~-~---d~i~-----~~~~~G~~vlr-----P~~av~i-------~~~~~~~ 347 (347) T protein:vir:33 301 VKLKD-LALERARRANYQ-A---DQII-----AKYAMGHGGLR-----PEAAGAI-------VLPKVSE 347 (347) T ss_pred eeeec-eeeeeccchhhh-h---Hhhh-----hhhhcCCceec-----ccceEEE-------ecCCCCC Confidence 55443 246777776543 1 2211 22333333211 1110000 0111111 No 112 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.01 E-value=4.9e-07 Score=55.16 Aligned_cols=294 Identities=11% Similarity=-0.020 Sum_probs=139.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) +.. ..+.=...++|+.+..-+.+.+.+.+.+.+ . -.....++....+|.+... +.+.-+.|+.. + T Consensus 84 ~~~--~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~------~---~~~~~~~~~~~~i~~~~~~-~~a~~~~E~~~---~ 148 (390) T protein:vir:40 84 IAG--NGFAGVTALLPPTVFERVFEDLTVEHPLLS------K---INFVNTTATTEWIISVGDV-ATAWWGPLCAE---I 148 (390) T ss_pred Hhc--cCcccCcccccHHHHHHHHHHHHhhhhhhh------h---ceeeecCCceeEEEEEcCC-cceeeeccccc---c Confidence 221 112223456777665555555444444422 1 1112235667788877643 44444444322 2 Q ss_pred c-ccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 P-IDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 81 t-~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) + ..+.+-++..-..++.+.-+.++++...-+..|-.+.+.+++++.+.+..++.+|. |- ......+....... T Consensus 149 ~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~---G~---G~~~P~Gil~~~~~ 222 (390) T protein:vir:40 149 KEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVN---GS---GKDQPIGMMRDLNN 222 (390) T ss_pred CccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhc---cc---CCCccceeeecccc Confidence 1 12223233333444555556777777777777778899999998888877765552 21 00000000000000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHH----HHHHhccc---cCceeEEEEccHHHHHHHhcchhhhcccccc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVD----AAFTMGDH---VGSIAAIAVHSMVYKRMTNNDEIEFIPDSKG 232 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~----A~~~~GD~---~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g 232 (367) . ......... ...++.....+ -...+++. ...-.+++||+..+..+.+. +...++.+| T Consensus 223 ~-------~~~~~~~~~------~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~--~~~~~d~~G 287 (390) T protein:vir:40 223 V-------TAGEHPVKT------ATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYA--ATSYMTPQG 287 (390) T ss_pred c-------ccccccccc------ccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHH--HhhccCCCC Confidence 0 000000000 01122211111 11223332 23346789999886543322 233455555 Q ss_pred ccc-chhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Ee Q lcl|Aclame:pro 233 QLT-IPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IV 307 (367) Q Consensus 233 ~~~-i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~ 307 (367) ... -....|++||+++.||-.. .+||.-. +.+.. ...++++++...+-..++..+....++ ++ T Consensus 288 ~~v~~~~~~g~pvv~~~~~p~~~---------i~~Gd~s~~~i~~---~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~ 355 (390) T protein:vir:40 288 VWVTGILPVPLEIVQSVAVPVGK---------AVAGRAKDYFMGI---GSEQVIRTSTEYRLLDDETLYYAKQYANGRPK 355 (390) T ss_pred ccccccCCCceeEEEcCCCCCCc---------EEEEeeceEEEEe---ecceEEEecchhhhhcCcEEEEEEEEeCCEEe Confidence 422 1234799999999998421 2333221 11111 123445555443322455555555444 44 Q ss_pred eeeeeeecccccccccccccccccccccCCCChHH Q lcl|Aclame:pro 308 HPGGFNWLDADVTIPDNTGSPSGITSGPPAITLAN 342 (367) Q Consensus 308 hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~ 342 (367) ||..|.--+-.-.++......+.++..+.+||.+| T Consensus 356 ~~~A~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:40 356 DNSSFLVFDITGLEGSPAIDVNVVNNATPSETPAE 390 (390) T ss_pred cccceEEEEeeccCCCCCCCcceeeCCCCCCCCCC Confidence 55555433322223344555677777888888888 No 113 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.00 E-value=1.1e-06 Score=53.17 Aligned_cols=283 Identities=8% Similarity=-0.018 Sum_probs=123.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCc--eEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGR--LINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~--~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) |.. ..+.-.-..+|+.+..-+.....+.+.|.+ . -.....++. .+.+|..... .....+.|+.... T Consensus 110 ~~~--~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~------l---~~~~~~~~~~g~~~~~~~~~~-~~~~~v~e~~~~~ 177 (404) T protein:vir:10 110 ISE--NIDEDGGYAVPEDIQTKINTRLKDTTDLYN------M---VDYEPVFTRSGSRTYEKRSKQ-KPMKPLSENQQIP 177 (404) T ss_pred hcc--ccCCCCceeechhHHHHHHHHHhhhhhHhh------h---hceeeccCCccceEEEEecCC-cceeecccccccc Confidence 211 001111235676665555554444443311 1 111111233 3444443333 2333444443211 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) . +...++-++.....++.+.-..+++....-+..+-...+.+++++...+..++.+| .| +....... .+. T Consensus 178 ~-~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il---~G---~g~~~~~~--gi~- 247 (404) T protein:vir:10 178 T-NGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEIL---YG---AGGDEHAT--GIM- 247 (404) T ss_pred c-cccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHh---hc---CCCCCccc--cee- Confidence 1 00112223333445555666677776655555566778889998888887777554 22 11000000 000 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHH-hccccCceeEEEEccHHHHHHHhcchh--hhccccc-ccc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-MGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIPDSK-GQL 234 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~-~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~-g~~ 234 (367) +...+. +........++.+.+++.. +-.....-.+++||+..+..|++..-- .|+-..+ ... T Consensus 248 ------------~~~~~~--~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~ 313 (404) T protein:vir:10 248 ------------TANKFK--KITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDP 313 (404) T ss_pred ------------eccccc--eeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC Confidence 000000 1111223457777777653 222222235689999999999986311 1111111 112 Q ss_pred cchhhcCcEEEE-eCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccE---EEee Q lcl|Aclame:pro 235 TIPTYMGKVVIV-DDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVH 308 (367) Q Consensus 235 ~i~t~~G~~Viv-dD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~---~~~h 308 (367) .-.+++|++|++ ++.+|-.+ .+. .+++||. .++.+.... ...+++.+++......++..+....+ -+.| T Consensus 314 ~~~~l~G~PV~~~~~~~~~~~---~~~-~~~~~gd~s~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~ 388 (404) T protein:vir:10 314 TQYRFLGLPVIELPNDLLLST---ESA-IPVLLGDTKEAYKYVSDG-AYELATTNIGAGAFETNTTKARIIMRIDGNVKD 388 (404) T ss_pred CCccccceeeEEecccccCCC---CCc-cEEEEEeccccEEEEEec-ceEEEEeccccchhhcCceEEEEEEeeccEEec Confidence 335799999885 45454322 122 2456652 233333222 22344444432221123333333222 3457 Q ss_pred eeeeeecccccccccccccccccccccCCCC Q lcl|Aclame:pro 309 PGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 309 p~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) |.+|..-.-. ...+|. T Consensus 389 ~~a~~~~~~~---------------~aa~~~ 404 (404) T protein:vir:10 389 SEALLIAEIP---------------VESVQA 404 (404) T ss_pred ccceEEEEee---------------cccCCC Confidence 7777654322 122333 No 114 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=97.98 E-value=1.3e-06 Score=52.78 Aligned_cols=286 Identities=13% Similarity=0.035 Sum_probs=126.5 Q ss_pred CCCccccc-cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQV-RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T-~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) ++- +.-| .-.-.++|+.+...+.+.+.+.+-+.+-+ + .........+.+|.+..- ..+.-+.|++. T Consensus 130 ~~~-~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~-----~---~~v~~~~~~~~~p~~~~~-~~a~~v~E~~~--- 196 (435) T protein:vir:80 130 MSL-NTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLG-----A---RTLPLSNGNITIPRLKGG-AIVGYIGADTD--- 196 (435) T ss_pred hhh-cccCCCCCccccchhHHHHHHHHHhhhchhhhcc-----c---eeeecCCCceEEEEEeCC-cceeeeccCcc--- Confidence 111 0001 11224667777666655554444432211 0 112222335888988643 34445666643 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcc--cHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGS--NPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~--DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) ++..+.+-.+.....++.+.-..+++....-++. +-.+.|.+++++...+..+..+|. | + ..++. -..+. T Consensus 197 ~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~---G---~-G~~~~-p~Gi~ 268 (435) T protein:vir:80 197 IPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIR---D---D-GTANT-PKGLR 268 (435) T ss_pred ccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhc---c---C-CCCCc-cccee Confidence 3334445445555666777777888877665654 445789999988887777765552 2 1 11000 00000 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccc--cCceeEEEEccHHHHHHHhcchh--hhccccccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDH--VGSIAAIAVHSMVYKRMTNNDEI--EFIPDSKGQ 233 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~--~~~l~~~vmhS~v~~~L~k~~li--~~~~~~~g~ 233 (367) . .....++...+. +. ........+.++...+-.. ...-..++||+..+..|++..-- .++-+.. T Consensus 269 ~-------~~~~~~~~~~~~--~~-~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~-- 336 (435) T protein:vir:80 269 F-------WALPGNVITASD--GS-TLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPEL-- 336 (435) T ss_pred e-------cccccceeeccc--cc-chhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCC-- Confidence 0 000011111111 10 0111123455554443221 11235689999999998775311 1221111 Q ss_pred ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCCcee---------EEEEEcc Q lcl|Aclame:pro 234 LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNGSGL---------EYILERK 303 (367) Q Consensus 234 ~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~~g~---------~~l~~r~ 303 (367) .=++++|++|+++|.||.......++ ...+||.=+ +.++... ...+++.++..-..+.+. ..+-... T Consensus 337 -~~~~l~G~pv~~~~~~p~~~~~~~~~-~~i~~gd~s~~~i~~~~-~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~ 413 (435) T protein:vir:80 337 -ANGMLKGYPVGKTTQVPINLGEAGKE-SEIYFTDFGDVFIGEEE-TLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIA 413 (435) T ss_pred -CCCeEeeeeeEEeccccccccCCCCc-ceEEEEEcccEEEEeec-ceEEEEeccccccccccchhhhhhcCcceeeeee Confidence 12478999999999999754322222 223343211 2222221 223444444321111111 1111111 Q ss_pred E---EEeeeeeeeecccccccccccccccccccccCCCChHHhcCCcccee Q lcl|Aclame:pro 304 E---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWER 351 (367) Q Consensus 304 ~---~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~ 351 (367) + -+.||..|..... .+|-- T Consensus 414 r~d~~~~~~~a~~~l~~-----------------------------~~~~~ 435 (435) T protein:vir:80 414 KNDFGPRHVESIAVLSG-----------------------------VAWGA 435 (435) T ss_pred eeCcEeecccceEEEec-----------------------------cCCCC Confidence 1 2334554443221 11111 No 115 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=97.97 E-value=1e-06 Score=53.43 Aligned_cols=277 Identities=9% Similarity=0.017 Sum_probs=126.0 Q ss_pred CCC--cccccccee--ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCc Q lcl|Aclame:pro 1 MPD--FNNQVRLVD--AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) Q Consensus 1 Ma~--~~~~T~l~d--~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~ 76 (367) +.. ....+.-.+ .++|+.+..-+...+.+.+.|.+ ...+-. ...+..+.+|.-......+..+.|+.. T Consensus 111 ~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~------~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~E~~~ 182 (409) T protein:vir:45 111 LRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIAS------VAQILT--TSDGRTMEWATADGTSEVGVLLGENEE 182 (409) T ss_pred HHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhh------hceeee--cCCCceEEEEeeccCcccccccccccc Confidence 100 011111121 35677665544444444433321 111100 124556666665544334445555432 Q ss_pred cccccccccchhhhhhhhhHhhcc-cchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHH--------HHHHHHhhhh Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKA-YGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIA--------MAVGVYKSNL 147 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg-~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla--------~l~Gvf~~~~ 147 (367) .+...++-.+..-..++...+ +.+++....-+.-|-.+.+.++|++-+.+..+..+|. -.+|+++... T Consensus 183 ---~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~ 259 (409) T protein:vir:45 183 ---AGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVT 259 (409) T ss_pred ---ccccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccc Confidence 222223222222222333323 3577776555555777889999988777777666552 0112211100 Q ss_pred hhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCce--eEEEEccHHHHHHHhcc--h Q lcl|Aclame:pro 148 AGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSI--AAIAVHSMVYKRMTNND--E 223 (367) Q Consensus 148 a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l--~~~vmhS~v~~~L~k~~--l 223 (367) . ... ......++++.+.++...+......- -+++||+..+..|++.. . T Consensus 260 ~---------------------------~~~-~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~ 311 (409) T protein:vir:45 260 G---------------------------TTQ-TAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQ 311 (409) T ss_pred c---------------------------ccc-cccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCC Confidence 0 000 11223578889999888775543333 35688999999988753 1 Q ss_pred hhhcc-cccccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec-ceeeeeccCCCcceeeeeehhhcCCceeEEEEE Q lcl|Aclame:pro 224 IEFIP-DSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGNGSGLEYILE 301 (367) Q Consensus 224 i~~~~-~~~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~-GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~ 301 (367) =.|+- ..-....-.+++|++|+++|.||..++ +.+ +.+||. .-+.+...++ ..++..+|+... .++..+.. T Consensus 312 G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~---~~~-~i~~Gd~~~~~i~~~~~-~~~~~~~d~~~~--~~~~~~~~ 384 (409) T protein:vir:45 312 GRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGA---GKK-FMFCGDFDRFIIRRVRY-MILKRLVERYAE--YDQTGFLA 384 (409) T ss_pred CceeeccCcCCCCCceecceeeEEecCcCCccC---Ccc-EEEEeehhhhheeeccc-eEEEEeeccccc--CCcEEEEE Confidence 11211 110111235799999999999996543 333 244443 1112222111 123333444322 24444444 Q ss_pred ccEEEe---eeeeeeeccccccccccccccccccccc Q lcl|Aclame:pro 302 RKEWIV---HPGGFNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 302 r~~~~~---hp~G~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) ..+|-. +|..|.-.. + .+. +++ T Consensus 385 ~~r~d~~~~~~~A~~~l~--~---------k~s-~~~ 409 (409) T protein:vir:45 385 FHRFDCILEDTSAIKALV--G---------KGS-VGG 409 (409) T ss_pred EEEeccEeechhheEEEE--e---------ccC-CCC Confidence 444432 344333211 1 111 111 No 116 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=97.96 E-value=8.6e-07 Score=53.85 Aligned_cols=286 Identities=13% Similarity=0.060 Sum_probs=128.2 Q ss_pred CCCc---cccccc---eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCC Q lcl|Aclame:pro 1 MPDF---NNQVRL---VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSD 74 (367) Q Consensus 1 Ma~~---~~~T~l---~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~ 74 (367) +... ...+.. .-.++|+-+.+-+.+.+.+.+-+.+-|. ..+..+...+++|.+..- ..+.-+.|+ T Consensus 118 ~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~--------~~~~~~~g~~~~p~~~~~-~~a~~v~Eg 188 (428) T protein:vir:10 118 LNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGA--------RSIPLPNGNMSLPRLAGG-ATASYTGEN 188 (428) T ss_pred hhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhhcc--------eeeecCCcceEEEEEeCC-cceeeeccC Confidence 1110 000111 1246677665555555544444422211 011122334788877542 344555555 Q ss_pred CccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHH------HHHHHHhhhhh Q lcl|Aclame:pro 75 NPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIA------MAVGVYKSNLA 148 (367) Q Consensus 75 ~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla------~l~Gvf~~~~a 148 (367) .. ++..+.+-++.....++.+.-..++++...-+..+....|.+++++-..+..++.+|. .-+|+++.... T Consensus 189 ~~---~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~ 265 (428) T protein:vir:10 189 QD---AKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQ 265 (428) T ss_pred cc---ccccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccc Confidence 43 3333443333344455566667888876665666777888999998888777765542 01112111100 Q ss_pred hhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHH---HHHHHHHHh---ccccCceeEEEEccHHHHHHHhcc Q lcl|Aclame:pro 149 GNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNRE---AFVDAAFTM---GDHVGSIAAIAVHSMVYKRMTNND 222 (367) Q Consensus 149 ~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~---~l~~A~~~~---GD~~~~l~~~vmhS~v~~~L~k~~ 222 (367) ...+.... ..+..+.+ .+.++...+ +.....-..++||+..+..|++.. T Consensus 266 --------------------~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk 320 (428) T protein:vir:10 266 --------------------WNRLLPWA-----ADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLR 320 (428) T ss_pred --------------------cccccccc-----ccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhh Confidence 00111111 11122333 334443322 222223457899999999988753 Q ss_pred hhhhcccccccc-----cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCCcee Q lcl|Aclame:pro 223 EIEFIPDSKGQL-----TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNGSGL 296 (367) Q Consensus 223 li~~~~~~~g~~-----~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~~g~ 296 (367) +.+|.. .=++++|++|+++|.||.......++ ..++||.=+ +.++.-+ ...+++.|+..--...+. T Consensus 321 ------d~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~-~~i~~gd~s~~~i~~~~-~i~i~~~~~~~~~~~~~~ 392 (428) T protein:vir:10 321 ------DGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKE-SEIYFADFNDVVIGEDG-NMKVDFSKEASYIDTDGK 392 (428) T ss_pred ------ccCCceeccCCCCCeeeceeeEEeccccccccCCCcc-ceEEEEecceEEEEEec-ceEEEeeccccccccccc Confidence 222222 12478999999999999753222222 234444322 2222211 122333333221111111 Q ss_pred E-EEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccc Q lcl|Aclame:pro 297 E-YILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNW 349 (367) Q Consensus 297 ~-~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW 349 (367) - .++.+ -.+..+.+.+-+-. -..|.---+-++.|| T Consensus 393 ~~~~f~~--~~~~~R~~~r~d~~----------------v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 393 LVSAFSR--NQSLIRVVTEHDIG----------------FRHPEGLVLGTGVLF 428 (428) T ss_pred ccchhhc--chhheeeeeeeCce----------------eeccceEEEEeccCC Confidence 0 01111 01111222222111 123444445567777 No 117 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=97.95 E-value=3.7e-07 Score=55.86 Aligned_cols=269 Identities=11% Similarity=0.003 Sum_probs=127.8 Q ss_pred CCC------ccccc---cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCccccc Q lcl|Aclame:pro 1 MPD------FNNQV---RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNY 71 (367) Q Consensus 1 Ma~------~~~~T---~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~ 71 (367) |+. +...+ .-..+|.||++.+-+.+.+.+.+.+.+-|+ ..+......+++|....- ..+.-+ T Consensus 347 ~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~--------~~~~~~~g~~~ip~~~~~-~~a~wv 417 (632) T protein:vir:96 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGA--------RMLPGLVGDVDIPKKTSG-ANFYWI 417 (632) T ss_pred hhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcc--------eEeecCCcceEEEEEeCC-ceeEee Confidence 211 00111 112356667665544444433333322111 111222335778876521 223334 Q ss_pred CCCCccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 72 GSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNF 151 (367) Q Consensus 72 ~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~ 151 (367) .|+.. ++..+++-++.....++.+.-..++.....-+.-|-...|.+.++....+..++.+|. | + .+++. T Consensus 418 ~E~~~---~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~---G---~-G~~~~ 487 (632) T protein:vir:96 418 GEDED---VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLT---G---T-GLAND 487 (632) T ss_pred cCCcc---ccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhc---c---c-CCCCc Confidence 45443 3333333333333444445555666665554555556777788876666655554431 1 1 11000 Q ss_pred hhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc-C-ceeEEEEccHHHHHHHhcchhhhccc Q lcl|Aclame:pro 152 ATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV-G-SIAAIAVHSMVYKRMTNNDEIEFIPD 229 (367) Q Consensus 152 ~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~-~-~l~~~vmhS~v~~~L~k~~li~~~~~ 229 (367) . ..+ . .... +.-++. +.+.+++..+.++..++.... + .-.+++||+..+..|++..| ++ T Consensus 488 p-~Gi---~-----~~~~--~~~~~~----~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l----~d 548 (632) T protein:vir:96 488 P-VGL---L-----NMTG--VPALTY----PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQV----FD 548 (632) T ss_pred c-cee---e-----eccc--ccceec----ccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhc----cC Confidence 0 000 0 0000 001111 123467888888877764332 2 23478999999998887654 33 Q ss_pred ccccccc--hhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCCceeEEE--EEc-c Q lcl|Aclame:pro 230 SKGQLTI--PTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNGSGLEYI--LER-K 303 (367) Q Consensus 230 ~~g~~~i--~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~~g~~~l--~~r-~ 303 (367) ..|...+ ++++|++|++++.||-.. .+||.-+ +.++.-. .+++.+++...-..|...+ +.| . T Consensus 549 ~~G~~i~~~~~l~G~pv~~s~~ip~~~---------~~~gd~s~~~i~~~~---~~~i~~~~~~~~~~~~v~~~~~~~~d 616 (632) T protein:vir:96 549 NTGERIWQNNEVNGYRAEASNQIPADT---------WIFGDWSQIVIAMWG---VLDLKVDPYTKAASDGLVLRVFQDVD 616 (632) T ss_pred CCCceeecCCeecccceEeccccccCc---------EEEeecceEEEEEec---ceEEEEccccccccCceEEEEEeecC Confidence 3443322 468999999999998431 3343332 1122211 2444555444333343333 333 2 Q ss_pred EEEeeeeeeeecccccccccccccccccccc Q lcl|Aclame:pro 304 EWIVHPGGFNWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 304 ~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~ 334 (367) --+.||..|.|.+.. + T Consensus 617 ~~v~~~~af~~~k~~---------------A 632 (632) T protein:vir:96 617 AGVRRKEAFCIAKKG---------------A 632 (632) T ss_pred ceeechhhhhheeec---------------C Confidence 335699999997542 1 No 118 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=97.93 E-value=2.3e-06 Score=51.55 Aligned_cols=305 Identities=12% Similarity=0.060 Sum_probs=146.8 Q ss_pred CCCccc---c-ccceec-cchHHHHHHHhhhhHHhh-hHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCC Q lcl|Aclame:pro 1 MPDFNN---Q-VRLVDA-VIPEVYTSYTAIDRPELT-AFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSD 74 (367) Q Consensus 1 Ma~~~~---~-T~l~d~-i~PEVf~~yv~~~~~~~~-~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~ 74 (367) ||.-.. + |+..-- ..+|+.+-++..-.-+.. +|-.+-++.+.-...+ + .+|+++.+|..+... ...+..+ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~-~-~~G~sv~i~~ig~~t--~~~~~~g 76 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRS-I-ASGKSAQFPVIGRTK--AAYLKPG 76 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhcccccc-c-cccceeEeeecccee--eeeeccC Confidence 776221 1 332211 455555444433222221 2211222222221111 1 479999999998764 3344333 Q ss_pred Cccccccccccchhhhhhhhh-HhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 75 NPNVEAPIDGLGSGEMKTTKT-WLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFAT 153 (367) Q Consensus 75 ~~~~~~t~~kitt~~~~a~i~-~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~ 153 (367) ++.+ .++..++..+..-+|= ..--++.+.|+-..-+-.|++.++.++.+...+++.++.++..|.+......+..... T Consensus 77 ~~l~-~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~ 155 (347) T protein:vir:15 77 ENLD-DKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENI 155 (347) T ss_pred CCCC-CCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 3211 1122233333222221 2233567789988888899999999999999999999999988766543221111100 Q ss_pred hhhhhhhhhhhhcchhhcceeecCcccchh-hccc----HHHHHHHHHHhccc--cCceeEEEEccHHHHHHHhcc-hhh Q lcl|Aclame:pro 154 IKTRGRVPAEVLGTAGDMVIDISGQTNPAD-AVFN----REAFVDAAFTMGDH--VGSIAAIAVHSMVYKRMTNND-EIE 225 (367) Q Consensus 154 ~~~~~~~~a~~~~~~~~~v~disa~t~~a~-~~~s----~~~l~~A~~~~GD~--~~~l~~~vmhS~v~~~L~k~~-li~ 225 (367) .. ........... ..+++.. ..-. ++.+.+|.++|-+. ...=..++|.|.+|..|.+.. ++. T Consensus 156 ~~--------~g~~~~~~~~~--~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~ 225 (347) T protein:vir:15 156 EG--------LGKPTVLTLVK--PTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNA 225 (347) T ss_pred cc--------cCccccccccc--cccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccc Confidence 00 00000001111 1111111 1112 44555666677543 223478999999999998873 332 Q ss_pred hcccccc---cccchhhcCcEEEEeCCCcccCCCC-------Cce-----------------EEEEEEecceeeeeccCC Q lcl|Aclame:pro 226 FIPDSKG---QLTIPTYMGKVVIVDDGMPVFGTGA-------DKT-----------------YLSILFGGAAFGYADGAP 278 (367) Q Consensus 226 ~~~~~~g---~~~i~t~~G~~VivdD~~pv~~t~~-------~~~-----------------yttyl~~~GAi~~~~~~~ 278 (367) ....+.+ +-.|+.++|++|+.+..+|....+. ... .-.+++-+-|++.....+ T Consensus 226 ~d~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~ 305 (347) T protein:vir:15 226 ANYQALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKD 305 (347) T ss_pred ccccccccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeec Confidence 2222211 2357889999999999999643210 000 113455566666655443 Q ss_pred CcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHH Q lcl|Aclame:pro 279 QVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLAN 342 (367) Q Consensus 279 ~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~ 342 (367) ..+|..|++... + |.+. .+|.+|..--. |..-... +-|--+| T Consensus 306 -~~~e~~~~~~~~-~---d~i~-----~~~~~G~~vlr-----P~~av~~-------~~~~~~~ 347 (347) T protein:vir:15 306 -LALERARRANYQ-A---DQII-----AKYAMGHGGLR-----PEAAGAI-------VLPKVSE 347 (347) T ss_pred -eeeeecccchhh-h---hhhe-----hhhhcCCceec-----cccEEEE-------ecCCCCC Confidence 246666765442 1 2222 22333433221 1110000 0111111 No 119 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=97.90 E-value=1.7e-06 Score=52.17 Aligned_cols=278 Identities=11% Similarity=0.045 Sum_probs=118.5 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC---CcccccCCCCcc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD---SLEPNYGSDNPN 77 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~---g~~~~~~~~~~~ 77 (367) ..-....+.-..-.+|+.+.+-+.+...+.+.+.+ +-....-++..+.+|...... +.+..+.|+... T Consensus 116 ~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~ 186 (413) T protein:vir:81 116 PASTATLTDEFQGGYGTTWNRNIIYRRREKLVVAD---------LMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKK 186 (413) T ss_pred hhhhcccccccccccchhhHHHHHHHHhhhhhHHh---------hcceeeccCCceeEEEeccccccccccceecCcccc Confidence 01111222233445677776666666555554421 111122356677778765432 234455655432 Q ss_pred ccccccccch-hhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLGS-GEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKT 156 (367) Q Consensus 78 ~~~t~~kitt-~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~ 156 (367) +-..+.. .......++.+.-..++++...-+ ..-...+.+++++-+.+..++.+|. | + .++.. ...+ T Consensus 187 ---~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~~~l~---G---~-G~~~~-~~Gi 254 (413) T protein:vir:81 187 ---PYMRFADFDIVTESLSKIAGLTKITDEMIEDY-DFLVSYINARLLEELAIEEERQLLL---G---D-GTGNN-LTGL 254 (413) T ss_pred ---cccCcccceeeEeeeeeEEEeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhc---c---C-CCCCc-cccc Confidence 2122211 222333444555567777654433 2344566777776666666554442 1 1 11100 0000 Q ss_pred hhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhc-cccCceeEEEEccHHHHHHHhcchhh--hcc----- Q lcl|Aclame:pro 157 RGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMG-DHVGSIAAIAVHSMVYKRMTNNDEIE--FIP----- 228 (367) Q Consensus 157 ~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~G-D~~~~l~~~vmhS~v~~~L~k~~li~--~~~----- 228 (367) +.. ..+..+... .....++.+.++...+- ...-.-.+++||+..+..|++..--+ |+- T Consensus 255 ---~~~-------~~~~~~~~~----~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~ 320 (413) T protein:vir:81 255 ---LKR-------DGIQTLAVS----NKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQ 320 (413) T ss_pred ---ccc-------ccccccccc----ccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceecccccc Confidence 000 000000000 01123455666665432 12222346999999999988764111 111 Q ss_pred cccc---cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecc--eeeeeccCCCcceeeeeehhhcCCceeEEEEE-- Q lcl|Aclame:pro 229 DSKG---QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRELRGNGSGLEYILE-- 301 (367) Q Consensus 229 ~~~g---~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G--Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~-- 301 (367) ...+ ...-++++|++|+++|.||-. ..+||.= ++.+.... ...++..+.....-..++..+.. T Consensus 321 ~~~~~~~~~~~~~l~G~pv~~s~~~~~~---------~~~~gd~~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~r~~~ 390 (413) T protein:vir:81 321 GQYGSGGIMLDPAPWGLRTVQSQVVPVG---------KPVVGAFRSAASVLRKG-GVRIDSTNTNVDDFENNLITVRAEE 390 (413) T ss_pred ccccccccccCceecceeeEEcCCCCcc---------cEEEEecccEEEEEEec-ceEEEEeccccchhhcCcEEEEEEE Confidence 1111 112357899999999999842 1233321 11111111 12244444321110112222222 Q ss_pred c-cEEEeeeeeeeecccccccccccccccccccccCCC Q lcl|Aclame:pro 302 R-KEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAI 338 (367) Q Consensus 302 r-~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sP 338 (367) | .-.+.||..|..-.- +...+| T Consensus 391 r~d~~~~~~~a~~~l~~---------------~~~~~p 413 (413) T protein:vir:81 391 RVGLMVTFPEAIVQLDV---------------AEVVTP 413 (413) T ss_pred eeccEEecccceEEEEe---------------cCCCCC Confidence 2 123446666643211 123345 No 120 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=97.86 E-value=1.4e-05 Score=47.17 Aligned_cols=262 Identities=13% Similarity=0.032 Sum_probs=111.1 Q ss_pred CCCccc-cccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNN-QVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~-~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) +..... .+.=....+|+-+...+.... +...+ .+ .-.....++....+|....-++....+.|+..... T Consensus 129 ~~~~~~~~~~~~~~~vp~~~~~~i~~~~-~~~~l------~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~ 198 (397) T protein:vir:96 129 AEKRDGFTSVEGGALIPQELLQPQLEPK-DIVDL------SK---YVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQ 198 (397) T ss_pred hhhhhcccccccccchhHHHHHHHHHhh-hhhhH------HH---hhhhccccccceeEEEEeccCCccccccccccccc Confidence 111010 011123455555544443321 11111 11 11111234556667765533333344444432211 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) ....+-.+.....++.+.-..+++....-+..|-...+.+++++-..+..+..++... . T Consensus 199 --~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~-------g------------ 257 (397) T protein:vir:96 199 --LANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVL-------K------------ 257 (397) T ss_pred --cccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------c------------ Confidence 1223333333444555554455554433333344556666666444443333322110 0 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchh--hhcc-cccccccc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIP-DSKGQLTI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~-~~~g~~~i 236 (367) .+.+....+++.+.++....=+... =.+++||+..+..|++..-- .|+- +.-....- T Consensus 258 -------------------~~~~~~~~~~d~~~~~~~~~~~~~~-~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~ 317 (397) T protein:vir:96 258 -------------------TATAKSVVGVDGLKDLINKEIKKVY-DVKLFISASMYSELDKLKDKNGRYLLQDSITAASG 317 (397) T ss_pred -------------------ccccccccchHHHHHHHHHhhhhhc-CcEEEEcHHHHHHHHHhhccCCCeEeccCccCCCc Confidence 0112234667788887765333222 25799999999999886311 1221 11112233 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEecc--eeeeeccCCCcceeeeeehhhcCCceeEEEEEc-cEEEeeeeeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRELRGNGSGLEYILER-KEWIVHPGGFN 313 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~G--Ai~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r-~~~~~hp~G~s 313 (367) .+++|++|++.+.++...+ .+++ +++||.= ++.+.... .+++.+........+.- .+.| ...+.||..|. T Consensus 318 ~~l~G~pv~~~~~~~~~~~--~~~~-~~~~gd~~~~~~~~~~~---~~~~~~~~~~~~~~~~~-~~~r~d~~~~~~~a~~ 390 (397) T protein:vir:96 318 KQLLGKEVVVLDDDVIGKS--VGNV-VGFIGDAKAFASFFDRK---QVSVSWVDNNIYGQLLA-GIIRYDVKATDKKAGF 390 (397) T ss_pred ccccccceEEecccccCCC--CCce-EEEEeehhcceEeEeec---ceEEEEecccccceeEE-EEEEEccEEecccceE Confidence 6899999988766544322 2332 3555531 12222221 23333332222111111 1112 22345777777 Q ss_pred ecccccccccccccccccccc Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~ 334 (367) .-.-++ + T Consensus 391 ~~~~~~--------------a 397 (397) T protein:vir:96 391 YVTFTI--------------G 397 (397) T ss_pred EEEeec--------------C Confidence 643221 1 No 121 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=97.84 E-value=2.4e-06 Score=51.41 Aligned_cols=275 Identities=13% Similarity=0.076 Sum_probs=130.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) .......|.=...++|+.+.+.+.+.+.+.+.+++..- .....|+ ..+|..... +.+.-+.|+.. + T Consensus 136 ~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~---------~~~~~g~-~~ip~~~~~-~~a~~v~E~~~---~ 201 (425) T protein:vir:95 136 KFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVD---------KIRVKGT-TRILVDTDT-SPATWIEQSGA---L 201 (425) T ss_pred HHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhc---------eeecCce-eEEEEecCC-ccccccccccc---c Confidence 00000111123357899888888777777766644211 1122344 478876543 44444555543 2 Q ss_pred cccccc-hhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLG-SGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 81 t~~kit-t~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) +..... -++..-..++.+.-+.+++....-+..+-...+.++++....+..++.+|. | +..+.....++... T Consensus 202 ~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~---G----~G~~~~~p~Gil~~ 274 (425) T protein:vir:95 202 PTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVK---G----TGAANKQPLGIIPS 274 (425) T ss_pred ccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhc---c----CCCCccccceeecc Confidence 222221 222234445566666788877666666777888888887777766664443 2 11100000000000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccC--ceeEEEEccHHHHH-HHhcchhhhcccccc---- Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG--SIAAIAVHSMVYKR-MTNNDEIEFIPDSKG---- 232 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~--~l~~~vmhS~v~~~-L~k~~li~~~~~~~g---- 232 (367) .. .... + + ......+++.+.++..++.-... .-.+++||...+.. |... ...++.+| T Consensus 275 ~~-------~~~~--~---~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l---~~~kd~~g~~i~ 338 (425) T protein:vir:95 275 LP-------PENQ--V---T-VEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEF---SIQVDSNGNVVG 338 (425) T ss_pred cc-------cccc--c---c-cccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHH---HhhcCCCCceee Confidence 00 0000 0 0 11224567888888776543332 23357899887543 3221 11222222 Q ss_pred ---cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCCceeEEEEEcc---EE Q lcl|Aclame:pro 233 ---QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNGSGLEYILERK---EW 305 (367) Q Consensus 233 ---~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~---~~ 305 (367) ....++++|++|+++|.||... .+||.-. ..++. ...+++++.....-..++..+.... .- T Consensus 339 ~~~~~~~~~l~G~pvv~~~~~~~~~---------i~~Gd~~~~~~~~---~~~~~i~~~~~~~f~~~~~~~~~~~r~d~~ 406 (425) T protein:vir:95 339 KLPNLRTPDLLGLRVVFNNFLDDDT---------VLFGEFEQYTLVE---RENITIDSSTHVKFTEDQTAFRGKGRFDGK 406 (425) T ss_pred ccCCCCCccccceeeEEcCcCCCcc---------EEEEecccEEEEe---ecceEEEeecccccccCceEEEEEEeeCcE Confidence 2345789999999999998531 3333211 11111 1123333332222123333333322 23 Q ss_pred Eeeeeeeeeccccccccccccccccccccc Q lcl|Aclame:pro 306 IVHPGGFNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 306 ~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) +.||..|..-+ ++. +..++ T Consensus 407 ~~~~~a~~~~~--i~~---------~~~g~ 425 (425) T protein:vir:95 407 PVKPEAFVLVT--ITD---------PVQGA 425 (425) T ss_pred eecccceEEEE--ecC---------cCCCC Confidence 55777777642 221 11122 No 122 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=97.81 E-value=5.6e-06 Score=49.38 Aligned_cols=283 Identities=11% Similarity=-0.035 Sum_probs=137.3 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |+--|+...-...|.||+|.+.+..-+-++.-+.. ++...+. +-|++|.||-.+... -.+|..+++ + T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~---~~~~~d~-----g~GDtV~InsIg~~t--V~dY~~~~~---i 67 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVN---IARVVDF-----PDGDKLTIPSVGTPV--VRSRPEQGD---F 67 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhh---hhccccc-----CCCCeEEeccccccc--cccccCCCC---c Confidence 99877644556668899999999876666543211 1111111 249999999988773 345544443 4 Q ss_pred cccccchhhhhhhh-hHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHH-HHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAV-GVYKSNLAGNFATIKTRG 158 (367) Q Consensus 81 t~~kitt~~~~a~i-~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~-Gvf~~~~a~~~~~~~~~~ 158 (367) +.+.+++.+..-+| ...-.+|.+.|.... ...|.+..+.++.+..-++..+..+...|+ |.-..+..++-.. T Consensus 68 ~~d~ltt~~~~l~IDq~KYfaf~VdDD~~Q-a~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~v----- 141 (322) T protein:vir:31 68 TFDNLDTGEISIILRDEVYAGNAISKKLRQ-DSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNV----- 141 (322) T ss_pred ccccCCCceEEEEEehhhhhccccchhHHH-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcce----- Confidence 55666655443222 223445577773333 445666666666665555555555544332 2211110000000 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccc--cCceeEEEEccHHHHHHHhcchhhh-------ccc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDH--VGSIAAIAVHSMVYKRMTNNDEIEF-------IPD 229 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~--~~~l~~~vmhS~v~~~L~k~~li~~-------~~~ 229 (367) ..+. .+.+ ++..+ .....++.|.+...+|-+. -..=..+||.|.+++.|+....+.. .+. T Consensus 142 ------in~~-~~~i-v~~gt---~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i 210 (322) T protein:vir:31 142 ------INGV-PHRF-VGTGT---DQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGI 210 (322) T ss_pred ------ecCC-ccce-eccCC---CchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhcccccccc Confidence 0000 0111 22222 2346789999999888653 2224788999999998866543321 111 Q ss_pred cc-c----cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeee----------c-----------cCCCccee Q lcl|Aclame:pro 230 SK-G----QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYA----------D-----------GAPQVPVA 283 (367) Q Consensus 230 ~~-g----~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~----------~-----------~~~~~~~e 283 (367) -+ | -..++..+|++|+++..+|. +.|+++-=++|+.... + ..|+ .| T Consensus 211 ~~sG~a~g~~~Vg~~~GF~V~~SN~l~~------~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~--~e 282 (322) T protein:vir:31 211 VESGIAPDMQFVRSVYGIDLFVSNLLAD------ANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPT--TK 282 (322) T ss_pred ccccchhhHHHHHHHhceeeeeeccccc------cccccccCcccccccceeecccccccchhhhhhhhHhhhhhh--hh Confidence 11 1 12378999999999999863 2222222222211110 0 0011 34 Q ss_pred eeeehhhcCCceeEEEEEccEEEeeeeeee--ecccccccccccccccccccccCCCChH Q lcl|Aclame:pro 284 VGRRELRGNGSGLEYILERKEWIVHPGGFN--WLDADVTIPDNTGSPSGITSGPPAITLA 341 (367) Q Consensus 284 ~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s--~~~~~~~~~~~~~~~~~~~~~~~sPt~a 341 (367) -.|+..... |- +-++.++|.. ..++=++. .++.-|+-= T Consensus 283 ~~r~~~~~~----d~-----~~~~~~~g~g~~r~e~l~~~-----------~a~~~~~~~ 322 (322) T protein:vir:31 283 SFIDDYNDD----LN-----TATTARWGNGLVRDENLVCV-----------LANADKVTF 322 (322) T ss_pred cccCccccc----cc-----eeeeeeecceeecccceEEE-----------EeccccccC Confidence 456654431 11 1222333322 22111100 000011100 No 123 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=97.80 E-value=1.9e-06 Score=52.00 Aligned_cols=290 Identities=11% Similarity=0.086 Sum_probs=145.7 Q ss_pred CCCccc------cccc--------eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCC Q lcl|Aclame:pro 1 MPDFNN------QVRL--------VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDS 66 (367) Q Consensus 1 Ma~~~~------~T~l--------~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g 66 (367) ||.... -|+- -+++. |+|...|.....+++.| .+.-.+.+ + .+|+++.+|+.+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~l-e~f~geV~~~f~~~s~~------~~~~~~r~-i-~~gks~~~~~iG~~~- 70 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFL-KVFGGEVLTAFARTSVT------TSRHMVRS-I-SSGKSAQFPVLGRTQ- 70 (345) T ss_pred CcccccchhcccccccccccCCchhHHHH-HHHhHHHHHHHHHHhhh------cccceeee-c-cccceEEEeeecceE- Confidence 554221 1111 13444 77777776666666555 33322221 2 479999999888552 Q ss_pred cccccCCCCccc----cccccc--cchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHH Q lcl|Aclame:pro 67 LEPNYGSDNPNV----EAPIDG--LGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAV 140 (367) Q Consensus 67 ~~~~~~~~~~~~----~~t~~k--itt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~ 140 (367) ...+..++..+ .++..+ |+-.+. .--.+.+.|+-..-+-.|.+.++++|.+..-++..++.++..|. T Consensus 71 -~~~~~~G~~l~~~~~~~~~~e~~ltID~~------~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~ 143 (345) T protein:vir:22 71 -AAYLAPGENLDDKRKDIKHTEKVITIDGL------LTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIA 143 (345) T ss_pred -EEeeecCCCCCCCCCCcccceEEEEecch------hhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222221110 111122 332221 22345778999889999999999999999999988888777654 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhcchhhcceeecC--cccchhh---cccHHHHHHHHHHhccc--cCceeEEEEccH Q lcl|Aclame:pro 141 GVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISG--QTNPADA---VFNREAFVDAAFTMGDH--VGSIAAIAVHSM 213 (367) Q Consensus 141 Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa--~t~~a~~---~~s~~~l~~A~~~~GD~--~~~l~~~vmhS~ 213 (367) ..-+.....+. .. .......+.++.+ ....+.. .--++.|.+|.++|-+. ...=..++|.|. T Consensus 144 k~a~~~~~~~~-~~----------~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~ 212 (345) T protein:vir:22 144 GLCNVESKYNE-NI----------EGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPD 212 (345) T ss_pred Hhhcccccccc-cc----------cccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChH Confidence 43221100000 00 0000011111111 1100000 11256777777777442 333478999999 Q ss_pred HHHHHHhcchhhhccccc----ccccchhhcCcEEEEeCCCcccCCC---------------CCceE---------EEEE Q lcl|Aclame:pro 214 VYKRMTNNDEIEFIPDSK----GQLTIPTYMGKVVIVDDGMPVFGTG---------------ADKTY---------LSIL 265 (367) Q Consensus 214 v~~~L~k~~li~~~~~~~----g~~~i~t~~G~~VivdD~~pv~~t~---------------~~~~y---------ttyl 265 (367) +|..|.+...+....+.. ++-.|++++|.+|+.+..+|....+ ..+.+ ...+ T Consensus 213 ~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~ 292 (345) T protein:vir:22 213 SYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLF 292 (345) T ss_pred HHHHHhccccccccccccccccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEE Confidence 999998886543322221 1235888999999999988843211 00111 2356 Q ss_pred EecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 266 FGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 266 ~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) |-+.|++.....+ ..+|..|++... .|.+..+..| .-..+.|..+-+. T Consensus 293 ~h~~A~~~v~~~~-~~~e~~r~~~~~----~d~I~~~~a~--G~~vlRPeaa~~i------------------------- 340 (345) T protein:vir:22 293 MHRSAVGTVKLRD-LALERARRANFQ----ADQIIAKYAM--GHGGLRPEAAGAV------------------------- 340 (345) T ss_pred Eehhheeeeeeec-ceeeeeechhHH----HHHHHHHHhc--CCcccccceeEEE------------------------- Confidence 7777877766553 236677766542 1222222111 1112222211110 Q ss_pred Cccceeeeccc Q lcl|Aclame:pro 346 PDNWERVTYRK 356 (367) Q Consensus 346 ~~NW~~v~d~K 356 (367) ++..| T Consensus 341 ------~~~~~ 345 (345) T protein:vir:22 341 ------VFKVE 345 (345) T ss_pred ------EEeeC Confidence 01111 No 124 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=97.79 E-value=9.8e-06 Score=48.04 Aligned_cols=266 Identities=9% Similarity=-0.041 Sum_probs=123.8 Q ss_pred CCCcccc-ccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCc--eEEeeeeccCCCcccccCCCCcc Q lcl|Aclame:pro 1 MPDFNNQ-VRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGR--LINIPFWRDLDSLEPNYGSDNPN 77 (367) Q Consensus 1 Ma~~~~~-T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~--~i~~P~~~~l~g~~~~~~~~~~~ 77 (367) +...... +.-.-.++|+.+.+.+.....+.+.+.+-. .....++. .+.+|....- ..+.-+.|+... T Consensus 120 ~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~---------~~~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~ 189 (397) T protein:vir:12 120 FRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYV---------TVEPVTTRSGTRLLEKNADM-VPFSPVEELGNL 189 (397) T ss_pred hhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhc---------ceeeccCCceeEEEEEecCC-cceeeecccccc Confidence 1111111 112235779888877777666655442211 11111222 3444444333 234455555321 Q ss_pred ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 78 ~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) .. .+..+-++.....++.+....++++...-+.-|-...+.+++++...+..+..++. |. .. T Consensus 190 ~~--~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~---G~--------g~----- 251 (397) T protein:vir:12 190 PE--IDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILA---AI--------AS----- 251 (397) T ss_pred cc--cccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHh---cc--------cc----- Confidence 11 12233333344455566667777777666666777788899887777766655442 11 00 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHH-hccccCceeEEEEccHHHHHHHhcchh--hhcc-ccccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-MGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIP-DSKGQ 233 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~-~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~-~~~g~ 233 (367) +......+++.+.++... +=.....-.+++||+..+..|++..-- .|+- +.-.. T Consensus 252 ----------------------~~~~g~~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~ 309 (397) T protein:vir:12 252 ----------------------LKKVDIDGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTN 309 (397) T ss_pred ----------------------ccccccccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccccC Confidence 011223556777777642 312222336799999999999875211 1111 11011 Q ss_pred ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEccE---EEee Q lcl|Aclame:pro 234 LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVH 308 (367) Q Consensus 234 ~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~---~~~h 308 (367) ..-.+++|++|++.+.+..... .++. .++||. .++.+.... ...++.++.....-..+...+....+ -++| T Consensus 310 g~~~~l~G~pv~~~~~~~~~~~--~~~~-~~~~gd~~~~~~~~~~~-~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~ 385 (397) T protein:vir:12 310 PTKKLLDGRPVVPFTNRVLKTQ--KGKA-PLIIGNLKEAIVLFDRE-QQSIASTDTGAGAFETNSTKVRGIEREDVRKWD 385 (397) T ss_pred CCCccccceeeEEecccccccC--CCcc-EEEEEehhceEEEEeec-ceEEEEeccccchhhcCceEEEEEEeeccEEec Confidence 1235899999987665433221 1222 255653 222222211 22244443332211122222222211 2346 Q ss_pred eeeeeecccccccccccccccccccc Q lcl|Aclame:pro 309 PGGFNWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 309 p~G~s~~~~~~~~~~~~~~~~~~~~~ 334 (367) |..|..-.-++ . T Consensus 386 ~~a~~~~~~t~--------------~ 397 (397) T protein:vir:12 386 EDAVVFGQITV--------------E 397 (397) T ss_pred ccceEEEEEee--------------C Confidence 66665543221 0 No 125 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=97.74 E-value=2.2e-06 Score=51.64 Aligned_cols=286 Identities=8% Similarity=-0.048 Sum_probs=131.1 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||-.. |.-. ..+|+.+..=+.+.+.+.+.+.+-+-. ...++..+++|.+..- ..+.-+.|+. .+ T Consensus 1 Mat~t--t~~g-~~vP~~~~~~ii~~~~~~s~l~~~~~~---------i~~~~~~~~~p~~~~~-~~a~wv~Eg~---~~ 64 (311) T protein:vir:99 1 MATFG--TGNL-KNLPRNIADGMVKDVVQGSTVAVLSAR---------KPQRFGNEDIITFNGR-PKAEFVGEGQ---QK 64 (311) T ss_pred Cceec--CCCc-eeccHHHHHHHHHHHHhhchhhhhcce---------eeccCCceEEEEEeCC-ceeEEeecCc---cc Confidence 99532 3333 345777755555555555544332211 1234556799998754 4455666664 34 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhh---cccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELA---GSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~---g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) +..+.+-++.....++.+--..++++-...+ ..|-.+.+.+++++.+.+..++.+|. |.-.-.......... T Consensus 65 ~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~---G~g~~~g~~~~g~~~-- 139 (311) T protein:vir:99 65 SSTTGEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYH---RINPLTGTVIPGWSN-- 139 (311) T ss_pred ccccceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhc---ccCcccCcccccccc-- Confidence 4455555555555566666677777754333 34567899999999998888877763 210000000000000 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCc--eeEEEEccHHHHHHHhcchhh--hc-ccccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGS--IAAIAVHSMVYKRMTNNDEIE--FI-PDSKG 232 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~--l~~~vmhS~v~~~L~k~~li~--~~-~~~~g 232 (367) ..............+ .......+.++..++-..... -.+++||+..+..|++..--+ ++ +.... T Consensus 140 -------~~~~~~~~~~~~~~~----~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~ 208 (311) T protein:vir:99 140 -------YLGAASKRVELTADT----IANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGL 208 (311) T ss_pred -------ccccccceeeccccc----cchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCccc Confidence 000011111111111 112234455566655433322 235999999999998753111 11 11101 Q ss_pred cccchhhcCcEEEEeCCCcccCC---CC----CceEEEEEEec--ceeeeeccCCCcceeeeeehhhcC-----CceeEE Q lcl|Aclame:pro 233 QLTIPTYMGKVVIVDDGMPVFGT---GA----DKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGN-----GSGLEY 298 (367) Q Consensus 233 ~~~i~t~~G~~VivdD~~pv~~t---~~----~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~-----~~g~~~ 298 (367) ...-.+++|++|++++.+|-... +. .+.+.-+++|. ..+.|+... ...+++.+.....+ ...+.. T Consensus 209 ~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~~ 287 (311) T protein:vir:99 209 GIGVSSFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQR-DIPVELIKYGDPDGQGDLKRHNQIA 287 (311) T ss_pred CCCCceecceeeEeecccccccccccccchhhccCcceEEEeeccccEEEEEec-CceEEEeecCCCCcchhhhhcCcEE Confidence 11235899999999999873211 00 00111123332 233333222 11122222211110 001111 Q ss_pred EE--EccE-EEeeeeeeeecccccccccccccccccccc Q lcl|Aclame:pro 299 IL--ERKE-WIVHPGGFNWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 299 l~--~r~~-~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~ 334 (367) +- .|.- .+.||.-+..+++. + T Consensus 288 ~r~~~r~d~~v~~~~~v~~~~~~---------------A 311 (311) T protein:vir:99 288 LRLEIVYGWYVFTDRFVVIENAV---------------A 311 (311) T ss_pred EEEEEeecceecChhHeeeeccc---------------C Confidence 11 1111 24466555554432 1 No 126 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=97.71 E-value=7.5e-06 Score=48.68 Aligned_cols=295 Identities=11% Similarity=0.002 Sum_probs=134.3 Q ss_pred CCCcccccccee-------ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCC Q lcl|Aclame:pro 1 MPDFNNQVRLVD-------AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGS 73 (367) Q Consensus 1 Ma~~~~~T~l~d-------~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~ 73 (367) |=+..++|+--. ..+||.+..++ ..+.+.+.| ...+.+... .+....++|.++.-........+ T Consensus 4 ~~~~~~~~k~it~~d~~gG~L~P~~~~~~i-~~l~e~s~i------~~~a~vi~t--~~s~~~~i~~i~~g~~~~~~~~~ 74 (314) T protein:vir:41 4 LNKPFQITPKIDVPDLGKGILAVQRFGEFV-REVRENSAI------IKDARVLNA--LKSYEVDISRISLGVELEPGRNT 74 (314) T ss_pred hhhHHHhhcccccccCCCceeChHHHHHHH-HHHHhccch------hhheeeecc--cCccceeecccccCccccccccc Confidence 333333333212 36899887655 445554444 333322211 23456777776521001111111 Q ss_pred CCccccccccccchhhhhhhhhHhhcccchhHHHHHhh--cccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 74 DNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELA--GSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNF 151 (367) Q Consensus 74 ~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~--g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~ 151 (367) ....+..+-...+-++..-..++..--|.+++....-. +.|.-..+.+++++-+.+..++.++ .|-=+. ...+ T Consensus 75 ~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~---nGdg~~-~s~~- 149 (314) T protein:vir:41 75 SGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFL---HADSSL-TTGR- 149 (314) T ss_pred ccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhh---ccccCC-cCcc- Confidence 11111122222222332233333344467776665544 4588899999999988888877655 231000 0000 Q ss_pred hhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc-C--ceeEEEEccHHHHHHHhcc--hhhh Q lcl|Aclame:pro 152 ATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV-G--SIAAIAVHSMVYKRMTNND--EIEF 226 (367) Q Consensus 152 ~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~-~--~l~~~vmhS~v~~~L~k~~--li~~ 226 (367) .+.+...... .....++.+.++.+ ..++.+.|.+....+-... . .--+++||..++..+++.- .-.+ T Consensus 150 ~~~~~p~G~l----~~a~~~~~~~~~~~----~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~ 221 (314) T protein:vir:41 150 ELYRINDGWM----KLAGNQYTDAEPED----ENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETG 221 (314) T ss_pred cchhcchhhh----hhcccceeecCccc----cccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCc Confidence 0000000000 00123344443332 2366778888888887643 2 2447999999998887641 1111 Q ss_pred c-ccccccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEeccee-eeeccCCCcceeeeeehhhcCCceeEEEEEccE Q lcl|Aclame:pro 227 I-PDSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAF-GYADGAPQVPVAVGRRELRGNGSGLEYILERKE 304 (367) Q Consensus 227 ~-~~~~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi-~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~ 304 (367) + +..-....-.+++|++|+....||..+.++ . +++|+.=.- .|... ....++.+|++. .++.-.+.+.+ T Consensus 222 l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~--~--~i~fgd~~nlv~~~~-~~ir~~~~~~a~----~~~~~~~~~~r 292 (314) T protein:vir:41 222 LGDSALIGATGLQYDGIPIQYVPALDALGDDK--A--RALLTVPTNLVYGFW-RNIRIEPKRDAA----MRRTEYIASLR 292 (314) T ss_pred ccchhhhCCCCceecceeeEecccccccCCCC--c--eEEEechhheEEEee-ceeEEeecccCc----CCeEEEEEEEE Confidence 1 111111123468899999999998654322 1 344544321 12221 123344445543 33444454444 Q ss_pred EEeeeeeeeeccccccccccccccccccccc Q lcl|Aclame:pro 305 WIVHPGGFNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 305 ~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) +-+ +|.|.++-+.+-.+ ..+++ T Consensus 293 ~d~---~~~~~~aa~~~~~~------~~~~~ 314 (314) T protein:vir:41 293 ADC---NYEDENAAVAAVID------MSSGG 314 (314) T ss_pred ece---EEEEcCcEEEEEee------ccCCC Confidence 433 23344443322110 01111 No 127 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=97.70 E-value=1.3e-05 Score=47.32 Aligned_cols=272 Identities=10% Similarity=-0.012 Sum_probs=120.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHh---hCCCceEEeeeeccCCCcccccCCCCcc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFL---SAPGRLINIPFWRDLDSLEPNYGSDNPN 77 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~---~~~G~~i~~P~~~~l~g~~~~~~~~~~~ 77 (367) +.. ..+.-.-.++|+.+...+.. ..+...+ ..++ ..+...+++|.+....+...-+.++... T Consensus 156 ~~~--~~~~~~g~lvp~~~~~~i~~-~~~~~~l------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 220 (437) T protein:vir:10 156 VTG--IALKDGKVIIPETILTPEKE-VHQFPRL------------GSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQT 220 (437) T ss_pred hhh--cccccccccchHHHHHHHHH-hhhhhhh------------hhcceeEeeccCceeeEEeeccccccccccccccc Confidence 111 01111223566655444322 1121111 1122 1234567788887665555555554432 Q ss_pred ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 78 ~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) .. .+..+-++..-..++.+.-..++.....-+.-|....+.+.+++.+.+..+..+|..+. .+ T Consensus 221 ~e--~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g-------~~-------- 283 (437) T protein:vir:10 221 TK--NATPVITPILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALT-------DG-------- 283 (437) T ss_pred cc--cccccceeeeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhc-------cc-------- Confidence 21 12222233333445556555666665555555666778888887666665554443210 00 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHH-hccccCceeEEEEccHHHHHHHhcchh--hhcc-ccccc Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-MGDHVGSIAAIAVHSMVYKRMTNNDEI--EFIP-DSKGQ 233 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~-~GD~~~~l~~~vmhS~v~~~L~k~~li--~~~~-~~~g~ 233 (367) ... ......++.+.++... +-.....-.+++||+..+..|++..=- .|+- +.-.. T Consensus 284 --------------------~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~ 342 (437) T protein:vir:10 284 --------------------IKK-TTSTYLLGDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTA 342 (437) T ss_pred --------------------ccc-cccccchhhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccC Confidence 000 0112234455555432 211222235799999999999886311 1221 11111 Q ss_pred ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec--ceeeeeccCCCcceeeeeehhhcCCceeEEEEEc-cEEEeeee Q lcl|Aclame:pro 234 LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNGSGLEYILER-KEWIVHPG 310 (367) Q Consensus 234 ~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~--GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r-~~~~~hp~ 310 (367) ..-++++|++|++.++|++...+ .+.+ +.+||. -++.+.... ...+++..+..... . ..-.+.| ..-++||. T Consensus 343 ~~~~~l~G~pv~~~~~~~~~~~~-~~~~-~~~~gd~~~~~~~~~r~-~~~~~~~~~~~~~~-~-~~~~~~r~d~~~~~~~ 417 (437) T protein:vir:10 343 ATGYTLLGKTVVIVDDKLFPSAS-AGDV-NIVVAPLKKAVINFKLT-EITGQFQDTYDIWY-K-QLGIFLRQNVVQASKD 417 (437) T ss_pred CCCcccccceeEEecccccCCcC-CCce-EEEEeeccccEEEEeee-ceEEEEeccccccc-c-eeeEEEEEccEEeccc Confidence 22368999999998887543322 2332 234443 123222211 11222221111110 0 1112223 33467888 Q ss_pred eeeecccccccccccccccccccccCCCChH Q lcl|Aclame:pro 311 GFNWLDADVTIPDNTGSPSGITSGPPAITLA 341 (367) Q Consensus 311 G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a 341 (367) .|.......++ .+ ...|+-+ T Consensus 418 a~~~l~~~~~~--~~---------~~~~~~~ 437 (437) T protein:vir:10 418 LIVNLTGKLKA--VT---------VVQSTAV 437 (437) T ss_pred ceEEEEeeccc--cc---------cCCCCCC Confidence 88864332211 11 1122222 No 128 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=97.56 E-value=7.1e-06 Score=48.83 Aligned_cols=300 Identities=10% Similarity=-0.012 Sum_probs=148.5 Q ss_pred CCCccccccc--------eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccC Q lcl|Aclame:pro 1 MPDFNNQVRL--------VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYG 72 (367) Q Consensus 1 Ma~~~~~T~l--------~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~ 72 (367) |--.|+.|+- .+++. |+|.-=|.....+++.| .+.-.++++ .+|+++.+|+.+... .+... T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~------~~~~~~rti--~~g~s~~~~~iG~~~--~~~~~ 69 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKF------APLMNIRDL--RGSNVVRLDRLGNVE--AKGRR 69 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhh------ccccceeee--ccceeEEEeeeeeee--eeccc Confidence 6555544432 13333 55544444443344444 333333332 579999999998663 22222 Q ss_pred CCCccccccccccchhhhhhhhhH-hhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHH-HHHHHHHhhhhhhh Q lcl|Aclame:pro 73 SDNPNVEAPIDGLGSGEMKTTKTW-LNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRII-AMAVGVYKSNLAGN 150 (367) Q Consensus 73 ~~~~~~~~t~~kitt~~~~a~i~~-r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~ll-a~l~Gvf~~~~a~~ 150 (367) -+.+.+ ...+...+.+-+|=. .--...+.|+-...+--|-..+++++.+..-++..+..++ .++++.-....... T Consensus 70 pG~~l~---~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~ 146 (335) T protein:vir:63 70 AGEELE---RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDL 146 (335) T ss_pred CCcCcC---CCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc Confidence 222111 111222221111100 0112247788888888889999999999888886665544 45554322111100 Q ss_pred hhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc--C---ceeEEEEccHHHHHHHhc-chh Q lcl|Aclame:pro 151 FATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV--G---SIAAIAVHSMVYKRMTNN-DEI 224 (367) Q Consensus 151 ~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~--~---~l~~~vmhS~v~~~L~k~-~li 224 (367) .. ..+.......++++.+..+....-...+-+|.+.|-++. + .-.+++|.|++|..|.+. +|+ T Consensus 147 ~~-----------~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~ 215 (335) T protein:vir:63 147 ED-----------AFSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLM 215 (335) T ss_pred CC-----------CcCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccc Confidence 00 000111222233433221111111233446666665433 2 227899999999999887 566 Q ss_pred hhc-ccccc-----cccchhhcCcEEEEeCCCcccCCC------C-------CceEEEEEEecceeeeeccCCCcceeee Q lcl|Aclame:pro 225 EFI-PDSKG-----QLTIPTYMGKVVIVDDGMPVFGTG------A-------DKTYLSILFGGAAFGYADGAPQVPVAVG 285 (367) Q Consensus 225 ~~~-~~~~g-----~~~i~t~~G~~VivdD~~pv~~t~------~-------~~~yttyl~~~GAi~~~~~~~~~~~e~~ 285 (367) +-. ..+++ .-.+...+|.+|+.+..+|..... . ..+...+++-+-|++.....+ +..|+. T Consensus 216 n~~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~-vt~e~~ 294 (335) T protein:vir:63 216 NVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAP-VQAKLW 294 (335) T ss_pred ccccccccccccccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEee-ccccee Confidence 632 22332 235888999999999999965311 1 113467888888988877665 334555 Q ss_pred eehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccc Q lcl|Aclame:pro 286 RRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 286 rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~ 334 (367) |+..+. .+.+..++.| ......|.-+-+..- +..|....++ T Consensus 295 ~~~~~~----~~~i~~~~a~--G~g~lRPe~a~~i~~--tg~~~~~~~~ 335 (335) T protein:vir:63 295 EDNEKF----SWVLDTFQMY--NIGARRPDTAGAIEL--KGIGAFDITA 335 (335) T ss_pred eccchh----hHHhHHHHHc--CCcccccceEEEEEE--cCCCceeecC Confidence 554432 1334444333 223334433322221 1112222221 No 129 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=97.55 E-value=1.1e-05 Score=47.75 Aligned_cols=295 Identities=9% Similarity=-0.006 Sum_probs=151.2 Q ss_pred CCCccccccc--------eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccC Q lcl|Aclame:pro 1 MPDFNNQVRL--------VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYG 72 (367) Q Consensus 1 Ma~~~~~T~l--------~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~ 72 (367) |--+|+.|+- .+++. |+|.-.|.....+++.| .+.-.++++ .+|+++.+|+.+...- +... T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~------~~~~~~rti--~~g~s~~~~~iG~~~~--~~~~ 69 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKF------APLMNIRDL--RGSNVVRLDRLGNVEA--KGRR 69 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhh------ccccceeee--ccceeEEEeeeeeeee--cccc Confidence 6655555542 24555 77777777666666655 334333332 5799999999886631 1111 Q ss_pred CCCccccccccccchhhhhhhhhH-hhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHH-HHHHHHHHhhhhhhh Q lcl|Aclame:pro 73 SDNPNVEAPIDGLGSGEMKTTKTW-LNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRI-IAMAVGVYKSNLAGN 150 (367) Q Consensus 73 ~~~~~~~~t~~kitt~~~~a~i~~-r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~l-la~l~Gvf~~~~a~~ 150 (367) -+. .+....+...+.+-+|=. .--...+.|+-...+--|-..++++|.+..-++..++.. +.++++.-...... T Consensus 70 pG~---~l~~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~- 145 (335) T protein:vir:78 70 AGE---ELERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVD- 145 (335) T ss_pred cCc---ccCCCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc- Confidence 111 111111222221111100 011234788888888889999999999988888766654 45555432111110 Q ss_pred hhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHH----HhccccCc------eeEEEEccHHHHHHHh Q lcl|Aclame:pro 151 FATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAF----TMGDHVGS------IAAIAVHSMVYKRMTN 220 (367) Q Consensus 151 ~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~----~~GD~~~~------l~~~vmhS~v~~~L~k 220 (367) ... ..+.......++++.+. .-.+..+.+|+. .|= .++. =.+++|.|++|..|.+ T Consensus 146 ~~~----------~~~~G~~~~~~~tg~~~----~~~~~~l~~a~~~a~~~l~-ekdvP~~~~~~rv~vv~P~~y~~Ll~ 210 (335) T protein:vir:78 146 LED----------AFSPGVLEKLDLTGLTA----KEAAEKIVRMHRRVVETFI-ERDLGDAVYSEGLTPMSPRVFSLLLE 210 (335) T ss_pred cCC----------CcCCCcceeeeeccccc----cccHHHHHHHHHHHHHHHH-hccCCCCCCCccEEEeChHHHHHHhc Confidence 000 00111122233443322 223555555543 342 2222 2689999999999988 Q ss_pred c-chhhhc-ccccc-----cccchhhcCcEEEEeCCCcccCCC------CCce-------EEEEEEecceeeeeccCCCc Q lcl|Aclame:pro 221 N-DEIEFI-PDSKG-----QLTIPTYMGKVVIVDDGMPVFGTG------ADKT-------YLSILFGGAAFGYADGAPQV 280 (367) Q Consensus 221 ~-~li~~~-~~~~g-----~~~i~t~~G~~VivdD~~pv~~t~------~~~~-------yttyl~~~GAi~~~~~~~~~ 280 (367) . +|++-. ..+++ .-.+...+|.+|+.+..+|..... ..++ -.+++|-+-|++.....+. T Consensus 211 ~~~l~n~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~- 289 (335) T protein:vir:78 211 HDKLMSVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPV- 289 (335) T ss_pred ccccccccccccccccccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEec- Confidence 7 566632 22332 135888999999999999965311 1111 1457788888888776653 Q ss_pred ceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccc Q lcl|Aclame:pro 281 PVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSG 334 (367) Q Consensus 281 ~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~ 334 (367) ..|+.|+.... .+.+..++.| ......|.-+-++.-.+. +....++ T Consensus 290 ~~e~~~~~~~~----~~~i~~~~a~--G~g~lRPe~a~~i~~tg~--~~~~~~~ 335 (335) T protein:vir:78 290 QAKLWEDHDQF----SWVLDTFQMY--NIGARRPDTAGAIELKGI--EAFDITA 335 (335) T ss_pred ccceeeccchh----hHhhhHHHHc--CCcccCcceEEEEEecCC--CcccccC Confidence 34555554432 1334444333 333334433333221111 1111111 No 130 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=97.53 E-value=2e-05 Score=46.36 Aligned_cols=299 Identities=10% Similarity=0.018 Sum_probs=115.1 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |+. ...+...- .+|+.+..-+.+.+.+.+.+.+ +-.....++..+++|....-++.+.-+.|+.. + T Consensus 151 ~~~-~~~~~gg~-~vp~~~~~~ii~~~~~~~~i~~---------l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~---~ 216 (497) T protein:vir:10 151 NPF-GSTGTFAP-GILPTFLPGIVEQLFYELSLAD---------LISSRPVTSPNLSYLTESAAHNNAAAVAEAGT---Y 216 (497) T ss_pred hhc-ccCccccc-ccchhhhHHHHHHHHhhhhHHh---------hccccccCCCceEEEEEcCCCCcceeeccCcc---c Confidence 221 00111222 3444454445554444443311 11112234556888887543344556666643 3 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHH-----HHHHHhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAM-----AVGVYKSNLAGNFATIK 155 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~-----l~Gvf~~~~a~~~~~~~ 155 (367) +..+.+-++.....++.+--..++++...-+ .+-...|.++++....+..+..+|.= ..|++............ T Consensus 217 ~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~ 295 (497) T protein:vir:10 217 PFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSAS 295 (497) T ss_pred ccccccceeeEeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccc Confidence 3334443444444444444445555443322 23346777777766666665554430 11222211110000000 Q ss_pred hhhhhhhhhhcchhhcceeecCcc----------------------------cchhhcccHHHHHHHHHHhcc-ccCcee Q lcl|Aclame:pro 156 TRGRVPAEVLGTAGDMVIDISGQT----------------------------NPADAVFNREAFVDAAFTMGD-HVGSIA 206 (367) Q Consensus 156 ~~~~~~a~~~~~~~~~v~disa~t----------------------------~~a~~~~s~~~l~~A~~~~GD-~~~~l~ 206 (367) ... ...........+.++..... ...........+..+...+-. ....-. T Consensus 296 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 374 (497) T protein:vir:10 296 SLF-GATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN 374 (497) T ss_pred cch-hhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCC Confidence 000 00000000000000000000 000000111122222222111 111224 Q ss_pred EEEEccHHHHHHHhcchhh--hcccc-----ccc--ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccC Q lcl|Aclame:pro 207 AIAVHSMVYKRMTNNDEIE--FIPDS-----KGQ--LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGA 277 (367) Q Consensus 207 ~~vmhS~v~~~L~k~~li~--~~~~~-----~g~--~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~ 277 (367) +++||+..+..|++.+=-+ |+-.+ .+. ..-++++|++|++++.||. +++..=-|..+++.+..-. T Consensus 375 ~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~------~~~~~Gd~~~~~~~i~~r~ 448 (497) T protein:vir:10 375 AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL------GTILVGHFAPSVIQTARRE 448 (497) T ss_pred eEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCC------CceEEeecccceEEEEEec Confidence 7999999999998764221 22111 111 1224789999999999984 2221111334555443211 Q ss_pred CCcceeeeeehhhcC--CceeEEEEEccEE---Eeeeeeeeeccccccccccccccccccccc Q lcl|Aclame:pro 278 PQVPVAVGRRELRGN--GSGLEYILERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 278 ~~~~~e~~rd~~~~~--~~g~~~l~~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) .+++++.+.... ..++..+....++ +.||..|..-.-. ...+++ T Consensus 449 ---~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~-----------~~~~~~ 497 (497) T protein:vir:10 449 ---GVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK-----------KGATGS 497 (497) T ss_pred ---ccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEec-----------CCccCC Confidence 122333221110 1123333333333 4477777654211 011111 No 131 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=97.53 E-value=2e-05 Score=46.36 Aligned_cols=299 Identities=10% Similarity=0.018 Sum_probs=115.1 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |+. ...+...- .+|+.+..-+.+.+.+.+.+.+ +-.....++..+++|....-++.+.-+.|+.. + T Consensus 151 ~~~-~~~~~gg~-~vp~~~~~~ii~~~~~~~~i~~---------l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~---~ 216 (497) T protein:vir:78 151 NPF-GSTGTFAP-GILPTFLPGIVEQLFYELSLAD---------LISSRPVTSPNLSYLTESAAHNNAAAVAEAGT---Y 216 (497) T ss_pred hhc-ccCccccc-ccchhhhHHHHHHHHhhhhHHh---------hccccccCCCceEEEEEcCCCCcceeeccCcc---c Confidence 221 00111222 3444454445554444443311 11112234556888887543344556666643 3 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHH-----HHHHHhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAM-----AVGVYKSNLAGNFATIK 155 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~-----l~Gvf~~~~a~~~~~~~ 155 (367) +..+.+-++.....++.+--..++++...-+ .+-...|.++++....+..+..+|.= ..|++............ T Consensus 217 ~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~ 295 (497) T protein:vir:78 217 PFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSAS 295 (497) T ss_pred ccccccceeeEeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccc Confidence 3334443444444444444445555443322 23346777777766666665554430 11222211110000000 Q ss_pred hhhhhhhhhhcchhhcceeecCcc----------------------------cchhhcccHHHHHHHHHHhcc-ccCcee Q lcl|Aclame:pro 156 TRGRVPAEVLGTAGDMVIDISGQT----------------------------NPADAVFNREAFVDAAFTMGD-HVGSIA 206 (367) Q Consensus 156 ~~~~~~a~~~~~~~~~v~disa~t----------------------------~~a~~~~s~~~l~~A~~~~GD-~~~~l~ 206 (367) ... ...........+.++..... ...........+..+...+-. ....-. T Consensus 296 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 374 (497) T protein:vir:78 296 SLF-GATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN 374 (497) T ss_pred cch-hhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCC Confidence 000 00000000000000000000 000000111122222222111 111224 Q ss_pred EEEEccHHHHHHHhcchhh--hcccc-----ccc--ccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccC Q lcl|Aclame:pro 207 AIAVHSMVYKRMTNNDEIE--FIPDS-----KGQ--LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGA 277 (367) Q Consensus 207 ~~vmhS~v~~~L~k~~li~--~~~~~-----~g~--~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~ 277 (367) +++||+..+..|++.+=-+ |+-.+ .+. ..-++++|++|++++.||. +++..=-|..+++.+..-. T Consensus 375 ~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~------~~~~~Gd~~~~~~~i~~r~ 448 (497) T protein:vir:78 375 AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL------GTILVGHFAPSVIQTARRE 448 (497) T ss_pred eEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCC------CceEEeecccceEEEEEec Confidence 7999999999998764221 22111 111 1224789999999999984 2221111334555443211 Q ss_pred CCcceeeeeehhhcC--CceeEEEEEccEE---Eeeeeeeeeccccccccccccccccccccc Q lcl|Aclame:pro 278 PQVPVAVGRRELRGN--GSGLEYILERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 278 ~~~~~e~~rd~~~~~--~~g~~~l~~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) .+++++.+.... ..++..+....++ +.||..|..-.-. ...+++ T Consensus 449 ---~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~-----------~~~~~~ 497 (497) T protein:vir:78 449 ---GVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK-----------KGATGS 497 (497) T ss_pred ---ccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEec-----------CCccCC Confidence 122333221110 1123333333333 4477777654211 011111 No 132 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=97.46 E-value=9.7e-06 Score=48.07 Aligned_cols=279 Identities=10% Similarity=0.007 Sum_probs=132.1 Q ss_pred Hhh--CCCceEEeeeeccCCCcccccCCCCccccccccccchhhhhhhh-hHhhcccchhHHHHHhhcccHHHHHHHHHH Q lcl|Aclame:pro 48 FLS--APGRLINIPFWRDLDSLEPNYGSDNPNVEAPIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFG 124 (367) Q Consensus 48 ~~~--~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~t~~kitt~~~~a~i-~~r~kg~~~tDla~~~~g~DPm~~i~~qia 124 (367) +.. .+|+++.+|+.+... ...+.-++++. -++..+...+.+-+| ...--.+.+.|+-...+-.|++.++.+|.+ T Consensus 1 ~vr~i~~g~s~~~~~iG~~~--~~~~~~G~~l~-~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G 77 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMGRTK--ARYLKQGQSLD-DGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMG 77 (324) T ss_pred CeeeeecCceEEEeeeeeeE--eccccCCCCcC-CCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHHHH Confidence 222 479999999987552 22222222110 012222222211111 111234578888888888999999999999 Q ss_pred HHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhccc----HHHHHHHHHHhcc Q lcl|Aclame:pro 125 VYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFN----REAFVDAAFTMGD 200 (367) Q Consensus 125 ~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s----~~~l~~A~~~~GD 200 (367) ..-++..++.++..+.++........... ........+..+++.+.+ ...+ ++.|.+|.++|=. T Consensus 78 ~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~----------~~~~g~~~~~~~~~~~~~--~~~~~~~~~dai~~a~~~Lde 145 (324) T protein:vir:99 78 EALAMAADVANYAEMAKLVNSRKETTNEN----------IEGLGAASLVKITGKKED--PAKYGTQVIQALTYARAAFAK 145 (324) T ss_pred HHHHHHHHHHHHHHHHHhhhcccccccCC----------cccCCccceecccccccc--cccCHHHHHHHHHHHHHHHhh Confidence 99999888888776655543322111110 011112222334433322 2334 3455555566532 Q ss_pred --ccCceeEEEEccHHHHHHHhcchhhhccc-cccc---ccchhhcCcEEEEeCCCcccCCCC----------------- Q lcl|Aclame:pro 201 --HVGSIAAIAVHSMVYKRMTNNDEIEFIPD-SKGQ---LTIPTYMGKVVIVDDGMPVFGTGA----------------- 257 (367) Q Consensus 201 --~~~~l~~~vmhS~v~~~L~k~~li~~~~~-~~g~---~~i~t~~G~~VivdD~~pv~~t~~----------------- 257 (367) -...=.+++|.|.+|..|.+...+..... +.+. -.|+.++|++|+.+..+|...... T Consensus 146 ~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~ 225 (324) T protein:vir:99 146 KYIPAGDRTFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGD 225 (324) T ss_pred cCCCCCCCEEEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCccccccccccccccccccccccccc Confidence 22334789999999999987755543322 2222 258889999999999999642110 Q ss_pred ---CceEE-------EEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeee---eecccccccccc Q lcl|Aclame:pro 258 ---DKTYL-------SILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGF---NWLDADVTIPDN 324 (367) Q Consensus 258 ---~~~yt-------tyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~---s~~~~~~~~~~~ 324 (367) .++|. ..+|-+-|++.....+ .-.|..|++... +.....++.=-+-++.|.+. .+.... .|.. T Consensus 226 ~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~-~~~e~~~~~~~~-~d~i~~~~a~G~~~lRPe~a~~v~l~~~~--~~~~ 301 (324) T protein:vir:99 226 STTTGKMTVGADNVVGLFVHRSAVATLKLKD-MALERARRPEYQ-ADQIIAKYAMGHGGLRPEAVGAIIFEDGE--TPAV 301 (324) T ss_pred cccccccccccCceeEEEEehhheEEEeeec-ceecceechhhH-HHhhhhhhhhcCcccccceEEEEEEccCc--cccc Confidence 11232 1344444444433332 235666766542 22222222222333344322 221110 0000 Q ss_pred cccccccccccCC--CChHHhcC Q lcl|Aclame:pro 325 TGSPSGITSGPPA--ITLANLAN 345 (367) Q Consensus 325 ~~~~~~~~~~~~s--Pt~a~L~~ 345 (367) +..-.....++.. -|.+.... T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~ 324 (324) T protein:vir:99 302 APDVITGVASFAAPASTRAKSSA 324 (324) T ss_pred cchhhhhhccccCcccceeeecC Confidence 0000000000000 01111100 No 133 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=97.40 E-value=1.3e-05 Score=47.31 Aligned_cols=266 Identities=11% Similarity=0.048 Sum_probs=119.0 Q ss_pred CCCccccc-cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQV-RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T-~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) +...+.-| .=...++|+-|..-+.+...+.+-+.+ ...+ ...++ ..+|....-.+++.-+.|+..... T Consensus 115 ~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~------~~~~---~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~ 183 (387) T protein:vir:26 115 LHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE------KARL---TNIKG--LEIPRVSYTLDDDDFITDVETAKE 183 (387) T ss_pred HhhhccCCCCCCceeechhHHHHHHHHHHhhchhhh------hcee---eecCC--ceeeeeeccCCccccccccccccc Confidence 00000001 112356787776656655555444422 1111 11223 345655433344544555543322 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) .+.+-++..-..++.+--..++++...-+..|-.+.+.++|++-+.+...+.++....|. ... T Consensus 184 ---~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~--------g~~------ 246 (387) T protein:vir:26 184 ---LKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS--------GLE------ 246 (387) T ss_pred ---cccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc--------ccc------ Confidence 233333333444444555677777655566677788999998766665444444222110 000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc---cc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL---TI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~---~i 236 (367) ..++ .++..........++.++++...+-.....-..++||+..+..|++. .++..+.. .- T Consensus 247 ----------~g~~-~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~-----~~~~~~~~~~~~~ 310 (387) T protein:vir:26 247 ----------HMSF-YNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISV-----LSNGTTNFFDTPA 310 (387) T ss_pred ----------ceee-eccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHH-----HhcCCCcccccCC Confidence 0000 00000001122347888888777655444456799999998876554 11222211 12 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Eeeeeeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IVHPGGFN 313 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~hp~G~s 313 (367) .+++|++|+++|+++..-- |.|. ...+.+. ...+...|+.. .|+..+..+.+| ++.|..|. T Consensus 311 ~~llG~PV~~~~~~~~~~~---GDf~-----~~~~~~~----~~~~~~~~~~~----~~~~~~~~~~r~Dg~v~~~~A~~ 374 (387) T protein:vir:26 311 EKVFGKPVVFTDAAVKPIV---GDFN-----YFGINYD----GTTYDTDKDVK----KGEYLFVLTAWYDQQRTLDSAFR 374 (387) T ss_pred ccccccceEEecCCCceee---echh-----hhhhhhh----hhhheeccccc----CCceEEEEEEEeCcEeechhheE Confidence 4789999999998864211 1111 1111110 11122233332 233333333222 22344444 Q ss_pred ecccccccccccccccccccccCCCC Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) .-+- ..+.+..|| T Consensus 375 ~l~~-------------ka~~~~~~~ 387 (387) T protein:vir:26 375 IAKA-------------KENTGPLPS 387 (387) T ss_pred EEEe-------------ecCCCCCCC Confidence 3221 112334444 No 134 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=97.40 E-value=1.3e-05 Score=47.31 Aligned_cols=266 Identities=11% Similarity=0.048 Sum_probs=119.0 Q ss_pred CCCccccc-cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQV-RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T-~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) +...+.-| .=...++|+-|..-+.+...+.+-+.+ ...+ ...++ ..+|....-.+++.-+.|+..... T Consensus 115 ~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~------~~~~---~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~ 183 (387) T protein:vir:96 115 LHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE------KARL---TNIKG--LEIPRVSYTLDDDDFITDVETAKE 183 (387) T ss_pred HhhhccCCCCCCceeechhHHHHHHHHHHhhchhhh------hcee---eecCC--ceeeeeeccCCccccccccccccc Confidence 00000001 112356787776656655555444422 1111 11223 345655433344544555543322 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) .+.+-++..-..++.+--..++++...-+..|-.+.+.++|++-+.+...+.++....|. ... T Consensus 184 ---~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~--------g~~------ 246 (387) T protein:vir:96 184 ---LKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS--------GLE------ 246 (387) T ss_pred ---cccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc--------ccc------ Confidence 233333333444444555677777655566677788999998766665444444222110 000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc---cc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL---TI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~---~i 236 (367) ..++ .++..........++.++++...+-.....-..++||+..+..|++. .++..+.. .- T Consensus 247 ----------~g~~-~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~-----~~~~~~~~~~~~~ 310 (387) T protein:vir:96 247 ----------HMSF-YNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISV-----LSNGTTNFFDTPA 310 (387) T ss_pred ----------ceee-eccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHH-----HhcCCCcccccCC Confidence 0000 00000001122347888888777655444456799999998876554 11222211 12 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Eeeeeeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IVHPGGFN 313 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~hp~G~s 313 (367) .+++|++|+++|+++..-- |.|. ...+.+. ...+...|+.. .|+..+..+.+| ++.|..|. T Consensus 311 ~~llG~PV~~~~~~~~~~~---GDf~-----~~~~~~~----~~~~~~~~~~~----~~~~~~~~~~r~Dg~v~~~~A~~ 374 (387) T protein:vir:96 311 EKVFGKPVVFTDAAVKPIV---GDFN-----YFGINYD----GTTYDTDKDVK----KGEYLFVLTAWYDQQRTLDSAFR 374 (387) T ss_pred ccccccceEEecCCCceee---echh-----hhhhhhh----hhhheeccccc----CCceEEEEEEEeCcEeechhheE Confidence 4789999999998864211 1111 1111110 11122233332 233333333222 22344444 Q ss_pred ecccccccccccccccccccccCCCC Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) .-+- ..+.+..|| T Consensus 375 ~l~~-------------ka~~~~~~~ 387 (387) T protein:vir:96 375 IAKA-------------KENTGPLPS 387 (387) T ss_pred EEEe-------------ecCCCCCCC Confidence 3221 112334444 No 135 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=97.40 E-value=1.3e-05 Score=47.31 Aligned_cols=266 Identities=11% Similarity=0.048 Sum_probs=119.0 Q ss_pred CCCccccc-cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQV-RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T-~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) +...+.-| .=...++|+-|..-+.+...+.+-+.+ ...+ ...++ ..+|....-.+++.-+.|+..... T Consensus 115 ~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~------~~~~---~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~ 183 (387) T protein:vir:94 115 LHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE------KARL---TNIKG--LEIPRVSYTLDDDDFITDVETAKE 183 (387) T ss_pred HhhhccCCCCCCceeechhHHHHHHHHHHhhchhhh------hcee---eecCC--ceeeeeeccCCccccccccccccc Confidence 00000001 112356787776656655555444422 1111 11223 345655433344544555543322 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) .+.+-++..-..++.+--..++++...-+..|-.+.+.++|++-+.+...+.++....|. ... T Consensus 184 ---~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~--------g~~------ 246 (387) T protein:vir:94 184 ---LKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS--------GLE------ 246 (387) T ss_pred ---cccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc--------ccc------ Confidence 233333333444444555677777655566677788999998766665444444222110 000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc---cc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL---TI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~---~i 236 (367) ..++ .++..........++.++++...+-.....-..++||+..+..|++. .++..+.. .- T Consensus 247 ----------~g~~-~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~-----~~~~~~~~~~~~~ 310 (387) T protein:vir:94 247 ----------HMSF-YNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISV-----LSNGTTNFFDTPA 310 (387) T ss_pred ----------ceee-eccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHH-----HhcCCCcccccCC Confidence 0000 00000001122347888888777655444456799999998876554 11222211 12 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Eeeeeeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IVHPGGFN 313 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~hp~G~s 313 (367) .+++|++|+++|+++..-- |.|. ...+.+. ...+...|+.. .|+..+..+.+| ++.|..|. T Consensus 311 ~~llG~PV~~~~~~~~~~~---GDf~-----~~~~~~~----~~~~~~~~~~~----~~~~~~~~~~r~Dg~v~~~~A~~ 374 (387) T protein:vir:94 311 EKVFGKPVVFTDAAVKPIV---GDFN-----YFGINYD----GTTYDTDKDVK----KGEYLFVLTAWYDQQRTLDSAFR 374 (387) T ss_pred ccccccceEEecCCCceee---echh-----hhhhhhh----hhhheeccccc----CCceEEEEEEEeCcEeechhheE Confidence 4789999999998864211 1111 1111110 11122233332 233333333222 22344444 Q ss_pred ecccccccccccccccccccccCCCC Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) .-+- ..+.+..|| T Consensus 375 ~l~~-------------ka~~~~~~~ 387 (387) T protein:vir:94 375 IAKA-------------KENTGPLPS 387 (387) T ss_pred EEEe-------------ecCCCCCCC Confidence 3221 112334444 No 136 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=97.37 E-value=1.8e-05 Score=46.62 Aligned_cols=264 Identities=10% Similarity=0.032 Sum_probs=117.7 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |..- .+.=.-.++|+-|..-+.....+.+.|.+-.-+ ...++ .++|....-.+.+.-+.|+.. . T Consensus 118 l~~~--t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v---------~~~~~--~~~p~~~~~~~~a~~v~E~~~---~ 181 (387) T protein:vir:93 118 LPTG--NDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL---------TNIKG--LEIPRVSYTLDDDDFITDVET---A 181 (387) T ss_pred hccC--cCCCCceeechhHHHHHHHHHHhhchhhhheee---------eecCC--ceEEEEeecCCccccccCccc---c Confidence 2210 011123577877766666666555555321111 11223 234543222233444445433 2 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) +..+.+-++..-..++.+.-..++++...-+..|-...+.++|++-+.+...+.++....|. .... T Consensus 182 ~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~--------g~p~------ 247 (387) T protein:vir:93 182 KELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS--------GLDH------ 247 (387) T ss_pred cccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc--------cccc------ Confidence 22233333333334444445567766555555667778888888766665444444222111 0000 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc---cch Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL---TIP 237 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~---~i~ 237 (367) .++. ++..........++.++++...+......-..++||+..+.+|++. +++..+.. .-. T Consensus 248 ----------g~l~-~~~~~~v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~-----~~d~~~~~~~~~~~ 311 (387) T protein:vir:93 248 ----------MSFY-NGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISV-----LSNGTTNFFDTPAE 311 (387) T ss_pred ----------eeee-ccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHH-----HhcCCCcccccCCc Confidence 0000 0000001112347888888877766555566899999988776543 12222211 114 Q ss_pred hhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEEEe---eeeeeee Q lcl|Aclame:pro 238 TYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIV---HPGGFNW 314 (367) Q Consensus 238 t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~---hp~G~s~ 314 (367) +++|++|+++|++|..- -|.|.-| .+.+. ....+|+.... .|...++.+.++-. .|.-|.. T Consensus 312 ~llG~PV~~~~~~~~~~---~GDf~~~-----~~~~~------~~~~~~~~~~~--~~~~~~~~~~r~d~~v~~~eA~~~ 375 (387) T protein:vir:93 312 KVFGKPVVFTDAAVKPI---VGDFNYF-----GINYD------GTTYDTDKDVK--KGEYLFVLTAWYDQQRTLDSAFRI 375 (387) T ss_pred cccccceEEecCCCcee---eeehhhh-----heehh------hheeeeccccc--CCceeEEEEeeeCceeechhheEE Confidence 78999999999886422 1222111 11110 12223332222 23344444433322 2333332 Q ss_pred cccccccccccccccccccccCCCC Q lcl|Aclame:pro 315 LDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) -. + ..+.+..|+ T Consensus 376 l~--~-----------k~~~~~~~~ 387 (387) T protein:vir:93 376 AK--A-----------KENTGSLPS 387 (387) T ss_pred EE--e-----------ecCCCCCCC Confidence 21 1 122344455 No 137 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=97.32 E-value=1.5e-05 Score=47.11 Aligned_cols=266 Identities=9% Similarity=0.031 Sum_probs=118.6 Q ss_pred CCCccccc-cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQV-RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T-~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) ..-...-| .=.-.++|+-|..-+...+.+.+.|.+-.-+ ...++ ..+|....-.+++.-+.|+.... T Consensus 130 ~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v---------~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~- 197 (402) T protein:vir:93 130 LHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL---------TNIKG--LEIPRVSYTLDDDDFITDVETAK- 197 (402) T ss_pred HhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhcee---------eecCC--ceeeeeeccCCcccccccccccc- Confidence 00000000 0013467777766666655555555321111 11223 34565543333444455544322 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~ 159 (367) ..+.+-++.....++.+--..++.+...-+..|-...|.++|++-+.+...+.++....|+ + .... T Consensus 198 --~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~------g--~p~g---- 263 (402) T protein:vir:93 198 --ELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS------G--LEHM---- 263 (402) T ss_pred --ccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc------c--ccce---- Confidence 2223323333344444444566766555556677788999998766665444444322211 0 0000 Q ss_pred hhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc---cc Q lcl|Aclame:pro 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL---TI 236 (367) Q Consensus 160 ~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~---~i 236 (367) .... -.+...+ ....++.|+++...+-.....-..++||+.++..|++. .++..+.+ .- T Consensus 264 -------~~~~--~~~~~~~----~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~-----~~d~~~~~~~~~~ 325 (402) T protein:vir:93 264 -------SFYN--GSVKEVE----GADMYDAIINALADLHEDYRDNATIYMRYADYVKIISV-----LSNGTTNFFDTPA 325 (402) T ss_pred -------eeec--ccccccc----ccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHH-----HhcCCCcccccCC Confidence 0000 0011111 12346788888776655444556799999998876654 12222211 12 Q ss_pred hhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Eeeeeeee Q lcl|Aclame:pro 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IVHPGGFN 313 (367) Q Consensus 237 ~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~hp~G~s 313 (367) .+++|++|+++|+++..-- |.|. ..-+.+ ....+...|++.+ |+..++...++ ++.|..|. T Consensus 326 ~~llG~PV~~t~~~~~i~~---GDf~-----~~~~~~----~~~~~~~~~~~~~----~~~~~~~~~r~Dg~v~~~~A~~ 389 (402) T protein:vir:93 326 EKVFGKPVVFTDAAVKPIV---GDFN-----YFGINY----DGTTYDTDKDVKK----GEYLFVLTAWYDQQRTLDSAFR 389 (402) T ss_pred ccccccceEEecCCCceee---echh-----hhhhhh----hhhhhhhhhcccC----CceEEEEEEEeCcEEechhheE Confidence 4789999999998864221 1111 001111 1112233344332 33333333332 22344444 Q ss_pred ecccccccccccccccccccccCCCC Q lcl|Aclame:pro 314 WLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) ...-. ...+..|| T Consensus 390 ~l~ik-------------~~~~~~~~ 402 (402) T protein:vir:93 390 IAKAK-------------ENTGPLPS 402 (402) T ss_pred EEEee-------------cCCCCCCC Confidence 32211 12334444 No 138 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=96.99 E-value=0.00019 Score=41.02 Aligned_cols=286 Identities=9% Similarity=-0.003 Sum_probs=123.6 Q ss_pred CCCcccccccee----ccchHHHHHHHhhhhHHhhhHhhcccc-cccHHHHHHhhCCCceEEeeeec----cCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVD----AVIPEVYTSYTAIDRPELTAFFLSGAV-ASNDFLSQFLSAPGRLINIPFWR----DLDSLEPNY 71 (367) Q Consensus 1 Ma~~~~~T~l~d----~i~PEVf~~yv~~~~~~~~~f~~SGi~-~~~~~l~~~~~~~G~~i~~P~~~----~l~g~~~~~ 71 (367) +-.....+..+| .+.||.+..++ ..+.+.+.|.+-.-+ .+ ..+.+..++.-+ -..|.. .. T Consensus 12 ~~~~~k~~t~~d~~Gg~l~P~~~~~~i-~~~~e~s~~l~~~~vi~~---------~~~~~~~i~~~g~~~~~~~g~~-~~ 80 (315) T protein:vir:41 12 PFEIVPKIDVPDLGRGVLSVDRFGEFV-KAVRDSAVIIPEARIDNA---------LKSYEKDISRLSLVLDVGPGRD-ET 80 (315) T ss_pred hhhhhhhcCCcCCCCceechHHHHHHH-HHHHhhhhhhhhceeeec---------cccccccccccccCcccccccc-cc Confidence 111111122344 37899988766 556666666542221 11 112233333211 111111 01 Q ss_pred CCCCccccccccccchhhhhhhhhHhhcccchhHHHHHhh--cccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhh Q lcl|Aclame:pro 72 GSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELA--GSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAG 149 (367) Q Consensus 72 ~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~--g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~ 149 (367) .+.++ .+..+.+-++..-..++..--+.+++....-+ +.|....+.+++++-+.++.+..++ .| +..+. T Consensus 81 ~~~~~---~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~---nG---dg~s~ 151 (315) T protein:vir:41 81 GQKLA---PPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYL---HG---DTSSS 151 (315) T ss_pred cCcCC---CCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhh---cc---CCcCc Confidence 11111 11111122221122222222356766665544 4588889999999888888776555 23 11111 Q ss_pred hhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccC---ceeEEEEccHHHHHHHhcchh-- Q lcl|Aclame:pro 150 NFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG---SIAAIAVHSMVYKRMTNNDEI-- 224 (367) Q Consensus 150 ~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~---~l~~~vmhS~v~~~L~k~~li-- 224 (367) +. +.+....... ....++.... .++ +...+..+.|++..+.+-.... .-.+++||..++..+++...- T Consensus 152 ~p-~~~~~~G~l~----~a~~~~~~~~-~~~-~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g 224 (315) T protein:vir:41 152 DP-LLRMSDGWLK----LASEKLTESD-VDP-EAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRE 224 (315) T ss_pred Cc-ccccccccee----cccccccccc-ccc-ccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCC Confidence 11 0000000000 0011111100 111 1223567778887776654322 234799999999998886321 Q ss_pred hhccc-ccccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCCceeEEEEEc Q lcl|Aclame:pro 225 EFIPD-SKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNGSGLEYILER 302 (367) Q Consensus 225 ~~~~~-~~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r 302 (367) .++-+ .-..-.-.+++|++|+..+.||..+.+.. .++|+.-. +.|+... ...++.+|++..+ ..-++.| T Consensus 225 ~~lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~----~ilf~d~~nl~~~~~~-~i~i~~~~~a~~~----~~~~~~~ 295 (315) T protein:vir:41 225 TGLGDQALTGANSILYDGRPVQYVPALEALNDGKS----RALFVVPTQLVYGFWR-NIKVVPDYDAEMR----LTKYVAS 295 (315) T ss_pred CccccchhhcCCCceecccceEecccccccCCCCc----cEEEecccceEEEecc-ccEEEeeecCCCC----ceEEEEE Confidence 12211 11111235799999999999987653321 24554432 2232222 2334445554332 2223333 Q ss_pred cEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceee Q lcl|Aclame:pro 303 KEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERV 352 (367) Q Consensus 303 ~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v 352 (367) .++- .+|-|.++.+++ +-+| T Consensus 296 ~r~d---~~~~~~~~~a~~---------------------------~~~v 315 (315) T protein:vir:41 296 LRTD---NHYEDEEGAVSA---------------------------TITV 315 (315) T ss_pred EEec---eeEEeccceeEe---------------------------eeeC Confidence 2221 122333332221 1111 No 139 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=96.93 E-value=0.00025 Score=40.29 Aligned_cols=312 Identities=10% Similarity=0.030 Sum_probs=128.1 Q ss_pred CCCcccc-ccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHh-hCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQ-VRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFL-SAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~-T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~-~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) || |.. | ++||++..-..+.+.+.+-| ...+-.+-+ .+.. ++-|++|+||.-... ...++.... .. T Consensus 1 MA--N~llT-----~iP~iia~~al~~l~~~lV~--~~lV~r~y~-ge~~~a~~GDTV~I~~p~~~--~v~d~~~~~-~~ 67 (423) T protein:vir:35 1 MA--NNLES-----NISQIVLKKFLPGFMSDIVL--CKTVDRQLL-SGEINSNTGDSVSFKRPHQF--KSERTETGD-IT 67 (423) T ss_pred Cc--cchhh-----hhHHHHHHHHHHHHHhhccc--chhcccCCC-cccccccCCCEEEEeeCCcc--eeecccCcC-CC Confidence 99 432 3 57888877666666555444 233333322 2221 245999999966543 223332111 12 Q ss_pred cccccccchhhhhhhh-hHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHH-HHhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVG-VYKSNLAGNFATIKT 156 (367) Q Consensus 79 ~~t~~kitt~~~~a~i-~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~G-vf~~~~a~~~~~~~~ 156 (367) .+.++.++..+...++ +...-++..+|+-..+.-.|..+.+..| +....++.+..|++.+.. .. T Consensus 68 ~~~~~~~~e~~v~l~id~~k~~a~~v~d~e~~l~i~~~~~~l~~a-~~ala~~vd~~l~~~l~~~a~------------- 133 (423) T protein:vir:35 68 GKDKNGLFSAKATGKVGKYITVAVEWTQIEEALKLNQLDQILSPI-HERMVTDLETELAHFMMNNGA------------- 133 (423) T ss_pred CccccccccceeeEEeccceeccceeCHHHHHhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhccc------------- Confidence 2344445444322222 2234456777777666666654333333 344555566666654421 11 Q ss_pred hhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccC--ceeEEEEccHHHHHHHhcchhhhcccccc-- Q lcl|Aclame:pro 157 RGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG--SIAAIAVHSMVYKRMTNNDEIEFIPDSKG-- 232 (367) Q Consensus 157 ~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~--~l~~~vmhS~v~~~L~k~~li~~~~~~~g-- 232 (367) +.+- + .+.. .-.++.+.+|..+|.+..= .=+.++|.|..+..|.+..- .+.....+ T Consensus 134 --------------~~vg-t--~~t~--~~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~-~~~~~~~~~~ 193 (423) T protein:vir:35 134 --------------LSLG-S--PNTA--IKKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQS-GLHAADQLVR 193 (423) T ss_pred --------------cccc-c--ccCC--cchHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhcccc-ceeccccchh Confidence 1100 0 1111 1236889999988865421 23789999999999886521 11111111 Q ss_pred ----cccc-hhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcc--eeeeeehhhcCCceeEEEEEccEE Q lcl|Aclame:pro 233 ----QLTI-PTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVP--VAVGRRELRGNGSGLEYILERKEW 305 (367) Q Consensus 233 ----~~~i-~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~--~e~~rd~~~~~~~g~~~l~~r~~~ 305 (367) +-.| +.+.|+.|..|..+|....+.... .....++-......+... ....+....- .+.+++..=..+ T Consensus 194 ~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~---~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~--~~~g~l~~GD~~ 268 (423) T protein:vir:35 194 TAWENAQISGNFGGIRALMSNGLASRKQGDFDG---AITVKTAPNVDYLSVKDSYQFTVALTGATP--SKTGFLKAGDQL 268 (423) T ss_pred HHHhhccceeeecceEEEEcCCCcccccccccc---ceeeccccccccccccccccceeeeeeeee--ccCCcEEecceE Confidence 1124 789999999999999654332111 111111111000010000 0000000000 001111111111 Q ss_pred Eeeeeeeeeccc-----------------ccccccccccccccccccCCCC------------------hHHhc-----C Q lcl|Aclame:pro 306 IVHPGGFNWLDA-----------------DVTIPDNTGSPSGITSGPPAIT------------------LANLA-----N 345 (367) Q Consensus 306 ~~hp~G~s~~~~-----------------~~~~~~~~~~~~~~~~~~~sPt------------------~a~L~-----~ 345 (367) -..|+.|... .+++-.++.++.. .+-..+|. +..+- . T Consensus 269 --t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~-~~v~i~p~~~~~~~~~~~~~v~a~~a~~~~vt~~~~a 345 (423) T protein:vir:35 269 --KFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTASGD-VTVKLSGVPIYDEKNSQYNAVDAKVKAGDAVSIIGTA 345 (423) T ss_pred --EeeeeeeccccccceeecccCCceeEEEEeccccccccCc-eeEEccccccccCCCcccccccccccCCceeeeeecC Confidence 1123333100 0000000000000 00111121 11111 0 Q ss_pred Cccc--eeeecccccceEEEEe-------------cC Q lcl|Aclame:pro 346 PDNW--ERVTYRKNVPMAFLVT-------------KG 367 (367) Q Consensus 346 ~~NW--~~v~d~K~i~iv~~~t-------------~g 367 (367) +++. +++|.+.+++++..-= +| T Consensus 346 ~~~~~~nl~~~~~a~~l~~~~l~~~~~~~~~~~~~~g 382 (423) T protein:vir:35 346 KQQMKPNLFYNKFFCGLGTIPLPKLHSLDSAVATYEG 382 (423) T ss_pred CCceeEEEeecCceeEEEEEccccCCccceeeccccC Confidence 1111 1244444444433210 11 No 140 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=96.85 E-value=0.0003 Score=39.89 Aligned_cols=294 Identities=16% Similarity=0.125 Sum_probs=143.6 Q ss_pred CCCccccccce-----------------eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeecc Q lcl|Aclame:pro 1 MPDFNNQVRLV-----------------DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRD 63 (367) Q Consensus 1 Ma~~~~~T~l~-----------------d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~ 63 (367) |++.| +-++. +++. |+|..=|.....+.+.| .+.-...+ + .+|+++.+|+.+. T Consensus 1 ~~~~~-~~~~~~~n~~t~~~~~~~~~~~al~l-e~f~geV~~~f~~~si~------~~~~~~rt-i-~~Gksv~f~~iG~ 70 (375) T protein:vir:10 1 MANAN-QVALGRSNLSTGTGYGGATDKYALYL-KLFSGEMFKGFQHETIA------RDLVTKRT-L-KNGKSLQFIYTGR 70 (375) T ss_pred Ccccc-ccccCccccCCccccccccchHHHHH-HHHhHHHHHHHHHHHhh------hccccccc-c-ccCceEEEEeeee Confidence 55544 22222 3444 55655554444444443 33322222 1 4799999998885 Q ss_pred CCCcccccCCCCcc-------ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHH Q lcl|Aclame:pro 64 LDSLEPNYGSDNPN-------VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRII 136 (367) Q Consensus 64 l~g~~~~~~~~~~~-------~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~ll 136 (367) .. ...+..++++ ...+...|+-.+. .--.+.+.|+-..-.-.|.+.++.++.+..-+++.++.++ T Consensus 71 ~t--~~~~t~G~~i~~~~~~d~~~te~~l~ID~~------~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~ 142 (375) T protein:vir:10 71 MT--SSFHTPGTPILGNADKAPPVAEKTIVMDDL------LISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIF 142 (375) T ss_pred eE--EeeecCCcCcCCccccCCCCCceEEEecch------hhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHH Confidence 42 2222211111 0111112332222 1235678899999999999999999999999998888777 Q ss_pred HHHH-HHHhhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccch--hhcccHHHHHHHHHHhccc--cCceeEEEEc Q lcl|Aclame:pro 137 AMAV-GVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPA--DAVFNREAFVDAAFTMGDH--VGSIAAIAVH 211 (367) Q Consensus 137 a~l~-Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a--~~~~s~~~l~~A~~~~GD~--~~~l~~~vmh 211 (367) ..+. +.-........ .........+.+.+..+.++ .+.--++.|.+|.++|-+. ...=..++|. T Consensus 143 ~~l~kaa~~~~p~~~~-----------~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~ 211 (375) T protein:vir:10 143 RSITRGARSASPVSAT-----------NFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLN 211 (375) T ss_pred HHHHHhhhhccccccc-----------cccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeC Confidence 6553 33221110000 00111223333433333211 1112245566666666543 2234789999 Q ss_pred cHHHHHHHhc----chhhhcccccc---cccchhhcCcEEEEeCCCcccCC----------------------------- Q lcl|Aclame:pro 212 SMVYKRMTNN----DEIEFIPDSKG---QLTIPTYMGKVVIVDDGMPVFGT----------------------------- 255 (367) Q Consensus 212 S~v~~~L~k~----~li~~~~~~~g---~~~i~t~~G~~VivdD~~pv~~t----------------------------- 255 (367) |.+|..|.+. .+++..-..++ +..+..+.|.+|+.+..+|..+. T Consensus 212 P~~y~~Ll~~~d~~~~~n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~ 291 (375) T protein:vir:10 212 PRQYYALIQDIGSNGLVNRDVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENAN 291 (375) T ss_pred hHHHHHHHhcCCccceeeecccccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcce Confidence 9999999876 23332211111 22466789999999999995421 Q ss_pred ---CCCceEE----------EEEEecceeeeeccCCCcceee---eeehhhcCCceeEEEEEccEEEeeeeeeeeccccc Q lcl|Aclame:pro 256 ---GADKTYL----------SILFGGAAFGYADGAPQVPVAV---GRRELRGNGSGLEYILERKEWIVHPGGFNWLDADV 319 (367) Q Consensus 256 ---~~~~~yt----------tyl~~~GAi~~~~~~~~~~~e~---~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~ 319 (367) +..++|. ..+|-+-|.+....-+. .+|+ +|+... -.+.+..| +.+.-.+..|.-+-. T Consensus 292 ~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~-~~~~~~~~~~~~~----q~~~i~~~--~a~G~~~lrp~~av~ 364 (375) T protein:vir:10 292 ATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIGP-QVQVTNGDVSVIY----QGDVILGR--MAMGADYLNPAAAVE 364 (375) T ss_pred eeccccccccccccccCceEEEEEchhheeeeeeecc-ccccccchhhhee----eeeeeeee--eeeccCccCceeEEE Confidence 0111221 24555555554332221 1222 123222 12444444 455555556654332 Q ss_pred ccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 320 TIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 320 ~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) .+- |--++++- T Consensus 365 l~~---------------~~~~~~~~ 375 (375) T protein:vir:10 365 LYI---------------GATAPSAF 375 (375) T ss_pred Eec---------------CcCccccC Confidence 210 00111111 No 141 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=96.72 E-value=5.2e-05 Score=44.05 Aligned_cols=314 Identities=14% Similarity=0.035 Sum_probs=156.2 Q ss_pred CCCccccccceeccchHH---HHHHHhhhhHHhh----hHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCC Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEV---YTSYTAIDRPELT----AFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGS 73 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEV---f~~yv~~~~~~~~----~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~ 73 (367) ||. |.. ..=+|.. +..-+.....+++ +|...|==.+-..+..|-.++|++|+++.-..|.|+. +.+ T Consensus 1 Ma~----T~~-~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~g--v~G 73 (364) T protein:vir:93 1 MSQ----TVI-PFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGKP--TYG 73 (364) T ss_pred Cce----ecc-CcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccCC--ccc Confidence 985 322 2245653 3333334444444 3443222222222333445789999999999997643 223 Q ss_pred CCccccccccccchhhhhhhhhHhhcccch-hHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhh-- Q lcl|Aclame:pro 74 DNPNVEAPIDGLGSGEMKTTKTWLNKAYGA-MDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGN-- 150 (367) Q Consensus 74 ~~~~~~~t~~kitt~~~~a~i~~r~kg~~~-tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~-- 150 (367) +.. -+---+.|+-..+..+|-....+... ...+..-+--|...+..++++.||.+..+..++-.|.|.-+...... T Consensus 74 d~~-leGnee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~ 152 (364) T protein:vir:93 74 DAR-VEGKEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIET 152 (364) T ss_pred Cce-eeccccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 222 12334567777777777666666643 23445555567778889999999999999988888888533221100 Q ss_pred hhhhhhhhhhhhhhhcchhhcceeecCccc----chhhcccHHHHHHHHHH---hccc-------------cCceeEEEE Q lcl|Aclame:pro 151 FATIKTRGRVPAEVLGTAGDMVIDISGQTN----PADAVFNREAFVDAAFT---MGDH-------------VGSIAAIAV 210 (367) Q Consensus 151 ~~~~~~~~~~~a~~~~~~~~~v~disa~t~----~a~~~~s~~~l~~A~~~---~GD~-------------~~~l~~~vm 210 (367) ..+... +.. .......++++--..++. +++-.++.+.+-+|... +|-. ++..-+++| T Consensus 153 ~~~~~~--~~N-~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l 229 (364) T protein:vir:93 153 PDFTGY--AGN-PLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVM 229 (364) T ss_pred cCcccc--ccc-ccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEE Confidence 000000 000 001111222222222221 12346888888888764 3311 124569999 Q ss_pred ccHHHHHHHhc---chhhhcccc---ccc------ccchhhcCcEEEEeCCCcccC--CCCCce--EEEEEEecceeeee Q lcl|Aclame:pro 211 HSMVYKRMTNN---DEIEFIPDS---KGQ------LTIPTYMGKVVIVDDGMPVFG--TGADKT--YLSILFGGAAFGYA 274 (367) Q Consensus 211 hS~v~~~L~k~---~li~~~~~~---~g~------~~i~t~~G~~VivdD~~pv~~--t~~~~~--yttyl~~~GAi~~~ 274 (367) |+-.++.|+.. +.+++.+.. .|. -.++.|+|+.|.---.++-.. .....+ -.++|+|.=|.+++ T Consensus 230 ~p~q~~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a 309 (364) T protein:vir:93 230 SEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIA 309 (364) T ss_pred cchhhhhhhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEE Confidence 99999999964 356666653 221 147889999776544443221 111222 34688887775554 Q ss_pred ccCC--Ccc--eeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcC Q lcl|Aclame:pro 275 DGAP--QVP--VAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLAN 345 (367) Q Consensus 275 ~~~~--~~~--~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~ 345 (367) .+.. ..+ .|-..|-... ..+.+ ..++...-..|.+... .. ..=||-+.+-. T Consensus 310 ~g~~~g~~~~w~Ee~~D~gn~----~~i~~---~~i~G~kK~rF~~~Df------Gv-------i~idtaa~~~~ 364 (364) T protein:vir:93 310 YGTANGLRFDWEETVKDYGNE----PAIAA---GFIAGMKKARFNNKDF------GV-------ISIDTAAKKHS 364 (364) T ss_pred eecCCCCCceeeecccCCCCc----hhhhh---hhHhhhhhcccCCccc------eE-------EEecccccccC Confidence 3332 122 2222232222 11111 1223333333322110 00 01122222222 No 142 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=96.71 E-value=0.00015 Score=41.52 Aligned_cols=264 Identities=10% Similarity=0.017 Sum_probs=118.6 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |-. ..+.=...++|+-+..-+...+.+.+.+.+ ...+ .+.+|. .+|....-.+.+.-+.|+.... T Consensus 83 l~~--~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~------~~~v---~~~~~~--~~p~~~~~~~~a~~v~E~~~~~-- 147 (352) T protein:vir:78 83 LPT--GNDSGGDKLLPKTLSKEIVSEPFAKNQLRE------KARL---TNIKGL--EIPRVSYTLDDDDFITDVETAK-- 147 (352) T ss_pred hcc--CCCCCCceeccHhHHHHHHHHHHhhcchhh------heee---EecCCc--eEEEEecCCCcccccccccccc-- Confidence 211 001112346777665555555554444422 1111 112332 4554433334455555554322 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ..+.+-++..-..++.+--..+++....-+..|-.+.+.++|++-+.+.....++..-.|. ...... + T Consensus 148 -~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~--------~~~~g~---l 215 (352) T protein:vir:78 148 -ELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS--------GLEHMS---F 215 (352) T ss_pred -cccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCC--------cccccc---e Confidence 2333333333444444555667776555556677788999998766655444343211110 000000 0 Q ss_pred hhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcccccccc---cch Q lcl|Aclame:pro 161 PAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQL---TIP 237 (367) Q Consensus 161 ~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~~g~~---~i~ 237 (367) .... +...+ +.-.++.+.++...+-.....-.+++||+..+..|++.. + ...+.+ .-. T Consensus 216 -------~~~~---~~~~t----~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~--~---~~~~~~~~~~~~ 276 (352) T protein:vir:78 216 -------YNGS---VKEVE----GANMYDAIINALADLHEDYRDNATIYMRYADYVKIISVL--S---NGTTNFFDTPAE 276 (352) T ss_pred -------eccc---ccccc----ccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHH--h---ccCCcccccCCc Confidence 0000 00111 112367888887766554444577999999998876641 1 111111 113 Q ss_pred hhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Eeeeeeeee Q lcl|Aclame:pro 238 TYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IVHPGGFNW 314 (367) Q Consensus 238 t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~hp~G~s~ 314 (367) +++|++|+++|+++..-- |.|..| .+.+ ....++..++.. .|+..+..+.++ +++|.-|.- T Consensus 277 ~llG~PV~~~~~~~~~~~---Gdf~~~-----~~~~----~~~~~~~~~~~~----~g~~~f~~~~r~Dg~~~~~eA~~~ 340 (352) T protein:vir:78 277 KVFGKPVVFTDAAVKPIV---GDFNYF-----GINY----DGTTYDTDKDVK----KGEYLFVLTAWYDQQRTLDSAFRI 340 (352) T ss_pred cccccceEEecCCCceeE---eehhhh-----hhhh----hhheeeeecccc----CCeeEEEEEeeeCceeechhheEE Confidence 789999999998864211 111111 1111 111233333332 234444433333 334444432 Q ss_pred cccccccccccccccccccccCCCC Q lcl|Aclame:pro 315 LDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) - + -..+++.-|+ T Consensus 341 l--~-----------~~a~~~~~~~ 352 (352) T protein:vir:78 341 A--K-----------AKESTGSLPS 352 (352) T ss_pred E--E-----------eecccCCCCC Confidence 2 1 1223444555 No 143 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=96.59 E-value=0.00049 Score=38.74 Aligned_cols=316 Identities=8% Similarity=-0.036 Sum_probs=147.4 Q ss_pred CCCccccccce------eccc-hHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCC Q lcl|Aclame:pro 1 MPDFNNQVRLV------DAVI-PEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGS 73 (367) Q Consensus 1 Ma~~~~~T~l~------d~i~-PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~ 73 (367) |..+|..|+-. +.-. =|+|.-=|.....+++.| .+.-.+++ + .+|+++.+|+.+... ...+.- T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~------~~~~~~rt-i-~~gkS~q~~~iG~~~--~~~~~~ 70 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENL------LQWFDVQE-V-VGTNSVSNKYIGETE--LQVLSP 70 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhh------cCcceeee-e-cccceEEeeeeeeeE--Eeeecc Confidence 98888777531 1111 134433343333333344 22222221 1 489999999987542 111111 Q ss_pred CCccccccccccchhhhhhhhhH-hhcccchhHHHHHhhccc-HHHHHHHHHHHHHhhhhhHHHHHHHHHHH-hhhhhhh Q lcl|Aclame:pro 74 DNPNVEAPIDGLGSGEMKTTKTW-LNKAYGAMDLTAELAGSN-PMTRIRNRFGVYWTRQWQRRIIAMAVGVY-KSNLAGN 150 (367) Q Consensus 74 ~~~~~~~t~~kitt~~~~a~i~~-r~kg~~~tDla~~~~g~D-Pm~~i~~qia~yw~~~~q~~lla~l~Gvf-~~~~a~~ 150 (367) ++ .+-+..+...+.+-+|=. .--...+.|+-....--| +=.+++++.+..-++..+..++..++... +...... T Consensus 71 G~---~ld~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~ 147 (364) T protein:vir:10 71 GK---SPDASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIR 147 (364) T ss_pred Cc---ccCCCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 11 111222222221111110 011234677776666666 55688888887777777776665443221 1100000 Q ss_pred hhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccH----HHHHHHHHHhccc--cCceeEEEEccHHHHHHHhc-ch Q lcl|Aclame:pro 151 FATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNR----EAFVDAAFTMGDH--VGSIAAIAVHSMVYKRMTNN-DE 223 (367) Q Consensus 151 ~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~----~~l~~A~~~~GD~--~~~l~~~vmhS~v~~~L~k~-~l 223 (367) .. .....+...+++...+. ....++ +.|.+|.+.|.+. ...=..++|.+.+|..|.+. +| T Consensus 148 ~~-----------~~~~~~g~~i~~~~~a~--~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~l 214 (364) T protein:vir:10 148 KN-----------PRVAGHGFSIHIVGLAS--SFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRI 214 (364) T ss_pred cC-----------CcccCCcceeeecccCc--chhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCcc Confidence 00 00000011112221111 122333 4455677777543 33448999999999998887 56 Q ss_pred hh--hcccccc---cccchhhcCcEEEEeCCCcccCC-------------------------CCCceEEEEEEecceeee Q lcl|Aclame:pro 224 IE--FIPDSKG---QLTIPTYMGKVVIVDDGMPVFGT-------------------------GADKTYLSILFGGAAFGY 273 (367) Q Consensus 224 i~--~~~~~~g---~~~i~t~~G~~VivdD~~pv~~t-------------------------~~~~~yttyl~~~GAi~~ 273 (367) ++ |.....+ .-.+...+|.+|+.+..+|.... +...+....+|-+-|++. T Consensus 215 vn~d~~~~~~~~~~~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~t 294 (364) T protein:vir:10 215 VDKSYTIAASDNTVDGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLV 294 (364) T ss_pred ccccccccCCCccccceeEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEE Confidence 64 4322222 23577899999999999995311 011244567888888887 Q ss_pred eccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCC-CChHHhcCCccceee Q lcl|Aclame:pro 274 ADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPA-ITLANLANPDNWERV 352 (367) Q Consensus 274 ~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~s-Pt~a~L~~~~NW~~v 352 (367) ....+. ..|+.|+... -.+.+..++ .+......|.-+-+..-. .+++.. --++-|+-+ |-+ + T Consensus 295 v~~~~~-t~e~~~~~~~----~~~~ida~~--a~G~g~lRPeaa~~i~~~--------~~~~~~~~~~~~~~~~-~~~-~ 357 (364) T protein:vir:10 295 GRTISI-TGDIFYEKKE----KTWYIDTFL--AEGAIPDRWEAVAVVTAA--------DTAELATDHNAILARA-NRK-V 357 (364) T ss_pred EEEecc-eeeeeeccce----eeeeeeeeh--cccCcccCccceEEEEec--------CCCCCccchhhhhhhc-ccc-E Confidence 776643 3555555443 224444443 344444555444332100 011111 111223222 211 1 Q ss_pred ecccccceEEEEecC Q lcl|Aclame:pro 353 TYRKNVPMAFLVTKG 367 (367) Q Consensus 353 ~d~K~i~iv~~~t~g 367 (367) +..||-. T Consensus 358 --------~~~~~~~ 364 (364) T protein:vir:10 358 --------TLTKSVN 364 (364) T ss_pred --------EEEEecC Confidence 1222212 No 144 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=96.47 E-value=0.0006 Score=38.27 Aligned_cols=314 Identities=11% Similarity=0.016 Sum_probs=128.2 Q ss_pred CCCccc-cccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHh-hCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNN-QVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFL-SAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~-~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~-~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) || |. .| +.||++..-..+.+.+.+-|.+ .+-.+- ..+.. ++-|++|+||.-.+.. ..++...+. . T Consensus 1 Ma--N~llT-----~ip~iia~~al~~l~~~lV~~~--lVnr~y-~~e~~~~k~GDTV~I~~p~~~~--~~~~~~~~~-~ 67 (423) T protein:vir:17 1 MP--NNLDS-----NVSQIVLKKFLPGFMSDLVLAK--TVDRQL-LAGEINSSTGDSVSFKRPHQFS--SLRTPTGDI-S 67 (423) T ss_pred Cc--cchhh-----hhHHHHHHHHHHHHHhhcccch--hhcccC-CcchhhcccCCEEEEeeCCcce--eecccCccc-C Confidence 98 43 23 5788887767666666554422 232222 11111 2469999998655432 222221110 1 Q ss_pred cccccccchhhhhhhh-hHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 79 ~~t~~kitt~~~~a~i-~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) .++++.++..+...++ +....++..+|+-..+.-.|. .++.++-..-.+++.+..|++.+.+. +. T Consensus 68 ~~~~~~l~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~-a~------------ 133 (423) T protein:vir:17 68 GQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNN-GA------------ 133 (423) T ss_pred CcccCccccceeEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhc-cc------------ Confidence 1233444443322222 112334555555554444442 33333323334455555666554321 00 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccC--ceeEEEEccHHHHHHHhcc-hhhh-cccccc- Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG--SIAAIAVHSMVYKRMTNND-EIEF-IPDSKG- 232 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~--~l~~~vmhS~v~~~L~k~~-li~~-~~~~~g- 232 (367) +. .+..+... -.++.+.++..+|.+..= .=+.++|.|..++.|.+.. .+.. ....+. T Consensus 134 -------------~~---~gt~~t~~--~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~a 195 (423) T protein:vir:17 134 -------------LS---LGSPNTPI--TKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTA 195 (423) T ss_pred -------------cc---cccCCccc--ccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHH Confidence 00 11111111 237889999888865422 2367899999999988653 2211 111111 Q ss_pred --cccc-hhhcCcEEEEeCCCcccCCCCCceEEEEEE-----ecceeeeeccCCCc--ceeeeeehhhcCCceeEEEEEc Q lcl|Aclame:pro 233 --QLTI-PTYMGKVVIVDDGMPVFGTGADKTYLSILF-----GGAAFGYADGAPQV--PVAVGRRELRGNGSGLEYILER 302 (367) Q Consensus 233 --~~~i-~t~~G~~VivdD~~pv~~t~~~~~yttyl~-----~~GAi~~~~~~~~~--~~e~~rd~~~~~~~g~~~l~~r 302 (367) +-.| +.+.|+.|..|..+|..+.+..+. +... .+++...+...... ...+.++...-+ -| |.+--- T Consensus 196 lr~g~i~G~i~GFdvy~Snnip~~T~gt~~~--t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~-~G-D~~t~a 271 (423) T protein:vir:17 196 WENAQIPTNFGGIRALMSNGLASRTQGAFGG--TLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLK-AG-DQVKFT 271 (423) T ss_pred HhhccceeeecceEEEEeCCCccccccceec--eeeecccccccccccccccceeeeeeeeeeeccCcee-ec-ceEEec Confidence 1235 689999999999999654322211 1111 11221111111110 011111111000 01 111111 Q ss_pred cEEEeeeeee------------eecccccccccccccccccccccCCCC-------------hHHhcCCcccee------ Q lcl|Aclame:pro 303 KEWIVHPGGF------------NWLDADVTIPDNTGSPSGITSGPPAIT-------------LANLANPDNWER------ 351 (367) Q Consensus 303 ~~~~~hp~G~------------s~~~~~~~~~~~~~~~~~~~~~~~sPt-------------~a~L~~~~NW~~------ 351 (367) -.+.+||.-. .|. +++..++..+.. .+-..+|. .+..++++.|+. T Consensus 272 Gv~~v~~~tk~v~~~~~t~~~~~~~---v~~~~~~~a~~~-~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~ 347 (423) T protein:vir:17 272 NTYWLQQQTKQALYNGATPISFTAT---VTADANSDSSGD-VTVTLSGVPIYDTTNPQYNSVSRQVAAGDAVSVVGTASQ 347 (423) T ss_pred ceeeecccccccccccccccceEEE---EEecccccccCc-eEEEecCccccccCCcccccceecccCCceeeccccccC Confidence 2233333322 221 111000000000 00111111 133444444443 Q ss_pred ------eecccccceEEEE-------------ecC Q lcl|Aclame:pro 352 ------VTYRKNVPMAFLV-------------TKG 367 (367) Q Consensus 352 ------v~d~K~i~iv~~~-------------t~g 367 (367) +|.+.+++++..- .+| T Consensus 348 t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g 382 (423) T protein:vir:17 348 TMKPNLFYNKFFCGLGSIPLPKLHSIDSAVATYEG 382 (423) T ss_pred CeeEEEEecCcceEEEEEcccCCCccceeecccCC Confidence 3555555554321 011 No 145 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=95.95 E-value=0.00092 Score=37.24 Aligned_cols=300 Identities=13% Similarity=0.038 Sum_probs=121.1 Q ss_pred CCC-ccccccce--eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcc Q lcl|Aclame:pro 1 MPD-FNNQVRLV--DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPN 77 (367) Q Consensus 1 Ma~-~~~~T~l~--d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~ 77 (367) ... ....+..+ ..++||-+...+...+.+.+.|.+..-+.+. .| +..+|.-.... .+.-++|+.+ T Consensus 144 ~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~---------~g-~~~~~~~~~~~-~a~wv~E~~~- 211 (466) T protein:vir:80 144 VRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPL---------KG-TARQNIAGAIP-EGVWTEAVAN- 211 (466) T ss_pred HHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeec---------Cc-eeEeeeecCCc-ceeecccccc- Confidence 000 01112222 3678887777777766666655432222221 22 34555444332 2222333332 Q ss_pred ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHH-----HHHHHhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAM-----AVGVYKSNLAGNFA 152 (367) Q Consensus 78 ~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~-----l~Gvf~~~~a~~~~ 152 (367) ++..+.+=++.....++.+.-+.+++....-+..|-.+.+.+++++-..+..+..+|.= =.|+++........ T Consensus 212 --~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~ 289 (466) T protein:vir:80 212 --LNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQP 289 (466) T ss_pred --cccccccccceeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccc Confidence 22222222233344455555567777777667777788888888876666666554430 01222211000000 Q ss_pred hhhhhhhhhhhhhcchhhcceeecCccc---chhhcccHHHHHHHHH----HhccccCceeEEEEccHHHHHHHhcchhh Q lcl|Aclame:pro 153 TIKTRGRVPAEVLGTAGDMVIDISGQTN---PADAVFNREAFVDAAF----TMGDHVGSIAAIAVHSMVYKRMTNNDEIE 225 (367) Q Consensus 153 ~~~~~~~~~a~~~~~~~~~v~disa~t~---~a~~~~s~~~l~~A~~----~~GD~~~~l~~~vmhS~v~~~L~k~~li~ 225 (367) ......+....+++.... .....-.+..+.+... +..-.......|+||+.++..|++..+.. T Consensus 290 ----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~ 359 (466) T protein:vir:80 290 ----------PNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITF 359 (466) T ss_pred ----------cccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccc Confidence 000000000111111000 0000011112222222 12333445667999999999988775321 Q ss_pred hcccccccc-----cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEE Q lcl|Aclame:pro 226 FIPDSKGQL-----TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYIL 300 (367) Q Consensus 226 ~~~~~~g~~-----~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~ 300 (367) .+.|.. .-..++|++|++++.||...- --|.+..|.+ ...+ .+++.+.....-..++..+. T Consensus 360 ---~~~g~~~~~~~~~~~i~G~pvv~s~~~~~~~~-~~g~~~~y~i-------~~r~---~~~i~~~~~~~f~~d~~~~r 425 (466) T protein:vir:80 360 ---NSAGALVASLNNTMPIVGGDIVILDFIPDNDI-IGGYGSLYLL-------AERA---DIKLAQSEHVRFIEDQTVFK 425 (466) T ss_pred ---cCCccccccCCCcccccccceeecCccCccce-eeeccccEEE-------Eeec---ceEEEechhhhhhcCcEEEE Confidence 112211 112478999999999985320 0011111222 1111 23333333332223444555 Q ss_pred EccEEE---eeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccceE Q lcl|Aclame:pro 301 ERKEWI---VHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMA 361 (367) Q Consensus 301 ~r~~~~---~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv 361 (367) ...++- ++|..|.-.+-+-.+|.. +...-|+-++ .|=| T Consensus 426 ~~~r~dg~~~~~~afv~~~~~~~~~~~--------~~~~~~~~~~---------------~~~~ 466 (466) T protein:vir:80 426 GTARYDGKPVFGEGFVAVNIANANPTT--------SITFAPDEAN---------------VPEV 466 (466) T ss_pred EEEEEccEEeccCceEEEEecCCCccc--------ceeeecCcCc---------------CCCC Confidence 444442 333333321111001110 1111111111 1111 No 146 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=95.72 E-value=0.0016 Score=35.97 Aligned_cols=316 Identities=10% Similarity=0.021 Sum_probs=125.4 Q ss_pred CCCccc-cccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHh-hCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNN-QVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFL-SAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~-~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~-~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) || |. .| +.||++..-+...+.+.+-| +..+-.+- ..+.. ++-|++|+||.-.+.. ..++... +.. T Consensus 1 Ma--N~llT-----~~p~iia~~aL~~l~~~lV~--~~lVnr~y-~~ef~~~k~GDTV~I~~p~~~~--~~d~~~~-~~~ 67 (423) T protein:vir:10 1 MP--NNLDS-----NVSQIVLKKFLPGFMSDLVL--AKTVDRQL-LAGEINSSTGDSVSFKRPHQFS--SLRTPTG-DIS 67 (423) T ss_pred Cc--cchhh-----hhHHHHHHHHHHHHHhhccc--chhhcccC-CCcccccccCCEEEEeeCCcee--eeccCCc-ccc Confidence 98 43 23 47888877666666555444 22332222 11111 2359999888555332 1122111 001 Q ss_pred cccccccchhhhhhhhh-HhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKT-WLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~-~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) .++++.++..+...++- ....++..+|+-..+.-.|. .++.++-..-.+++.+..|++.+.+.- T Consensus 68 ~~~~~dl~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~-------------- 132 (423) T protein:vir:10 68 GQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNG-------------- 132 (423) T ss_pred ccccCccccceeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhcc-------------- Confidence 12333343333222221 12233444554443333342 333333333344555556665433210 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccC--ceeEEEEccHHHHHHHhcc-hhhhcc-cccc- Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG--SIAAIAVHSMVYKRMTNND-EIEFIP-DSKG- 232 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~--~l~~~vmhS~v~~~L~k~~-li~~~~-~~~g- 232 (367) .+ ..+..+.. .-.++.+.++..+|.+..= .=+.++|.|..++.|.+.. .+..-+ ..+. T Consensus 133 ------------~~---~~gt~~t~--~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~a 195 (423) T protein:vir:10 133 ------------AL---SLGSPNTP--ITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTA 195 (423) T ss_pred ------------cc---ccccCCcc--cchHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhh Confidence 00 01111111 1236788888888865422 2367899999999988653 221111 1111 Q ss_pred --cccc-hhhcCcEEEEeCCCcccCCCCCceEEEEEE---ecceeeeeccCCCcceeeeeehhh--cCCceeEEEEEccE Q lcl|Aclame:pro 233 --QLTI-PTYMGKVVIVDDGMPVFGTGADKTYLSILF---GGAAFGYADGAPQVPVAVGRRELR--GNGSGLEYILERKE 304 (367) Q Consensus 233 --~~~i-~t~~G~~VivdD~~pv~~t~~~~~yttyl~---~~GAi~~~~~~~~~~~e~~rd~~~--~~~~g~~~l~~r~~ 304 (367) +-.| +.+.|+.|..|..+|..+.+..+...+... .+|+...+... ..+...+.... +--.-.|.+----. T Consensus 196 lr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~--~~~~~~~~~~~~~~~l~~GD~~t~aGv 273 (423) T protein:vir:10 196 WENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQ--FTVTLTGATASVTGFLKAGDQVKFTNT 273 (423) T ss_pred hhhccceeeecceEEEEeCCCccccccccccceeeeecceeccccccccce--eeeeeeeccccccCceeecceEEecce Confidence 1135 689999999999999754333222111111 11111111100 00111111000 00000011111112 Q ss_pred EEeeeeee------------eecccccccccccccccccccccCCC-------------ChHHhcCCcccee-------- Q lcl|Aclame:pro 305 WIVHPGGF------------NWLDADVTIPDNTGSPSGITSGPPAI-------------TLANLANPDNWER-------- 351 (367) Q Consensus 305 ~~~hp~G~------------s~~~~~~~~~~~~~~~~~~~~~~~sP-------------t~a~L~~~~NW~~-------- 351 (367) +.+||.-. .|. +++..... ..+..+-..+| =.+..++++.|+. T Consensus 274 ~~v~~~tk~~~~~~~t~~~~~~~---v~a~~~~~-~~g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~ 349 (423) T protein:vir:10 274 YWLQQQTKQALYNGATPISFTAT---VTADANSD-SGGDVTVTLSGVPIYDTTNPQYNSVSRQVEAGDAVSVVGTASQTM 349 (423) T ss_pred eeecccccccccccccCcceEEE---EEeeeeec-cCCceeeeccCccccccCCcccccccccccCCceeeccccccCCe Confidence 23333322 221 11100000 00001111112 1233444444544 Q ss_pred ----eecccccceEEEE-------------ecC Q lcl|Aclame:pro 352 ----VTYRKNVPMAFLV-------------TKG 367 (367) Q Consensus 352 ----v~d~K~i~iv~~~-------------t~g 367 (367) +|.+.+++++..- .+| T Consensus 350 ~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g 382 (423) T protein:vir:10 350 KPNLFYNKFFCGLGSIPLPKLHSIDSAVATYEG 382 (423) T ss_pred eEEEEecCcceEEEEEcccCCCccceeeccccC Confidence 4555555554321 011 No 147 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=95.04 E-value=0.0029 Score=34.50 Aligned_cols=285 Identities=10% Similarity=-0.034 Sum_probs=112.5 Q ss_pred CCCccccc----cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCc Q lcl|Aclame:pro 1 MPDFNNQV----RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) Q Consensus 1 Ma~~~~~T----~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~ 76 (367) ++-....| .--.+++|+.+..-+.+.+.+.+-+.+.|... +..+-..++ .+++|....- +.+.-+.|+.. T Consensus 332 ~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~----~~~~~~~~~-~~~ip~~t~~-~~a~wv~Eg~~ 405 (645) T protein:vir:93 332 SAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGG----IPALRQVPF-NIRVHAQVSG-GAAGWVGEGKT 405 (645) T ss_pred hhhhccccccccccCCccCchhhHHHHHHhhhhhhhHHhhcccc----ccccccccC-ceeeeeeecC-cceEEeccCcc Confidence 11000111 11356788887766666666655554433221 000111122 3566754321 23333445433 Q ss_pred cccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhh Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKT 156 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~ 156 (367) ++..+.+=++.....++.+--..++++-..-+.-|-...+.+++++...+..++.+|. | ..++..... T Consensus 406 ---~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~---g----~g~~~~~~~-- 473 (645) T protein:vir:93 406 ---KPLTKFDFESITFSHAKVSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVD---P----KKAAVADVS-- 473 (645) T ss_pred ---ccccccceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhc---C----CCcccCCcc-- Confidence 3333333333223333334334455544333444555677888887777766655542 1 000000000 Q ss_pred hhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCc--eeEEEEccHHHHHHHhcchhhhcccc-ccc Q lcl|Aclame:pro 157 RGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGS--IAAIAVHSMVYKRMTNNDEIEFIPDS-KGQ 233 (367) Q Consensus 157 ~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~--l~~~vmhS~v~~~L~k~~li~~~~~~-~g~ 233 (367) -........+.. ........+..+...+-+..-. -.+++||+..+..|++..--+-.+.. +.. T Consensus 474 -----------p~gi~~~~~~~~---~~~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~~ 539 (645) T protein:vir:93 474 -----------PASITHDVKGTA---SSGNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDMT 539 (645) T ss_pred -----------ccceeccccccc---cccchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecCCC Confidence 000011111110 0111223444444444322222 24799999999999886422111110 001 Q ss_pred ccchhhcCcEEEEeCCCcccCCCCCceEEEEEE-ecceeeeeccCCCcceeeeeehhhc----CC--------ceeEEEE Q lcl|Aclame:pro 234 LTIPTYMGKVVIVDDGMPVFGTGADKTYLSILF-GGAAFGYADGAPQVPVAVGRRELRG----NG--------SGLEYIL 300 (367) Q Consensus 234 ~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~-~~GAi~~~~~~~~~~~e~~rd~~~~----~~--------~g~~~l~ 300 (367) ..=++++|++|++++.||-.-. -+.+.-+++ -.|.+.+.... ...+++.-.+... .+ ..+..+- T Consensus 540 ~~~~tL~G~PV~~s~~vp~~~~--~gd~s~~~ig~~~~v~i~~s~-~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vair 616 (645) T protein:vir:93 540 LLGGSFQGLPVIVSQYVGDQLV--LVNAPDIYLADDGGVAVDMSR-EASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIR 616 (645) T ss_pred CCCceeeceeeEEeccCCccee--EeccccEEEEEecceEEEeec-ceeEEEeecccccccccccccchhHhhcCceEEE Confidence 1125899999999999983210 011111222 22333332211 1122222111100 00 0000011 Q ss_pred --EccEEE-eeee------eeeeccccccccccccccccccccc Q lcl|Aclame:pro 301 --ERKEWI-VHPG------GFNWLDADVTIPDNTGSPSGITSGP 335 (367) Q Consensus 301 --~r~~~~-~hp~------G~s~~~~~~~~~~~~~~~~~~~~~~ 335 (367) .|--|. .||. |+.|-.++ ++ T Consensus 617 a~~r~d~~~~~p~a~~~lt~~~~g~~~---------------~~ 645 (645) T protein:vir:93 617 AERWINWRRRRTAAVAVITGVNYGSAS---------------GG 645 (645) T ss_pred EEEEEcceeeCccceEEEecccCCccc---------------CC Confidence 111121 2444 44553332 22 No 148 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=94.72 E-value=0.0037 Score=33.94 Aligned_cols=296 Identities=11% Similarity=-0.018 Sum_probs=126.5 Q ss_pred CCC-ccccccceec-------cchHHHHHHHhhhhH-HhhhHhhcccccccHHHHH-HhhCCCceEEeeeeccCCCcccc Q lcl|Aclame:pro 1 MPD-FNNQVRLVDA-------VIPEVYTSYTAIDRP-ELTAFFLSGAVASNDFLSQ-FLSAPGRLINIPFWRDLDSLEPN 70 (367) Q Consensus 1 Ma~-~~~~T~l~d~-------i~PEVf~~yv~~~~~-~~~~f~~SGi~~~~~~l~~-~~~~~G~~i~~P~~~~l~g~~~~ 70 (367) |-+ .-+.|-...+ -+||-+.-...++-. ...+++..++.+..-..++ .-..+|++|.||.+.. .|- -+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~-~gl-~D 78 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDT-TEL-KD 78 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecc-ccc-cc Confidence 332 1111111100 123322222222111 1112222222221111111 1224899999999985 343 35 Q ss_pred cCCCCccccccccccchhhhhhhhh-HhhcccchhHHHHHhhccc--HHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhh Q lcl|Aclame:pro 71 YGSDNPNVEAPIDGLGSGEMKTTKT-WLNKAYGAMDLTAELAGSN--PMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNL 147 (367) Q Consensus 71 ~~~~~~~~~~t~~kitt~~~~a~i~-~r~kg~~~tDla~~~~g~D--Pm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~ 147 (367) |..+. ..+.+.++.....-++- .|..+|.+.|.-..-+..+ .....+.+...--....+...++.|.+ ... T Consensus 79 Y~R~~---g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~---~a~ 152 (319) T protein:vir:97 79 YKRNA---TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR---NKA 152 (319) T ss_pred ccCCC---CcccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHh---hcc Confidence 64332 24455565555544332 2455555555444433221 111112222211222233333333321 100 Q ss_pred hhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc-CceeEEEEccHHHHHHHhcchhhh Q lcl|Aclame:pro 148 AGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV-GSIAAIAVHSMVYKRMTNNDEIEF 226 (367) Q Consensus 148 a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~-~~l~~~vmhS~v~~~L~k~~li~~ 226 (367) . . .. .+.+ +.-.++.|.++..+|-+.. ..=.+++|.|.+|..|++.. .| T Consensus 153 ~----------------------~-~~-~~~t----~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~--~f 202 (319) T protein:vir:97 153 K----------------------H-LT-VGTG----SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFV--IA 202 (319) T ss_pred c----------------------c-cc-cccC----HHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhh--hh Confidence 0 0 00 0011 1123788889988886543 22368899999999998763 34 Q ss_pred cccccc------cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEe-cceeeeeccCCCcceeeeeehhhcCCceeEEE Q lcl|Aclame:pro 227 IPDSKG------QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFG-GAAFGYADGAPQVPVAVGRRELRGNGSGLEYI 299 (367) Q Consensus 227 ~~~~~g------~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~-~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l 299 (367) .+..+. +-.|+.+.|+.|+...+. ..+-.-|+++ ++|+.....-.. ++..|.+....+...+.+ T Consensus 203 ~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~-------~~k~in~i~~h~~A~~~~~k~~~--~~~~~p~~~~~a~~v~gr 273 (319) T protein:vir:97 203 LPQGDTRQQVLGKGVQGELDGFVIVKVPTK-------LLQGLQAIAVVGEVLASPIQADL--AKTNSNIPGMFGTLAEQL 273 (319) T ss_pred hccccccccceeeeeceeecCeEEEEeccc-------ccccceEEEEcCCeeeeeeeeee--eeccCCCccccceeeeee Confidence 443321 235778889988864221 1122235554 667765443222 333343222222222333 Q ss_pred EEccEEEeeee--eeeecccccccccccccccccccccCCCChHHhcCCcccee Q lcl|Aclame:pro 300 LERKEWIVHPG--GFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWER 351 (367) Q Consensus 300 ~~r~~~~~hp~--G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~ 351 (367) ..=..|++-|. |+ |.... ........+++.|.+.+-.-....+. T Consensus 274 ~y~d~~V~~~k~~~I-y~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:97 274 LYTGAFVPEHLQKYI-FTIGG-------TEVATKRDGVDAHADNVAKPSGSLEM 319 (319) T ss_pred eeeeeEEeccccceE-EEeec-------CCcccCCCccccccccccCCcccccC Confidence 33333444444 44 43222 11223445666676665444444444 No 149 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=94.72 E-value=0.0037 Score=33.94 Aligned_cols=296 Identities=11% Similarity=-0.018 Sum_probs=126.5 Q ss_pred CCC-ccccccceec-------cchHHHHHHHhhhhH-HhhhHhhcccccccHHHHH-HhhCCCceEEeeeeccCCCcccc Q lcl|Aclame:pro 1 MPD-FNNQVRLVDA-------VIPEVYTSYTAIDRP-ELTAFFLSGAVASNDFLSQ-FLSAPGRLINIPFWRDLDSLEPN 70 (367) Q Consensus 1 Ma~-~~~~T~l~d~-------i~PEVf~~yv~~~~~-~~~~f~~SGi~~~~~~l~~-~~~~~G~~i~~P~~~~l~g~~~~ 70 (367) |-+ .-+.|-...+ -+||-+.-...++-. ...+++..++.+..-..++ .-..+|++|.||.+.. .|- -+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~-~gl-~D 78 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDT-TEL-KD 78 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecc-ccc-cc Confidence 332 1111111100 123322222222111 1112222222221111111 1224899999999985 343 35 Q ss_pred cCCCCccccccccccchhhhhhhhh-HhhcccchhHHHHHhhccc--HHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhh Q lcl|Aclame:pro 71 YGSDNPNVEAPIDGLGSGEMKTTKT-WLNKAYGAMDLTAELAGSN--PMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNL 147 (367) Q Consensus 71 ~~~~~~~~~~t~~kitt~~~~a~i~-~r~kg~~~tDla~~~~g~D--Pm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~ 147 (367) |..+. ..+.+.++.....-++- .|..+|.+.|.-..-+..+ .....+.+...--....+...++.|.+ ... T Consensus 79 Y~R~~---g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~---~a~ 152 (319) T protein:vir:94 79 YKRNA---TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR---NKA 152 (319) T ss_pred ccCCC---CcccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHh---hcc Confidence 64332 24455565555544332 2455555555444433221 111112222211222233333333321 100 Q ss_pred hhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc-CceeEEEEccHHHHHHHhcchhhh Q lcl|Aclame:pro 148 AGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV-GSIAAIAVHSMVYKRMTNNDEIEF 226 (367) Q Consensus 148 a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~-~~l~~~vmhS~v~~~L~k~~li~~ 226 (367) . . .. .+.+ +.-.++.|.++..+|-+.. ..=.+++|.|.+|..|++.. .| T Consensus 153 ~----------------------~-~~-~~~t----~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~--~f 202 (319) T protein:vir:94 153 K----------------------H-LT-VGTG----SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFV--IA 202 (319) T ss_pred c----------------------c-cc-cccC----HHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhh--hh Confidence 0 0 00 0011 1123788889988886543 22368899999999998763 34 Q ss_pred cccccc------cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEe-cceeeeeccCCCcceeeeeehhhcCCceeEEE Q lcl|Aclame:pro 227 IPDSKG------QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFG-GAAFGYADGAPQVPVAVGRRELRGNGSGLEYI 299 (367) Q Consensus 227 ~~~~~g------~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~-~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l 299 (367) .+..+. +-.|+.+.|+.|+...+. ..+-.-|+++ ++|+.....-.. ++..|.+....+...+.+ T Consensus 203 ~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~-------~~k~in~i~~h~~A~~~~~k~~~--~~~~~p~~~~~a~~v~gr 273 (319) T protein:vir:94 203 LPQGDTRQQVLGKGVQGELDGFVIVKVPTK-------LLQGLQAIAVVGEVLASPIQADL--AKTNSNIPGMFGTLAEQL 273 (319) T ss_pred hccccccccceeeeeceeecCeEEEEeccc-------ccccceEEEEcCCeeeeeeeeee--eeccCCCccccceeeeee Confidence 443321 235778889988864221 1122235554 667765443222 333343222222222333 Q ss_pred EEccEEEeeee--eeeecccccccccccccccccccccCCCChHHhcCCcccee Q lcl|Aclame:pro 300 LERKEWIVHPG--GFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWER 351 (367) Q Consensus 300 ~~r~~~~~hp~--G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~ 351 (367) ..=..|++-|. |+ |.... ........+++.|.+.+-.-....+. T Consensus 274 ~y~d~~V~~~k~~~I-y~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:94 274 LYTGAFVPEHLQKYI-FTIGG-------TEVATKRDGVDAHADNVAKPSGSLEM 319 (319) T ss_pred eeeeeEEeccccceE-EEeec-------CCcccCCCccccccccccCCcccccC Confidence 33333444444 44 43222 11223445666676665444444444 No 150 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=287 Identities=11% Similarity=-0.002 Sum_probs=121.8 Q ss_pred CCC---------------ccccc-cceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccC Q lcl|Aclame:pro 1 MPD---------------FNNQV-RLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDL 64 (367) Q Consensus 1 Ma~---------------~~~~T-~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l 64 (367) |-+ |.++. -.-.+.--|.|.+.+.+...+.+ + -+..+++ .++. ..+|++|.||.+... T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~-~-s~~~~~N-~~~e---~~~g~tVkIp~i~~~ 85 (329) T protein:vir:10 12 MNKEIKNATGKLKLNLQHFANKSVEPGDTLLKNKHVGILEKVTAANS-Y-SAPAVIS-NDAI---FMQGRSFTVIKGDVT 85 (329) T ss_pred hhhhhhcccceeEEehhhhcCCccCCchhHHHHHHHHHHHHHHHhhc-e-eeeeecc-ccee---eccCcEEEEeeeccc Confidence 221 11111 00111122344444444332221 1 1112222 1111 348999999999753 Q ss_pred CCcccccCCCCccccccccccchhhhhhhhhH-hhcccchhHHHHHhh-----cccHHHHHHHHHHHHHhhhhhHHHHHH Q lcl|Aclame:pro 65 DSLEPNYGSDNPNVEAPIDGLGSGEMKTTKTW-LNKAYGAMDLTAELA-----GSNPMTRIRNRFGVYWTRQWQRRIIAM 138 (367) Q Consensus 65 ~g~~~~~~~~~~~~~~t~~kitt~~~~a~i~~-r~kg~~~tDla~~~~-----g~DPm~~i~~qia~yw~~~~q~~lla~ 138 (367) |- -+|.-+. ..+.+.++...+.-++-+ |.-+|.+.|.-..-+ ..++|++ +...--....++..++. T Consensus 86 -gl-~DY~R~~---g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~---~~~~~v~pEiDay~~sk 157 (329) T protein:vir:10 86 -EL-KDYKRNA---TNEFDHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAK---QASEVVAPYLDNLRFAT 157 (329) T ss_pred -cc-ccccCCC---CccccccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHH---HHHHHhhhHHHHHHHHH Confidence 43 3564332 244555555554443322 444445444333322 2333332 11111122223333333 Q ss_pred HHHHHhhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc-CceeEEEEccHHHHH Q lcl|Aclame:pro 139 AVGVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV-GSIAAIAVHSMVYKR 217 (367) Q Consensus 139 l~Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~-~~l~~~vmhS~v~~~ 217 (367) |.+ .... . .-.+. .+.-.++.|.++..+|.+.. ..=..++|.|.+|.. T Consensus 158 la~---~a~~----------------------~--~~~~~----t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~ 206 (329) T protein:vir:10 158 LAR---NKAK----------------------H--LTVGS----GADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKG 206 (329) T ss_pred HHh---hccc----------------------c--ccccc----CHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHH Confidence 321 0000 0 00011 11124788999999987753 234689999999999 Q ss_pred HHhcchhhhcccccc------cccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhc Q lcl|Aclame:pro 218 MTNNDEIEFIPDSKG------QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRG 291 (367) Q Consensus 218 L~k~~li~~~~~~~g------~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~ 291 (367) |++... |.+..+. +-.|+.+.|++|+...+.... .+...+.-++|+.+...-. .+|..|.+... T Consensus 207 Lk~~~~--f~~~~~~~~~~~~~g~Vg~idG~~Ii~vps~~~k------~in~ii~~~~A~~~~~K~~--~~~~~~p~~~~ 276 (329) T protein:vir:10 207 IKKFVI--ELPQGDNRQQVLGKGVQGELDGFTIVKVPSKMLQ------GVEAMAVIGEVMASPIQAN--EAKLNSNVPGM 276 (329) T ss_pred HHhhhh--hhccccccccceeeeeeeeecCeEEEEecCCccc------ceeEEEEcCCceeeeeeee--eeeeeCCCCcc Confidence 988643 3333221 235788899999865432221 1222444567776654332 24444433222 Q ss_pred CCceeEEEEEccEEEeeee--eeeecccccccccccccccccccccCCCChHHhcCCccceeee Q lcl|Aclame:pro 292 NGSGLEYILERKEWIVHPG--GFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVT 353 (367) Q Consensus 292 ~~~g~~~l~~r~~~~~hp~--G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~ 353 (367) .+...+.+..=..|++.|. |+ |.......+..- ++++.=| +++++-|+.=. T Consensus 277 ~a~~v~gr~yyd~~V~~~k~~~I-~~~~~~a~~~~~-------~~~~~~~---~~~~~~~~~~~ 329 (329) T protein:vir:10 277 FGTLAEQMLYTGAFVPEHLQKYI-FTIGGKEVETNR-------DGVDAHA---DETNASADTGA 329 (329) T ss_pred chheeeeeeeeeeEEEccccCEE-EEecccCcccCC-------CCCCccc---cccccccccCC Confidence 2223333333344556666 44 333222111111 1111111 22333333222 No 151 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=93.60 E-value=0.0039 Score=33.78 Aligned_cols=316 Identities=11% Similarity=0.048 Sum_probs=147.4 Q ss_pred CCCccccccce------------eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcc Q lcl|Aclame:pro 1 MPDFNNQVRLV------------DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLE 68 (367) Q Consensus 1 Ma~~~~~T~l~------------d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~ 68 (367) |..+|.-|+-. .+|.-||+..|... +-| .+.-.+++ -.+|+++.+|+.+... . T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~-----si~------~~~~~vRt--I~~gkS~qf~~lG~s~--a 65 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKG-----ENI------MSYFDVQT--VTGTNTVSNKYLGETE--L 65 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHH-----hhh------cccceeee--ecccceEEEEEeeeeE--E Confidence 88888777642 34455555554432 222 22222221 1589999999986432 1 Q ss_pred cccCCCCccccccccccchhhhhhhhhH-hhcccchhHHHHHhhccc-HHHHHHHHHHHHHhhhhhHHHHH-HHHHHHhh Q lcl|Aclame:pro 69 PNYGSDNPNVEAPIDGLGSGEMKTTKTW-LNKAYGAMDLTAELAGSN-PMTRIRNRFGVYWTRQWQRRIIA-MAVGVYKS 145 (367) Q Consensus 69 ~~~~~~~~~~~~t~~kitt~~~~a~i~~-r~kg~~~tDla~~~~g~D-Pm~~i~~qia~yw~~~~q~~lla-~l~Gvf~~ 145 (367) ....-+.. +-...+.+.+.+-+|=. .---..+-||-....--| +=.+++++++..-++..+..+|. ++.+-++. T Consensus 66 ~y~~pG~~---ldg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~ 142 (400) T protein:vir:10 66 QVLAPGQS---PAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIAN 142 (400) T ss_pred eeecCCCC---cCCCCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 11111111 11111222211110000 001124556656666666 66788888888888877766665 33333222 Q ss_pred hhhhhhhhhhhhhhhhhhhhcchhhcc--eeecCcccchhhcccHHHHH----HHHHHh--ccccCceeEEEEccHHHHH Q lcl|Aclame:pro 146 NLAGNFATIKTRGRVPAEVLGTAGDMV--IDISGQTNPADAVFNREAFV----DAAFTM--GDHVGSIAAIAVHSMVYKR 217 (367) Q Consensus 146 ~~a~~~~~~~~~~~~~a~~~~~~~~~v--~disa~t~~a~~~~s~~~l~----~A~~~~--GD~~~~l~~~vmhS~v~~~ 217 (367) ...-.. ......+. ..+++.+ .....+...|. +|.+.| -|-...-.++.|.+..|.- T Consensus 143 t~~~~~-------------~~~g~~~g~s~~v~~~~--~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~ 207 (400) T protein:vir:10 143 TQAKRT-------------NPRVKGHGFSVNVEVNE--GEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNV 207 (400) T ss_pred cccccc-------------cCCccccccceeecccc--cccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHH Confidence 111100 11122222 2333332 22334555555 333333 2222223466666666666 Q ss_pred HHhc-chhhhcc--cccc---cccchhhcCcEEEEeCCCcccCCC-------------------CCceEEEEEEecceee Q lcl|Aclame:pro 218 MTNN-DEIEFIP--DSKG---QLTIPTYMGKVVIVDDGMPVFGTG-------------------ADKTYLSILFGGAAFG 272 (367) Q Consensus 218 L~k~-~li~~~~--~~~g---~~~i~t~~G~~VivdD~~pv~~t~-------------------~~~~yttyl~~~GAi~ 272 (367) |+.. +|++-.. ...+ .-.+..++|++|+.+..+|..... ...+-...+|-+-|++ T Consensus 208 Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~ 287 (400) T protein:vir:10 208 LRDADRIVDKSYTISQSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALL 287 (400) T ss_pred HHhCCcccchhccccCCCccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheE Confidence 6554 4775432 2222 224678999999999999863210 1112235788888888 Q ss_pred eeccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceee Q lcl|Aclame:pro 273 YADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERV 352 (367) Q Consensus 273 ~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v 352 (367) .....+ ...|+.||.... .+.+...+.|...| ..|.-+.+..-..+..+. ..++ ++..-..| T Consensus 288 tvk~~~-lt~~~~~d~r~~----~~~id~~~a~G~g~--~RPeaa~vv~~~~~~~~~--~~~~---------~~~~~~~~ 349 (400) T protein:vir:10 288 VGRSID-VIGDIFYEKKEK----TYYIDTFMSEGAIP--DRWEAVSVVTTKRQSTGA--VDSG---------NAAQHTQV 349 (400) T ss_pred EEEeec-cccccccchhhH----HHHHHHHHHhCCcc--cchhheEEEEecCCcccc--cccC---------cchhHHHH Confidence 766554 335556665543 24455555555444 456554443222221111 1111 11112223 Q ss_pred ecccccceEEEEecC Q lcl|Aclame:pro 353 TYRKNVPMAFLVTKG 367 (367) Q Consensus 353 ~d~K~i~iv~~~t~g 367 (367) ..+-+-..+.+|+-| T Consensus 350 ~~~~~~~~~~~~~~~ 364 (400) T protein:vir:10 350 LNRAQRKAVYVKNAA 364 (400) T ss_pred HhhcccceEEEeccc Confidence 333334445566666 No 152 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=93.56 E-value=0.0071 Score=32.35 Aligned_cols=313 Identities=10% Similarity=0.010 Sum_probs=117.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHh-hCCCceEEeeeeccCC---CcccccCCCCc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFL-SAPGRLINIPFWRDLD---SLEPNYGSDNP 76 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~-~~~G~~i~~P~~~~l~---g~~~~~~~~~~ 76 (367) || |..|. ++||++.+-..+.+.+.+-| +..+-.+- ..+.. ++-|++|+||.=.... +...++...++ T Consensus 1 MA--Nsl~~----l~p~iia~~al~~l~~~lV~--~~lV~r~y-~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~ 71 (423) T protein:vir:10 1 MA--NNLDA----NVSQIVLKKFLPGFMSDLVL--CKTVDRQL-LAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSK 71 (423) T ss_pred Cc--ccccc----ccHHHHHHHHHHHHHhhccc--chhhccCC-CccccccccCCEEEEeeCCceeeecccCcccCcccc Confidence 98 33322 78999988777766665544 22332222 11111 2359999988544321 00011111100 Q ss_pred ---cccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 77 ---NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFAT 153 (367) Q Consensus 77 ---~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~ 153 (367) .+...+-+|.+.+.. ++.++|.-..+.-.|. +++.++-..-.+++.+..|...+... T Consensus 72 ~~l~e~~v~l~id~~k~~--------a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~----------- 131 (423) T protein:vir:10 72 NSLISAKATGEVGNYITV--------AVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKH----------- 131 (423) T ss_pred cccccceEEEEecceeee--------eeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhc----------- Confidence 011123344444333 3444444433333333 23322222223333344443222110 Q ss_pred hhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccC--ceeEEEEccHHHHHHHhc-chhhh-ccc Q lcl|Aclame:pro 154 IKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG--SIAAIAVHSMVYKRMTNN-DEIEF-IPD 229 (367) Q Consensus 154 ~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~--~l~~~vmhS~v~~~L~k~-~li~~-~~~ 229 (367) ..+. .+.++.. .-.++.+.+|..+|.+..= .=+.++|.|..+..|.+. ..... ... T Consensus 132 ---------------~~~~---vgt~~t~--~~a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~ 191 (423) T protein:vir:10 132 ---------------GALS---LGSPNTP--IKKWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQL 191 (423) T ss_pred ---------------cccc---ccccccc--cccHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhcccccc Confidence 0011 1111111 1136788898888865422 236789999999998764 32222 111 Q ss_pred ccc---cccc-hhhcCcEEEEeCCCcccCCCCCc-eEE--EEEEecceeeeeccCCCcceeeeeehhhcCC-ceeEEEEE Q lcl|Aclame:pro 230 SKG---QLTI-PTYMGKVVIVDDGMPVFGTGADK-TYL--SILFGGAAFGYADGAPQVPVAVGRRELRGNG-SGLEYILE 301 (367) Q Consensus 230 ~~g---~~~i-~t~~G~~VivdD~~pv~~t~~~~-~yt--tyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~-~g~~~l~~ 301 (367) ... +-.| +.+.|+.+..|..+|..+.+..+ ..+ .+....|+=.-....+.......-....+-- .| |.+-- T Consensus 192 ~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~G-D~~t~ 270 (423) T protein:vir:10 192 VRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVG-DQLQF 270 (423) T ss_pred chHHHHhcccceeecceEEEEecCCcccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEec-ceEee Confidence 111 1234 78999999999999965433222 111 1111122100000000000000000000000 00 11111 Q ss_pred ccEEEeeeeee------------eecccccccccccccccccccccCCCCh-------------HHhcCCcc-------- Q lcl|Aclame:pro 302 RKEWIVHPGGF------------NWLDADVTIPDNTGSPSGITSGPPAITL-------------ANLANPDN-------- 348 (367) Q Consensus 302 r~~~~~hp~G~------------s~~~~~~~~~~~~~~~~~~~~~~~sPt~-------------a~L~~~~N-------- 348 (367) --.+.+||.=. .|. ++...++..+... +-..+|.. +.+++++. T Consensus 271 aGv~~v~~~tk~~l~~~~~~~~~~~~---V~~~~~~~a~~~~-tv~i~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~ 346 (423) T protein:vir:10 271 DDTHWLNQQSKQTLYNGASALSFTAT---VMEDANAHSSGDV-TVKISGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSK 346 (423) T ss_pred cceeeecccccceeecccCCcceEEE---EEecccccccCce-EEEeccccccccCcccccceeccccCCceeEEeeccC Confidence 12223332211 111 1100000011000 01111211 11111111 Q ss_pred ----ceeeecccccceEEEE-------------ecC Q lcl|Aclame:pro 349 ----WERVTYRKNVPMAFLV-------------TKG 367 (367) Q Consensus 349 ----W~~v~d~K~i~iv~~~-------------t~g 367 (367) =+++|.+.+++++..- .+| T Consensus 347 ~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g 382 (423) T protein:vir:10 347 QAMKPNLFYNKLFCGLGTIPLPKLHSIDSAVATYEG 382 (423) T ss_pred CceeEEEEecCcceEEEEEcccCCCccceeeccccc Confidence 1234555555554321 011 No 153 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=93.39 E-value=0.0015 Score=36.02 Aligned_cols=324 Identities=15% Similarity=0.117 Sum_probs=140.0 Q ss_pred CCCc---ccccc-------ceeccchH--HHHHHHhhhhH---HhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC Q lcl|Aclame:pro 1 MPDF---NNQVR-------LVDAVIPE--VYTSYTAIDRP---ELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD 65 (367) Q Consensus 1 Ma~~---~~~T~-------l~d~i~PE--Vf~~yv~~~~~---~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~ 65 (367) |--+ +...+ ....=+|. .+...+..... .+-.+.+.+-=.|-..+..|-.+.|+.|+++.-..|. T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 1100 00000 00001111 01111101000 1111223332223333445556899999999999997 Q ss_pred CcccccCCCCccccccccccchhhhhhhhhHhhcccchh-HHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHh Q lcl|Aclame:pro 66 SLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAM-DLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYK 144 (367) Q Consensus 66 g~~~~~~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~t-Dla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~ 144 (367) |+. +.++... +---+.|+-..+..+|-....++... ..+..-+--|...+....++.||++..+..++-.|.|.-+ T Consensus 81 g~g--v~Gd~~l-EGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg 157 (404) T protein:vir:10 81 KRP--TMGDERV-EGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) T ss_pred cCC--cccCcee-eccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 643 2222221 22334566666666666555555322 3344445567777888999999999999999988888655 Q ss_pred hhhhhhhhhhhhhhh------hhhhhhcchhhcceeecCccc----chhhcccHHHHHHHHHHh-------------ccc Q lcl|Aclame:pro 145 SNLAGNFATIKTRGR------VPAEVLGTAGDMVIDISGQTN----PADAVFNREAFVDAAFTM-------------GDH 201 (367) Q Consensus 145 ~~~a~~~~~~~~~~~------~~a~~~~~~~~~v~disa~t~----~a~~~~s~~~l~~A~~~~-------------GD~ 201 (367) .-..... .+.+... ..........++.+-...++. +..-.|+.+.+-++...+ ||. T Consensus 158 ~~~n~~~-~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~ 236 (404) T protein:vir:10 158 DFVADDT-ILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (404) T ss_pred ccccccc-eeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEecccc Confidence 3211111 0000000 000000111111111111111 122357777776665443 332 Q ss_pred ---cCceeEEEEccHHHHHHHhc----chhhhcccc----cc--cc----cchhhcCcEEEEeCCCccc----------- Q lcl|Aclame:pro 202 ---VGSIAAIAVHSMVYKRMTNN----DEIEFIPDS----KG--QL----TIPTYMGKVVIVDDGMPVF----------- 253 (367) Q Consensus 202 ---~~~l~~~vmhS~v~~~L~k~----~li~~~~~~----~g--~~----~i~t~~G~~VivdD~~pv~----------- 253 (367) .+..-+++|||-+++.|+.+ +..+..+.. .| ++ .++.|+|+.|.---.+|+- T Consensus 237 ~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~ 316 (404) T protein:vir:10 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSE 316 (404) T ss_pred ccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecC Confidence 11258999999999999998 244544421 11 22 3568888777643333310 Q ss_pred -------CCCC--CceEEEEEEecceeeeeccCCC----cceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccc Q lcl|Aclame:pro 254 -------GTGA--DKTYLSILFGGAAFGYADGAPQ----VPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVT 320 (367) Q Consensus 254 -------~t~~--~~~yttyl~~~GAi~~~~~~~~----~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~ 320 (367) ...+ ..+=..+|+|.=|.+++-+.+. .-.|-..|-.. -..+.+. .++...=..|.+.... T Consensus 317 n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~----~~~i~~~---~i~G~kK~rF~~~~g~ 389 (404) T protein:vir:10 317 NNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDN----RTEIAIS---WINGLKKIRFPEKSGK 389 (404) T ss_pred CccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCc----hhhhhhH---HHhhhhhccccCCCCc Confidence 0001 1112458888877655533321 11222223221 1122221 1222222233211100 Q ss_pred cccccccccccccccCCCChHHh Q lcl|Aclame:pro 321 IPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 321 ~~~~~~~~~~~~~~~~sPt~a~L 343 (367) ..+-. -..=||-+-| T Consensus 390 -~~DfG-------vi~idta~~~ 404 (404) T protein:vir:10 390 -MQDHG-------VIAVDTAVKL 404 (404) T ss_pred -eeeEE-------EEEecccccC Confidence 00000 0001222222 No 154 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=93.39 E-value=0.0015 Score=36.02 Aligned_cols=324 Identities=15% Similarity=0.117 Sum_probs=140.0 Q ss_pred CCCc---ccccc-------ceeccchH--HHHHHHhhhhH---HhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC Q lcl|Aclame:pro 1 MPDF---NNQVR-------LVDAVIPE--VYTSYTAIDRP---ELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD 65 (367) Q Consensus 1 Ma~~---~~~T~-------l~d~i~PE--Vf~~yv~~~~~---~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~ 65 (367) |--+ +...+ ....=+|. .+...+..... .+-.+.+.+-=.|-..+..|-.+.|+.|+++.-..|. T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:81 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 1100 00000 00001111 01111101000 1111223332223333445556899999999999997 Q ss_pred CcccccCCCCccccccccccchhhhhhhhhHhhcccchh-HHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHh Q lcl|Aclame:pro 66 SLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAM-DLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYK 144 (367) Q Consensus 66 g~~~~~~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~t-Dla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~ 144 (367) |+. +.++... +---+.|+-..+..+|-....++... ..+..-+--|...+....++.||++..+..++-.|.|.-+ T Consensus 81 g~g--v~Gd~~l-EGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg 157 (404) T protein:vir:81 81 KRP--TMGDERV-EGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) T ss_pred cCC--cccCcee-eccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 643 2222221 22334566666666666555555322 3344445567777888999999999999999988888655 Q ss_pred hhhhhhhhhhhhhhh------hhhhhhcchhhcceeecCccc----chhhcccHHHHHHHHHHh-------------ccc Q lcl|Aclame:pro 145 SNLAGNFATIKTRGR------VPAEVLGTAGDMVIDISGQTN----PADAVFNREAFVDAAFTM-------------GDH 201 (367) Q Consensus 145 ~~~a~~~~~~~~~~~------~~a~~~~~~~~~v~disa~t~----~a~~~~s~~~l~~A~~~~-------------GD~ 201 (367) .-..... .+.+... ..........++.+-...++. +..-.|+.+.+-++...+ ||. T Consensus 158 ~~~n~~~-~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~ 236 (404) T protein:vir:81 158 DFVADDT-ILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (404) T ss_pred ccccccc-eeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEecccc Confidence 3211111 0000000 000000111111111111111 122357777776665443 332 Q ss_pred ---cCceeEEEEccHHHHHHHhc----chhhhcccc----cc--cc----cchhhcCcEEEEeCCCccc----------- Q lcl|Aclame:pro 202 ---VGSIAAIAVHSMVYKRMTNN----DEIEFIPDS----KG--QL----TIPTYMGKVVIVDDGMPVF----------- 253 (367) Q Consensus 202 ---~~~l~~~vmhS~v~~~L~k~----~li~~~~~~----~g--~~----~i~t~~G~~VivdD~~pv~----------- 253 (367) .+..-+++|||-+++.|+.+ +..+..+.. .| ++ .++.|+|+.|.---.+|+- T Consensus 237 ~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~ 316 (404) T protein:vir:81 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSE 316 (404) T ss_pred ccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecC Confidence 11258999999999999998 244544421 11 22 3568888777643333310 Q ss_pred -------CCCC--CceEEEEEEecceeeeeccCCC----cceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccc Q lcl|Aclame:pro 254 -------GTGA--DKTYLSILFGGAAFGYADGAPQ----VPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVT 320 (367) Q Consensus 254 -------~t~~--~~~yttyl~~~GAi~~~~~~~~----~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~ 320 (367) ...+ ..+=..+|+|.=|.+++-+.+. .-.|-..|-.. -..+.+. .++...=..|.+.... T Consensus 317 n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~----~~~i~~~---~i~G~kK~rF~~~~g~ 389 (404) T protein:vir:81 317 NNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDN----RTEIAIS---WINGLKKIRFPEKSGK 389 (404) T ss_pred CccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCc----hhhhhhH---HHhhhhhccccCCCCc Confidence 0001 1112458888877655533321 11222223221 1122221 1222222233211100 Q ss_pred cccccccccccccccCCCChHHh Q lcl|Aclame:pro 321 IPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 321 ~~~~~~~~~~~~~~~~sPt~a~L 343 (367) ..+-. -..=||-+-| T Consensus 390 -~~DfG-------vi~idta~~~ 404 (404) T protein:vir:81 390 -MQDHG-------VIAVDTAVKL 404 (404) T ss_pred -eeeEE-------EEEecccccC Confidence 00000 0001222222 No 155 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=93.39 E-value=0.0015 Score=36.02 Aligned_cols=324 Identities=15% Similarity=0.117 Sum_probs=140.0 Q ss_pred CCCc---ccccc-------ceeccchH--HHHHHHhhhhH---HhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC Q lcl|Aclame:pro 1 MPDF---NNQVR-------LVDAVIPE--VYTSYTAIDRP---ELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD 65 (367) Q Consensus 1 Ma~~---~~~T~-------l~d~i~PE--Vf~~yv~~~~~---~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~ 65 (367) |--+ +...+ ....=+|. .+...+..... .+-.+.+.+-=.|-..+..|-.+.|+.|+++.-..|. T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 1100 00000 00001111 01111101000 1111223332223333445556899999999999997 Q ss_pred CcccccCCCCccccccccccchhhhhhhhhHhhcccchh-HHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHh Q lcl|Aclame:pro 66 SLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAM-DLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYK 144 (367) Q Consensus 66 g~~~~~~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~t-Dla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~ 144 (367) |+. +.++... +---+.|+-..+..+|-....++... ..+..-+--|...+....++.||++..+..++-.|.|.-+ T Consensus 81 g~g--v~Gd~~l-EGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg 157 (404) T protein:vir:10 81 KRP--TMGDERV-EGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) T ss_pred cCC--cccCcee-eccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 643 2222221 22334566666666666555555322 3344445567777888999999999999999988888655 Q ss_pred hhhhhhhhhhhhhhh------hhhhhhcchhhcceeecCccc----chhhcccHHHHHHHHHHh-------------ccc Q lcl|Aclame:pro 145 SNLAGNFATIKTRGR------VPAEVLGTAGDMVIDISGQTN----PADAVFNREAFVDAAFTM-------------GDH 201 (367) Q Consensus 145 ~~~a~~~~~~~~~~~------~~a~~~~~~~~~v~disa~t~----~a~~~~s~~~l~~A~~~~-------------GD~ 201 (367) .-..... .+.+... ..........++.+-...++. +..-.|+.+.+-++...+ ||. T Consensus 158 ~~~n~~~-~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~ 236 (404) T protein:vir:10 158 DFVADDT-ILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (404) T ss_pred ccccccc-eeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEecccc Confidence 3211111 0000000 000000111111111111111 122357777776665443 332 Q ss_pred ---cCceeEEEEccHHHHHHHhc----chhhhcccc----cc--cc----cchhhcCcEEEEeCCCccc----------- Q lcl|Aclame:pro 202 ---VGSIAAIAVHSMVYKRMTNN----DEIEFIPDS----KG--QL----TIPTYMGKVVIVDDGMPVF----------- 253 (367) Q Consensus 202 ---~~~l~~~vmhS~v~~~L~k~----~li~~~~~~----~g--~~----~i~t~~G~~VivdD~~pv~----------- 253 (367) .+..-+++|||-+++.|+.+ +..+..+.. .| ++ .++.|+|+.|.---.+|+- T Consensus 237 ~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~ 316 (404) T protein:vir:10 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSE 316 (404) T ss_pred ccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecC Confidence 11258999999999999998 244544421 11 22 3568888777643333310 Q ss_pred -------CCCC--CceEEEEEEecceeeeeccCCC----cceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccc Q lcl|Aclame:pro 254 -------GTGA--DKTYLSILFGGAAFGYADGAPQ----VPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVT 320 (367) Q Consensus 254 -------~t~~--~~~yttyl~~~GAi~~~~~~~~----~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~ 320 (367) ...+ ..+=..+|+|.=|.+++-+.+. .-.|-..|-.. -..+.+. .++...=..|.+.... T Consensus 317 n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~----~~~i~~~---~i~G~kK~rF~~~~g~ 389 (404) T protein:vir:10 317 NNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDN----RTEIAIS---WINGLKKIRFPEKSGK 389 (404) T ss_pred CccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCc----hhhhhhH---HHhhhhhccccCCCCc Confidence 0001 1112458888877655533321 11222223221 1122221 1222222233211100 Q ss_pred cccccccccccccccCCCChHHh Q lcl|Aclame:pro 321 IPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 321 ~~~~~~~~~~~~~~~~sPt~a~L 343 (367) ..+-. -..=||-+-| T Consensus 390 -~~DfG-------vi~idta~~~ 404 (404) T protein:vir:10 390 -MQDHG-------VIAVDTAVKL 404 (404) T ss_pred -eeeEE-------EEEecccccC Confidence 00000 0001222222 No 156 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=93.39 E-value=0.0015 Score=36.02 Aligned_cols=324 Identities=15% Similarity=0.117 Sum_probs=140.0 Q ss_pred CCCc---ccccc-------ceeccchH--HHHHHHhhhhH---HhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC Q lcl|Aclame:pro 1 MPDF---NNQVR-------LVDAVIPE--VYTSYTAIDRP---ELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD 65 (367) Q Consensus 1 Ma~~---~~~T~-------l~d~i~PE--Vf~~yv~~~~~---~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~ 65 (367) |--+ +...+ ....=+|. .+...+..... .+-.+.+.+-=.|-..+..|-.+.|+.|+++.-..|. T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:32 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 1100 00000 00001111 01111101000 1111223332223333445556899999999999997 Q ss_pred CcccccCCCCccccccccccchhhhhhhhhHhhcccchh-HHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHh Q lcl|Aclame:pro 66 SLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAM-DLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYK 144 (367) Q Consensus 66 g~~~~~~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~t-Dla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~ 144 (367) |+. +.++... +---+.|+-..+..+|-....++... ..+..-+--|...+....++.||++..+..++-.|.|.-+ T Consensus 81 g~g--v~Gd~~l-EGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg 157 (404) T protein:vir:32 81 KRP--TMGDERV-EGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) T ss_pred cCC--cccCcee-eccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 643 2222221 22334566666666666555555322 3344445567777888999999999999999988888655 Q ss_pred hhhhhhhhhhhhhhh------hhhhhhcchhhcceeecCccc----chhhcccHHHHHHHHHHh-------------ccc Q lcl|Aclame:pro 145 SNLAGNFATIKTRGR------VPAEVLGTAGDMVIDISGQTN----PADAVFNREAFVDAAFTM-------------GDH 201 (367) Q Consensus 145 ~~~a~~~~~~~~~~~------~~a~~~~~~~~~v~disa~t~----~a~~~~s~~~l~~A~~~~-------------GD~ 201 (367) .-..... .+.+... ..........++.+-...++. +..-.|+.+.+-++...+ ||. T Consensus 158 ~~~n~~~-~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~ 236 (404) T protein:vir:32 158 DFVADDT-ILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (404) T ss_pred ccccccc-eeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEecccc Confidence 3211111 0000000 000000111111111111111 122357777776665443 332 Q ss_pred ---cCceeEEEEccHHHHHHHhc----chhhhcccc----cc--cc----cchhhcCcEEEEeCCCccc----------- Q lcl|Aclame:pro 202 ---VGSIAAIAVHSMVYKRMTNN----DEIEFIPDS----KG--QL----TIPTYMGKVVIVDDGMPVF----------- 253 (367) Q Consensus 202 ---~~~l~~~vmhS~v~~~L~k~----~li~~~~~~----~g--~~----~i~t~~G~~VivdD~~pv~----------- 253 (367) .+..-+++|||-+++.|+.+ +..+..+.. .| ++ .++.|+|+.|.---.+|+- T Consensus 237 ~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~ 316 (404) T protein:vir:32 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSE 316 (404) T ss_pred ccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecC Confidence 11258999999999999998 244544421 11 22 3568888777643333310 Q ss_pred -------CCCC--CceEEEEEEecceeeeeccCCC----cceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccc Q lcl|Aclame:pro 254 -------GTGA--DKTYLSILFGGAAFGYADGAPQ----VPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVT 320 (367) Q Consensus 254 -------~t~~--~~~yttyl~~~GAi~~~~~~~~----~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~ 320 (367) ...+ ..+=..+|+|.=|.+++-+.+. .-.|-..|-.. -..+.+. .++...=..|.+.... T Consensus 317 n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~----~~~i~~~---~i~G~kK~rF~~~~g~ 389 (404) T protein:vir:32 317 NNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDN----RTEIAIS---WINGLKKIRFPEKSGK 389 (404) T ss_pred CccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCc----hhhhhhH---HHhhhhhccccCCCCc Confidence 0001 1112458888877655533321 11222223221 1122221 1222222233211100 Q ss_pred cccccccccccccccCCCChHHh Q lcl|Aclame:pro 321 IPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 321 ~~~~~~~~~~~~~~~~sPt~a~L 343 (367) ..+-. -..=||-+-| T Consensus 390 -~~DfG-------vi~idta~~~ 404 (404) T protein:vir:32 390 -MQDHG-------VIAVDTAVKL 404 (404) T ss_pred -eeeEE-------EEEecccccC Confidence 00000 0001222222 No 157 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=93.30 E-value=0.008 Score=32.07 Aligned_cols=288 Identities=10% Similarity=0.038 Sum_probs=108.9 Q ss_pred CCC------------cccccccee-----ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeecc Q lcl|Aclame:pro 1 MPD------------FNNQVRLVD-----AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRD 63 (367) Q Consensus 1 Ma~------------~~~~T~l~d-----~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~ 63 (367) ||+ ++..+. ++ .+.||+...+ ..+..+.+.|++--=+ .........+|.|+. T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~-~~~~~g~~v~~~~~~~l-~~~i~e~s~~l~~i~v---------~~v~~~~~~i~~~~~ 69 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTV-DDLDAGGTLPDPLWDEF-WTDMIEETPLLDAIRT---------ETVGAKKTRIPTLNI 69 (321) T ss_pred CchHHHHHHHHHHHHhccccc-cccCCcceeCHHHHHHH-HHHHHHhhhhhhhcee---------eeccCcceeeeeecc Confidence 332 222211 11 4556654443 3445555555441111 112234456677652 Q ss_pred CCCcccccC-CCCccccccccccchhhhhhhhhHhhcccchhHHHHHhh--cccHHHHHHHHHHHHHhhhhhHHHHHHHH Q lcl|Aclame:pro 64 LDSLEPNYG-SDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELA--GSNPMTRIRNRFGVYWTRQWQRRIIAMAV 140 (367) Q Consensus 64 l~g~~~~~~-~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~--g~DPm~~i~~qia~yw~~~~q~~lla~l~ 140 (367) .+...-.. +++ ....+.+.+-++..-..++.+--+.+++.-..-. +.|..+.+.+++++-+.+..+...+ . T Consensus 70 -~~~~~~~~~e~~--~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~---n 143 (321) T protein:vir:31 70 -GERHRRPQDEGE--WNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAA---N 143 (321) T ss_pred -CCcccccccccc--cccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhhee---e Confidence 22111111 111 1111222222222223333333344444433322 4577788888888666555544433 2 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCc--eeEEEEccHHHHHH Q lcl|Aclame:pro 141 GVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGS--IAAIAVHSMVYKRM 218 (367) Q Consensus 141 Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~--l~~~vmhS~v~~~L 218 (367) | +..+.+.....-.+-+. ....++.-+. .+...++.+.|.+....+-..... =-+++||+.++..+ T Consensus 144 G---d~~~~~~~~~~n~G~l~-----~a~~~~~~~~----~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~ 211 (321) T protein:vir:31 144 G---DEDAEDSFENQNDGFIT-----VAEGDVETID----AADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSY 211 (321) T ss_pred c---cccCCCcccccchhhhh-----hhcccccccc----ccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHH Confidence 2 22222110000000000 0011111111 112346788898888887543321 22688999988765 Q ss_pred Hhcchhhh----cccccccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEec---ceeeeeccCCCcceeeeeehhhc Q lcl|Aclame:pro 219 TNNDEIEF----IPDSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG---AAFGYADGAPQVPVAVGRRELRG 291 (367) Q Consensus 219 ~k~~li~~----~~~~~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~---GAi~~~~~~~~~~~e~~rd~~~~ 291 (367) ++. |.+- .+..-....-.+++|++|++++.||-.. .+|++ =++++.. ...++..|+...- T Consensus 212 ~~~-l~~~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP~~~---------il~t~~~nl~~~~~~---~~~~~~~~~~~~~ 278 (321) T protein:vir:31 212 HYT-LTDRDTPLGDNVIMGEADVNPFSFPIIGSGLWPDDK---------AMFTDPQNLIYALYR---DLEIDVLTESDKV 278 (321) T ss_pred HHH-HhcCCCccccchhhccccccccceeEEEcCCCCCCc---------EEEeccccEEEEEee---ccEEEEeecCccc Confidence 542 1111 1110001123479999999999999421 22211 1111111 1112222222111 Q ss_pred C---CceeEEEEEccEEEeeeee-eeecccccccccccccccccccccCCCC Q lcl|Aclame:pro 292 N---GSGLEYILERKEWIVHPGG-FNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 292 ~---~~g~~~l~~r~~~~~hp~G-~s~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) . ..-+.++..+-.|++--.+ +.+-. .+ +-+-+.--.+|+ T Consensus 279 ~~~~~~~~~~~~~~~~~~ve~~~a~a~~~-~i--------~~~~~~~~~~~~ 321 (321) T protein:vir:31 279 SERDLHARYFMRGDDDFAIENTEAVVLAE-GL--------GDPLEHLEEETS 321 (321) T ss_pred cccceeeEeeeeeecceeEeccccEEEEe-cC--------CcchhcccCCCC Confidence 0 0111222222233322111 11110 00 000111112222 No 158 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=93.04 E-value=0.009 Score=31.80 Aligned_cols=323 Identities=10% Similarity=-0.003 Sum_probs=144.8 Q ss_pred CCCccccccce------ec-cchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCC Q lcl|Aclame:pro 1 MPDFNNQVRLV------DA-VIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGS 73 (367) Q Consensus 1 Ma~~~~~T~l~------d~-i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~ 73 (367) |..+|..|+-. +. +-=|+|.-=|.....+++.| .+.-.+++ + .+|+++.+|+.+...= ..+.- T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~------~~~~~vrt-i-~~GkS~qf~~iG~~~a--~y~~~ 70 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENI------LSYFDVQT-V-TGTNTVSNKYLGETEL--QVLAP 70 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhh------cCcceeee-e-cccceEEEEEEeeeEE--eeecc Confidence 98888777531 11 11133433333333333333 22222221 1 4899999999874421 11111 Q ss_pred CCccccccccccchhhhhhhhhH-hhcccchhHHHHHhhccc-HHHHHHHHHHHHHhhhhhHHHHHHHHH-HHhhhhhhh Q lcl|Aclame:pro 74 DNPNVEAPIDGLGSGEMKTTKTW-LNKAYGAMDLTAELAGSN-PMTRIRNRFGVYWTRQWQRRIIAMAVG-VYKSNLAGN 150 (367) Q Consensus 74 ~~~~~~~t~~kitt~~~~a~i~~-r~kg~~~tDla~~~~g~D-Pm~~i~~qia~yw~~~~q~~lla~l~G-vf~~~~a~~ 150 (367) +. .+-...+.+.+.+-+|=. .--...+.|+-....--| +=.+++++.+..-++..++.++..++. -.+....-. T Consensus 71 G~---~ldg~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~ 147 (402) T protein:vir:97 71 GQ---SPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER 147 (402) T ss_pred cc---ccCCCCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 10 011111222211111100 001123667766666666 556888888888888888877764432 111110000 Q ss_pred hhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHH----HHHHHhc--cccCceeEEEEccHHHHHHHhc-ch Q lcl|Aclame:pro 151 FATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFV----DAAFTMG--DHVGSIAAIAVHSMVYKRMTNN-DE 223 (367) Q Consensus 151 ~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~----~A~~~~G--D~~~~l~~~vmhS~v~~~L~k~-~l 223 (367) .. ..........++ ..+. +.+..++..+. +|.+.|- |-...=..++|.|.+|..|.+. +| T Consensus 148 ~~--------~~~~~~g~s~~~----~~t~-~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl 214 (402) T protein:vir:97 148 NK--------PRVKGHGFSINV----NVTE-SEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRI 214 (402) T ss_pred cc--------Cccccccccccc----cccc-chhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccc Confidence 00 000000001111 0111 11223454444 5555553 3333347999999999999887 56 Q ss_pred hh--hcccccc---cccchhhcCcEEEEeCCCcccCCC------------C-------CceEEEEEEecceeeeeccCCC Q lcl|Aclame:pro 224 IE--FIPDSKG---QLTIPTYMGKVVIVDDGMPVFGTG------------A-------DKTYLSILFGGAAFGYADGAPQ 279 (367) Q Consensus 224 i~--~~~~~~g---~~~i~t~~G~~VivdD~~pv~~t~------------~-------~~~yttyl~~~GAi~~~~~~~~ 279 (367) ++ |.....+ .-.+...+|.+|+.+..+|..++. . ..+-..++|-+-|++.....+- T Consensus 215 ~n~d~~~~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~v 294 (402) T protein:vir:97 215 VDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEV 294 (402) T ss_pred cchhhccccCCccccceeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeecc Confidence 53 4322222 235788999999999999964310 0 0122457788888877665542 Q ss_pred cceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccc Q lcl|Aclame:pro 280 VPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVP 359 (367) Q Consensus 280 ~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~ 359 (367) ..++.|+..+. .+.+.+.+.|...| ..|.-+-++--. ... +.+ .-.+|.+.-+=-..- -+=. T Consensus 295 -T~~~~~d~r~~----~~~id~~~a~G~g~--~RPeaa~vv~~~-----~~~-t~~---~~~~~~~~~~~~~~~--~~~~ 356 (402) T protein:vir:97 295 -TGDIFYEKKEK----TYYIDTFMAEGAIP--DRWEAVSVVTTK-----RDA-TTG---DAGGPGDDHATVLAR--AQRK 356 (402) T ss_pred -ccchhhchhHH----HHHHHHHHHhCCcc--cCccceEEEEEe-----ccc-ccc---cCCccccchhhhhcc--cccc Confidence 24444454432 12333343343333 233322221000 000 000 011222222211111 1112 Q ss_pred eEEEEecC Q lcl|Aclame:pro 360 MAFLVTKG 367 (367) Q Consensus 360 iv~~~t~g 367 (367) .+..+|-| T Consensus 357 ~~~~~~~~ 364 (402) T protein:vir:97 357 AVYVKTEG 364 (402) T ss_pred eEEEeccc Confidence 34566777 No 159 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=92.63 E-value=0.0052 Score=33.09 Aligned_cols=328 Identities=11% Similarity=0.050 Sum_probs=146.6 Q ss_pred CCCccccccc--eeccchHHHHHHHhhhhHHh----hhHhhc----------------ccccccHHHHHHhhCCCceEEe Q lcl|Aclame:pro 1 MPDFNNQVRL--VDAVIPEVYTSYTAIDRPEL----TAFFLS----------------GAVASNDFLSQFLSAPGRLINI 58 (367) Q Consensus 1 Ma~~~~~T~l--~d~i~PEVf~~yv~~~~~~~----~~f~~S----------------Gi~~~~~~l~~~~~~~G~~i~~ 58 (367) |--+ +|.+ -+-.=..++..-+-....++ ++|... +.=.|-..+..|-...|+.|++ T Consensus 1 ~~~a--~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~GD~Vtf 78 (430) T protein:vir:10 1 MTAS--KTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKGDEVRF 78 (430) T ss_pred Ccce--eeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCccEEEE Confidence 5432 2322 11111223333333333332 344332 2211122333444578999999 Q ss_pred eeeccCCCcccccCCCCccccccccccchhhhhhhhhHhhcccchhH-HHHHhhcccHHHHHHHHHHHHHhhhhhHHHHH Q lcl|Aclame:pro 59 PFWRDLDSLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMD-LTAELAGSNPMTRIRNRFGVYWTRQWQRRIIA 137 (367) Q Consensus 59 P~~~~l~g~~~~~~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tD-la~~~~g~DPm~~i~~qia~yw~~~~q~~lla 137 (367) +.-..|.|+. .. ++.. -+---+.|+.+.+..+|=...+++.... .+..-+--|...+...+++.||++..+..++- T Consensus 79 ~L~~~L~g~g-v~-Gd~~-lEGnee~L~~~~d~l~IDq~R~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~v 155 (430) T protein:vir:10 79 HFVQPANAFP-IM-GSEY-AEGKGTGLKIGSDQLRVNQARFPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSMLV 155 (430) T ss_pred eEeeccccCc-ee-cCce-eeccccceEEEeeEEEEeeeccccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999997643 22 2222 1233456777777777666666655443 23444456777788889999999988888888 Q ss_pred HHHHHHhhhhhhhhhhhh-----hhhhhhhhhhcchh-hcce-eecCcc----------cchhhcccHHHHHHHHHHhcc Q lcl|Aclame:pro 138 MAVGVYKSNLAGNFATIK-----TRGRVPAEVLGTAG-DMVI-DISGQT----------NPADAVFNREAFVDAAFTMGD 200 (367) Q Consensus 138 ~l~Gvf~~~~a~~~~~~~-----~~~~~~a~~~~~~~-~~v~-disa~t----------~~a~~~~s~~~l~~A~~~~GD 200 (367) -|.|.-+.....+..... .............. .|.+ .-.+.+ -++.-.++.+.+-+|...... T Consensus 156 ~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~ 235 (430) T protein:vir:10 156 HLAGARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMDQ 235 (430) T ss_pred HHhhhhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHHh Confidence 888864433222211000 00000000000111 1222 000000 012235788877776554322 Q ss_pred ----------ccC------ceeEEEEccHHHHHHHhcc-hhhhcc-----cccc--cc----cchhhcCcEEEEeCCCcc Q lcl|Aclame:pro 201 ----------HVG------SIAAIAVHSMVYKRMTNND-EIEFIP-----DSKG--QL----TIPTYMGKVVIVDDGMPV 252 (367) Q Consensus 201 ----------~~~------~l~~~vmhS~v~~~L~k~~-li~~~~-----~~~g--~~----~i~t~~G~~VivdD~~pv 252 (367) .++ .+-+++|||.+++.|+.+- ..++.. ..+| ++ .++.|+|+.|.--- .|+ T Consensus 236 ~~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~~-~vi 314 (430) T protein:vir:10 236 IELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKMP-KPI 314 (430) T ss_pred hCCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEecCC-cee Confidence 122 2589999999999999983 333311 1112 22 46788888776321 111 Q ss_pred ----------cC----------------CCCCceEEEEEEecceeeeeccCCC---cc---eeeeeehhhcCCceeEEEE Q lcl|Aclame:pro 253 ----------FG----------------TGADKTYLSILFGGAAFGYADGAPQ---VP---VAVGRRELRGNGSGLEYIL 300 (367) Q Consensus 253 ----------~~----------------t~~~~~yttyl~~~GAi~~~~~~~~---~~---~e~~rd~~~~~~~g~~~l~ 300 (367) .. .+...+=..+|+|.-|++.+.+... .+ .|...|-. ...++.+ T Consensus 315 rf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g----~~~~i~~ 390 (430) T protein:vir:10 315 RFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHG----DKLELLI 390 (430) T ss_pred eecCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccC----chhhhhh Confidence 00 0011122467788777766655421 11 22222221 1122222 Q ss_pred EccEEEeeeeeeeecccccc--cccccccccccccccCCCChHHhcCCccceeeeccc Q lcl|Aclame:pro 301 ERKEWIVHPGGFNWLDADVT--IPDNTGSPSGITSGPPAITLANLANPDNWERVTYRK 356 (367) Q Consensus 301 ~r~~~~~hp~G~s~~~~~~~--~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K 356 (367) . .++...=..|...+.. ...+-.. ..=||-+-|--+ +| T Consensus 391 ~---~i~G~kK~rF~~~~~~~~~~~DfGv-------i~idtaa~~~~~--------~~ 430 (430) T protein:vir:10 391 G---AILGCSKIRFAVEATNGLEYTDHGV-------MAIDTAVKIIGP--------RK 430 (430) T ss_pred h---HHhccceeeecCCCCCCceeeeeEE-------EEhhhhhhhhcC--------CC Confidence 1 2333333334321100 0000000 000111111111 11 No 160 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=92.00 E-value=0.0029 Score=34.50 Aligned_cols=263 Identities=14% Similarity=0.109 Sum_probs=120.4 Q ss_pred CC---Ccccc-c------cceeccchH--HHHHHHhhh---hHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCC Q lcl|Aclame:pro 1 MP---DFNNQ-V------RLVDAVIPE--VYTSYTAID---RPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD 65 (367) Q Consensus 1 Ma---~~~~~-T------~l~d~i~PE--Vf~~yv~~~---~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~ 65 (367) |. ..+-. . +..-.=+|. +|..-+... ...+-.|.+.|-=.|-..+..|-.+.|+.|+++.-..|. T Consensus 1 mt~~~~~~~~~~~~~~~ft~~~~~~~~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~L~ 80 (318) T protein:vir:27 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) T ss_pred CCccCCCChHHHHHHHHHHHHhcCChHHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeeccc Confidence 22 10000 0 000001111 122222111 111224445442222333344445799999999999997 Q ss_pred CcccccCCCCccccccccccchhhhhhhhhHhhcccchhH-HHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHh Q lcl|Aclame:pro 66 SLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMD-LTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYK 144 (367) Q Consensus 66 g~~~~~~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tD-la~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~ 144 (367) |+. .. ++.. -+---+.|+-..+..+|=....++.... .+..-+--|...+....++.||++..+..++-.|.|.-+ T Consensus 81 g~g-v~-Gd~~-lEGnee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg 157 (318) T protein:vir:27 81 KRP-TM-GDER-VEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (318) T ss_pred cCc-cc-cCce-eeccccceEEEeeEEEEeeeccccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 643 22 2222 1223345666666666655555543221 122223347677788899999999999999999988765 Q ss_pred hhhhhhhhhh-----hhhhhhhhhhhcchhhcceeecCccc----chhhcccHHHHHHHHHHh-------------ccc- Q lcl|Aclame:pro 145 SNLAGNFATI-----KTRGRVPAEVLGTAGDMVIDISGQTN----PADAVFNREAFVDAAFTM-------------GDH- 201 (367) Q Consensus 145 ~~~a~~~~~~-----~~~~~~~a~~~~~~~~~v~disa~t~----~a~~~~s~~~l~~A~~~~-------------GD~- 201 (367) .-...+.... ...............++.+-...++. +..-.|+.+.+-++...+ ||. T Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~ 237 (318) T protein:vir:27 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (318) T ss_pred ccccccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeeccccc Confidence 3211111000 00000000111112222222111111 122356666665654332 222 Q ss_pred --cCceeEEEEccHHHHHHHhcc----hhhhcccc----ccc--c----cchhhcCcEEEEeCCCcccCCCCCceEEEEE Q lcl|Aclame:pro 202 --VGSIAAIAVHSMVYKRMTNND----EIEFIPDS----KGQ--L----TIPTYMGKVVIVDDGMPVFGTGADKTYLSIL 265 (367) Q Consensus 202 --~~~l~~~vmhS~v~~~L~k~~----li~~~~~~----~g~--~----~i~t~~G~~VivdD~~pv~~t~~~~~yttyl 265 (367) .+..-+++|||-+++.|+.+- ..++.+.. +|. + .++.|+|+.+.---.+|+- T Consensus 238 ~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIr------------ 305 (318) T protein:vir:27 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR------------ 305 (318) T ss_pred cCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEE------------ Confidence 112589999999999999872 44444421 111 1 3566666666544444431 Q ss_pred EecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccE Q lcl|Aclame:pro 266 FGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE 304 (367) Q Consensus 266 ~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~ 304 (367) |-+|. .+...|- + T Consensus 306 f~~G~----------~v~~~~~----------------~ 318 (318) T protein:vir:27 306 FYQGQ----------RFWYQRI----------------T 318 (318) T ss_pred EcCCC----------eeeeeec----------------C Confidence 21221 1112222 1 No 161 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=88.20 E-value=0.034 Score=28.65 Aligned_cols=183 Identities=16% Similarity=0.132 Sum_probs=96.6 Q ss_pred cccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhh Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKT 156 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~ 156 (367) ++.+ |.+ -+.+.|+-..-+..|.+.++.+|.+..-++..++.++..+...-........ T Consensus 1 iD~l----L~a------------~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~----- 59 (221) T protein:vir:17 1 MDDL----LVA------------SQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTG----- 59 (221) T ss_pred CCcc----hhH------------HHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccc----- Confidence 0111 111 2467888888888999999999999999998888877665433211110000 Q ss_pred hhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhc--cccCceeEEEEccHHHHHHHh---cchhhhccc-c Q lcl|Aclame:pro 157 RGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMG--DHVGSIAAIAVHSMVYKRMTN---NDEIEFIPD-S 230 (367) Q Consensus 157 ~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~G--D~~~~l~~~vmhS~v~~~L~k---~~li~~~~~-~ 230 (367) +.....+....+.+.+..+ + ++.|.+|.++|- +-...=..+++.+..|+.|.+ ..+++.... + T Consensus 60 ---------~~~g~~~~~~a~~t~~~~~-l-~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s 128 (221) T protein:vir:17 60 ---------QDGGFSVNIGAGNTNNAQA-I-VDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNT 128 (221) T ss_pred ---------cccCcceeccccccCCHHH-H-HHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccc Confidence 0011111111122222222 2 577777777763 233345678889999999876 345554222 3 Q ss_pred cccc----cchhhcCcEEEEeCCCcccCCC----C----------Cce-------EEEEEEecceeeeecc--CCCcc-e Q lcl|Aclame:pro 231 KGQL----TIPTYMGKVVIVDDGMPVFGTG----A----------DKT-------YLSILFGGAAFGYADG--APQVP-V 282 (367) Q Consensus 231 ~g~~----~i~t~~G~~VivdD~~pv~~t~----~----------~~~-------yttyl~~~GAi~~~~~--~~~~~-~ 282 (367) .+.. .++.+.|.+|+.+..+|..... . .++ -...+|-+-|++.-.. .|..| . T Consensus 129 ~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~~~~~~~ 208 (221) T protein:vir:17 129 QGDMNTGKGLYVNAGIRIYKSNVLASLYGTNLVTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLLPPSRPPL 208 (221) T ss_pred cccccccceeeeecCcEEEEeccCCcccccccccCCccccccccccccccccccceEEEEEcchheeeeeeecCCCCCce Confidence 3322 4677899999999999963211 0 011 2245566666554332 22222 0 Q ss_pred e----eeeehhhc Q lcl|Aclame:pro 283 A----VGRRELRG 291 (367) Q Consensus 283 e----~~rd~~~~ 291 (367) - .-|.|.+. T Consensus 209 ~~~~~~~~~~~~~ 221 (221) T protein:vir:17 209 VISMFSIRRPDRR 221 (221) T ss_pred eeeeeeccCCCCC Confidence 0 01222111 No 162 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=88.15 E-value=0.034 Score=28.63 Aligned_cols=286 Identities=11% Similarity=0.058 Sum_probs=143.7 Q ss_pred CCCccccccceeccchHHHHHHHhh---hhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAI---DRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPN 77 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~---~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~ 77 (367) |+. --+.+.+|.|+-.-|.+ ++.--++|+|- .. | .-|..-.+|.++-| -+-+++|+ T Consensus 74 mtt-----~~a~IliP~vis~v~~Eaaepl~~~~kl~qk------~~----L-~~Grsm~F~~~g~~--Ra~~IgEG--- 132 (393) T protein:vir:79 74 MAT-----PSAQILIPRVIVGTMREAAEPLYIGTKMLQK------IR----L-KSGQSMIFPSIGIM--RAYDVAEG--- 132 (393) T ss_pred hcC-----CCcceechhhhhhhhhhcccchhHHHHHHHH------Hh----h-hcCcceeccchhee--eecccccc--- Confidence 543 23667889888665554 22222222210 00 0 23555566666544 23334333 Q ss_pred ccccccccc---hhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLG---SGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATI 154 (367) Q Consensus 78 ~~~t~~kit---t~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~ 154 (367) ++++...|. -+.-...+.+.|-...++|+...=+|=|-|+-.-++-+....|..+...+.-++.- .. T Consensus 133 gE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~----gh------ 202 (393) T protein:vir:79 133 QEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSH----GH------ 202 (393) T ss_pred ccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcc----cc------ Confidence 122222222 22222334455667889999988888888887777777777777666666544321 00 Q ss_pred hhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHH-HHHhccccCceeEEEEccHHHHHHHhcchhhhcccc-cc Q lcl|Aclame:pro 155 KTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDA-AFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDS-KG 232 (367) Q Consensus 155 ~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A-~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~~~-~g 232 (367) .+.++. .+....|+.=.+ ..+-....+|++.|.|= .+.+-+. =....++||+--.....|+.+.+.++.. -| T Consensus 203 ---tvfDa~-st~t~ahptGr~-~~~~qNGTlSleDllDm~~av~~~h-yt~svi~MHPLAWnv~AKna~me~~~~na~g 276 (393) T protein:vir:79 203 ---TVFDNY-STNKLAHTTGLD-KNGVQNDTFSAEDFLDLIIAVMANE-YTPSDLMMHPLAWTVFAKNELMGSLQANPYG 276 (393) T ss_pred ---eeeecc-ccCccceeecCC-ccccccccccHHHHHHHHHHHhccc-CCcceEEEcCchhhhhhhhhhhcceeecccc Confidence 000000 000011111000 00112346899999994 4555332 2467899999999999999888765542 23 Q ss_pred ccc---c--------hhhcCc-----EEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCcee Q lcl|Aclame:pro 233 QLT---I--------PTYMGK-----VVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGL 296 (367) Q Consensus 233 ~~~---i--------~t~~G~-----~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~ 296 (367) +.. + ..++|| -|+++-=+|+.. ++.+|--|+.-..-+++-.+.+ ...+||-..+- .+. T Consensus 277 N~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~--k~~rFd~~~Vd~NnvgvlLV~D--~i~tdq~ddk~--rdi 350 (393) T protein:vir:79 277 NYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDK--KSRRFDVYAVDRNNVGVLLVRD--DLKTDQWDEKA--RGL 350 (393) T ss_pred ccCccccchhhhhchhhhccccccceeEEEeccccccc--ccceeeEEEeecCCceEEEEec--Ccceecccccc--ccc Confidence 221 1 124454 788888888765 4556655666555544444443 34455544433 444 Q ss_pred EEEEEccEEEeee----------eeeeecccccccccccccccccccccCCCChHHhcCCcc Q lcl|Aclame:pro 297 EYILERKEWIVHP----------GGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDN 348 (367) Q Consensus 297 ~~l~~r~~~~~hp----------~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~N 348 (367) +-+--|.||.+++ +-++++++- |..--+.+-.| T Consensus 351 q~iKl~ERYG~gvLn~gkaiavakNI~~~k~y-------------------~~P~~~~~~~~ 393 (393) T protein:vir:79 351 QNIKMIERYGIGILNEGKAIAVAKNISMDKSY-------------------AEPMLIKNVGN 393 (393) T ss_pred eeeeeeeeeceeeeeCCceEEEEecceeeccc-------------------ccchhhhccCC Confidence 4444444454422 233443221 11111222222 No 163 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=86.00 E-value=0.049 Score=27.78 Aligned_cols=314 Identities=12% Similarity=0.035 Sum_probs=137.9 Q ss_pred CCCccccccce------------eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcc Q lcl|Aclame:pro 1 MPDFNNQVRLV------------DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLE 68 (367) Q Consensus 1 Ma~~~~~T~l~------------d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~ 68 (367) |..+|.-|+-. .+|.-||+..|... +-| .+.-.+++ -.+|+++.+|+.+... . T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~-----si~------~~~~~vRt--i~~gkS~qf~~~G~s~--~ 65 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKG-----ENI------MSYFDVQT--VTGTNTVSNKYLGETE--L 65 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHH-----hhh------cccceeee--ecccceEEEEEeeeeE--e Confidence 88888777531 34555555555432 222 22222221 1589999999987442 1 Q ss_pred cccCCCCccccccccccchhhhhhhhhH-hhcccchhHHHHHhhccc-HHHHHHHHHHHHHhhhhhHHHHHHHH--HHHh Q lcl|Aclame:pro 69 PNYGSDNPNVEAPIDGLGSGEMKTTKTW-LNKAYGAMDLTAELAGSN-PMTRIRNRFGVYWTRQWQRRIIAMAV--GVYK 144 (367) Q Consensus 69 ~~~~~~~~~~~~t~~kitt~~~~a~i~~-r~kg~~~tDla~~~~g~D-Pm~~i~~qia~yw~~~~q~~lla~l~--Gvf~ 144 (367) ..+.-+. .+....+.+.+.+-+|=. .---..+.|+-...+--| +=.+++++++..-++..+..++..++ |+ + T Consensus 66 ~~~~pG~---~ld~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~-a 141 (401) T protein:vir:70 66 QVLAPGQ---SPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGI-A 141 (401) T ss_pred eeecCCC---CcCCCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhcc-c Confidence 1111111 111122222221111100 011124566666666656 44588888887778877776665542 22 1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHH----HHHHHh--ccccCceeEEEEccHHHHHH Q lcl|Aclame:pro 145 SNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFV----DAAFTM--GDHVGSIAAIAVHSMVYKRM 218 (367) Q Consensus 145 ~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~----~A~~~~--GD~~~~l~~~vmhS~v~~~L 218 (367) ...+.+. ..........+++.+.. .....+...|. +|.+.| -|-...-.++.|.+..|.-| T Consensus 142 na~~~~~-----------~p~~~~~G~~i~v~~~~--~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~L 208 (401) T protein:vir:70 142 NTQAKRT-----------NPRVKGHGFSINVEVAE--GEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVL 208 (401) T ss_pred ccccccc-----------CCCcCCCceEEeccccc--cccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHH Confidence 1111000 01111112223333322 22234444455 444433 22222334444455555555 Q ss_pred Hhc-chhhhc--ccccc---cccchhhcCcEEEEeCCCcccCC------------C-------CCceEEEEEEecceeee Q lcl|Aclame:pro 219 TNN-DEIEFI--PDSKG---QLTIPTYMGKVVIVDDGMPVFGT------------G-------ADKTYLSILFGGAAFGY 273 (367) Q Consensus 219 ~k~-~li~~~--~~~~g---~~~i~t~~G~~VivdD~~pv~~t------------~-------~~~~yttyl~~~GAi~~ 273 (367) .+. +|++-. ....+ .-.+....|++|+.+..+|..+. + ...+-...+|-+-|++. T Consensus 209 l~~d~L~nrd~~~s~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~t 288 (401) T protein:vir:70 209 RDADRIVDKTYTISQSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLV 288 (401) T ss_pred HhcCcccchhhccccCCccccceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEE Confidence 553 577532 22222 23567889999999999996431 1 11122357788888877 Q ss_pred eccCCCcceeeeeehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCC---ccce Q lcl|Aclame:pro 274 ADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANP---DNWE 350 (367) Q Consensus 274 ~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~---~NW~ 350 (367) ....+- ..|+.||..+. .+.+..++.|.. ....|.-+.+.... ...-|-+.+.+. .+-. T Consensus 289 vk~~~l-t~~~~~d~r~~----~~~id~~~a~g~--g~~RPeaa~vv~~k-----------~~~~~~~~~~~~~~~~~~~ 350 (401) T protein:vir:70 289 GRSIDV-TGDIFYEKKEK----TYYIDTFMAEGA--IPDRWEAVSVVTTK-----------RNTTTGAVEGTDGAQHTIV 350 (401) T ss_pred EEeecc-ccchhhhhhhh----HHHHHHHHHhCC--cccchhheEEEeec-----------CcccccccccCCcchhhhh Confidence 655542 34444554432 233334433433 33455444432111 111111111111 0111 Q ss_pred ee-ecccccceEEEEecC Q lcl|Aclame:pro 351 RV-TYRKNVPMAFLVTKG 367 (367) Q Consensus 351 ~v-~d~K~i~iv~~~t~g 367 (367) ++ ..+|.+ -+++-+ T Consensus 351 ~~~~~~~~~---~~~~~~ 365 (401) T protein:vir:70 351 KNRAQRKAV---YVKNAA 365 (401) T ss_pred hhhccceeE---Eecccc Confidence 11 122222 334444 No 164 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=79.86 E-value=0.1 Score=26.06 Aligned_cols=290 Identities=12% Similarity=-0.065 Sum_probs=110.6 Q ss_pred CCCccccccc-eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCcccc Q lcl|Aclame:pro 1 MPDFNNQVRL-VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE 79 (367) Q Consensus 1 Ma~~~~~T~l-~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~ 79 (367) +-.....|.- .-.++|+-+..-+.+.+.+.+-+.+-.-+ ...+| ...+|....- +.+.-+.+...... T Consensus 83 ~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v---------~~~~~-~~~i~~~~~~-~~a~w~~e~~~~~~ 151 (395) T protein:vir:95 83 FNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINF---------QNAGI-KTRVIKADPA-GQAVWGKVFGEIKG 151 (395) T ss_pred HHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhcee---------EecCC-ceEEEEecCC-cceEEeecccccCc Confidence 0000001111 12457777777777776666655442211 11234 3577765433 22222222111000 Q ss_pred ccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHH-------HHHHHhhhhhhhhh Q lcl|Aclame:pro 80 APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAM-------AVGVYKSNLAGNFA 152 (367) Q Consensus 80 ~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~-------l~Gvf~~~~a~~~~ 152 (367) ...-+++. -.-..++...-..++..-..-+..|-...+.+++++-..+..++.+|.= =.|+++....... T Consensus 152 ~~~~~f~~--i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~- 228 (395) T protein:vir:95 152 QLDAAFRE--ENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSG- 228 (395) T ss_pred ccccccee--eeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeeccccccc- Confidence 00011111 1112222332234555544445556667888888866666555433310 0122211100000 Q ss_pred hhhhhhhhhhhhhcchhhcceee-cCcccchhhcccHHHHHHHHHHhc-------cccCceeEEEEccHHHHHHHhcchh Q lcl|Aclame:pro 153 TIKTRGRVPAEVLGTAGDMVIDI-SGQTNPADAVFNREAFVDAAFTMG-------DHVGSIAAIAVHSMVYKRMTNNDEI 224 (367) Q Consensus 153 ~~~~~~~~~a~~~~~~~~~v~di-sa~t~~a~~~~s~~~l~~A~~~~G-------D~~~~l~~~vmhS~v~~~L~k~~li 224 (367) ..+-.. ......+.....+..+.+....+. .....-..++||+..+.+++.+-+ T Consensus 229 -----------------~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~- 290 (395) T protein:vir:95 229 -----------------AVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYT- 290 (395) T ss_pred -----------------ccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcce- Confidence 000000 000000111223344444332210 012234568999999887665432 Q ss_pred hhcccccccccchhhc--CcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEc Q lcl|Aclame:pro 225 EFIPDSKGQLTIPTYM--GKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILER 302 (367) Q Consensus 225 ~~~~~~~g~~~i~t~~--G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r 302 (367) .+++.|.+ .+.+ |.+|++++.||-.. -.-|.+.-|.++. ...+++++.....-..++..+..+ T Consensus 291 --~~~~~G~~--~~~lg~g~~v~~~~~~p~~~-i~fgdfs~y~i~~----------r~~~~i~~~~~~~~~~d~~~f~~~ 355 (395) T protein:vir:95 291 --YLTANGGF--VTVLPYNVTIITSEFVPEGK-LVAFVTDRYNAVR----------GGGLTVKKFDQTLALEDAVLFTAK 355 (395) T ss_pred --eccCCCcc--eeccCCcceEEEcCCCCCCc-EEEEecccEEEEE----------ecceEEEeccchhhhCCcEEEEEE Confidence 23344542 3443 67899999998321 0111122222211 112333444333222344444444 Q ss_pred cEE---Eeeeeeeeecccccc--cccccccccccccccCCCChHHhcCC Q lcl|Aclame:pro 303 KEW---IVHPGGFNWLDADVT--IPDNTGSPSGITSGPPAITLANLANP 346 (367) Q Consensus 303 ~~~---~~hp~G~s~~~~~~~--~~~~~~~~~~~~~~~~sPt~a~L~~~ 346 (367) .++ ++++..|....-++. .+..+.+ .+..+|= +-+ T Consensus 356 ~r~dg~~~~~~A~~~l~i~~~~~~~~~~~~-----~~~~~~~----~~~ 395 (395) T protein:vir:95 356 TFAYGQPDDNKASAVYDLKVASAPRRQTSA-----GGTTDGI----AEA 395 (395) T ss_pred EEECCEEeccccEEEEEeeccCCCCCCCCC-----CCCCCcc----ccC Confidence 333 334444444322211 1111111 1222222 222 No 165 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=79.00 E-value=0.11 Score=25.87 Aligned_cols=277 Identities=12% Similarity=0.028 Sum_probs=114.6 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||-.| + +|.|.+.+.++..+.+- +|.+...+.....--.||++|.||....- |. -+|.-++. -- T Consensus 1 MA~~n--------~-a~~~~~~Ld~~~~~~l~---~~~L~~~~~~~~v~~~gg~tVkI~~i~~~-gl-~DY~R~~~--g~ 64 (299) T protein:vir:79 1 MAALN--------Y-AKEYSNVLAQAYPYTLN---FGDLYATPNNGRYRWTGSKTIEIPTISTT-GR-VDSNRDTI--AV 64 (299) T ss_pred Cccch--------h-HHHHHHHHHHHHHhhce---eeeeccCcccceeeecCCCEEEEeccccc-cc-cccccCCC--cc Confidence 99533 2 37788888777666543 45555444322211257999999988642 32 23432211 01 Q ss_pred cccccchhhh-hhhhhHhhcccchhHHHHHhh-cccHHHHHHHHHHHHHh-hhhhHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEM-KTTKTWLNKAYGAMDLTAELA-GSNPMTRIRNRFGVYWT-RQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) Q Consensus 81 t~~kitt~~~-~a~i~~r~kg~~~tDla~~~~-g~DPm~~i~~qia~yw~-~~~q~~lla~l~Gvf~~~~a~~~~~~~~~ 157 (367) .+..++...+ ...-+.|..+|.+.++-..-+ +.-.++.+.+++.+... -..++-.++.| .+.... T Consensus 65 ~~g~~~~~~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl---~~~a~~--------- 132 (299) T protein:vir:79 65 AQRNYDNAWEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKI---YADWTA--------- 132 (299) T ss_pred cccccCcceeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHH---HHhhhh--------- Confidence 1122322221 122234566667763332222 22234444444432221 11222223322 110000 Q ss_pred hhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc--CceeEEEEccHHHHHHHhcchh-hhccccc--c Q lcl|Aclame:pro 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTNNDEI-EFIPDSK--G 232 (367) Q Consensus 158 ~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~--~~l~~~vmhS~v~~~L~k~~li-~~~~~~~--g 232 (367) ... ..++.+-+++ --++.|.++..+|=+.. ..=..++|.|.++.-|++...+ ....... + T Consensus 133 -----------~g~--~~~~~~~T~~--n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~ 197 (299) T protein:vir:79 133 -----------LGN--TADTTVLTTT--NVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGT 197 (299) T ss_pred -----------cCC--cccccccCHH--HHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccc Confidence 000 0011111111 12577888888776542 3347899999999999886422 2222211 1 Q ss_pred --cccchhhcCcEEEE--eCCCccc-----C--CCCCceEEEEEEe-cceeeeeccCCCcceeeeeehhhcCCceeEEEE Q lcl|Aclame:pro 233 --QLTIPTYMGKVVIV--DDGMPVF-----G--TGADKTYLSILFG-GAAFGYADGAPQVPVAVGRRELRGNGSGLEYIL 300 (367) Q Consensus 233 --~~~i~t~~G~~Viv--dD~~pv~-----~--t~~~~~yttyl~~-~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~ 300 (367) +..|+.+.|+.|+. ++.|+.. + ++..++-.-|++. ++|+.--.... .+... .|..+ +.| +.|+ T Consensus 198 ~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~~~~ak~in~ii~~~~a~~~~~K~~--~~~~~-~P~~~-~~~-~~~~ 272 (299) T protein:vir:79 198 SLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKVGAGAKQIFMSLVHPSAIITPVSYQ--FSKLD-EPTAV-TEG-KYFY 272 (299) T ss_pred eeeeeeeeecceEEEEechhhcCccceeccCccccCcccccceEEEcCCeeeeeEeee--eEEee-cCCCC-Ccc-ceee Confidence 23578889998875 4445421 0 1122232234433 33332111111 11111 13222 333 3444 Q ss_pred EccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccce-EEEEecC Q lcl|Aclame:pro 301 ERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPM-AFLVTKG 367 (367) Q Consensus 301 ~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~i-v~~~t~g 367 (367) .-|.| +--| |.+.|.=.| |-+++=| T Consensus 273 ~~r~y-----~d~~-------------------------------------v~~nk~~~i~~~~~~a~ 298 (299) T protein:vir:79 273 FEESF-----EDVF-------------------------------------ILNKKADAIQFVVEGAG 298 (299) T ss_pred eeeee-----eeee-------------------------------------eeccccCeEEEEeeecC Confidence 32211 1111 112222111 1111111 No 166 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=72.33 E-value=0.18 Score=24.61 Aligned_cols=303 Identities=10% Similarity=0.013 Sum_probs=135.4 Q ss_pred CCCcccccc--cee---ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCC Q lcl|Aclame:pro 1 MPDFNNQVR--LVD---AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDN 75 (367) Q Consensus 1 Ma~~~~~T~--l~d---~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~ 75 (367) |-++|..+. ++. -.-||+=.-|-.++....-+.. =++-+.++...+=..-|.||.+=++.+|..+---..|+- T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~--lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv 78 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKE--QYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGI 78 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhh--hhhhhcccccccccccCCeEEEEecccccccccchhcCC Confidence 655554431 111 1345543445444433332221 111222222222234699999888888854333333332 Q ss_pred c-----------------ccccccc--------------ccchhhhhhhhhHhhcccchhHHHHHhhcccHHHH-HHHHH Q lcl|Aclame:pro 76 P-----------------NVEAPID--------------GLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTR-IRNRF 123 (367) Q Consensus 76 ~-----------------~~~~t~~--------------kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~-i~~qi 123 (367) + ...+|.+ +++--...+.+++.|.=.++||.+.+..-++-+.+ +...+ T Consensus 79 ~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~el 158 (401) T protein:vir:95 79 DASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSREL 158 (401) T ss_pred CcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHH Confidence 1 1111111 22222233344444544588998877666655543 44444 Q ss_pred HHHH----hhhhhHHHHHHHHH-HHhhhhhhhhhhhhhhhhhhhhhhcchhhcceeec-CcccchhhcccHHHHHHHHHH Q lcl|Aclame:pro 124 GVYW----TRQWQRRIIAMAVG-VYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDIS-GQTNPADAVFNREAFVDAAFT 197 (367) Q Consensus 124 a~yw----~~~~q~~lla~l~G-vf~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~dis-a~t~~a~~~~s~~~l~~A~~~ 197 (367) -..- .+..|++||+...- +++ ...+.|.+ +....+...++.+.+-++... T Consensus 159 l~g~~~~t~d~i~~dll~ag~~viyA------------------------g~ats~At~~~~~~~~t~vt~~~l~rl~~~ 214 (401) T protein:vir:95 159 MNGATQITEAVLQKDLLAAAGTVLYA------------------------GAATSDATITGEGSTPSVVSYKNLMRLDQI 214 (401) T ss_pred hhhhhhhHHHHHHHHHHhhcCeeecC------------------------CccceeeeccccccccceechhHHHHHHHH Confidence 3333 22334444422100 010 01112221 111223445666666666544 Q ss_pred hcc-------------------ccCceeEEEEccHHHHHHHhc-------chhhhccccccc----ccchhhcCcEEEEe Q lcl|Aclame:pro 198 MGD-------------------HVGSIAAIAVHSMVYKRMTNN-------DEIEFIPDSKGQ----LTIPTYMGKVVIVD 247 (367) Q Consensus 198 ~GD-------------------~~~~l~~~vmhS~v~~~L~k~-------~li~~~~~~~g~----~~i~t~~G~~Vivd 247 (367) +=+ --+.-.+.+|||.....|+.. ++++..++.+.. -+|+.+-+.|+|+. T Consensus 215 L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~ 294 (401) T protein:vir:95 215 LTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQV 294 (401) T ss_pred HHhcccccchhhhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEec Confidence 322 123345689999776666533 577778887654 36889999999987 Q ss_pred CC--------CcccC------------CCCCceEEEEEEecceeeeeccCCC-----cceeeeeehhhcCCceeEEEEEc Q lcl|Aclame:pro 248 DG--------MPVFG------------TGADKTYLSILFGGAAFGYADGAPQ-----VPVAVGRRELRGNGSGLEYILER 302 (367) Q Consensus 248 D~--------~pv~~------------t~~~~~yttyl~~~GAi~~~~~~~~-----~~~e~~rd~~~~~~~g~~~l~~r 302 (367) .- +|..+ .+...+|-++++|.-|++.-..... ..+.+.|- -++- ...+--+.. T Consensus 295 p~~~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~p-G~~~-ad~~DPlgQ 372 (401) T protein:vir:95 295 PEMLHWAGAGAQATGANPGYRTSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMP-GKET-ADRNDPYGE 372 (401) T ss_pred ccceeecCCcccccccccccccccccCCCcceeeeeeEEccccceecccccCCccccceeEeecC-CcCC-CCCCCcccc Confidence 65 33322 2345588889999888865442211 12222222 1100 000111111 Q ss_pred cEEEeeeeeeeecccccccccccccccccccccCCCC Q lcl|Aclame:pro 303 KEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) Q Consensus 303 ~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt 339 (367) +-+ -|++|--+..+...-. + ..-.+.+|- T Consensus 373 ~g~----vgwK~~~a~~vL~~e~--m--~~ies~a~~ 401 (401) T protein:vir:95 373 TGF----SSIKWYYGILVKRPER--L--ALIKTVAPL 401 (401) T ss_pred eeh----hhhhhhhhhheeccce--e--EEEEeecCC Confidence 111 3444433322211100 0 001122232 No 167 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=69.69 E-value=0.22 Score=24.20 Aligned_cols=287 Identities=11% Similarity=0.024 Sum_probs=108.9 Q ss_pred CCCccccccc-eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcc--cccCCCCcc Q lcl|Aclame:pro 1 MPDFNNQVRL-VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLE--PNYGSDNPN 77 (367) Q Consensus 1 Ma~~~~~T~l-~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~--~~~~~~~~~ 77 (367) ||- .|.. +....+.-...-|.+...+.+.+++- -|.-.+ .||. ....--+-+.+.+ ....+... T Consensus 1 mpa---ltLaea~k~~~d~l~~~ViE~~~~~s~lL~~---LpF~~v-----eg~~-~~ynR~~~~~~~~~~~v~~~~~~- 67 (310) T protein:vir:97 1 MAS---VTLAESAKLAQDELVAGVIENIITVNRMFDV---LPFDSI-----EGNS-LAYNRENVLGDVIMAGVGTTFSG- 67 (310) T ss_pred Ccc---cchHHHhhcCcchHHHHHHHHHhccchHHHh---CCcccc-----cCCc-ceeeEeeccCCcccccccccccC- Confidence 883 2311 11233444445555555555554321 010001 1221 1111001110100 00000000 Q ss_pred ccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHH---HHHhhhhhHHHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 78 VEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFG---VYWTRQWQRRIIAMAVGVYKSNLAGNFATI 154 (367) Q Consensus 78 ~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia---~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~ 154 (367) ....+..-+..+......-.+.-..+.-.-..+-.++|+.++..|+. +...++.+..||. | + .+++ .|. T Consensus 68 ~g~~~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lIN---G---D-~a~n-~F~ 139 (310) T protein:vir:97 68 AGAGKAAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLIN---G---N-GAGN-EFA 139 (310) T ss_pred CCccccccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhc---c---c-cCCC-ccc Confidence 00111222222222222222333333333334445889999988886 4455555555553 1 1 1111 122 Q ss_pred hhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhc-------chhhhc Q lcl|Aclame:pro 155 KTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNN-------DEIEFI 227 (367) Q Consensus 155 ~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~-------~li~~~ 227 (367) .+...+ ....+.|..+. .+.++.+.|-..+.+.=+....-..+.||++.+.+++.. +...-. T Consensus 140 GL~~~~-------~~~q~i~~~~~----gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~ 208 (310) T protein:vir:97 140 GLIQLC-------ASGQKATTGAT----GSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVV 208 (310) T ss_pred chhhcC-------CccceeecCCC----CCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCcc Confidence 222111 11223332211 123566655444444312233456899999764443322 111111 Q ss_pred ccccccccchhhcCcEEEEeCCCcccCC--CCCceEEEEEEecc-------eeeeeccCCCcceeeeeehhh-cCCceeE Q lcl|Aclame:pro 228 PDSKGQLTIPTYMGKVVIVDDGMPVFGT--GADKTYLSILFGGA-------AFGYADGAPQVPVAVGRRELR-GNGSGLE 297 (367) Q Consensus 228 ~~~~g~~~i~t~~G~~VivdD~~pv~~t--~~~~~yttyl~~~G-------Ai~~~~~~~~~~~e~~rd~~~-~~~~g~~ 297 (367) .... .-.+.+|+|.+++..|.+|+..+ .+.+....|...-| -+|+-.. ..+...-|..-. ....... T Consensus 209 ~~~~-G~~v~~~~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~--~~~glsVr~~G~~~~~~v~~ 285 (310) T protein:vir:97 209 ELPS-GAEVPAYSGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTAT--QAAGIQVVDVGESEDSDEHI 285 (310) T ss_pred ccCC-CCEEeeeCCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccC--CccceeEEeCCcccCCccee Confidence 2212 23678999999999999998642 23444444555433 3332211 122223333221 1112223 Q ss_pred EEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEec Q lcl|Aclame:pro 298 YILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTK 366 (367) Q Consensus 298 ~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~ 366 (367) +++.- | .|.. |..+|++...+=++| T Consensus 286 ~~V~~--Y----~~~a--------------------------------------v~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 286 WRVKW--Y----CGLA--------------------------------------LFSEKGLACADGITN 310 (310) T ss_pred EEEEE--e----eeEE--------------------------------------EecccceeeeccccC Confidence 33321 1 1111 111222222222222 No 168 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=62.94 E-value=0.33 Score=23.25 Aligned_cols=313 Identities=9% Similarity=-0.045 Sum_probs=111.7 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHH----hhCCCceEEeeeeccCCCcccccCCCCc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQF----LSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~----~~~~G~~i~~P~~~~l~g~~~~~~~~~~ 76 (367) ||. +.|+|++..++.|+.+..+....|+....+-..+..+.. -+..+..+..||-.+-.+. ... T Consensus 1 M~~------i~d~f~~~~l~~~i~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~-~~~----- 68 (348) T protein:vir:96 1 MGL------IYDKVTASNIAGYFNTLQENVDSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNV-TIR----- 68 (348) T ss_pred Ccc------hhhccCHHHHHHHHHhcccchhhhhhhhcCCCccccceeEEEEeecCCceeEeeeecCCCCc-cee----- Confidence 985 467899999999997654444556555555433322221 1233444455655433221 111 Q ss_pred cccccccccchhhhhhhhhHhhcccchhHHHHH----hhcccH-HHHHHHHHH-------HHHhhhhhHHHHHHHH-HHH Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAE----LAGSNP-MTRIRNRFG-------VYWTRQWQRRIIAMAV-GVY 143 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~----~~g~DP-m~~i~~qia-------~yw~~~~q~~lla~l~-Gvf 143 (367) .-..+++..-.-...+......+.|+-.+ .++.++ +..+.++++ ....++.+......|. |-+ T Consensus 69 ----~r~~~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki 144 (348) T protein:vir:96 69 ----DRVSAEIHDEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKI 144 (348) T ss_pred ----cccceeeeeeecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCee Confidence 11111221111222222334455554332 112222 345555544 2233333333333332 322 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcc- Q lcl|Aclame:pro 144 KSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNND- 222 (367) Q Consensus 144 ~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~- 222 (367) .....+. . . ..+....+.|.+..+.+=.++.+.+ ...|.+....+=+.+.....++|.++++..|.++. T Consensus 145 ~~~~~~~----~----~-~vdfg~~~~~~~t~~~~W~~~~adp-~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~ 214 (348) T protein:vir:96 145 AFTSDGV----N----K-DIDYGVKADHKKQVSKSWAEPGATP-LADLEDAIETARELGLNPERAIMNAKTFGLIRKAAS 214 (348) T ss_pred EeecCCe----e----E-EEeccCCcccceeeccccCCCCCCH-HHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHH Confidence 1111000 0 0 0011112233333222211111111 12222222222223445678999999999998874 Q ss_pred hhhhcccccccc----------cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCC------cceeeee Q lcl|Aclame:pro 223 EIEFIPDSKGQL----------TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQ------VPVAVGR 286 (367) Q Consensus 223 li~~~~~~~g~~----------~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~------~~~e~~r 286 (367) ..+.+....+.. -++++.|..|++=|.- +.. .+|+.+ .++-+|.+.+.....- ...| +. T Consensus 215 v~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~y~~~-y~d--~~G~~~-~~~p~~~v~l~~~~~~G~~~yg~~~e-~~ 289 (348) T protein:vir:96 215 TVKAIKPLAGDGSSVTKAELQNYVADNYGVEIVLENGT-YRN--EKGEVS-KFFPDGHLTLIPNGPLGNTVFGTTPE-ES 289 (348) T ss_pred HHHHHhccCCccccccHHHHHHHHhhhcCceEEEEccE-EEe--cCCcEe-ccccCCeEEEEcCCCceeEEeccChh-hh Confidence 445444333211 1345567666654432 111 123321 2233333332211100 0001 11 Q ss_pred ehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeec-ccccceEEEEe Q lcl|Aclame:pro 287 RELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTY-RKNVPMAFLVT 365 (367) Q Consensus 287 d~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d-~K~i~iv~~~t 365 (367) +...+...-.++-.... +..-.+|.+. .|....+...++.=-|.. +..+-++.+.+ T Consensus 290 ~~~~~~~~~~~~~~~~~----~~~~~~~~~~-------------------dP~~~~~~~~s~plPv~~~~~~~~~a~Vl~ 346 (348) T protein:vir:96 290 DLFADNTVNADVEIVDS----GIAVTTTKTT-------------------DPVNVQTKVSMVALPSFERLGDVYMLTVIP 346 (348) T ss_pred hhhhcccccccceecCC----eeEEEeeecC-------------------CCceEEEEEeeeeeccccCCCcEEEEEEec Confidence 11111000001100000 0111233221 233332222222222211 12222222211 Q ss_pred cC Q lcl|Aclame:pro 366 KG 367 (367) Q Consensus 366 ~g 367 (367) == T Consensus 347 ~~ 348 (348) T protein:vir:96 347 GV 348 (348) T ss_pred CC Confidence 11 No 169 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=61.00 E-value=0.36 Score=23.01 Aligned_cols=313 Identities=10% Similarity=-0.025 Sum_probs=113.5 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHH----HhhCCCceEEeeeeccCCCcccccCCCCc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQ----FLSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~----~~~~~G~~i~~P~~~~l~g~~~~~~~~~~ 76 (367) ||. |.|+|+|..+..|+.+.......|+....+-.....+. +-...|..+-.||-..-.+....-. T Consensus 1 M~~------l~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r---- 70 (348) T protein:vir:49 1 MGL------IYDKVTASNIAGYFNALQENVDSTLGESIFPARKQLGTKLSYITGASGQSVALKAAAFDTNVTVRDR---- 70 (348) T ss_pred Ccc------hhhhcCHHHHHHHHHhccccchhhhHhhcCCCccccCceeEEEEeecCceeeeeeecCCCCcceecc---- Confidence 994 56889999999999865444445554444433332221 1123444555555554322211111 Q ss_pred cccccccccchhhhhhhhhHhhcccchhHHHHHhhc---ccH--HHHHHHHHH-------HHHhhhhhHHHHHHHH-HHH Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAG---SNP--MTRIRNRFG-------VYWTRQWQRRIIAMAV-GVY 143 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g---~DP--m~~i~~qia-------~yw~~~~q~~lla~l~-Gvf 143 (367) ..+++..-.-...+...-..+.|.-.+... ..| ...+.++++ +...++.+......|. |-+ T Consensus 71 ------~~~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki 144 (348) T protein:vir:49 71 ------VSAEMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKI 144 (348) T ss_pred ------cceeeeeeecCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeE Confidence 112222222222333444566674433222 221 223334433 2233333333333332 322 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcc- Q lcl|Aclame:pro 144 KSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNND- 222 (367) Q Consensus 144 ~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~- 222 (367) .... +.. . . ..+-...+.|.+-.+..=.++.+.+ ...|-+.+..+=+.+.....++|.++++..|+++. T Consensus 145 ~i~~--~g~--~----~-~vdyg~~~~~~~t~~~~W~~~~adp-~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~ 214 (348) T protein:vir:49 145 AFTS--DGV--N----K-DIDYGVKPDHKKQVSKSWAEPGATP-LADLEDAIETARELGLNPERAVMNAKTFGLIRKAAS 214 (348) T ss_pred EEec--CCc--e----E-EEeecCCcccceeeeeccCCCCCCH-HHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHH Confidence 1110 000 0 0 0001111223222221111111111 12222222222233445778999999999998874 Q ss_pred hhhhccccccc---c-------cchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCC--Ccc----eeeee Q lcl|Aclame:pro 223 EIEFIPDSKGQ---L-------TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAP--QVP----VAVGR 286 (367) Q Consensus 223 li~~~~~~~g~---~-------~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~--~~~----~e~~r 286 (367) ..+.+....+. + -+.++.|..|++=|.- +.. .+|+.+ .+|-++.|.+..... ... .| +. T Consensus 215 v~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~i~~y~~~-y~d--~dG~~~-~~~p~~~v~l~~~~~~G~~~yg~~~e-~~ 289 (348) T protein:vir:49 215 TVKVIKPLAGDGSSVTKAELDNYIADNFGVTVVLENGT-YRN--EKGEVS-KFFPDGHLTLIPNGPLGNTVFGTTPE-ES 289 (348) T ss_pred HHHHhhccCcccccccHHHHHHHHHhhcCceEEEEeeE-EEe--cCCcEe-eeecCCeEEEecCCCcceeEEecChh-hh Confidence 33444333221 1 1234556666654442 211 123221 223333333322111 000 11 01 Q ss_pred ehhhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceee-ecccccceEEEEe Q lcl|Aclame:pro 287 RELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERV-TYRKNVPMAFLVT 365 (367) Q Consensus 287 d~~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v-~d~K~i~iv~~~t 365 (367) +...+.....+.-..+..+.++ +|.+. .|....+...+..=-| .++..+-++.+++ T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~----~~~~~-------------------dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 346 (348) T protein:vir:49 290 DLFADNTVNADVEIVDNGIAVT----TTKTT-------------------DPVNVQTKVSMVALPSFERLDDVYMLTVIP 346 (348) T ss_pred hhccccccccceeecCCeEEEe----eeecC-------------------CCceEEEEEeeeccccccCCCcEEEEEEec Confidence 1111111111222222222221 23221 1222222221211112 2233333333333 Q ss_pred cC Q lcl|Aclame:pro 366 KG 367 (367) Q Consensus 366 ~g 367 (367) == T Consensus 347 ~~ 348 (348) T protein:vir:49 347 AV 348 (348) T ss_pred CC Confidence 22 No 170 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=43.72 E-value=0.83 Score=21.02 Aligned_cols=268 Identities=9% Similarity=-0.004 Sum_probs=99.5 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) ||- |=. +.+.+.+.++...++.. |.++.+.-.. ..--.||++|.||-..-.+|-. +|.-.. .. T Consensus 1 Mai-n~~---------~k~~~~ld~~~~~~~~~--~~l~~~~n~~-~~~~~gak~VkIp~ist~~gl~-dY~R~~---g~ 63 (285) T protein:vir:79 1 MTV-VLD---------SKDLARIDEEYKADSQV--WSYLTGGNGV-TQRFRGHNEVRINKLSGFVDAT-AYKRGQ---DN 63 (285) T ss_pred Ccc-hhh---------HHHHHHHHHHHHHhhhh--hhhcccCCcc-eeEecCCCEEEEeeeccccccc-cccccc---Cc Confidence 884 211 23444455544443222 2233322100 0012479999999885333322 243221 12 Q ss_pred cccccchhhhhhhh-hHhhcccchhHHHHHhhcccHHHHHHHHHHHHHh-hhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWT-RQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 81 t~~kitt~~~~a~i-~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~-~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) +.+.++...+.-++ +-|+..|.+...-..-.+.=.+++|.+++.+... -..++--+|.|- +.+.. T Consensus 64 ~~g~v~~~~et~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla---~~a~~---------- 130 (285) T protein:vir:79 64 ARKTISVGKETVKLTHEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLF---DSAAK---------- 130 (285) T ss_pred cccccceeeeEEEeeccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHH---hhccc---------- Confidence 33444444333322 2255555544111111222224444444432221 112222233221 11100 Q ss_pred hhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhcccc-CceeEEEEccHHHHHHHhcchhhhcccc-----cc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV-GSIAAIAVHSMVYKRMTNNDEIEFIPDS-----KG 232 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~-~~l~~~vmhS~v~~~L~k~~li~~~~~~-----~g 232 (367) . ...+-+++. -++.|.+|+.+|=+.. ..=.+++|.|.+|.-|++...+...... .+ T Consensus 131 ------------~----~~~~~T~~n--v~~~i~~~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~ 192 (285) T protein:vir:79 131 ------------K----ATDSITKDN--ALDAYDTAEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVIN 192 (285) T ss_pred ------------c----cccccCHHH--HHHHHHHHHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceecc Confidence 0 000111111 2667777777775442 1235889999999988877533221111 11 Q ss_pred c--ccchhhcC-cEEEE--eCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeee--eehhhcCCceeEEEEE-ccE Q lcl|Aclame:pro 233 Q--LTIPTYMG-KVVIV--DDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVG--RRELRGNGSGLEYILE-RKE 304 (367) Q Consensus 233 ~--~~i~t~~G-~~Viv--dD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~--rd~~~~~~~g~~~l~~-r~~ 304 (367) . ..++.+-| ..++. +|.|...+ ..+-.-|++.+....+. +.. .+.- -+|..+ .+|--.|+. |+- T Consensus 193 ~i~~~V~~lDg~v~ii~Vps~r~kt~~---~~k~Infiiv~~~a~i~---~~K-~~~~~~f~P~~~-~~~d~~~~~~R~Y 264 (285) T protein:vir:79 193 GIDRRVAQLDGGVPIVRVSSDRLKGLG---ITNHVNFILTPLSAIAP---IVK-YDSVSVIDPSTD-RSGNRWTIKGLSY 264 (285) T ss_pred ceeeeeccccceeEEEEcchhhccCcC---cchhccEEEecCceecc---cee-eeeeEeECCCCC-CCcceeeeeeeee Confidence 1 13455555 44443 23343211 11222355544321111 110 1111 122222 122223333 322 Q ss_pred EEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 305 WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 305 ~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) +-+- |.+.|.=.|-.-...| T Consensus 265 ~d~f-------------------------------------------v~~nk~~~Iy~~~~a~ 284 (285) T protein:vir:79 265 YDAI-------------------------------------------VLDNAKKGIYVAATAG 284 (285) T ss_pred eeee-------------------------------------------ehhhccceeeeeeccc Confidence 2111 1222222222222222 No 171 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=41.10 E-value=0.94 Score=20.73 Aligned_cols=281 Identities=8% Similarity=-0.118 Sum_probs=104.4 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |-. +.+.=...++|+-+..-+.+.+.+.+-+.+ ...+ ...+|+ ..+|.-... +.+.-+.+...... T Consensus 76 ~~~--~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~------~~~v---~~~~~~-~~i~~~~~~-~~a~w~~e~~~~~~- 141 (381) T protein:vir:95 76 INK--NVNYKEEKLLPEETIDRIFEDLTTNHPLLA------DLGI---KNAGLR-LKFLKSETS-GVAVWGKIYGEIKG- 141 (381) T ss_pred Hhc--ccCCCCceecCHHHHHHHHHHHHhhcccee------heee---EecCcc-eEEEEecCC-cceeeecccccccc- Confidence 111 001112356777777767776666655533 2211 122343 466643322 22222222211100 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ....+=++.....++.+.-..++..-..-+..|-.+.+.+++++-..+..+..+| .| +......+... .. T Consensus 142 -~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i---~G---~G~~qP~Gil~---~~ 211 (381) T protein:vir:95 142 -QLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL---KG---TGKDQPIGLNR---QV 211 (381) T ss_pred -cccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeE---ec---cCCCCceeeee---cc Confidence 0011111112223333333344444444445566677888887555444433322 11 00000000000 00 Q ss_pred hhhhhcchhhcce-e-ecCcc-cchhhcccHHHHHHHHHHhc---cc----cCceeEEEEccHHHHHHHhcchhhhcccc Q lcl|Aclame:pro 161 PAEVLGTAGDMVI-D-ISGQT-NPADAVFNREAFVDAAFTMG---DH----VGSIAAIAVHSMVYKRMTNNDEIEFIPDS 230 (367) Q Consensus 161 ~a~~~~~~~~~v~-d-isa~t-~~a~~~~s~~~l~~A~~~~G---D~----~~~l~~~vmhS~v~~~L~k~~li~~~~~~ 230 (367) ... ......+. + .+..+ ........++.|.+-...+. .. ...-..++||+..+.+|+++.. .+++ T Consensus 212 ~~~--~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~---~~~~ 286 (381) T protein:vir:95 212 QKG--VSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNA 286 (381) T ss_pred Ccc--cccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc---cCCC Confidence 000 00000000 0 00000 00111122334433332222 11 1222467999999999887642 2344 Q ss_pred cccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Ee Q lcl|Aclame:pro 231 KGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IV 307 (367) Q Consensus 231 ~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~ 307 (367) +|...-....|..|+.++.||... -.-|.+.-|.++ +.. .+++++.....=..++..+..+.++ .+ T Consensus 287 ~G~~v~~l~~g~~vv~s~~~p~~~-iifgDfs~Y~i~-------~r~---~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~ 355 (381) T protein:vir:95 287 NGVYVTALPFNLNVIESTVQEAGK-VLTYVKGLYDGY-------LAG---GINVQKFKETLALDDMDLYTAKQFAYGKAK 355 (381) T ss_pred CCceeecCCCCceEEecCCCCcCc-EEEEecccEEEE-------Eec---ccEEEeechhHhhcCCeEEEEEEEEcCEEe Confidence 554322233578899999998421 011111112221 111 1233333322212233333333332 23 Q ss_pred eeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 308 HPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 308 hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) |+..|. +..|+.+| T Consensus 356 ~~~A~~----------------------------------------------v~~l~~~~ 369 (381) T protein:vir:95 356 DNKVAA----------------------------------------------VWKLDLKG 369 (381) T ss_pred cCceEE----------------------------------------------EEEEEecC Confidence 444433 33333333 No 172 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=41.10 E-value=0.94 Score=20.73 Aligned_cols=281 Identities=8% Similarity=-0.118 Sum_probs=104.4 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccccc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~~~ 80 (367) |-. +.+.=...++|+-+..-+.+.+.+.+-+.+ ...+ ...+|+ ..+|.-... +.+.-+.+...... T Consensus 76 ~~~--~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~------~~~v---~~~~~~-~~i~~~~~~-~~a~w~~e~~~~~~- 141 (381) T protein:vir:10 76 INK--NVNYKEEKLLPEETIDRIFEDLTTNHPLLA------DLGI---KNAGLR-LKFLKSETS-GVAVWGKIYGEIKG- 141 (381) T ss_pred Hhc--ccCCCCceecCHHHHHHHHHHHHhhcccee------heee---EecCcc-eEEEEecCC-cceeeecccccccc- Confidence 111 001112356777777767776666655533 2211 122343 466643322 22222222211100 Q ss_pred cccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRV 160 (367) Q Consensus 81 t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~~~ 160 (367) ....+=++.....++.+.-..++..-..-+..|-.+.+.+++++-..+..+..+| .| +......+... .. T Consensus 142 -~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i---~G---~G~~qP~Gil~---~~ 211 (381) T protein:vir:10 142 -QLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL---KG---TGKDQPIGLNR---QV 211 (381) T ss_pred -cccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeE---ec---cCCCCceeeee---cc Confidence 0011111112223333333344444444445566677888887555444433322 11 00000000000 00 Q ss_pred hhhhhcchhhcce-e-ecCcc-cchhhcccHHHHHHHHHHhc---cc----cCceeEEEEccHHHHHHHhcchhhhcccc Q lcl|Aclame:pro 161 PAEVLGTAGDMVI-D-ISGQT-NPADAVFNREAFVDAAFTMG---DH----VGSIAAIAVHSMVYKRMTNNDEIEFIPDS 230 (367) Q Consensus 161 ~a~~~~~~~~~v~-d-isa~t-~~a~~~~s~~~l~~A~~~~G---D~----~~~l~~~vmhS~v~~~L~k~~li~~~~~~ 230 (367) ... ......+. + .+..+ ........++.|.+-...+. .. ...-..++||+..+.+|+++.. .+++ T Consensus 212 ~~~--~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~---~~~~ 286 (381) T protein:vir:10 212 QKG--VSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNA 286 (381) T ss_pred Ccc--cccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc---cCCC Confidence 000 00000000 0 00000 00111122334433332222 11 1222467999999999887642 2344 Q ss_pred cccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEEEccEE---Ee Q lcl|Aclame:pro 231 KGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW---IV 307 (367) Q Consensus 231 ~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~---~~ 307 (367) +|...-....|..|+.++.||... -.-|.+.-|.++ +.. .+++++.....=..++..+..+.++ .+ T Consensus 287 ~G~~v~~l~~g~~vv~s~~~p~~~-iifgDfs~Y~i~-------~r~---~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~ 355 (381) T protein:vir:10 287 NGVYVTALPFNLNVIESTVQEAGK-VLTYVKGLYDGY-------LAG---GINVQKFKETLALDDMDLYTAKQFAYGKAK 355 (381) T ss_pred CCceeecCCCCceEEecCCCCcCc-EEEEecccEEEE-------Eec---ccEEEeechhHhhcCCeEEEEEEEEcCEEe Confidence 554322233578899999998421 011111112221 111 1233333322212233333333332 23 Q ss_pred eeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 308 HPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 308 hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) |+..|. +..|+.+| T Consensus 356 ~~~A~~----------------------------------------------v~~l~~~~ 369 (381) T protein:vir:10 356 DNKVAA----------------------------------------------VWKLDLKG 369 (381) T ss_pred cCceEE----------------------------------------------EEEEEecC Confidence 444433 33333333 No 173 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=39.52 E-value=1 Score=20.55 Aligned_cols=315 Identities=10% Similarity=-0.042 Sum_probs=110.0 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHH----hhCCCceEEeeeeccCCCcccccCCCCc Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQF----LSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~----~~~~G~~i~~P~~~~l~g~~~~~~~~~~ 76 (367) ||. |.|+|+|..+..|+.+..+....|+....+.+....+.. -+..+..+..||-..-.+. +... T Consensus 1 M~~------i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~-~~~~---- 69 (348) T protein:vir:27 1 MGL------IYDKVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNV-TIRD---- 69 (348) T ss_pred Ccc------hhhhcCHHHHHHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCceeEeeeecCCCCc-ceec---- Confidence 995 678999999999997654444445544454333322211 1223334444554432211 1111 Q ss_pred cccccccccchhhhhhhhhHhhcccchhHHHHH---hhcccH--HHHHHHHHH-------HHHhhhhhHHHHHHHH-HHH Q lcl|Aclame:pro 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAE---LAGSNP--MTRIRNRFG-------VYWTRQWQRRIIAMAV-GVY 143 (367) Q Consensus 77 ~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~---~~g~DP--m~~i~~qia-------~yw~~~~q~~lla~l~-Gvf 143 (367) -..+++..-.-...+...-..+.|+-.+ .....| +..+.++++ ....++.+......|. |.+ T Consensus 70 -----r~~~~~~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki 144 (348) T protein:vir:27 70 -----RVSAEMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKI 144 (348) T ss_pred -----ccceeeeeeecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCee Confidence 1111111111112222333455554332 222222 223434433 2223333333333332 322 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcc- Q lcl|Aclame:pro 144 KSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNND- 222 (367) Q Consensus 144 ~~~~a~~~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~- 222 (367) .....+. . .. .+..-.+.|.+..+..=.++.+.+ ...|.+.+.++=+.+-....++|.++++..|.++. T Consensus 145 ~i~~~~~----~----~~-vdfg~~~~~~~t~~~~W~~~~adp-~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~ 214 (348) T protein:vir:27 145 AFTSDGV----N----KD-IDYGVKPDHKKQVSKSWAEPGATP-LADLEDAIETARELGLNPERAVMNAKTFGLIRKAAS 214 (348) T ss_pred EEecCCe----e----EE-EeecCCcccceeeeeccCCCCCCH-HHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHH Confidence 1111000 0 00 001111233322221111111111 12333333333233446778999999999999874 Q ss_pred hhhhcccccc---cc-------cchhhcCcEEEEeCCCcccCCCCCce----EEEEEEecceeeeeccCCCcceeeeeeh Q lcl|Aclame:pro 223 EIEFIPDSKG---QL-------TIPTYMGKVVIVDDGMPVFGTGADKT----YLSILFGGAAFGYADGAPQVPVAVGRRE 288 (367) Q Consensus 223 li~~~~~~~g---~~-------~i~t~~G~~VivdD~~pv~~t~~~~~----yttyl~~~GAi~~~~~~~~~~~e~~rd~ 288 (367) ..+.+....+ .+ -++++.|..|++=|.-=....|+... -+..++..|.+|.-.=++ . .| +-+. T Consensus 215 v~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~-~-~e-~~~~ 291 (348) T protein:vir:27 215 TVKVIKPLAGDGSAVTKAELENYIADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGT-T-PE-ESDL 291 (348) T ss_pred HHHHhcccCccccccCHHHHHHHHHhhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEecc-C-cc-hhhh Confidence 3344433221 11 13456677666655421111121111 122333333322111011 1 11 1111 Q ss_pred hhcCCceeEEEEEccEEEeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeee-cccccceEEEEecC Q lcl|Aclame:pro 289 LRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVT-YRKNVPMAFLVTKG 367 (367) Q Consensus 289 ~~~~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~-d~K~i~iv~~~t~g 367 (367) ..+.....++-+...++ .--+|.. ..|....+...+..=-|. ++..+-++.+.+== T Consensus 292 ~~~~~~~~~~~~~~~~~----~~~~~~~-------------------~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 292 FADNTVNAEVEIVDNGI----AVTTTKT-------------------TDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred hhccccccceeeeCCee----EEEeeec-------------------CCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 11111111111111111 1112321 112222222222211121 12222222222211 No 174 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=38.43 E-value=1.1 Score=20.43 Aligned_cols=307 Identities=13% Similarity=0.083 Sum_probs=117.6 Q ss_pred CCCccccccceeccchHHHHHHHhhhhHHhh----------hHhhcccccccHHHHHHhh---CCCceEEeeeeccCC-- Q lcl|Aclame:pro 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELT----------AFFLSGAVASNDFLSQFLS---APGRLINIPFWRDLD-- 65 (367) Q Consensus 1 Ma~~~~~T~l~d~i~PEVf~~yv~~~~~~~~----------~f~~SGi~~~~~~l~~~~~---~~G~~i~~P~~~~l~-- 65 (367) |+.-.+.|-..+..+|+. . +. ..|+ -...-|++.. ..|+..+. .+-+.+ =||++|. T Consensus 1 ~~~~~~~~~~~~~~~~~~-~----e~-~~KS~~tg~g~~p~~q~~~gAlR~-esL~~~i~~Lt~~~~~~--~~~~~i~k~ 71 (462) T protein:vir:96 1 MHKDTNLTAEQNKYADKF-Q----EE-VMKSYQTGYGITPDTQVDAGALRR-EILDDQITMLTWTQDDL--IFYREISRR 71 (462) T ss_pred Cccccccchhhhhhhchh-h----HH-HHHHHhcCCCcCCccccccchhhh-hhhhhhhheeeecccch--hhhhhcCCc Confidence 887444443333333332 1 11 1122 1111233332 44554332 343333 4587775 Q ss_pred ------------------CcccccCCCCccccccccccchhhhhhhhhHhhcccchhHHHHHhhc-ccHHHHHHHHHHHH Q lcl|Aclame:pro 66 ------------------SLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAG-SNPMTRIRNRFGVY 126 (367) Q Consensus 66 ------------------g~~~~~~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g-~DPm~~i~~qia~y 126 (367) |+...+.|... ...+.-+-.+...++++.+..=.++..+.+..+ .||| ++..|=+ T Consensus 72 ~a~sTv~~y~~~~~~G~~g~~~f~~E~g~---~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~-~~~~~da-- 145 (462) T protein:vir:96 72 PAQSTVQKYDVYLRHGNVGHSRFVREVGV---APVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPM-QILTEDA-- 145 (462) T ss_pred hhhhhhhhheeeeccCccccccccccccc---cccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHH-HHHHHHH-- Confidence 33333333321 222222223344555666666667777777666 6887 3333333 Q ss_pred HhhhhhHHHHHHHHHH-----Hhhhhhhh------hhhhhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHH Q lcl|Aclame:pro 127 WTRQWQRRIIAMAVGV-----YKSNLAGN------FATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAA 195 (367) Q Consensus 127 w~~~~q~~lla~l~Gv-----f~~~~a~~------~~~~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~ 195 (367) ++.+++-+ +++..-.+ ..|..+.. -...++|+|.-+.. ++-+.|+.|. T Consensus 146 --------i~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~-------lI~~~NViDarG~~------Ls~~~ln~aa 204 (462) T protein:vir:96 146 --------IAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAK-------LIDKDNVIDAKGES------LTETLLNRSA 204 (462) T ss_pred --------HHHHHHHHHHHHhhhhcccCCCccccccchhhhhh-------hcCCCceeecCCCC------ccHHHHhhhh Confidence 33333332 23322222 22333222 22568899976543 7888999999 Q ss_pred HHhccccCceeEEEEccHHHHHHHhcchhhh--ccccc-ccccch------------------hhcCcEEEEeCCCcccC Q lcl|Aclame:pro 196 FTMGDHVGSIAAIAVHSMVYKRMTNNDEIEF--IPDSK-GQLTIP------------------TYMGKVVIVDDGMPVFG 254 (367) Q Consensus 196 ~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~--~~~~~-g~~~i~------------------t~~G~~VivdD~~pv~~ 254 (367) .+.|-....-+-++||+.+.++|.+..|=.. +...+ +...++ ++++..-+.+-+++.. T Consensus 205 ~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~- 283 (462) T protein:vir:96 205 VLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPL- 283 (462) T ss_pred hhcccccCChhheecchHHHHHHHHhhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccC- Confidence 8888888888999999999999987754221 11111 211111 1112222222111100 Q ss_pred CCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCcee-EEEEEccEEEeeeeeeeeccccccccccccccc-ccc Q lcl|Aclame:pro 255 TGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGL-EYILERKEWIVHPGGFNWLDADVTIPDNTGSPS-GIT 332 (367) Q Consensus 255 t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~-~~l~~r~~~~~hp~G~s~~~~~~~~~~~~~~~~-~~~ 332 (367) |++ ...+.+ ..+-+...++..+.+ ++ ..|-+.+...|....+...+..+..-+ ... T Consensus 284 -------------p~a----p~~~~v-saTv~t~~~g~f~~~~d~----~~y~Y~V~avs~dgeS~PS~~VtaTva~~~~ 341 (462) T protein:vir:96 284 -------------PNA----PQPATV-KATVETGKKGLFTDEHDR----AELTYKVVVNSDDAQSAPSEAVTATVNNATD 341 (462) T ss_pred -------------CCC----CCCCce-eEEEEeCCCCCCCCccCc----eeEEEEEEEECCCCccccceeeEeeeecccc Confidence 000 000001 111111112211221 11 223333333333322211110000000 000 Q ss_pred cccC--CCChHHhcCCccceeee-c---------ccccceEEEEecC Q lcl|Aclame:pro 333 SGPP--AITLANLANPDNWERVT-Y---------RKNVPMAFLVTKG 367 (367) Q Consensus 333 ~~~~--sPt~a~L~~~~NW~~v~-d---------~K~i~iv~~~t~g 367 (367) .... +++....+. .-|=+|| + -|.+|.-....-| T Consensus 342 gv~ltIt~~a~~~~~-~~~~~IYRk~~~sg~y~li~rv~~~~~n~~g 387 (462) T protein:vir:96 342 GVKLEISVNAMYQQQ-PQFVSIYRQGRKTGDFYLIKRLGMKEVNDEG 387 (462) T ss_pred cceEEEEEcCCcccc-ceEEEEEeecCCccccceeeeeeceeecCCc Confidence 0000 001000000 1133333 1 1111111111111 No 175 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=21.86 E-value=2.5 Score=18.37 Aligned_cols=238 Identities=15% Similarity=0.097 Sum_probs=92.9 Q ss_pred CCCc-ccccccee---ccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCce---EEee--eeccCCCccccc Q lcl|Aclame:pro 1 MPDF-NNQVRLVD---AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRL---INIP--FWRDLDSLEPNY 71 (367) Q Consensus 1 Ma~~-~~~T~l~d---~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~---i~~P--~~~~l~g~~~~~ 71 (367) ||-. ++.=+|.| ...|.-..+-|.+.+.+.+.++ ...|-++.-...++.+ .++| -|-.|. . T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL-----~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN-~---- 70 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVL-----QDMTAIEGNLPTGHRTSVRTGLPTPTWRKLY-G---- 70 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHH-----hhcchhhccCCcccceeEEeecCCchhhhcC-C---- Confidence 7753 22223444 4555433344555555555442 2222222211223333 3455 555552 1 Q ss_pred CCCCccccccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHH---HHHHHhhhhhHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 72 GSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNR---FGVYWTRQWQRRIIAMAVGVYKSNLA 148 (367) Q Consensus 72 ~~~~~~~~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~q---ia~yw~~~~q~~lla~l~Gvf~~~~a 148 (367) .+++++-++.+......-.+ |..+=|....-..++....-++| +.+-..++.++.+|- ++... T Consensus 71 -------g~~~s~~tt~qvt~~l~ilg-g~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iy------GD~a~ 136 (330) T protein:vir:10 71 -------GVLPNKSSTAQVTDNCGMLE-AYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFY------GNDGI 136 (330) T ss_pred -------ccccccceEEEEEEEeEEec-chhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhcc------CCCCC Confidence 12233333333333333223 33333443333344444333333 334444445554442 11111 Q ss_pred hhhhhhhhhhhhh----------hhh---------------------------------hcchhh-cc------------ Q lcl|Aclame:pro 149 GNFATIKTRGRVP----------AEV---------------------------------LGTAGD-MV------------ 172 (367) Q Consensus 149 ~~~~~~~~~~~~~----------a~~---------------------------------~~~~~~-~v------------ 172 (367) +...|..+..... ..+ -.+.+. .+ T Consensus 137 ~p~~F~GL~kR~~~~ta~~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~ 216 (330) T protein:vir:10 137 APAEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEG 216 (330) T ss_pred ChhhccchhhhcCCCCCCchhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeE Confidence 1122222111110 000 000000 00 Q ss_pred ------------------------eeecCcccchhhcccHHHHHHHHHHhccccCceeEEEEccHHHHHHHhcchhhhcc Q lcl|Aclame:pro 173 ------------------------IDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIP 228 (367) Q Consensus 173 ------------------------~disa~t~~a~~~~s~~~l~~A~~~~GD~~~~l~~~vmhS~v~~~L~k~~li~~~~ 228 (367) +|+|..+.++.+.-=.+.+++|..++=--...-.+++||-.+...|++|..- T Consensus 217 ~~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~---- 292 (330) T protein:vir:10 217 YRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVD---- 292 (330) T ss_pred EeeeeeeeeeeEEeCcccEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhh---- Confidence 1222221111000001233334433322222346799999999999998421 Q ss_pred cccccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecceee Q lcl|Aclame:pro 229 DSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFG 272 (367) Q Consensus 229 ~~~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GAi~ 272 (367) -.+-.+.+.++-|++|..-+++|+-.++ +++-.+-++. T Consensus 293 k~n~~l~~~~~~g~~~t~~~gipir~~D------ail~tE~~vv 330 (330) T protein:vir:10 293 KIANNLTWETVSGERVMTFDGIPVQRTD------ALLNTESRVV 330 (330) T ss_pred cccceeeeeecCCeeeEEECCeEEEEEe------eeecCccccC Confidence 1222345566677777777777764322 1222222221 No 176 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=21.44 E-value=2.6 Score=18.31 Aligned_cols=279 Identities=6% Similarity=-0.109 Sum_probs=103.0 Q ss_pred CCCccccccce--eccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccc Q lcl|Aclame:pro 1 MPDFNNQVRLV--DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) Q Consensus 1 Ma~~~~~T~l~--d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~ 78 (367) .+ +-..+.-+ ..++|+-+..-+.+.+.+.+-+++ ...+ ...+| ...+|.-. -++.+.-+.+..... T Consensus 76 ~~-~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~------~~~v---~~~~~-~~~i~~~~-~~~~a~wv~e~~~~~ 143 (377) T protein:vir:96 76 ND-IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLK------VINF---KNTSL-RLKALTAE-TSGTAVWGDIFGEIK 143 (377) T ss_pred HH-HHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhh------hcee---EecCC-ceEEEEec-CCcceeEeecccccc Confidence 00 00001112 346777766666666655554433 2211 11233 35666422 223333233322111 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHH-----HHHHHhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAM-----AVGVYKSNLAGNFAT 153 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~-----l~Gvf~~~~a~~~~~ 153 (367) ....-++. +..-..++...-..++..-..-+..|-.+.+.+++++-..+..++.++.= =.|++......... T Consensus 144 ~~~~~~f~--~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~- 220 (377) T protein:vir:96 144 GQLKQAFK--EQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVD- 220 (377) T ss_pred cccCccce--eEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccc- Confidence 00001111 11122222232334555544455667778888888866665555444320 00111111000000 Q ss_pred hhhhhhhhhhhhcchhhcceeecCcccchhhcccHHHHHHHHHH-------hccc----cCceeEEEEccHHHHHHHhcc Q lcl|Aclame:pro 154 IKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-------MGDH----VGSIAAIAVHSMVYKRMTNND 222 (367) Q Consensus 154 ~~~~~~~~a~~~~~~~~~v~disa~t~~a~~~~s~~~l~~A~~~-------~GD~----~~~l~~~vmhS~v~~~L~k~~ 222 (367) .........+++.....++ ...++++.+.+-... .|.. ...-..++||+.++..++.+- T Consensus 221 ---------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~ 290 (377) T protein:vir:96 221 ---------QSTGRDITTYKTDKEAIAD-LSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF 290 (377) T ss_pred ---------ccccccccceeeccccccc-cccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccccc Confidence 0000001111111111111 112344444443221 1211 112246899999998775442 Q ss_pred hhhhcccccccccchhhcC--cEEEEeCCCcccCCCCCceEEEEEEecceeeeeccCCCcceeeeeehhhcCCceeEEEE Q lcl|Aclame:pro 223 EIEFIPDSKGQLTIPTYMG--KVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYIL 300 (367) Q Consensus 223 li~~~~~~~g~~~i~t~~G--~~VivdD~~pv~~t~~~~~yttyl~~~GAi~~~~~~~~~~~e~~rd~~~~~~~g~~~l~ 300 (367) + .++++|. ..+.+| .+|+.++.||... -.-|.+.-|.++ ... .++++|.....=..++..+. T Consensus 291 ~---~~~~~G~--~~~~l~~p~~v~~s~~~p~~~-i~fgdf~~Y~i~-------~r~---~~~i~~~~~~~~~~d~~~f~ 354 (377) T protein:vir:96 291 T---SRNQFGE--YVTVLPHGITILESLAVETGK-AIAFVANRYDAF-------MAT---ASTIEEYDQTFAMEDLQLYL 354 (377) T ss_pred c---ccCCCCC--ceeccCCCceEEecCCCCccc-EEEEEcCcEEEE-------Eec---ccEEEeehhhhhhcCCeEEE Confidence 1 2334453 334444 5688888888421 111122223222 211 23333333222122333333 Q ss_pred EccEE---EeeeeeeeecccccccccccccccccccccCCCChHHhcCCccceeeecccccceEEEEecC Q lcl|Aclame:pro 301 ERKEW---IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRKNVPMAFLVTKG 367 (367) Q Consensus 301 ~r~~~---~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L~~~~NW~~v~d~K~i~iv~~~t~g 367 (367) .+.++ .+++..+.- +. ++.| T Consensus 355 ~~~r~dG~~~d~~a~~v----------------------------------------------l~-l~~~ 377 (377) T protein:vir:96 355 TKNYFYGKAKDNHTAAL----------------------------------------------LT-LAGG 377 (377) T ss_pred EEEEEcCEEecCCcEEE----------------------------------------------EE-EecC Confidence 33222 222222221 11 1222 No 177 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=20.60 E-value=2.7 Score=18.18 Aligned_cols=293 Identities=9% Similarity=-0.069 Sum_probs=104.9 Q ss_pred CCCccccccc-eeccchHHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhCCCceEEeeeeccCCCcccccCCCCccc- Q lcl|Aclame:pro 1 MPDFNNQVRL-VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV- 78 (367) Q Consensus 1 Ma~~~~~T~l-~d~i~PEVf~~yv~~~~~~~~~f~~SGi~~~~~~l~~~~~~~G~~i~~P~~~~l~g~~~~~~~~~~~~- 78 (367) ....+..|.- ....+|+-|..-+.+.+.+.+-+++ .+.+ ...+|. ..+|.-... +.+.-..+.+... T Consensus 73 ~~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~------~a~v---~~~~~~-~~i~~~~~~-~~a~W~~e~~~~~~ 141 (381) T protein:vir:10 73 FMDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLA------DLGI---KNAGLR-LKFLKSETS-GVAVWGKIYGEIKG 141 (381) T ss_pred HHHHhhcCCCCCceecCHHHHHHHHHHHHhhcceee------eeee---EecCcc-eEEEeecCC-cceEEeeccccccc Confidence 0001111111 1256777776666666666554432 2221 122343 456643322 2222112111100 Q ss_pred cccccccchhhhhhhhhHhhcccchhHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhhhhhhhhhh Q lcl|Aclame:pro 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) Q Consensus 79 ~~t~~kitt~~~~a~i~~r~kg~~~tDla~~~~g~DPm~~i~~qia~yw~~~~q~~lla~l~Gvf~~~~a~~~~~~~~~~ 158 (367) ..+| +++ +..-..++.+.-..++..-..-+..|-.+.+.+++++-..+..+..++ .| +......+... T Consensus 142 ~~~~-~f~--~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi---~G---dG~~qP~Gil~--- 209 (381) T protein:vir:10 142 QLDA-AFS--EETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL---KG---TGKDQPIGLNR--- 209 (381) T ss_pred ccCc-cce--eEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeE---ec---ccCCCceeeee--- Confidence 0000 111 111222223333344444444445565667788877555554443222 22 10000000000 Q ss_pred hhhhhhhcchhhcceeecCccc---chhhcccHHHH---HHHHHHhccc----cCceeEEEEccHHHHHHHhcchhhhcc Q lcl|Aclame:pro 159 RVPAEVLGTAGDMVIDISGQTN---PADAVFNREAF---VDAAFTMGDH----VGSIAAIAVHSMVYKRMTNNDEIEFIP 228 (367) Q Consensus 159 ~~~a~~~~~~~~~v~disa~t~---~a~~~~s~~~l---~~A~~~~GD~----~~~l~~~vmhS~v~~~L~k~~li~~~~ 228 (367) .... .......+.......+ .......+..+ ..+....+.. ...=..++||+..+.+|++... .+ T Consensus 210 ~~~~--~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~---~~ 284 (381) T protein:vir:10 210 QVQK--GVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HL 284 (381) T ss_pred cCCc--cccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccc---cC Confidence 0000 0000000000000000 00000011112 1222222211 1112357899999999887542 23 Q ss_pred cccccccchhhcCcEEEEeCCCcccCCCCCceEEEEEEecce-eeeeccCCCcceeeeeehhhcCCceeEEEEEccEE-- Q lcl|Aclame:pro 229 DSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRGNGSGLEYILERKEW-- 305 (367) Q Consensus 229 ~~~g~~~i~t~~G~~VivdD~~pv~~t~~~~~yttyl~~~GA-i~~~~~~~~~~~e~~rd~~~~~~~g~~~l~~r~~~-- 305 (367) +++|...-....|.+|+.++.||-. + .+||.=. ..+.+.. .+.++|.....=..++..+..+.++ T Consensus 285 ~~~G~~v~~lp~g~~vv~~~~~p~~------~---i~fGDfs~Y~i~~r~---~~~i~~~~~~~~~~d~~~f~a~~r~dG 352 (381) T protein:vir:10 285 NANGVYVTALPFNLNVIESTVQEAG------K---VLTYVKGLYDGYLAG---GINVQKFKETLALDDMDLYTAKQFAYG 352 (381) T ss_pred CCCCceeecCCCCceeEEcCCCCcC------c---EEEEEcccEEEEEec---ccEEEeechhhhhcCceEEEEEEEEcC Confidence 4555432223358899999999842 1 2232211 1111211 2333443333222344444443333 Q ss_pred -EeeeeeeeecccccccccccccccccccccCCCChHHh Q lcl|Aclame:pro 306 -IVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANL 343 (367) Q Consensus 306 -~~hp~G~s~~~~~~~~~~~~~~~~~~~~~~~sPt~a~L 343 (367) .+|+..+..-.-+ ..+ +.+.-+.|. +-| T Consensus 353 ~~~~~~A~~v~~l~-------~~~--~~~~~~~~~-~~~ 381 (381) T protein:vir:10 353 KAKDNKVAAVWKLD-------LKG--HKPALEDTE-ETL 381 (381) T ss_pred EEecCCcEEEEEEe-------ecC--Ccccccccc-ccC Confidence 4555555442111 111 111111111 111 Done!