Query lcl|Aclame:protein:vir:2504|NCBI_annot:major capsid subunit gp9|genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Match_columns 305 No_of_seqs 124 out of 1200 Neff 9.7 Searched_HMMs 1612 Date Sat Nov 30 06:07:40 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_9 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_9_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:2504 Length: 305 # 100.0 6.3E-74 3.9E-77 421.9 31.4 305 1-305 1-305 (305) 2 protein:vir:80684 Length: 315 100.0 7.6E-61 4.7E-64 350.2 28.7 292 1-305 1-313 (315) 3 protein:vir:105905 Length: 304 100.0 1.3E-60 8E-64 348.9 29.7 286 1-297 9-304 (304) 4 protein:vir:94142 Length: 304 100.0 1.3E-60 8E-64 348.9 29.7 286 1-297 9-304 (304) 5 protein:vir:8187 Length: 311 # 100.0 3E-60 1.8E-63 346.9 29.4 287 1-299 1-311 (311) 6 protein:vir:7771 Length: 330 # 100.0 8.7E-60 5.4E-63 344.4 29.9 297 1-305 1-329 (330) 7 protein:vir:41 Length: 299 # N 100.0 1.5E-59 9.5E-63 343.0 28.7 284 1-299 6-299 (299) 8 protein:vir:5739 Length: 366 # 100.0 1.4E-59 8.7E-63 343.2 28.3 291 1-298 64-366 (366) 9 protein:vir:104085 Length: 320 100.0 5.9E-59 3.7E-62 339.8 29.1 291 1-305 14-320 (320) 10 protein:vir:9759 Length: 303 # 100.0 8.6E-59 5.4E-62 338.9 29.3 285 1-298 1-303 (303) 11 protein:vir:97148 Length: 324 100.0 1.5E-58 9.5E-62 337.5 29.8 289 1-305 27-323 (324) 12 protein:vir:9574 Length: 300 # 100.0 1.3E-58 8.3E-62 337.9 29.4 285 1-298 1-300 (300) 13 protein:vir:1638 Length: 298 # 100.0 1.4E-58 8.4E-62 337.8 29.0 285 1-297 1-298 (298) 14 protein:vir:2344 Length: 397 # 100.0 1.8E-58 1.1E-61 337.2 28.4 288 1-305 10-317 (397) 15 protein:vir:105038 Length: 428 100.0 1.7E-58 1.1E-61 337.2 26.9 291 1-298 125-428 (428) 16 protein:vir:78830 Length: 324 100.0 6.8E-58 4.2E-61 334.0 29.6 289 1-305 27-323 (324) 17 protein:vir:96392 Length: 324 100.0 6.8E-58 4.2E-61 334.0 29.6 289 1-305 27-323 (324) 18 protein:vir:80376 Length: 435 100.0 3.2E-58 2E-61 335.8 27.5 291 1-300 132-435 (435) 19 protein:vir:103955 Length: 324 100.0 8.9E-58 5.5E-61 333.4 29.7 289 1-305 27-323 (324) 20 protein:vir:1433 Length: 435 # 100.0 2.4E-58 1.5E-61 336.5 26.4 291 1-300 132-435 (435) 21 protein:vir:78223 Length: 333 100.0 9.9E-58 6.1E-61 333.1 29.7 298 1-299 10-333 (333) 22 protein:vir:485 Length: 407 # 100.0 5.1E-58 3.2E-61 334.7 27.7 285 1-305 106-407 (407) 23 protein:vir:99749 Length: 324 100.0 1.2E-57 7.4E-61 332.7 29.6 289 1-305 27-323 (324) 24 protein:vir:9309 Length: 324 # 100.0 1.6E-57 1E-60 331.9 29.8 289 1-305 27-323 (324) 25 protein:vir:4226 Length: 326 # 100.0 1.6E-57 9.8E-61 332.0 28.8 290 1-301 20-326 (326) 26 protein:vir:78523 Length: 338 100.0 2.5E-57 1.5E-60 330.9 29.5 300 1-305 10-338 (338) 27 protein:vir:2430 Length: 318 # 100.0 2.6E-57 1.6E-60 330.8 28.9 289 1-303 14-318 (318) 28 protein:vir:96223 Length: 324 100.0 7.5E-57 4.7E-60 328.3 29.6 289 1-305 27-323 (324) 29 protein:vir:95763 Length: 297 100.0 6.8E-57 4.2E-60 328.5 29.3 282 1-301 9-297 (297) 30 protein:vir:94771 Length: 298 100.0 5.9E-57 3.6E-60 328.9 28.7 285 1-297 1-298 (298) 31 protein:vir:99920 Length: 311 100.0 2E-56 1.3E-59 325.9 28.6 286 1-298 1-311 (311) 32 protein:vir:100247 Length: 425 100.0 1.9E-56 1.2E-59 326.0 27.1 279 1-299 130-425 (425) 33 protein:vir:4456 Length: 401 # 100.0 2.4E-56 1.5E-59 325.6 26.6 275 1-298 107-401 (401) 34 protein:vir:101650 Length: 497 100.0 6.8E-56 4.2E-59 323.0 26.5 281 1-305 151-497 (497) 35 protein:vir:7855 Length: 497 # 100.0 6.8E-56 4.2E-59 323.0 26.5 281 1-305 151-497 (497) 36 protein:vir:6242 Length: 390 # 100.0 2.7E-55 1.6E-58 319.8 26.9 271 1-299 110-390 (390) 37 protein:vir:104256 Length: 458 100.0 6.7E-55 4.1E-58 317.6 27.5 283 1-298 162-458 (458) 38 protein:vir:93616 Length: 645 100.0 1.1E-54 6.9E-58 316.4 27.3 284 1-304 338-645 (645) 39 protein:vir:8102 Length: 543 # 100.0 8.9E-55 5.5E-58 316.9 26.2 280 1-299 250-543 (543) 40 protein:vir:1328 Length: 392 # 100.0 1.9E-54 1.2E-57 315.1 27.3 273 1-299 110-392 (392) 41 protein:vir:95376 Length: 425 100.0 1.5E-54 9.3E-58 315.7 26.4 276 1-304 138-425 (425) 42 protein:vir:97053 Length: 390 100.0 6.4E-54 4E-57 312.2 27.1 270 1-296 113-390 (390) 43 protein:vir:100135 Length: 418 100.0 7.4E-54 4.6E-57 311.9 27.4 276 1-303 135-418 (418) 44 protein:vir:4511 Length: 409 # 100.0 7.5E-54 4.7E-57 311.8 27.2 280 1-304 117-409 (409) 45 protein:vir:4339 Length: 395 # 100.0 1.3E-53 7.9E-57 310.6 27.2 275 1-298 113-395 (395) 46 protein:vir:4997 Length: 397 # 100.0 1.2E-53 7.4E-57 310.7 26.9 269 1-305 109-392 (397) 47 protein:vir:1025 Length: 408 # 100.0 1.9E-53 1.2E-56 309.6 27.8 270 1-305 116-400 (408) 48 protein:vir:81070 Length: 390 100.0 1.6E-53 1E-56 310.0 27.2 270 1-296 113-390 (390) 49 protein:vir:10364 Length: 390 100.0 3.1E-53 1.9E-56 308.4 27.8 270 1-296 113-390 (390) 50 protein:vir:3845 Length: 395 # 100.0 3.1E-53 1.9E-56 308.5 27.2 270 1-305 107-390 (395) 51 protein:vir:102082 Length: 392 100.0 5.1E-53 3.1E-56 307.3 27.8 269 1-305 106-391 (392) 52 protein:vir:107593 Length: 392 100.0 5.1E-53 3.1E-56 307.3 27.8 269 1-305 106-391 (392) 53 protein:vir:105004 Length: 392 100.0 5.1E-53 3.1E-56 307.3 27.8 269 1-305 106-391 (392) 54 protein:vir:102873 Length: 392 100.0 5.1E-53 3.1E-56 307.3 27.8 269 1-305 106-391 (392) 55 protein:vir:1886 Length: 385 # 100.0 3.8E-53 2.3E-56 308.0 26.7 273 1-299 105-385 (385) 56 protein:vir:191 Length: 385 # 100.0 3.8E-53 2.3E-56 308.0 26.7 273 1-299 105-385 (385) 57 protein:vir:4953 Length: 397 # 100.0 4.6E-53 2.9E-56 307.5 26.9 269 1-305 109-392 (397) 58 protein:vir:102119 Length: 404 100.0 1.3E-52 7.9E-56 305.1 28.2 283 1-304 110-404 (404) 59 protein:vir:81160 Length: 371 100.0 1.1E-52 7E-56 305.4 27.6 262 1-298 91-371 (371) 60 protein:vir:4856 Length: 293 # 100.0 1.2E-52 7.3E-56 305.3 27.3 268 1-305 5-287 (293) 61 protein:vir:4092 Length: 390 # 100.0 1.1E-52 7E-56 305.4 26.5 282 1-305 84-374 (390) 62 protein:vir:4830 Length: 397 # 100.0 1.7E-52 1.1E-55 304.4 27.2 269 1-305 109-392 (397) 63 protein:vir:7409 Length: 408 # 100.0 2.5E-52 1.6E-55 303.5 27.6 270 1-305 116-400 (408) 64 protein:vir:1268 Length: 397 # 100.0 1.9E-52 1.2E-55 304.1 26.9 262 1-298 123-397 (397) 65 protein:vir:81227 Length: 413 100.0 2.5E-52 1.5E-55 303.5 27.2 275 1-303 118-413 (413) 66 protein:vir:4700 Length: 415 # 100.0 4.8E-52 3E-55 301.9 28.7 278 1-305 121-410 (415) 67 protein:vir:4600 Length: 415 # 100.0 4.8E-52 3E-55 301.9 28.7 278 1-305 121-410 (415) 68 protein:vir:96762 Length: 632 100.0 1.3E-52 7.8E-56 305.1 24.8 272 1-297 357-632 (632) 69 protein:vir:3991 Length: 404 # 100.0 5.2E-52 3.2E-55 301.7 27.6 270 1-305 116-400 (404) 70 protein:vir:6212 Length: 434 # 100.0 3.3E-52 2.1E-55 302.8 26.2 280 1-304 141-434 (434) 71 protein:vir:98635 Length: 377 100.0 3.8E-53 2.4E-56 308.0 20.8 276 1-298 79-377 (377) 72 protein:vir:79987 Length: 415 100.0 1.9E-51 1.2E-54 298.6 28.6 279 1-305 121-410 (415) 73 protein:vir:81100 Length: 415 100.0 1.9E-51 1.2E-54 298.6 28.6 279 1-305 121-410 (415) 74 protein:vir:98339 Length: 415 100.0 1.9E-51 1.2E-54 298.6 28.6 279 1-305 121-410 (415) 75 protein:vir:101607 Length: 379 100.0 9.1E-52 5.7E-55 300.4 26.2 262 1-298 106-379 (379) 76 protein:vir:9410 Length: 415 # 100.0 5E-51 3.1E-54 296.4 28.2 279 1-305 121-410 (415) 77 protein:vir:1383 Length: 421 # 100.0 8.6E-51 5.4E-54 295.1 26.0 267 1-305 114-395 (421) 78 protein:vir:100632 Length: 381 100.0 3.7E-51 2.3E-54 297.1 23.9 279 1-305 76-373 (381) 79 protein:vir:9509 Length: 381 # 100.0 4.4E-51 2.7E-54 296.7 24.0 281 1-305 76-377 (381) 80 protein:vir:101291 Length: 381 100.0 4.4E-51 2.7E-54 296.7 24.0 281 1-305 76-377 (381) 81 protein:vir:100884 Length: 389 100.0 2.1E-50 1.3E-53 292.9 27.7 266 1-305 109-389 (389) 82 protein:vir:100172 Length: 394 100.0 5.3E-50 3.3E-53 290.7 28.3 266 1-305 111-391 (394) 83 protein:vir:8420 Length: 477 # 100.0 6.3E-50 3.9E-53 290.3 25.7 291 1-304 156-477 (477) 84 protein:vir:94673 Length: 419 100.0 2.1E-49 1.3E-52 287.5 26.5 279 1-302 123-419 (419) 85 protein:vir:3870 Length: 400 # 100.0 1.6E-49 1E-52 288.1 25.6 257 1-299 133-400 (400) 86 protein:vir:9704 Length: 394 # 100.0 7.2E-49 4.4E-52 284.5 26.3 258 1-304 128-394 (394) 87 protein:vir:95963 Length: 395 100.0 6.7E-49 4.2E-52 284.7 25.7 279 1-305 86-388 (395) 88 protein:vir:9643 Length: 377 # 100.0 5.5E-49 3.4E-52 285.2 25.0 272 1-298 79-377 (377) 89 protein:vir:1084 Length: 437 # 100.0 5.2E-49 3.2E-52 285.3 24.5 267 1-305 156-435 (437) 90 protein:vir:78640 Length: 352 100.0 1.2E-48 7.4E-52 283.3 22.9 266 1-304 83-352 (352) 91 protein:vir:78350 Length: 383 100.0 8.1E-48 5E-51 278.8 23.6 279 1-305 83-383 (383) 92 protein:vir:9361 Length: 402 # 100.0 3.5E-48 2.2E-51 280.7 20.7 266 1-304 133-402 (402) 93 protein:vir:80128 Length: 466 100.0 1.2E-47 7.6E-51 277.8 22.5 282 1-305 148-456 (466) 94 protein:vir:93881 Length: 387 100.0 3.5E-47 2.2E-50 275.2 23.1 266 1-304 118-387 (387) 95 protein:vir:2685 Length: 387 # 100.0 2.6E-47 1.6E-50 276.0 21.6 266 1-304 118-387 (387) 96 protein:vir:96978 Length: 387 100.0 2.6E-47 1.6E-50 276.0 21.6 266 1-304 118-387 (387) 97 protein:vir:94424 Length: 387 100.0 2.6E-47 1.6E-50 276.0 21.6 266 1-304 118-387 (387) 98 protein:vir:962 Length: 397 # 100.0 6.2E-46 3.8E-49 268.4 23.5 255 1-298 132-397 (397) 99 protein:vir:4197 Length: 314 # 100.0 5.5E-39 3.4E-42 230.3 24.4 281 1-301 14-314 (314) 100 protein:vir:4159 Length: 315 # 100.0 1.9E-38 1.2E-41 227.4 23.0 274 1-295 19-315 (315) 101 protein:vir:3158 Length: 321 # 100.0 7.9E-36 4.9E-39 213.0 24.9 286 1-305 18-320 (321) 102 protein:vir:97397 Length: 517 100.0 3.4E-35 2.1E-38 209.6 20.8 272 1-303 239-517 (517) 103 protein:vir:3033 Length: 272 # 100.0 4.7E-32 2.9E-35 192.3 23.4 258 1-301 1-272 (272) 104 protein:vir:9820 Length: 272 # 100.0 4.7E-32 2.9E-35 192.3 23.4 258 1-301 1-272 (272) 105 protein:vir:4074 Length: 480 # 99.9 1.8E-30 1.1E-33 183.7 12.7 260 1-301 210-480 (480) 106 protein:vir:3613 Length: 272 # 99.9 3.1E-24 1.9E-27 149.5 19.5 258 1-298 1-272 (272) 107 protein:vir:94933 Length: 330 99.9 5E-23 3.1E-26 142.8 21.1 277 1-299 25-330 (330) 108 protein:vir:93742 Length: 274 99.9 7.3E-23 4.6E-26 141.9 21.5 260 1-303 1-274 (274) 109 protein:vir:80930 Length: 278 99.8 6.2E-22 3.8E-25 136.8 21.1 262 1-299 1-278 (278) 110 protein:vir:105334 Length: 276 99.8 5.6E-22 3.5E-25 137.1 20.2 262 1-305 1-276 (276) 111 protein:vir:96123 Length: 274 99.8 2.6E-21 1.6E-24 133.5 21.2 260 1-302 1-274 (274) 112 protein:vir:96833 Length: 275 99.8 2.9E-21 1.8E-24 133.2 19.7 260 1-302 1-275 (275) 113 protein:vir:79928 Length: 393 99.8 2E-21 1.2E-24 134.1 17.4 292 1-305 59-387 (393) 114 protein:vir:97433 Length: 274 99.8 1.5E-20 9.3E-24 129.3 21.9 260 1-302 1-274 (274) 115 protein:vir:94494 Length: 274 99.8 1.5E-20 9.3E-24 129.3 21.9 260 1-302 1-274 (274) 116 protein:vir:1239 Length: 274 # 99.8 1.5E-19 9E-23 123.8 20.4 260 1-303 1-274 (274) 117 protein:vir:95107 Length: 270 99.8 9.6E-20 5.9E-23 124.8 18.9 259 1-303 1-270 (270) 118 protein:vir:97255 Length: 310 99.7 7.3E-19 4.5E-22 120.0 22.7 278 1-298 1-310 (310) 119 protein:vir:95898 Length: 274 99.7 9E-19 5.6E-22 119.5 21.2 260 1-303 1-274 (274) 120 protein:vir:96262 Length: 274 99.7 9E-19 5.6E-22 119.5 21.2 260 1-303 1-274 (274) 121 protein:vir:739 Length: 231 # 99.6 1E-16 6.5E-20 108.2 16.0 222 35-298 1-231 (231) 122 protein:vir:99424 Length: 360 99.5 5E-15 3.1E-18 99.0 19.5 283 1-305 20-360 (360) 123 protein:vir:7990 Length: 273 # 99.4 1.1E-13 6.8E-17 91.6 18.9 255 1-298 1-273 (273) 124 protein:vir:108211 Length: 318 99.4 5.1E-14 3.1E-17 93.5 16.4 280 1-303 1-318 (318) 125 protein:vir:102605 Length: 273 99.3 3.3E-13 2E-16 89.0 19.5 255 1-298 1-273 (273) 126 protein:vir:105822 Length: 273 99.3 3.3E-13 2E-16 89.0 19.5 255 1-298 1-273 (273) 127 protein:vir:6324 Length: 335 # 99.2 4.8E-12 3E-15 82.6 18.9 281 1-305 1-335 (335) 128 protein:vir:100057 Length: 375 99.2 1.8E-11 1.1E-14 79.5 21.8 289 1-305 1-374 (375) 129 protein:vir:5974 Length: 324 # 99.2 6.9E-12 4.3E-15 81.8 18.9 269 1-305 1-295 (324) 130 protein:vir:8324 Length: 410 # 99.2 1.1E-12 6.9E-16 86.1 14.5 260 1-296 131-410 (410) 131 protein:vir:94622 Length: 341 99.2 1.6E-12 9.9E-16 85.3 15.0 286 1-300 1-341 (341) 132 protein:vir:102944 Length: 330 99.2 9.2E-12 5.7E-15 81.1 19.1 278 1-305 1-301 (330) 133 protein:vir:78935 Length: 335 99.1 1.4E-11 8.9E-15 80.0 18.3 281 1-305 1-335 (335) 134 protein:vir:10450 Length: 344 99.1 7.9E-12 4.9E-15 81.4 16.7 280 1-298 1-344 (344) 135 protein:vir:80213 Length: 334 99.1 1.6E-11 9.8E-15 79.8 17.7 283 1-300 1-334 (334) 136 protein:vir:94711 Length: 347 99.1 9.1E-12 5.6E-15 81.1 15.2 279 1-299 1-347 (347) 137 protein:vir:94576 Length: 347 99.1 3.7E-11 2.3E-14 77.8 18.3 282 1-298 1-347 (347) 138 protein:vir:8885 Length: 347 # 99.1 1.5E-11 9E-15 80.0 15.7 283 1-299 1-347 (347) 139 protein:vir:2201 Length: 345 # 99.1 4.6E-11 2.9E-14 77.2 18.3 279 1-298 1-345 (345) 140 protein:vir:1583 Length: 351 # 99.0 5.5E-11 3.4E-14 76.8 17.7 274 1-305 1-336 (351) 141 protein:vir:103323 Length: 364 99.0 5.6E-10 3.5E-13 71.3 22.9 284 1-305 1-344 (364) 142 protein:vir:78739 Length: 332 99.0 1.1E-10 6.6E-14 75.2 15.9 273 1-296 7-332 (332) 143 protein:vir:3364 Length: 347 # 98.9 4.4E-10 2.7E-13 71.9 17.8 282 1-300 1-347 (347) 144 protein:vir:80180 Length: 381 98.9 4.7E-10 2.9E-13 71.7 17.0 281 1-305 1-336 (381) 145 protein:vir:95318 Length: 328 98.9 7.8E-10 4.8E-13 70.5 17.3 225 1-232 1-328 (328) 146 protein:vir:1541 Length: 347 # 98.8 2.6E-09 1.6E-12 67.7 18.9 281 1-300 1-347 (347) 147 protein:vir:93858 Length: 400 98.7 6E-10 3.7E-13 71.1 13.5 267 1-296 117-400 (400) 148 protein:vir:9927 Length: 295 # 98.7 2.7E-09 1.7E-12 67.6 16.6 266 1-305 1-293 (295) 149 protein:vir:99675 Length: 324 98.7 2.3E-09 1.4E-12 67.9 15.7 251 34-305 1-317 (324) 150 protein:vir:97031 Length: 402 98.6 5.2E-09 3.2E-12 66.0 15.6 293 1-305 1-342 (402) 151 protein:vir:3136 Length: 322 # 98.6 6.8E-09 4.2E-12 65.4 15.9 286 1-303 1-322 (322) 152 protein:vir:103285 Length: 296 98.6 3.1E-08 2E-11 61.7 17.7 274 1-299 1-296 (296) 153 protein:vir:102655 Length: 322 98.6 1.8E-08 1.1E-11 63.1 16.2 281 1-299 1-322 (322) 154 protein:vir:9875 Length: 296 # 98.5 2E-08 1.2E-11 62.8 16.4 269 1-299 1-296 (296) 155 protein:vir:103759 Length: 330 98.5 1.3E-08 8.1E-12 63.8 15.4 225 1-232 1-330 (330) 156 protein:vir:7019 Length: 401 # 98.5 1.3E-08 8.1E-12 63.8 15.2 292 1-305 1-339 (401) 157 protein:vir:105645 Length: 400 98.5 3.4E-08 2.1E-11 61.5 16.8 292 1-305 1-344 (400) 158 protein:vir:98525 Length: 331 98.5 3.9E-08 2.4E-11 61.2 17.0 225 1-232 1-331 (331) 159 protein:vir:107826 Length: 331 98.5 3.9E-08 2.4E-11 61.2 17.0 225 1-232 1-331 (331) 160 protein:vir:107388 Length: 331 98.5 3.9E-08 2.4E-11 61.2 17.0 225 1-232 1-331 (331) 161 protein:vir:107687 Length: 319 98.5 6.4E-08 4E-11 60.0 18.0 273 1-296 21-319 (319) 162 protein:vir:80068 Length: 301 98.4 1.7E-07 1.1E-10 57.6 18.0 270 3-296 1-301 (301) 163 protein:vir:104342 Length: 314 98.3 9.2E-08 5.7E-11 59.2 15.4 274 1-299 19-314 (314) 164 protein:vir:79642 Length: 329 98.3 2.3E-07 1.4E-10 57.0 17.5 276 1-299 26-329 (329) 165 protein:vir:106647 Length: 303 98.3 7.9E-08 4.9E-11 59.5 14.4 273 1-304 1-303 (303) 166 protein:vir:7324 Length: 335 # 98.3 1.2E-07 7.6E-11 58.5 15.4 226 1-231 1-335 (335) 167 protein:vir:79548 Length: 652 98.2 6E-07 3.8E-10 54.7 16.3 274 1-295 359-652 (652) 168 protein:vir:8843 Length: 317 # 98.0 3E-06 1.9E-09 50.8 17.6 275 1-300 1-317 (317) 169 protein:vir:99075 Length: 392 98.0 5.4E-06 3.3E-09 49.5 17.7 275 1-305 1-315 (392) 170 protein:vir:95512 Length: 693 97.9 3.4E-06 2.1E-09 50.6 16.1 277 1-296 394-693 (693) 171 protein:vir:80446 Length: 367 97.8 1.2E-05 7.4E-09 47.6 16.8 287 1-305 1-330 (367) 172 protein:vir:94070 Length: 339 97.7 8.1E-06 5E-09 48.5 15.5 278 1-296 46-339 (339) 173 protein:vir:94989 Length: 349 97.5 5.3E-05 3.3E-08 44.0 19.4 281 1-305 1-327 (349) 174 protein:vir:78387 Length: 349 97.5 5.8E-05 3.6E-08 43.8 18.7 274 1-305 1-327 (349) 175 protein:vir:101557 Length: 336 97.1 1.7E-05 1E-08 46.8 10.6 274 1-296 34-336 (336) 176 protein:vir:95131 Length: 325 97.1 0.00016 1E-07 41.3 18.4 274 1-305 1-298 (325) 177 protein:vir:96792 Length: 315 97.1 0.00017 1E-07 41.3 16.1 264 1-305 1-285 (315) 178 protein:vir:98566 Length: 355 97.0 0.00022 1.4E-07 40.6 18.3 287 1-305 16-353 (355) 179 protein:vir:3643 Length: 336 # 97.0 2E-05 1.3E-08 46.3 9.8 274 1-296 34-336 (336) 180 protein:vir:1153 Length: 338 # 96.9 0.00026 1.6E-07 40.3 17.5 282 1-300 16-338 (338) 181 protein:vir:108303 Length: 418 96.9 0.0003 1.8E-07 39.9 19.4 268 1-305 1-325 (418) 182 protein:vir:78558 Length: 336 96.8 0.00011 6.8E-08 42.3 12.7 273 1-296 31-336 (336) 183 protein:vir:79157 Length: 339 96.8 0.00032 2E-07 39.7 17.3 283 1-305 16-339 (339) 184 protein:vir:5255 Length: 304 # 96.7 0.00039 2.4E-07 39.3 14.6 271 1-295 1-304 (304) 185 protein:vir:1829 Length: 355 # 96.6 0.00046 2.9E-07 38.9 18.0 287 1-305 16-353 (355) 186 protein:vir:78777 Length: 358 96.5 0.00055 3.4E-07 38.5 16.1 282 1-305 20-352 (358) 187 protein:vir:103886 Length: 302 96.4 0.00061 3.8E-07 38.2 17.1 272 1-298 1-302 (302) 188 protein:vir:107732 Length: 379 96.3 0.00068 4.2E-07 37.9 14.1 277 1-296 56-379 (379) 189 protein:vir:104011 Length: 337 96.2 0.00087 5.4E-07 37.4 19.0 281 1-304 16-337 (337) 190 protein:vir:79171 Length: 337 96.2 0.00094 5.8E-07 37.2 19.0 281 1-304 16-337 (337) 191 protein:vir:106734 Length: 336 96.0 0.00045 2.8E-07 38.9 11.6 273 1-296 31-336 (336) 192 protein:vir:96079 Length: 382 96.0 0.00091 5.7E-07 37.3 13.0 278 1-296 63-382 (382) 193 protein:vir:5694 Length: 357 # 95.9 0.0013 8.3E-07 36.3 16.2 287 1-305 16-357 (357) 194 protein:vir:78186 Length: 337 95.8 0.0014 8.9E-07 36.2 17.0 281 1-304 16-337 (337) 195 protein:vir:6061 Length: 357 # 95.7 0.0016 9.9E-07 35.9 16.5 287 1-305 16-350 (357) 196 protein:vir:100331 Length: 342 95.5 0.002 1.2E-06 35.4 17.4 282 1-304 16-342 (342) 197 protein:vir:98856 Length: 343 95.1 0.0027 1.7E-06 34.7 17.3 282 1-305 16-341 (343) 198 protein:vir:3746 Length: 336 # 95.0 0.0029 1.8E-06 34.5 17.6 280 1-305 13-335 (336) 199 protein:vir:3783 Length: 336 # 95.0 0.003 1.9E-06 34.4 17.4 280 1-305 13-335 (336) 200 protein:vir:2016 Length: 357 # 94.8 0.0036 2.2E-06 34.0 16.4 287 1-305 16-350 (357) 201 protein:vir:1781 Length: 221 # 94.5 0.0042 2.6E-06 33.6 12.2 184 88-305 1-209 (221) 202 protein:vir:348 Length: 321 # 94.3 0.0047 2.9E-06 33.3 14.5 280 1-296 1-321 (321) 203 protein:vir:270 Length: 341 # 94.0 0.0058 3.6E-06 32.8 16.2 278 1-305 20-332 (341) 204 protein:vir:99576 Length: 388 94.0 0.0058 3.6E-06 32.8 13.0 284 1-296 72-388 (388) 205 protein:vir:3525 Length: 423 # 93.4 0.0078 4.8E-06 32.1 18.5 271 1-305 1-335 (423) 206 protein:vir:95875 Length: 401 91.9 0.014 8.5E-06 30.8 16.3 291 1-301 9-401 (401) 207 protein:vir:105374 Length: 423 91.0 0.018 1.1E-05 30.1 20.5 271 1-305 1-335 (423) 208 protein:vir:861 Length: 318 # 89.3 0.0058 3.6E-06 32.9 6.4 270 1-296 35-318 (318) 209 protein:vir:105522 Length: 423 89.0 0.029 1.8E-05 29.0 20.5 278 1-305 1-335 (423) 210 protein:vir:1663 Length: 393 # 88.1 0.0061 3.8E-06 32.7 5.7 270 1-296 110-393 (393) 211 protein:vir:174 Length: 423 # 87.7 0.037 2.3E-05 28.4 18.3 274 1-305 1-317 (423) 212 protein:vir:107120 Length: 329 86.5 0.045 2.8E-05 28.0 22.6 269 1-305 30-314 (329) 213 protein:vir:93966 Length: 400 86.2 0.011 7.1E-06 31.2 6.1 270 1-296 117-400 (400) 214 protein:vir:80835 Length: 464 83.6 0.067 4.2E-05 27.0 9.9 273 1-305 22-335 (464) 215 protein:vir:106286 Length: 534 82.6 0.076 4.7E-05 26.7 17.5 286 1-305 87-523 (534) 216 protein:vir:96442 Length: 418 82.5 0.076 4.7E-05 26.7 17.2 293 1-305 69-418 (418) 217 protein:vir:80986 Length: 528 81.6 0.084 5.2E-05 26.5 15.1 277 1-305 174-508 (528) 218 protein:vir:96666 Length: 462 81.5 0.085 5.3E-05 26.5 13.4 275 1-305 26-338 (462) 219 protein:vir:99311 Length: 463 81.5 0.085 5.3E-05 26.4 13.5 283 1-305 26-338 (463) 220 protein:vir:95603 Length: 463 81.5 0.085 5.3E-05 26.4 13.5 283 1-305 26-338 (463) 221 protein:vir:5942 Length: 523 # 79.7 0.1 6.3E-05 26.0 13.1 268 1-303 219-523 (523) 222 protein:vir:103463 Length: 521 78.7 0.11 6.9E-05 25.8 17.7 287 1-305 79-500 (521) 223 protein:vir:97331 Length: 319 77.1 0.13 8E-05 25.5 21.7 268 1-305 1-301 (319) 224 protein:vir:94800 Length: 319 77.1 0.13 8E-05 25.5 21.7 268 1-305 1-301 (319) 225 protein:vir:6901 Length: 522 # 76.9 0.13 8.1E-05 25.4 17.8 286 1-305 80-510 (522) 226 protein:vir:95451 Length: 313 73.7 0.17 0.0001 24.8 15.9 273 1-300 1-313 (313) 227 protein:vir:104915 Length: 470 69.1 0.23 0.00014 24.1 16.7 282 1-305 69-457 (470) 228 protein:vir:80491 Length: 467 64.4 0.3 0.00019 23.4 12.1 281 1-305 31-331 (467) 229 protein:vir:100603 Length: 529 63.6 0.31 0.00019 23.3 16.1 287 1-305 79-515 (529) 230 protein:vir:98143 Length: 524 63.5 0.32 0.0002 23.3 14.4 281 1-305 159-504 (524) 231 protein:vir:63741 Length: 468 62.9 0.33 0.0002 23.3 11.9 281 1-305 32-332 (468) 232 protein:vir:94870 Length: 318 62.0 0.34 0.00021 23.1 8.2 270 1-296 35-318 (318) 233 protein:vir:101811 Length: 529 59.3 0.4 0.00025 22.8 17.3 287 1-305 79-518 (529) 234 protein:vir:6601 Length: 528 # 54.7 0.5 0.00031 22.2 18.5 284 1-305 78-508 (528) 235 protein:vir:103370 Length: 418 54.1 0.51 0.00032 22.2 15.8 293 1-305 69-417 (418) 236 protein:vir:101039 Length: 529 52.2 0.56 0.00035 22.0 17.4 287 1-305 79-518 (529) 237 protein:vir:79008 Length: 299 51.0 0.59 0.00037 21.8 21.0 265 1-300 1-299 (299) 238 protein:vir:100851 Length: 514 47.7 0.69 0.00043 21.5 10.1 255 1-305 45-319 (514) 239 protein:vir:106998 Length: 468 44.0 0.82 0.00051 21.0 17.8 283 1-305 63-455 (468) 240 protein:vir:96490 Length: 348 42.1 0.9 0.00056 20.8 12.7 269 1-305 1-338 (348) 241 protein:vir:103181 Length: 457 40.7 0.96 0.0006 20.7 17.6 281 1-305 59-445 (457) 242 protein:vir:7214 Length: 521 # 40.4 0.97 0.0006 20.6 17.8 288 1-305 79-508 (521) 243 protein:vir:107947 Length: 519 36.5 1.2 0.00073 20.2 15.9 286 1-305 77-498 (519) 244 protein:vir:93696 Length: 364 36.3 1.2 0.00073 20.2 15.2 288 1-305 1-364 (364) 245 protein:vir:102823 Length: 470 35.6 1.2 0.00076 20.1 10.4 262 1-305 18-303 (470) 246 protein:vir:5670 Length: 514 # 33.2 1.4 0.00085 19.8 17.6 287 1-305 76-497 (514) 247 protein:vir:104549 Length: 462 29.9 1.6 0.001 19.4 12.3 284 1-305 107-449 (462) 248 protein:vir:99888 Length: 309 29.0 1.7 0.0011 19.3 12.8 281 1-299 1-309 (309) 249 protein:vir:1991 Length: 305 # 20.7 2.7 0.0017 18.2 11.5 229 1-305 1-268 (305) No 1 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=6.3e-74 Score=421.91 Aligned_cols=305 Identities=100% Similarity=1.344 Sum_probs=293.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) ||++++++||.+||+++.++|++.+++.++|+++++++++.++++++|+.++.+.+.|++|++..++++++.++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) ++++||++++++||+|+++|+.++++++|.++|++++++++|++||+|+|++.+.++.++.+.................+ T Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) T ss_pred EeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccchhh Confidence 99999999999999999999999999999999999999999999999999999888888888887777777777777788 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecccccCccceEecCccccCCCCceEEEEehhhEE Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVK 240 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~ 240 (305) .++.+.+..+...+...++.++.|+||+.++..|+++||++|+|+|++++++|+|++++++++.+.++++++||||++|+ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~l~G~Pv~~~~~~~~~~~~~~~~~gd~s~~~ 240 (305) T protein:vir:25 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVK 240 (305) T ss_pred hHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecCCcccccceEEcCccCCCCCccEEEEEecceEE Confidence 88888898888888888888899999999999999999999999999999999999999999988889999999999999 Q ss_pred EEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 241 IGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 241 ~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) +++++++++++++++++.+++..+++|++|++++|++.|+||.|.||+++++++++|+++|+||| T Consensus 241 i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 241 IGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) T ss_pred EEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=7.6e-61 Score=350.17 Aligned_cols=292 Identities=15% Similarity=0.181 Sum_probs=243.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) ||++++++||+++|+++.++|++.+++.++++++|+++++.++.+++|++++.+.++|++|++. +++++++|+++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~-----~~~s~~~f~~v 75 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEV-----KPSASVDVSAF 75 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCcc-----ccccccceeee Confidence 9999999999999999999999999999999999999999999999999999999999999975 67788999999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHH----HHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVA----VLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~----~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) ++++||++++++||+|+++++..+ ++++|.++|++++++++|+++|+|+|.+.+..+.++........+.+.. + T Consensus 76 ~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~--~ 153 (315) T protein:vir:80 76 TAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDA--T 153 (315) T ss_pred EeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeec--c Confidence 999999999999999999988765 7899999999999999999999999876665555555544333332222 2 Q ss_pred chhhhHHHHHHHHHHHHhhh-ccccceEEEEchHHHHHHHHhhccCCc-----eeec------ccccCccceEecCcccc Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVAS-AGWAPDTLLSSLALRYEVANIRDANGN-----PVFR------DDSFAGFRTFFNRNGAW 224 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~~~~~~l~~~kd~~G~-----~l~~------~~~l~G~pv~~~~~~~~ 224 (305) ...+.++. +++..+.. .+...++|+||+.++..|+++||.+|+ ++|. +.+++|+||+++++++. T Consensus 154 ~~~~~d~~----~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~ 229 (315) T protein:vir:80 154 DSATADLV----KAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSG 229 (315) T ss_pred ccchHHHH----HHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCCc Confidence 22333433 34444433 334557899999999999999877665 4553 24799999999999975 Q ss_pred CC-----CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 225 DA-----DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 225 ~~-----~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) .. ++..++||||++++++.+++++++++++.+ .+...+++|++|++++|++.|+||+|.||++|++++.+.+. T Consensus 230 ~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~--~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~ 307 (315) T protein:vir:80 230 APEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGD--PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) T ss_pred ccccccccccEEEEeecccEEEEEecCeeEEEecccc--ccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCC Confidence 43 345789999999999999999999998764 34456789999999999999999999999999999999888 Q ss_pred cccCCC Q lcl|Aclame:pro 300 VVAPAA 305 (305) Q Consensus 300 ~v~~a~ 305 (305) ..+|.| T Consensus 308 ~~~~~~ 313 (315) T protein:vir:80 308 KPNPPA 313 (315) T ss_pred CCCCCC Confidence 889999 No 3 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=1.3e-60 Score=348.92 Aligned_cols=286 Identities=24% Similarity=0.332 Sum_probs=247.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..+++++||.+||+++.++|++.+++.++|+++|+++|++++.+++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 9 ~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~-----~~~~~~~~~~i 83 (304) T protein:vir:10 9 GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETER-----IQTSKPEYAQA 83 (304) T ss_pred ccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcc-----cccccceeeEE Confidence 5566788889999999999999999999999999999999999999999999999999999975 66778899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCc--ccccccccccccccceeecccch Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASW--VSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++++|++++++||+|+++||.++++++|.++|++++++++|+++++|+|++.+. .+.++........ .... T Consensus 84 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~------~~~~ 157 (304) T protein:vir:10 84 EMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKG------NVVT 157 (304) T ss_pred EEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccc------cccc Confidence 9999999999999999999999999999999999999999999999999976433 2222222222111 1111 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc--cccCccceEecCccccCCCCceEEEEeh Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD--DSFAGFRTFFNRNGAWDADAAIEVIADS 236 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~--~~l~G~pv~~~~~~~~~~~~~~~~~gdf 236 (305) .....++++.++..++...+..+++|+||+.++..|+++||++|+|+|++ ++++|+||+++++++.+.+++.++|||| T Consensus 158 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~ 237 (304) T protein:vir:10 158 DTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGNEIMGLPLSYTGADVYDKKKSLALMGDW 237 (304) T ss_pred cccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCccccceeeEEecccccCCCCcEEEEEeh Confidence 22334556667777788888888999999999999999999999999986 5899999999999999888999999999 Q ss_pred hhEEEEeecCcEEEEeecceecc------CcceeeeeecCcEEEEEEEEEccEeecccceEEEeccc Q lcl|Aclame:pro 237 SRVKIGVRQDITVKFLDQATLGT------GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTP 297 (305) Q Consensus 237 ~~~~~~~~~~i~v~~~~~~~~~~------~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~ 297 (305) +++++++|+++++++++++.+.. +...+++|++|++++|+++|+|+++.+|+||++++.+. T Consensus 238 ~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 238 DYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred hhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 99999999999999999976543 33466889999999999999999999999999999988 No 4 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=1.3e-60 Score=348.92 Aligned_cols=286 Identities=24% Similarity=0.332 Sum_probs=247.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..+++++||.+||+++.++|++.+++.++|+++|+++|++++.+++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 9 ~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~-----~~~~~~~~~~i 83 (304) T protein:vir:94 9 GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETER-----IQTSKPEYAQA 83 (304) T ss_pred ccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcc-----cccccceeeEE Confidence 5566788889999999999999999999999999999999999999999999999999999975 66778899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCc--ccccccccccccccceeecccch Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASW--VSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++++|++++++||+|+++||.++++++|.++|++++++++|+++++|+|++.+. .+.++........ .... T Consensus 84 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~------~~~~ 157 (304) T protein:vir:94 84 EMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKG------NVVT 157 (304) T ss_pred EEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccc------cccc Confidence 9999999999999999999999999999999999999999999999999976433 2222222222111 1111 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc--cccCccceEecCccccCCCCceEEEEeh Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD--DSFAGFRTFFNRNGAWDADAAIEVIADS 236 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~--~~l~G~pv~~~~~~~~~~~~~~~~~gdf 236 (305) .....++++.++..++...+..+++|+||+.++..|+++||++|+|+|++ ++++|+||+++++++.+.+++.++|||| T Consensus 158 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~ 237 (304) T protein:vir:94 158 DTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGNEIMGLPLSYTGADVYDKKKSLALMGDW 237 (304) T ss_pred cccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCccccceeeEEecccccCCCCcEEEEEeh Confidence 22334556667777788888888999999999999999999999999986 5899999999999999888999999999 Q ss_pred hhEEEEeecCcEEEEeecceecc------CcceeeeeecCcEEEEEEEEEccEeecccceEEEeccc Q lcl|Aclame:pro 237 SRVKIGVRQDITVKFLDQATLGT------GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTP 297 (305) Q Consensus 237 ~~~~~~~~~~i~v~~~~~~~~~~------~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~ 297 (305) +++++++|+++++++++++.+.. +...+++|++|++++|+++|+|+++.+|+||++++.+. T Consensus 238 ~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 238 DYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred hhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 99999999999999999976543 33466889999999999999999999999999999988 No 5 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=3e-60 Score=346.95 Aligned_cols=287 Identities=18% Similarity=0.219 Sum_probs=243.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) ||++++ ||+++|+++.++|++.+++.++++++|++++++++.+++|+.++.+.++|++|++. +|+++++|+++ T Consensus 1 mat~~~--gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~-----~~~~~~~f~~v 73 (311) T protein:vir:81 1 MVALAT--GTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQ-----KSESTATFAPV 73 (311) T ss_pred CceecC--CceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcc-----cccccceeeEE Confidence 998766 79999999999999999999999999999999999999999999999999999875 67778899999 Q ss_pred EeeeeeEEEeehhhHHHhh---cCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~---ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) +++++|+++++++|+|+++ |+..+++++|.+++++++++++|+++++|++++.+..+.++.+......+........ T Consensus 74 ~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~ 153 (311) T protein:vir:81 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGT 153 (311) T ss_pred EEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccc Confidence 9999999999999999996 5667899999999999999999999999998777766777776655444433332222 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccC----- Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD----- 225 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~----- 225 (305) . ......+..+...+...++.+++|+||+.++..|+++||++|+|+|++ .+++|+||++++.++.. T Consensus 154 ~--~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~ 231 (311) T protein:vir:81 154 S--ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVT 231 (311) T ss_pred c--chHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccc Confidence 2 122334555666666677788899999999999999999999999963 57999999999887632 Q ss_pred ---------CCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 226 ---------ADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 226 ---------~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) .++..++||||++|+++.+++++++++++.. .+..+++|++|++++|++.|+||.|.||+||++++.+ T Consensus 232 ~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~---~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a 308 (311) T protein:vir:81 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD---PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) T ss_pred cccchhcccCCccEEEEEecccEEEEEeccceEEEeccCC---CCcchhhhhcCcEEEEEEEEeccEeecccceEEEEee Confidence 2345689999999999999999999988763 2345678999999999999999999999999999998 Q ss_pred ccc Q lcl|Aclame:pro 297 PVA 299 (305) Q Consensus 297 ~~a 299 (305) ..| T Consensus 309 ~~~ 311 (311) T protein:vir:81 309 DES 311 (311) T ss_pred ccC Confidence 766 No 6 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=8.7e-60 Score=344.36 Aligned_cols=297 Identities=23% Similarity=0.307 Sum_probs=248.3 Q ss_pred CC--------CccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MA--------DISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma--------~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (305) |+ .+++.++|.++|+++.++|++.+++.++|++++++++++++.+++|+.++.+.+.|++|++. +++ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~-----~~~ 75 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAER-----KPI 75 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCc-----ccc Confidence 22 23455567788889999999999999999999999999999999999999999999999875 667 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccc--c Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQ--A 150 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~--~ 150 (305) ++++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++.+. .++++....... . T Consensus 76 ~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~--~g~~~~~~~~~~~~~ 153 (330) T protein:vir:77 76 TKGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAF--KGYLAETTKVVSLAD 153 (330) T ss_pred ccceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcc--ccccccccccceeec Confidence 788999999999999999999999999999999999999999999999999999999976543 444443322211 1 Q ss_pred eeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------------cccCccceEe Q lcl|Aclame:pro 151 VEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------------DSFAGFRTFF 218 (305) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------------~~l~G~pv~~ 218 (305) ...........+.++++..++..+...+...++|+||+.++..|+++||++|+|+|++ .+++|+||++ T Consensus 154 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~ 233 (330) T protein:vir:77 154 TNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYV 233 (330) T ss_pred ccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEE Confidence 2223333445556677777788888888888899999999999999999999999974 3689999999 Q ss_pred cCccccCC--CCceEEEEehhhEEEEeecCcEEEEeecceeccCc--------ceeeeeecCcEEEEEEEEEccEeeccc Q lcl|Aclame:pro 219 NRNGAWDA--DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGE--------NQINLAERDMVALRLKARFAYVLGVSA 288 (305) Q Consensus 219 ~~~~~~~~--~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~--------~~~~~~~~~~~~~r~~~r~~~~v~~p~ 288 (305) +++++... ++..+++|||++++++++++++++++++.++..+. ..+++|++|++++|++.|+|+++.+|+ T Consensus 234 ~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~ 313 (330) T protein:vir:77 234 ADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKD 313 (330) T ss_pred eccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEeccc Confidence 99988543 45679999999999999999999999999877643 456789999999999999999999999 Q ss_pred ceEEEeccccccccCCC Q lcl|Aclame:pro 289 TAQGANKTPVAVVAPAA 305 (305) Q Consensus 289 a~~~~~~t~~a~v~~a~ 305 (305) ||++++.+.++. +|-- T Consensus 314 a~~~i~~~~~~~-~~~~ 329 (330) T protein:vir:77 314 AFVKLTDQVAGT-DPEE 329 (330) T ss_pred ceEEEEeccCCc-CCCC Confidence 999998875433 4444 No 7 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=1.5e-59 Score=343.03 Aligned_cols=284 Identities=20% Similarity=0.303 Sum_probs=248.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |+.++++++|.+||++++++|++.+++.++++++|+++|++++..++|+.++ +.+.|++|++. +|+++++|+++ T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~-----~~~~~~~f~~v 79 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMSG-VGAFWVDEAER-----IQTSKPTFTKA 79 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEcC-CceeeeecCcc-----ccccccceeEE Confidence 8899999999999999999999999999999999999999999999998865 77999999875 66778899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) +++++|++++++||+|+++||..+++++|.++|++++++++|+++|+|+|++. +.+++.......+.. ..+... T Consensus 80 ~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~---~~gil~~~~~~~~~~--~~~~~~- 153 (299) T protein:vir:41 80 KMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPY---NWNILKSATDASNLV--EETANK- 153 (299) T ss_pred EEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcc---cccccccccccceee--cccccc- Confidence 99999999999999999999999999999999999999999999999999764 345555443332221 122222 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------cccCccceEecCccccCCCCceEEEE Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------DSFAGFRTFFNRNGAWDADAAIEVIA 234 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------~~l~G~pv~~~~~~~~~~~~~~~~~g 234 (305) ++++.++..++...++.+++|+||+.++..|+++||++|+|+|++ ++++|+||++++.++.+.++..++|| T Consensus 154 ---~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~g 230 (299) T protein:vir:41 154 ---YDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVDDVLGLPIAYTPKYTFGDKDISELVG 230 (299) T ss_pred ---HHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCceecceeeEEecccCCCCCceEEEEE Confidence 345666677778888888999999999999999999999999975 47899999999999988888899999 Q ss_pred ehhhEEEEeecCcEEEEeecceecc----CcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 235 DSSRVKIGVRQDITVKFLDQATLGT----GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 235 df~~~~~~~~~~i~v~~~~~~~~~~----~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) ||++++++++++++++++++.++.. +...+++|++|++++|++.|+||.+.+|+||++++.+.+- T Consensus 231 dfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 231 DWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred ecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 9999999999999999999987654 3446778999999999999999999999999999987654 No 8 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=1.4e-59 Score=343.24 Aligned_cols=291 Identities=16% Similarity=0.155 Sum_probs=242.7 Q ss_pred CCC-ccCCccceEccHHHHHHHHHHHHhhhhhhhh-cceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MAD-ISRAEVASLIQEAYSDTLLAAAKQGSTVLSA-FQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~-~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l-~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) |+. +++++||++||+++.++|++.+++.++++++ ++++++.++.+++|+.++.+.+.|++|++. +|+++++|+ T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E~~~-----~~~s~~~f~ 138 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGEGKD-----VVATGATFD 138 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeeccCcc-----cccccccee Confidence 333 2445689999999999999999999999998 899999999999999999999999999875 677888999 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++++++|+++++++|+|+++||.++++++|.++|++++++++|++||+|+|.+. .|.|+++.............+.. T Consensus 139 ~i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~--~p~Gi~~~~~~~~~~~~~~~t~~ 216 (366) T protein:vir:57 139 DVKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGD--TPKGMKAVATAANRLVAWTGTAI 216 (366) T ss_pred EEEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc--cccceeeccccccceeecccccc Confidence 9999999999999999999999999999999999999999999999999998643 56777765554443333333333 Q ss_pred hhhHHHHHHHHH--HHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc---cccCccceEecCccccC----CCCc Q lcl|Aclame:pro 159 NESDIVGATNRA--AKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD---DSFAGFRTFFNRNGAWD----ADAA 229 (305) Q Consensus 159 ~~~~~~~~~~~~--~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~---~~l~G~pv~~~~~~~~~----~~~~ 229 (305) +..++..++..+ .......+...+.|+||+.++..|+++||++|+|+|++ ++|+|+||++++++|.+ .+.. T Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~~~~~ 296 (366) T protein:vir:57 217 NLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPEMSQGILKGYPIQRTSAIPANLGDDGNES 296 (366) T ss_pred chhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCceeccCCCCCeecceeeEEccccccccccCCCcc Confidence 333332222222 22333445667899999999999999999999999963 57999999999998854 3456 Q ss_pred eEEEEehhhEEEEeecCcEEEEeecceeccCc-ceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 230 IEVIADSSRVKIGVRQDITVKFLDQATLGTGE-NQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 230 ~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~-~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) .++||||++|++++|+++++++++++++.+.+ ..+++|++|++++|++.|+||++.||++|+.++...= T Consensus 297 ~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 297 EIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred EEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 79999999999999999999999998877644 4578899999999999999999999999999998877 No 9 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=5.9e-59 Score=339.81 Aligned_cols=291 Identities=20% Similarity=0.237 Sum_probs=248.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |+.++++++|.+||+++.++|++.+++.++|+++|+++++.++++++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~-----~~~~~~~f~~v 88 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEGDM-----KPITKGNMTSQ 88 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCCcc-----ccccccceeEE Confidence 8888888888899999999999999999999999999999999999999999999999999875 67788899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) +++++|+++++++|+|+++|+.++++++|.++|++++++++|+++|+|+|++.+..+.++.+.......... ..... T Consensus 89 ~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 165 (320) T protein:vir:10 89 NIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGGA---TASDL 165 (320) T ss_pred EEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecccc---ccccc Confidence 999999999999999999999999999999999999999999999999998776655555444333222111 11222 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------------cccCccceEecCccccCCCC Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------------DSFAGFRTFFNRNGAWDADA 228 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------------~~l~G~pv~~~~~~~~~~~~ 228 (305) ....+.+.++...+...+..+++|+|||+++..|+++||++|+|+|++ .+++|+|++++++++. ++ T Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~--~~ 243 (320) T protein:vir:10 166 TAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHVAD--GT 243 (320) T ss_pred ccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCCCCC--Cc Confidence 222334556677777788889999999999999999999999999964 3588999999998764 45 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccCc----ceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGE----NQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~----~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) ..++||||++++++++++++++++++.+++.++ ..+++|++|++++|++.|+||.+.||++|+++++.. +|. T Consensus 244 ~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~----ap~ 319 (320) T protein:vir:10 244 TVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVV----TPD 319 (320) T ss_pred eEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEecc----CCC Confidence 568899999999999999999999999877643 356789999999999999999999999999998753 355 Q ss_pred C Q lcl|Aclame:pro 305 A 305 (305) Q Consensus 305 ~ 305 (305) | T Consensus 320 ~ 320 (320) T protein:vir:10 320 A 320 (320) T ss_pred C Confidence 5 No 10 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=8.6e-59 Score=338.91 Aligned_cols=285 Identities=14% Similarity=0.149 Sum_probs=235.6 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |++.+ +||.+||++++++|++.+++.++++++|+++++.++..++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 1 m~t~t--~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~-----~~~s~~~f~~v 73 (303) T protein:vir:97 1 MGTET--SKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGK-----KTHGGLSLEPV 73 (303) T ss_pred CcccC--CCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCcc-----ccccccceeeE Confidence 99544 578999999999999999999999999999999999999999999999999999975 77788999999 Q ss_pred EeeeeeEEEeehhhHHHhh---cCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCc--ccccccccccccccceeecc Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASW--VSPALIPAAVTAGQAVEVVG 155 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~---ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~--~~~~~~~~~~~~~~~~~~~~ 155 (305) ++++||+++++++|+|+++ |+.++++++|.++|++++++++|+++++|++++.+. .+.+................ T Consensus 74 ~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 153 (303) T protein:vir:97 74 TIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTE 153 (303) T ss_pred EeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccccccccccccccccccccccc Confidence 9999999999999999994 678899999999999999999999999997654332 12221111111112222222 Q ss_pred cchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc--------cccCccceEecCccccCC- Q lcl|Aclame:pro 156 GVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD--------DSFAGFRTFFNRNGAWDA- 226 (305) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~--------~~l~G~pv~~~~~~~~~~- 226 (305) + ...++++.+++..+...++.+++|+|||.++..|+++||++|+|+|++ ++++|+||+++++++... T Consensus 154 ~----~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~ 229 (303) T protein:vir:97 154 S----EDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGAD 229 (303) T ss_pred c----cchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccc Confidence 2 233456666667777777888999999999999999999999999964 379999999999987532 Q ss_pred ---CCceEEEEehh-hEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 227 ---DAAIEVIADSS-RVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 227 ---~~~~~~~gdf~-~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) ++..++||||+ .+.++.|++++++++++.. .++..+++|++|++++|+++|+||+|.+|+||++++.+++ T Consensus 230 ~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~--~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 230 EAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGD--PDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred cCCCccEEEEeeccccEEEEEecCcEEEEeeccC--CCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 34568999995 5679999999999987543 3455778999999999999999999999999999999887 No 11 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=1.5e-58 Score=337.54 Aligned_cols=289 Identities=20% Similarity=0.259 Sum_probs=246.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..+++++++.+||+++.++|++.+++.++|+++|+++|++++++++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~-----~~~~~~~f~~v 101 (324) T protein:vir:97 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQK-----IETSKATWVNA 101 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEEecCcceeEeccCcc-----ccccccceeEE Confidence 4555667789999999999999999999999999999999999999999999999999999875 67788999999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) +++++|+++++++|+|+++|+.++++++|.++|++++++++|+++|+|+|++. .+.++........ ....+..++ T Consensus 102 ~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~--~~~gi~~~~~~~~---~~~~~~~~~ 176 (324) T protein:vir:97 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIEKTN---KVIKGDFTQ 176 (324) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc--cCccccccccccc---eeccccCCH Confidence 99999999999999999999999999999999999999999999999998653 4455544322221 122233333 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc---cccCccceEecCccccCCCCceEEEEehh Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD---DSFAGFRTFFNRNGAWDADAAIEVIADSS 237 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~---~~l~G~pv~~~~~~~~~~~~~~~~~gdf~ 237 (305) +++.++..++...++.+++|+||+.++..|+++||++|+|+|++ ++++|+||++++..+ .+++.++||||+ T Consensus 177 ----~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~~~~~tl~G~PV~~~~~~~--~~~~~~~~gd~~ 250 (324) T protein:vir:97 177 ----DNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLKSSN--LKRGELITGDFD 250 (324) T ss_pred ----HHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCCCccccceeeEeecCCC--CCcceEEEEecc Confidence 44555667777888888999999999999999999999999974 579999999876543 467889999999 Q ss_pred hEEEEeecCcEEEEeecceecc----CcceeeeeecCcEEEEEEEEEccEeecccceEEEecc-ccccccCCC Q lcl|Aclame:pro 238 RVKIGVRQDITVKFLDQATLGT----GENQINLAERDMVALRLKARFAYVLGVSATAQGANKT-PVAVVAPAA 305 (305) Q Consensus 238 ~~~~~~~~~i~v~~~~~~~~~~----~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t-~~a~v~~a~ 305 (305) ++++++++++++++++++++.. +...+++|++|++++|++.|+|+.+.+|+||++++.+ +....+||- T Consensus 251 ~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~ 323 (324) T protein:vir:97 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred cEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCCCCC Confidence 9999999999999999987664 3457889999999999999999999999999999977 444556666 No 12 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=1.3e-58 Score=337.87 Aligned_cols=285 Identities=14% Similarity=0.156 Sum_probs=235.6 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) ||+++++. |.+||+++..+|++.+++.++++++|+++++.++.+++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 1 ma~~t~~~-G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~-----~~~s~~~f~~v 74 (300) T protein:vir:95 1 MSEAQLSK-GNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGK-----KTHGGVSLDPV 74 (300) T ss_pred CcccccCC-cceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcc-----cccccccceee Confidence 99977766 5689999999999999999999999999999999999999999999999999875 77888999999 Q ss_pred EeeeeeEEEeehhhHHHhh---cCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCc--ccccccccccccccceeecc Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASW--VSPALIPAAVTAGQAVEVVG 155 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~---ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~--~~~~~~~~~~~~~~~~~~~~ 155 (305) ++++||++++++||+|+++ |+.++++++|.+++++++++++|+++|+|++++.+. .+.+............ .. T Consensus 75 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-~~- 152 (300) T protein:vir:95 75 TIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTV-PF- 152 (300) T ss_pred EeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceee-cc- Confidence 9999999999999999994 678999999999999999999999999997644332 2222222111111111 11 Q ss_pred cchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec-------ccccCccceEecCccccCC-- Q lcl|Aclame:pro 156 GVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR-------DDSFAGFRTFFNRNGAWDA-- 226 (305) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~-------~~~l~G~pv~~~~~~~~~~-- 226 (305) +.....+.+.++...+...++.+++|+|||.++..|+++||++|+|+|+ +.+++|+||++++.++... T Consensus 153 ---~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~ 229 (300) T protein:vir:95 153 ---KDTNPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTD 229 (300) T ss_pred ---cccchHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCCC Confidence 1223345566667777777788889999999999999999999999995 3679999999999988654 Q ss_pred CCceEEEEehhhEE-EEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 227 DAAIEVIADSSRVK-IGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 227 ~~~~~~~gdf~~~~-~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) ++..+++|||++++ ++.|++++++++++.. .++..+++|++|++++|+++|+||++.+|++|+++++++= T Consensus 230 ~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~--~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 230 PKNTAIVGDFETMFKWGYAKEVPMEIIKYGD--PDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred CccEEEEeeccceEEEEEecccEEEEeeccC--CCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 34568899999865 8999999999988664 3445678999999999999999999999999999987643 No 13 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=1.4e-58 Score=337.84 Aligned_cols=285 Identities=14% Similarity=0.171 Sum_probs=236.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) ||. +||.++|+++.++|++.++++++++++|+++++.++.+++|+.++.+.++|++|++. +|+++++|+++ T Consensus 1 ma~----~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~-----~~~~~~~f~~v 71 (298) T protein:vir:16 1 MVL----NKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGK-----KTHGGVTLAPQ 71 (298) T ss_pred Ccc----cCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCcc-----ccccccceeEE Confidence 773 358899999999999999999999999999999999999999999999999999875 77788999999 Q ss_pred EeeeeeEEEeehhhHHHhh---cCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~---ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) ++++||+++++++|+|+++ |+.++++++|.++|++++++++|+++++|++.+.+.... +................. T Consensus 72 ~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~-~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:16 72 TMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASA-VIGTNHFDSKVTQKVEAP 150 (298) T ss_pred EEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccc-cccccccccccccccccc Confidence 9999999999999999995 567899999999999999999999999997654432221 111111111112222223 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccC--CCC Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD--ADA 228 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~--~~~ 228 (305) ....+..+++.++..++...++.+++|+||++++..|+++||++|+|+|++ .+++|+||+++++++.. .++ T Consensus 151 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~ 230 (298) T protein:vir:16 151 RGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) T ss_pred cccccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCc Confidence 333444566777777777778888899999999999999999999999975 47999999999998853 345 Q ss_pred ceEEEEehhhEE-EEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccc Q lcl|Aclame:pro 229 AIEVIADSSRVK-IGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTP 297 (305) Q Consensus 229 ~~~~~gdf~~~~-~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~ 297 (305) ..+++|||++++ ++.+++++++++++.. .++..+++|++|++++|+++|+||++.||++|++++.+. T Consensus 231 ~~~~~GDfs~~~~~~~~~~~~~~~~~~~~--~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGD--PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cEEEEeeccceEEEEEecCceEEEeeccC--CcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 679999999864 8999999999988753 344567899999999999999999999999999999876 No 14 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=1.8e-58 Score=337.20 Aligned_cols=288 Identities=20% Similarity=0.274 Sum_probs=241.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |+..++++++.++|+++.++|++.+++.++|++++++++++++++++|++++.+.+.|++|++. +++++++|+++ T Consensus 10 ~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~-----~~~s~~~f~~v 84 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDM-----KPITKGNMTKR 84 (397) T ss_pred HhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCcc-----ccccccceeEE Confidence 7777777778888889999999999999999999999999999999999999999999999875 66778899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) ++++||++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++.+.. ++..... . .........+ T Consensus 85 ~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~--~~~~~~~-~---~~~~~~~~~~ 158 (397) T protein:vir:23 85 DVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQ--GYLDQSN-K---TQSISPNAYQ 158 (397) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccc--ccccccc-c---eeeecccchh Confidence 99999999999999999999999999999999999999999999999999876432 2222111 1 1112222223 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeeccc------------ccCccceEecCccccCCCC Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD------------SFAGFRTFFNRNGAWDADA 228 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~------------~l~G~pv~~~~~~~~~~~~ 228 (305) + .+.++...+...++..+.|+||+..+..|+++||++|+|+|++. +++|+|++++++++. ++ T Consensus 159 ~----~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~--g~ 232 (397) T protein:vir:23 159 G----LGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAE--GD 232 (397) T ss_pred H----HHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCC--Cc Confidence 3 33344455666777888999999999999999999999999753 689999999999873 45 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccC----cceeeeeecCcEEEEEEEEEccEeecccceEEEecccccccc-- Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTG----ENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVA-- 302 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~----~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~-- 302 (305) ..+++|||++++++++++++++++++.+++.. ...+++|++|++++|++.|+||++.+|++|++++.++..... T Consensus 233 ~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~ 312 (397) T protein:vir:23 233 VVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYAL 312 (397) T ss_pred eEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeee Confidence 67899999999999999999999999987754 357889999999999999999999999999999987543321 Q ss_pred --CCC Q lcl|Aclame:pro 303 --PAA 305 (305) Q Consensus 303 --~a~ 305 (305) |.+ T Consensus 313 ~~~~~ 317 (397) T protein:vir:23 313 DLDGA 317 (397) T ss_pred ccccc Confidence 222 No 15 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=1.7e-58 Score=337.24 Aligned_cols=291 Identities=15% Similarity=0.139 Sum_probs=239.3 Q ss_pred CC-CccCCccceEccHHHHHHHHHHHHhhhhhhhh-cceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MA-DISRAEVASLIQEAYSDTLLAAAKQGSTVLSA-FQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma-~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l-~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) ++ .++++.||++||+++.++|++.+++.++|+++ ++++++.++.+++|+.++.+.+.|++|++. +|+++++|+ T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~-----~~~~~~~f~ 199 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAGGATASYTGENQD-----AKVSEARFD 199 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeCCcceeeeccCcc-----cccccccee Confidence 22 33445688999999999999999999999999 788999889999999999999999999875 677788999 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc-c Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG-V 157 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 157 (305) ++++.++|++++++||+|+++|+.+++++||.++|++++++++|++||+|+|++ ..|.|+++.+............ . T Consensus 200 ~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~--~~p~Gi~~~~~~~~~~~~~~~~~~ 277 (428) T protein:vir:10 200 DVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTG--DTPIGMKARATQWNRLLPWAADAA 277 (428) T ss_pred eEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCC--cccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999864 3567777665544332322222 2 Q ss_pred hhhhHHHHHHH--HHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc---cccCccceEecCccccCC----CC Q lcl|Aclame:pro 158 ANESDIVGATN--RAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD---DSFAGFRTFFNRNGAWDA----DA 228 (305) Q Consensus 158 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~---~~l~G~pv~~~~~~~~~~----~~ 228 (305) .+.+.+...+. .+.......+...+.|+||+.++..|+++||++|+|+|++ ++++|+||++++++|.+. ++ T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~ 357 (428) T protein:vir:10 278 VNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKE 357 (428) T ss_pred ccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeeceeeEEeccccccccCCCcc Confidence 22222212222 1223334445567899999999999999999999999964 479999999999987643 45 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccC-cceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTG-ENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~-~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) ..++||||++|++++++++++++++++.+... ...+.+|++|++++|++.|+||++.+|++|+.++...- T Consensus 358 ~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 358 SEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred ceEEEEecceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 67999999999999999999999999865544 34678899999999999999999999999999998877 No 16 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=6.8e-58 Score=334.00 Aligned_cols=289 Identities=20% Similarity=0.253 Sum_probs=246.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..++++++|.+||+++.++|++.+++.++|++++++++++++++++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~-----~~~~~~~~~~v 101 (324) T protein:vir:78 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQK-----IETSKATWVNA 101 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCCcc-----ccccccceeEE Confidence 6667778889999999999999999999999999999999999999999999999999999875 67788899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) +++++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++. .+.++........ ....+..+ T Consensus 102 ~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~--~~~gi~~~~~~~~---~~~~~~~t- 175 (324) T protein:vir:78 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIEKTN---KVIKGDFT- 175 (324) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--cCccccccccccc---eecccccc- Confidence 99999999999999999999999999999999999999999999999998653 3444444322221 11222233 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec---ccccCccceEecCccccCCCCceEEEEehh Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR---DDSFAGFRTFFNRNGAWDADAAIEVIADSS 237 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~---~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~ 237 (305) .+.+.++..++...++..++|+||+.++..|+++||++|+|++. +.+++|+||+++.... .+++.+++|||+ T Consensus 176 ---~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV~~~~~~~--~~~~~~~~gd~~ 250 (324) T protein:vir:78 176 ---QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSN--LKRGELITGDFD 250 (324) T ss_pred ---HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCCCCCcccceeeEeeCCCC--CCcceEEEEecc Confidence 44455566677778888899999999999999999999999986 4579999999876543 567889999999 Q ss_pred hEEEEeecCcEEEEeecceecc----CcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccc-ccCCC Q lcl|Aclame:pro 238 RVKIGVRQDITVKFLDQATLGT----GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAV-VAPAA 305 (305) Q Consensus 238 ~~~~~~~~~i~v~~~~~~~~~~----~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~-v~~a~ 305 (305) ++++++++++++++++++++.. +...+++|++|++++|++.|+||.+.+|+||++++++..+. +||+- T Consensus 251 ~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:78 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred eEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCC Confidence 9999999999999999987554 45678899999999999999999999999999999864444 67777 No 17 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=6.8e-58 Score=334.00 Aligned_cols=289 Identities=20% Similarity=0.253 Sum_probs=246.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..++++++|.+||+++.++|++.+++.++|++++++++++++++++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~-----~~~~~~~~~~v 101 (324) T protein:vir:96 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQK-----IETSKATWVNA 101 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCCcc-----ccccccceeEE Confidence 6667778889999999999999999999999999999999999999999999999999999875 67788899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) +++++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++. .+.++........ ....+..+ T Consensus 102 ~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~--~~~gi~~~~~~~~---~~~~~~~t- 175 (324) T protein:vir:96 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIEKTN---KVIKGDFT- 175 (324) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--cCccccccccccc---eecccccc- Confidence 99999999999999999999999999999999999999999999999998653 3444444322221 11222233 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec---ccccCccceEecCccccCCCCceEEEEehh Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR---DDSFAGFRTFFNRNGAWDADAAIEVIADSS 237 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~---~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~ 237 (305) .+.+.++..++...++..++|+||+.++..|+++||++|+|++. +.+++|+||+++.... .+++.+++|||+ T Consensus 176 ---~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV~~~~~~~--~~~~~~~~gd~~ 250 (324) T protein:vir:96 176 ---QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSN--LKRGELITGDFD 250 (324) T ss_pred ---HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCCCCCcccceeeEeeCCCC--CCcceEEEEecc Confidence 44455566677778888899999999999999999999999986 4579999999876543 567889999999 Q ss_pred hEEEEeecCcEEEEeecceecc----CcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccc-ccCCC Q lcl|Aclame:pro 238 RVKIGVRQDITVKFLDQATLGT----GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAV-VAPAA 305 (305) Q Consensus 238 ~~~~~~~~~i~v~~~~~~~~~~----~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~-v~~a~ 305 (305) ++++++++++++++++++++.. +...+++|++|++++|++.|+||.+.+|+||++++++..+. +||+- T Consensus 251 ~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:96 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred eEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCC Confidence 9999999999999999987554 45678899999999999999999999999999999864444 67777 No 18 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=3.2e-58 Score=335.82 Aligned_cols=291 Identities=17% Similarity=0.202 Sum_probs=242.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhh-cceeecCCCceEEEEEeCCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSA-FQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l-~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) +..++++.||++||+++.++|++.+++.++++++ ++++++.++.+++|+.++.+.+.|++|++. +|+++++|++ T Consensus 132 ~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~-----~~~~~~~f~~ 206 (435) T protein:vir:80 132 LNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTD-----IPTTQQQFDD 206 (435) T ss_pred hcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeCCcceeeeccCcc-----ccccccceee Confidence 5567777899999999999999999999999998 889999999999999999999999999875 6777889999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHH--HHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATV--AVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~--~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) +++.++|+++++++|+|+++|+.+ +++++|.++|++++++++|++||+|+|++. .|.|+++.......... ... T Consensus 207 i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~--~p~Gi~~~~~~~~~~~~--~~~ 282 (435) T protein:vir:80 207 LKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTAN--TPKGLRFWALPGNVITA--SDG 282 (435) T ss_pred EEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC--cccceeecccccceeec--ccc Confidence 999999999999999999999854 799999999999999999999999998653 46676665544332222 222 Q ss_pred hhhhHHHHHHHHHHHHhhh--ccccceEEEEchHHHHHHHHhhccCCceeec---ccccCccceEecCccccCC----CC Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVAS--AGWAPDTLLSSLALRYEVANIRDANGNPVFR---DDSFAGFRTFFNRNGAWDA----DA 228 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~---~~~l~G~pv~~~~~~~~~~----~~ 228 (305) .+......++.+++..+.. .++..++|+||+.++..|+++||++|+|+|+ +++++|+||++++.+|... +. T Consensus 283 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~ 362 (435) T protein:vir:80 283 STLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGEAGKE 362 (435) T ss_pred cchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCCCCCeEeeeeeEEeccccccccCCCCc Confidence 2222223333344333332 3456788999999999999999999999995 4689999999999987543 34 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccCc-ceeeeeecCcEEEEEEEEEccEeecccceEEEecccccc Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGE-NQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAV 300 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~-~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~ 300 (305) ++++||||++|++++|+++++++++++++.+.. ..+++|++|++++|++.|+||.+.+|++|+.+++..-++ T Consensus 363 ~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 363 SEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred ceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 579999999999999999999999999776654 356789999999999999999999999999999986655 No 19 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=8.9e-58 Score=333.35 Aligned_cols=289 Identities=20% Similarity=0.253 Sum_probs=244.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..+++++++.+||++++++|++.+++.++|+++|+++|+.++++++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 27 ~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~-----~~~~~~~~~~v 101 (324) T protein:vir:10 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQK-----IETSKATWVNA 101 (324) T ss_pred cceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceeEeccCcc-----ccccccceeEE Confidence 4445556677899999999999999999999999999999999999999999999999999975 67778899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) +++++|+++++++|+|+++|+.++++++|.++|++++++++|+++|+|+|++. .+.++........ ....+..+ T Consensus 102 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~--~~~~i~~~~~~~~---~~~~~~~t- 175 (324) T protein:vir:10 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIEKTN---KVIKGDFT- 175 (324) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCc--cCccccccccccc---eeccccCC- Confidence 99999999999999999999999999999999999999999999999998753 4445444332221 11222233 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc---cccCccceEecCccccCCCCceEEEEehh Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD---DSFAGFRTFFNRNGAWDADAAIEVIADSS 237 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~---~~l~G~pv~~~~~~~~~~~~~~~~~gdf~ 237 (305) ++++.+++..+...++.+++|+|||.++..|+++||++|+|+|++ .+++|+||++++... .+++.+++|||+ T Consensus 176 ---~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~PV~~~~~~~--~~~~~~~~gd~~ 250 (324) T protein:vir:10 176 ---QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLKSSN--LKRGELITGDFD 250 (324) T ss_pred ---HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecCCCCccccceeEEeecCCC--CCcceEEEEecc Confidence 345556677777788888999999999999999999999999864 579999999876543 467889999999 Q ss_pred hEEEEeecCcEEEEeecceecc----CcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc-cccCCC Q lcl|Aclame:pro 238 RVKIGVRQDITVKFLDQATLGT----GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA-VVAPAA 305 (305) Q Consensus 238 ~~~~~~~~~i~v~~~~~~~~~~----~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a-~v~~a~ 305 (305) ++++++++++++++++++++.. +...+++|++|++++|++.|+||.+.+|+||++++.+.++ ..+||. T Consensus 251 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:10 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred cEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCC Confidence 9999999999999999987554 3456788999999999999999999999999999977443 457777 No 20 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=2.4e-58 Score=336.52 Aligned_cols=291 Identities=16% Similarity=0.202 Sum_probs=243.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhh-cceeecCCCceEEEEEeCCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSA-FQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l-~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |..+++.+||++||+++.++|++.+++.++++++ ++++++.++.+++|+.++.+.+.|++|++. +++++++|++ T Consensus 132 ~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~-----~~~~~~~f~~ 206 (435) T protein:vir:14 132 LNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTD-----IPTTQQQFDD 206 (435) T ss_pred cccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeeeccCcc-----ccccccceeE Confidence 6677888899999999999999999999999998 889999988999999999999999999875 6677889999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHH--HHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATV--AVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~--~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) +++.++|+++++++|+|+++|+.+ +++++|.++|++++++++|++|++|+|++. .|.|++........ ...... T Consensus 207 i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~--~p~Gi~~~~~~~~~--~~~~~~ 282 (435) T protein:vir:14 207 LKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTAN--TPKGLRFWALPSNV--ITASDA 282 (435) T ss_pred EEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc--cccceeecccccce--eccccc Confidence 999999999999999999999854 699999999999999999999999998654 46666654333222 122222 Q ss_pred hhhhHHHHHHHHHHHHhhhc--cccceEEEEchHHHHHHHHhhccCCceeec---ccccCccceEecCccccC----CCC Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASA--GWAPDTLLSSLALRYEVANIRDANGNPVFR---DDSFAGFRTFFNRNGAWD----ADA 228 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~---~~~l~G~pv~~~~~~~~~----~~~ 228 (305) .+.+....++.++...+... ++..++|+||+.++..|+++||++|+|+|+ +++++|+||++++.+|.. .+. T Consensus 283 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~~ 362 (435) T protein:vir:14 283 STLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGETGKE 362 (435) T ss_pred cchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeecceeEeeccccccccCCCcc Confidence 33334444555555444432 456778999999999999999999999995 368999999999998764 234 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccCc-ceeeeeecCcEEEEEEEEEccEeecccceEEEecccccc Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGE-NQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAV 300 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~-~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~ 300 (305) +.++||||++|++++|+++++++++++.+.... ..+.+|++|++++|++.|+||++.+|++|+.+++.+.+. T Consensus 363 ~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 363 SEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred ceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 579999999999999999999999998776644 356789999999999999999999999999999988766 No 21 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=9.9e-58 Score=333.11 Aligned_cols=298 Identities=20% Similarity=0.237 Sum_probs=243.6 Q ss_pred CCCccCCccc------eEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhh---cccccc Q lcl|Aclame:pro 1 MADISRAEVA------SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATD---PKGVKP 71 (305) Q Consensus 1 Ma~~t~~~gg------~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~---~~~~~~ 71 (305) |+.++..+|+ .++|+++.++|++.+++.++++++|++++++++.+++|+.++.+.+.|++|++.. +.+.++ T Consensus 10 ~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~ 89 (333) T protein:vir:78 10 NSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKP 89 (333) T ss_pred hcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCccccccccccccc Confidence 4444443433 4899999999999999999999999999999999999999999999999998753 234578 Q ss_pred cccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccce Q lcl|Aclame:pro 72 TSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAV 151 (305) Q Consensus 72 ~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~ 151 (305) +++++|+++++++||+++++++|+|+++|+.++++++|.++|++++++++|+++|+|+|++.+..+.++.+......... T Consensus 90 ~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~ 169 (333) T protein:vir:78 90 LSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTN 169 (333) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccccccccccccc Confidence 89999999999999999999999999999999999999999999999999999999999887777777665443332211 Q ss_pred eecccchhhhHHHHHHHHHHHHhhhc-cccceEEEEchHHHHHHHH---hhccCCceeecc-------cccCccceEecC Q lcl|Aclame:pro 152 EVVGGVANESDIVGATNRAAKAVASA-GWAPDTLLSSLALRYEVAN---IRDANGNPVFRD-------DSFAGFRTFFNR 220 (305) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~l~~---~kd~~G~~l~~~-------~~l~G~pv~~~~ 220 (305) . ..........++.+.+++..+... ++.++.|+|||..+..|++ ++|++|+|+|++ .+++|+||++++ T Consensus 170 ~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~ 248 (333) T protein:vir:78 170 V-DYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVLGLPAQFGR 248 (333) T ss_pred c-cccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCceeeceeeEEcc Confidence 1 111112222344555555555443 4567789999999987765 679999999974 479999999999 Q ss_pred ccccCC-----CCceEEEEehhhEEEEeecCcEEEEeecceecc-CcceeeeeecCcEEEEEEEEEccEeecccceEEEe Q lcl|Aclame:pro 221 NGAWDA-----DAAIEVIADSSRVKIGVRQDITVKFLDQATLGT-GENQINLAERDMVALRLKARFAYVLGVSATAQGAN 294 (305) Q Consensus 221 ~~~~~~-----~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~-~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~ 294 (305) +++.+. ++..+++|||++|++++++++++++++++++.. +...+++|++|++++|+++|+||.+.+|++|++++ T Consensus 249 ~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~ 328 (333) T protein:vir:78 249 AVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFV 328 (333) T ss_pred ccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEe Confidence 988653 345799999999999999999999999987655 44578899999999999999999999999999998 Q ss_pred ccccc Q lcl|Aclame:pro 295 KTPVA 299 (305) Q Consensus 295 ~t~~a 299 (305) .+.++ T Consensus 329 ~~~a~ 333 (333) T protein:vir:78 329 DDEQP 333 (333) T ss_pred ccCCC Confidence 87543 No 22 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=5.1e-58 Score=334.67 Aligned_cols=285 Identities=15% Similarity=0.109 Sum_probs=238.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccc-ccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTS-KVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~-~~~f~~ 79 (305) |..+++++||++||+++.++|++.+++.++|+++|+++++.++.+++|+..+.+.+.|++|++.. |.+ .++|++ T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~-----~~~~~~~f~~ 180 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGGTTSGWVGETDAR-----PETATSKLGL 180 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCcceeeecccccc-----ccccccccee Confidence 88888999999999999999999999999999999999999999999999999999999998763 433 368999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceee------ Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV------ 153 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~------ 153 (305) +++.++|+++++++|+|+++|+.++++++|.++|++++++++|.+|++|+|++ +|.|+++........... T Consensus 181 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~---~p~Gil~~~~~~~~~~~~~~~~~~ 257 (407) T protein:vir:48 181 IEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSK---KPKGFLAYESTDEDDKTRAFGKLQ 257 (407) T ss_pred EEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCC---ccceeeeccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999975 466776554433221110 Q ss_pred -cccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCcccc- Q lcl|Aclame:pro 154 -VGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAW- 224 (305) Q Consensus 154 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~- 224 (305) ........-.++++.+++..+...+...++|+||+.++..|+++||++|||+|++ .+++|+||++++++|. T Consensus 258 ~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~ 337 (407) T protein:vir:48 258 HIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDI 337 (407) T ss_pred ccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecCcCCc Confidence 0111111112445566666777777888899999999999999999999999975 3799999999999875 Q ss_pred CCCCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 225 DADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 225 ~~~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) +.+...++||||++ |.+++|.++++..++ ++++|++.+|++.|+|+++.+|+||++++.++++.... T Consensus 338 ~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~------------~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa~~~~~ 405 (407) T protein:vir:48 338 AADAKAIAFGNFKRGYTIVDRIGTRILRDP------------YTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAATRQKA 405 (407) T ss_pred cCCccEEEEEeccccEEEEEeeceEEEeec------------cccCCcEEEEEEEEeccEEecccceEEEEeeccCCCCC Confidence 34556789999986 678889998876432 35789999999999999999999999999999988777 Q ss_pred CC Q lcl|Aclame:pro 304 AA 305 (305) Q Consensus 304 a~ 305 (305) || T Consensus 406 ~~ 407 (407) T protein:vir:48 406 AA 407 (407) T ss_pred CC Confidence 77 No 23 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=1.2e-57 Score=332.67 Aligned_cols=289 Identities=19% Similarity=0.249 Sum_probs=244.6 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..+++.+++.+||+++.++|++.+++.++|+++|+++|+.++++++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 27 ~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~-----~~~~~~~~~~v 101 (324) T protein:vir:99 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADKPGAYWVGEGQK-----IETSKATWVNA 101 (324) T ss_pred cceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEeccCcc-----ccccccceeEE Confidence 4445556677899999999999999999999999999999999999999999999999999875 67788899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) +++++|+++++++|+|+++|+.++++++|.++|++++++++|+++|+|+|++. .+.++........ ....+..+ T Consensus 102 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~--~~~~~~~~~~~~~---~~~~~~~~- 175 (324) T protein:vir:99 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIEKTN---KVIKGDFT- 175 (324) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCc--cCccccccccccc---eeccccCC- Confidence 99999999999999999999999999999999999999999999999998653 4444444322221 11222233 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec---ccccCccceEecCccccCCCCceEEEEehh Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR---DDSFAGFRTFFNRNGAWDADAAIEVIADSS 237 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~---~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~ 237 (305) .+++.+++..+...++.++.|+|||.++..|+++||++|+|+|. +++++|+||++++.+. .+++.+++|||+ T Consensus 176 ---~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~PVv~~~~~~--~~~~~~i~gd~~ 250 (324) T protein:vir:99 176 ---QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLKSSN--LKRGELITGDFD 250 (324) T ss_pred ---HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCCCccccceeEEeecCCC--CCcceEEEEecc Confidence 44556667777888888899999999999999999999999986 3579999999887554 467789999999 Q ss_pred hEEEEeecCcEEEEeecceecc----CcceeeeeecCcEEEEEEEEEccEeecccceEEEecc-ccccccCCC Q lcl|Aclame:pro 238 RVKIGVRQDITVKFLDQATLGT----GENQINLAERDMVALRLKARFAYVLGVSATAQGANKT-PVAVVAPAA 305 (305) Q Consensus 238 ~~~~~~~~~i~v~~~~~~~~~~----~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t-~~a~v~~a~ 305 (305) ++++++++++++++++++++.. +...+++|++|++++|++.|+||.+.||++|++++.+ +....+||. T Consensus 251 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~ 323 (324) T protein:vir:99 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred cEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCC Confidence 9999999999999999987654 4456789999999999999999999999999999977 334446666 No 24 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=1.6e-57 Score=331.93 Aligned_cols=289 Identities=20% Similarity=0.255 Sum_probs=244.6 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..+++++++.+||++++++|++.+++.++++++|++++++++.++||+.++.+.+.|++|++. +|+++++|+++ T Consensus 27 ~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~-----~~~~~~~f~~i 101 (324) T protein:vir:93 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQK-----IETSKATWVNA 101 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCcc-----ccccccceeEE Confidence 5556666678899999999999999999999999999999999999999999999999999875 67778899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) +++++|++++++||+|+++||.++++++|.++|++++++++|+++|+|+|++. .+.++......... ...+..+ T Consensus 102 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~--~~~~~~~~~~~~~~---~~~~~~~- 175 (324) T protein:vir:93 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIEKTNK---VIKGDFT- 175 (324) T ss_pred EEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC--cCccccccccccce---ecccccc- Confidence 99999999999999999999999999999999999999999999999988643 34444443322211 1222223 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec---ccccCccceEecCccccCCCCceEEEEehh Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR---DDSFAGFRTFFNRNGAWDADAAIEVIADSS 237 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~---~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~ 237 (305) .+++.+++..+...++..+.|+||+.++..|++++|++|+|++. +.+++|+||+++... ..+++.+++|||+ T Consensus 176 ---~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PVv~~~~~--~~~~~~i~~gdfs 250 (324) T protein:vir:93 176 ---QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSS--NLKRGELITGDFD 250 (324) T ss_pred ---HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCCcccceeeEeecCC--CCCcceEEEEecc Confidence 44566677777888888899999999999999999999999986 457999999987643 3567889999999 Q ss_pred hEEEEeecCcEEEEeecceecc----CcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccc-ccCCC Q lcl|Aclame:pro 238 RVKIGVRQDITVKFLDQATLGT----GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAV-VAPAA 305 (305) Q Consensus 238 ~~~~~~~~~i~v~~~~~~~~~~----~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~-v~~a~ 305 (305) ++++++++++++++++++++.. +...+++|++|++++|++.|+||.+.+|++|++++.+.... +||+- T Consensus 251 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:93 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred eEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCC Confidence 9999999999999999987654 34567899999999999999999999999999999764333 66766 No 25 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=1.6e-57 Score=331.99 Aligned_cols=290 Identities=19% Similarity=0.231 Sum_probs=238.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |...+ +++|.++|+++.++|++.+++.++++++|++++++++.+++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 20 ~~~~~-~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~-----~~~~~~~f~~i 93 (326) T protein:vir:42 20 AQTGD-SMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSASWIGEGDM-----KPITKGNMTSQ 93 (326) T ss_pred eeccc-cCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceEEecCCcc-----ccccccceeEE Confidence 54444 4456689999999999999999999999999999999999999999999999999875 67778899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) +++++|+++++++|+|+++||.++++++|.++|++++++++|+++|+|+|++.+ .+++................... T Consensus 94 ~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p---~gi~~~~~~~~~~~~~~~~~~~~ 170 (326) T protein:vir:42 94 TIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFP---TFLAQTTKEVSLVDPDGTGSNAD 170 (326) T ss_pred EEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcc---ccccccccccceeeccccccccc Confidence 999999999999999999999999999999999999999999999999997643 44443333222222222211111 Q ss_pred hHHHH-HHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeeccc------------ccCccceEecCccccCCC Q lcl|Aclame:pro 161 SDIVG-ATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD------------SFAGFRTFFNRNGAWDAD 227 (305) Q Consensus 161 ~~~~~-~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~------------~l~G~pv~~~~~~~~~~~ 227 (305) ....+ .+..+...+...++..+.|+||+.++..|+++||++|+|+|++. ++.|+|++++++++. + T Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~--~ 248 (326) T protein:vir:42 171 LTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVAS--G 248 (326) T ss_pred chhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCC--C Confidence 11111 12334445556677788999999999999999999999999752 588999999998874 5 Q ss_pred CceEEEEehhhEEEEeecCcEEEEeecceeccC----cceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccc Q lcl|Aclame:pro 228 AAIEVIADSSRVKIGVRQDITVKFLDQATLGTG----ENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVV 301 (305) Q Consensus 228 ~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~----~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v 301 (305) +..+++|||++++++++++++++++++.+++.. ...+++|++|++++|++.|+||.+.||++|++++..+++.- T Consensus 249 ~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 249 TVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred ceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccccCC Confidence 567789999999999999999999999987654 34678899999999999999999999999999998865442 No 26 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=2.5e-57 Score=330.93 Aligned_cols=300 Identities=20% Similarity=0.232 Sum_probs=242.1 Q ss_pred CCCccCCc------cceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhh---cccccc Q lcl|Aclame:pro 1 MADISRAE------VASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATD---PKGVKP 71 (305) Q Consensus 1 Ma~~t~~~------gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~---~~~~~~ 71 (305) |+.++... ++.+||++++++|++.+++.++|+++|++++++++.+++|+.++.+.+.|++++... +.+.++ T Consensus 10 ~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~~~~Eg~~~~ 89 (338) T protein:vir:78 10 NTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGTKP 89 (338) T ss_pred hhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeeccccccccccccccc Confidence 44333333 344899999999999999999999999999999999999999988887777654321 234578 Q ss_pred cccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccce Q lcl|Aclame:pro 72 TSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAV 151 (305) Q Consensus 72 ~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~ 151 (305) +++++|++++++++|+++++++|+|+++|+.++++++|.++|++++++++|+++|+|+|++.+.++.++.+......... T Consensus 90 ~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~~~~~~~ 169 (338) T protein:vir:78 90 LSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNVIVNTTN 169 (338) T ss_pred ccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccccccccc Confidence 88899999999999999999999999999999999999999999999999999999999887777777766544433221 Q ss_pred eecccchhhhHHHHHHHHHHHHhhh-ccccceEEEEchHHHHHHH---HhhccCCceeecc-------cccCccceEecC Q lcl|Aclame:pro 152 EVVGGVANESDIVGATNRAAKAVAS-AGWAPDTLLSSLALRYEVA---NIRDANGNPVFRD-------DSFAGFRTFFNR 220 (305) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~~~~~~l~---~~kd~~G~~l~~~-------~~l~G~pv~~~~ 220 (305) . ..........++.+.++...+.. ..+..++|+||+.++..|+ +++|++|+|+|++ .+++|+||++++ T Consensus 170 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~G~PV~~~~ 248 (338) T protein:vir:78 170 V-DYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGDLLGLPVQFGK 248 (338) T ss_pred c-ccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCceeeeeeEEEcc Confidence 1 11222233445556666555543 4456778999999988774 5789999999964 479999999999 Q ss_pred ccccC-----CCCceEEEEehhhEEEEeecCcEEEEeecceeccCc----ceeeeeecCcEEEEEEEEEccEeecccceE Q lcl|Aclame:pro 221 NGAWD-----ADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGE----NQINLAERDMVALRLKARFAYVLGVSATAQ 291 (305) Q Consensus 221 ~~~~~-----~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~----~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~ 291 (305) ++|.. .++..++||||++|++++++++++++++++++.... ..+++|++|++++|++.|+||++.||++|+ T Consensus 249 ~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~ 328 (338) T protein:vir:78 249 AVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFV 328 (338) T ss_pred ccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceE Confidence 98753 234679999999999999999999999999877653 457889999999999999999999999999 Q ss_pred EEeccccccccCCC Q lcl|Aclame:pro 292 GANKTPVAVVAPAA 305 (305) Q Consensus 292 ~~~~t~~a~v~~a~ 305 (305) +++...++. | T Consensus 329 ~l~~~~~~~----~ 338 (338) T protein:vir:78 329 KFVDDEDPD----A 338 (338) T ss_pred EEecccCCC----C Confidence 999864433 2 No 27 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=2.6e-57 Score=330.76 Aligned_cols=289 Identities=19% Similarity=0.225 Sum_probs=243.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |+.++++++|.+||+++.++|++.+++.++|+++|+++++.++.+++|+.++.+.+.|++|++. +++++++|+++ T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~-----~~~~~~~f~~i 88 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEGDM-----KPITKGNMTSQ 88 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCCcc-----ccccccceeEE Confidence 8888888899999999999999999999999999999999999999999999999999999876 66778899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) ++++||+++++++|+|+++||.++++++|.++|++++++++|+++++|+|++.+.. +................ T Consensus 89 ~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~---~~~~~~~~~~~~~~~~~---- 161 (318) T protein:vir:24 89 TIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTY---IGQTTKAISIADTTGAT---- 161 (318) T ss_pred EEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcc---ccccccccccccccccc---- Confidence 99999999999999999999999999999999999999999999999999775433 33222211111111111 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeeccc------------ccCccceEecCccccCCCC Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD------------SFAGFRTFFNRNGAWDADA 228 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~------------~l~G~pv~~~~~~~~~~~~ 228 (305) ....+.+..+...+...++..+.|+|||..+..|+++||++|+|+|++. ++.|+|+++++.++. ++ T Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~--~~ 239 (318) T protein:vir:24 162 TVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHVVE--GT 239 (318) T ss_pred chHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCCCC--Cc Confidence 1122334445566677778888999999999999999999999999753 588899999888753 56 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccCc----ceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGE----NQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~----~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) ..+++|||++++++++++++++++++++++... ..+++|++|++++|+++|+||.+.+|++|++++...++.-.. T Consensus 240 ~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 240 TVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGGEG 318 (318) T ss_pred cEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCCCC Confidence 678999999999999999999999999877643 457789999999999999999999999999999876655444 No 28 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=7.5e-57 Score=328.27 Aligned_cols=289 Identities=20% Similarity=0.254 Sum_probs=242.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..+.+++++.+||++++++|++.++++++++++++++|+++++++||+.++.+.+.|++|++. +|+++++|+++ T Consensus 27 ~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~-----~~~~~~~f~~v 101 (324) T protein:vir:96 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQK-----IETSKATWVNA 101 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCcc-----ccccccceeEE Confidence 3344455677899999999999999999999999999999999999999999999999999875 67788899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) +++++|++++++||+|+++|+..+++++|.++|++++++++|+++|+|+|++. .+.++........ ....+..++ T Consensus 102 ~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~--~~~~~~~~~~~~~---~~~~~~~~~ 176 (324) T protein:vir:96 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIKKTN---KVIKGDFTQ 176 (324) T ss_pred EEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCC--cCccccccccccc---eecccccch Confidence 99999999999999999999999999999999999999999999999988653 3444443322211 122222333 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec---ccccCccceEecCccccCCCCceEEEEehh Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR---DDSFAGFRTFFNRNGAWDADAAIEVIADSS 237 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~---~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~ 237 (305) +++.+++.++...++.+++|+||+.++..|+++||++|+|++. +.+++|+||+++.... .+++.+++|||+ T Consensus 177 ----~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~~~~~~l~G~PV~~~~~~~--~~~~~~~~gd~s 250 (324) T protein:vir:96 177 ----DNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSN--LKRGELITGDFD 250 (324) T ss_pred ----HHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCCcccceeeEeecCCC--CCcceEEEEecc Confidence 4455566677778888899999999999999999999999986 4589999999865443 467789999999 Q ss_pred hEEEEeecCcEEEEeecceecc----CcceeeeeecCcEEEEEEEEEccEeecccceEEEecc-ccccccCCC Q lcl|Aclame:pro 238 RVKIGVRQDITVKFLDQATLGT----GENQINLAERDMVALRLKARFAYVLGVSATAQGANKT-PVAVVAPAA 305 (305) Q Consensus 238 ~~~~~~~~~i~v~~~~~~~~~~----~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t-~~a~v~~a~ 305 (305) ++++++++++++++++++++.. +...+++|++|++++|++.|+||.+.+|++|++++.+ +...++|+- T Consensus 251 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:96 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred eEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCC Confidence 9999999999999999987654 3457889999999999999999999999999999976 444556666 No 29 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=6.8e-57 Score=328.50 Aligned_cols=282 Identities=21% Similarity=0.271 Sum_probs=239.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCC-ceEEEEEeCCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTK-TTHLPVLATLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |..++++++|.+||++++++|++.+++.++|+++|++++++++ ...+|+..+.+.+.|++|++. +++++++|++ T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-----~~~~~~~f~~ 83 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEK-----IKTDKPEVVP 83 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCcc-----ccccccceeE Confidence 7788888899999999999999999999999999999999765 467888899999999999875 6677789999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) ++++++|+++++++|+|+++|+.++++++|.+++++++++++|+++|+|+|++. +.++++...... ....+..+ T Consensus 84 v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~---~~gi~~~~~~~~---~~~~~~~t 157 (297) T protein:vir:95 84 VTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPF---ANSVAKAAKDAN---KVIGGPIN 157 (297) T ss_pred EEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcc---cccccccccccc---eecccccC Confidence 999999999999999999999999999999999999999999999999999764 345554433222 12222334 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc--cccCccceEecCccccCCCCceEEEEehh Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD--DSFAGFRTFFNRNGAWDADAAIEVIADSS 237 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~--~~l~G~pv~~~~~~~~~~~~~~~~~gdf~ 237 (305) +++ +.++..++...++.+++|+||+..+..|++++|++|+|+|++ .+++|+|++++...+ .+++.++||||+ T Consensus 158 ~~~----i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~~~~l~G~Pv~~~~~~~--~~~~~~~~gd~s 231 (297) T protein:vir:95 158 YDN----ILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKAANTIDGITTVDLKSAR--FEKGDLLAGDFD 231 (297) T ss_pred HHH----HHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCCCCcccceeeEeecCCC--CCCceEEEEecc Confidence 444 455666777778888999999999999999999999999974 579999998776543 467789999999 Q ss_pred hEEEEeecCcEEEEeecceecc----CcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccc Q lcl|Aclame:pro 238 RVKIGVRQDITVKFLDQATLGT----GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVV 301 (305) Q Consensus 238 ~~~~~~~~~i~v~~~~~~~~~~----~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v 301 (305) +++++++++++++++++.++.. +...+++|++|++++|++.|+||++.+|++|++++.+ .+| T Consensus 232 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a--t~~ 297 (297) T protein:vir:95 232 NLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPA--ERV 297 (297) T ss_pred cEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeec--CCC Confidence 9999999999999999987654 3456788999999999999999999999999999754 345 No 30 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=5.9e-57 Score=328.87 Aligned_cols=285 Identities=14% Similarity=0.172 Sum_probs=236.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |+. +||.+||+++.++|++.++++++++++|++++++++.+++|+.++.+.+.|++|++. +|+++++|+++ T Consensus 1 ma~----~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~-----~~~~~~~f~~v 71 (298) T protein:vir:94 1 MVL----NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGK-----KTHGGVTLAPQ 71 (298) T ss_pred Cee----ccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCcc-----ccccccceeEE Confidence 665 458899999999999999999999999999999999999999999999999999875 77788999999 Q ss_pred EeeeeeEEEeehhhHHHhh---cCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~---ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) +++++|+++++++|+|+++ ++..+++++|.++|++++++++|+++++|++.+.+....+........ ......... T Consensus 72 ~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~-~~~~~~~~~ 150 (298) T protein:vir:94 72 TMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDS-KVTQKVEAP 150 (298) T ss_pred EEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccc-ccccccccc Confidence 9999999999999999996 456789999999999999999999999996544433322222111111 111112222 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccC--CCC Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD--ADA 228 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~--~~~ 228 (305) ....+..+++.+++.++...+..+++|+||+.++..|+++||++|+|+|++ .+++|+||++++.++.. .++ T Consensus 151 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~ 230 (298) T protein:vir:94 151 RGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) T ss_pred cccccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCc Confidence 233445567778888888888888899999999999999999999999975 47999999999998754 345 Q ss_pred ceEEEEehhhEE-EEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccc Q lcl|Aclame:pro 229 AIEVIADSSRVK-IGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTP 297 (305) Q Consensus 229 ~~~~~gdf~~~~-~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~ 297 (305) ..+++|||++++ ++.+++++++++++.. .++..+++|++|++++|+++|+||.+.||++|++++++. T Consensus 231 ~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~--~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGD--PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cEEEEeeccceEEEEEecCceEEEeecCC--CcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 679999999864 8999999999988663 345567899999999999999999999999999999876 No 31 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=2e-56 Score=325.93 Aligned_cols=286 Identities=19% Similarity=0.224 Sum_probs=228.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) ||+++ +++|++||+++.++|++.+++.++|+++|+++|++++..++|+.++.+.++|++|++. +|+++++|+++ T Consensus 1 Mat~t-t~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~-----~~~~~~~f~~v 74 (311) T protein:vir:99 1 MATFG-TGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQ-----KSSTTGEFDFV 74 (311) T ss_pred Cceec-CCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcc-----cccccceeeEE Confidence 99766 5678899999999999999999999999999999998999999999999999999975 67778899999 Q ss_pred EeeeeeEEEeehhhHHHhh---cCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~---ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) +++++|+++++++|+||++ |+.++++++|.++|++++++++|+++|+|+|++.+..+.+...........++.... T Consensus 75 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~- 153 (311) T protein:vir:99 75 TSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTAD- 153 (311) T ss_pred EEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeecccc- Confidence 9999999999999999994 778999999999999999999999999999977665555544433323232222221 Q ss_pred hhhhHHHHHHHHHHHHhhhc--cccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccC--- Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASA--GWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD--- 225 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~--- 225 (305) .......++..++..+... .+..+.|+||+.++..|+++||++|+|+|++ .+++|+|++++++++.. T Consensus 154 -~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~ 232 (311) T protein:vir:99 154 -TIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEA 232 (311) T ss_pred -ccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeeccccccccc Confidence 1112223334444433332 3456679999999999999999999999975 37999999999877521 Q ss_pred ---------CCCceEEEEehhhE-EEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEec Q lcl|Aclame:pro 226 ---------ADAAIEVIADSSRV-KIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANK 295 (305) Q Consensus 226 ---------~~~~~~~~gdf~~~-~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~ 295 (305) .+...+++|||+++ .++.++++++++++++. .+..+++|++|++++|++.|+||+|.||+ |++++. T Consensus 233 ~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~d~~~~r~~~r~d~~v~~~~-~v~~~~ 308 (311) T protein:vir:99 233 DPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGD---PDGQGDLKRHNQIALRLEIVYGWYVFTDR-FVVIEN 308 (311) T ss_pred ccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCC---CCcchhhhhcCcEEEEEEEeecceecChh-Heeeec Confidence 23446789999985 48899999999987653 34567889999999999999999999975 556655 Q ss_pred ccc Q lcl|Aclame:pro 296 TPV 298 (305) Q Consensus 296 t~~ 298 (305) ..| T Consensus 309 ~~A 311 (311) T protein:vir:99 309 AVA 311 (311) T ss_pred ccC Confidence 544 No 32 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=1.9e-56 Score=326.02 Aligned_cols=279 Identities=14% Similarity=0.101 Sum_probs=231.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhccccccccc-cccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSK-VTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~-~~f~~ 79 (305) |..+++++||++||+++.++|++.+++.++|+++|+++++.++..++|+.++.+.+.|++|++. +|.++ ++|++ T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~-----~~~~~~~~f~~ 204 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGGTTSGWVGEASQ-----RPQTNAATFQP 204 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcCCcceeeeccccc-----cccccccccce Confidence 8888999999999999999999999999999999999999999999999999999999999976 34443 58999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccccee------- Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVE------- 152 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~------- 152 (305) ++++++|++++++||+|+++|+.++++++|.++|++++++++|++||+|+|++ .|.|+++.......... T Consensus 205 v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~---~p~Gil~~~~~~~~~~~~~~~~~~ 281 (425) T protein:vir:10 205 LSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTN---KPNGLLTYIAGGANAAKHPFGAIE 281 (425) T ss_pred eeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCC---Ccceeeeccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999965 46677665443322111 Q ss_pred ecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccC Q lcl|Aclame:pro 153 VVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD 225 (305) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~ 225 (305) ...........++.+.+++..+...+...+.|+||+.++..|+++||++|+|+|++ .+++|+||++++++|.. T Consensus 282 ~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~ 361 (425) T protein:vir:10 282 VVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDV 361 (425) T ss_pred cccccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecCcCCc Confidence 01111112222444555666777777888899999999999999999999999975 37999999999998743 Q ss_pred -CCCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 226 -ADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 226 -~~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) .+..+++||||++ |++++|.++++... . +|.+|++.+|+..|+|+.+.+|+||+.++.+.+- T Consensus 362 ~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d--~----------~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 362 AANSTPILFGDFQQTYLIIDRIGVRVLRD--P----------YTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred cCCccEEEEEehhccEEEEEecceEEEec--c----------cccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 4556799999998 57888888776432 2 2568999999999999999999999999987544 No 33 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=2.4e-56 Score=325.56 Aligned_cols=275 Identities=15% Similarity=0.107 Sum_probs=231.2 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |+.+++++||++||+++.++|++.+++.++|+++|++++++++.+++|+..+.+.+.|++|++..++ ...++|+++ T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~----~~~~~~~~v 182 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGTASGWVGETDTRSQ----TATSRLGLI 182 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCccceeeccccccCc----cccccceee Confidence 8889999999999999999999999999999999999999999999999999999999999975332 234689999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccc---------- Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQA---------- 150 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~---------- 150 (305) ++++||+++++++|+|+++|+.++++++|.++|++++++++|.+||+|+|++ .|.|+++........ T Consensus 183 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~---~p~Gil~~~~~~~~~~~~~~~~~~~ 259 (401) T protein:vir:44 183 EPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTK---KPKGFLAYESTEESDKARAFGKLQH 259 (401) T ss_pred eeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCC---ccceeeccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999975 456666544332211 Q ss_pred -eeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCcc Q lcl|Aclame:pro 151 -VEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNG 222 (305) Q Consensus 151 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~ 222 (305) .+...+..+ ++.+.+++..+...+...+.|+||+.++..|+++||++|+|+|++ .+++|+||++++++ T Consensus 260 ~~t~~~~~~~----~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~ 335 (401) T protein:vir:44 260 IVSGEATAVT----ADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQM 335 (401) T ss_pred cccccccccC----HHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEecCc Confidence 111122222 445556666777777778899999999999999999999999974 36999999999998 Q ss_pred cc-CCCCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 223 AW-DADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 223 ~~-~~~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) |. ++++..++||||++ |.+.+|.++++..++ +|++|++.+|++.|+|+++.+|++|++++.+.+ T Consensus 336 p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~------------~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 336 PDIAADAKAIAFGNFKRGYTIVDRIGTRILRDP------------YTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred CCccCCccEEEEeehhccEEEEEecceEEeeec------------cccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 75 34556688999986 678899998875432 367899999999999999999999999999876 No 34 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=6.8e-56 Score=323.03 Aligned_cols=281 Identities=12% Similarity=0.052 Sum_probs=226.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-CCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |..+++++||++||+++..+|++.+++.++++++++++++++++++||+.++ .+.+.||+|++. +|+++++|++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~-----~~~s~~~f~~ 225 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGT-----YPFSSEEFAR 225 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcc-----ccccccccee Confidence 7788889999999999999999999999999999999999999999999876 478999999975 6778899999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecc---- Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVG---- 155 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~---- 155 (305) +++.+||++++++||+||++|+. ++++||.++|++++++++|.+||+|+|++. |.|+++............. T Consensus 226 i~~~~~k~a~~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~---p~Gil~~~~~~~~~~~~~~~~~~ 301 (497) T protein:vir:10 226 VYEQVGKVANALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPG---VNGLLQRSTGFTASSASSLFGAT 301 (497) T ss_pred eEeeeeeeEeecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCccc---ccccccccccccccccccchhhh Confidence 99999999999999999999975 699999999999999999999999999763 5666554432211110000 Q ss_pred ---------------------------------------------cchhhhHHHHHHHHHHHHhhh-ccccceEEEEchH Q lcl|Aclame:pro 156 ---------------------------------------------GVANESDIVGATNRAAKAVAS-AGWAPDTLLSSLA 189 (305) Q Consensus 156 ---------------------------------------------~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~~ 189 (305) ...+..+....+..+...+.. .++.+++|+||+. T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~ 381 (497) T protein:vir:10 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPR 381 (497) T ss_pred hhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchH Confidence 000111222333333333333 3455678999999 Q ss_pred HHHHHHHhhccCCceeeccc-------------ccCccceEecCccccCCCCceEEEEehhh--EEEEeecCcEEEEeec Q lcl|Aclame:pro 190 LRYEVANIRDANGNPVFRDD-------------SFAGFRTFFNRNGAWDADAAIEVIADSSR--VKIGVRQDITVKFLDQ 254 (305) Q Consensus 190 ~~~~l~~~kd~~G~~l~~~~-------------~l~G~pv~~~~~~~~~~~~~~~~~gdf~~--~~~~~~~~i~v~~~~~ 254 (305) ++..|+++||++|+|+|+++ +++|+||++++.++ .+.++||||++ |.+++|++++|+++++ T Consensus 382 ~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~----~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~ 457 (497) T protein:vir:10 382 DWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP----LGTILVGHFAPSVIQTARREGVTMQMTNS 457 (497) T ss_pred HHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCC----CCceEEeecccceEEEEEecccEEEeecc Confidence 99999999999999999752 68999999999987 35689999987 4468899999998765 Q ss_pred ceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 255 ATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .. +.|++|++++|++.|+|+.|.+|+||++++.+.++. || T Consensus 458 ~~--------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~---~~ 497 (497) T protein:vir:10 458 NG--------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT---GS 497 (497) T ss_pred cc--------hhhhcCcEEEEEEEeecceeeccccEEEEEecCCcc---CC Confidence 32 349999999999999999999999999999975422 33 No 35 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=6.8e-56 Score=323.03 Aligned_cols=281 Identities=12% Similarity=0.052 Sum_probs=226.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-CCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |..+++++||++||+++..+|++.+++.++++++++++++++++++||+.++ .+.+.||+|++. +|+++++|++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~-----~~~s~~~f~~ 225 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGT-----YPFSSEEFAR 225 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcc-----ccccccccee Confidence 7788889999999999999999999999999999999999999999999876 478999999975 6778899999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecc---- Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVG---- 155 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~---- 155 (305) +++.+||++++++||+||++|+. ++++||.++|++++++++|.+||+|+|++. |.|+++............. T Consensus 226 i~~~~~k~a~~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~---p~Gil~~~~~~~~~~~~~~~~~~ 301 (497) T protein:vir:78 226 VYEQVGKVANALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPG---VNGLLQRSTGFTASSASSLFGAT 301 (497) T ss_pred eEeeeeeeEeecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCccc---ccccccccccccccccccchhhh Confidence 99999999999999999999975 699999999999999999999999999763 5666554432211110000 Q ss_pred ---------------------------------------------cchhhhHHHHHHHHHHHHhhh-ccccceEEEEchH Q lcl|Aclame:pro 156 ---------------------------------------------GVANESDIVGATNRAAKAVAS-AGWAPDTLLSSLA 189 (305) Q Consensus 156 ---------------------------------------------~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~~ 189 (305) ...+..+....+..+...+.. .++.+++|+||+. T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~ 381 (497) T protein:vir:78 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPR 381 (497) T ss_pred hhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchH Confidence 000111222333333333333 3455678999999 Q ss_pred HHHHHHHhhccCCceeeccc-------------ccCccceEecCccccCCCCceEEEEehhh--EEEEeecCcEEEEeec Q lcl|Aclame:pro 190 LRYEVANIRDANGNPVFRDD-------------SFAGFRTFFNRNGAWDADAAIEVIADSSR--VKIGVRQDITVKFLDQ 254 (305) Q Consensus 190 ~~~~l~~~kd~~G~~l~~~~-------------~l~G~pv~~~~~~~~~~~~~~~~~gdf~~--~~~~~~~~i~v~~~~~ 254 (305) ++..|+++||++|+|+|+++ +++|+||++++.++ .+.++||||++ |.+++|++++|+++++ T Consensus 382 ~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~----~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~ 457 (497) T protein:vir:78 382 DWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP----LGTILVGHFAPSVIQTARREGVTMQMTNS 457 (497) T ss_pred HHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCC----CCceEEeecccceEEEEEecccEEEeecc Confidence 99999999999999999752 68999999999987 35689999987 4468899999998765 Q ss_pred ceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 255 ATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .. +.|++|++++|++.|+|+.|.+|+||++++.+.++. || T Consensus 458 ~~--------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~---~~ 497 (497) T protein:vir:78 458 NG--------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT---GS 497 (497) T ss_pred cc--------hhhhcCcEEEEEEEeecceeeccccEEEEEecCCcc---CC Confidence 32 349999999999999999999999999999975422 33 No 36 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=2.7e-55 Score=319.79 Aligned_cols=271 Identities=18% Similarity=0.162 Sum_probs=219.2 Q ss_pred CCCccCCccceEccHHHHHH-HHHHHHhhhhhhhhcceeecCCC-ceEEEEEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDT-LLAAAKQGSTVLSAFQNVNMGTK-TTHLPVLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~-i~~~~~~~~~l~~l~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) ...++++++|.++|+++.++ |++.+++.++++++|+++++.++ .+.+|+.++.+.+.|++|++. +|+++++|+ T Consensus 110 ~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~-----~~~~~~~f~ 184 (390) T protein:vir:62 110 KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETAE-----IPESYPATA 184 (390) T ss_pred hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeeccccc-----cccccccee Confidence 23345555555555555555 55667777788899999999764 589999999999999999876 667788999 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccccee-ecccc Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVE-VVGGV 157 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 157 (305) ++++++||++++++||+|+++||.++++++|.++|++++++++|.+|++|+|+|. |+++.......... ...+. T Consensus 185 ~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~-----Gi~~~~~~~~~~~~~~~~~~ 259 (390) T protein:vir:62 185 QRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPR-----GILTDASPATATFLATDTDS 259 (390) T ss_pred eeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccc-----cccccccccccceecccccc Confidence 9999999999999999999999999999999999999999999999999999654 44443332222221 11222 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeeccc-------ccCccceEecCccccCCCCce Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD-------SFAGFRTFFNRNGAWDADAAI 230 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~-------~l~G~pv~~~~~~~~~~~~~~ 230 (305) .++++ +.++...+...+...+.|+||+..+..|+++||++|+|+|+++ +|+|+||++++++| ... T Consensus 260 ~~~~~----l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p----~~~ 331 (390) T protein:vir:62 260 KVSDA----LIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAPSLFNGKVVETDDGMP----ADK 331 (390) T ss_pred cchHH----HHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCccceecccceEEecCCC----Ccc Confidence 33344 4445555566666667899999999999999999999999753 69999999999886 456 Q ss_pred EEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 231 EVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 231 ~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) ++||||++|+++++++++++.+.+. +|.+|++.+|++.|+|+++.+|+||+.++.+++| T Consensus 332 i~~gd~s~~~i~~~~~~~v~~~~~~----------~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 332 ILFADLSKYRVRFAGSLRVDRSVDA----------KFSTDQIVYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred EEEeeccceeEEeecceEEEeeccc----------cccCCcEEEEEEEEeCcEeechhheEEEEeecCC Confidence 8999999999999999999987764 3889999999999999999999999999998877 No 37 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=6.7e-55 Score=317.59 Aligned_cols=283 Identities=12% Similarity=0.077 Sum_probs=236.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhccccc-cccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVK-PTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~-~~~~~~f~~ 79 (305) ...++.++||.++|+++.++|++.+++.++++++|+++|++++...+|+..+.+.+.|++|++..++.+. ..++++|++ T Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~ 241 (458) T protein:vir:10 162 NQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKE 241 (458) T ss_pred hhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeeccccccccccccccccccccee Confidence 2334556789999999999999999999999999999999999999999999999999999988776554 345678999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccccee-ecccch Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVE-VVGGVA 158 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 158 (305) +++.++|++++++||+|+++|+.+++++||.++|++++++++|.+||+|+|++ .|.|+++.......... ...... T Consensus 242 i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~---~p~Gi~~~~~~~~~~~~~~~~~~~ 318 (458) T protein:vir:10 242 IHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSG---KPKGLLTLASEDSAKVVTEAKADG 318 (458) T ss_pred eEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCC---ccceeeecccccccceeecccccc Confidence 99999999999999999999999999999999999999999999999999875 56777765543332211 112111 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-----------cccCccceEecCccccCCC Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-----------DSFAGFRTFFNRNGAWDAD 227 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-----------~~l~G~pv~~~~~~~~~~~ 227 (305) .....++.+.+++..+...++.++.|+||+.++..|+++||++|+|+|++ .+++|+||++++.+|...+ T Consensus 319 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~ 398 (458) T protein:vir:10 319 SVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKAN 398 (458) T ss_pred cccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEccccccccC Confidence 11222455566677777788888999999999999999999999999853 3699999999999998888 Q ss_pred CceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 228 AAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 228 ~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) +..++||||++ |.++++.++++.+++ ++.+|++.+|++.|+|+.|.+|++|++.+.+.+ T Consensus 399 ~~~~~~~~f~~~~~~~~~~~~~v~~d~------------~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 399 SAEFAVIVYKDNFVMPRQRAVTVERER------------QAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred CcceEEEEecccEEEEEeeceEEEeec------------ccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 88999999965 678999999886532 245789999999999999999999999877655 No 38 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=1.1e-54 Score=316.37 Aligned_cols=284 Identities=14% Similarity=0.118 Sum_probs=228.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecC----CCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG----TKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~----~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) |...+.+.||+++|+++.++|++.+++.+++++++...... .+++++|+.++.+.++||+|++. +|+++++ T Consensus 338 ~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~-----~~~s~~~ 412 (645) T protein:vir:93 338 TTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKT-----KPLTKFD 412 (645) T ss_pred ccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCcc-----ccccccc Confidence 33444455889999999999999999999999997653322 24689999999999999999875 7788899 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccC-cCcccccccccccccccceeecc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKP-ASWVSPALIPAAVTAGQAVEVVG 155 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~-~~~~~~~~~~~~~~~~~~~~~~~ 155 (305) |+++++++||+++++++|+||++|+.++++++|.++|++++++++|.+||+|+|.+ .+..|.+++.... .... T Consensus 413 f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~------~~~~ 486 (645) T protein:vir:93 413 FESITFSHAKVSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVK------GTAS 486 (645) T ss_pred eeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceecccc------cccc Confidence 99999999999999999999999999999999999999999999999999998864 3344555443221 1111 Q ss_pred cchhhhHHHHHHHHHHHHhhhc--cccceEEEEchHHHHHHHHhhccCCceeec-----ccccCccceEecCccccCCCC Q lcl|Aclame:pro 156 GVANESDIVGATNRAAKAVASA--GWAPDTLLSSLALRYEVANIRDANGNPVFR-----DDSFAGFRTFFNRNGAWDADA 228 (305) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~-----~~~l~G~pv~~~~~~~~~~~~ 228 (305) +.....++.. ++..+... ....++|+|||.++..|+++||++|+|+|. +++|+|+||+++++++. T Consensus 487 ~~~~~~d~~~----~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~~~~~~tL~G~PV~~s~~vp~---- 558 (645) T protein:vir:93 487 SGNPDADAEA----AFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDMTLLGGSFQGLPVIVSQYVGD---- 558 (645) T ss_pred ccchHHHHHH----HHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecCCCCCCceeeceeeEEeccCCc---- Confidence 2222333333 33333322 234568999999999999999999999984 35899999999999874 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccCc------------ceeeeeecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGE------------NQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~------------~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) .++||||++++++.++++.+.+++++++.... ..+++|++||+++|+++|+||++.||+||++++.. T Consensus 559 -~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~ 637 (645) T protein:vir:93 559 -QLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGV 637 (645) T ss_pred -ceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecc Confidence 36899999999999999999999999875332 34678999999999999999999999999999987 Q ss_pred ccccccCC Q lcl|Aclame:pro 297 PVAVVAPA 304 (305) Q Consensus 297 ~~a~v~~a 304 (305) .-+.-..+ T Consensus 638 ~~g~~~~~ 645 (645) T protein:vir:93 638 NYGSASGG 645 (645) T ss_pred cCCcccCC Confidence 66554444 No 39 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=8.9e-55 Score=316.90 Aligned_cols=280 Identities=11% Similarity=0.083 Sum_probs=232.4 Q ss_pred CCCccCCccceEccHHHHHHHH-HHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLL-AAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~-~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) ...+++++||++||+++..+++ +.+++.++++++++++++ ++.+.+|+.++.+.+.|++|++. +++++++|++ T Consensus 250 ~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~~~~~~~~~~~~a~~v~Eg~~-----~~~~~~~~~~ 323 (543) T protein:vir:81 250 AMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGDVWHGVSSAAVQWSWDAEFEE-----VSDDSPEFGQ 323 (543) T ss_pred hcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC-CcceEEEEecCCcceeecccCcc-----ccccccccce Confidence 2346777899999999998876 667888999999998776 56789999999999999999976 5677889999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccccee-ecccch Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVE-VVGGVA 158 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 158 (305) ++++++|++++++||+|+++|+ +++.++|.++|++++++++|.+||+|+|++. .|.|+++.......... ...... T Consensus 324 i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~--~p~Gi~~~~~~~~~~~~~~~~~~~ 400 (543) T protein:vir:81 324 PEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQGN--QPTGIVTALAGTAAEIAPVTAETF 400 (543) T ss_pred eeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCc--ccccchhhcccccccccccccccc Confidence 9999999999999999999998 6999999999999999999999999998653 56777665443322221 222222 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------cccCccceEecCccccC------C Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------DSFAGFRTFFNRNGAWD------A 226 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------~~l~G~pv~~~~~~~~~------~ 226 (305) .++++..++..+...+...+.|+||+.++..|+++||++|+|+|.+ ++++|+||+++++++.. . T Consensus 401 ----~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~ 476 (543) T protein:vir:81 401 ----ALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIGNGEPSQLLGRPVGEAEAMDANWNTSASA 476 (543) T ss_pred ----cHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccCcCCCCCccccceeeEEeccccccccccccC Confidence 3445556666777777777899999999999999999999999974 47999999999998754 2 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) +..+++||||++|+++++++++++++.+.... ..|.+|++.+|++.|+||.+.+|+||++++.+.+| T Consensus 477 ~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~------~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 477 DNFVLLYGNFQNYVIADRIGMTVEFIPHLFGT------NRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred CcceEEEeeccceeEEeecccEEEEecccccc------chhhcCceEEEEEEeeccEeecccceEEEEecccC Confidence 55679999999999999999999988765422 23778999999999999999999999999998877 No 40 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=1.9e-54 Score=315.11 Aligned_cols=273 Identities=18% Similarity=0.174 Sum_probs=224.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHh-hhhhhhhcceeecCCC-ceEEEEEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQ-GSTVLSAFQNVNMGTK-TTHLPVLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~-~~~l~~l~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) ...++++++|.++|+++.++++..+.. .++++++++++++.++ .+.+|+.++.+.+.|++|++. +|+++++|+ T Consensus 110 ~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~f~ 184 (392) T protein:vir:13 110 KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAE-----IPESYPATT 184 (392) T ss_pred hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeeccccc-----cccccccee Confidence 334555666667777777777666555 5667788999988654 589999999999999999975 677888999 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccccee-ecccc Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVE-VVGGV 157 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 157 (305) +++++++|++++++||+|+++|+.++++++|.++|++++++++|.+||+|+|++ .|.|+++.......... ...+. T Consensus 185 ~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~---~p~Gil~~~~~~~~~~~~~~~~~ 261 (392) T protein:vir:13 185 QRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTG---QPRGILTDATGANAAFGEADADS 261 (392) T ss_pred eEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCc---ccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999865 46777766543332222 11222 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccCCCCce Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWDADAAI 230 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~~~~~~ 230 (305) ..+ +.+.++...+...+...+.|+||+.++..|+++||++|+|+|++ .+|+|+||+++++++ +++ T Consensus 262 ~~~----d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~----~~~ 333 (392) T protein:vir:13 262 KVS----DALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKVVETDDGMP----ADK 333 (392) T ss_pred ccH----HHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEcCCCC----CCc Confidence 333 34445555666666777889999999999999999999999975 379999999999987 456 Q ss_pred EEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 231 EVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 231 ~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) ++||||++|+++++++++++.+.+. +|.+|++.+|++.|+|+.+.||+||+.++.+++| T Consensus 334 i~~Gdf~~~~i~~~~~~~i~~~~~~----------~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 334 VLFADLSKYRVRFAGSLRVDRSVDA----------KFSTDQIVYRFLQRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred EEEeeccceeEEeecceEEEeeccc----------cccCCcEEEEEEEEeccEEecccceEEEEeeccC Confidence 9999999999999999999887664 3789999999999999999999999999998877 No 41 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=1.5e-54 Score=315.67 Aligned_cols=276 Identities=19% Similarity=0.166 Sum_probs=227.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhccccccccc-cccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSK-VTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~-~~f~~ 79 (305) ++..++++||++||+++.++|++.+++.++|+++|+++++++ .+++|+..+.+.+.|++|++.. |.++ ++|++ T Consensus 138 ~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~g-~~~ip~~~~~~~a~~v~E~~~~-----~~~~~~~f~~ 211 (425) T protein:vir:95 138 RNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVKG-TTRILVDTDTSPATWIEQSGAL-----PTGDVGTIAS 211 (425) T ss_pred HhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecCc-eeEEEEecCCcccccccccccc-----ccccccccce Confidence 455566779999999999999999999999999999999865 6799999999999999999864 4343 47999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) +++++||++++++||+|+++|+.+++++||.++|++++++++|+++|+|+|++++ .|.|+++....... ........+ T Consensus 212 i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~-~p~Gil~~~~~~~~-~~~~~~~~~ 289 (425) T protein:vir:95 212 IDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANK-QPLGIIPSLPPENQ-VTVEADNNL 289 (425) T ss_pred eeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcc-ccceeecccccccc-cccccccch Confidence 9999999999999999999999999999999999999999999999999997644 56777765443332 222333444 Q ss_pred hhHHHHHHHHHHHHhhhcc--ccceEEEEchHHH----HHHHHhhccCCceeec-----ccccCccceEecCccccCCCC Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAG--WAPDTLLSSLALR----YEVANIRDANGNPVFR-----DDSFAGFRTFFNRNGAWDADA 228 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~--~~~~~~v~~~~~~----~~l~~~kd~~G~~l~~-----~~~l~G~pv~~~~~~~~~~~~ 228 (305) ++++.+.+ ..+...+ ...+.|+||+.++ ..|+++||++|+|+|+ .++++|+||++++.++ + T Consensus 290 ~~~~~~~~----~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l~G~pvv~~~~~~----~ 361 (425) T protein:vir:95 290 LKNLVKQI----GLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRVVFNNFLD----D 361 (425) T ss_pred HHHHHHHH----HhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCccccceeeEEcCcCC----C Confidence 44444443 3333322 3456799998864 3467889999999986 3579999999999986 4 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) ++++||||++|++++|++++++++++.+ |.+|++++|+..|+|+.+.+|+||+.++.|+ |+.|| T Consensus 362 ~~i~~Gd~~~~~~~~~~~~~i~~~~~~~----------f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~--~~~g~ 425 (425) T protein:vir:95 362 DTVLFGEFEQYTLVERENITIDSSTHVK----------FTEDQTAFRGKGRFDGKPVKPEAFVLVTITD--PVQGA 425 (425) T ss_pred ccEEEEecccEEEEeecceEEEeecccc----------cccCceEEEEEEeeCcEeecccceEEEEecC--cCCCC Confidence 5699999999999999999999987753 7889999999999999999999999999874 67788 No 42 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=6.4e-54 Score=312.19 Aligned_cols=270 Identities=16% Similarity=0.106 Sum_probs=232.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCC-Cceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATL-PEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) +.+.+++++|.++|+++.++|++.+++.++|++++++++++++.+++|+.++. +.+.|++|++. +|+++++|++ T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-----~~~~~~~~~~ 187 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGAL-----KPESSLKFAK 187 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEecCCcceeeecCCcc-----ccccccceeE Confidence 66778888899999999999999999999999999999999999999999764 78999999875 6777889999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) ++++++|+++++++|+|+++|+. +++++|.++|++++++++|.+||+|+|++. .+.|+++.+...... .... T Consensus 188 i~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~la~a~~~~~d~a~l~G~g~~~--~p~Gi~~~~~~~~~~-----~~~~ 259 (390) T protein:vir:97 188 KTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGAND--GLLGLIPQATTYAAP-----TTIA 259 (390) T ss_pred EEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCCc--cccceeecccccccc-----cccc Confidence 99999999999999999999975 799999999999999999999999998754 467777654333221 1122 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------cccCccceEecCccccCCCCceEEE Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------DSFAGFRTFFNRNGAWDADAAIEVI 233 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------~~l~G~pv~~~~~~~~~~~~~~~~~ 233 (305) ....++.+.++...+...++.+++|+|||.++..|+++||++|+|+|++ ++++|+||++++.++ +++++| T Consensus 260 ~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~----~~~~~~ 335 (390) T protein:vir:97 260 GATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMA----PGEFLV 335 (390) T ss_pred ccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCccCCCCceecceeeEEcCCCC----CCcEEE Confidence 2334556677777888888899999999999999999999999999964 479999999999886 457999 Q ss_pred Eehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 234 ADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 234 gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) |||++ |.++++++++++.+++.. .|++|++++|++.|+||.+.+|+||++++.+ T Consensus 336 gd~~~~~~~~~~~~~~i~~~~~~~---------~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 336 GAFDLAAQIFDQWDARVEIGYVND---------DFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred EeccceEEEEEecceEEEEeeccc---------ccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 99997 667899999999876542 3889999999999999999999999999998 No 43 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=7.4e-54 Score=311.87 Aligned_cols=276 Identities=14% Similarity=0.137 Sum_probs=231.6 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-CCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) ....+++++|.+||+++.++|++.+++.++|++++++++++++++++|+..+ .+.+.|++|++. +++++++|++ T Consensus 135 ~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~f~~ 209 (418) T protein:vir:10 135 TVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGAQ-----KPTSDLKFNL 209 (418) T ss_pred hccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeeccCcc-----ccccccceee Confidence 4455677789999999999999999999999999999999999999999876 588999999875 6777889999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) +++.++|++++++||+|+++|+. +++++|.++|++++++++|.+||+|+|++. .|.|+++......... ...+. T Consensus 210 v~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~--~p~Gi~~~~~~~~~~~-~~~~~-- 283 (418) T protein:vir:10 210 KNQPVRTIAHLFKASRQILDDAP-ALQSYIDGRARYGLQLTEEGQILKGDGTGA--NILGILPQASAFMPSI-TLANA-- 283 (418) T ss_pred EEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCCc--cccccccccccccccc-ccccc-- Confidence 99999999999999999999875 899999999999999999999999998764 3667776554332221 11222 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------cccCccceEecCccccCCCCceEEE Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------DSFAGFRTFFNRNGAWDADAAIEVI 233 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------~~l~G~pv~~~~~~~~~~~~~~~~~ 233 (305) ..++++..++..+...++.+++|+|||.++..|+++||++|+|+|.+ ++++|+||++++++| .+.++| T Consensus 284 --~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p----~~~~~~ 357 (418) T protein:vir:10 284 --TPIDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPRLWNLPVVETQAMT----ANEFLV 357 (418) T ss_pred --ccHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCceecceeeEEcCCCC----CCcEEE Confidence 23445556666777788888899999999999999999999999963 579999999999987 456899 Q ss_pred Eehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 234 ADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 234 gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) |||++ |+++++++++++++++.. .+|++|++.+|++.|+||.+.+|++|++++.+++ +.+ T Consensus 358 gd~s~~~~~~~~~~~~i~~~~~~~--------~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~--~~g 418 (418) T protein:vir:10 358 GAFSMAAQIFDRMEIEVLLSTENV--------DDFEKNMVSIRAEERLALAVYRPESFVTGALVEQ--AGG 418 (418) T ss_pred eeccceEEEEEecceEEEEecccc--------hhhhcCceEEEEEEeeccEEecccceEEEEeccC--CCC Confidence 99997 668889999998876542 3589999999999999999999999999998743 222 No 44 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=7.5e-54 Score=311.82 Aligned_cols=280 Identities=14% Similarity=0.110 Sum_probs=231.2 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCC-Cceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT-THLPVLATL-PEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~-~~~p~~~~~-~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) |..+++++||++||+++.++|++.+++.++|+++|++++++++. ..+|+..+. ..+.|++|++. +|+++.+|. T Consensus 117 ~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~-----~~~~~~~f~ 191 (409) T protein:vir:45 117 QGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEE-----AGEEDTDFG 191 (409) T ss_pred ccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCcccccccccccc-----ccccccccc Confidence 77788889999999999999999999999999999999998765 445555543 56789999875 677788999 Q ss_pred eEEeeeeeEE-EeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 79 NRTLVAEEIA-VIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 79 ~v~~~~~k~~-~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) ++++.++|++ +++++|+|+++|+.+++++||.++|++++++++|++||+|+|++.+..|.|+++....... ....+. T Consensus 192 ~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~--~~~~~~ 269 (409) T protein:vir:45 192 MGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQ--TAAANA 269 (409) T ss_pred eeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccc--cccccc Confidence 9999999985 6789999999999999999999999999999999999999999888888888876554322 222333 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceE--EEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCcccc-CCC Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDT--LLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAW-DAD 227 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~-~~~ 227 (305) .+++++. +++..+...+...+. |+||+.++..|+++||++|+|+|++ .+++|+||++++++|. +++ T Consensus 270 ~~~d~i~----~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~ 345 (409) T protein:vir:45 270 VKWQEIL----ALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAG 345 (409) T ss_pred cchHHHH----HHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecCcCCccCC Confidence 4444444 444555555544444 5789999999999999999999975 3799999999999885 345 Q ss_pred CceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 228 AAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 228 ~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) +.+++||||++|++++++++.++.+++.+ |++|++.||++.|+|+.+.+|+||+.++.++++- | T Consensus 346 ~~~i~~Gd~~~~~i~~~~~~~~~~~~d~~----------~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~~---~ 409 (409) T protein:vir:45 346 KKFMFCGDFDRFIIRRVRYMILKRLVERY----------AEYDQTGFLAFHRFDCILEDTSAIKALVGKGSVG---G 409 (409) T ss_pred ccEEEEeehhhhheeeccceEEEEeeccc----------ccCCcEEEEEEEEeccEeechhheEEEEeccCCC---C Confidence 56789999999999999999999877653 6789999999999999999999999999875422 1 No 45 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=1.3e-53 Score=310.58 Aligned_cols=275 Identities=15% Similarity=0.116 Sum_probs=234.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-CCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) +...+++++|.++|+++.++|++.+++.++|+++|++++++++.+++|+.++ .+.+.|++|++. +|+++++|++ T Consensus 113 ~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~~~ 187 (395) T protein:vir:43 113 AITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNNAAPVSEGTQ-----KPYSDLTFEL 187 (395) T ss_pred hhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCceeeecCCcc-----ccccccceeE Confidence 4455667788999999999999999999999999999999999999999876 478999999875 6778889999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) ++++++|++++++||+|+++|+. +++++|.++|++++++++|.+||+|+|++.+ +.|+++........ ...... T Consensus 188 i~~~~~k~~~~~~is~ell~d~~-~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~--~~Gi~~~~~~~~~~---~~~~~~ 261 (395) T protein:vir:43 188 ENAPVRTIAHLFKASRQILDDAS-ALQSYIDARARYGLMLVEECQLLYGNGTGAN--LHGIIPQAQAYAPP---SGVVVT 261 (395) T ss_pred EEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCc--cccccccccccccc---cccccc Confidence 99999999999999999999875 6999999999999999999999999987543 56666654433322 122233 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------cccCccceEecCccccCCCCceEEE Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------DSFAGFRTFFNRNGAWDADAAIEVI 233 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------~~l~G~pv~~~~~~~~~~~~~~~~~ 233 (305) ....++.+.+++..+...++.+++|+|||.++..|+++||++|+|+|++ ++++|+||+++++++ ++.++| T Consensus 262 ~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~l~G~pVv~~~~~~----~~~~~~ 337 (395) T protein:vir:43 262 AEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGSPQNGTTPTLWRLPVVETQAIT----QDEFLT 337 (395) T ss_pred cchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccccccCCCceecceeeEEcCCCC----CCcEEE Confidence 3345667777888888888888999999999999999999999999963 479999999999886 456899 Q ss_pred Eehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 234 ADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 234 gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) |||++ |.+++|++++|+++++. ...|++|++.+|++.|+||++.+|++|++++.+++ T Consensus 338 gd~~~~~~~~~~~~~~i~~~~~~--------~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 338 GAFSLGAQIFDRMDIEVLVSTEN--------DKDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred EeccceEEEEEecceEEEEeccc--------cchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 99998 55788999999887653 23589999999999999999999999999999877 No 46 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=1.2e-53 Score=310.74 Aligned_cols=269 Identities=14% Similarity=0.044 Sum_probs=225.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceE--EEEEeC-CCceeeeecchhhccccccccc-cc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTH--LPVLAT-LPEADWVGESATDPKGVKPTSK-VT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~--~p~~~~-~~~a~~v~E~~~~~~~~~~~~~-~~ 76 (305) |+.+++++||.+||+++.++|++.+++.++|+++|+++++++++.+ +|+..+ .+.+.|++|++. +|+++ ++ T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~ 183 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQ-----IGQNDDPK 183 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeeccccc-----cccccccc Confidence 8899999999999999999999999999999999999999876554 555543 467999999876 44443 68 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) |++++++++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+. .+ T Consensus 184 ~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~~-------------------~~ 244 (397) T protein:vir:49 184 LSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLPNK-------------------PT 244 (397) T ss_pred eeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-------------------cc Confidence 99999999999999999999999999999999999999999999999999999875431 11 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecC--cccc-CC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNR--NGAW-DA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~--~~~~-~~ 226 (305) ..+++ .+.+++.++...+..++.|+|||.++..|+++||++|+|+|++ .+++|+||++.+ .++. .. T Consensus 245 ~~~~d----~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~ 320 (397) T protein:vir:49 245 LAKWD----DIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTG 320 (397) T ss_pred ccCHH----HHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCceecceeeEEecccccccccC Confidence 12333 4455666777788888999999999999999999999999964 479999998754 3443 34 Q ss_pred CCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 227 DAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 227 ~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++..++||||++ |+++++++++++++++.. .+|++|++.+|++.|+|+.+.+|++|++++.++.+..+|+. T Consensus 321 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 392 (397) T protein:vir:49 321 GAMPLYFGDLKQAVTLFDRQHLSLLSTNIGG--------GAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQKAKL 392 (397) T ss_pred CceeEEEeeccceEEEEeecccEEEEecccc--------chhhcCeeeEEEEEeeccEEecccceEEEEecccccccCcc Confidence 567899999997 668999999999877542 45889999999999999999999999999988766545544 No 47 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=1.9e-53 Score=309.63 Aligned_cols=270 Identities=14% Similarity=0.079 Sum_probs=224.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE---eCCCceeeeecchhhccccccc-cccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVL---ATLPEADWVGESATDPKGVKPT-SKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~---~~~~~a~~v~E~~~~~~~~~~~-~~~~ 76 (305) |..+++++||++||+++.++|++.+++.++|+++|+++++++++..+|+. +..+.+.|++|++. +|+ +.++ T Consensus 116 ~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~ 190 (408) T protein:vir:10 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGK-----IPDLDNPQ 190 (408) T ss_pred hhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCccc-----cccccCcc Confidence 88889999999999999999999999999999999999998776666554 34477899999976 443 3478 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) |++++++++|++++++||+|+++|+.+++++||.++|++++++++|++|++|+|++... .+ T Consensus 191 ~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~-------------------~~ 251 (408) T protein:vir:10 191 LTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK-------------------PT 251 (408) T ss_pred eeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------------cc Confidence 99999999999999999999999999999999999999999999999999999865421 11 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCc--ccc-CC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRN--GAW-DA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~--~~~-~~ 226 (305) ..+++++.+.+ ...+...+...+.|+||+.++..|+++||++|+|+|++ .+++|+||++.++ ++. +. T Consensus 252 ~~~~~~l~~~~---~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~ 328 (408) T protein:vir:10 252 IAKFDDVITMI---NTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGS 328 (408) T ss_pred cccHHHHHHHH---HHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCceecceeeEEecccccCccCC Confidence 12334443332 34455666677789999999999999999999999975 3799999998654 443 34 Q ss_pred CCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 227 DAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 227 ~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++..++||||++ |.+++|++++++++++.+ ..|++|++.+|++.|+|+.+.+|++|++++.+++++..|.. T Consensus 329 ~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~--------~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 400 (408) T protein:vir:10 329 TVYPLYYGDMSQAITLFDRENMSLLPTNIGA--------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNF 400 (408) T ss_pred CceEEEEEehhccEEEEEecceEEEEccccc--------chhhcCceEEEEEEeeccEEeccccEEEEEeeccccCCCCC Confidence 556799999997 568999999999877643 45899999999999999999999999999999877655555 No 48 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=1.6e-53 Score=310.02 Aligned_cols=270 Identities=16% Similarity=0.118 Sum_probs=231.6 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCC-Cceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATL-PEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) +..++++++|.++|+++..+|++.+++.++|+++|++++++++.+++|+.++. +.+.|++|++. +|+++++|++ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-----~~~~~~~~~~ 187 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGAL-----KPESSLKFAK 187 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEecCCcceeeecCCcc-----cccccceeeE Confidence 55667778888999999999999999999999999999999999999999765 68999999875 6778889999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) ++++++|+++.+++|+|+++|+. +++++|.++|++++++++|.+||+|+|++. .+.|+++......... ... T Consensus 188 i~~~~~k~~~~~~is~ell~d~~-~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~--~~~Gi~~~~~~~~~~~-----~~~ 259 (390) T protein:vir:81 188 KTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGAND--GLLGLIPQATTYAAPT-----TIA 259 (390) T ss_pred EEEeeeEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCCCC--cccceeeccccccccc-----ccc Confidence 99999999999999999999975 799999999999999999999999998754 3567766544332211 122 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------cccCccceEecCccccCCCCceEEE Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------DSFAGFRTFFNRNGAWDADAAIEVI 233 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------~~l~G~pv~~~~~~~~~~~~~~~~~ 233 (305) ....++.+.+++..+...++.+++|+|||.++..|+++||++|+|+|++ ++++|+||++++.+| +++++| T Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pv~~~~~~p----~~~~~~ 335 (390) T protein:vir:81 260 GATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMA----PGEFLV 335 (390) T ss_pred cchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcccccCceecceeeEEcCCCC----CCcEEE Confidence 2233556677778888888889999999999999999999999999975 479999999999886 456999 Q ss_pred Eehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 234 ADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 234 gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) |||++ |++.++++++++.+++.. +|++|++.+|++.|+||.+.+|+||++++.+ T Consensus 336 gd~~~~~~~~~~~~~~v~~~~~~~---------~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 336 GAFDLAAQIFDQWDARVEIGYVGE---------DFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EehhceEEEEEecceEEEEecccc---------hhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 99997 567889999998876532 4889999999999999999999999999998 No 49 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=3.1e-53 Score=308.42 Aligned_cols=270 Identities=16% Similarity=0.111 Sum_probs=229.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCC-Cceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATL-PEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) +...+++++|.++|+++.++|++.+++.++|+++|++++++++++++|+.++. +.+.|++|++. +|+++++|++ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-----~~~~~~~~~~ 187 (390) T protein:vir:10 113 ASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGAL-----KPESSLKFAK 187 (390) T ss_pred hhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCCcceeeecCCcc-----ccccccceeE Confidence 44455556677888889999999999999999999999999999999998864 78999999875 6677889999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) ++++++|+++++++|+|+++|+. +++++|.++|++++++++|+++|+|+|++. .|.|+++.+........ .. T Consensus 188 i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~~~~~~il~G~G~~~--~p~Gi~~~~~~~~~~~~-----~~ 259 (390) T protein:vir:10 188 KTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGAND--GLLGLIPQATTYAAPTT-----IA 259 (390) T ss_pred EEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCCc--ccccccccccccccccc-----cc Confidence 99999999999999999999976 899999999999999999999999998754 46777766543332211 12 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------cccCccceEecCccccCCCCceEEE Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------DSFAGFRTFFNRNGAWDADAAIEVI 233 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------~~l~G~pv~~~~~~~~~~~~~~~~~ 233 (305) .....+.+..++..+...++..++|+|||.++..|+++||++|+|+|++ ++++|+||++++.+| .+.++| T Consensus 260 ~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p----~~~~~~ 335 (390) T protein:vir:10 260 GATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMA----PGEFLV 335 (390) T ss_pred ccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCcCcCCceecceeeEEcCCCC----CCcEEE Confidence 2233456667777888888889999999999999999999999999974 478999999999887 456899 Q ss_pred EehhhE-EEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 234 ADSSRV-KIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 234 gdf~~~-~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) |||+++ .+.++++++++.+++. ..|++|++.+|++.|+||.+.+|+||++++.+ T Consensus 336 gdf~~~~~~~~~~~~~i~~~~~~---------~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 336 GAFDLAAQIFDQWDARVEIGYVN---------DDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EeccceEEEEEecceEEEEeecc---------cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 999974 5788999999887654 23889999999999999999999999999998 No 50 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=3.1e-53 Score=308.48 Aligned_cols=270 Identities=14% Similarity=0.076 Sum_probs=223.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE--Ee-CCCceeeeecchhhcccccccc-ccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPV--LA-TLPEADWVGESATDPKGVKPTS-KVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~--~~-~~~~a~~v~E~~~~~~~~~~~~-~~~ 76 (305) .+.+++++||++||+++.++|++.+++.++|+++|+++++++++..+++ .. ..+.+.|++|++. ++++ .++ T Consensus 107 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~ 181 (395) T protein:vir:38 107 SGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESAL-----IGDNDDPE 181 (395) T ss_pred hccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccccc-----cccccccc Confidence 4455666789999999999999999999999999999999876656554 33 3467899999976 4433 478 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) |++++++++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+.. + T Consensus 182 f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~-------------------~ 242 (395) T protein:vir:38 182 LTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKKP-------------------T 242 (395) T ss_pred eeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-------------------c Confidence 999999999999999999999999999999999999999999999999999998764321 1 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccc--cCCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGA--WDAD 227 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~--~~~~ 227 (305) ..+++++.+.+. ..+...+...+.|+|||.++..|+++||++|+|+|++ .+++|+||+++++++ ...+ T Consensus 243 ~~~~~~i~~~~~---~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~ 319 (395) T protein:vir:38 243 ISQFDNIKDLEN---NTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSG 319 (395) T ss_pred cccHHHHHHHHH---HhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceeccceeEEecccccCcCCC Confidence 123334433322 3455566677889999999999999999999999964 479999999987643 3456 Q ss_pred CceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 228 AAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 228 ~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) +..++||||++ |+++++++++++++++.. ..|++|++.+|++.|+|+.+.+|.+|++++.++++..+|++ T Consensus 320 ~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~--------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 390 (395) T protein:vir:38 320 SHPLYFGDLKQGITLFDRQQMQIDTTNVGA--------GSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQAQGT 390 (395) T ss_pred cceEEEEeccccEEEEEecceEEEEecccc--------chhhcCceEEEEEEeeccEEecccceEEEEeecccCCCCCc Confidence 67899999997 678999999999877642 45899999999999999999999999999999887766666 No 51 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=5.1e-53 Score=307.29 Aligned_cols=269 Identities=16% Similarity=0.136 Sum_probs=222.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEeCCCceeeeecchhhccccccc-ccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT--THLPVLATLPEADWVGESATDPKGVKPT-SKVTW 77 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~--~~~p~~~~~~~a~~v~E~~~~~~~~~~~-~~~~f 77 (305) |+.+++++||.+||+++.++|++.+++.++|+++|+++++++++ +.+|+.++.+.+.|++|++. ++. +.++| T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGE-----IPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccc-----ccccccccc Confidence 88888899999999999999999999999999999999998665 45666677889999999876 443 34799 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) ++++++++|++++++||+|+++|+.++++++|.++|++++++++|.+|++|+|++... +. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~--------------------~~ 240 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ--------------------AI 240 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--------------------Cc Confidence 9999999999999999999999999999999999999999999999999999865321 12 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEec-Ccc-c----c Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFN-RNG-A----W 224 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~-~~~-~----~ 224 (305) .+++++.+.+. ..+...+..++.|+|||.++..|+++||++|+|+|++ .+++|.|+++. +.+ + . T Consensus 241 ~~~d~i~~~~~---~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 241 KSLDDIKDVLN---VKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred cCHHHHHHHHH---HhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 23344444332 3556666777889999999999999999999999964 37899876543 222 2 2 Q ss_pred CCCCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 225 DADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 225 ~~~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) ..++..++||||++ |++++|++++++++++.. ..|++|++.+|++.|+||++.+|++|++++.++++|..+ T Consensus 318 ~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~--------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~ 389 (392) T protein:vir:10 318 TAKKAPLIIGDLKEAIVLFKREDMELASTDVGG--------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) T ss_pred cCCceEEEEEehhceEEEEeecceEEEEecccc--------chhhcCceEEEEEEeeccEEecccceEEEEecccccccC Confidence 34566799999998 568899999999876532 358999999999999999999999999999988777764 Q ss_pred CC Q lcl|Aclame:pro 304 AA 305 (305) Q Consensus 304 a~ 305 (305) .+ T Consensus 390 ~~ 391 (392) T protein:vir:10 390 PQ 391 (392) T ss_pred CC Confidence 44 No 52 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=5.1e-53 Score=307.29 Aligned_cols=269 Identities=16% Similarity=0.136 Sum_probs=222.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEeCCCceeeeecchhhccccccc-ccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT--THLPVLATLPEADWVGESATDPKGVKPT-SKVTW 77 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~--~~~p~~~~~~~a~~v~E~~~~~~~~~~~-~~~~f 77 (305) |+.+++++||.+||+++.++|++.+++.++|+++|+++++++++ +.+|+.++.+.+.|++|++. ++. +.++| T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGE-----IPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccc-----ccccccccc Confidence 88888899999999999999999999999999999999998665 45666677889999999876 443 34799 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) ++++++++|++++++||+|+++|+.++++++|.++|++++++++|.+|++|+|++... +. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~--------------------~~ 240 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ--------------------AI 240 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--------------------Cc Confidence 9999999999999999999999999999999999999999999999999999865321 12 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEec-Ccc-c----c Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFN-RNG-A----W 224 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~-~~~-~----~ 224 (305) .+++++.+.+. ..+...+..++.|+|||.++..|+++||++|+|+|++ .+++|.|+++. +.+ + . T Consensus 241 ~~~d~i~~~~~---~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 241 KSLDDIKDVLN---VKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred cCHHHHHHHHH---HhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 23344444332 3556666777889999999999999999999999964 37899876543 222 2 2 Q ss_pred CCCCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 225 DADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 225 ~~~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) ..++..++||||++ |++++|++++++++++.. ..|++|++.+|++.|+||++.+|++|++++.++++|..+ T Consensus 318 ~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~--------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~ 389 (392) T protein:vir:10 318 TAKKAPLIIGDLKEAIVLFKREDMELASTDVGG--------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) T ss_pred cCCceEEEEEehhceEEEEeecceEEEEecccc--------chhhcCceEEEEEEeeccEEecccceEEEEecccccccC Confidence 34566799999998 568899999999876532 358999999999999999999999999999988777764 Q ss_pred CC Q lcl|Aclame:pro 304 AA 305 (305) Q Consensus 304 a~ 305 (305) .+ T Consensus 390 ~~ 391 (392) T protein:vir:10 390 PQ 391 (392) T ss_pred CC Confidence 44 No 53 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=5.1e-53 Score=307.29 Aligned_cols=269 Identities=16% Similarity=0.136 Sum_probs=222.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEeCCCceeeeecchhhccccccc-ccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT--THLPVLATLPEADWVGESATDPKGVKPT-SKVTW 77 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~--~~~p~~~~~~~a~~v~E~~~~~~~~~~~-~~~~f 77 (305) |+.+++++||.+||+++.++|++.+++.++|+++|+++++++++ +.+|+.++.+.+.|++|++. ++. +.++| T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGE-----IPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccc-----ccccccccc Confidence 88888899999999999999999999999999999999998665 45666677889999999876 443 34799 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) ++++++++|++++++||+|+++|+.++++++|.++|++++++++|.+|++|+|++... +. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~--------------------~~ 240 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ--------------------AI 240 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--------------------Cc Confidence 9999999999999999999999999999999999999999999999999999865321 12 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEec-Ccc-c----c Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFN-RNG-A----W 224 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~-~~~-~----~ 224 (305) .+++++.+.+. ..+...+..++.|+|||.++..|+++||++|+|+|++ .+++|.|+++. +.+ + . T Consensus 241 ~~~d~i~~~~~---~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 241 KSLDDIKDVLN---VKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred cCHHHHHHHHH---HhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 23344444332 3556666777889999999999999999999999964 37899876543 222 2 2 Q ss_pred CCCCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 225 DADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 225 ~~~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) ..++..++||||++ |++++|++++++++++.. ..|++|++.+|++.|+||++.+|++|++++.++++|..+ T Consensus 318 ~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~--------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~ 389 (392) T protein:vir:10 318 TAKKAPLIIGDLKEAIVLFKREDMELASTDVGG--------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) T ss_pred cCCceEEEEEehhceEEEEeecceEEEEecccc--------chhhcCceEEEEEEeeccEEecccceEEEEecccccccC Confidence 34566799999998 568899999999876532 358999999999999999999999999999988777764 Q ss_pred CC Q lcl|Aclame:pro 304 AA 305 (305) Q Consensus 304 a~ 305 (305) .+ T Consensus 390 ~~ 391 (392) T protein:vir:10 390 PQ 391 (392) T ss_pred CC Confidence 44 No 54 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=5.1e-53 Score=307.29 Aligned_cols=269 Identities=16% Similarity=0.136 Sum_probs=222.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEeCCCceeeeecchhhccccccc-ccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT--THLPVLATLPEADWVGESATDPKGVKPT-SKVTW 77 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~--~~~p~~~~~~~a~~v~E~~~~~~~~~~~-~~~~f 77 (305) |+.+++++||.+||+++.++|++.+++.++|+++|+++++++++ +.+|+.++.+.+.|++|++. ++. +.++| T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGE-----IPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccc-----ccccccccc Confidence 88888899999999999999999999999999999999998665 45666677889999999876 443 34799 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) ++++++++|++++++||+|+++|+.++++++|.++|++++++++|.+|++|+|++... +. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~--------------------~~ 240 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ--------------------AI 240 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--------------------Cc Confidence 9999999999999999999999999999999999999999999999999999865321 12 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEec-Ccc-c----c Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFN-RNG-A----W 224 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~-~~~-~----~ 224 (305) .+++++.+.+. ..+...+..++.|+|||.++..|+++||++|+|+|++ .+++|.|+++. +.+ + . T Consensus 241 ~~~d~i~~~~~---~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 241 KSLDDIKDVLN---VKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred cCHHHHHHHHH---HhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 23344444332 3556666777889999999999999999999999964 37899876543 222 2 2 Q ss_pred CCCCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 225 DADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 225 ~~~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) ..++..++||||++ |++++|++++++++++.. ..|++|++.+|++.|+||++.+|++|++++.++++|..+ T Consensus 318 ~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~--------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~ 389 (392) T protein:vir:10 318 TAKKAPLIIGDLKEAIVLFKREDMELASTDVGG--------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) T ss_pred cCCceEEEEEehhceEEEEeecceEEEEecccc--------chhhcCceEEEEEEeeccEEecccceEEEEecccccccC Confidence 34566799999998 568899999999876532 358999999999999999999999999999988777764 Q ss_pred CC Q lcl|Aclame:pro 304 AA 305 (305) Q Consensus 304 a~ 305 (305) .+ T Consensus 390 ~~ 391 (392) T protein:vir:10 390 PQ 391 (392) T ss_pred CC Confidence 44 No 55 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=3.8e-53 Score=308.00 Aligned_cols=273 Identities=15% Similarity=0.148 Sum_probs=231.2 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-CCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |.. +++.+|.++|+++..+|++.+++.++|+++|++++++++++++|+.++ .+.+.|++|++. +|+++++|++ T Consensus 105 ~~~-~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~~~ 178 (385) T protein:vir:18 105 LGS-DADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKAL-----KPESDITFSK 178 (385) T ss_pred hcc-ccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCcc-----ccccccceeE Confidence 444 344456788899999999999999999999999999998999999875 578999999865 6778889999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) ++++++|++++++||+|+++|+. +++++|.++|++++++++|.+||+|+|++.+ +.|+++......... ... T Consensus 179 ~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~--~~Gi~~~~~~~~~~~-----~~~ 250 (385) T protein:vir:18 179 QTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLLNGDGTGDN--LEGLNKVATAYDTSL-----NAT 250 (385) T ss_pred EEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCc--ccccccccccccccc-----ccc Confidence 99999999999999999999875 6999999999999999999999999987654 456665544332221 122 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------cccCccceEecCccccCCCCceEEE Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------DSFAGFRTFFNRNGAWDADAAIEVI 233 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------~~l~G~pv~~~~~~~~~~~~~~~~~ 233 (305) ....++.+.+++.++...++.++.|+|||.++..|+++||++|+|+|++ .+++|+||++++.+| ++.++| T Consensus 251 ~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p----~~~~~~ 326 (385) T protein:vir:18 251 GDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQA----AGTFTV 326 (385) T ss_pred ccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceeeEEcCcCC----CCcEEE Confidence 2334566777778888888889999999999999999999999999964 579999999999986 456999 Q ss_pred Eehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 234 ADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 234 gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) |||++ |+++++++++|+++++.. ++|++|++.+|++.|+|+.+.+|.+|++++.+.++ T Consensus 327 gd~~~~~~~~~~~~~~v~~~~~~~--------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 327 GGFDMASQVWDRMDATVEVSREDR--------DNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred eecccEEEEEEecceEEEEecccc--------chhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 99987 678899999998866542 45899999999999999999999999999999877 No 56 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=3.8e-53 Score=308.00 Aligned_cols=273 Identities=15% Similarity=0.148 Sum_probs=231.2 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-CCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |.. +++.+|.++|+++..+|++.+++.++|+++|++++++++++++|+.++ .+.+.|++|++. +|+++++|++ T Consensus 105 ~~~-~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~~~ 178 (385) T protein:vir:19 105 LGS-DADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKAL-----KPESDITFSK 178 (385) T ss_pred hcc-ccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCcc-----ccccccceeE Confidence 444 344456788899999999999999999999999999998999999875 578999999865 6778889999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) ++++++|++++++||+|+++|+. +++++|.++|++++++++|.+||+|+|++.+ +.|+++......... ... T Consensus 179 ~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~--~~Gi~~~~~~~~~~~-----~~~ 250 (385) T protein:vir:19 179 QTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLLNGDGTGDN--LEGLNKVATAYDTSL-----NAT 250 (385) T ss_pred EEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCc--ccccccccccccccc-----ccc Confidence 99999999999999999999875 6999999999999999999999999987654 456665544332221 122 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------cccCccceEecCccccCCCCceEEE Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------DSFAGFRTFFNRNGAWDADAAIEVI 233 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------~~l~G~pv~~~~~~~~~~~~~~~~~ 233 (305) ....++.+.+++.++...++.++.|+|||.++..|+++||++|+|+|++ .+++|+||++++.+| ++.++| T Consensus 251 ~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p----~~~~~~ 326 (385) T protein:vir:19 251 GDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQA----AGTFTV 326 (385) T ss_pred ccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceeeEEcCcCC----CCcEEE Confidence 2334566777778888888889999999999999999999999999964 579999999999986 456999 Q ss_pred Eehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 234 ADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 234 gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) |||++ |+++++++++|+++++.. ++|++|++.+|++.|+|+.+.+|.+|++++.+.++ T Consensus 327 gd~~~~~~~~~~~~~~v~~~~~~~--------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 327 GGFDMASQVWDRMDATVEVSREDR--------DNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred eecccEEEEEEecceEEEEecccc--------chhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 99987 678899999998866542 45899999999999999999999999999999877 No 57 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=4.6e-53 Score=307.50 Aligned_cols=269 Identities=14% Similarity=0.052 Sum_probs=224.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEe-CCCceeeeecchhhccccccc-cccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT--THLPVLA-TLPEADWVGESATDPKGVKPT-SKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~--~~~p~~~-~~~~a~~v~E~~~~~~~~~~~-~~~~ 76 (305) |+.+++++||++||+++.++|++.+++.++|+++|+++++++++ +.+|+.. ..+.+.|++|++. ++. +.++ T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~ 183 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGK-----IADVDDPK 183 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCccc-----cccccccc Confidence 88889999999999999999999999999999999999997554 5566654 4477999999976 443 4679 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) |++++++++|++++++||+|+++|+.+++++||.++|++++++++|++|++|+|++.... + T Consensus 184 ~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~~-------------------~ 244 (397) T protein:vir:49 184 LSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALPTKP-------------------T 244 (397) T ss_pred eeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------------c Confidence 999999999999999999999999999999999999999999999999999998754321 1 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCc--ccc-CC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRN--GAW-DA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~--~~~-~~ 226 (305) ..++ +.+.+++..+...+...+.|+||+.++..|+++||++|+|+|++ .+++|+||++.+. ++. .. T Consensus 245 ~~~~----d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~ 320 (397) T protein:vir:49 245 LTKW----DDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTG 320 (397) T ss_pred cccH----HHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecccccccccC Confidence 1223 34555666677777888999999999999999999999999975 3799999987553 443 34 Q ss_pred CCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 227 DAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 227 ~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++..++||||++ |++++|++++++++++.. +.|++|++.+|++.|+|+.+.+|.+|++++.+.++.-+|-. T Consensus 321 ~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~--------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 392 (397) T protein:vir:49 321 GAMPLYFGDLKQAVTLFDRQHMSLLSTNIGG--------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNL 392 (397) T ss_pred CceeEEEeeccceEEEEeecceEEEEecccc--------chhhcCceeEEEEeeeCcEEecccceEEEEeecccCCCCCc Confidence 567799999997 568899999999876532 45899999999999999999999999999988654433332 No 58 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=1.3e-52 Score=305.08 Aligned_cols=283 Identities=9% Similarity=0.038 Sum_probs=229.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCC--CceEEEEEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGT--KTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~--~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) |..+++++||++||+++.++|++.+++.++|+++++++++++ +.+.+|+.++.+.+.|++|++..+.. ..+++|+ T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~---~~~~~f~ 186 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTN---GDNGKLE 186 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeecccccccccc---cccccee Confidence 888888999999999999999999999999999999999874 46778888899999999999763321 1357899 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++++++|++++++||+|+++|+.++++++|.++|++++++++|.+||+|+|++.+ +.|+......... ...+.. T Consensus 187 ~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~--~~gi~~~~~~~~~---~~~~~~ 261 (404) T protein:vir:10 187 RFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEH--ATGIMTANKFKKI---TLPKSP 261 (404) T ss_pred eeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCc--ccceeecccccee---eccccc Confidence 99999999999999999999999999999999999999999999999999987543 4455544333222 122223 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEec-Cccc-cCCCCc Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFN-RNGA-WDADAA 229 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~-~~~~-~~~~~~ 229 (305) .++++.+.+ ...+...+...++|+|||.++..|+++||++|+|+|++ ++++|+||++. +.++ .+.++. T Consensus 262 ~~~~~~~~~---~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~ 338 (404) T protein:vir:10 262 ALKDFKKCK---NVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAI 338 (404) T ss_pred cHHHHHHHH---HhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCccccceeeEEecccccCCCCCcc Confidence 333333322 22455666667789999999999999999999999975 37999999854 3333 345677 Q ss_pred eEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 230 IEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 230 ~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) .++||||++ |.++.|++++++++++. +..|++|++.+|++.|+|+.+.+|++|++++.+.++ .|| T Consensus 339 ~~~~gd~s~~~~~~~~~~~~i~~~~~~--------~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa--~~~ 404 (404) T protein:vir:10 339 PVLLGDTKEAYKYVSDGAYELATTNIG--------AGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVES--VQA 404 (404) T ss_pred EEEEEeccccEEEEEecceEEEEeccc--------cchhhcCceEEEEEEeeccEEecccceEEEEeeccc--CCC Confidence 899999997 56888999999887654 245889999999999999999999999999998764 466 No 59 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=1.1e-52 Score=305.37 Aligned_cols=262 Identities=12% Similarity=0.072 Sum_probs=221.6 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceE--EEEEeCCCceeeeecchhhccccccc-ccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTH--LPVLATLPEADWVGESATDPKGVKPT-SKVTW 77 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~--~p~~~~~~~a~~v~E~~~~~~~~~~~-~~~~f 77 (305) |+.+++++||.+||+++..+|++.+++.++|+++++++++++++.+ +++..+.+.+.|++|++. +|+ +.++| T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-----~~~~~~~~f 165 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAA-----IGEKATPQF 165 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccc-----cccccccce Confidence 9999999999999999999999999999999999999999876655 555566788999999875 443 56799 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) ++++++++|++++++||+|+++|+.+++++||.++|++++++++|.+|++|+|++.+. + . T Consensus 166 ~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~---~-----------------~ 225 (371) T protein:vir:81 166 TLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKT---A-----------------I 225 (371) T ss_pred eeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---c-----------------c Confidence 9999999999999999999999999999999999999999999999999999865321 1 1 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccC----- Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD----- 225 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~----- 225 (305) .+++++... +...+...+...+.|+|||.++..|+++||++|+|+|++ ++++|+||+++++++.+ T Consensus 226 ~~~~~i~~~---~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 302 (371) T protein:vir:81 226 ADLDGLKQI---INVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDG 302 (371) T ss_pred ccHHHHHHH---HHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCceecceeEEEecccccCccccc Confidence 223333222 223445566677899999999999999999999999974 47999999999987633 Q ss_pred ---CCCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 226 ---ADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 226 ---~~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) .+...++||||++ |.+++|++++++++++.. +.|++|++.+|++.|+|+.+.+|++|++++.+.+ T Consensus 303 ~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~--------~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 303 GTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAM--------DAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred cccCCcceEEEEehhceEEEEeecceEEEEecccc--------chhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 3456799999998 567889999999876543 4589999999999999999999999999999877 No 60 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=1.2e-52 Score=305.27 Aligned_cols=268 Identities=15% Similarity=0.069 Sum_probs=221.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEe-CCCceeeeecchhhccccccc-cccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT--THLPVLA-TLPEADWVGESATDPKGVKPT-SKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~--~~~p~~~-~~~~a~~v~E~~~~~~~~~~~-~~~~ 76 (305) |+.+++++||++||+++.++|++.++++++|+++|+++++++++ +.+|+.. ..+.+.|++|++. +++ ++++ T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~-----~~~~~~~~ 79 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGK-----IADIDDPK 79 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcc-----cccccccc Confidence 99999999999999999999999999999999999999997654 5566664 4578999999976 443 4578 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) |++++++++|+++++++|+|+++|+.++++++|.+++++++++++|++|++|+|+.... .+ T Consensus 80 ~~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~-------------------~~ 140 (293) T protein:vir:48 80 LSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTK-------------------PT 140 (293) T ss_pred eeEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccccc-------------------cc Confidence 99999999999999999999999999999999999999999999999999998754321 12 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCc--ccc-CC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRN--GAW-DA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~--~~~-~~ 226 (305) ..++++ +.+++.++...+...+.|+||+.++..|+++||++|+|+|++ .+++|+||++.+. ++. .. T Consensus 141 ~~~~d~----i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~ 216 (293) T protein:vir:48 141 LTKWDD----IIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASS 216 (293) T ss_pred ccCHHH----HHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccC Confidence 233444 444555666667778899999999999999999999999975 3799999987543 332 34 Q ss_pred CCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 227 DAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 227 ~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++..++||||++ |+++++++++++++++.. +.|++|++.+|++.|+|+.+.+|++|++++.+.++. +||- T Consensus 217 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~-~~~~ 287 (293) T protein:vir:48 217 GVMPLYFGDLKQAVTLFDRQQMSLLSTNIGG--------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIAD-QKGN 287 (293) T ss_pred CceEEEEEeccceEEEEEecceEEEEecccc--------hhhhcCeEEEEEEEeeCcEEecccceEEEEeecccc-CCcc Confidence 566799999998 568899999999877542 458999999999999999999999999999775433 2222 No 61 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=1.1e-52 Score=305.37 Aligned_cols=282 Identities=12% Similarity=0.043 Sum_probs=229.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) ++.+++++||++||+++.++|++.+++.++|+++|+++|++++...+|+.++.+.+.|++|++..+ +.++++|+++ T Consensus 84 ~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~~~~a~~~~E~~~~~----~~~~~~f~~i 159 (390) T protein:vir:40 84 IAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGDVATAWWGPLCAEIK----EVLDNGFDKI 159 (390) T ss_pred HhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcCCcceeeeccccccC----ccccccceee Confidence 777888899999999999999999999999999999999999999999999999999999986532 3456799999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccc--eeecccch Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQA--VEVVGGVA 158 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 158 (305) ++++||++++++||+|+++|+.+++++||.++|++++++++|++|++|+|++ .|.|+++........ ........ T Consensus 160 ~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~---~P~Gil~~~~~~~~~~~~~~~~~~~ 236 (390) T protein:vir:40 160 QTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKD---QPIGMMRDLNNVTAGEHPVKTATPL 236 (390) T ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCC---ccceeeecccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999965 466766644322211 11222223 Q ss_pred hhhHHHHHHHHHHHHhhh---ccccceEEEEchHHHH----HHHHhhccCCceeecccccCccceEecCccccCCCCceE Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVAS---AGWAPDTLLSSLALRY----EVANIRDANGNPVFRDDSFAGFRTFFNRNGAWDADAAIE 231 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~~~~~~----~l~~~kd~~G~~l~~~~~l~G~pv~~~~~~~~~~~~~~~ 231 (305) ++.+..+.+..+...+.. .....++|+||+.++. .+++++|.+|+|++.. ...|+||+++++++ ++.+ T Consensus 237 t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~-~~~g~pvv~~~~~p----~~~i 311 (390) T protein:vir:40 237 TDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGI-LPVPLEIVQSVAVP----VGKA 311 (390) T ss_pred chhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCcccccc-CCCceeEEEcCCCC----CCcE Confidence 444444444444443332 2345678999997642 4557899999999865 45799999999986 4569 Q ss_pred EEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 232 VIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 232 ~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) +||||++|++++|++++|+++++.+ |.+|++.+|+..|+|+++.+|+||+.++.++++. +|+. T Consensus 312 ~~Gd~s~~~i~~~~~~~v~~~~~~~----------f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~-~~~~ 374 (390) T protein:vir:40 312 VAGRAKDYFMGIGSEQVIRTSTEYR----------LLDDETLYYAKQYANGRPKDNSSFLVFDITGLEG-SPAI 374 (390) T ss_pred EEEeeceEEEEeecceEEEecchhh----------hhcCcEEEEEEEEeCCEEecccceEEEEeeccCC-CCCC Confidence 9999999999999999999887653 7899999999999999999999999999887754 3333 No 62 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=1.7e-52 Score=304.37 Aligned_cols=269 Identities=14% Similarity=0.051 Sum_probs=224.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE---eCCCceeeeecchhhcccccccc-ccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVL---ATLPEADWVGESATDPKGVKPTS-KVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~---~~~~~a~~v~E~~~~~~~~~~~~-~~~ 76 (305) |+..++++||++||+++.++|++.+++.++|+++|+++++++++..+|+. +..+.+.|++|++. ++++ .++ T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~-----~~~~~~~~ 183 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGS-----IGTNDDPK 183 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccc-----cccccccc Confidence 88888889999999999999999999999999999999999887776654 34467999999976 4433 478 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) |++++++++|++++++||+|+++|+.+++++||.++|++++++++|++|++|+|++... .+ T Consensus 184 ~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~~-------------------~~ 244 (397) T protein:vir:48 184 LYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLPTK-------------------PT 244 (397) T ss_pred eeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------------cc Confidence 99999999999999999999999999999999999999999999999999999875431 11 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCc--cc-cCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRN--GA-WDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~--~~-~~~ 226 (305) ..++ +.+.++..++...+...+.|+||+.++..|+++||++|+|+|++ .+++|+||++.+. ++ ... T Consensus 245 ~~~~----d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~ 320 (397) T protein:vir:48 245 LTKW----DDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASS 320 (397) T ss_pred cccH----HHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceeccceeEEecccccCCcCC Confidence 1223 33445566667777888999999999999999999999999974 3799999987653 33 345 Q ss_pred CCceEEEEehhhE-EEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 227 DAAIEVIADSSRV-KIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 227 ~~~~~~~gdf~~~-~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++..++||||+++ .+++|++++++++++.. .+|.+|++.+|++.|+|+.+.+|++|++++.+.++.-+|.- T Consensus 321 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 392 (397) T protein:vir:48 321 GAMPLYFGDLKQAVTLFDRQQMSLLSTNIGG--------GAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKGNL 392 (397) T ss_pred CceEEEEEeccceEEEEeecceEEEEeccch--------hhhhcCceeEEEEeeeccEEecccceEEEEecccccCCCCc Confidence 6778999999975 58899999999876542 45889999999999999999999999999998765423322 No 63 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=2.5e-52 Score=303.46 Aligned_cols=270 Identities=14% Similarity=0.094 Sum_probs=222.2 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEE--EEEeC-CCceeeeecchhhccccccc-cccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHL--PVLAT-LPEADWVGESATDPKGVKPT-SKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~--p~~~~-~~~a~~v~E~~~~~~~~~~~-~~~~ 76 (305) |..+++++||++||+++.++|++.+++.++|+++|+++++++++..+ ++..+ ...+.|++|++. +++ +.++ T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~-----~~~~~~~~ 190 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGK-----IPDLDNPR 190 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccccc-----cccccccc Confidence 77888899999999999999999999999999999999998765554 44443 466789999875 443 4579 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) |++++++++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++... .+ T Consensus 191 ~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~-------------------~~ 251 (408) T protein:vir:74 191 LTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKK-------------------PT 251 (408) T ss_pred eeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------------cc Confidence 99999999999999999999999999999999999999999999999999999875321 11 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCc--ccc-CC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRN--GAW-DA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~--~~~-~~ 226 (305) ..+++++.+.+ ...+...+...+.|+|||.++..|+++||++|+|+|++ .+++|+||++.++ ++. +. T Consensus 252 ~~~~~~i~~~~---~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~ 328 (408) T protein:vir:74 252 IANFDDVITMI---NTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGS 328 (408) T ss_pred cccHHHHHHHH---HHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCceecceeeEEecCcccccccC Confidence 22334443332 34556666777889999999999999999999999974 4799999998754 443 45 Q ss_pred CCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 227 DAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 227 ~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++..++||||++ |.+++|++++++++++. +..|++|++.+|++.|+|+.+.+|++|++++.++++.-.|+. T Consensus 329 ~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~--------~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 400 (408) T protein:vir:74 329 TVYPLYYGDMSQAITLFDRENMSLLPTNIG--------AGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQVGNF 400 (408) T ss_pred CcceEEEEehhccEEEEEecceEEEEeccc--------cchhhcceeeEEEEEeeCcEEecccceEEEEeecccCCCCCC Confidence 667899999997 56889999999987653 235889999999999999999999999999998665533333 No 64 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=1.9e-52 Score=304.11 Aligned_cols=262 Identities=13% Similarity=0.049 Sum_probs=221.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCC--ceEEEEEeCCCceeeeecchhhccccccc-ccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTK--TTHLPVLATLPEADWVGESATDPKGVKPT-SKVTW 77 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~--~~~~p~~~~~~~a~~v~E~~~~~~~~~~~-~~~~f 77 (305) |+.+++++||.+||+++.++|++.+++.++|+++|++++++++ .+.+|+.++.+.+.|++|++. +|. +.++| T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-----~~~~~~~~~ 197 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGN-----LPEIDQPRF 197 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeeccccc-----ccccccccc Confidence 8888999999999999999999999999999999999999754 556677788889999999975 443 45799 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) ++++++++|++++++||+|+++|+.++++++|.++|++++++++|.+|++|+|++.+ .++ T Consensus 198 ~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~~---~g~----------------- 257 (397) T protein:vir:12 198 TKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLKK---VDI----------------- 257 (397) T ss_pred eeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---ccc----------------- Confidence 999999999999999999999999999999999999999999999999999987532 221 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccc--cCCCC Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGA--WDADA 228 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~--~~~~~ 228 (305) .+++++.+. +...+...+...+.|+|||.++..|+++||++|+|+|++ .+++|+||++.++.. .+.++ T Consensus 258 ~~~~~i~~~---~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~ 334 (397) T protein:vir:12 258 DGLDGIKKA---LNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGK 334 (397) T ss_pred ccHHHHHHH---HhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCccccceeeEEecccccccCCCc Confidence 223333332 223556667778899999999999999999999999974 379999998776532 34567 Q ss_pred ceEEEEehhhE-EEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 229 AIEVIADSSRV-KIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 229 ~~~~~gdf~~~-~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) ..++||||+++ .+++|++++++++++. +..|++|++.+|++.|+|+.+.+|++|++++.|.. T Consensus 335 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~--------~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 335 APLIIGNLKEAIVLFDREQQSIASTDTG--------AGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred cEEEEEehhceEEEEeecceEEEEeccc--------cchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 78999999985 5788999999887654 24589999999999999999999999999999865 No 65 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=2.5e-52 Score=303.49 Aligned_cols=275 Identities=13% Similarity=0.156 Sum_probs=224.6 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCC----Cceeeeecchhhccccccccc-c Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATL----PEADWVGESATDPKGVKPTSK-V 75 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~----~~a~~v~E~~~~~~~~~~~~~-~ 75 (305) ++.+++++++.++|+++.++|++.+++.++|++++++++++++++++|+.... ..+.|++|++. +|+++ . T Consensus 118 ~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-----~~~~~~~ 192 (413) T protein:vir:81 118 STATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGK-----KPYMRFA 192 (413) T ss_pred hhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCceeEEEeccccccccccceecCccc-----ccccCcc Confidence 55667778999999999999999999999999999999999999999997643 46899999875 55555 5 Q ss_pred cceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecc Q lcl|Aclame:pro 76 TWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVG 155 (305) Q Consensus 76 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (305) +|+++++.++|++++++||+|+++|+. .+++||.++|++++++++|++||+|+|++.+ +.|+++......... .+ T Consensus 193 ~f~~i~~~~~k~~~~~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~--~~Gi~~~~~~~~~~~--~~ 267 (413) T protein:vir:81 193 DFDIVTESLSKIAGLTKITDEMIEDYD-FLVSYINARLLEELAIEEERQLLLGDGTGNN--LTGLLKRDGIQTLAV--SN 267 (413) T ss_pred cceeeEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCCCc--ccccccccccccccc--cc Confidence 799999999999999999999999986 4999999999999999999999999987653 456665443332211 11 Q ss_pred cchhhhHHHHHHHHHHHHhh-hccccceEEEEchHHHHHHHHhhccCCceeecc--------------cccCccceEecC Q lcl|Aclame:pro 156 GVANESDIVGATNRAAKAVA-SAGWAPDTLLSSLALRYEVANIRDANGNPVFRD--------------DSFAGFRTFFNR 220 (305) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~--------------~~l~G~pv~~~~ 220 (305) ....++.+..++.... ..++.+++|+||+.++..|+++||++|+|+|++ .+++|+||++++ T Consensus 268 ----~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~ 343 (413) T protein:vir:81 268 ----KDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQ 343 (413) T ss_pred ----cchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcC Confidence 1223344444443333 334566779999999999999999999999963 268999999999 Q ss_pred ccccCCCCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 221 NGAWDADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 221 ~~~~~~~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) .++ .+.++||||++ |++++|++++++++++.. .+|++|++.+|++.|+|+.+.+|.+|++++.++ T Consensus 344 ~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~--------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~-- 409 (413) T protein:vir:81 344 VVP----VGKPVVGAFRSAASVLRKGGVRIDSTNTNV--------DDFENNLITVRAEERVGLMVTFPEAIVQLDVAE-- 409 (413) T ss_pred CCC----cccEEEEecccEEEEEEecceEEEEecccc--------chhhcCcEEEEEEEeeccEEecccceEEEEecC-- Confidence 886 45799999997 567889999999877643 358999999999999999999999999999875 Q ss_pred cccC Q lcl|Aclame:pro 300 VVAP 303 (305) Q Consensus 300 ~v~~ 303 (305) +++| T Consensus 410 ~~~p 413 (413) T protein:vir:81 410 VVTP 413 (413) T ss_pred CCCC Confidence 4677 No 66 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=4.8e-52 Score=301.91 Aligned_cols=278 Identities=12% Similarity=0.073 Sum_probs=225.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE--eCCCceeeeecchhhccccccc-ccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVL--ATLPEADWVGESATDPKGVKPT-SKVTW 77 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~--~~~~~a~~v~E~~~~~~~~~~~-~~~~f 77 (305) .+.+++++||.+||+++.++|++.+++.++|+++|++++++++..++|+. ++...+.|++|++. +|+ +.++| T Consensus 121 ~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~-----~~~~~~~~~ 195 (415) T protein:vir:47 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE-----NPELAVKPF 195 (415) T ss_pred hccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccc-----cccccccce Confidence 34456677899999999999999999999999999999999888777765 56678899999976 443 45789 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) +++++.++|++++++||+|+++|+.+++++||.++|++++++++|++|++|+|++.+....... .... ......+. T Consensus 196 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~---~~~~-~~~~~~~~ 271 (415) T protein:vir:47 196 FQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF---EKEG-KKLEVKKA 271 (415) T ss_pred eeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccc---cccc-ceeccccc Confidence 9999999999999999999999999999999999999999999999999999876553322111 1111 11122233 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCcccc-CCCCc Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAW-DADAA 229 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~-~~~~~ 229 (305) .++ +++.+++.++...++.++.|+||+.++..|+++||++|+|+|++ .+++|+||+++++++. +.++. T Consensus 272 ~~~----~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) T protein:vir:47 272 KSL----DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNN 347 (415) T ss_pred cch----HHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCcc Confidence 333 44555666667777888999999999999999999999999964 4799999999988774 34567 Q ss_pred eEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 230 IEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 230 ~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .++||||++ |++++|++++++.++ |.++.+.+|+..|+|+.+.+|++|+.++.++++. .|++ T Consensus 348 ~~~~gd~~~~~~~~~~~~~~v~~~~-------------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~-~~~~ 410 (415) T protein:vir:47 348 TLIIGNLKDAIVLFDRSQYQASWTD-------------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER-GEGD 410 (415) T ss_pred EEEEEehhccEEEEeecceEEEeec-------------cccCceEEEEEEEeccEEeccccEEEEEeeccCC-CCCC Confidence 799999998 567889999988754 4456788999999999999999999999987654 5666 No 67 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=4.8e-52 Score=301.91 Aligned_cols=278 Identities=12% Similarity=0.073 Sum_probs=225.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE--eCCCceeeeecchhhccccccc-ccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVL--ATLPEADWVGESATDPKGVKPT-SKVTW 77 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~--~~~~~a~~v~E~~~~~~~~~~~-~~~~f 77 (305) .+.+++++||.+||+++.++|++.+++.++|+++|++++++++..++|+. ++...+.|++|++. +|+ +.++| T Consensus 121 ~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~-----~~~~~~~~~ 195 (415) T protein:vir:46 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE-----NPELAVKPF 195 (415) T ss_pred hccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccc-----cccccccce Confidence 34456677899999999999999999999999999999999888777765 56678899999976 443 45789 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) +++++.++|++++++||+|+++|+.+++++||.++|++++++++|++|++|+|++.+....... .... ......+. T Consensus 196 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~---~~~~-~~~~~~~~ 271 (415) T protein:vir:46 196 FQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF---EKEG-KKLEVKKA 271 (415) T ss_pred eeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccc---cccc-ceeccccc Confidence 9999999999999999999999999999999999999999999999999999876553322111 1111 11122233 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCcccc-CCCCc Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAW-DADAA 229 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~-~~~~~ 229 (305) .++ +++.+++.++...++.++.|+||+.++..|+++||++|+|+|++ .+++|+||+++++++. +.++. T Consensus 272 ~~~----~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) T protein:vir:46 272 KSL----DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNN 347 (415) T ss_pred cch----HHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCcc Confidence 333 44555666667777888999999999999999999999999964 4799999999988774 34567 Q ss_pred eEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 230 IEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 230 ~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .++||||++ |++++|++++++.++ |.++.+.+|+..|+|+.+.+|++|+.++.++++. .|++ T Consensus 348 ~~~~gd~~~~~~~~~~~~~~v~~~~-------------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~-~~~~ 410 (415) T protein:vir:46 348 TLIIGNLKDAIVLFDRSQYQASWTD-------------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER-GEGD 410 (415) T ss_pred EEEEEehhccEEEEeecceEEEeec-------------cccCceEEEEEEEeccEEeccccEEEEEeeccCC-CCCC Confidence 799999998 567889999988754 4456788999999999999999999999987654 5666 No 68 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=1.3e-52 Score=305.13 Aligned_cols=272 Identities=15% Similarity=0.169 Sum_probs=229.2 Q ss_pred CCCccCCccceEccHHH-HHHHHHHHHhhhhhhhh-cceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAY-SDTLLAAAKQGSTVLSA-FQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~-~~~i~~~~~~~~~l~~l-~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) |..+|+++||++||+++ .++|++.+++.++++++ ++++++.++++++|+.++.+.++|++|++. ++.++++|+ T Consensus 357 ~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~-----~~~s~~~f~ 431 (632) T protein:vir:96 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDED-----VQDSDFDFT 431 (632) T ss_pred hhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeCCceeEeecCCcc-----cccccccee Confidence 77788888999999886 58999999999999998 788999999999999999999999999875 667788999 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++++++|++++++||+||++|+.++++++|.++|++++++++|.++|+|+|... .|.|+++...... +....... T Consensus 432 ~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~--~p~Gi~~~~~~~~--~~~~~~~~ 507 (632) T protein:vir:96 432 TLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLAN--DPVGLLNMTGVPA--LTYPAGGV 507 (632) T ss_pred eEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCC--ccceeeecccccc--eecccccC Confidence 9999999999999999999999999999999999999999999999999998543 5677776544332 22333344 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHH--hhccCCceeecccccCccceEecCccccCCCCceEEEEeh Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVAN--IRDANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADS 236 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~--~kd~~G~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf 236 (305) ++.++.++...+.... .....+.|+||+.....|++ ++|++|+|+|++++++|+|++++++++. +.++|||| T Consensus 508 ~~~~i~~~~~~i~~~~--~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~~~l~G~pv~~s~~ip~----~~~~~gd~ 581 (632) T protein:vir:96 508 DWASVVDMETKISTFN--ADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNNEVNGYRAEASNQIPA----DTWIFGDW 581 (632) T ss_pred CHHHHHHHHHHHhhcc--cccCccEEEEchhHHHHHHHHhccCCCCceeecCCeecccceEecccccc----CcEEEeec Confidence 5555555443333221 22345689999998877765 7799999999999999999999999874 45999999 Q ss_pred hhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccc Q lcl|Aclame:pro 237 SRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTP 297 (305) Q Consensus 237 ~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~ 297 (305) +++++++++++++.++++.+ |.+|++.+|+..|+|+++.+|++|+.++... T Consensus 582 s~~~i~~~~~~~i~~~~~~~----------~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 582 SQIVIAMWGVLDLKVDPYTK----------AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred ceEEEEEecceEEEEccccc----------cccCceEEEEEeecCceeechhhhhheeecC Confidence 99999999999999877653 6789999999999999999999999999875 No 69 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=5.2e-52 Score=301.74 Aligned_cols=270 Identities=14% Similarity=0.087 Sum_probs=221.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE---eCCCceeeeecchhhccccccc-cccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVL---ATLPEADWVGESATDPKGVKPT-SKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~---~~~~~a~~v~E~~~~~~~~~~~-~~~~ 76 (305) |..+++++||++||+++.++|++.+++.++|+++|+++|++++...+|+. +..+.+.|++|++. +|+ +.++ T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-----~~~~~~~~ 190 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGK-----IPDLDNPR 190 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccc-----cccccccc Confidence 78889999999999999999999999999999999999998776665554 34477899999975 443 4679 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) |++++++++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++... .+ T Consensus 191 f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~~-------------------~~ 251 (404) T protein:vir:39 191 LTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKK-------------------PT 251 (404) T ss_pred eeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-------------------cc Confidence 99999999999999999999999999999999999999999999999999999875321 11 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCc--ccc-CC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRN--GAW-DA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~--~~~-~~ 226 (305) ..+++++.+.+ ...+...+...+.|+|||.++..|+++||++|+|+|++ .+++|+||++.++ ++. .. T Consensus 252 ~~~~~~i~~~~---~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~ 328 (404) T protein:vir:39 252 IAKFDDVITMI---NTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGS 328 (404) T ss_pred cccHHHHHHHH---HHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecceeEEEecccccCccCC Confidence 12333443333 23444555667789999999999999999999999975 3799999998765 332 23 Q ss_pred CCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 227 DAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 227 ~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++..+++|||++ |++++|++++++++++.. +.|++|++.+|++.|+|+.+.+|++|++++.++++.-..+. T Consensus 329 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~~~ 400 (404) T protein:vir:39 329 TVYPLYYGDMSQAITLFDRENMSLLPTNIGA--------GAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQVGNF 400 (404) T ss_pred CccEEEEEeccccEEEEeecceEEEEeccch--------hhhhhceeeEEEEeeeccEEecccceEEEEeeccccCCCCC Confidence 556799999997 567889999998877542 45889999999999999999999999999998764433333 No 70 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=3.3e-52 Score=302.79 Aligned_cols=280 Identities=14% Similarity=0.107 Sum_probs=221.8 Q ss_pred CC-CccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MA-DISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma-~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) ++ .+++++||++||+++.++|++.++++++|+++|+++++++ +.++|+....+.+.|+.+.+. ...++.++++|++ T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~-~~~~p~~~~~~~a~~~~~~~e--~~~~~~~~~~f~~ 217 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKE-NIKYPVLVKKAEAQGHKNERT--NNEMPETDIEFDE 217 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccCC-ceEEEEEecCCcccceecccc--cccccccccceee Confidence 22 2455678999999999999999999999999999998764 689999988888888765432 2346778899999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) +++++||+++++++|+|+++|+.+++++||.++|++++++++|++||+|+|++.+. .+++...... .... T Consensus 218 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~--~g~~~~~~~~--------~~~~ 287 (434) T protein:vir:62 218 IELSPTEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNIN--DGALAKKAVE--------FKTD 287 (434) T ss_pred EEeeheeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccc--cceeeccccc--------cccc Confidence 99999999999999999999999999999999999999999999999999976532 3333322211 1112 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc---------cccCccceEecCccccC--CCC Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD---------DSFAGFRTFFNRNGAWD--ADA 228 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~---------~~l~G~pv~~~~~~~~~--~~~ 228 (305) .....+++.++...+...+...+.|+||+.++..|+++||++|+|+|++ .+++|+||++++.++.. .+. T Consensus 288 ~~~~~d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~ 367 (434) T protein:vir:62 288 EKNLYDALVKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDT 367 (434) T ss_pred ccchhhHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCc Confidence 2233556666777777777778899999999999999999999999964 26999999999988743 233 Q ss_pred ceEEEEehhhEEEEeec-CcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeec-ccceEEEeccccccccCC Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQ-DITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGV-SATAQGANKTPVAVVAPA 304 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~-~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~-p~a~~~~~~t~~a~v~~a 304 (305) ..++||||++|++++|. .++++++.+.+ |.+|++.+|++.|+|+.+.+ |.++..++.+..++ +.| T Consensus 368 ~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~----------~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~-~~~ 434 (434) T protein:vir:62 368 PVFYFGDFSKFYIQDVIGSLEVQKLVELF----------SRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAP-TGA 434 (434) T ss_pred eEEEEeeccceEEEEeeceeEEEeehhhh----------cccCceEEEEEeeecceeecCcccceEEEEEeccC-CCC Confidence 45889999999998875 47788766543 67899999999999999765 87777776653322 222 No 71 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=3.8e-53 Score=307.95 Aligned_cols=276 Identities=10% Similarity=-0.026 Sum_probs=228.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..++.++||++||+++.++|++.+.+.++++++|+++++++ ..++|+.++.+.+.|++|++... ++++++|+++ T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~~~~~~~~~~a~w~~e~~~~~----~~~~~~f~~i 153 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIK----GQLKQAFKEQ 153 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCc-ceEEEEecCCcceeEeecccccC----cccCccceeE Confidence 788889999999999999999999999999999999999865 57999999999999999976532 3457799999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhh Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) ++.+||+++++++|+||++||.+++++||.+++++++++++|++|++|+|++ +|.|+++................+. T Consensus 154 ~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~---qP~Gil~~~~~~~~~~~~~~~~~~~ 230 (377) T protein:vir:98 154 DFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLL---QPVGLLKDLSQPTVDQSTGRDITTY 230 (377) T ss_pred eecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCC---cceeeeecccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999965 6778876544333222222222222 Q ss_pred hHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec----------c-----------cccCccce--E Q lcl|Aclame:pro 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR----------D-----------DSFAGFRT--F 217 (305) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~----------~-----------~~l~G~pv--~ 217 (305) ....+.+.++...+...+...++|+||+.+...++++||.+|+|+|. + .+++|+|+ + T Consensus 231 ~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv 310 (377) T protein:vir:98 231 KTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITIL 310 (377) T ss_pred cchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEE Confidence 22334455566666666667778999999999999999999999993 1 14566664 4 Q ss_pred ecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccc Q lcl|Aclame:pro 218 FNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTP 297 (305) Q Consensus 218 ~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~ 297 (305) .+++++ ++.++||||++|++++|++++++++++.+ |.+|++.+|+..|+|+.+.+|++|++++.+. T Consensus 311 ~s~~~p----~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~----------~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~ 376 (377) T protein:vir:98 311 ESLAVE----TGKAIAFVANRYDAFMATASTIEEYDQTF----------AMEDLQLYLTKNYFYGKAKDNHTAALLTLAG 376 (377) T ss_pred ecCCCC----cccEEEEEecceeEEeecceEEEeechhh----------hhcCceEEEEEEEEcCEEeccCcEEEEEEec Confidence 455554 56699999999999999999999988764 7889999999999999999999999999875 Q ss_pred c Q lcl|Aclame:pro 298 V 298 (305) Q Consensus 298 ~ 298 (305) = T Consensus 377 ~ 377 (377) T protein:vir:98 377 G 377 (377) T ss_pred C Confidence 4 No 72 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=1.9e-51 Score=298.63 Aligned_cols=279 Identities=11% Similarity=0.071 Sum_probs=225.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE--EEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLP--VLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p--~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) .+.+++++||.+||+++.++|++.+++.++|+++|++++|++++.++| +.++...+.|++|++..++ .+.++|+ T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~----~~~~~~~ 196 (415) T protein:vir:79 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE----LAVKPFF 196 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCc----cccccee Confidence 445667778999999999999999999999999999999987765555 4567788999999876332 2356899 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++++++|++++++||+|+++|+.+++++||.++|++++++++|.+|++|+|++.+....... ..... .....+.. T Consensus 197 ~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~---~~~~~-~~~~~~~~ 272 (415) T protein:vir:79 197 QLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF---EKEGK-KLEVKKAK 272 (415) T ss_pred eEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccc---ccccc-cccccccc Confidence 999999999999999999999999999999999999999999999999999876554322211 11111 12222233 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccC-CCCce Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD-ADAAI 230 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~-~~~~~ 230 (305) + ++++.+++.++...++..++|+||+.++..|+++||++|+|+|++ .+++|+||++.++++.. .++.+ T Consensus 273 ~----~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:79 273 S----LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred c----hhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccE Confidence 3 344555666677777888999999999999999999999999975 37999999998887753 45678 Q ss_pred EEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 231 EVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 231 ~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++||||++ |+++++++++++.++ |.++.+.+|+..|+|+.+.+|++|++++.++++. .|++ T Consensus 349 ~~~Gd~~~~~~~~~~~~~~v~~~~-------------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~-~~~~ 410 (415) T protein:vir:79 349 LIIGNLKDAIVLFDRSQYQASWTD-------------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER-GEGD 410 (415) T ss_pred EEEEehhccEEEEeecceEEEEec-------------cccCceEEEEEEEeccEEeccccEEEEEEeccCC-CCCc Confidence 99999998 557889999998754 3455678999999999999999999999987644 6666 No 73 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=1.9e-51 Score=298.63 Aligned_cols=279 Identities=11% Similarity=0.071 Sum_probs=225.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE--EEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLP--VLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p--~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) .+.+++++||.+||+++.++|++.+++.++|+++|++++|++++.++| +.++...+.|++|++..++ .+.++|+ T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~----~~~~~~~ 196 (415) T protein:vir:81 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE----LAVKPFF 196 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCc----cccccee Confidence 445667778999999999999999999999999999999987765555 4567788999999876332 2356899 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++++++|++++++||+|+++|+.+++++||.++|++++++++|.+|++|+|++.+....... ..... .....+.. T Consensus 197 ~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~---~~~~~-~~~~~~~~ 272 (415) T protein:vir:81 197 QLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF---EKEGK-KLEVKKAK 272 (415) T ss_pred eEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccc---ccccc-cccccccc Confidence 999999999999999999999999999999999999999999999999999876554322211 11111 12222233 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccC-CCCce Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD-ADAAI 230 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~-~~~~~ 230 (305) + ++++.+++.++...++..++|+||+.++..|+++||++|+|+|++ .+++|+||++.++++.. .++.+ T Consensus 273 ~----~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:81 273 S----LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred c----hhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccE Confidence 3 344555666677777888999999999999999999999999975 37999999998887753 45678 Q ss_pred EEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 231 EVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 231 ~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++||||++ |+++++++++++.++ |.++.+.+|+..|+|+.+.+|++|++++.++++. .|++ T Consensus 349 ~~~Gd~~~~~~~~~~~~~~v~~~~-------------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~-~~~~ 410 (415) T protein:vir:81 349 LIIGNLKDAIVLFDRSQYQASWTD-------------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER-GEGD 410 (415) T ss_pred EEEEehhccEEEEeecceEEEEec-------------cccCceEEEEEEEeccEEeccccEEEEEEeccCC-CCCc Confidence 99999998 557889999998754 3455678999999999999999999999987644 6666 No 74 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=1.9e-51 Score=298.63 Aligned_cols=279 Identities=11% Similarity=0.071 Sum_probs=225.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE--EEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLP--VLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p--~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) .+.+++++||.+||+++.++|++.+++.++|+++|++++|++++.++| +.++...+.|++|++..++ .+.++|+ T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~----~~~~~~~ 196 (415) T protein:vir:98 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE----LAVKPFF 196 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCc----cccccee Confidence 445667778999999999999999999999999999999987765555 4567788999999876332 2356899 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++++++|++++++||+|+++|+.+++++||.++|++++++++|.+|++|+|++.+....... ..... .....+.. T Consensus 197 ~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~---~~~~~-~~~~~~~~ 272 (415) T protein:vir:98 197 QLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGF---EKEGK-KLEVKKAK 272 (415) T ss_pred eEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccc---ccccc-cccccccc Confidence 999999999999999999999999999999999999999999999999999876554322211 11111 12222233 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccC-CCCce Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD-ADAAI 230 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~-~~~~~ 230 (305) + ++++.+++.++...++..++|+||+.++..|+++||++|+|+|++ .+++|+||++.++++.. .++.+ T Consensus 273 ~----~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:98 273 S----LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred c----hhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccE Confidence 3 344555666677777888999999999999999999999999975 37999999998887753 45678 Q ss_pred EEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 231 EVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 231 ~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++||||++ |+++++++++++.++ |.++.+.+|+..|+|+.+.+|++|++++.++++. .|++ T Consensus 349 ~~~Gd~~~~~~~~~~~~~~v~~~~-------------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~-~~~~ 410 (415) T protein:vir:98 349 LIIGNLKDAIVLFDRSQYQASWTD-------------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER-GEGD 410 (415) T ss_pred EEEEehhccEEEEeecceEEEEec-------------cccCceEEEEEEEeccEEeccccEEEEEEeccCC-CCCc Confidence 99999998 557889999998754 3455678999999999999999999999987644 6666 No 75 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=9.1e-52 Score=300.40 Aligned_cols=262 Identities=12% Similarity=0.037 Sum_probs=220.5 Q ss_pred CCC-ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCC--Cceeeeecchhhcccccccccccc Q lcl|Aclame:pro 1 MAD-ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATL--PEADWVGESATDPKGVKPTSKVTW 77 (305) Q Consensus 1 Ma~-~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~--~~a~~v~E~~~~~~~~~~~~~~~f 77 (305) .++ +++++++.++|+++..+|++.+++.++++++|+++++.+++++||+.++. ..+.|++|++. +|+++++| T Consensus 106 ~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~-----~~~~~~~f 180 (379) T protein:vir:10 106 VGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAGEGAIGAQVEGAT-----KGQKDYDI 180 (379) T ss_pred hcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCCCcccccccCCcc-----ccccccce Confidence 222 45555667899999999999999999999999999999999999998754 55678999864 67788999 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) ++++++++|++++++||+|+++|+. ++++||.++|++++++++|.+|+.|+|........ ..... T Consensus 181 ~~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~--------------~~~~~ 245 (379) T protein:vir:10 181 SMIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPNALRRDYAKAENAAFNAVLAANATASTE--------------IITNK 245 (379) T ss_pred eeeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHhcccccccccccc--------------cccCc Confidence 9999999999999999999999986 69999999999999999999999988754211100 01111 Q ss_pred hhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc---------cccCccceEecCccccCCCC Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD---------DSFAGFRTFFNRNGAWDADA 228 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~---------~~l~G~pv~~~~~~~~~~~~ 228 (305) ..++.+.+++..+...++.++.|+|||.++..|+++||++|+|+|++ .+++|+||++++.++ . T Consensus 246 ----~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~----a 317 (379) T protein:vir:10 246 ----NKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRINGIPLFRATWLA----A 317 (379) T ss_pred ----ccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcceecceeeEecCCCC----C Confidence 12345666666777788888999999999999999999999999974 269999999998876 4 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) +.++||||+++++.+++++.++++++. .+.|++|++.+|++.|+|+.|.||++|++++.+.+ T Consensus 318 g~~~~gdf~~~~~~~~~~~~i~~~~~~--------~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 318 NKYYVGDWTRVTKVTTEGLSLEFSEVE--------GTNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred CceEEeecccEEEEEEeceEEEEeecc--------cccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 569999999999999999999887654 23599999999999999999999999999999976 No 76 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=5e-51 Score=296.36 Aligned_cols=279 Identities=11% Similarity=0.072 Sum_probs=224.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE--EEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLP--VLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p--~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) -+.+++++||.+||+++.++|++.+++.++|+++|+++++++++.++| +.++.+.+.|++|++..++ .+.++|+ T Consensus 121 ~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~----~~~~~~~ 196 (415) T protein:vir:94 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE----LAVKPFF 196 (415) T ss_pred hhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccccccccc----cccccce Confidence 334566778999999999999999999999999999999987765554 5567788999999976332 2356899 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++++++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+........ .... ........ T Consensus 197 ~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~---~~~~-~~~~~~~~ 272 (415) T protein:vir:94 197 QLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFE---KEGK-KLEVKKAK 272 (415) T ss_pred eeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccc---cccc-cccccccc Confidence 9999999999999999999999999999999999999999999999999998765543222111 1111 11222223 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccC-CCCce Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD-ADAAI 230 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~-~~~~~ 230 (305) . ++++.++...+...++.++.|+|||.++..|+++||++|+|+|++ .+++|+||++++.++.. .++.. T Consensus 273 ~----~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:94 273 S----LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred c----hHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccE Confidence 3 344555666667777888999999999999999999999999964 47999999999887743 45667 Q ss_pred EEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 231 EVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 231 ~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++||||++ |+++++++++++.++ |.++.+.+|++.|+|+.+.+|++|++++.++++. .|++ T Consensus 349 i~~gd~~~~~~~~~~~~~~v~~~~-------------~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~-~~~~ 410 (415) T protein:vir:94 349 LIIGNLKDAIVLFDRSQYQASWTD-------------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER-GEGD 410 (415) T ss_pred EEEEehhccEEEEeecceEEEEec-------------cccCceEEEEEEEeccEEeccccEEEEEEeccCC-CCCc Confidence 99999998 567889999998654 3456788999999999999999999999987643 6666 No 77 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=8.6e-51 Score=295.05 Aligned_cols=267 Identities=10% Similarity=0.155 Sum_probs=219.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCc--eeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPE--ADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~--a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) .+..++++||++||+++.++|++.+++.++|+++|+++++.+++.++|+...... +.|++|+.. ++.++++|+ T Consensus 114 ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~-----~~~s~~~f~ 188 (421) T protein:vir:13 114 RDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRAGASVDKLANLAKDTE-----LVKAMLKTQ 188 (421) T ss_pred hhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEeecCCccceeecccccc-----cccccccee Confidence 4557778899999999999999999999999999999999999999999876655 455778754 677889999 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++. +.|+++. .+.. T Consensus 189 ~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~--------~~g~~~~-----------~~~~ 249 (421) T protein:vir:13 189 PMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQ--------AKAVLAE-----------ETIN 249 (421) T ss_pred EEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhh--------hhhcccc-----------cccc Confidence 99999999999999999999999999999999999999999999988742 2333221 1122 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc------cccCccceEecCccccC-CCCceE Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------DSFAGFRTFFNRNGAWD-ADAAIE 231 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~------~~l~G~pv~~~~~~~~~-~~~~~~ 231 (305) ++++ +.+++.++...++..+.|+||+.++..|+++||++|+|+|++ .+++|+||+++++++.. .+...+ T Consensus 250 ~~d~----i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~ 325 (421) T protein:vir:13 250 DYAG----LVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDETKF 325 (421) T ss_pred chHH----HHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecCcCCCCCceecceeeEEeccccccCCCceEE Confidence 3334 445555666677788999999999999999999999999975 47999999999987743 456789 Q ss_pred EEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc-ccc----CCC Q lcl|Aclame:pro 232 VIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA-VVA----PAA 305 (305) Q Consensus 232 ~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a-~v~----~a~ 305 (305) +||||++ |++++|++++++++++. .|++|++.+|++.|+|+++.+|++|+.+..+..+ .|+ |++ T Consensus 326 ~~gd~~~~~~~~~~~~~~v~~~~~~----------~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~ 395 (421) T protein:vir:13 326 IVSDFKTLIKFMDRKQYLIDQSKEA----------GYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKS 395 (421) T ss_pred EEEeccccEEEEEecceEEEeeccc----------ccccCeeEEEEEeeecceeecchhhheeeecccceeeccccccCC Confidence 9999997 67899999999988765 3899999999999999999999887666654322 222 222 No 78 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=3.7e-51 Score=297.08 Aligned_cols=279 Identities=14% Similarity=0.013 Sum_probs=221.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |...++++||++||+++.++|++.+++.|++|++|+++++++ ..++|+.++.+.+.|++|..... .+++++|+++ T Consensus 76 ~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~-~~~i~~~~~~~~a~W~~e~~~~~----~~~~~~f~~i 150 (381) T protein:vir:10 76 INKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIK----GQLDAAFSEE 150 (381) T ss_pred HhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc-ceEEEeecCCcceEEeecccccc----cccCccceeE Confidence 778888899999999999999999999999999999999865 67999999999999999876532 3456899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceee------c Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV------V 154 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~------~ 154 (305) ++..||+++++++|+||++|+.+++++||.++++++|++++|++|++|+|++ +|.|+++........... . T Consensus 151 ~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~---qP~Gil~~~~~~~~~~~g~~~~~~~ 227 (381) T protein:vir:10 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD---QPIGLNRQVQKGVSVTDGAYPEKEE 227 (381) T ss_pred eecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCC---CceeeeecCCccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999975 567776543322111111 1 Q ss_pred ccchhhhHHH---HHHHHHHHHhh-------hccccceEEEEchHHHHHHHHhh---ccCCceeecccccCccceEecCc Q lcl|Aclame:pro 155 GGVANESDIV---GATNRAAKAVA-------SAGWAPDTLLSSLALRYEVANIR---DANGNPVFRDDSFAGFRTFFNRN 221 (305) Q Consensus 155 ~~~~~~~~~~---~~~~~~~~~~~-------~~~~~~~~~v~~~~~~~~l~~~k---d~~G~~l~~~~~l~G~pv~~~~~ 221 (305) .+..++.+.. +.+..+...+. ..+.....|+||+.++..|++++ +++|+|+|..+ .|.|++.++. T Consensus 228 ~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~lp--~g~~vv~~~~ 305 (381) T protein:vir:10 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP--FNLNVIESTV 305 (381) T ss_pred cccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecCC--CCceeEEcCC Confidence 1111222222 22222221111 12344567999999999888654 88999998633 4778999888 Q ss_pred cccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccc Q lcl|Aclame:pro 222 GAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVV 301 (305) Q Consensus 222 ~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v 301 (305) || ++.++||||++|++++|++++++++++.+ |.+|++.+|+..|+|+.+.+|+||+.++.+-.+ T Consensus 306 ~p----~~~i~fGDfs~Y~i~~r~~~~i~~~~~~~----------~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~-- 369 (381) T protein:vir:10 306 QE----AGKVLTYVKGLYDGYLAGGINVQKFKETL----------ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG-- 369 (381) T ss_pred CC----cCcEEEEEcccEEEEEecccEEEeechhh----------hhcCceEEEEEEEEcCEEecCCcEEEEEEeecC-- Confidence 86 45699999999999999999999988764 788999999999999999999999998887555 Q ss_pred cCCC Q lcl|Aclame:pro 302 APAA 305 (305) Q Consensus 302 ~~a~ 305 (305) +|.+ T Consensus 370 ~~~~ 373 (381) T protein:vir:10 370 HKPA 373 (381) T ss_pred Cccc Confidence 4444 No 79 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=4.4e-51 Score=296.66 Aligned_cols=281 Identities=14% Similarity=0.023 Sum_probs=223.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |...++++||++||+++.++|++.+++.++++++|+++++++ ..++|+.++.+.+.|++|.+..+ .+++++|+++ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~~~~~----~~~~~~f~~i 150 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIK----GQLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc-ceEEEEecCCcceeeeccccccc----ccccccceee Confidence 778888999999999999999999999999999999999875 57999999999999999976543 2456799999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccccee------ec Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVE------VV 154 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~------~~ 154 (305) ++.+||+++++++|+||++|+.+++++||.+++++++++++|++|++|+|++ +|.|+++.......... .. T Consensus 151 ~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~---qP~Gil~~~~~~~~~~~g~~~~~~~ 227 (381) T protein:vir:95 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD---QPIGLNRQVQKGVSVTEGAYPEKEE 227 (381) T ss_pred eecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCC---CceeeeeccCccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999965 56777654332211111 01 Q ss_pred ccchhhhH---HHHHHHHHHHHhhh-------ccccceEEEEchHHHHHHHHhh---ccCCceeecccccCccceEecCc Q lcl|Aclame:pro 155 GGVANESD---IVGATNRAAKAVAS-------AGWAPDTLLSSLALRYEVANIR---DANGNPVFRDDSFAGFRTFFNRN 221 (305) Q Consensus 155 ~~~~~~~~---~~~~~~~~~~~~~~-------~~~~~~~~v~~~~~~~~l~~~k---d~~G~~l~~~~~l~G~pv~~~~~ 221 (305) .+..++.+ ..+.+..+...+.. .+.....|+||+.++..|++++ +++|+|+|..+ .|.+++.++. T Consensus 228 ~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~--~g~~vv~s~~ 305 (381) T protein:vir:95 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP--FNLNVIESTV 305 (381) T ss_pred ccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCC--CCceEEecCC Confidence 11111222 22333333333321 2344567999999999888765 67899987532 3666888887 Q ss_pred cccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc--ccc Q lcl|Aclame:pro 222 GAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT--PVA 299 (305) Q Consensus 222 ~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t--~~a 299 (305) ++ ++.++||||++|++++|++++++++++.. |.+|++.+|+..|+|+.+.+|+||+.++.+ .+. T Consensus 306 ~p----~~~iifgDfs~Y~i~~r~~~~i~~~~~~~----------~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~ 371 (381) T protein:vir:95 306 QE----AGKVLTYVKGLYDGYLAGGINVQKFKETL----------ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) T ss_pred CC----cCcEEEEecccEEEEEecccEEEeechhH----------hhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCC Confidence 76 45699999999999999999999988764 889999999999999999999999996655 577 Q ss_pred cccCCC Q lcl|Aclame:pro 300 VVAPAA 305 (305) Q Consensus 300 ~v~~a~ 305 (305) +++++. T Consensus 372 ~~~~~~ 377 (381) T protein:vir:95 372 PALEGT 377 (381) T ss_pred cCcccc Confidence 777777 No 80 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=4.4e-51 Score=296.66 Aligned_cols=281 Identities=14% Similarity=0.023 Sum_probs=223.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |...++++||++||+++.++|++.+++.++++++|+++++++ ..++|+.++.+.+.|++|.+..+ .+++++|+++ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~~~~~----~~~~~~f~~i 150 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIK----GQLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc-ceEEEEecCCcceeeeccccccc----ccccccceee Confidence 778888999999999999999999999999999999999875 57999999999999999976543 2456799999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccccee------ec Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVE------VV 154 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~------~~ 154 (305) ++.+||+++++++|+||++|+.+++++||.+++++++++++|++|++|+|++ +|.|+++.......... .. T Consensus 151 ~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~---qP~Gil~~~~~~~~~~~g~~~~~~~ 227 (381) T protein:vir:10 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD---QPIGLNRQVQKGVSVTEGAYPEKEE 227 (381) T ss_pred eecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCC---CceeeeeccCccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999965 56777654332211111 01 Q ss_pred ccchhhhH---HHHHHHHHHHHhhh-------ccccceEEEEchHHHHHHHHhh---ccCCceeecccccCccceEecCc Q lcl|Aclame:pro 155 GGVANESD---IVGATNRAAKAVAS-------AGWAPDTLLSSLALRYEVANIR---DANGNPVFRDDSFAGFRTFFNRN 221 (305) Q Consensus 155 ~~~~~~~~---~~~~~~~~~~~~~~-------~~~~~~~~v~~~~~~~~l~~~k---d~~G~~l~~~~~l~G~pv~~~~~ 221 (305) .+..++.+ ..+.+..+...+.. .+.....|+||+.++..|++++ +++|+|+|..+ .|.+++.++. T Consensus 228 ~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~--~g~~vv~s~~ 305 (381) T protein:vir:10 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP--FNLNVIESTV 305 (381) T ss_pred ccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCC--CCceEEecCC Confidence 11111222 22333333333321 2344567999999999888765 67899987532 3666888887 Q ss_pred cccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc--ccc Q lcl|Aclame:pro 222 GAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT--PVA 299 (305) Q Consensus 222 ~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t--~~a 299 (305) ++ ++.++||||++|++++|++++++++++.. |.+|++.+|+..|+|+.+.+|+||+.++.+ .+. T Consensus 306 ~p----~~~iifgDfs~Y~i~~r~~~~i~~~~~~~----------~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~ 371 (381) T protein:vir:10 306 QE----AGKVLTYVKGLYDGYLAGGINVQKFKETL----------ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) T ss_pred CC----cCcEEEEecccEEEEEecccEEEeechhH----------hhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCC Confidence 76 45699999999999999999999988764 889999999999999999999999996655 577 Q ss_pred cccCCC Q lcl|Aclame:pro 300 VVAPAA 305 (305) Q Consensus 300 ~v~~a~ 305 (305) +++++. T Consensus 372 ~~~~~~ 377 (381) T protein:vir:10 372 PALEGT 377 (381) T ss_pred cCcccc Confidence 777777 No 81 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=2.1e-50 Score=292.88 Aligned_cols=266 Identities=16% Similarity=0.120 Sum_probs=217.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-CCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |+.+++++||++||+++..+|++.++++++|+++|+++|+++++.++|+... ...+.|++|++..+ +.++++|++ T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~----~~~~~~~~~ 184 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENP----KLAEPEFNK 184 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEecCCCcccccccccccc----cccccccee Confidence 8889999999999999999999999999999999999999999999999864 46667898886532 245789999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) +++.++|+++++++|+|+++||.+++++||.++|++++++++|.+|++|+|.+... ...+..+ T Consensus 185 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~-----------------~~~~~~~ 247 (389) T protein:vir:10 185 VDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTAK-----------------KTTTDTL 247 (389) T ss_pred eeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc-----------------ccccccc Confidence 99999999999999999999999999999999999999999999999998754321 1122233 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-----------cccCccceEecCc--cccCC Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-----------DSFAGFRTFFNRN--GAWDA 226 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-----------~~l~G~pv~~~~~--~~~~~ 226 (305) ++++.+.+. ..+...+ .+.|+||+.++..|+++||++|+|+|++ .+|+|+||++.+. ++... T Consensus 248 ~d~l~~~~~---~~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~ 322 (389) T protein:vir:10 248 VDSLKHILN---VDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLA 322 (389) T ss_pred HHHHHHHHH---hhhhhhh--CcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCC Confidence 344433332 1222222 4689999999999999999999999964 2699999987654 34455 Q ss_pred CCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 227 DAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 227 ~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++..++||||++ |++++|++++++++++.++ ...+|+..|+|+++.+|+||++++.++++..+|+= T Consensus 323 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~-------------~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 323 GDQKAFVGDLKRGVLFTDRQQVTLAWEDSKIY-------------GKYLGAAFRFGVQKADSKAGYFVTNTDVPGSALGK 389 (389) T ss_pred CceEEEEeeccccEEEEeecceEEEeeccccc-------------cceEEEEEEeccEEecccceEEEEeeccCCCCCCC Confidence 677899999998 6789999999998876543 23578999999999999999999999887766666 No 82 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=5.3e-50 Score=290.72 Aligned_cols=266 Identities=15% Similarity=0.134 Sum_probs=216.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-CCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) +..+++++||++||+++.++|++.+++.++|+++|+++++++++.++|+... ...+.|++|++..++ ++.++|++ T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~----~~~~~~~~ 186 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPA----LAEPEFEQ 186 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCccccccccccccc----ccccccee Confidence 7788999999999999999999999999999999999999999999998864 477899999875322 35679999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) ++++++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.... ..+... T Consensus 187 v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~-----------------~~~~~~ 249 (394) T protein:vir:10 187 VDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTAKA-----------------TTTDTL 249 (394) T ss_pred EEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-----------------cccccc Confidence 999999999999999999999999999999999999999999999999988653221 111222 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-----------cccCccceEecCc--cccCC Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-----------DSFAGFRTFFNRN--GAWDA 226 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-----------~~l~G~pv~~~~~--~~~~~ 226 (305) .+++.+. ........+ .+.|+||+.++..|+++||++|||+|++ .+|+|+||++.+. ++... T Consensus 250 ~d~l~~~----~~~~~~~~~-~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~ 324 (394) T protein:vir:10 250 VDSLKHI----LNVDLDPAY-SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAA 324 (394) T ss_pred HHHHHHH----HHhhhhhhc-cCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCC Confidence 3333332 222222222 4689999999999999999999999964 3699999988654 45556 Q ss_pred CCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 227 DAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 227 ~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ++..++||||++ |+++++++++++.+++.+ |. ..+|+..|+|+.+.+|.+|+.++.++++.=++|+ T Consensus 325 ~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~----------~~---~~~~~~~r~d~~~~~~~ai~~~~~~~~~~~~~~~ 391 (394) T protein:vir:10 325 GDQKAFVGDLKRGVLFADRQQVTLAWEDSKI----------YG---RYLGAAFRFGVKQADSNAGYFVTNTDAASGSTSG 391 (394) T ss_pred CceEEEEeeccccEEEEeecceEEEEecccc----------cc---eeEEEEEEeccEEeccccEEEEEeecccCCCCCC Confidence 777899999998 567889999998876543 22 3579999999999999999999999887755555 No 83 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=6.3e-50 Score=290.32 Aligned_cols=291 Identities=14% Similarity=0.099 Sum_probs=226.8 Q ss_pred CCCccCCccceEccHHH-HHHHHHHHHhhhhhhhhcceeecCC--CceEEEEEeCC-Cceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAY-SDTLLAAAKQGSTVLSAFQNVNMGT--KTTHLPVLATL-PEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~-~~~i~~~~~~~~~l~~l~~~~~~~~--~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) .-+++++.||++||+++ .++|++.+++.++++++++++++++ +++++|+.++. ..+.|++|+...++..+|.++++ T Consensus 156 ~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~ 235 (477) T protein:vir:84 156 DLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLT 235 (477) T ss_pred cccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccc Confidence 11344555788887774 6889999999999999999998764 46899997655 45778999998888889999999 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeec-c Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVV-G 155 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~-~ 155 (305) |++++++++|++++++||+|+++||.+++++||.++|++++++++|.+||+|+|+.. +|.|+++.........+.. . T Consensus 236 f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~--~p~Gi~~~~~~~~~~~~~~~~ 313 (477) T protein:vir:84 236 DGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNN--QVVGVRATAGITQVTATSAGS 313 (477) T ss_pred eeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCC--ccceeeecccccccccccccc Confidence 999999999999999999999999999999999999999999999999999998643 5677776554333222211 1 Q ss_pred cchhhhHHHHHHHHHHHHhhhccc-cceEEEEchHHHHHHHHhhccCCceeeccc--------------------ccCcc Q lcl|Aclame:pro 156 GVANESDIVGATNRAAKAVASAGW-APDTLLSSLALRYEVANIRDANGNPVFRDD--------------------SFAGF 214 (305) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~--------------------~l~G~ 214 (305) +........+.+.++...+...+. ....|+|||.++..|+++||++|+|+|+++ +++|+ T Consensus 314 t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~ 393 (477) T protein:vir:84 314 ALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGL 393 (477) T ss_pred chhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhccc Confidence 122233344555555555555544 345799999999999999999999999752 78999 Q ss_pred ceEecCccccCC----CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEee-cccc Q lcl|Aclame:pro 215 RTFFNRNGAWDA----DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLG-VSAT 289 (305) Q Consensus 215 pv~~~~~~~~~~----~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~-~p~a 289 (305) ||++++.+|.+. +...++||||++++++. .++.++++++.+ +.+.++.+|+..++++..+ +|++ T Consensus 394 pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~-~~~~~~~~~~~~----------~~~~~~~~~v~~~~~~~~~r~~~a 462 (477) T protein:vir:84 394 PVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE-SSVRMRALQETR----------AENLSVLLQVYGYLAFTAARFPQS 462 (477) T ss_pred ceEecCcccccccccCCcceEEEEEeceEEEEe-eceeEEeccccc----------cccceeeeeehhhhhhhhhccccc Confidence 999999998653 33479999999998886 477887765543 3456677788777777555 5999 Q ss_pred eEEEeccccccccCC Q lcl|Aclame:pro 290 AQGANKTPVAVVAPA 304 (305) Q Consensus 290 ~~~~~~t~~a~v~~a 304 (305) |+.++++..+..|-| T Consensus 463 fv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 463 VVEIGGTALTAPTFA 477 (477) T ss_pred eEEeecccccccccC Confidence 999999977666666 No 84 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=2.1e-49 Score=287.46 Aligned_cols=279 Identities=11% Similarity=0.079 Sum_probs=229.0 Q ss_pred CCCcc-CCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC--------CCceeeeecchhhcccccc Q lcl|Aclame:pro 1 MADIS-RAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT--------LPEADWVGESATDPKGVKP 71 (305) Q Consensus 1 Ma~~t-~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~--------~~~a~~v~E~~~~~~~~~~ 71 (305) +...+ +..++.++|+.+.+.|+...+....++++|+++++.++.+++|+.++ ...+.|++|++. +| T Consensus 123 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-----~~ 197 (419) T protein:vir:94 123 APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTA-----KP 197 (419) T ss_pred cccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCcc-----cc Confidence 33333 34455677788777788888888899999999999999999988654 345889999875 67 Q ss_pred cccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccce Q lcl|Aclame:pro 72 TSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAV 151 (305) Q Consensus 72 ~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~ 151 (305) +++++|+++++++||++++++||+|+++|+. +++++|.++|++++++++|.+||+|+|++ +|.|+++......... T Consensus 198 ~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~aii~G~G~~---~p~Gi~~~~~~~~~~~ 273 (419) T protein:vir:94 198 QSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRLTYGLRFLRDRQLLNGNGST---EMQGILTTPGIGTYQQ 273 (419) T ss_pred ccccceeeEEeeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcc---cccceecccccccccc Confidence 7889999999999999999999999999875 79999999999999999999999999975 5677776555444333 Q ss_pred eecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCcee-ec-------ccccCccceEecCccc Q lcl|Aclame:pro 152 EVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPV-FR-------DDSFAGFRTFFNRNGA 223 (305) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l-~~-------~~~l~G~pv~~~~~~~ 223 (305) .......+....++++.+++..+...++.+++|+|||.++..|++++|++|+++ ++ +.+++|+||++++.++ T Consensus 274 ~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~ 353 (419) T protein:vir:94 274 PKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIA 353 (419) T ss_pred cccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceeeEEcCCCC Confidence 333444455566778888888888888888999999999999999999877654 44 2489999999999886 Q ss_pred cCCCCceEEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccccc Q lcl|Aclame:pro 224 WDADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVA 302 (305) Q Consensus 224 ~~~~~~~~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~ 302 (305) +++++||||++ |++.++++++++++++.. ++|.+|++.+|++.|+|+.+.+|++|++++.+++. | T Consensus 354 ----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~--------~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~--~ 419 (419) T protein:vir:94 354 ----QGTALVGGFRQGATLWSRQGITVLMTDSHA--------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT--T 419 (419) T ss_pred ----CccEEEeeccceEEEEEecceEEEEecccc--------chhhcCcEEEEEEEeeccEEeccccEEEEEeccCC--C Confidence 56799999998 567889999998876542 35889999999999999999999999999998753 3 No 85 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=1.6e-49 Score=288.08 Aligned_cols=257 Identities=19% Similarity=0.200 Sum_probs=213.0 Q ss_pred CCC-ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MAD-ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA-TLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~-~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) |.. +++++||++||+++.++|++.+++.++|+++++++++++++.++|+.. ..+.+.|++|++..+ ..++++|+ T Consensus 133 ~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~----~~~~~~f~ 208 (400) T protein:vir:38 133 VNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNP----AMAKPEFK 208 (400) T ss_pred HhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCCCcccccccccccc----ccccccce Confidence 333 577789999999999999999999999999999999999999999986 457789999987632 23567999 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) ++++.++|++++++||+|+++||.+++++||.++++++++.++|.++++|+|.+... +.. T Consensus 209 ~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~--------------------~~~ 268 (400) T protein:vir:38 209 PVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTAK--------------------TIS 268 (400) T ss_pred eeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc--------------------ccc Confidence 999999999999999999999999999999999999999999999999998854321 111 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccC-CCCce Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWD-ADAAI 230 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~-~~~~~ 230 (305) +++++.+.+. ..... ...+.|+|||.++..|+++||++|+|+|++ .+++|+||+++++++.. .++.. T Consensus 269 ~~~~~~~~~~----~~~~~-~~~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~ 343 (400) T protein:vir:38 269 SVDDLKHINN----VDLDP-AYSRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAH 343 (400) T ss_pred cHHHHHHHHH----hhhhh-hhCcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCccccccceeEEecccccCCCCceE Confidence 2333333322 21222 234689999999999999999999999974 47999999999987743 46778 Q ss_pred EEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 231 EVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 231 ~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) ++||||++ |++++|++++++.+++.+ +...+|+..|+|+.+.+|.+|++++.+|+| T Consensus 344 ~~~gd~s~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 344 AFLGDIKRAILFANRADFMVRWVDDQI-------------YGQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred EEEEeccccEEEEeecceEEEEecccc-------------cceeEEEEEEeccEEecccceEEEEeecCC Confidence 99999998 567789999999877543 235789999999999999999999999887 No 86 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=7.2e-49 Score=284.53 Aligned_cols=258 Identities=19% Similarity=0.156 Sum_probs=210.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA-TLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) ...+++++||+++|+++.++|++.+++.++|+++|+++++.+++.++|+.. +...+.|++|++..++ .++++|++ T Consensus 128 ~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~----~~~~~~~~ 203 (394) T protein:vir:97 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPA----LAKPDFKD 203 (394) T ss_pred ccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCCccceecccccccc----ccccccee Confidence 445677889999999999999999999999999999999999999999986 4567899999976322 34679999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) +++.+||++++++||+|+++|+.+++++||.++|++++++++|.+|++|.+++.+. +..+ T Consensus 204 v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~~~--------------------~~~~ 263 (394) T protein:vir:97 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTK--------------------TVKN 263 (394) T ss_pred EEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--------------------cccc Confidence 99999999999999999999999999999999999999999999999987754321 1123 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccCCCCceEE Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWDADAAIEV 232 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~~~~~~~~ 232 (305) ++++.+.+... ... ...+.|+|||.++..|+++||++|+|+|++ .+++|+||++.+... .++++++ T Consensus 264 ~~~~~~~~~~~----~~~-~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~--~~~~~~~ 336 (394) T protein:vir:97 264 LDEIKALLNGG----FDP-AYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEV--LGANKAF 336 (394) T ss_pred HHHHHHHHHhh----hhh-hhCCEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEecccc--cCCccEE Confidence 34444433322 222 234689999999999999999999999975 379999999876533 4567799 Q ss_pred EEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 233 IADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 233 ~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) ||||++ |++++|++++++.+++. ++...+|+..|+|+.+.+|++|++++.++++ +|= T Consensus 337 ~gd~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~--~p~ 394 (394) T protein:vir:97 337 IGDFKRGVLFADRKDLGLRWADNE-------------IYGQYLQAVLRFGVSKVDDKAGYYVTFTPEP--LPL 394 (394) T ss_pred EeeccccEEEEEecceEEEEeccc-------------ccceeEEEEEEEccEEecccceEEEEecccc--cCC Confidence 999987 56888999999876543 2345789999999999999999999998652 233 No 87 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=6.7e-49 Score=284.68 Aligned_cols=279 Identities=14% Similarity=0.049 Sum_probs=215.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |...+.++||++||+++.++|++.+++.++++++|+++++++ ..++|+.++.+.+.|++|.... +++++++|+++ T Consensus 86 ~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~~~~----~~~~~~~f~~i 160 (395) T protein:vir:95 86 INYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI-KTRVIKADPAGQAVWGKVFGEI----KGQLDAAFREE 160 (395) T ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEEeeccccc----Cccccccceee Confidence 777889999999999999999999999999999999999865 6799999999999999876543 24567899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccccee--ecccch Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVE--VVGGVA 158 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 158 (305) ++++||++++++||+||++|+.+++++||.++|++++++++|++|++|+|++.. +|.|+++.......... ...... T Consensus 161 ~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~-qP~Gil~~~~~~~~~~~~~~~~~~~ 239 (395) T protein:vir:95 161 NFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKT-QPVGLMKDVNTNSGAVTDKASSGTL 239 (395) T ss_pred eeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCc-Cceeeeecccccccccccccccchh Confidence 999999999999999999999999999999999999999999999999997643 57777765433322111 111122 Q ss_pred hhhHHH---HHHHHHHHHh-------hhccccceEEEEchHHHHHHHHhhccCCceeecc-----cccC--ccceEecCc Q lcl|Aclame:pro 159 NESDIV---GATNRAAKAV-------ASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-----DSFA--GFRTFFNRN 221 (305) Q Consensus 159 ~~~~~~---~~~~~~~~~~-------~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-----~~l~--G~pv~~~~~ 221 (305) +++++. ..+..+...+ ...+.....|+||+.++. |..|+|+|++ .+++ |+|++.++. T Consensus 240 t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~------~~~g~~~~~~~~G~~~~~lg~g~~v~~~~~ 313 (395) T protein:vir:95 240 TFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW------DVQARYTYLTANGGFVTVLPYNVTIITSEF 313 (395) T ss_pred hhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh------hcCCcceeccCCCcceeccCCcceEEEcCC Confidence 222222 2222222221 112233457999998764 4567888865 2444 556788888 Q ss_pred cccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc----- Q lcl|Aclame:pro 222 GAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT----- 296 (305) Q Consensus 222 ~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t----- 296 (305) +| ++.++||||++|++++|++++++++++.+ |.+|++.+|+..|+|+.+.|+.||+.++.+ T Consensus 314 ~p----~~~i~fgdfs~y~i~~r~~~~i~~~~~~~----------~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~ 379 (395) T protein:vir:95 314 VP----EGKLVAFVTDRYNAVRGGGLTVKKFDQTL----------ALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASAP 379 (395) T ss_pred CC----CCcEEEEecccEEEEEecceEEEeccchh----------hhCCcEEEEEEEEECCEEeccccEEEEEeeccCCC Confidence 76 45699999999999999999999988754 788999999999999999999999998877 Q ss_pred ccccccCCC Q lcl|Aclame:pro 297 PVAVVAPAA 305 (305) Q Consensus 297 ~~a~v~~a~ 305 (305) +....+||. T Consensus 380 ~~~~~~~~~ 388 (395) T protein:vir:95 380 RRQTSAGGT 388 (395) T ss_pred CCCCCCCCC Confidence 333334444 No 88 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=5.5e-49 Score=285.17 Aligned_cols=272 Identities=12% Similarity=0.016 Sum_probs=215.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..++.++||++||+++.++|++.+.+.++++++|+++++++ ..++|+.++.+.+.|++|++... ++++++|+++ T Consensus 79 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~wv~e~~~~~----~~~~~~f~~i 153 (377) T protein:vir:96 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIK----GQLKQAFKEQ 153 (377) T ss_pred HhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceeEeecccccc----cccCccceeE Confidence 677888899999999999999999999999999999999865 68999999999999999986532 3457899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceee------- Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV------- 153 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~------- 153 (305) ++.+||++++++||+||++|+.+++++||.+++++++++++|++|++|+|++ +|.|+++........... T Consensus 154 ~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~---~P~Gil~~~~~~~~~~~~~~~~~~~ 230 (377) T protein:vir:96 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL---QPVGLLKDLSQPTVDQSTGRDITTY 230 (377) T ss_pred eeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCC---cceeeeeccccccccccccccccce Confidence 9999999999999999999999999999999999999999999999999976 577777643322211110 Q ss_pred --------cccchhhhHHHHHHHHHHHHhhhcc-------ccceEEEEchHHHHHH---HHhhccCCceeecccccCccc Q lcl|Aclame:pro 154 --------VGGVANESDIVGATNRAAKAVASAG-------WAPDTLLSSLALRYEV---ANIRDANGNPVFRDDSFAGFR 215 (305) Q Consensus 154 --------~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~v~~~~~~~~l---~~~kd~~G~~l~~~~~l~G~p 215 (305) .....+.+.+.+.+..+...+...+ ...+.|+||+.++..+ ...++++|+|. +++|+| T Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~----~~l~~p 306 (377) T protein:vir:96 231 KTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYV----TVLPHG 306 (377) T ss_pred eeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccccccccCCCCCce----eccCCC Confidence 1111223334444444444333221 2345799999988776 34566777664 677777 Q ss_pred e--EecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEE Q lcl|Aclame:pro 216 T--FFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGA 293 (305) Q Consensus 216 v--~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~ 293 (305) + +.++.+| ++.++||||++|++++|++++++.+++.. |.+|++.+|+..|+|+.+.+|++|+.+ T Consensus 307 ~~v~~s~~~p----~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~----------~~~d~~~f~~~~r~dG~~~d~~a~~vl 372 (377) T protein:vir:96 307 ITILESLAVE----TGKAIAFVANRYDAFMATASTIEEYDQTF----------AMEDLQLYLTKNYFYGKAKDNHTAALL 372 (377) T ss_pred ceEEecCCCC----cccEEEEEcCcEEEEEecccEEEeehhhh----------hhcCCeEEEEEEEEcCEEecCCcEEEE Confidence 4 4555565 45699999999999999999999988754 788999999999999999999999999 Q ss_pred ecccc Q lcl|Aclame:pro 294 NKTPV 298 (305) Q Consensus 294 ~~t~~ 298 (305) +.+.- T Consensus 373 ~l~~~ 377 (377) T protein:vir:96 373 TLAGG 377 (377) T ss_pred EEecC Confidence 98754 No 89 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=5.2e-49 Score=285.31 Aligned_cols=267 Identities=13% Similarity=0.032 Sum_probs=213.6 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA-TLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) ++..+++++|++||+++.+.|. .+++.+.+++++++++++++..++|+.. ..+.+.|++|++..++ .+.++|++ T Consensus 156 ~~~~~~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e----~~~~~~~~ 230 (437) T protein:vir:10 156 VTGIALKDGKVIIPETILTPEK-EVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTK----NATPVITP 230 (437) T ss_pred hhhcccccccccchHHHHHHHH-HhhhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccc----ccccccee Confidence 6777888999999999987665 5678889999999999999999999985 4578999999876432 34578999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) +++.++|+++++++|+|+++|+.+++++||.++|++++++++|.+|++|+|++... ..+... T Consensus 231 v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~------------------~~~~~~ 292 (437) T protein:vir:10 231 ILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKK------------------TTSTYL 292 (437) T ss_pred eeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc------------------cccccc Confidence 99999999999999999999999999999999999999999999999999865321 011112 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCcc--c-cCCCCc Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNG--A-WDADAA 229 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~--~-~~~~~~ 229 (305) .+++.+.+. ..+...+...+.|+||+.++..|+++||++|+|+|++ .+|+|+||++++++ | ...++. T Consensus 293 ~~~~~~~~~---~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~ 369 (437) T protein:vir:10 293 LGDLKKVLN---VTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDV 369 (437) T ss_pred hhhHHHHHH---hhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCce Confidence 233333222 2455566667789999999999999999999999964 47999999987654 3 234667 Q ss_pred eEEEEehhhE-EEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc-ccccccCCC Q lcl|Aclame:pro 230 IEVIADSSRV-KIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT-PVAVVAPAA 305 (305) Q Consensus 230 ~~~~gdf~~~-~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t-~~a~v~~a~ 305 (305) +++||||+++ .+.+|.+++++.++. |..+...+|+..|+|+.+.+|+||++++++ ++.+++++| T Consensus 370 ~~~~gd~~~~~~~~~r~~~~~~~~~~------------~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~ 435 (437) T protein:vir:10 370 NIVVAPLKKAVINFKLTEITGQFQDT------------YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTVVQST 435 (437) T ss_pred EEEEeeccccEEEEeeeceEEEEecc------------cccccceeeEEEEEccEEecccceEEEEeeccccccCCCC Confidence 7999999975 578899999976532 345567889999999999999999999966 444455555 No 90 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=1.2e-48 Score=283.32 Aligned_cols=266 Identities=12% Similarity=0.116 Sum_probs=214.6 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-CCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |..+++++||++||+++.++|++.+++.++|+++|+++++++ ..+|+.+. .+.+.|++|++. +++++++|++ T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~-----~~~~~~~f~~ 155 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVET-----AKELKLKGDT 155 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC--ceEEEEecCCCcccccccccc-----ccccccccee Confidence 888889999999999999999999999999999999998764 56787654 478999999876 5667789999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHH-HHHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQ-AVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~-a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++.+||++++++||+|+++||.+++++||.++|+++++++++. .+.+|+|.+ ++.+++...... ..++.. T Consensus 156 v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~---~~~g~l~~~~~~-----~~t~~~ 227 (352) T protein:vir:78 156 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSG---LEHMSFYNGSVK-----EVEGAN 227 (352) T ss_pred eeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCc---ccccceeccccc-----cccccc Confidence 99999999999999999999999999999999999999998655 444555544 234444332211 112222 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec--ccccCccceEecCccccCCCCceEEEEeh Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGFRTFFNRNGAWDADAAIEVIADS 236 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l~G~pv~~~~~~~~~~~~~~~~~gdf 236 (305) . ++.+.++...+...+...+.|+||+.++..|++++|.+|+|+|. +.+++|+||++++.+ ..++|||| T Consensus 228 ~----~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~~~~~~llG~PV~~~~~~------~~~~~Gdf 297 (352) T protein:vir:78 228 M----YDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDAA------VKPIVGDF 297 (352) T ss_pred h----HHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCcccccCCccccccceEEecCC------CceeEeeh Confidence 2 44555666677777777889999999999999999999999985 568999999988753 45789999 Q ss_pred hhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 237 SRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 237 ~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) ++|++. +.++.++..++. ..+++.+++..|+|+++.||+||+.++.++++.-.|+ T Consensus 298 ~~~~~~-~~~~~~~~~~~~------------~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~~~~ 352 (352) T protein:vir:78 298 NYFGIN-YDGTTYDTDKDV------------KKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGSLPS 352 (352) T ss_pred hhhhhh-hhhheeeeeccc------------cCCeeEEEEEeeeCceeechhheEEEEeecccCCCCC Confidence 998775 455666554432 3578999999999999999999999999988888999 No 91 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=8.1e-48 Score=278.77 Aligned_cols=279 Identities=14% Similarity=0.051 Sum_probs=212.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |...++++||++||+++.++|++.+++.++++++|+++++++ ..++|+.++.+.+.|++|..... ..++++|+++ T Consensus 83 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~~~~~----~~~~~~f~~i 157 (383) T protein:vir:78 83 INKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGL-RTKFLKSETSGVAVWGKIFGEIK----GQLDATFSDE 157 (383) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCC-ceEEEEEcCCcceEEeecccccc----cccCcceeeE Confidence 889999999999999999999999999999999999999876 47999999999999999976532 3456799999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceee------c Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV------V 154 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~------~ 154 (305) ++.+||+++++++|+||++|+.+++++||.++++++|++++|++|++|+|++ +|.|+++........... . T Consensus 158 ~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~---qP~Gil~~~~~~~~~~~~~~~~~~~ 234 (383) T protein:vir:78 158 ESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGND---KPIGLNRKVGKGSTVVDGVYAEKAA 234 (383) T ss_pred eecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCC---CceeeeeccCCcccccccccccccc Confidence 9999999999999999999999999999999999999999999999999965 577777643322221111 1 Q ss_pred ccchhhhHHHHHHHHHHHHh--hh--------ccccceEEEEchHHHHHHH---HhhccCCceeecccccCccc--eEec Q lcl|Aclame:pro 155 GGVANESDIVGATNRAAKAV--AS--------AGWAPDTLLSSLALRYEVA---NIRDANGNPVFRDDSFAGFR--TFFN 219 (305) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~--~~--------~~~~~~~~v~~~~~~~~l~---~~kd~~G~~l~~~~~l~G~p--v~~~ 219 (305) .+..++.++......+.... .. .......|+||+..+..+. ...+.+|+|. +++|+| ++.+ T Consensus 235 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~----t~l~~~~~iv~s 310 (383) T protein:vir:78 235 TGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSLNANGVYV----TALPFNLNIIES 310 (383) T ss_pred cchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchhccCCCCcee----eecCCCceEEec Confidence 11222222222211111100 00 0112345888886554332 2346677765 455666 5566 Q ss_pred CccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc-cc Q lcl|Aclame:pro 220 RNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT-PV 298 (305) Q Consensus 220 ~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t-~~ 298 (305) +.++ ++.++||||++|++++|++++++++++.+ |.+|++.+|+..|+|+.+.||+||+.++.+ .. T Consensus 311 ~~~p----~~~iifgdfs~Y~i~~r~~~~i~~~~~~~----------f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~ 376 (383) T protein:vir:78 311 LFVP----EKKAISYVAERYDALIGGPLDIGTYDQTL----------AIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINP 376 (383) T ss_pred CCCC----cccEEEeeccceEEEecccceEEecchhh----------hhcCceEEEEEEEEcCEEecCCeEEEEEEEecC Confidence 6665 45689999999999999999999988764 788999999999999999999999997755 33 Q ss_pred ccccCCC Q lcl|Aclame:pro 299 AVVAPAA 305 (305) Q Consensus 299 a~v~~a~ 305 (305) +..+|+- T Consensus 377 ~~~~~~~ 383 (383) T protein:vir:78 377 AEQTPEG 383 (383) T ss_pred CCCCCCC Confidence 4556766 No 92 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=3.5e-48 Score=280.74 Aligned_cols=266 Identities=12% Similarity=0.129 Sum_probs=212.6 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA-TLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |..+++++||++||+++.++|++.+++.++|+++|+++++++ ..+|+.+ +...+.|++|++. ++.++++|++ T Consensus 133 ~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~Eg~~-----~~~~~~~f~~ 205 (402) T protein:vir:93 133 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVET-----AKELKAKGDT 205 (402) T ss_pred hccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC--ceeeeeeccCCcccccccccc-----ccccccccce Confidence 788889999999999999999999999999999999999864 5678765 4578999999876 5566789999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHH-HHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQA-VIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a-~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++.+||++++++||+||++||.+++++||.++|+++++++++.. |..|+|.+ ++.+++..... ..+.+. T Consensus 206 i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g---~p~g~~~~~~~-----~~~~~~- 276 (402) T protein:vir:93 206 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSG---LEHMSFYNGSV-----KEVEGA- 276 (402) T ss_pred eeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCcc---ccceeeecccc-----cccccc- Confidence 999999999999999999999999999999999999999997654 44566544 34444433221 111222 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec--ccccCccceEecCccccCCCCceEEEEeh Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGFRTFFNRNGAWDADAAIEVIADS 236 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l~G~pv~~~~~~~~~~~~~~~~~gdf 236 (305) ..++.+.+++..+...+...+.|+||+.++..++++++.+|+++|. +.+++|+||++++.+ .+++|||| T Consensus 277 ---~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~~~~~~llG~PV~~t~~~------~~i~~GDf 347 (402) T protein:vir:93 277 ---DMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDAA------VKPIVGDF 347 (402) T ss_pred ---chHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCCccccccceEEecCC------Cceeeech Confidence 2345566677777777777889999999988776666666777774 568999999998754 35899999 Q ss_pred hhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 237 SRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 237 ~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) ++|++.. .++.++.++++ ..+++.+|+..|+|+.+.+|+||+.++.++++..+|. T Consensus 348 ~~~~~~~-~~~~~~~~~~~------------~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 348 NYFGINY-DGTTYDTDKDV------------KKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 402 (402) T ss_pred hhhhhhh-hhhhhhhhhcc------------cCCceEEEEEEEeCcEEechhheEEEEeecCCCCCCC Confidence 9987654 34545544332 2478999999999999999999999999999999999 No 93 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=1.2e-47 Score=277.78 Aligned_cols=282 Identities=13% Similarity=0.096 Sum_probs=216.6 Q ss_pred CCC-ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MAD-ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~-~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) +.. .+.++++.+||+++.+.|++.+++.++++++|++.++++ ..++|+....+.+.|++|++. +++++++|++ T Consensus 148 ~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g-~~~~~~~~~~~~a~wv~E~~~-----~~~~~~~f~~ 221 (466) T protein:vir:80 148 AQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKG-TARQNIAGAIPEGVWTEAVAN-----LNELSLSFSQ 221 (466) T ss_pred hhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCc-eeEeeeecCCcceeecccccc-----cccccccccc Confidence 222 233445689999999999999999999999999999875 578999888899999999875 5667889999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc-- Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV-- 157 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 157 (305) +++.+||++++++||+||++||.+++++||.++|++++++++|.+||+|+|++. |.|+++............... T Consensus 222 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~---P~Gil~~~~~~~~~~~~~~~~~~ 298 (466) T protein:vir:80 222 IEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKM---PVGIVTRLAQTTQPPNWGTKAPA 298 (466) T ss_pred eeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCC---cceeeeccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999763 556665432221111110000 Q ss_pred ---hhhhHH-------------HHHHHHHHHHhhhcc-ccceEEEEchHHHHHHHHhh---ccCCceeecc---cccCcc Q lcl|Aclame:pro 158 ---ANESDI-------------VGATNRAAKAVASAG-WAPDTLLSSLALRYEVANIR---DANGNPVFRD---DSFAGF 214 (305) Q Consensus 158 ---~~~~~~-------------~~~~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~~k---d~~G~~l~~~---~~l~G~ 214 (305) .+..++ +..+..........+ .....|+||+.++..|.+++ +.+|.+++.+ .+++|+ T Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~~i~G~ 378 (466) T protein:vir:80 299 WTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLNNTMPIVGG 378 (466) T ss_pred ccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCCCccccccc Confidence 000000 111111111112222 33346999999999999887 6778887754 348899 Q ss_pred ceEecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEe Q lcl|Aclame:pro 215 RTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGAN 294 (305) Q Consensus 215 pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~ 294 (305) ||+++++++ ++.+++|||+.|++++|+++++.++++.. |.+|++.+|+..|+|+++.+|++|+.++ T Consensus 379 pvv~s~~~~----~~~~~~g~~~~y~i~~r~~~~i~~~~~~~----------f~~d~~~~r~~~r~dg~~~~~~afv~~~ 444 (466) T protein:vir:80 379 DIVILDFIP----DNDIIGGYGSLYLLAERADIKLAQSEHVR----------FIEDQTVFKGTARYDGKPVFGEGFVAVN 444 (466) T ss_pred ceeecCccC----ccceeeeccccEEEEeecceEEEechhhh----------hhcCcEEEEEEEEEccEEeccCceEEEE Confidence 999999886 45599999999999999999999887643 7889999999999999999999999998 Q ss_pred ccc-cccccCCC Q lcl|Aclame:pro 295 KTP-VAVVAPAA 305 (305) Q Consensus 295 ~t~-~a~v~~a~ 305 (305) .+. ..+|++.+ T Consensus 445 ~~~~~~~~~~~~ 456 (466) T protein:vir:80 445 IANANPTTSITF 456 (466) T ss_pred ecCCCcccceee Confidence 763 33455555 No 94 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=3.5e-47 Score=275.24 Aligned_cols=266 Identities=12% Similarity=0.117 Sum_probs=210.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA-TLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |..+++++||++||+++.++|++.+++.++|+++|+++++++ ..+|+.. +...+.|++|++. ++.++++|++ T Consensus 118 l~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~-----~~~~~~~f~~ 190 (387) T protein:vir:93 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVET-----AKELKLKGDT 190 (387) T ss_pred hccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC--ceEEEEeecCCccccccCccc-----ccccccccce Confidence 888899999999999999999999999999999999999864 5678765 4578999999876 5566789999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHH-HHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQA-VIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a-~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++.++|++++++||+||++||.+++++||.++++++++++++.. |.+|+|.+. +.+++..... ..+.+. T Consensus 191 v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~---p~g~l~~~~~-----~~v~~~- 261 (387) T protein:vir:93 191 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGL---DHMSFYNGSV-----KEVEGA- 261 (387) T ss_pred eeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccc---cceeeecccc-----cccccc- Confidence 999999999999999999999999999999999999999997765 445666543 3444432211 111222 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHH-HHhhccCCceee-cccccCccceEecCccccCCCCceEEEEeh Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEV-ANIRDANGNPVF-RDDSFAGFRTFFNRNGAWDADAAIEVIADS 236 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l-~~~kd~~G~~l~-~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf 236 (305) ..++.+.++...+...+...+.|+||+.++..+ ++++|.+|++++ .+.+++|+||++++.. .+++|||| T Consensus 262 ---~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~~~~~~~~~~llG~PV~~~~~~------~~~~~GDf 332 (387) T protein:vir:93 262 ---DMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDAA------VKPIVGDF 332 (387) T ss_pred ---chHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCCccccccceEEecCC------Cceeeeeh Confidence 234556666777777777888999999887665 555666555444 3568999999998753 35799999 Q ss_pred hhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 237 SRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 237 ~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) ++|++. +.++.++..++ +.++++.+++..|+|+.+.+|+||+.++.++++.-+|+ T Consensus 333 ~~~~~~-~~~~~~~~~~~------------~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 333 NYFGIN-YDGTTYDTDKD------------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGSLPS 387 (387) T ss_pred hhhhee-hhhheeeeccc------------ccCCceeEEEEeeeCceeechhheEEEEeecCCCCCCC Confidence 998775 44565554433 34678999999999999999999999999888888999 No 95 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=2.6e-47 Score=276.02 Aligned_cols=266 Identities=12% Similarity=0.123 Sum_probs=212.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA-TLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |..+++++||++||+++.++|++.+++.++|+++++++++++ ..+|+.. +...+.|++|++. +++++++|++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~-----~~~~~~~f~~ 190 (387) T protein:vir:26 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVET-----AKELKAKGDT 190 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCcccccccccc-----ccccccccce Confidence 788888999999999999999999999999999999999864 5678765 4578999999886 5667789999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHH-HHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQA-VIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a-~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++.++|++++++||+||++||.+++++||.++|+++++++++.. |.+|+|++ ++.+++..... ....+.. T Consensus 191 v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g---~~~g~~~~~~~-----~~~~~~~ 262 (387) T protein:vir:26 191 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSG---LEHMSFYNGSV-----KEVEGAD 262 (387) T ss_pred eeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCcc---ccceeeecccc-----ccccccc Confidence 999999999999999999999999999999999999999997655 44555544 23444432211 1122222 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec--ccccCccceEecCccccCCCCceEEEEeh Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGFRTFFNRNGAWDADAAIEVIADS 236 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l~G~pv~~~~~~~~~~~~~~~~~gdf 236 (305) .++.+.++...+...+...+.|+||+.++..+.++++.+|+++|. +.+++|+||++++.. .+++|||| T Consensus 263 ----~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~~~~~llG~PV~~~~~~------~~~~~GDf 332 (387) T protein:vir:26 263 ----MYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDAA------VKPIVGDF 332 (387) T ss_pred ----hHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCCccccccceEEecCC------Cceeeech Confidence 345566666677777777889999999988877777777778874 568999999998753 35899999 Q ss_pred hhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 237 SRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 237 ~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) ++|++.. .++.++.+++. ..+++.+|+..|+|+.+.+|+||+.++.++++-.+|- T Consensus 333 ~~~~~~~-~~~~~~~~~~~------------~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:26 333 NYFGINY-DGTTYDTDKDV------------KKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred hhhhhhh-hhhhheecccc------------cCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 9987654 45555554432 2578999999999999999999999999988888998 No 96 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=2.6e-47 Score=276.02 Aligned_cols=266 Identities=12% Similarity=0.123 Sum_probs=212.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA-TLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |..+++++||++||+++.++|++.+++.++|+++++++++++ ..+|+.. +...+.|++|++. +++++++|++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~-----~~~~~~~f~~ 190 (387) T protein:vir:96 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVET-----AKELKAKGDT 190 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCcccccccccc-----ccccccccce Confidence 788888999999999999999999999999999999999864 5678765 4578999999886 5667789999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHH-HHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQA-VIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a-~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++.++|++++++||+||++||.+++++||.++|+++++++++.. |.+|+|++ ++.+++..... ....+.. T Consensus 191 v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g---~~~g~~~~~~~-----~~~~~~~ 262 (387) T protein:vir:96 191 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSG---LEHMSFYNGSV-----KEVEGAD 262 (387) T ss_pred eeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCcc---ccceeeecccc-----ccccccc Confidence 999999999999999999999999999999999999999997655 44555544 23444432211 1122222 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec--ccccCccceEecCccccCCCCceEEEEeh Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGFRTFFNRNGAWDADAAIEVIADS 236 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l~G~pv~~~~~~~~~~~~~~~~~gdf 236 (305) .++.+.++...+...+...+.|+||+.++..+.++++.+|+++|. +.+++|+||++++.. .+++|||| T Consensus 263 ----~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~~~~~llG~PV~~~~~~------~~~~~GDf 332 (387) T protein:vir:96 263 ----MYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDAA------VKPIVGDF 332 (387) T ss_pred ----hHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCCccccccceEEecCC------Cceeeech Confidence 345566666677777777889999999988877777777778874 568999999998753 35899999 Q ss_pred hhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 237 SRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 237 ~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) ++|++.. .++.++.+++. ..+++.+|+..|+|+.+.+|+||+.++.++++-.+|- T Consensus 333 ~~~~~~~-~~~~~~~~~~~------------~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:96 333 NYFGINY-DGTTYDTDKDV------------KKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred hhhhhhh-hhhhheecccc------------cCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 9987654 45555554432 2578999999999999999999999999988888998 No 97 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=2.6e-47 Score=276.02 Aligned_cols=266 Identities=12% Similarity=0.123 Sum_probs=212.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA-TLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) |..+++++||++||+++.++|++.+++.++|+++++++++++ ..+|+.. +...+.|++|++. +++++++|++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~-----~~~~~~~f~~ 190 (387) T protein:vir:94 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVET-----AKELKAKGDT 190 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCcccccccccc-----ccccccccce Confidence 788888999999999999999999999999999999999864 5678765 4578999999886 5667789999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHH-HHcCcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQA-VIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a-~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) +++.++|++++++||+||++||.+++++||.++|+++++++++.. |.+|+|++ ++.+++..... ....+.. T Consensus 191 v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g---~~~g~~~~~~~-----~~~~~~~ 262 (387) T protein:vir:94 191 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSG---LEHMSFYNGSV-----KEVEGAD 262 (387) T ss_pred eeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCcc---ccceeeecccc-----ccccccc Confidence 999999999999999999999999999999999999999997655 44555544 23444432211 1122222 Q ss_pred hhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec--ccccCccceEecCccccCCCCceEEEEeh Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGFRTFFNRNGAWDADAAIEVIADS 236 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l~G~pv~~~~~~~~~~~~~~~~~gdf 236 (305) .++.+.++...+...+...+.|+||+.++..+.++++.+|+++|. +.+++|+||++++.. .+++|||| T Consensus 263 ----~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~~~~~llG~PV~~~~~~------~~~~~GDf 332 (387) T protein:vir:94 263 ----MYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDAA------VKPIVGDF 332 (387) T ss_pred ----hHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCCccccccceEEecCC------Cceeeech Confidence 345566666677777777889999999988877777777778874 568999999998753 35899999 Q ss_pred hhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 237 SRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 237 ~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) ++|++.. .++.++.+++. ..+++.+|+..|+|+.+.+|+||+.++.++++-.+|- T Consensus 333 ~~~~~~~-~~~~~~~~~~~------------~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:94 333 NYFGINY-DGTTYDTDKDV------------KKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred hhhhhhh-hhhhheecccc------------cCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 9987654 45555554432 2578999999999999999999999999988888998 No 98 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=6.2e-46 Score=268.44 Aligned_cols=255 Identities=14% Similarity=0.131 Sum_probs=209.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCCceeeeecchhhccccccccccccee Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA-TLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~~~~~f~~ 79 (305) ++.+++.+++.++|+++.+.|++ +++...++++|+++++++++..+|+.. +...+.|++|++..+ ..++++|++ T Consensus 132 ~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~----~~~~~~~~~ 206 (397) T protein:vir:96 132 RDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNP----QLANPKMVE 206 (397) T ss_pred hhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCcccccccccccc----ccccccccc Confidence 77788888999999999999987 567788999999999999999999875 457788999987632 235689999 Q ss_pred EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchh Q lcl|Aclame:pro 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) Q Consensus 80 v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) ++++++|+++++++|+|+++|+.++++++|.++|++++++++|.+|++|+|.+.+. +..+ T Consensus 207 i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~--------------------~~~~ 266 (397) T protein:vir:96 207 IDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTATAK--------------------SVVG 266 (397) T ss_pred eeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--------------------cccc Confidence 99999999999999999999999999999999999999999999999998865321 1223 Q ss_pred hhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCcc--ccCCCCce Q lcl|Aclame:pro 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNG--AWDADAAI 230 (305) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~--~~~~~~~~ 230 (305) ++++.+.+... ...+ ..+.|+|||.++..|+++||++|+|+|++ .+|+|+||++.+.. ....++.. T Consensus 267 ~d~~~~~~~~~----~~~~-~~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~ 341 (397) T protein:vir:96 267 VDGLKDLINKE----IKKV-YDVKLFISASMYSELDKLKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVV 341 (397) T ss_pred hHHHHHHHHHh----hhhh-cCcEEEEcHHHHHHHHHhhccCCCeEeccCccCCCcccccccceEEecccccCCCCCceE Confidence 44444443322 2222 35689999999999999999999999964 37999999876653 23456678 Q ss_pred EEEEehhh-EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 231 EVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 231 ~~~gdf~~-~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) ++||||++ |++++|+++++..+++.+ ....+|+..|+|+.+.+|++|++++.+.+ T Consensus 342 ~~~gd~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 342 GFIGDAKAFASFFDRKQVSVSWVDNNI-------------YGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EEEeehhcceEeEeecceEEEEecccc-------------cceeEEEEEEEccEEecccceEEEEeecC Confidence 99999997 568899999998876543 24578999999999999999999999876 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=5.5e-39 Score=230.32 Aligned_cols=281 Identities=9% Similarity=0.034 Sum_probs=211.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee-cCCCceEEEEEeCC-Cceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN-MGTKTTHLPVLATL-PEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~-~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) |.. +..+||+|+|++. +++++.+++.+++++++++++ +++....+|+...+ ....|..|++.. ...++++++|+ T Consensus 14 it~-~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~--~~~~~~~~tf~ 89 (314) T protein:vir:41 14 IDV-PDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTK--VAPTADEVTVS 89 (314) T ss_pred ccc-ccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCC--ccCCccccccc Confidence 644 4556899999887 689999999999999999985 56778899887533 222333322221 12456778999 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHH--HHHHHHHHHHHHHHHHHHHHHHHcCcccCcC-----cccccccccccccccce Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATV--AVLTEVAELGGQAIGKKLDQAVIFGTDKPAS-----WVSPALIPAAVTAGQAV 151 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~--~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~-----~~~~~~~~~~~~~~~~~ 151 (305) ++++..||+...++||+|+|+|+.. +|+++|.++|++++++.++..+++|+|+... .++.|+++.+....... T Consensus 90 ~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~ 169 (314) T protein:vir:41 90 TNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTDA 169 (314) T ss_pred ceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceeec Confidence 9999999999999999999999965 9999999999999999999999999986321 25667776543221111 Q ss_pred eecccchhhhHHHHHHHHHHHHhhhcccc---ceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCc Q lcl|Aclame:pro 152 EVVGGVANESDIVGATNRAAKAVASAGWA---PDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRN 221 (305) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~ 221 (305) ...++....+.+.+++..+...++. ..+|+||+.+...++++++.+|+++|++ .++.|+||+.... T Consensus 170 ----~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~ 245 (314) T protein:vir:41 170 ----EPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQYDGIPIQYVPA 245 (314) T ss_pred ----CccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCceecceeeEeccc Confidence 1122334455666777777776653 4579999999999999999999999853 4788999998887 Q ss_pred ccc-CCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccc Q lcl|Aclame:pro 222 GAW-DADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAV 300 (305) Q Consensus 222 ~~~-~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~ 300 (305) ++. ..++.+++||||++++++.+..++++..+++ .++++.+.+..|+|..+..+.+.++.....+.. T Consensus 246 ~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a------------~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~ 313 (314) T protein:vir:41 246 LDALGDDKARALLTVPTNLVYGFWRNIRIEPKRDA------------AMRRTEYIASLRADCNYEDENAAVAAVIDMSSG 313 (314) T ss_pred ccccCCCCceEEEechhheEEEeeceeEEeecccC------------cCCeEEEEEEEEeceEEEEcCcEEEEEeeccCC Confidence 764 4578899999999999999888877764432 467899999999999988776665555433222 Q ss_pred c Q lcl|Aclame:pro 301 V 301 (305) Q Consensus 301 v 301 (305) = T Consensus 314 ~ 314 (314) T protein:vir:41 314 G 314 (314) T ss_pred C Confidence 1 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=1.9e-38 Score=227.36 Aligned_cols=274 Identities=12% Similarity=0.115 Sum_probs=200.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee-cCCCceEEEEEeCC----Cceeeeecchhhcccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN-MGTKTTHLPVLATL----PEADWVGESATDPKGVKPTSKV 75 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~-~~~~~~~~p~~~~~----~~a~~v~E~~~~~~~~~~~~~~ 75 (305) |. +++.+||+++|++. +++++.+++.++++++|++++ +++....+++.... +...|.+|.. ..+++.+ T Consensus 19 ~t-~~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~~~~~-----~~~~~~~ 91 (315) T protein:vir:41 19 ID-VPDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDETGQKL-----APPESTA 91 (315) T ss_pred cC-CcCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccccccCcC-----CCCCCcc Confidence 43 45667888888776 679999999999999999865 44445555543211 1233444443 3556778 Q ss_pred cceeEEeeeeeEEEeehhhHHHhhcCH--HHHHHHHHHHHHHHHHHHHHHHHHcCcccCc-C--cccccccccccccccc Q lcl|Aclame:pro 76 TWANRTLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKPA-S--WVSPALIPAAVTAGQA 150 (305) Q Consensus 76 ~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~-~--~~~~~~~~~~~~~~~~ 150 (305) +|+++++..+|+...+.+|+|+|+|+. ++++++|.+++++++++.++.++++|+|+.. + ..+.|+++.+...... T Consensus 92 ~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~ 171 (315) T protein:vir:41 92 EVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKLTE 171 (315) T ss_pred ccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceecccccccc Confidence 999999999999999999999999986 4999999999999999999999999988532 1 2456766644332211 Q ss_pred eeecccchhhhHHHHHHHHHHHHhhhcccc---ceEEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecC Q lcl|Aclame:pro 151 VEVVGGVANESDIVGATNRAAKAVASAGWA---PDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNR 220 (305) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~ 220 (305) .. ..........+.+.++...+...++. .+.|+||+.+...++++||.+|+|+|++ .+++|+||...+ T Consensus 172 ~~--~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~~~tl~G~PV~~~~ 249 (315) T protein:vir:41 172 SD--VDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTGANSILYDGRPVQYVP 249 (315) T ss_pred cc--cccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccchhhcCCCceecccceEecc Confidence 11 11111122245566677777766653 4679999999999999999999999964 579999999888 Q ss_pred cccc-CCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccc--eEEEec Q lcl|Aclame:pro 221 NGAW-DADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSAT--AQGANK 295 (305) Q Consensus 221 ~~~~-~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a--~~~~~~ 295 (305) .++. ..++..++||||++|+++.+.+++++..+++ .++.+.+.+..|+|..+..+.+ +..++. T Consensus 250 ~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a------------~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 250 ALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDA------------EMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred cccccCCCCccEEEecccceEEEeccccEEEeeecC------------CCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 8764 3468899999999999999999998876553 2355778888899987655444 333333 No 101 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=7.9e-36 Score=213.02 Aligned_cols=286 Identities=12% Similarity=0.133 Sum_probs=213.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +..+++.++|++||+++.++|++.+.+.++++++++++++.+...++|.....+.+.|+++... ...+.++++|+++ T Consensus 18 ~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~~e~~---~~~~~~~~~~~~~ 94 (321) T protein:vir:31 18 ALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQDEGE---WNENESDVSTGTI 94 (321) T ss_pred cccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCcccccccccc---cccccccceeeee Confidence 4444666788999999999999999999999999999999999999999877777778764322 2234567899999 Q ss_pred EeeeeeEEEeehhhHHHhhcCH--HHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCc---ccccccccccccccceeecc Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKPASW---VSPALIPAAVTAGQAVEVVG 155 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~--~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~---~~~~~~~~~~~~~~~~~~~~ 155 (305) ++..+|+...++||+|+|+|+. .+++++|.+.+++++++.++.++|+|+|...+. .+.|.+..+........... T Consensus 95 ~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~~~ 174 (321) T protein:vir:31 95 DISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETIDAAD 174 (321) T ss_pred eeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhccccccccccc Confidence 9999999999999999999975 589999999999999999999999999875432 23465554433333322222 Q ss_pred cchhhhHHHHHHHHHHHHhhhcccc--ceEEEEchHHHHHHHH-hhccCCceeec-------ccccCccceEecCccccC Q lcl|Aclame:pro 156 GVANESDIVGATNRAAKAVASAGWA--PDTLLSSLALRYEVAN-IRDANGNPVFR-------DDSFAGFRTFFNRNGAWD 225 (305) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~-~kd~~G~~l~~-------~~~l~G~pv~~~~~~~~~ 225 (305) ...+. +.+.++...++..++. ..+|+||+.+...+++ +++. +.++|. +.++.|+|++..+++| T Consensus 175 ~~~~~----d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~-~~~~~~~~l~~~~~~tl~G~pvv~~~~mP-- 247 (321) T protein:vir:31 175 DILDN----DLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDR-DTPLGDNVIMGEADVNPFSFPIIGSGLWP-- 247 (321) T ss_pred cccCH----HHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcC-CCccccchhhccccccccceeEEEcCCCC-- Confidence 22332 3445556666665542 3579999999887765 5554 456664 3469999999999887 Q ss_pred CCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc--ccccccC Q lcl|Aclame:pro 226 ADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT--PVAVVAP 303 (305) Q Consensus 226 ~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t--~~a~v~~ 303 (305) ++.++++||++++++.++++++++..+..... .+++.+......++|+.|.++++++.++.- |...+.+ T Consensus 248 --~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~~~~~~~ 318 (321) T protein:vir:31 248 --DDKAMFTDPQNLIYALYRDLEIDVLTESDKVS-------ERDLHARYFMRGDDDFAIENTEAVVLAEGLGDPLEHLEE 318 (321) T ss_pred --CCcEEEeccccEEEEEeeccEEEEeecCcccc-------ccceeeEeeeeeecceeEeccccEEEEecCCcchhcccC Confidence 56799999999999999999888766542111 122334444566799999999999999965 4445555 Q ss_pred CC Q lcl|Aclame:pro 304 AA 305 (305) Q Consensus 304 a~ 305 (305) .. T Consensus 319 ~~ 320 (321) T protein:vir:31 319 ET 320 (321) T ss_pred CC Confidence 55 No 102 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=3.4e-35 Score=209.56 Aligned_cols=272 Identities=16% Similarity=0.089 Sum_probs=191.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) +...+...++++.|+.+...+...+...+++++++++.++. ...+|.......+.|+.|+.. +|+++.+|+++ T Consensus 239 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~--~~~~~~~~~~~~a~~~~eG~~-----kp~s~~tf~~~ 311 (517) T protein:vir:97 239 AELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP--TLVVGGDNALTQGTGHTTGTD-----KTESNITLQTR 311 (517) T ss_pred eecccccccccccchHHHHHHHHhhhhhccceeeeeecccc--ceeeecccccceeeeeecCCc-----ccccccceeeE Confidence 22223344688999999999999999999888887765554 466777777777889998865 78888999999 Q ss_pred EeeeeeEEEeehhhHHHhhcCHHH----HHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDDATVA----VLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~----~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) ++.++++++++++|+++++|+.++ +++||.++|+++++++++.+|++|+|++.. ..++++.+..... . .... T Consensus 312 ~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~--~~gi~~~a~~~~~-~-~~~~ 387 (517) T protein:vir:97 312 VLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVS--ETQIYPVVGDAWA-T-NVTG 387 (517) T ss_pred EeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcc--ccccccccccccc-c-cccc Confidence 999999999999999999998887 999999999999999999999999997543 2344433221111 1 1111 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecccccCccceEe---cCccccCCCCceEEE Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDDSFAGFRTFF---NRNGAWDADAAIEVI 233 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~~l~G~pv~~---~~~~~~~~~~~~~~~ 233 (305) .....+++.. +...+.. ...+.|+||+.++..|+++||++|||||++....+-|... .+.++. ...+...+ T Consensus 388 ~~~~~d~i~~---l~~a~~~--a~~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~~~~~~l~G~~~~~~~-~~~~~~~~ 461 (517) T protein:vir:97 388 TTNIQELLEK---LSVATPK--AADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQS-VAVDEKTA 461 (517) T ss_pred cchHHHHHHH---HHHHhhh--ccCCEEEECHHHHHHHHHhhcCCCCeeccCcCCcccccccCCccccccc-cccCceeE Confidence 2222233333 3222222 2356799999999999999999999999864322222111 111111 11223345 Q ss_pred EehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 234 ADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 234 gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) ++++.|+++.+.++.+..+ + .+.+|+..++.++|+++.|..|+++++...+|. |.+ T Consensus 462 ~~~~~y~i~~~~g~~~~~~----f--------d~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~--~~~ 517 (517) T protein:vir:97 462 VSLSGYVTNGSRGMEFEQG----T--------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP--VAG 517 (517) T ss_pred eeccccEEEeecceeeeee----e--------ecccCceeEeeeeeeccccccccceEEEEEcCC--CCC Confidence 5678888888887654321 1 134688889999999999999999999988864 333 No 103 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=4.7e-32 Score=192.31 Aligned_cols=258 Identities=14% Similarity=0.092 Sum_probs=203.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhccee----ecCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNV----NMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~----~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||.++|+.+..++|+.+...+++.+.+.+.+.+++.+- ...+++++||+++..+.+.|++|++. ++.++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~-----i~~~~~~ 75 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEA-----IPMTQLG 75 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCc-----ccccccc Confidence 99988888899999999999999999999988887652 23466799999988889999999875 6677889 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) ++++++.+++++..+++|+|+..++..++.+++.+++++++++++|+.++..-... . ...+. T Consensus 76 ~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a---------------~---~~~~~ 137 (272) T protein:vir:30 76 FKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKS---------------T---QTVEA 137 (272) T ss_pred cceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccc---------------c---ccccc Confidence 99999999999999999999999999999999999999999999999998532110 0 01111 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccC-------Cceeec---ccccCccceEecCccccCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDAN-------GNPVFR---DDSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~-------G~~l~~---~~~l~G~pv~~~~~~~~~~ 226 (305) ..++ +.+.++...+...+.....|+|||.++..|++.+..+ |..... -++++|+||+++++++ T Consensus 138 ~~t~----d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p--- 210 (272) T protein:vir:30 138 TATV----DGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCP--- 210 (272) T ss_pred ccCH----HHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCC--- Confidence 2233 3344455566666677789999999999998763211 111111 2479999999999986 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccc Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVV 301 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v 301 (305) ++++++.+...+.++.+++++++..++.. +....++...|+++.+.+|+++++++.++++-- T Consensus 211 -~~t~~~~~~~a~~~~~~~~~~ve~~r~~~------------~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 211 -KGTAYMVRKGALRIMLKRNTMVETDRDIT------------KAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred -cceEEEEcCCeEEEEecCCceeeeccccc------------cceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 45677778888888888898888766542 345778999999999999999999999877665 No 104 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=4.7e-32 Score=192.31 Aligned_cols=258 Identities=14% Similarity=0.092 Sum_probs=203.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhccee----ecCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNV----NMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~----~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||.++|+.+..++|+.+...+++.+.+.+.+.+++.+- ...+++++||+++..+.+.|++|++. ++.++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~-----i~~~~~~ 75 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEA-----IPMTQLG 75 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCc-----ccccccc Confidence 99988888899999999999999999999988887652 23466799999988889999999875 6677889 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) ++++++.+++++..+++|+|+..++..++.+++.+++++++++++|+.++..-... . ...+. T Consensus 76 ~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a---------------~---~~~~~ 137 (272) T protein:vir:98 76 FKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKS---------------T---QTVEA 137 (272) T ss_pred cceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccc---------------c---ccccc Confidence 99999999999999999999999999999999999999999999999998532110 0 01111 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccC-------Cceeec---ccccCccceEecCccccCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDAN-------GNPVFR---DDSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~-------G~~l~~---~~~l~G~pv~~~~~~~~~~ 226 (305) ..++ +.+.++...+...+.....|+|||.++..|++.+..+ |..... -++++|+||+++++++ T Consensus 138 ~~t~----d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p--- 210 (272) T protein:vir:98 138 TATV----DGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCP--- 210 (272) T ss_pred ccCH----HHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCC--- Confidence 2233 3344455566666677789999999999998763211 111111 2479999999999986 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccc Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVV 301 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v 301 (305) ++++++.+...+.++.+++++++..++.. +....++...|+++.+.+|+++++++.++++-- T Consensus 211 -~~t~~~~~~~a~~~~~~~~~~ve~~r~~~------------~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 211 -KGTAYMVRKGALRIMLKRNTMVETDRDIT------------KAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred -cceEEEEcCCeEEEEecCCceeeeccccc------------cceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 45677778888888888898888766542 345778999999999999999999999877665 No 105 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.94 E-value=1.8e-30 Score=183.71 Aligned_cols=260 Identities=14% Similarity=0.037 Sum_probs=158.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) -+.+.....+..+|+.+.+.+.......+++...++.. ..+.....|++|.....+... ..++.+. T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~g~~~~~~~~e~~~~~~~~~---~~~~~~~ 275 (480) T protein:vir:40 210 GADLNVVNSLGSITSKYARKSGIYDGAMKARFQGLTLA-----------EDGVDDTFISGTFKAGTDKNK---SQTATKR 275 (480) T ss_pred hccccccccccccccchhhheeechhhhhhhhhcceee-----------eccccceeeeeeeeccccccc---ccccccc Confidence 11111122223345555444444444444433333221 223345667777654332211 1123334 Q ss_pred Eee---eeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccc Q lcl|Aclame:pro 81 TLV---AEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) Q Consensus 81 ~~~---~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) ++. .++++...+.|.++++|+. ++++||.++|++.++++++.+|++|+|++.. .+.++..... ..+.. T Consensus 276 ~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~-~~~g~~~~~~-------~~~~~ 346 (480) T protein:vir:40 276 SLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMSEMVNRVIQKVEYNMILGSVDGSN-GFYGLKTATD-------GWTKQ 346 (480) T ss_pred hhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCcc-ccccceeecc-------ccccc Confidence 443 4688888899999999977 7999999999999999999999999776542 2233322111 11111 Q ss_pred hhhhHHHHHHHHHHHHhhhccccce-EEEEchHHHHHHHHhhccCCceeecc-------cccCccceEecCccccCCCCc Q lcl|Aclame:pro 158 ANESDIVGATNRAAKAVASAGWAPD-TLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWDADAA 229 (305) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~v~~~~~~~~l~~~kd~~G~~l~~~-------~~l~G~pv~~~~~~~~~~~~~ 229 (305) .+.++. +.++..++...+..++ .|+||+.+|..|+++||++|+|||++ .+++|+||++...... .+. T Consensus 347 ~~~~d~---id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~--~~~ 421 (480) T protein:vir:40 347 IEYTDL---FEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDGHSRFNELATKEQIAQSFGAVNLETRVWMP--KDE 421 (480) T ss_pred chhHHH---HHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCCCeeccCcccccCcceecccceeeeecccc--CCc Confidence 222333 3334445555555555 69999999999999999999999985 4789999887654321 122 Q ss_pred eEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccc Q lcl|Aclame:pro 230 IEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVV 301 (305) Q Consensus 230 ~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v 301 (305) +.+..+...+++++++ ++. .++. .+..|+..++++.|+++.+.+|.+++.++....==| T Consensus 422 ~~~~~~~~~~~~~d~~-~~~--~~~~----------~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:40 422 VAVYNHDEYVLIGDLN-VEN--YNDF----------DLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGSLGV 480 (480) T ss_pred ceeeeCCccEEEEecc-cce--eccc----------ccccchhhhhhhhhhceeeEccccEEEEEeccCcCC Confidence 3333333445667653 222 1111 245788899999999999999999999998744333 No 106 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.88 E-value=3.1e-24 Score=149.49 Aligned_cols=258 Identities=12% Similarity=0.070 Sum_probs=188.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeec----CCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNM----GTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~----~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||.+.|.-.-.++|+.+...+.+.+.+...+.+++..-+. .++++++|++....++.++.|+.. ++..+.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~-----i~~~~lt 75 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGE-----ISLDKIG 75 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCc-----cChhhcC Confidence 9998888888899999999999999999888888866442 356899999987777888888865 5666778 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) .++.++..++.+..+.++++...++..++.+.+.++++..+++++|+.++..-... ....+. T Consensus 76 ~~~~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~------------------~~~~~~ 137 (272) T protein:vir:36 76 TTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTT------------------SQTVST 137 (272) T ss_pred CcceeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc------------------cccccc Confidence 88889999999999999999998998999999999999999999999987432100 001112 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhc------cCCceeecc---cccCccceEecCccccCCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRD------ANGNPVFRD---DSFAGFRTFFNRNGAWDAD 227 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd------~~G~~l~~~---~~l~G~pv~~~~~~~~~~~ 227 (305) ..+++. +.++...+.........+++||..+..|++... ..|..+... ++++|++|++++.+|.+.+ T Consensus 138 ~~~~d~----i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~ 213 (272) T protein:vir:36 138 KANVDG----VQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA 213 (272) T ss_pred cccHHH----HHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCce Confidence 223333 444445555555566789999999999986532 333333222 5799999999999986544 Q ss_pred Cce-EEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 228 AAI-EVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 228 ~~~-~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) ... ++++ -..+.++..+++++|..|+.. +..-.++...+++..+.+|+++++++.+.+ T Consensus 214 ~~~~~~~~-~gA~~~~~~~~~~vE~~R~~~------------~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 214 LMFKIVSN-SPALKLVLKRGVQVETDRDIV------------TKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eEEEEEec-ccceeeeecCCcccccccchh------------hcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 222 2333 222334556677777655442 223468888999999999999999999987 No 107 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.86 E-value=5e-23 Score=142.84 Aligned_cols=277 Identities=14% Similarity=0.122 Sum_probs=204.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |+++|-++++.+.|......|+|.+.+++.++++.++.++.++.+++++...-+.+.|...++..+ ++...+|.++ T Consensus 25 m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~lp~a~~r~~n~~~~----~~~~~Tf~q~ 100 (330) T protein:vir:94 25 MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVLGDVQFLAVGGTIT----AKNPATFTKV 100 (330) T ss_pred hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecCCcceeeecccccc----ccCcceeeee Confidence 999999999999999999999999999999999999998989999999999999999987655422 2234579999 Q ss_pred EeeeeeEEEeehhhHHHh--hcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceee--ccc Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVI--DDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV--VGG 156 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell--~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~--~~~ 156 (305) +.+.+.++..+.|.+++. ..+..+...+..+...++++++++.++|||++++.. ..|+....... +.+.. .++ T Consensus 101 t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~--F~GL~~~~~~~-q~i~tg~~gg 177 (330) T protein:vir:94 101 TSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNS--FQGMMGLVAAS-QTISAGANGG 177 (330) T ss_pred eechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcc--ccchhhcCCcc-cEEecCCCCC Confidence 999999999999999995 445678888889999999999999999999877553 34454443322 22222 233 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc----------cccCccceEecCccccCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD----------DSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~----------~~l~G~pv~~~~~~~~~~ 226 (305) ..+.+ ++..+...+......++.|+||+.....++.+....|++-..+ ..+.|.|++..+.++.+. T Consensus 178 ~~T~d----~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~ 253 (330) T protein:vir:94 178 TLTFE----LLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNM 253 (330) T ss_pred CCCHH----HHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCC Confidence 33433 3344444444444567899999999999999887666543321 256788988888887643 Q ss_pred ------CCceEEEEehh-----hEEEEee----cCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceE Q lcl|Aclame:pro 227 ------DAAIEVIADSS-----RVKIGVR----QDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQ 291 (305) Q Consensus 227 ------~~~~~~~gdf~-----~~~~~~~----~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~ 291 (305) +...|++..|. +.+.|.. .|+.++... .. -++.....|++.|++.++.+|+|+. T Consensus 254 ~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G--~~---------~~k~v~~~~v~~y~~~av~~~~a~~ 322 (330) T protein:vir:94 254 TQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVG--AK---------ENADETITRVKMYCGFANFSQLGLA 322 (330) T ss_pred CcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCC--Cc---------cccceeeEEEEEeeeeEEechhhee Confidence 23467777763 4566663 244443211 11 1335577899999999999999999 Q ss_pred EEeccccc Q lcl|Aclame:pro 292 GANKTPVA 299 (305) Q Consensus 292 ~~~~t~~a 299 (305) ++..-..+ T Consensus 323 ~L~~V~~g 330 (330) T protein:vir:94 323 AIKGLIPG 330 (330) T ss_pred eeccccCC Confidence 99985444 No 108 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.85 E-value=7.3e-23 Score=141.93 Aligned_cols=260 Identities=15% Similarity=0.091 Sum_probs=194.2 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeec----CCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNM----GTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~----~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||..+|.-+-.++|+.+...+.+.+.+...+.+++...+- .++++++|++....++.++.|++. ++..+.+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~-----i~~~~it 75 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEK-----IPTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCc-----ccccccc Confidence 9999999899999999999999999999888888766432 355899999986678888888765 5667788 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) +++.++..++.+..+.++++...++..++.+.+.+++++++++++|+.++..-.... .. .... T Consensus 76 ~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~---------------~~--~~~~ 138 (274) T protein:vir:93 76 TKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------------LT--VNAD 138 (274) T ss_pred cceeEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc---------------cc--cccc Confidence 899999999999999999999999888999999999999999999999985322110 00 0011 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhh------ccC-Cceeec---ccccCccceEecCccccCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIR------DAN-GNPVFR---DDSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~k------d~~-G~~l~~---~~~l~G~pv~~~~~~~~~~ 226 (305) ...++. +.++...+.........+++||.++..|++.. ++. |..+.. -++++|++|++++.+| T Consensus 139 ~~~~d~----i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p--- 211 (274) T protein:vir:93 139 ITKLNG----LQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE--- 211 (274) T ss_pred ccCHHH----HHHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCCCC--- Confidence 122333 34444455555556678999999999998531 111 222222 2479999999999886 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) .++.++.....+.++.+.++.++..++.. +....++...+++..+.+|.++++++++.+.. +- T Consensus 212 -~~t~~l~~~gai~~~~~~~~~vE~~Rd~~------------~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~-~~ 274 (274) T protein:vir:93 212 -AGTAILAKKGAVKLILKRDFFLEVARDAS------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSL-EM 274 (274) T ss_pred -cceEEEEeCCeEEEEecCCcccccccchh------------hcccEEEEEEEEEEEEEcCCceEEEeeCcccc-CC Confidence 45577777777777777888887666543 22357889999999999999999999865433 22 No 109 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.83 E-value=6.2e-22 Score=136.85 Aligned_cols=262 Identities=11% Similarity=0.047 Sum_probs=185.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN----MGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||..+|.-+-.++|+.+...+.+.+++...+.+++.... ..++++++|++.....+.++.|++. ++..+.+ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~-----i~~~~lt 75 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAA-----IDYSALE 75 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCc-----Ccccccc Confidence 999888888899999999999999999888888875533 2356899999986677788888765 5556778 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCc-ccCcCcccccccccccccccceeecc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGT-DKPASWVSPALIPAAVTAGQAVEVVG 155 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~-g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (305) +++.++..++.+..+.++++....+..++.+.+.+++++.+++++|+.++..- |... ...... T Consensus 76 ~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~----------------~~~~~~ 139 (278) T protein:vir:80 76 TESVKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTL----------------EVKGAI 139 (278) T ss_pred cceeeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------------cccccc Confidence 88888888898888999999999998999999999999999999999888532 1110 000011 Q ss_pred cchhhhHHHHHHHHHHHHhhhccc-cceEEEEchHHHHHHHHhhc-------cCCceeec---ccccCccceEecCcccc Q lcl|Aclame:pro 156 GVANESDIVGATNRAAKAVASAGW-APDTLLSSLALRYEVANIRD-------ANGNPVFR---DDSFAGFRTFFNRNGAW 224 (305) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~~~~~~l~~~kd-------~~G~~l~~---~~~l~G~pv~~~~~~~~ 224 (305) ........++.+.++..++...+. ....+++||..+..|++... ..|..+.+ -+++.|++|++++++|. T Consensus 140 t~~~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~ 219 (278) T protein:vir:80 140 NIGLIDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLAD 219 (278) T ss_pred ccchhhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCCc Confidence 111222233444444444443322 23358899999999986531 11333332 24789999999999873 Q ss_pred CCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 225 DADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 225 ~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) ++.++-.-..+-++...+++++..++.. +..-.++...+++..+.||.+++++++...- T Consensus 220 ----~t~~l~~~gAi~~~~~~~~~vE~~Rd~~------------~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 220 ----GNALAVKAGALKTFLKRNLLAESGRDMD------------HKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred ----ceEEEEeccceeeeecCCcccccccchh------------hccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 3444444445546666777776655442 2234678889999999999999999986443 No 110 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.83 E-value=5.6e-22 Score=137.11 Aligned_cols=262 Identities=14% Similarity=0.096 Sum_probs=190.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN----MGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||..+|.-.-.++|+.+...+.+.+.+...+.+++..-+ ..++++++|++....++.++.|+.. ++..+.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~-----i~~~~lt 75 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQK-----IPVDKIE 75 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCc-----cCccccc Confidence 998888888889999999999999999999988886543 3567899999987778888888865 5666788 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) .++.....++.+..+.+++|....+..|+.+.+.++++..+++++|+.++.=-.. .. .. .... T Consensus 76 ~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~------------~~---~~--~~~~ 138 (276) T protein:vir:10 76 TNRREAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRG------------TK---LT--VSAD 138 (276) T ss_pred cceeeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhc------------cc---cc--cccc Confidence 8889999999999999999999998889999999999999999999988731100 00 00 0111 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhc------c-CCceeec---ccccCccceEecCccccCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRD------A-NGNPVFR---DDSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd------~-~G~~l~~---~~~l~G~pv~~~~~~~~~~ 226 (305) ..+++ .+.++...+.........++|||..+..|++..+ + .|..... -+.++|++|++++.++ T Consensus 139 ~~t~d----~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p--- 211 (276) T protein:vir:10 139 IGTLA----GLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKKLD--- 211 (276) T ss_pred ccCHH----HHHHHHHHhccccCcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCCCC--- Confidence 12233 3444444555555567789999999999987532 1 1222222 2478999999999886 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .++.++..-..+-++..+++++|..++... ..-.+++..+++..+.+|.+++++++.+-.. +-+| T Consensus 212 -~~t~~l~~~gAi~~~~~~~~~vE~dRd~~~------------~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~-~~~~ 276 (276) T protein:vir:10 212 -EGEAILAKRGAVKLITKRDFFLETDRDPST------------KTTALYSDKHYVAYLYDESKAVKVTKGAGTT-DSGA 276 (276) T ss_pred -cceEEEEeccceeeeecCCceeecccchhh------------cccEEEEeeEEEEEEEcCcceEEEecCCcCC-cCCC Confidence 344444444455566677888887766531 2345788889999999999999999776322 2222 No 111 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.82 E-value=2.6e-21 Score=133.46 Aligned_cols=260 Identities=13% Similarity=0.084 Sum_probs=189.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN----MGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||..+|.-.-.++|+.+...+.+.+.+...+.+++...+ ..++++++|++.....+..+.|+.. ++..+.+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~-----i~~~~it 75 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEK-----IPVDQIG 75 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCc-----Cchhhcc Confidence 999888888899999999999999998888888776532 1366899999975566666777654 5566778 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) +...++..++.+..+.++++....+..++.+.+.++++..+++++|+.++.--.... .. .... T Consensus 76 ~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~------------~~-----~~~~ 138 (274) T protein:vir:96 76 TSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT------------LT-----VEAD 138 (274) T ss_pred cceeEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC------------CC-----cCcc Confidence 888888889988889999999988888999999999999999999998885321100 00 0011 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhh------cc-CCceeec---ccccCccceEecCccccCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIR------DA-NGNPVFR---DDSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~k------d~-~G~~l~~---~~~l~G~pv~~~~~~~~~~ 226 (305) ..+ ++.+.++...+.........++|||..+..|++.. ++ .|..+.+ -++++|++|++++++|. T Consensus 139 ~~~----~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~-- 212 (274) T protein:vir:96 139 ITK----LDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNK-- 212 (274) T ss_pred ccc----HHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCCCCc-- Confidence 112 33444555555555556778999999999998753 11 1222222 24689999999999874 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccccc Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVA 302 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~ 302 (305) ++.++.....+.++.+.++.++..++.. +..-.++...++|..+.||.+++++++..+--|- T Consensus 213 --~t~~l~~~gA~~~~~~~~~~vE~~Rd~~------------~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 213 --GEALLAKKGAVKLITKRDFFLEKDRDAS------------RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred --ceEEEEeCcceeeeecCCcccccccchh------------hcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 3445445555556667777776555442 2234678889999999999999999998666655 No 112 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.81 E-value=2.9e-21 Score=133.16 Aligned_cols=260 Identities=14% Similarity=0.070 Sum_probs=186.1 Q ss_pred CCCcc-CCccceEccHHHHHHHHHHHHhhhhhhhhcceeec----CCCceEEEEEeCCCceeeeecchhhcccccccccc Q lcl|Aclame:pro 1 MADIS-RAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNM----GTKTTHLPVLATLPEADWVGESATDPKGVKPTSKV 75 (305) Q Consensus 1 Ma~~t-~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~----~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~ 75 (305) ||..+ |.-.-.++|+.+...+.+.+.+...+.+++.+-+. .++++++|++....++.++.|+.. ++..+. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~-----i~~~~l 75 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEE-----IPIDLI 75 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCC-----cchhhc Confidence 66544 33345677999999999999999999888866443 366899999987677777888765 556677 Q ss_pred cceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecc Q lcl|Aclame:pro 76 TWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVG 155 (305) Q Consensus 76 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (305) +.++.++..++.+..+.++++....+..|+.+.+.++++..+++++|+.++.--++.. . +... T Consensus 76 t~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~---------------~--~~~~ 138 (275) T protein:vir:96 76 ETKKRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGAT---------------L--KVEA 138 (275) T ss_pred ccceeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc---------------c--cccc Confidence 8888888999999999999999888888899999999999999999999884221110 0 0011 Q ss_pred cchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhh-------ccCCceeecc---cccCccceEecCccccC Q lcl|Aclame:pro 156 GVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIR-------DANGNPVFRD---DSFAGFRTFFNRNGAWD 225 (305) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~k-------d~~G~~l~~~---~~l~G~pv~~~~~~~~~ 225 (305) ...+++ .+.++...+.........+++||..+..|++.. +..|..+.+. ++++|++|++++.++. T Consensus 139 ~~~~~d----~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~- 213 (275) T protein:vir:96 139 DITKLA----GLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKIKE- 213 (275) T ss_pred cccCHH----HHHHHHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCCCc- Confidence 112333 344444555445556678999999999998752 1223333332 4689999999998863 Q ss_pred CCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccccc Q lcl|Aclame:pro 226 ADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVA 302 (305) Q Consensus 226 ~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~ 302 (305) ++.++..-..+.++.+.++++|..|+.. +..-.++...+++..+.+|.++++++++|++.=- T Consensus 214 ---~t~~i~~~gA~~~~~~~~~~vE~~Rd~~------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 214 ---GEAILAKRGAVKLITKRDFFLETERHAS------------HKSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred ---ceEEEEeccceeeeecCCcccccccchh------------hcCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 3334333444556667777777666553 2235678888999999999999999999876633 No 113 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.80 E-value=2e-21 Score=134.06 Aligned_cols=292 Identities=16% Similarity=0.141 Sum_probs=187.1 Q ss_pred CC------------CccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhhcc Q lcl|Aclame:pro 1 MA------------DISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATDPK 67 (305) Q Consensus 1 Ma------------~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~ 67 (305) |+ .+|+.++..+||+.+++-+.+.++.-....++...+... +.+..+|-. +.-.+.-|+||++.++ T Consensus 59 m~G~~p~~eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~-g~~Ra~~IgEGgE~~~ 137 (393) T protein:vir:79 59 MEGETPTNEVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSI-GIMRAYDVAEGQEIPE 137 (393) T ss_pred hcCCCchhheehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccch-heeeeccccccccccc Confidence 11 156677899999999999999999988888888888885 445555543 4567788999988776 Q ss_pred cccccccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccc Q lcl|Aclame:pro 68 GVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTA 147 (305) Q Consensus 68 ~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~ 147 (305) .. .+..++++++++..|.|..+.+|+|+++||.+|+.++....+.++++++.++.++++.-+.....-.++.+ .... T Consensus 138 ~s--ld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st-~t~a 214 (393) T protein:vir:79 138 DS--IDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYST-NKLA 214 (393) T ss_pred cc--hhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeecccc-Cccc Confidence 44 34468899999999999999999999999999999999999999999999999999875433211111111 1111 Q ss_pred ccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhh---ccCCcee-------e------ccccc Q lcl|Aclame:pro 148 GQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIR---DANGNPV-------F------RDDSF 211 (305) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~k---d~~G~~l-------~------~~~~l 211 (305) ..+.-+-.+..+..-..+++.++..++++.++.++.++|||..|+.+.+-. ...-.+. + .|+.+ T Consensus 215 hptGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp~~i 294 (393) T protein:vir:79 215 HTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGPDSI 294 (393) T ss_pred eeecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhchhhh Confidence 111111112333334456677777888899999999999999999887642 1111111 1 12233 Q ss_pred Cc-----cceEecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeec Q lcl|Aclame:pro 212 AG-----FRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGV 286 (305) Q Consensus 212 ~G-----~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~ 286 (305) .| +.|+++..+|.+.....+ +|+..++..+.|-.-++ .+.. ..++...++.+.|+...|+|++|+| T Consensus 295 ~~~~~~nlnv~~sPfvp~d~k~~rF------d~~~Vd~NnvgvlLV~D-~i~t--dq~ddk~rdiq~iKl~ERYG~gvLn 365 (393) T protein:vir:79 295 QGRLPFNFNVNLSPFIPLDKKSRRF------DVYAVDRNNVGVLLVRD-DLKT--DQWDEKARGLQNIKMIERYGIGILN 365 (393) T ss_pred ccccccceeEEEeccccccccccee------eEEEeecCCceEEEEec-Ccce--eccccccccceeeeeeeeeceeeee Confidence 33 357788888876543332 23333333333222111 1111 1222345688899999999999988 Q ss_pred cc-ceEEEeccccccc--cCCC Q lcl|Aclame:pro 287 SA-TAQGANKTPVAVV--APAA 305 (305) Q Consensus 287 p~-a~~~~~~t~~a~v--~~a~ 305 (305) .. +++..+.-..+-- .|-- T Consensus 366 ~gkaiavakNI~~~k~y~~P~~ 387 (393) T protein:vir:79 366 EGKAIAVAKNISMDKSYAEPML 387 (393) T ss_pred CCceEEEEecceeecccccchh Confidence 53 2222221111111 2222 No 114 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.80 E-value=1.5e-20 Score=129.26 Aligned_cols=260 Identities=15% Similarity=0.092 Sum_probs=190.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN----MGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||..+|.-.-.++|+.+...+.+.+.+...+.+++..-+ .+++++++|++....++..+.|+.. ++..+.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~-----i~~~~lt 75 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEK-----IPTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCc-----ccccccc Confidence 999888888999999999999999988887778876533 2366899999976566777777764 5566778 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) .++.++..++.+..+.++++....+..++.+.+.+++++++++++|+.++.--.+.. ... ... T Consensus 76 ~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~---------------~~~--~~~ 138 (274) T protein:vir:97 76 TKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------------LTV--NAD 138 (274) T ss_pred cceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC---------------ccc--ccc Confidence 888899999999899999999988888999999999999999999999884321110 000 011 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh------hcc-CCceeec---ccccCccceEecCccccCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI------RDA-NGNPVFR---DDSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~------kd~-~G~~l~~---~~~l~G~pv~~~~~~~~~~ 226 (305) ..++ +.+.++...+........+++|||..+..|++. +.+ .|..+.. -++++|++|++++.+| T Consensus 139 ~~~~----d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p--- 211 (274) T protein:vir:97 139 ITKL----NGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE--- 211 (274) T ss_pred ccCH----HHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCC--- Confidence 1223 334445555555555677899999999999863 111 2333333 2478999999999987 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccccc Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVA 302 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~ 302 (305) .++.++.....+.++.+.++.++..++... ..-.++...+++..+.+|.++++++++.+..-- T Consensus 212 -~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~------------~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 212 -AGTAILAKKGAVKLILKRDFFLEVARDAST------------KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred -cceEEEEeCcceEeeecCCceeccccchhh------------cccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 345566666666677777888877665531 223577888999999999999999987543322 No 115 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.80 E-value=1.5e-20 Score=129.26 Aligned_cols=260 Identities=15% Similarity=0.092 Sum_probs=190.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN----MGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||..+|.-.-.++|+.+...+.+.+.+...+.+++..-+ .+++++++|++....++..+.|+.. ++..+.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~-----i~~~~lt 75 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEK-----IPTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCc-----ccccccc Confidence 999888888999999999999999988887778876533 2366899999976566777777764 5566778 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) .++.++..++.+..+.++++....+..++.+.+.+++++++++++|+.++.--.+.. ... ... T Consensus 76 ~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~---------------~~~--~~~ 138 (274) T protein:vir:94 76 TKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------------LTV--NAD 138 (274) T ss_pred cceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC---------------ccc--ccc Confidence 888899999999899999999988888999999999999999999999884321110 000 011 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh------hcc-CCceeec---ccccCccceEecCccccCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI------RDA-NGNPVFR---DDSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~------kd~-~G~~l~~---~~~l~G~pv~~~~~~~~~~ 226 (305) ..++ +.+.++...+........+++|||..+..|++. +.+ .|..+.. -++++|++|++++.+| T Consensus 139 ~~~~----d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p--- 211 (274) T protein:vir:94 139 ITKL----NGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLE--- 211 (274) T ss_pred ccCH----HHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCC--- Confidence 1223 334445555555555677899999999999863 111 2333333 2478999999999987 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccccc Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVA 302 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~ 302 (305) .++.++.....+.++.+.++.++..++... ..-.++...+++..+.+|.++++++++.+..-- T Consensus 212 -~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~------------~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 212 -AGTAILAKKGAVKLILKRDFFLEVARDAST------------KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred -cceEEEEeCcceEeeecCCceeccccchhh------------cccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 345566666666677777888877665531 223577888999999999999999987543322 No 116 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.75 E-value=1.5e-19 Score=123.85 Aligned_cols=260 Identities=15% Similarity=0.082 Sum_probs=185.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN----MGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||..+|.-.-.++|+.+...+.+.+.+...+.+++..-. ..++++++|++....++..+.|+.. ++..+.+ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~-----i~~~~lt 75 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEK-----IPTDILE 75 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCc-----cchhhcc Confidence 999888888899999999999999988877777776532 2466899999976667777777764 4556677 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) ..+..+..++.+..+.++++....+..|+.+.+.++++..+++++|+.++.--.+... +.... T Consensus 76 ~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~-----------------~~~~~ 138 (274) T protein:vir:12 76 TKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-----------------TVNAD 138 (274) T ss_pred cceeeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----------------ccccc Confidence 8888888899999999999988888888999999999999999999988853221100 00111 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh------hccC-Cceeecc---cccCccceEecCccccCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI------RDAN-GNPVFRD---DSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~------kd~~-G~~l~~~---~~l~G~pv~~~~~~~~~~ 226 (305) ..++ +.+.++...+........+++|||..+..|++. ++++ |..+.++ ++++|++|++++.+|..+ T Consensus 139 a~~~----d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t 214 (274) T protein:vir:12 139 ITKL----NGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRSNKLEAGT 214 (274) T ss_pred ccCH----HHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEeCCCCcce Confidence 1223 334444555554445667899999999999864 1222 3333332 468999999999987432 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) .++|| ...+.++...++++|..++... ..-.++...+++..+.||.+++++++..+.. +- T Consensus 215 ---~~l~~-~gA~~~~~~~~~~vE~~Rd~~~------------~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~-~~ 274 (274) T protein:vir:12 215 ---AILAK-KGAVKLILKRDFFLEVARDAST------------KTTALYSDKHYVAYLYDESKAVKITKGSGSL-EM 274 (274) T ss_pred ---EEEEe-ccceeeeecCCceeccccchhh------------cccEEEeeeEEEEEEEcCCceEEEEcCCccc-cC Confidence 23444 3444455667778877666531 2236788899999999999999999754332 22 No 117 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.75 E-value=9.6e-20 Score=124.85 Aligned_cols=259 Identities=10% Similarity=0.027 Sum_probs=185.2 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeec----CCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNM----GTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~----~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||.+.-++ .++|+.+...+.+++.+...+.+++..-+. .+..+++|.+.-..++.-+.|++. ++..+.+ T Consensus 1 Ma~T~~~d--~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~-----i~~~~lt 73 (270) T protein:vir:95 1 MTQTKKAN--LINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVA-----MDTTQMS 73 (270) T ss_pred CCceehhh--hcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCc-----cchhhcc Confidence 99877654 478999999999999998888888876332 466899999986667766777764 5566778 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) .++.....++.+..+.++++....+..|....+.++++..+++++|+.++.-- .+ +. . ..+. T Consensus 74 ~~~~~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l--------~~----a~---~---~~~~ 135 (270) T protein:vir:95 74 MTTTKVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAEL--------NK----SK---Q---TATV 135 (270) T ss_pred cchheeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHh--------cc----cc---c---cccc Confidence 88888889999999999999888877788999999999999999999887310 00 00 0 0111 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhc---c-CCceee---cccccCccceEecCccccCCCCc Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRD---A-NGNPVF---RDDSFAGFRTFFNRNGAWDADAA 229 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd---~-~G~~l~---~~~~l~G~pv~~~~~~~~~~~~~ 229 (305) ..+.+ .+.++...+.........++|||..+..|++... . .|.-+. .-+.++|++|++++..+. ++ T Consensus 136 ~~t~~----~~~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G~~Viv~s~~~~---~~ 208 (270) T protein:vir:95 136 SADAT----GILDAIEVFNSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVGVSDIVKSKRVS---EN 208 (270) T ss_pred ccCHH----HHHHHHHHhccccCCCcEEEEcHHHHHHHHhhhcccccccccchhcccccceecceeEEEeCCCCC---ce Confidence 22333 3344445555566667789999999999986431 1 111111 125789999999887653 44 Q ss_pred eEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 230 IEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 230 ~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) ..++-....+-++..+++.+|..|+... ....++...+++..+.+|..+++++..|++...- T Consensus 209 ~~~l~~~gAi~~~~~~~~~vEtdRd~~~------------~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~~ 270 (270) T protein:vir:95 209 TAFLQRYGAMEIVNKKKPEAYTDFDILK------------RTHLLSTNYHYSVNLKDETGVVKVTFKPSGSLEM 270 (270) T ss_pred eEEEEeccceeeeecCCceeeeccchhh------------cccEEEeeeEEEEEEEccceEEEEEecCCCCcCC Confidence 4444444455567777888887766532 2335677788999999999999999988776655 No 118 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.74 E-value=7.3e-19 Score=120.01 Aligned_cols=278 Identities=14% Similarity=0.071 Sum_probs=190.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeE Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v 80 (305) |..+|-++.+.+.+..+...|+|.+.+.+.|+++.++.++.++.+.+.+....+.+.+.+..........+.+..+|+++ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t~~~~ 80 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAATFTKV 80 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCcccccccccee Confidence 99999999999999999999999999999999999999999999999998877776665433333222345567889999 Q ss_pred EeeeeeEEEeehhhHHHhhc--C-HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceee-ccc Q lcl|Aclame:pro 81 TLVAEEIAVIIPVHENVIDD--A-TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV-VGG 156 (305) Q Consensus 81 ~~~~~k~~~~~~is~ell~d--s-~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~-~~~ 156 (305) +...+-+++.+.|.+.+.+- + ..+...+-.++..+++.++.+..+|||+.+.+++ .|++............ .++ T Consensus 81 ~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F--~GL~~~~~~~q~i~~~~~gg 158 (310) T protein:vir:97 81 NSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEF--AGLIQLCASGQKATTGATGS 158 (310) T ss_pred eeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcc--cchhhcCCccceeecCCCCC Confidence 99999999999999865442 2 4455555567788999999999999999876543 3444443332222111 223 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh-h------------ccCCceeecccccCccceEecCccc Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI-R------------DANGNPVFRDDSFAGFRTFFNRNGA 223 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~-k------------d~~G~~l~~~~~l~G~pv~~~~~~~ 223 (305) ..+. +++..+...+......+..++|||++..+++.+ + +..|+++ ..+.|.|++..+.+| T Consensus 159 ~~t~----d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v---~~~~GiPi~~~d~ip 231 (310) T protein:vir:97 159 AISF----AILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEV---PAYSGTPIFRNDYIP 231 (310) T ss_pred CCCH----HHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEE---eeeCCeEEEEeCccC Confidence 3333 334444444444445677899999876666533 2 2233333 378899999999887 Q ss_pred cC------CCCceEEEEehh-----hEEEEee----cCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeeccc Q lcl|Aclame:pro 224 WD------ADAAIEVIADSS-----RVKIGVR----QDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSA 288 (305) Q Consensus 224 ~~------~~~~~~~~gdf~-----~~~~~~~----~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~ 288 (305) .+ .+...|++.-|. +.++|.. .++.++.. ..+ =+......|++.+++.++.+|+ T Consensus 232 ~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~--G~~---------~~~~v~~~~V~~Y~~~av~~~~ 300 (310) T protein:vir:97 232 TNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDV--GES---------EDSDEHIWRVKWYCGLALFSEK 300 (310) T ss_pred CCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeC--Ccc---------cCCcceeEEEEEeeeEEEeccc Confidence 64 234456666554 3455542 23433321 111 1345677899999999999999 Q ss_pred ceEEEecccc Q lcl|Aclame:pro 289 TAQGANKTPV 298 (305) Q Consensus 289 a~~~~~~t~~ 298 (305) |+..+..-.- T Consensus 301 A~a~L~~V~~ 310 (310) T protein:vir:97 301 GLACADGITN 310 (310) T ss_pred ceeeeccccC Confidence 9999987543 No 119 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.73 E-value=9e-19 Score=119.51 Aligned_cols=260 Identities=15% Similarity=0.083 Sum_probs=183.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN----MGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||..+|.=.-.++|+.+...+.+.+.+...+.+++..-+ ..++++++|++....++..+.|+.. ++..+.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~-----i~~~~lt 75 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEK-----IPTDILE 75 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCc-----cchhhcc Confidence 999887777888899999999999988888878765433 2467999999976666777777754 4555677 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) ....++..++.+..+.++++....+..++.+.+.++++..+++++|+.++.--.+.. ..+ ... T Consensus 76 ~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~---------------~~~--~~~ 138 (274) T protein:vir:95 76 TKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK---------------LTV--EAD 138 (274) T ss_pred cceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc---------------ccc--ccc Confidence 788888888988899999998888888999999999999999999998884221110 000 011 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh------hccC-Cceeecc---cccCccceEecCccccCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI------RDAN-GNPVFRD---DSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~------kd~~-G~~l~~~---~~l~G~pv~~~~~~~~~~ 226 (305) ..+++ .+.++...+........+++|||..+..|++. ++++ |..+.++ +++.|++|++++.++.. T Consensus 139 ~~~~d----~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~- 213 (274) T protein:vir:95 139 ITKLT----GLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAG- 213 (274) T ss_pred ccCHH----HHHHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCc- Confidence 12233 34444455544445667899999999999874 1222 2333332 46899999999987632 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) ..+++| ...+.++...++++|..++.. +..-.++...++++.+.||.+++++++ +++..+- T Consensus 214 --t~~l~~-~gA~~~~~~~~~~vE~~Rd~~------------~~~d~i~~~~~y~~~~~~~~~~v~~tk-~~~~~~~ 274 (274) T protein:vir:95 214 --TAILAK-KGAVKLITKRDFFLETDRDPS------------TKTTALYSDKHYVAYLYDESKAVKITK-GSGSLEM 274 (274) T ss_pred --eEEEEe-ccceeeeecCCcccccccccc------------cccCEEEEeEEEEEEEEcCCcEEEEEc-CCccccC Confidence 224444 334445566777777666553 233457888999999999999999994 3344444 No 120 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.73 E-value=9e-19 Score=119.51 Aligned_cols=260 Identities=15% Similarity=0.083 Sum_probs=183.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN----MGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||..+|.=.-.++|+.+...+.+.+.+...+.+++..-+ ..++++++|++....++..+.|+.. ++..+.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~-----i~~~~lt 75 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEK-----IPTDILE 75 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCc-----cchhhcc Confidence 999887777888899999999999988888878765433 2467999999976666777777754 4555677 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) ....++..++.+..+.++++....+..++.+.+.++++..+++++|+.++.--.+.. ..+ ... T Consensus 76 ~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~---------------~~~--~~~ 138 (274) T protein:vir:96 76 TKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK---------------LTV--EAD 138 (274) T ss_pred cceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc---------------ccc--ccc Confidence 788888888988899999998888888999999999999999999998884221110 000 011 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh------hccC-Cceeecc---cccCccceEecCccccCC Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI------RDAN-GNPVFRD---DSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~------kd~~-G~~l~~~---~~l~G~pv~~~~~~~~~~ 226 (305) ..+++ .+.++...+........+++|||..+..|++. ++++ |..+.++ +++.|++|++++.++.. T Consensus 139 ~~~~d----~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~- 213 (274) T protein:vir:96 139 ITKLT----GLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAG- 213 (274) T ss_pred ccCHH----HHHHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCc- Confidence 12233 34444455544445667899999999999874 1222 2333332 46899999999987632 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccC Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~ 303 (305) ..+++| ...+.++...++++|..++.. +..-.++...++++.+.||.+++++++ +++..+- T Consensus 214 --t~~l~~-~gA~~~~~~~~~~vE~~Rd~~------------~~~d~i~~~~~y~~~~~~~~~~v~~tk-~~~~~~~ 274 (274) T protein:vir:96 214 --TAILAK-KGAVKLITKRDFFLETDRDPS------------TKTTALYSDKHYVAYLYDESKAVKITK-GSGSLEM 274 (274) T ss_pred --eEEEEe-ccceeeeecCCcccccccccc------------cccCEEEEeEEEEEEEEcCCcEEEEEc-CCccccC Confidence 224444 334445566777777666553 233457888999999999999999994 3344444 No 121 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.57 E-value=1e-16 Score=108.20 Aligned_cols=222 Identities=12% Similarity=0.094 Sum_probs=154.9 Q ss_pred cceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHH Q lcl|Aclame:pro 35 FQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGG 114 (305) Q Consensus 35 ~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la 114 (305) -+-++ .++++++|.+.+ ++.-+.|+.. ++..++++++.+...++.+..++|++|....+..|......++++ T Consensus 1 ~~~~~-~Gdtit~P~~iG--da~~v~eG~~-----i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~ 72 (231) T protein:vir:73 1 ENGIN-LANLCEYPNDIG--DAADVAEGGE-----ISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLG 72 (231) T ss_pred Ccccc-CCceEEeccccc--chhhhcCCCc-----CChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHH Confidence 11223 255899997743 4555777765 666778899999999999999999999988888899999999999 Q ss_pred HHHHHHHHHHHHcCcccCcCcccccccccccccccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHH Q lcl|Aclame:pro 115 QAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEV 194 (305) Q Consensus 115 ~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l 194 (305) .++++++|..++.--.. +. . ...+..+. +.+.++...+.........++|||..+..| T Consensus 73 ~~iA~kvD~di~~~~~~------------a~---l---~~~~~~t~----d~i~~A~~~fgde~~~~~vivv~p~~~~~L 130 (231) T protein:vir:73 73 LSLANKVDDDLLKAAKT------------TS---Q---TVSTKANV----DGVQAALDIFNDEDAQAYVLIVNPKDAAKI 130 (231) T ss_pred HHHHHhhhHHHHHhhcc------------cc---c---cccccccH----HHHHHHHHHhccccccceEEEEcchHHHhh Confidence 99999999998831100 00 0 11122333 334444555555555667899999999999 Q ss_pred HHhhc------cCCceeecc---cccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceee Q lcl|Aclame:pro 195 ANIRD------ANGNPVFRD---DSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQIN 265 (305) Q Consensus 195 ~~~kd------~~G~~l~~~---~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~ 265 (305) |+..+ ..|..++.. +.+.|++|++++.++.+.+-..-++.-...+.+...++++++..|+.. T Consensus 131 rk~~~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~--------- 201 (231) T protein:vir:73 131 RKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV--------- 201 (231) T ss_pred hhccchhhhhhhhccceeeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeecccccc--------- Confidence 98443 223333332 578999999999988543322223333344556777888888776653 Q ss_pred eeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 266 LAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 266 ~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) +....+++...++..+.+|..+++++.+.+ T Consensus 202 ---~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 202 ---TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred ---ccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 234567788899999999999999999987 No 122 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.50 E-value=5e-15 Score=98.99 Aligned_cols=283 Identities=12% Similarity=0.088 Sum_probs=171.5 Q ss_pred CCCccCC-ccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceee-eecchh-hcccccccccccc Q lcl|Aclame:pro 1 MADISRA-EVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADW-VGESAT-DPKGVKPTSKVTW 77 (305) Q Consensus 1 Ma~~t~~-~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~-v~E~~~-~~~~~~~~~~~~f 77 (305) =++++.+ -++.+++++...+|++.+++.++++++++++++.+.+..+++...+..... -.|... ....+ .+. T Consensus 20 k~~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~r~~r~~~e~~~~~~~~~-----~~~ 94 (360) T protein:vir:99 20 QKDIGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPRLSGHTRDEEGSRTENSE-----AES 94 (360) T ss_pred hhhccccccCceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccceeeccccccCCCCCcCCc-----Ccc Confidence 1112212 257889999999999999999999999999999998888887654322111 112211 11111 222 Q ss_pred eeEEee-eeeEEEeehhhHHHhhcCH----HHHHHHHHHHHHHHHHHHHHHHHHcCcccCcC-----------ccccccc Q lcl|Aclame:pro 78 ANRTLV-AEEIAVIIPVHENVIDDAT----VAVLTEVAELGGQAIGKKLDQAVIFGTDKPAS-----------WVSPALI 141 (305) Q Consensus 78 ~~v~~~-~~k~~~~~~is~ell~ds~----~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~-----------~~~~~~~ 141 (305) ..+.+. ..++-..+.++.+-+++.. ..+++.|.+.|++++++.++.-.++|+.+... ....|.+ T Consensus 95 ~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~~~d~fl~~~dGwl 174 (360) T protein:vir:99 95 GSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIGGAAELDNTFKGWI 174 (360) T ss_pred ccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccCcccchhhhhhHHHH Confidence 333332 2355566677777766643 35779999999999999999999999754221 1111222 Q ss_pred ccccccccceeecc-------------------------cchh-hhHHHHHHHHHHHHhhhcccc----ceEEEEchHHH Q lcl|Aclame:pro 142 PAAVTAGQAVEVVG-------------------------GVAN-ESDIVGATNRAAKAVASAGWA----PDTLLSSLALR 191 (305) Q Consensus 142 ~~~~~~~~~~~~~~-------------------------~~~~-~~~~~~~~~~~~~~~~~~~~~----~~~~v~~~~~~ 191 (305) ..+....+.+..+. +... .......+..+...++..++. .-.|+|++... T Consensus 175 Kka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~ 254 (360) T protein:vir:99 175 ARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQV 254 (360) T ss_pred HHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEEEccCchH Confidence 22211110000000 0000 111233455677777777654 22799999876 Q ss_pred HHHHHhh-c---cCCceeec---ccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCccee Q lcl|Aclame:pro 192 YEVANIR-D---ANGNPVFR---DDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQI 264 (305) Q Consensus 192 ~~l~~~k-d---~~G~~l~~---~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~ 264 (305) ...+..- + .-|...+. .-...|+|++....+| ++.++|-++++++++.+++++++.+.+... T Consensus 255 ~~yr~~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~p----d~~~mlT~p~NLi~g~~~~iri~~~~e~~~------- 323 (360) T protein:vir:99 255 QSYTMSLTEREDPLGSAVIFGDSDITPFSYDLVGVNGFP----DEYMMFTDPNNLAFGLYEEMELDQSTDTDK------- 323 (360) T ss_pred HHHHHHHhccCcccchhheecccccccceeeeEEcCCCC----CCceEEeccCceeEEeeeeeEEeecccchh------- Confidence 6554432 2 22332222 2246799998777765 567999999999999999999987655321 Q ss_pred eeeecCcEEEE--EEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 265 NLAERDMVALR--LKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 265 ~~~~~~~~~~r--~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .......+| ....+|+.+.+++|++.++.-+. |-| T Consensus 324 --~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~----~~~ 360 (360) T protein:vir:99 324 --VHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLET----PTA 360 (360) T ss_pred --hhhhceeeeEEEEEEeeEEEEecccEEEEecCCC----CCC Confidence 111222222 45679999999999999998754 333 No 123 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.38 E-value=1.1e-13 Score=91.65 Aligned_cols=255 Identities=15% Similarity=0.085 Sum_probs=151.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcce----eecCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQN----VNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~----~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||.. .++|+.+..++++.+++..++.+++.. +...++++++|+......+....++... +..+.+ T Consensus 1 MA~~------~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~-----~~~~~~ 69 (273) T protein:vir:79 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQT-----SADAIS 69 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCcc-----Cccccc Confidence 8873 368999999999999999988888643 3334668999997655555556565542 233445 Q ss_pred ceeEEeeeee-EEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCc-ccCcCcccccccccccccccceeec Q lcl|Aclame:pro 77 WANRTLVAEE-IAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGT-DKPASWVSPALIPAAVTAGQAVEVV 154 (305) Q Consensus 77 f~~v~~~~~k-~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~-g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (305) ..++++...| .+.-+.|+++-...+..++.+ +.+++++++++++|+-++.=- +.+.. .... T Consensus 70 ~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~~~--------------~~~~-- 132 (273) T protein:vir:79 70 DTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA--------------LTGS-- 132 (273) T ss_pred cceEEEEEeeecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc--------------cccc-- Confidence 5666666655 244456776333344557876 557788999999998665210 00000 0000 Q ss_pred ccchhhhHHHHHHHHHHHHhhhccc--cceEEEEchHHHHHHHHhh------ccCCc-eeec---ccccCccceEecCcc Q lcl|Aclame:pro 155 GGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANIR------DANGN-PVFR---DDSFAGFRTFFNRNG 222 (305) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~~k------d~~G~-~l~~---~~~l~G~pv~~~~~~ 222 (305) ...+....++.+..+...+...+. ..-.++++|..+..|.+.. +..|. -.++ -.++.|++++.++++ T Consensus 133 -~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~l 211 (273) T protein:vir:79 133 -APSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL 211 (273) T ss_pred -cccchhhHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccc Confidence 111122334455555555544432 2236889999998886532 12222 1222 257999999999999 Q ss_pred ccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 223 AWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 223 ~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) |...+. ..+.+--+.+.+.. +...++..++.. .| -..++....+|..+.||++++.++.+.+ T Consensus 212 p~~~~~-~~~a~~~~A~~~a~-~~~~~e~~r~~~---------~~---~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 212 RDTDDE-QFVAFHPSAAAYVS-QIDTVEALRDQD---------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred cccCce-EEEEEeccceeeee-ehhhhhcccCcc---------cc---eeeeeeeeeeeeEEecCceEEEEeccCC Confidence 865432 23333222222221 112232222111 12 3457888999999999999999999887 No 124 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.37 E-value=5.1e-14 Score=93.47 Aligned_cols=280 Identities=15% Similarity=0.063 Sum_probs=158.8 Q ss_pred CCC----ccCCccce-----Ec--cHHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeC---CCceeeeecchhh Q lcl|Aclame:pro 1 MAD----ISRAEVAS-----LI--QEAYSDTLLAAAKQGSTVLSAFQNVNM-GTKTTHLPVLAT---LPEADWVGESATD 65 (305) Q Consensus 1 Ma~----~t~~~gg~-----li--p~~~~~~i~~~~~~~~~l~~l~~~~~~-~~~~~~~p~~~~---~~~a~~v~E~~~~ 65 (305) |.. ++..+|+. ++ |+.+-+.+.+.+...-+.-.+.+.... .++.+.+-.... ..++.-|.|+++ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggE- 79 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGE- 79 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCccc- Confidence 542 22333432 22 666667777777666555555555433 355555543221 234555677765 Q ss_pred cccccccccccceeEEe-eeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccc Q lcl|Aclame:pro 66 PKGVKPTSKVTWANRTL-VAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAA 144 (305) Q Consensus 66 ~~~~~~~~~~~f~~v~~-~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~ 144 (305) +|++...++...+ ..+|.|..++||+|++..+..+..+-...++++.|+++.|...+.---++.. +. ++.. T Consensus 80 ----iP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t-~~---~~~s 151 (318) T protein:vir:10 80 ----IPVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIV-PT---LAVP 151 (318) T ss_pred ----ccccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cc---ccCC Confidence 6667777766665 4569999999999999999999999999999999999999877742111100 00 0000 Q ss_pred cccccce-eecccchhhhHHHH---HHHHH--HHHhhhccccceEEEEchHHHHHHHHhhc------cCCceeec----- Q lcl|Aclame:pro 145 VTAGQAV-EVVGGVANESDIVG---ATNRA--AKAVASAGWAPDTLLSSLALRYEVANIRD------ANGNPVFR----- 207 (305) Q Consensus 145 ~~~~~~~-~~~~~~~~~~~~~~---~~~~~--~~~~~~~~~~~~~~v~~~~~~~~l~~~kd------~~G~~l~~----- 207 (305) ....... ...+.....+.... +.+.+ ...-.+.+|.++.++|||..+..|++-++ .++.+++. T Consensus 152 ~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~t 231 (318) T protein:vir:10 152 TAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWT 231 (318) T ss_pred cCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhccccc Confidence 0000000 00000000000000 11100 01113567899999999999999965433 23333321 Q ss_pred ---ccccCccceEecCccccCCCCceEEEEehhhE-EEEeecCcEEEEeecceeccCcceeeeeec-CcEEEEEEEEEcc Q lcl|Aclame:pro 208 ---DDSFAGFRTFFNRNGAWDADAAIEVIADSSRV-KIGVRQDITVKFLDQATLGTGENQINLAER-DMVALRLKARFAY 282 (305) Q Consensus 208 ---~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~-~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~-~~~~~r~~~r~~~ 282 (305) +..++|+.++.+.++|. +++++.+-... .+.+.+.++.+-.. ..++.+ .... ....+|+..+-.. T Consensus 232 g~~~g~~lGl~vi~s~~~p~----~~alvlq~g~vG~~~d~~pl~~t~~~----~egg~~--~g~~~~s~~~~~~~~~~~ 301 (318) T protein:vir:10 232 GNFPGSVMGLNVIRSRTFPI----DRVLIMERGTVGFYSDTRPLQFTALY----PEGNGP--NGGPTESYRADASHKRAL 301 (318) T ss_pred ccccceeeceEEeecCccCC----CeeEEEecCCcceeeccccceeeecc----cCCCCC--CCCcchhhheehheeeee Confidence 34578999999999874 33555554332 34444555443322 111111 1222 2455688888889 Q ss_pred EeecccceEEEeccccccccC Q lcl|Aclame:pro 283 VLGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 283 ~v~~p~a~~~~~~t~~a~v~~ 303 (305) .|.+|+|+++||+- ++| T Consensus 302 ~V~~PkA~~~itgi----~~~ 318 (318) T protein:vir:10 302 AVDQPKAALWLTGI----VTP 318 (318) T ss_pred eeeCcceeEEEeec----cCC Confidence 99999999999984 566 No 125 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.35 E-value=3.3e-13 Score=89.02 Aligned_cols=255 Identities=14% Similarity=0.072 Sum_probs=150.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcce----eecCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQN----VNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~----~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||.. .++|+.+..++++.+++.+++..++.. ....++++++|+......+....++... +..+.+ T Consensus 1 MA~~------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~-----~~~~~~ 69 (273) T protein:vir:10 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQT-----SADAIS 69 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCcc-----Cccccc Confidence 8872 468999999999999999988888743 1223668999997655555555454432 222334 Q ss_pred ceeEEeeeeeE-EEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCc-ccCcCcccccccccccccccceeec Q lcl|Aclame:pro 77 WANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGT-DKPASWVSPALIPAAVTAGQAVEVV 154 (305) Q Consensus 77 f~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~-g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (305) -.++++...+. +.-+.|++.-...+..++.+ +.++++++++.++|.-++.=- +.+. . .. . T Consensus 70 ~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~------------~--~~---~ 131 (273) T protein:vir:10 70 DTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT------------A--LT---G 131 (273) T ss_pred cceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc------------c--cc---c Confidence 45555554432 33445666333334557877 567789999999998776310 0000 0 00 0 Q ss_pred ccchhhhHHHHHHHHHHHHhhhccc--cceEEEEchHHHHHHHHhh----c--cCC-ceeec---ccccCccceEecCcc Q lcl|Aclame:pro 155 GGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANIR----D--ANG-NPVFR---DDSFAGFRTFFNRNG 222 (305) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~~k----d--~~G-~~l~~---~~~l~G~pv~~~~~~ 222 (305) ....+...+++.+..+...+..... ..-.++++|..+..|.+.. + ..| ...++ -.++.|++++.++++ T Consensus 132 ~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~l 211 (273) T protein:vir:10 132 SAPTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL 211 (273) T ss_pred ccccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEeccc Confidence 1111223345556666655554433 2236899999999886532 2 212 12222 257899999999999 Q ss_pred ccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 223 AWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 223 ~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) |...+ ..++.+--+.+.+..+ ...++..+... .| ...+++...+|..+.||++++.++.+.+ T Consensus 212 p~~~~-~~~~~~~~~A~~~a~q-~~~~e~~r~~~---------~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 212 RDTDD-EQFVAFHPSAAAYVSQ-IDTVEALRDQD---------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ccCCc-cEEEEEeccceeeeee-eehhhcccCCC---------cc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 86543 3344444333322221 11222221111 12 2357888999999999999999999887 No 126 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.35 E-value=3.3e-13 Score=89.02 Aligned_cols=255 Identities=14% Similarity=0.072 Sum_probs=150.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcce----eecCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQN----VNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~----~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||.. .++|+.+..++++.+++.+++..++.. ....++++++|+......+....++... +..+.+ T Consensus 1 MA~~------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~-----~~~~~~ 69 (273) T protein:vir:10 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQT-----SADAIS 69 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCcc-----Cccccc Confidence 8872 468999999999999999988888743 1223668999997655555555454432 222334 Q ss_pred ceeEEeeeeeE-EEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCc-ccCcCcccccccccccccccceeec Q lcl|Aclame:pro 77 WANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGT-DKPASWVSPALIPAAVTAGQAVEVV 154 (305) Q Consensus 77 f~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~-g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (305) -.++++...+. +.-+.|++.-...+..++.+ +.++++++++.++|.-++.=- +.+. . .. . T Consensus 70 ~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~------------~--~~---~ 131 (273) T protein:vir:10 70 DTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT------------A--LT---G 131 (273) T ss_pred cceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc------------c--cc---c Confidence 45555554432 33445666333334557877 567789999999998776310 0000 0 00 0 Q ss_pred ccchhhhHHHHHHHHHHHHhhhccc--cceEEEEchHHHHHHHHhh----c--cCC-ceeec---ccccCccceEecCcc Q lcl|Aclame:pro 155 GGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANIR----D--ANG-NPVFR---DDSFAGFRTFFNRNG 222 (305) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~~k----d--~~G-~~l~~---~~~l~G~pv~~~~~~ 222 (305) ....+...+++.+..+...+..... ..-.++++|..+..|.+.. + ..| ...++ -.++.|++++.++++ T Consensus 132 ~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~l 211 (273) T protein:vir:10 132 SAPTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL 211 (273) T ss_pred ccccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEeccc Confidence 1111223345556666655554433 2236899999999886532 2 212 12222 257899999999999 Q ss_pred ccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 223 AWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 223 ~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) |...+ ..++.+--+.+.+..+ ...++..+... .| ...+++...+|..+.||++++.++.+.+ T Consensus 212 p~~~~-~~~~~~~~~A~~~a~q-~~~~e~~r~~~---------~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 212 RDTDD-EQFVAFHPSAAAYVSQ-IDTVEALRDQD---------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ccCCc-cEEEEEeccceeeeee-eehhhcccCCC---------cc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 86543 3344444333322221 11222221111 12 2357888999999999999999999887 No 127 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.21 E-value=4.8e-12 Score=82.64 Aligned_cols=281 Identities=15% Similarity=0.066 Sum_probs=162.8 Q ss_pred CCCcc----------CCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCC-CceEEEEEeCCCceeeeecchhhcccc Q lcl|Aclame:pro 1 MADIS----------RAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGT-KTTHLPVLATLPEADWVGESATDPKGV 69 (305) Q Consensus 1 Ma~~t----------~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~E~~~~~~~~ 69 (305) |+... .++-...| +++..++.+.....+.++.+..+.++.+ +++.+|+. +...+....-|++..... T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERSR 78 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcCcCCCC Confidence 66543 22222334 9999999999999999999998888774 58899986 566676666666654432 Q ss_pred cccccccceeEEeeeeeE-EEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHH----cCccc--CcCcc---ccc Q lcl|Aclame:pro 70 KPTSKVTWANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVI----FGTDK--PASWV---SPA 139 (305) Q Consensus 70 ~~~~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l----~G~g~--~~~~~---~~~ 139 (305) ...++..+..-.+ .....|.+----++..|+-+.+.+++.+++|+..|++++ .+... +.+.. ..| T Consensus 79 -----~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G 153 (335) T protein:vir:63 79 -----VVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPG 153 (335) T ss_pred -----ccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCC Confidence 2234444444332 111122221111345689999999999999999999765 33221 11110 111 Q ss_pred ccccccccccceeecccchhhhHHHHHHHHHHHHhhhcccc-----ceEEEEchHHHHHHHHhhc--------cCCc--e Q lcl|Aclame:pro 140 LIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWA-----PDTLLSSLALRYEVANIRD--------ANGN--P 204 (305) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~v~~~~~~~~l~~~kd--------~~G~--~ 204 (305) +... ...+..+...+...+++.+..+..++...+.. .-..+++|..+..|.+-+. ++|. + T Consensus 154 ~~~~-----~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~ 228 (335) T protein:vir:63 154 VLEK-----LDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDY 228 (335) T ss_pred ccee-----eeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccc Confidence 1111 11112222234566677777777777765543 1368999999999876421 1111 1 Q ss_pred ee-cccccCccceEecCccccCCCC-------ceEEEEehhhEE----------EEeecCcEEEEeecceeccCcceeee Q lcl|Aclame:pro 205 VF-RDDSFAGFRTFFNRNGAWDADA-------AIEVIADSSRVK----------IGVRQDITVKFLDQATLGTGENQINL 266 (305) Q Consensus 205 l~-~~~~l~G~pv~~~~~~~~~~~~-------~~~~~gdf~~~~----------~~~~~~i~v~~~~~~~~~~~~~~~~~ 266 (305) .. +-..+.|+||+.++++|..... ...+-|||+... .+.-.++..++.++.. . T Consensus 229 ~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~---------~ 299 (335) T protein:vir:63 229 VKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNE---------K 299 (335) T ss_pred cCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccc---------h Confidence 11 1246899999999998854322 234556664432 1122222222221111 0 Q ss_pred eecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 267 AERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 267 ~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) |. ..+.+..-+|..+.||++++.++.|..+++.--| T Consensus 300 ~~---~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:63 300 FS---WVLDTFQMYNIGARRPDTAGAIELKGIGAFDITA 335 (335) T ss_pred hh---HHhHHHHHcCCcccccceEEEEEEcCCCceeecC Confidence 11 1233444578889999999999988877776666 No 128 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.20 E-value=1.8e-11 Score=79.46 Aligned_cols=289 Identities=12% Similarity=0.052 Sum_probs=156.4 Q ss_pred CCCccCCccc-----------------eEccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecc Q lcl|Aclame:pro 1 MADISRAEVA-----------------SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGES 62 (305) Q Consensus 1 Ma~~t~~~gg-----------------~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~ 62 (305) |++.+.+--| .+.=+.+..++.+.....+.++.+.++..+. ++++++|+. +...+....-+ T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~i-G~~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYT-GRMTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEee-eeeEEeeecCC Confidence 4433322211 2334888999999999999999999988777 558889886 55555555545 Q ss_pred hhhccccccccccccee--EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc----CcccCcCcc Q lcl|Aclame:pro 63 ATDPKGVKPTSKVTWAN--RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTDKPASWV 136 (305) Q Consensus 63 ~~~~~~~~~~~~~~f~~--v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~----G~g~~~~~~ 136 (305) ++.... +..+.+..+ ++++..|+.. ..|.+==--++..++.+.+.+++++++++.+|+.++. +.....+.. T Consensus 80 ~~i~~~--~~~d~~~te~~l~ID~~~y~~-~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~ 156 (375) T protein:vir:10 80 TPILGN--ADKAPPVAEKTIVMDDLLISS-AFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVS 156 (375) T ss_pred cCcCCc--cccCCCCCceEEEecchhhhh-hhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 443221 111222233 3444333322 1222211123456899999999999999999998863 211111100 Q ss_pred cc-cccccccc-cccceeecccchhhhHHHHHHHHHHHHhhhcccc--ceEEEEchHHHHHHHHhhccC--------Cce Q lcl|Aclame:pro 137 SP-ALIPAAVT-AGQAVEVVGGVANESDIVGATNRAAKAVASAGWA--PDTLLSSLALRYEVANIRDAN--------GNP 204 (305) Q Consensus 137 ~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~~kd~~--------G~~ 204 (305) .. ...++... .....+......+...+++.+.++...+...+.. .=..+++|..+..|.+-+|.+ |.- T Consensus 157 ~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~ 236 (375) T protein:vir:10 157 ATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSA 236 (375) T ss_pred cccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecccccc Confidence 00 00000000 0011112222234556677777777766655432 225789999999887655432 221 Q ss_pred eecc---cccCccceEecCccccCCCCc---------------------------------eEEEEeh---hh--EEEE- Q lcl|Aclame:pro 205 VFRD---DSFAGFRTFFNRNGAWDADAA---------------------------------IEVIADS---SR--VKIG- 242 (305) Q Consensus 205 l~~~---~~l~G~pv~~~~~~~~~~~~~---------------------------------~~~~gdf---~~--~~~~- 242 (305) +... ..+.|++|+.++++|..++.+ ..+-+|| +. ..+. T Consensus 237 ~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~ 316 (375) T protein:vir:10 237 LQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQ 316 (375) T ss_pred eeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEc Confidence 2222 368899999999888543211 1233344 11 1111 Q ss_pred -------eecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 243 -------VRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 243 -------~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .-.++++++++ ..|+ -.+-...|.+.+-+|-.+.||++++.+.... ..|+| T Consensus 317 ~~A~g~v~~~~~~~~~~~--------~~~~-~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~---~~~~~ 374 (375) T protein:vir:10 317 KEAAGVVEAIGPQVQVTN--------GDVS-VIYQGDVILGRMAMGADYLNPAAAVELYIGA---TAPSA 374 (375) T ss_pred hhheeeeeeecccccccc--------chhh-heeeeeeeeeeeeeccCccCceeEEEEecCc---Ccccc Confidence 22333333321 0011 1122334677778888999999999997653 46777 No 129 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.19 E-value=6.9e-12 Score=81.79 Aligned_cols=269 Identities=9% Similarity=0.036 Sum_probs=151.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhc---------ceee--cCCCceEEEEEeCC-Cceeeeecchhhccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAF---------QNVN--MGTKTTHLPVLATL-PEADWVGESATDPKG 68 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~---------~~~~--~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~ 68 (305) ||.+.- .-.++|+.+..-+.+...+.+.+.+-. .... .++..+++|.+..- .++.-+.|+.. T Consensus 1 MA~T~l--sd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~---- 74 (324) T protein:vir:59 1 MAYTKI--SDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDD---- 74 (324) T ss_pred CCceee--eceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcc---- Confidence 995544 345778888777877777777664421 2221 34668899998643 45666666654 Q ss_pred ccccccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccc Q lcl|Aclame:pro 69 VKPTSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAG 148 (305) Q Consensus 69 ~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~ 148 (305) ++..+.+.++-....++.+.-+.++++...-+..+....+.+++++.++++.++.+|.--. +........ . .. T Consensus 75 -i~~~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~---g~~~~~~~~--~-~~ 147 (324) T protein:vir:59 75 -LVPQKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELA---GVFSNDDMK--D-NK 147 (324) T ss_pred -cchhhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHH---Hhhhccccc--c-ce Confidence 4555666666666677788888999987777778899999999999999999987763210 000000000 0 00 Q ss_pred ccee-ecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh------hccCCceeecccccCccceEecCc Q lcl|Aclame:pro 149 QAVE-VVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI------RDANGNPVFRDDSFAGFRTFFNRN 221 (305) Q Consensus 149 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~------kd~~G~~l~~~~~l~G~pv~~~~~ 221 (305) ..+. ..+...+ .+.+.++...+.........++||+.++..|++. +.++|..- -+.++|++|++++. T Consensus 148 ~dvsa~~~~~~s----~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~--i~~~~G~~VivdD~ 221 (324) T protein:vir:59 148 LDISGTADGIYS----AETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEFVKDSQSGIR--FPTYMNKRVIVDDS 221 (324) T ss_pred eeeeccccceec----HHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhhccccccCce--eeeecccEEEEeCC Confidence 0000 0111111 2345555555555556678999999999999865 34443321 25689999999999 Q ss_pred cccCCCC--c----eEEEEehhhEEEEe-ecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEe Q lcl|Aclame:pro 222 GAWDADA--A----IEVIADSSRVKIGV-RQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGAN 294 (305) Q Consensus 222 ~~~~~~~--~----~~~~gdf~~~~~~~-~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~ 294 (305) +|..... . ..+|+. ..+.+.. +..+.++..++.. .....+..+.++. .+|.++.... T Consensus 222 ~p~~~~~~~~~~y~s~l~~~-GAi~~~~~~~~v~vE~dRd~~------------~g~~~l~~r~~~~---~~p~G~s~~~ 285 (324) T protein:vir:59 222 MPVETLEDGTKVFTSYLFGA-GALGYAEGQPEVPTETARNAL------------GSQDILINRKHFV---LHPRGVKFTE 285 (324) T ss_pred CCccccCCCCceEEEEEEec-CeEEEeecCCCcceecccCcc------------ccceEEEEeeEEE---eEeeeEEecc Confidence 8864321 1 233332 1122222 2345555554431 1223344444433 3444433322 Q ss_pred ccccccccCCC Q lcl|Aclame:pro 295 KTPVAVVAPAA 305 (305) Q Consensus 295 ~t~~a~v~~a~ 305 (305) .+ .+-..|-- T Consensus 286 ~~-~~~~sPt~ 295 (324) T protein:vir:59 286 NA-MAGTTPTD 295 (324) T ss_pred cc-cCCCCCCh Confidence 11 11122222 No 130 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.19 E-value=1.1e-12 Score=86.12 Aligned_cols=260 Identities=13% Similarity=0.122 Sum_probs=170.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceee-eecc-hhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADW-VGES-ATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~-v~E~-~~~~~~~~~~~~~~f~ 78 (305) ....++.+...+||++++...++.+.+..++..+....|..+.+++||+.+....... ..++ +..+.+..+..+.+|+ T Consensus 131 ~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~ 210 (410) T protein:vir:83 131 ADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVID 210 (410) T ss_pred hccCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccccccccceeee Confidence 3445666666788888999999999999999999888999999999988765543221 1122 2334444688888899 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHH---HHcCcccCcCcccccccccccccccceeecc Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQA---VIFGTDKPASWVSPALIPAAVTAGQAVEVVG 155 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a---~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (305) ..+...+.+|+...+|++.++.|.+...+...+-|+.+++.+-+.+ +|..+-++ . ... T Consensus 211 t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~-----------~--------~a~ 271 (410) T protein:vir:83 211 RLTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG-----------A--------VGY 271 (410) T ss_pred eccceeehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-----------h--------hhh Confidence 9999999999999999999999999999999999999998877643 44322110 0 011 Q ss_pred cchhhhHHHHHHHHHHHHhhhc--cccceEEEEchHHHHHHHHh--------hccCC---ceee--cccccCccceEecC Q lcl|Aclame:pro 156 GVANESDIVGATNRAAKAVASA--GWAPDTLLSSLALRYEVANI--------RDANG---NPVF--RDDSFAGFRTFFNR 220 (305) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~~--------kd~~G---~~l~--~~~~l~G~pv~~~~ 220 (305) ...+.+.+...+.++...+... +.....+.++|++...+.++ +|+.| .++. .-+.+.+.||++.. T Consensus 272 ~~~Tad~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~ 351 (410) T protein:vir:83 272 GNATADNVASAIWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSA 351 (410) T ss_pred hhccHHHHHHHHHHHHHHHhhhhccceeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEec Confidence 1223445555555555555554 44455678899887655443 23333 1111 12467888998776 Q ss_pred ccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 221 NGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 221 ~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) .. ..+++.|-|...+.+-+..+=.+++.++..+ |.+.=.+ .|+.+.++.+.+++-+.+. T Consensus 352 ~a----~AgTA~f~~~~Ai~~~eS~~gp~qL~d~~i~------------nLt~~yS-gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 352 AL----GSGDAYLFSTAAIECFEQRVGTLQVVEPSVF------------GLQVAYA-GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred CC----CcCeeeEeccceeeeeecCCceeEeeCCchh------------hhhhhhe-eeeeeccccccceeeeccC Confidence 54 4566777777666555544322333322211 1111112 5778889999999888887 No 131 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.18 E-value=1.6e-12 Score=85.27 Aligned_cols=286 Identities=12% Similarity=0.002 Sum_probs=152.9 Q ss_pred CCCccCCcc--------ceEccHHHHHHHHHHHHhhhhhhhhcceee---cCCCceEEEEEeCCCceeeeecchhhcccc Q lcl|Aclame:pro 1 MADISRAEV--------ASLIQEAYSDTLLAAAKQGSTVLSAFQNVN---MGTKTTHLPVLATLPEADWVGESATDPKGV 69 (305) Q Consensus 1 Ma~~t~~~g--------g~lip~~~~~~i~~~~~~~~~l~~l~~~~~---~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~ 69 (305) ||-+.+-+| ..++|+.+..++++.+++..++.++++..+ ..+++++||+.. .+.+.-..++...+... T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i~~~~ 79 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS-ELGVEDKATDVPVGVQP 79 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC-cceeeeecCCCcccccc Confidence 775555544 447899999999999999998888876443 236689999864 55565566665543332 Q ss_pred cccccccceeEEeeeee-EEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccc Q lcl|Aclame:pro 70 KPTSKVTWANRTLVAEE-IAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAG 148 (305) Q Consensus 70 ~~~~~~~f~~v~~~~~k-~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~ 148 (305) .+-.++++...+ ...-+.|+++-..++..++.+.+.++.++++++++|+.++.---............ .. T Consensus 80 -----~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~----~~ 150 (341) T protein:vir:94 80 -----VNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFS----SS 150 (341) T ss_pred -----ccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCcccc----Cc Confidence 333444555433 34446777755555677899999999999999999988774211111000000000 00 Q ss_pred cceeecccchhhhHHHHHHHHHHHHhhhccc--cceEEEEchHHHHHHHHhh-----ccCCceeecc---cccCccceEe Q lcl|Aclame:pro 149 QAVEVVGGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANIR-----DANGNPVFRD---DSFAGFRTFF 218 (305) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~~k-----d~~G~~l~~~---~~l~G~pv~~ 218 (305) .. ..........++.+..+...+....- ..=.++++|..+..|.+.. |..|.-.++. ..+.|++|+. T Consensus 151 ~~---~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~ 227 (341) T protein:vir:94 151 NG---AITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIR 227 (341) T ss_pred cc---cccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEE Confidence 00 00001111122334444444443322 2225788999999997532 2223322332 4799999999 Q ss_pred cCccccCCCCce------E-----------------EEEehhhE--EEEeecCc-EEEEeecceecc-------Ccceee Q lcl|Aclame:pro 219 NRNGAWDADAAI------E-----------------VIADSSRV--KIGVRQDI-TVKFLDQATLGT-------GENQIN 265 (305) Q Consensus 219 ~~~~~~~~~~~~------~-----------------~~gdf~~~--~~~~~~~i-~v~~~~~~~~~~-------~~~~~~ 265 (305) ++++|....... . .-+|++.. +++-+..+ .++..+...+.. ....++ T Consensus 228 Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~ 307 (341) T protein:vir:94 228 TSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFE 307 (341) T ss_pred eccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccch Confidence 998875432110 0 00111111 11111111 111111000000 000000 Q ss_pred eeecCcEEEEEEEEEccEeecccceEEEecccccc Q lcl|Aclame:pro 266 LAERDMVALRLKARFAYVLGVSATAQGANKTPVAV 300 (305) Q Consensus 266 ~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~ 300 (305) -.+-...+++..-+|.++.||++.+.+....+.+ T Consensus 308 -~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 308 -NREQVWLMVGRQAYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred -hhhhhhhhhhhhhhcccccCcceeEEEecCcCCC Confidence 0111234566777899999999998888776655 No 132 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.18 E-value=9.2e-12 Score=81.09 Aligned_cols=278 Identities=10% Similarity=0.045 Sum_probs=155.2 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcc---------eeecCCCceEEEEEeC-CCceeeeecchhhccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQ---------NVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVK 70 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~---------~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~ 70 (305) ||..+|.-.-.++|+.+..-+.+...+.+.+++-.- ....++..+++|.+.. ..++.-+.|++. ++ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~----~i 76 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDK----AL 76 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcc----cc Confidence 998766666778899997777777777666644221 1223577899999863 345555556542 24 Q ss_pred ccccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccc--ccccccccc Q lcl|Aclame:pro 71 PTSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPA--LIPAAVTAG 148 (305) Q Consensus 71 ~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~--~~~~~~~~~ 148 (305) +..+.+-++-....++.+..+.++++...-+..|....+.+++++.+++..++.++.--. +.+... ......... T Consensus 77 ~~~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~---gvf~~~~~~~~~~~~~~ 153 (330) T protein:vir:10 77 ETGKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLN---GIFATGTAGEKGALEET 153 (330) T ss_pred chhhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHH---hhhhhhhcccchhhhhh Confidence 555566666667777888889999988777888999999999999999988877663110 000000 000000000 Q ss_pred cceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh------hccCCceeecccccCccceEecCcc Q lcl|Aclame:pro 149 QAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI------RDANGNPVFRDDSFAGFRTFFNRNG 222 (305) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~------kd~~G~~l~~~~~l~G~pv~~~~~~ 222 (305) ........... .-.+.+.++...+.........++||+.++..|++. +++++.. .-+.++|++|++++.+ T Consensus 154 ~~~~~~~~~a~--~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~--~i~~~~G~~VivdD~~ 229 (330) T protein:vir:10 154 HVSDQSKASTG--IDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQYIQPTTATI--NIPTYLGYRVIIDDGI 229 (330) T ss_pred heecccccccc--cCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhhhhcccccCc--ccccccceEEEEeCCC Confidence 00000011110 112345555555555555677999999999999874 2333321 1267899999999999 Q ss_pred ccCCCCceE-EEEehhhEEEEe---ecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc-c Q lcl|Aclame:pro 223 AWDADAAIE-VIADSSRVKIGV---RQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT-P 297 (305) Q Consensus 223 ~~~~~~~~~-~~gdf~~~~~~~---~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t-~ 297 (305) |...+.... +|+. ..+.+.. ...+.+|..|+.. .....+....++ +.+|.++.....+ + T Consensus 230 p~~~~~yt~yl~~~-GAi~~~~~~~~~~v~~EtdRd~~------------~g~~~l~~r~~~---~~hp~G~s~~~~~~~ 293 (330) T protein:vir:10 230 APTGDIYTSYLFRT-GSIGLNTGNPSGLTTFETSREAA------------KGNDMIYTRRAL---VMHPYGVKWTGAEVD 293 (330) T ss_pred CCCCCceeEEEEec-CceeeecccCCccccccccCCcc------------ccceEEEEeeEE---Eeeeeeeeecccccc Confidence 876555443 3331 1112222 1123455444432 111222222232 3456555554321 1 Q ss_pred cccccCCC Q lcl|Aclame:pro 298 VAVVAPAA 305 (305) Q Consensus 298 ~a~v~~a~ 305 (305) .+-..|.- T Consensus 294 ~~~~sPt~ 301 (330) T protein:vir:10 294 AGNITPSN 301 (330) T ss_pred cCcCCcCh Confidence 11122322 No 133 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.14 E-value=1.4e-11 Score=80.04 Aligned_cols=281 Identities=14% Similarity=0.074 Sum_probs=159.6 Q ss_pred CCCcc----------CCccceEccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhhcccc Q lcl|Aclame:pro 1 MADIS----------RAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATDPKGV 69 (305) Q Consensus 1 Ma~~t----------~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~~ 69 (305) |+... .++-...| +.+..++.+.....+.++++..+.++. ++++.+|+. +...+....-|++..... T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERSR 78 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccCcccCCCC Confidence 66433 22223344 899999999999999999999888876 458999976 666666666666654432 Q ss_pred cccccccceeEEeeeeeE-EEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHH----cCcc--cCcCc---cccc Q lcl|Aclame:pro 70 KPTSKVTWANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVI----FGTD--KPASW---VSPA 139 (305) Q Consensus 70 ~~~~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l----~G~g--~~~~~---~~~~ 139 (305) ...++..+..-.+ .....|.+-=--++..|+.+.+.+++++++++..|++++ .+.. ++... ...| T Consensus 79 -----~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G 153 (335) T protein:vir:78 79 -----VVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPG 153 (335) T ss_pred -----cccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCC Confidence 2234444443332 111122221112345689999999999999999999776 2222 11111 1111 Q ss_pred ccccccccccceeecccchhhhHHHHHHHHHHHHhhhccccc-----eEEEEchHHHHHHHHhhc--------cCCceee Q lcl|Aclame:pro 140 LIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAP-----DTLLSSLALRYEVANIRD--------ANGNPVF 206 (305) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~v~~~~~~~~l~~~kd--------~~G~~l~ 206 (305) ++... ..+..+...+...+.+.+..+...+...+... -..+++|..+..|..-+. ++|.-.+ T Consensus 154 ~~~~~-----~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~ 228 (335) T protein:vir:78 154 VLEKL-----DLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDY 228 (335) T ss_pred cceee-----eeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccccccccccccc Confidence 11111 11112223345566777777777666554421 258899999999876421 1121111 Q ss_pred ---cccccCccceEecCccccCCCC-------ceEEEEehhhEE----------EEeecCcEEEEeecceeccCcceeee Q lcl|Aclame:pro 207 ---RDDSFAGFRTFFNRNGAWDADA-------AIEVIADSSRVK----------IGVRQDITVKFLDQATLGTGENQINL 266 (305) Q Consensus 207 ---~~~~l~G~pv~~~~~~~~~~~~-------~~~~~gdf~~~~----------~~~~~~i~v~~~~~~~~~~~~~~~~~ 266 (305) .-..+.|+||+.++++|..... ...+-+||+.-. .+.-.++..++.++. . . T Consensus 229 ~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~----~-----~ 299 (335) T protein:vir:78 229 VKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDH----D-----Q 299 (335) T ss_pred ccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeecc----c-----h Confidence 1246899999999999855322 223334553321 121122222222111 1 1 Q ss_pred eecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 267 AERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 267 ~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) |. -.+.+..-+|..+.||++.+.++.|..+++.--| T Consensus 300 ~~---~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:78 300 FS---WVLDTFQMYNIGARRPDTAGAIELKGIEAFDITA 335 (335) T ss_pred hh---HhhhHHHHcCCcccCcceEEEEEecCCCcccccC Confidence 11 1234444578889999999999988776666656 No 134 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.13 E-value=7.9e-12 Score=81.44 Aligned_cols=280 Identities=13% Similarity=0.049 Sum_probs=154.3 Q ss_pred CCCccCC------ccce----------EccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecch Q lcl|Aclame:pro 1 MADISRA------EVAS----------LIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESA 63 (305) Q Consensus 1 Ma~~t~~------~gg~----------lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~ 63 (305) ||.+++. .++. .| +.+..++.+...+.+.++++.++.++. ++++++|+. +...+.....|+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G~~~~~~~~~G~ 78 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGE 78 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-ceeEEEeeecCC Confidence 8865332 1222 34 889999999999999999999988877 558899976 555666666666 Q ss_pred hhcccccccccccceeEEeeeee--EEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc----Ccc--cCcCc Q lcl|Aclame:pro 64 TDPKGVKPTSKVTWANRTLVAEE--IAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTD--KPASW 135 (305) Q Consensus 64 ~~~~~~~~~~~~~f~~v~~~~~k--~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~----G~g--~~~~~ 135 (305) ...... .++.-.+++|...+ +.. ..|.+-=--++..++.+.+.+++++++++..|+.++. +.. ++... T Consensus 79 ~l~~t~---~~~~~~e~~l~ID~~~y~~-~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~ 154 (344) T protein:vir:10 79 NLDDIR---KDIKHTEKVITIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNE 154 (344) T ss_pred CCCCCC---CCcccceEEEEEcchhhhh-hhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 643321 11233444444333 222 1222211123456899999999999999999988852 111 11111 Q ss_pred ccccccccccc--cccceeecccchhhhHHHHHHHHHHHHhhhccccc--eEEEEchHHHHHHHHhhccC-----Cceee Q lcl|Aclame:pro 136 VSPALIPAAVT--AGQAVEVVGGVANESDIVGATNRAAKAVASAGWAP--DTLLSSLALRYEVANIRDAN-----GNPVF 206 (305) Q Consensus 136 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~~kd~~-----G~~l~ 206 (305) .+.+....... ................+++.+.++...+...+-.. =..+++|..+..|.+-+.-+ |.-.. T Consensus 155 ~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~ 234 (344) T protein:vir:10 155 NITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDP 234 (344) T ss_pred ccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccce Confidence 11111111100 11111112222333456677777777666654322 25788999999886543211 11111 Q ss_pred c---ccccCccceEecCccccCC--C---------------CceEEEEehhhEE----------EEeecCcEEEEeecce Q lcl|Aclame:pro 207 R---DDSFAGFRTFFNRNGAWDA--D---------------AAIEVIADSSRVK----------IGVRQDITVKFLDQAT 256 (305) Q Consensus 207 ~---~~~l~G~pv~~~~~~~~~~--~---------------~~~~~~gdf~~~~----------~~~~~~i~v~~~~~~~ 256 (305) . -..+.|++|+.++++|... + .+-.+..+|+... .+...+++++..++.. T Consensus 235 ~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 314 (344) T protein:vir:10 235 EKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 314 (344) T ss_pred eeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchh Confidence 1 1368999999999887421 0 1112223443321 1222333444333221 Q ss_pred eccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 257 LGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 257 ~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) .|. -.+++..-+|..+.||++.+.+..++- T Consensus 315 ---------~~~---d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 315 ---------FQA---DQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred ---------HHH---HHHHHHhhcccceecccceEEEEeecC Confidence 122 246777888999999998866666654 No 135 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.12 E-value=1.6e-11 Score=79.81 Aligned_cols=283 Identities=13% Similarity=-0.004 Sum_probs=157.8 Q ss_pred CCCccCC---------ccc-eEcc-HHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhhccc Q lcl|Aclame:pro 1 MADISRA---------EVA-SLIQ-EAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATDPKG 68 (305) Q Consensus 1 Ma~~t~~---------~gg-~lip-~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~ 68 (305) |+....+ .++ .-+. +++..++.+...+.+.++++.++.++. ++++.||+. +...++...-+++.... T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~~~~g~~l~~~ 79 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAGRKAGEELVVQ 79 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeeeecCCCCCCCC Confidence 7765211 112 2333 899999999999999999999988887 558999976 66666666666655433 Q ss_pred ccccccccceeEEeeeeeE-EEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc----Cccc--Cc---Ccccc Q lcl|Aclame:pro 69 VKPTSKVTWANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTDK--PA---SWVSP 138 (305) Q Consensus 69 ~~~~~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~----G~g~--~~---~~~~~ 138 (305) . .+-.+.++....+ .....|.+-=--++..|+.+.+.++++++++++.|++++. +... +. ..+.. T Consensus 80 ~-----~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~ 154 (334) T protein:vir:80 80 K-----NVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHD 154 (334) T ss_pred C-----cccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccC Confidence 2 2334545444442 1222222211123456899999999999999999998752 2211 11 11111 Q ss_pred cccccccccccceeecccchhhhHHHHHHHHHHHHhhhcccc-----ceEEEEchHHHHHHHHhhc---c-----C-Cce Q lcl|Aclame:pro 139 ALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWA-----PDTLLSSLALRYEVANIRD---A-----N-GNP 204 (305) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~v~~~~~~~~l~~~kd---~-----~-G~~ 204 (305) |+.......+ +......+.+.+++.+..+...+...+.. .=..+++|..+..|..-+. . + +.. T Consensus 155 G~~~~~~~~g---~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~ 231 (334) T protein:vir:80 155 GILLPSTISG---LAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNS 231 (334) T ss_pred Ccceeecccc---cccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecccccccc Confidence 2222111111 12223334445566666666666655443 2368999999999875421 1 1 111 Q ss_pred eec--ccccCccceEecCccccCC-------CCceEEEEehhhEEE--EeecCc-EEEEee---cceeccCcceeeeeec Q lcl|Aclame:pro 205 VFR--DDSFAGFRTFFNRNGAWDA-------DAAIEVIADSSRVKI--GVRQDI-TVKFLD---QATLGTGENQINLAER 269 (305) Q Consensus 205 l~~--~~~l~G~pv~~~~~~~~~~-------~~~~~~~gdf~~~~~--~~~~~i-~v~~~~---~~~~~~~~~~~~~~~~ 269 (305) +-. -..++|++|+.++++|... +....+-|||+.... .-++-+ .++..+ +.+.+.. .|.. T Consensus 232 ~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-----~~~d 306 (334) T protein:vir:80 232 FVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKK-----DFGH 306 (334) T ss_pred ccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechh-----hHHH Confidence 111 2468999999999998542 122466777766432 112221 111111 1111100 0111 Q ss_pred CcEEEEEEEEEccEeecccceEEEecccccc Q lcl|Aclame:pro 270 DMVALRLKARFAYVLGVSATAQGANKTPVAV 300 (305) Q Consensus 270 ~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~ 300 (305) .+.+..-+|-++.||++++.+..+.+-| T Consensus 307 ---~i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 307 ---YLDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred ---HHHHHHHcCCceeccceEEEEEEeeecC Confidence 1233345678899998888888775433 No 136 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.09 E-value=9.1e-12 Score=81.12 Aligned_cols=279 Identities=13% Similarity=0.062 Sum_probs=150.5 Q ss_pred CCCccCCcc------c-------eEccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhhc Q lcl|Aclame:pro 1 MADISRAEV------A-------SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATDP 66 (305) Q Consensus 1 Ma~~t~~~g------g-------~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~ 66 (305) ||.++.+.- | .+.=+++..+++....+.+.++.+.++.++. ++++.||+. +...+.....++... T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i-G~~tv~~~t~G~~l~ 79 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM-GRTSGVYLAPGERLS 79 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecc-cceeeeeecCCCCcC Confidence 776554322 1 1222688899999988889999998888766 558889986 555666566665543 Q ss_pred ccccccccccceeEEeeeeeEEEeehhhHHHhh-----cCHHHHHHHHHHHHHHHHHHHHHHHHHc--C--cc--cCcCc Q lcl|Aclame:pro 67 KGVKPTSKVTWANRTLVAEEIAVIIPVHENVID-----DATVAVLTEVAELGGQAIGKKLDQAVIF--G--TD--KPASW 135 (305) Q Consensus 67 ~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~-----ds~~~~~~~v~~~la~~~a~~~d~a~l~--G--~g--~~~~~ 135 (305) .... ..+-.+.++...+. .+.+.++. ++..++.+.+.++.++++++..|+.++. . .. .+... T Consensus 80 ~~~~---~~~~~e~~itID~~----~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~ 152 (347) T protein:vir:94 80 DKRK---GIKHTEKVITIDGL----LTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNE 152 (347) T ss_pred CCCC---CCCcceEEEEecch----hhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 2211 12233433433332 12222332 2455788999999999999999998862 1 11 00111 Q ss_pred cccccccccccc-ccceeecccchhhhHHHHHHHHHHHHhhhcccc--ceEEEEchHHHHHHHHhhccCC-----ceeec Q lcl|Aclame:pro 136 VSPALIPAAVTA-GQAVEVVGGVANESDIVGATNRAAKAVASAGWA--PDTLLSSLALRYEVANIRDANG-----NPVFR 207 (305) Q Consensus 136 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~~kd~~G-----~~l~~ 207 (305) ...+........ .+.............+++.+.++...+...+-. .=..+++|..+..|..-++-+. .-..+ T Consensus 153 ~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 232 (347) T protein:vir:94 153 NIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPE 232 (347) T ss_pred ccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhcccccccc Confidence 111111111111 111111111223345566676666666655432 2268899999887754332111 11111 Q ss_pred ---ccccCccceEecCccccCCC------Cc-e-------E--------EEEehhhEE--E--------EeecCcEEEEe Q lcl|Aclame:pro 208 ---DDSFAGFRTFFNRNGAWDAD------AA-I-------E--------VIADSSRVK--I--------GVRQDITVKFL 252 (305) Q Consensus 208 ---~~~l~G~pv~~~~~~~~~~~------~~-~-------~--------~~gdf~~~~--~--------~~~~~i~v~~~ 252 (305) -..++|++|+.++++|.... .+ . . +-+||+.-. + +...+++++.. T Consensus 233 ~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~ 312 (347) T protein:vir:94 233 TGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERD 312 (347) T ss_pred ccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccch Confidence 14789999999999884211 11 1 1 223332221 1 11112223322 Q ss_pred ecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 253 DQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) ++. +.| .-.+++...+|..+.||++.+.++.+.+- T Consensus 313 r~~---------~~~---~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 313 RDV---------DAQ---GDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred hch---------hhH---HHHhhhhhhhcCcccccceeEEEEecCCC Confidence 211 112 23578888999999999999999887554 No 137 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.08 E-value=3.7e-11 Score=77.78 Aligned_cols=282 Identities=11% Similarity=0.030 Sum_probs=153.9 Q ss_pred CCCccCCc--------c---c---eEccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhh Q lcl|Aclame:pro 1 MADISRAE--------V---A---SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATD 65 (305) Q Consensus 1 Ma~~t~~~--------g---g---~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~ 65 (305) ||.+.++. + | .+.=+.+..++.+...+.+.++.+.++..+. ++++++|+. +...+.....++.. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~i-G~~~~~~~~~G~~l 79 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVL-GRTKAAYLQPGENL 79 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeec-cceeEeeeecCcCC Confidence 77544332 0 1 1233889999999999999999999887765 568889865 44556666666653 Q ss_pred cccccccccccceeEEeeeeeE-EEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc----Cccc--CcCcccc Q lcl|Aclame:pro 66 PKGVKPTSKVTWANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTDK--PASWVSP 138 (305) Q Consensus 66 ~~~~~~~~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~----G~g~--~~~~~~~ 138 (305) ... ..++...+.++...++ .....|.+-=--++..|+.+.+.+++++++++..|+.++. +... +....+. T Consensus 80 ~~~---~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:94 80 DDK---RKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIA 156 (347) T ss_pred CCC---cCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 321 1123455555544443 1112232211123456789999999999999999998862 2111 0000000 Q ss_pred ccccc--ccccccceeecccchhhhHHHHHHHHHHHHhhhcccc--ceEEEEchHHHHHHHHhhc-cCCceee------- Q lcl|Aclame:pro 139 ALIPA--AVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWA--PDTLLSSLALRYEVANIRD-ANGNPVF------- 206 (305) Q Consensus 139 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~~kd-~~G~~l~------- 206 (305) +.... ...........+.......+++.+.++...+...+-. .-.++++|..+..|.+..+ ..+.+.. T Consensus 157 g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~G 236 (347) T protein:vir:94 157 GLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPSTG 236 (347) T ss_pred cCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccccccccc Confidence 00000 0000001111122233445566677777666655432 2246778988888765432 2222211 Q ss_pred cccccCccceEecCccccCCC------Cc---------------eEEEEehhhEE--E--------EeecCcEEEEeecc Q lcl|Aclame:pro 207 RDDSFAGFRTFFNRNGAWDAD------AA---------------IEVIADSSRVK--I--------GVRQDITVKFLDQA 255 (305) Q Consensus 207 ~~~~l~G~pv~~~~~~~~~~~------~~---------------~~~~gdf~~~~--~--------~~~~~i~v~~~~~~ 255 (305) .-..+.|++|+.++++|.... .+ .-+=+||++.. + +...++.+++.++. T Consensus 237 ~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~ 316 (347) T protein:vir:94 237 SIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRA 316 (347) T ss_pred eeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeech Confidence 124789999999999874321 00 11334444321 1 22233344443322 Q ss_pred eeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 256 TLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 256 ~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) .+ -...+.+..-+|-.+.||++.+.+..+.+ T Consensus 317 ~~------------~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 317 NF------------QADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hh------------hhhhhhhhhhhcCcccccceeEEEEecCC Confidence 21 12245677778889999999998887765 No 138 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.07 E-value=1.5e-11 Score=80.00 Aligned_cols=283 Identities=13% Similarity=0.058 Sum_probs=154.0 Q ss_pred CCCccCCc--------c---c---eEccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhh Q lcl|Aclame:pro 1 MADISRAE--------V---A---SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATD 65 (305) Q Consensus 1 Ma~~t~~~--------g---g---~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~ 65 (305) ||..+++. | + .+.=+++..++.+...+.+.++.+.++.++. ++++.+|+. +...+.....++.. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~i-G~~~~~~~~~g~~l 79 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeee-cceeeeeeccccCC Confidence 66433221 1 1 2333888999999999999999998887765 558889865 44445545555542 Q ss_pred cccccccccccceeEEeeeeeE-EEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc----CcccC--cCcccc Q lcl|Aclame:pro 66 PKGVKPTSKVTWANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTDKP--ASWVSP 138 (305) Q Consensus 66 ~~~~~~~~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~----G~g~~--~~~~~~ 138 (305) .. +..++...++++...+. .....|.+-=.-++..|+.+.+.+++++++++..|+.++. +.... ...... T Consensus 80 ~~---~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:88 80 DD---KRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) T ss_pred CC---CCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccC Confidence 21 11234455666655553 1122333321223345788899999999999999998863 21111 011111 Q ss_pred ccccccccc-ccceeecccchhhhHHHHHHHHHHHHhhhccc--cceEEEEchHHHHHHHHhhcc-CCcee----e---c Q lcl|Aclame:pro 139 ALIPAAVTA-GQAVEVVGGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANIRDA-NGNPV----F---R 207 (305) Q Consensus 139 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~~kd~-~G~~l----~---~ 207 (305) |........ ...............+++.+.++...+....- ..=.++++|..+..|.+.+.. ...+. + . T Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (347) T protein:vir:88 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGN 236 (347) T ss_pred CccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcce Confidence 111111111 00111112222333445666666666555432 223688999988887653211 11111 1 1 Q ss_pred ccccCccceEecCccccCCCC---------------------ceEEEEehhhEEE----------EeecCcEEEEeecce Q lcl|Aclame:pro 208 DDSFAGFRTFFNRNGAWDADA---------------------AIEVIADSSRVKI----------GVRQDITVKFLDQAT 256 (305) Q Consensus 208 ~~~l~G~pv~~~~~~~~~~~~---------------------~~~~~gdf~~~~~----------~~~~~i~v~~~~~~~ 256 (305) -..+.|++|+.++++|..... ..-+.+||+...- +...++.++..++.. T Consensus 237 vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~ 316 (347) T protein:vir:88 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE 316 (347) T ss_pred eeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeechh Confidence 246889999999988742111 0113345543211 112223333332221 Q ss_pred eccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccc Q lcl|Aclame:pro 257 LGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 257 ~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a 299 (305) .| .-.+++.+.+|..+.||++.+.++.+++| T Consensus 317 ---------~~---~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 317 ---------FQ---ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred ---------hH---HHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 11 23578888999999999999999998877 No 139 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.07 E-value=4.6e-11 Score=77.24 Aligned_cols=279 Identities=14% Similarity=0.068 Sum_probs=155.6 Q ss_pred CCCccCC-------ccc--------eEccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchh Q lcl|Aclame:pro 1 MADISRA-------EVA--------SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESAT 64 (305) Q Consensus 1 Ma~~t~~-------~gg--------~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~ 64 (305) |++.++. ..| .+.=+.+..++.+...+.+.++++.++.++. ++++++|+. +...+.....|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~~~~~~G~~ 79 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGEN 79 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEEEeeecCCC Confidence 7665441 001 2334888999999999999999999988887 558889976 6666777777765 Q ss_pred hccccccccccccee--EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc----Ccc--cCcCc- Q lcl|Aclame:pro 65 DPKGVKPTSKVTWAN--RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTD--KPASW- 135 (305) Q Consensus 65 ~~~~~~~~~~~~f~~--v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~----G~g--~~~~~- 135 (305) ..... .+++..+ ++++..++... .|.+-=--++..|+.+.+.+++++++++..|+.++. +.. ++... T Consensus 80 l~~~~---~~~~~~e~~ltID~~~y~~~-~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~ 155 (345) T protein:vir:22 80 LDDKR---KDIKHTEKVITIDGLLTADV-LIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNEN 155 (345) T ss_pred CCCCC---CCcccceEEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 43321 1233455 33333333222 222211123456899999999999999999998873 111 00111 Q ss_pred ---ccccccccccccccceeecccchhhhHHHHHHHHHHHHhhhccccc--eEEEEchHHHHHHHHhhccC-Ccee---- Q lcl|Aclame:pro 136 ---VSPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAP--DTLLSSLALRYEVANIRDAN-GNPV---- 205 (305) Q Consensus 136 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~~kd~~-G~~l---- 205 (305) +..+++......+.. ..........+++.+.++...+...+-.. =.++++|..+..|.+-+.-+ ..+. T Consensus 156 ~~~~~~~~~~~~~~~g~~--~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~ 233 (345) T protein:vir:22 156 IEGLGTATVIETTQNKAA--LTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALID 233 (345) T ss_pred cccccccccccccccccc--ccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccccc Confidence 111111111111111 11112233456677777766666554332 25899999999886543221 1111 Q ss_pred ec---ccccCccceEecCccccCCCCc----------------------------eEEEEehhhEEEEeecCcEEEEeec Q lcl|Aclame:pro 206 FR---DDSFAGFRTFFNRNGAWDADAA----------------------------IEVIADSSRVKIGVRQDITVKFLDQ 254 (305) Q Consensus 206 ~~---~~~l~G~pv~~~~~~~~~~~~~----------------------------~~~~gdf~~~~~~~~~~i~v~~~~~ 254 (305) .. -..+.|++|+.++++|...... ..++.-.+.+..+...+++++..++ T Consensus 234 ~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~ 313 (345) T protein:vir:22 234 PEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARR 313 (345) T ss_pred cccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeec Confidence 11 1368899999998876421111 1111122222233333444444433 Q ss_pred ceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 255 ATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) ... |. -.+++..-+|..+.||++.+.++.+-. T Consensus 314 ~~~---------~~---d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 314 ANF---------QA---DQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred hhH---------HH---HHHHHHHhcCCcccccceeEEEEEeeC Confidence 221 11 246777788999999999999998744 No 140 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.04 E-value=5.5e-11 Score=76.82 Aligned_cols=274 Identities=10% Similarity=0.019 Sum_probs=143.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhc---------ceeecCCCceEEEEEeC-CCceeeeecchhhccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAF---------QNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVK 70 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~---------~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~ 70 (305) ||.+.- .-.++|+.+..-+.++..+.+.+++-. ....-++..+++|.+.. ..++.-+.|+.. + T Consensus 1 MA~T~l--sd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~-----i 73 (351) T protein:vir:15 1 MAETHL--SDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDD-----I 73 (351) T ss_pred CCceee--eeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcc-----c Confidence 996544 345778888777777776666664421 11223467899999864 245656666654 4 Q ss_pred ccccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccc Q lcl|Aclame:pro 71 PTSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQA 150 (305) Q Consensus 71 ~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~ 150 (305) +..+.+-.+-....++.+..+.++++...-+..+....+.+++++.+++..++.+|.--- +.+.............. T Consensus 74 ~~~kitt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~---gv~~~~~~~~~~~~d~t 150 (351) T protein:vir:15 74 DVNNLTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLK---GVMGVTKIANSKVYDQT 150 (351) T ss_pred chheecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHH---HHhhchhhcccceeccc Confidence 444555555556667788888999987777777899999999999999999988774100 00000000000000000 Q ss_pred -eeecccchhhhHHHHHHHHHHHHhhhcc-ccceEEEEchHHHHHHHHhh------ccCCceeecccccCccceEecCcc Q lcl|Aclame:pro 151 -VEVVGGVANESDIVGATNRAAKAVASAG-WAPDTLLSSLALRYEVANIR------DANGNPVFRDDSFAGFRTFFNRNG 222 (305) Q Consensus 151 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~~k------d~~G~~l~~~~~l~G~pv~~~~~~ 222 (305) ......... .+.+.++..++.... -....|+||+.++..|++.. .++|..- -+.++|++|++++.+ T Consensus 151 ~~~~~~~~is----~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~--i~t~~G~~VivdD~~ 224 (351) T protein:vir:15 151 KVSPSEPMFG----AKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIETIQPQNGATP--FEAYNGLRIVLDDDI 224 (351) T ss_pred cccccccccC----HHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhhccccccCcc--cceecceEEEEcCCC Confidence 000111111 244555555554432 23578999999999998643 4444321 267999999999998 Q ss_pred ccCCCC--c----eEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 223 AWDADA--A----IEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 223 ~~~~~~--~----~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) |.+... . ..+||.=. +.++. ++..+++.++.....+. -.+..+.++ +.+|.++..-..+ T Consensus 225 p~~~~~~~~~~ytsyl~~~GA-i~~~~-~~~~ve~~rd~~~~~g~----------d~l~~r~~~---~~hp~G~s~~~~~ 289 (351) T protein:vir:15 225 EIDLTDKTKPVSTSYIFAPGA-VRYST-NMRSTETKYDPLINGGQ----------DVIVQKRVG---TIHVAGTSIKASF 289 (351) T ss_pred ccccCCCCCceeEEEEEecce-eeeec-CCcCcceeecccCCCCc----------eEEEEeeee---eeeeeeeeecccc Confidence 864322 1 22333211 11222 23334444444322111 111111111 1233333322111 Q ss_pred -ccc-------------------------------------cccCCC Q lcl|Aclame:pro 297 -PVA-------------------------------------VVAPAA 305 (305) Q Consensus 297 -~~a-------------------------------------~v~~a~ 305 (305) +.+ .+.||- T Consensus 290 ~~~~~~sPt~~~L~~~~NW~~v~~~d~k~I~iv~~~~~~~~~~~~~~ 336 (351) T protein:vir:15 290 SPSKASFPTIDELAKSSTWEVVDGIDVRSIGVVAYTAQLDPALTPGA 336 (351) T ss_pred cccCcCCcChHHhcCCcccccccCCCccccceEEEEEecCcccccCC Confidence 001 111111 No 141 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.03 E-value=5.6e-10 Score=71.31 Aligned_cols=284 Identities=10% Similarity=-0.029 Sum_probs=152.5 Q ss_pred CCCccCCccce---------EccHHHHHHHHHHHHhhhhhhhhcceeecCC-CceEEEEEeCCCceeeeecchhhccccc Q lcl|Aclame:pro 1 MADISRAEVAS---------LIQEAYSDTLLAAAKQGSTVLSAFQNVNMGT-KTTHLPVLATLPEADWVGESATDPKGVK 70 (305) Q Consensus 1 Ma~~t~~~gg~---------lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~E~~~~~~~~~ 70 (305) |+..++...+. +.=+++..++.+.....+.++.+..+.++.+ +++++|+. +..+++...-|+.... . T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~i-G~~~~~~~~~G~~ld~-~- 77 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYI-GETELQVLSPGKSPDA-S- 77 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeee-eeeEEeeeccCcccCC-C- Confidence 88765554332 3338888999999999999999998888774 58999987 5555555555554322 2 Q ss_pred ccccccceeEEeeeee--EEEeehhhHHHhhcCHHH-HHHHHHHHHHHHHHHHHHHHHHc-C-cccCcCccc---ccccc Q lcl|Aclame:pro 71 PTSKVTWANRTLVAEE--IAVIIPVHENVIDDATVA-VLTEVAELGGQAIGKKLDQAVIF-G-TDKPASWVS---PALIP 142 (305) Q Consensus 71 ~~~~~~f~~v~~~~~k--~~~~~~is~ell~ds~~~-~~~~v~~~la~~~a~~~d~a~l~-G-~g~~~~~~~---~~~~~ 142 (305) .+.-++.+|..-+ +.. ..|-+=---++..+ +-+.+.+++++++++..|+.++. . .+.+....+ .+... T Consensus 78 ---~~~~~k~~itID~ll~a~-~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~ 153 (364) T protein:vir:10 78 ---PTEFDKNRLVVDTTVIAR-NTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVA 153 (364) T ss_pred ---CcccCcEEEEecceeeec-hhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCccc Confidence 2233444444433 221 12222001123455 67889999999999999998852 0 000000000 00111 Q ss_pred ccccc-ccceeecccchhhhHHHHHHHHHHHHhhhcccc--ceEEEEchHHHHHHHHhhc---------cCCceee-ccc Q lcl|Aclame:pro 143 AAVTA-GQAVEVVGGVANESDIVGATNRAAKAVASAGWA--PDTLLSSLALRYEVANIRD---------ANGNPVF-RDD 209 (305) Q Consensus 143 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~~kd---------~~G~~l~-~~~ 209 (305) ..... ....+..+...+...+.+.+..+...+...+.. .-..+++|..+..|.+-.. .+|.+.. +-. T Consensus 154 ~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~ 233 (364) T protein:vir:10 154 GHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVL 233 (364) T ss_pred CCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeE Confidence 11100 011122222334456666666777666655432 2358899999988876321 1111111 113 Q ss_pred ccCccceEecCccccCCCC-----------------ceE--EEEehhhE--EE--------EeecCcEEEEeecceeccC Q lcl|Aclame:pro 210 SFAGFRTFFNRNGAWDADA-----------------AIE--VIADSSRV--KI--------GVRQDITVKFLDQATLGTG 260 (305) Q Consensus 210 ~l~G~pv~~~~~~~~~~~~-----------------~~~--~~gdf~~~--~~--------~~~~~i~v~~~~~~~~~~~ 260 (305) .+.|+||+.++++|...+. +.- ..+||+.. .+ +...++..++.++.. T Consensus 234 ~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~---- 309 (364) T protein:vir:10 234 KSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKK---- 309 (364) T ss_pred EEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccc---- Confidence 6899999999998742211 101 12444332 12 222333333332211 Q ss_pred cceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 261 ENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 261 ~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .|. .-+.+..-+|..+.||++++.++... .-+||- T Consensus 310 -----~~~---~~ida~~a~G~g~lRPeaa~~i~~~~--~~~~~~ 344 (364) T protein:vir:10 310 -----EKT---WYIDTFLAEGAIPDRWEAVAVVTAAD--TAELAT 344 (364) T ss_pred -----eee---eeeeeehcccCcccCccceEEEEecC--CCCCcc Confidence 111 12334555788899999999997653 335665 No 142 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.95 E-value=1.1e-10 Score=75.24 Aligned_cols=273 Identities=14% Similarity=0.082 Sum_probs=152.5 Q ss_pred CCCccCC-------ccc---eEccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhhcccc Q lcl|Aclame:pro 1 MADISRA-------EVA---SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATDPKGV 69 (305) Q Consensus 1 Ma~~t~~-------~gg---~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~~ 69 (305) |+....+ ++- .+.=+.+..++++...+.+.++.+.++.++. +++++||+. +...+.....++..... T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~i-g~~~~~~~~~g~~l~~~- 84 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGTPIVGD- 84 (332) T ss_pred ccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEec-cceeEeeecCCCCCCCC- Confidence 4432222 111 1333888999999999999999998877766 568999987 44445444444433211 Q ss_pred cccccccceeEEeeeee--EEEeehhhHHHh-hcCHHHHHHHHHHHHHHHHHHHHHHHHHc----CcccCcCcccccccc Q lcl|Aclame:pro 70 KPTSKVTWANRTLVAEE--IAVIIPVHENVI-DDATVAVLTEVAELGGQAIGKKLDQAVIF----GTDKPASWVSPALIP 142 (305) Q Consensus 70 ~~~~~~~f~~v~~~~~k--~~~~~~is~ell-~ds~~~~~~~v~~~la~~~a~~~d~a~l~----G~g~~~~~~~~~~~~ 142 (305) .+++-.++++...+ +.. ..|.+ +- .++..++.+.+.++.++++++.+|+.++. +..... +.+... T Consensus 85 ---~~~~~~~~~l~ID~~ky~~-~~Vdd-iD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~---~~~~~~ 156 (332) T protein:vir:78 85 ---AGIKANEKTLVMDDLLVSS-QFVYS-LDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEAS---PVTGEP 156 (332) T ss_pred ---CCCCCceEEEEEehhhhhH-HHHHh-HHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccC---cccccc Confidence 11222333444333 222 12222 21 13456899999999999999999987763 211100 000000 Q ss_pred cccccccceeecccchhhhHHHHHHHHHHHHhhhccccce--EEEEchHHHHHHHHhhc----------cCCceeec--- Q lcl|Aclame:pro 143 AAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPD--TLLSSLALRYEVANIRD----------ANGNPVFR--- 207 (305) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~~kd----------~~G~~l~~--- 207 (305) .... .....+...+...+++.+.++...+...+-... .++++|..+..|.+.+| .+|. +.. T Consensus 157 g~~~---~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~-~~~g~~ 232 (332) T protein:vir:78 157 GGFH---VNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGD-MNSGKG 232 (332) T ss_pred cccc---cccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccc-eeccee Confidence 0000 011122233445677778888877776654322 46779999998876433 1221 221 Q ss_pred ccccCccceEecCccccCCCC----------ceEEEEehhhEE--EEeec--------CcEEEEeecceeccCcceeeee Q lcl|Aclame:pro 208 DDSFAGFRTFFNRNGAWDADA----------AIEVIADSSRVK--IGVRQ--------DITVKFLDQATLGTGENQINLA 267 (305) Q Consensus 208 ~~~l~G~pv~~~~~~~~~~~~----------~~~~~gdf~~~~--~~~~~--------~i~v~~~~~~~~~~~~~~~~~~ 267 (305) -..+.|++|+.++++|...+. ...+-|+|+... +.-+. ++.+++.+... .-..| T Consensus 233 i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~------~~~~~ 306 (332) T protein:vir:78 233 LYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF------NVQYQ 306 (332) T ss_pred eeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhccc------chhhh Confidence 246889999999999854322 224556665521 11112 22222211100 00111 Q ss_pred ecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 268 ERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 268 ~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) .-.++....+|..+.||++++.++.+ T Consensus 307 ---~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 307 ---GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred ---HhhhhhhhhhcCceecccceEEEeeC Confidence 23467777899999999999999987 No 143 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.91 E-value=4.4e-10 Score=71.89 Aligned_cols=282 Identities=11% Similarity=0.028 Sum_probs=151.9 Q ss_pred CCCccCCc-------cc--------eEccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchh Q lcl|Aclame:pro 1 MADISRAE-------VA--------SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESAT 64 (305) Q Consensus 1 Ma~~t~~~-------gg--------~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~ 64 (305) ||.+.+.. -| ..| +.+..++.+..++.+.++.++++.++. ++++.||+.. ...+.....++. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG-~~t~~~~~~g~~ 78 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG-RTKAAYLKPGEN 78 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccccccccceeEeeecc-ceeeeeecCCCC Confidence 77544332 01 244 889999999999999999998876655 5688888764 344454555544 Q ss_pred hcccccccccccceeEEeeee--eEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc-----CcccCcCccc Q lcl|Aclame:pro 65 DPKGVKPTSKVTWANRTLVAE--EIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF-----GTDKPASWVS 137 (305) Q Consensus 65 ~~~~~~~~~~~~f~~v~~~~~--k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~-----G~g~~~~~~~ 137 (305) .... ..+.+..+.++... |+.. ..|.+-=-.++..++.+.+.++.++++++..|+.++. +......... T Consensus 79 l~~~---~~~~~~~e~~ltiD~~~y~~-~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:33 79 LDDK---RKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNEN 154 (347) T ss_pred CCCC---CCCCccceEEEEechhhhhh-HHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 3221 11123344444333 3221 1222211123456788999999999999999998872 1111110000 Q ss_pred cccccccc----ccccceeecccchhhhHHHHHHHHHHHHhhhcccc--ceEEEEchHHHHHHHHhhc-----cCCceee Q lcl|Aclame:pro 138 PALIPAAV----TAGQAVEVVGGVANESDIVGATNRAAKAVASAGWA--PDTLLSSLALRYEVANIRD-----ANGNPVF 206 (305) Q Consensus 138 ~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~~kd-----~~G~~l~ 206 (305) ........ ......+..+...+...+++.+.++...+...+-. .=.++++|..+..|.+... ..|.-.+ T Consensus 155 ~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~ 234 (347) T protein:vir:33 155 IEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDP 234 (347) T ss_pred cccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccccccccc Confidence 00000000 00011111112223345667777777666655432 2358899999998875432 2221111 Q ss_pred cc---cccCccceEecCccccCCCCc------------------eEEEEehhhE--E------E--EeecCcEEEEeecc Q lcl|Aclame:pro 207 RD---DSFAGFRTFFNRNGAWDADAA------------------IEVIADSSRV--K------I--GVRQDITVKFLDQA 255 (305) Q Consensus 207 ~~---~~l~G~pv~~~~~~~~~~~~~------------------~~~~gdf~~~--~------~--~~~~~i~v~~~~~~ 255 (305) .. ..+.|++|+.++++|.....+ ..+-++|+.. + + ....++.++..++. T Consensus 235 ~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~ 314 (347) T protein:vir:33 235 ERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA 314 (347) T ss_pred ccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccch Confidence 11 368999999999987532211 1122333221 1 1 11222233333322 Q ss_pred eeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccc Q lcl|Aclame:pro 256 TLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAV 300 (305) Q Consensus 256 ~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~ 300 (305) . +-.-.++....+|..+.||++.+.++....+- T Consensus 315 ~------------~~~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 315 N------------YQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred h------------hhhHhhhhhhhcCCceecccceEEEecCCCCC Confidence 1 11234677788899999999999998876544 No 144 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.88 E-value=4.7e-10 Score=71.70 Aligned_cols=281 Identities=11% Similarity=0.030 Sum_probs=137.1 Q ss_pred CCCcc-----------CCccceEccHHHHHHHHHHHHhhhhhhhhcceeec---CCCceEEEEEeCCCceeeeecchhhc Q lcl|Aclame:pro 1 MADIS-----------RAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNM---GTKTTHLPVLATLPEADWVGESATDP 66 (305) Q Consensus 1 Ma~~t-----------~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~---~~~~~~~p~~~~~~~a~~v~E~~~~~ 66 (305) ||.+- ++....++|+.+..++++.+++..++..+++.... .+.++++|+.. .+.+....++.... T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~d~~~g~~i~ 79 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVYDKQPQTPVN 79 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceeeeecCCCccc Confidence 44332 22234688999999999999999888888765332 35689999864 55666677766543 Q ss_pred ccccccccccceeEEeeeeeE-EEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcc----cCcCccccccc Q lcl|Aclame:pro 67 KGVKPTSKVTWANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTD----KPASWVSPALI 141 (305) Q Consensus 67 ~~~~~~~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g----~~~~~~~~~~~ 141 (305) .. +.+..++++...+. ..-..|++.-...+..++.+.+.++++.+++++.|+.++.--. ...+.. .. T Consensus 80 ~~-----~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~---~t 151 (381) T protein:vir:80 80 LQ-----ARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRI---YS 151 (381) T ss_pred cc-----ccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---cc Confidence 32 33344445544332 2335777754555566899999999999999999998874211 111000 00 Q ss_pred ccccccccceeecccchhhhHHHHHHHHHHHHhhhccc--cceEEEEchHHHHHHHHhh-----ccCCceeecc---ccc Q lcl|Aclame:pro 142 PAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANIR-----DANGNPVFRD---DSF 211 (305) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~~k-----d~~G~~l~~~---~~l 211 (305) ..............+.......++.+..+...+....- ..-.++++|..+..|.+.. |..+...++. .++ T Consensus 152 ~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i 231 (381) T protein:vir:80 152 YDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTI 231 (381) T ss_pred ccccccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEE Confidence 00000000111111112223344555566655554432 2226889999999987642 1122222222 479 Q ss_pred CccceEecCccccCCCCceE-EEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecc-cc Q lcl|Aclame:pro 212 AGFRTFFNRNGAWDADAAIE-VIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVS-AT 289 (305) Q Consensus 212 ~G~pv~~~~~~~~~~~~~~~-~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p-~a 289 (305) .|++++.++++|.....+.. .+|-..... ..+.- ..+. ..|.++..++|....+|..+... .. T Consensus 232 ~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~----~~~~~-----~~~~------g~~s~~a~av~~~k~yd~~~~~~~~~ 296 (381) T protein:vir:80 232 LGMEVIVTTQIGINSLTGYVNGQGAPTQPT----PGVLG-----SPYL------PDQAGTANVVNTGSASDLAVSLSYFG 296 (381) T ss_pred cceEEEeecccccccccceeeecccccccc----ccccc-----cccc------cccccceeeeeeeeeeceeeeeeecc Confidence 99999999998864322211 111000000 00000 0000 00111122333333333322110 00 Q ss_pred eEEEe-------------c------ccccccc-----CCC Q lcl|Aclame:pro 290 AQGAN-------------K------TPVAVVA-----PAA 305 (305) Q Consensus 290 ~~~~~-------------~------t~~a~v~-----~a~ 305 (305) +-..+ + ..++.|- +|+ T Consensus 297 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 336 (381) T protein:vir:80 297 LPVFSGAGATAADGGQTLGSFGGANRWATAVVCHPDWLAV 336 (381) T ss_pred ceeeecceeeecCCCceeeeehhhhhhhhhcccccccccc Confidence 00000 0 1111111 222 No 145 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.85 E-value=7.8e-10 Score=70.53 Aligned_cols=225 Identities=14% Similarity=0.089 Sum_probs=145.1 Q ss_pred CC-----CccCCc-cceEccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhhcccccccc Q lcl|Aclame:pro 1 MA-----DISRAE-VASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATDPKGVKPTS 73 (305) Q Consensus 1 Ma-----~~t~~~-gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~ 73 (305) |+ ..|-.+ ..-+-|......|+|.+.+.++|++..++.... +..+.+.+.++-|.+.|..=++. .+.+ T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN~g-----~~~s 75 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLNYG-----VQPS 75 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecCCc-----cCcc Confidence 44 445555 444667778889999999999999999998885 44588899999999999665543 5567 Q ss_pred cccceeEEeeeeeEEEeehhhHHHhhcCH--HHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccc--------- Q lcl|Aclame:pro 74 KVTWANRTLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIP--------- 142 (305) Q Consensus 74 ~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~--------- 142 (305) +.++.+++-..+-+++.+.+.+.+.+... .++...-.....+++++++...||+|+.+..+..-.|+.. T Consensus 76 ~~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~ 155 (328) T protein:vir:95 76 KSTTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGN 155 (328) T ss_pred cceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCcccccc Confidence 78999999999999999999998887653 2333444566889999999999999965422111111000 Q ss_pred ----------ccc----------------------ccc------------------------------------------ Q lcl|Aclame:pro 143 ----------AAV----------------------TAG------------------------------------------ 148 (305) Q Consensus 143 ----------~~~----------------------~~~------------------------------------------ 148 (305) +.. ..+ T Consensus 156 a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvr 235 (328) T protein:vir:95 156 AQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVR 235 (328) T ss_pred ccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEE Confidence 000 000 Q ss_pred --cc-eeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhh-ccCCceeec-------ccccCccceE Q lcl|Aclame:pro 149 --QA-VEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIR-DANGNPVFR-------DDSFAGFRTF 217 (305) Q Consensus 149 --~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~k-d~~G~~l~~-------~~~l~G~pv~ 217 (305) +. +..-.......++.+++..+...+++.......|+||++....|++.. +.....+-. ...+.|.|+. T Consensus 236 I~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~gipir 315 (328) T protein:vir:95 236 IANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFRGVPIR 315 (328) T ss_pred EecCcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEECCeEEE Confidence 00 000001113344666667777777666666778999999999998753 333322221 2357788877 Q ss_pred ecCccccCCCCceEE Q lcl|Aclame:pro 218 FNRNGAWDADAAIEV 232 (305) Q Consensus 218 ~~~~~~~~~~~~~~~ 232 (305) ..+.+..+ +..++ T Consensus 316 ~~dai~~t--E~~vv 328 (328) T protein:vir:95 316 ETDALLET--EARVV 328 (328) T ss_pred EEeeeecC--ccccC Confidence 66654421 11111 No 146 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.81 E-value=2.6e-09 Score=67.68 Aligned_cols=281 Identities=12% Similarity=0.068 Sum_probs=149.4 Q ss_pred CCCccCCc-------cce-------EccHHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhh Q lcl|Aclame:pro 1 MADISRAE-------VAS-------LIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATD 65 (305) Q Consensus 1 Ma~~t~~~-------gg~-------lip~~~~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~ 65 (305) ||.+.+.. -|. +.=+.+..++++..++.+.++.+.++.++. ++++.||+... ..+.....+... T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~-~t~~~~~~g~~l 79 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR-TKAAYLKPGENL 79 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccc-eeeeeeccCCCC Confidence 88755432 011 222677888999999999999998877755 56888987643 445545555433 Q ss_pred cccccccccccceeEEeeee--eEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcC-----c---ccCcCc Q lcl|Aclame:pro 66 PKGVKPTSKVTWANRTLVAE--EIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFG-----T---DKPASW 135 (305) Q Consensus 66 ~~~~~~~~~~~f~~v~~~~~--k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G-----~---g~~~~~ 135 (305) ... ..+.+..+.++... |+.. ..|.+-=-.++..++.+.+.++.++++++..|+.++.= + .+..+. T Consensus 80 ~~~---~~~~~~~e~~ltID~~~~~~-~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~ 155 (347) T protein:vir:15 80 DDK---RKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENI 155 (347) T ss_pred CCC---CCCCccceEEEEechhhhhh-HHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 221 11223444444333 3222 12222111234567899999999999999999988721 0 000000 Q ss_pred cc---ccccccccccccceeecccchhhhHHHHHHHHHHHHhhhccc--cceEEEEchHHHHHHHHhhcc-----CCcee Q lcl|Aclame:pro 136 VS---PALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANIRDA-----NGNPV 205 (305) Q Consensus 136 ~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~~kd~-----~G~~l 205 (305) .. .++.... ................+++.+.++...+....- ..=.++++|..+..|.+-.+. +|.-. T Consensus 156 ~~~g~~~~~~~~--~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~ 233 (347) T protein:vir:15 156 EGLGKPTVLTLV--KPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALID 233 (347) T ss_pred cccCcccccccc--ccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccccccccccc Confidence 00 0000000 000011111122344556666666655555443 222578899999988754321 12111 Q ss_pred ecc---cccCccceEecCccccCCCCc----------eE--------EEEehhhE--------E--EEeecCcEEEEeec Q lcl|Aclame:pro 206 FRD---DSFAGFRTFFNRNGAWDADAA----------IE--------VIADSSRV--------K--IGVRQDITVKFLDQ 254 (305) Q Consensus 206 ~~~---~~l~G~pv~~~~~~~~~~~~~----------~~--------~~gdf~~~--------~--~~~~~~i~v~~~~~ 254 (305) ++. ..++|++|+.++++|...+.. -. +-++|+.. . .+...++.++..++ T Consensus 234 ~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~ 313 (347) T protein:vir:15 234 HERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARR 313 (347) T ss_pred ccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeeccc Confidence 222 468999999999987532211 01 12222211 1 11222333443332 Q ss_pred ceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccc Q lcl|Aclame:pro 255 ATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAV 300 (305) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~ 300 (305) .. +-.-.++....+|..++||++.+.+.....+- T Consensus 314 ~~------------~~~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 314 AN------------YQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred ch------------hhhhhhehhhhcCCceeccccEEEEecCCCCC Confidence 21 12234677778899999999999998776544 No 147 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=98.75 E-value=6e-10 Score=71.13 Aligned_cols=267 Identities=13% Similarity=0.063 Sum_probs=154.5 Q ss_pred CCC--ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MAD--ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~--~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) ++. +|.++.-..+|.-+...|-..++.+.++.+..++.++++- -+-+......-.|+. ..+.++..+..+|. T Consensus 117 l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~l--~V~~~~dt~~qa~gH----k~G~~K~eq~~tl~ 190 (400) T protein:vir:93 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGAL--LVSRSFDSANEAQVH----KDGQTKTEQAATLT 190 (400) T ss_pred hhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeeecCCce--eeecchhhhccccee----ccCCcccceeeeee Confidence 332 3334444578999999999999999999998888887432 221111111122211 12234667777888 Q ss_pred eEEeeeeeEEEeehhhHHHhh--cCHHHHHHHHHHHHHHHHHH-HHHHHHHcCcccCcCcccc----cccccccccccce Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVID--DATVAVLTEVAELGGQAIGK-KLDQAVIFGTDKPASWVSP----ALIPAAVTAGQAV 151 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~--ds~~~~~~~v~~~la~~~a~-~~d~a~l~G~g~~~~~~~~----~~~~~~~~~~~~~ 151 (305) .-++.+.-+..+.++.+-..+ .+.-.|.+||+++|...+-. ..+.+++-|+|+.. +... .+-+.+..+ .. T Consensus 191 ~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ng-f~~~dk~t~Ik~I~~dt--~k 267 (400) T protein:vir:93 191 IDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNG-FKSIDKEADVKKIKKIT--TK 267 (400) T ss_pred eeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccc-cCCCcchhhhhhhhhhh--hh Confidence 888888877777777433322 23456899999999999995 57999999988642 1000 001111100 00 Q ss_pred eecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeeccc-------ccCccceEe-cCccc Q lcl|Aclame:pro 152 EVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD-------SFAGFRTFF-NRNGA 223 (305) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~-------~l~G~pv~~-~~~~~ 223 (305) +-.++...+.++... +.....+.....-.++++|..++.|+.+||++|.+.|+.. .-.|+--++ ...++ T Consensus 268 t~~a~~~~~qdl~E~---~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~ 344 (400) T protein:vir:93 268 AKSAGKTPFADAIEE---AVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSK 344 (400) T ss_pred hhhcCCccHHHHHHH---HHhhhhhccCCceeEEeccchHHHHHHhcCCcceeeeeeccccchhhhhcccceeeeeccCC Confidence 111222333333333 2323333333445789999999999999999999988532 112322221 22222 Q ss_pred cCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 224 WDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 224 ~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) . ..+.++.|-..++ . -.| ++.. .-..|.+|+-.+.++..+++-+.-|.+-+.++.. T Consensus 345 ~---~kp~V~VDek~~i-~-~~~--~~t~----------~sf~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 345 A---LKPTVLVDQKYHI-D-MQD--LTKV----------DAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred C---CCceeeeehhhhc-c-ccC--ceec----------cceeeeeccceEEeeeeeccceecccceeeEeeC Confidence 1 1223334544442 1 111 1111 1112667788889999999999888888777766 No 148 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.73 E-value=2.7e-09 Score=67.57 Aligned_cols=266 Identities=14% Similarity=0.055 Sum_probs=140.0 Q ss_pred CCCccCCccceEccHHH---HHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAY---SDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~---~~~i~~~~~~~~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) ||+...+..-.|.+.+. .+.+-+-+.+-..++...+.+|+. +.++++|++.....+.-|+||+. +|.++.+ T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~-----Iplskvt 75 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGET-----IPLSKVT 75 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcc-----cchhhhe Confidence 99876665555663332 344433333333344445788888 45899999988888888999986 6666666 Q ss_pred ce---eEEeeeeeEEEeehhhHHHhhcC-HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccccee Q lcl|Aclame:pro 77 WA---NRTLVAEEIAVIIPVHENVIDDA-TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVE 152 (305) Q Consensus 77 f~---~v~~~~~k~~~~~~is~ell~ds-~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 152 (305) .+ ..+++.+|.+.- +|+|.++.| .-+....-.++|..+++.++|..++.--.+++. .. T Consensus 76 ~~~~~t~t~kikK~rK~--tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~---------------t~- 137 (295) T protein:vir:99 76 RTKDKDYTVKWFKKRRA--TTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPT---------------KV- 137 (295) T ss_pred eeeeeeeEEEeeeeccc--ccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCce---------------ee- Confidence 54 366777787765 499999644 446788889999999999999999953221110 00 Q ss_pred ecccchhhhHHHHHHHHHHHHhh---hccccceEEEEchHHHHHHHHhhcc--CCceeec-c--cccCccc-eEecCccc Q lcl|Aclame:pro 153 VVGGVANESDIVGATNRAAKAVA---SAGWAPDTLLSSLALRYEVANIRDA--NGNPVFR-D--DSFAGFR-TFFNRNGA 223 (305) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~v~~~~~~~~l~~~kd~--~G~~l~~-~--~~l~G~p-v~~~~~~~ 223 (305) +.+.+...+..+..++. ..+-.....++||.....|++-..- +..-.|- . ..++|+- ++.+..++ T Consensus 138 ------tg~~lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~nfLG~q~II~S~kv~ 211 (295) T protein:vir:99 138 ------KGVGLQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKNFLGMQNVIVMPSVP 211 (295) T ss_pred ------ehhhHHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhhhhhccceEEEcccCC Confidence 11122222222222222 2222345788899999988754321 1110110 0 1378987 77887775 Q ss_pred cCCCCceEEEEehhhEE---EEee-cCcEEEEeecceeccCcceeeeeecCcEE--EEEEE--EEccE--eecccceEEE Q lcl|Aclame:pro 224 WDADAAIEVIADSSRVK---IGVR-QDITVKFLDQATLGTGENQINLAERDMVA--LRLKA--RFAYV--LGVSATAQGA 293 (305) Q Consensus 224 ~~~~~~~~~~gdf~~~~---~~~~-~~i~v~~~~~~~~~~~~~~~~~~~~~~~~--~r~~~--r~~~~--v~~p~a~~~~ 293 (305) ++.++.---.++. +..+ +++. .--.+..+....--+.+|... +-++. .-++. ..+++++++. T Consensus 212 ----~G~~~aT~~~Ni~~ay~~~~~g~l~----~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~ 283 (295) T protein:vir:99 212 ----EGKIYSTAVENLVFASLNVKGGDLG----GLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEA 283 (295) T ss_pred ----CceEEEeeccceEEEEecCCchhhh----hhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEE Confidence 3333322222221 1111 1111 000011111111111111110 00111 11111 3567888988 Q ss_pred eccccccccCCC Q lcl|Aclame:pro 294 NKTPVAVVAPAA 305 (305) Q Consensus 294 ~~t~~a~v~~a~ 305 (305) +.+.. -.|+. T Consensus 284 tI~~~--~~~~~ 293 (295) T protein:vir:99 284 TIEAA--AVPGI 293 (295) T ss_pred EEecC--cCCCC Confidence 88533 34555 No 149 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.71 E-value=2.3e-09 Score=67.93 Aligned_cols=251 Identities=12% Similarity=0.021 Sum_probs=125.2 Q ss_pred hcceeecCCCceEEEEEeCCCceeeeecchhhccccccccccccee--EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHH Q lcl|Aclame:pro 34 AFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWAN--RTLVAEEIAVIIPVHENVIDDATVAVLTEVAE 111 (305) Q Consensus 34 l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~~--v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~ 111 (305) +++.+. +++++++|+. +...+....-|++..... .+..-.+ ++++..++..+ .|.+-=--++..|+.+...+ T Consensus 1 ~vr~i~-~g~s~~~~~i-G~~~~~~~~~G~~l~~~~---~~~~~~e~~itID~~l~~~~-~VdDiD~~qa~~Dlr~e~s~ 74 (324) T protein:vir:99 1 MTRTIT-SGKSAQFPVM-GRTKARYLKQGQSLDDGR---EDIKHTEKVITIDGLLTTDV-LIYDIEDAMNHYDVRSEYST 74 (324) T ss_pred Ceeeee-cCceEEEeee-eeeEeccccCCCCcCCCc---CCcCcccEEEEecchhhhhh-hhhhHHHHhcCccchhHHHH Confidence 555554 3668999987 555666555555443211 1112223 34444343222 22221112345689999999 Q ss_pred HHHHHHHHHHHHHHHc----Cc--ccC---cCcccccccccccccccceeecccchhhhHHHHHHHHHHHHhhhcccc-- Q lcl|Aclame:pro 112 LGGQAIGKKLDQAVIF----GT--DKP---ASWVSPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWA-- 180 (305) Q Consensus 112 ~la~~~a~~~d~a~l~----G~--g~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 180 (305) ++++++++.+|+.++. +. .++ .+....+........ ............+++.+.++...+...+.. T Consensus 75 ~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~ 151 (324) T protein:vir:99 75 QMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKIT---GKKEDPAKYGTQVIQALTYARAAFAKKYIPAG 151 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceeccc---ccccccccCHHHHHHHHHHHHHHHhhcCCCCC Confidence 9999999999988752 11 111 111111111111111 111122233445677777777666655432 Q ss_pred ceEEEEchHHHHHHHHhhcc-CCcee----ecc---cccCccceEecCccccCCCCce---------------------E Q lcl|Aclame:pro 181 PDTLLSSLALRYEVANIRDA-NGNPV----FRD---DSFAGFRTFFNRNGAWDADAAI---------------------E 231 (305) Q Consensus 181 ~~~~v~~~~~~~~l~~~kd~-~G~~l----~~~---~~l~G~pv~~~~~~~~~~~~~~---------------------~ 231 (305) .=+++++|..+..|..-+.. ++.+. +.. ..++|++|+.++++|...+... - T Consensus 152 gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~k 231 (324) T protein:vir:99 152 DRTFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGK 231 (324) T ss_pred CCEEEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccc Confidence 22589999999877543221 11111 111 4689999999999885422110 1 Q ss_pred EEEehhhE----------EEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc--ccc Q lcl|Aclame:pro 232 VIADSSRV----------KIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT--PVA 299 (305) Q Consensus 232 ~~gdf~~~----------~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t--~~a 299 (305) +-+||+.. ..+...++..+..++. .+-.-.++....+|..+.||++++.++.. .+. T Consensus 232 y~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~------------~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~ 299 (324) T protein:vir:99 232 MTVGADNVVGLFVHRSAVATLKLKDMALERARRP------------EYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETP 299 (324) T ss_pred cccccCceeEEEEehhheEEEeeecceecceech------------hhHHHhhhhhhhhcCcccccceEEEEEEccCccc Confidence 33444322 1122222233332221 11123456667788899999888666532 111 Q ss_pred cccC------------CC Q lcl|Aclame:pro 300 VVAP------------AA 305 (305) Q Consensus 300 ~v~~------------a~ 305 (305) .|+| |+ T Consensus 300 ~~~~~~~~~~~~~~~~~~ 317 (324) T protein:vir:99 300 AVAPDVITGVASFAAPAS 317 (324) T ss_pred cccchhhhhhccccCccc Confidence 1222 22 No 150 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.64 E-value=5.2e-09 Score=65.98 Aligned_cols=293 Identities=11% Similarity=-0.006 Sum_probs=148.3 Q ss_pred CCCccCCccce---------EccHHHHHHHHHHHHhhhhhhhhcceeecCC-CceEEEEEeCCCceeeeecchhhccccc Q lcl|Aclame:pro 1 MADISRAEVAS---------LIQEAYSDTLLAAAKQGSTVLSAFQNVNMGT-KTTHLPVLATLPEADWVGESATDPKGVK 70 (305) Q Consensus 1 Ma~~t~~~gg~---------lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~E~~~~~~~~~ 70 (305) |+..++...+. +.=+++..++.+.....+.++++..+.++.+ +++++|+. +..+++...-|+... +.. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~i-G~~~a~y~~~G~~ld-g~~ 78 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPN-ATP 78 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEE-eeeEEeeeccccccC-CCC Confidence 88765554332 3338888999999999999999988888764 58999987 555566655555432 222 Q ss_pred ccccccceeEEeeeeeE-EEeehhhHHHhhcCHHH-HHHHHHHHHHHHHHHHHHHHHHc-----Ccc--cCcCccccccc Q lcl|Aclame:pro 71 PTSKVTWANRTLVAEEI-AVIIPVHENVIDDATVA-VLTEVAELGGQAIGKKLDQAVIF-----GTD--KPASWVSPALI 141 (305) Q Consensus 71 ~~~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~-~~~~v~~~la~~~a~~~d~a~l~-----G~g--~~~~~~~~~~~ 141 (305) +.-++..|..-.+ .....|.+=---++..+ +-+.+.+++++++++..|+.++. +.- .+....+.+.. T Consensus 79 ----~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~ 154 (402) T protein:vir:97 79 ----TQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) T ss_pred ----cccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccc Confidence 2334444443332 11112221001123455 67889999999999999998763 110 00000011111 Q ss_pred ccccccccceeecccchhhhHHHHHHHHHHHHhhhccccc--eEEEEchHHHHHHHHhhc--------c-CCceeec-cc Q lcl|Aclame:pro 142 PAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAP--DTLLSSLALRYEVANIRD--------A-NGNPVFR-DD 209 (305) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~~kd--------~-~G~~l~~-~~ 209 (305) .. .......+......+...+++.+..+...+...+... -.++++|..+..|.+-.+ + +|.+... -. T Consensus 155 ~g-~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~ 233 (402) T protein:vir:97 155 HG-FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVL 233 (402) T ss_pred cc-cccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeE Confidence 00 1111112222223455666777777776666554332 268899999998875421 1 2222211 14 Q ss_pred ccCccceEecCccccCCCC-----------ceE--EEEehhhE--EEEeecCc-EEEEeecceeccCcceeeeeecCcEE Q lcl|Aclame:pro 210 SFAGFRTFFNRNGAWDADA-----------AIE--VIADSSRV--KIGVRQDI-TVKFLDQATLGTGENQINLAERDMVA 273 (305) Q Consensus 210 ~l~G~pv~~~~~~~~~~~~-----------~~~--~~gdf~~~--~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~ 273 (305) .+.|+||+.++++|...+. +.. +-|||+.- ++..++-+ .++..+ .+.+... ..+.|.+ . T Consensus 234 ~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~-vT~~~~~-d~r~~~~---~ 308 (402) T protein:vir:97 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIE-VTGDIFY-EKKEKTY---Y 308 (402) T ss_pred EEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeec-cccchhh-chhHHHH---H Confidence 6999999999998853211 111 23565432 22223222 122211 1100000 0000100 1 Q ss_pred EEEEEEEccEeecccceEEEeccc--cccccCCC Q lcl|Aclame:pro 274 LRLKARFAYVLGVSATAQGANKTP--VAVVAPAA 305 (305) Q Consensus 274 ~r~~~r~~~~v~~p~a~~~~~~t~--~a~v~~a~ 305 (305) +-+..-+|..+.||+++..++..- ++.++|.- T Consensus 309 id~~~a~G~g~~RPeaa~vv~~~~~~t~~~~~~~ 342 (402) T protein:vir:97 309 IDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) T ss_pred HHHHHHhCCcccCccceEEEEEecccccccCCcc Confidence 222334566677887776664331 22233322 No 151 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.63 E-value=6.8e-09 Score=65.38 Aligned_cols=286 Identities=11% Similarity=-0.025 Sum_probs=143.6 Q ss_pred CCCccCCc-cceEc-cHHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeCCCceeeeecchhhcccccccccccc Q lcl|Aclame:pro 1 MADISRAE-VASLI-QEAYSDTLLAAAKQGSTVLSAFQNVNM-GTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTW 77 (305) Q Consensus 1 Ma~~t~~~-gg~li-p~~~~~~i~~~~~~~~~l~~l~~~~~~-~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f 77 (305) |+.+..+. +..+| |+.++.+|..-+.+......+.++... .+++++||..... ...--.+.....-.+..+.+ + T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~-tV~dY~~~~~i~~d~ltt~~--~ 77 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTP-VVRSRPEQGDFTFDNLDTGE--I 77 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEecccccc-ccccccCCCCcccccCCCce--E Confidence 99866544 55666 999999999888887766666654443 3668999876432 22111222221111111111 1 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc--CcccC--cCcccccccccccccccceee Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF--GTDKP--ASWVSPALIPAAVTAGQAVEV 153 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~--G~g~~--~~~~~~~~~~~~~~~~~~~~~ 153 (305) .+.++..|+.++ .++++..+ ...+|.+...++++++++..+|+.+.. -+|.. .........+.. ........ T Consensus 78 -~l~IDq~KYfaf-~VdDD~~Q-a~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~-~~~iv~~g 153 (322) T protein:vir:31 78 -SIILRDEVYAGN-AISKKLRQ-DSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGV-PHRFVGTG 153 (322) T ss_pred -EEEEehhhhhcc-ccchhHHH-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCC-ccceeccC Confidence 345565566544 58886654 566899999999999999999876632 11110 000000000000 00011111 Q ss_pred cccchhhhHHHHHHHHHHHHhhhccc--cceEEEEchHHHHHHHH-------hhccCCceee-----------cccccCc Q lcl|Aclame:pro 154 VGGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVAN-------IRDANGNPVF-----------RDDSFAG 213 (305) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~-------~kd~~G~~l~-----------~~~~l~G 213 (305) .+....+ +.+.++..++....- ..=.++++|.....|.. ++| +|+.. .-..+.| T Consensus 154 t~~~~ay----~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D--~rf~~i~~sG~a~g~~~Vg~~~G 227 (322) T protein:vir:31 154 TDQTMDV----TDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNN--PRWEGIVESGIAPDMQFVRSVYG 227 (322) T ss_pred CCchhhH----HHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhcc--ccccccccccchhhHHHHHHHhc Confidence 1222333 444444444433321 12257888998887743 233 22211 1257889 Q ss_pred cceEecCccccC-------CCCceEEEEehhhEEEEeecCc-E-EEEeecceeccCcceeeeeecCcEEEEEEEEEccEe Q lcl|Aclame:pro 214 FRTFFNRNGAWD-------ADAAIEVIADSSRVKIGVRQDI-T-VKFLDQATLGTGENQINLAERDMVALRLKARFAYVL 284 (305) Q Consensus 214 ~pv~~~~~~~~~-------~~~~~~~~gdf~~~~~~~~~~i-~-v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v 284 (305) +-|+.++.++.. .+......|-++.+.....-+. . +.-.++.. .+...+.-.+..-..|..+|.|.++ T Consensus 228 F~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~---~~e~~r~~~~~~d~~~~~~~~g~g~ 304 (322) T protein:vir:31 228 IDLFVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMP---TTKSFIDDYNDDLNTATTARWGNGL 304 (322) T ss_pred eeeeeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhh---hhhcccCccccccceeeeeeeccee Confidence 999999987421 0111122222222221100000 0 00000000 0000111112234578999999999 Q ss_pred ecccceEEEeccccccccC Q lcl|Aclame:pro 285 GVSATAQGANKTPVAVVAP 303 (305) Q Consensus 285 ~~p~a~~~~~~t~~a~v~~ 303 (305) .||+..+.+.... +++|- T Consensus 305 ~r~e~l~~~~a~~-~~~~~ 322 (322) T protein:vir:31 305 VRDENLVCVLANA-DKVTF 322 (322) T ss_pred ecccceEEEEecc-ccccC Confidence 9999999887653 24444 No 152 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.55 E-value=3.1e-08 Score=61.71 Aligned_cols=274 Identities=12% Similarity=0.056 Sum_probs=156.5 Q ss_pred CCCccCCccceEccH---HHHHHHHHHHHhhhhhhhhcceee-cC--CCceEEEEEeCCCceeeeecchhhccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQE---AYSDTLLAAAKQGSTVLSAFQNVN-MG--TKTTHLPVLATLPEADWVGESATDPKGVKPTSK 74 (305) Q Consensus 1 Ma~~t~~~gg~lip~---~~~~~i~~~~~~~~~l~~l~~~~~-~~--~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~ 74 (305) |.---..++|.++-+ .+...+++...+.-..+++..+.. .+ ..+..+.+.+....+.|++... .++|..+ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~----~dip~v~ 76 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYT----DDLPLVD 76 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCc----cccceee Confidence 664433345555553 445667776766666666655432 22 2256677777777788877554 2467777 Q ss_pred ccceeEEeeeeeEEEeehhhHHHhhcC---HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccce Q lcl|Aclame:pro 75 VTWANRTLVAEEIAVIIPVHENVIDDA---TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAV 151 (305) Q Consensus 75 ~~f~~v~~~~~k~~~~~~is~ell~ds---~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~ 151 (305) ..+++.....+.++..+.++.+=++.+ ..++..--....++++++++|+.+|+|+..- ...|+++....... T Consensus 77 ~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~---g~~GLlN~p~v~~~-- 151 (296) T protein:vir:10 77 ALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAH---GIPSVFDYPNINNV-- 151 (296) T ss_pred ccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccc---cceeEeecCCCccc-- Confidence 778888888888888888887544433 4578888888899999999999999997542 23344443222111 Q ss_pred eecccchhhhHHHHHHHHHHHHhhh---ccccceEEEEchHHHHHHHHhhccCCceeec-------ccccCccceEecCc Q lcl|Aclame:pro 152 EVVGGVANESDIVGATNRAAKAVAS---AGWAPDTLLSSLALRYEVANIRDANGNPVFR-------DDSFAGFRTFFNRN 221 (305) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~-------~~~l~G~pv~~~~~ 221 (305) .......+...+++++..+..++.. ....+..++++|..+..|...-...|.-++. +..+.+.|.+.... T Consensus 152 ~~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~~l~~a~ 231 (296) T protein:vir:10 152 VSGGSWSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQYLNDYN 231 (296) T ss_pred cccCCccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEeeeeccCC Confidence 1111122333566667666665543 2356778999999999987654444433221 11232333322111 Q ss_pred cccCCCCceEEEEeh--hhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEc-cEeecccceEEEecccc Q lcl|Aclame:pro 222 GAWDADAAIEVIADS--SRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFA-YVLGVSATAQGANKTPV 298 (305) Q Consensus 222 ~~~~~~~~~~~~gdf--~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~-~~v~~p~a~~~~~~t~~ 298 (305) ..++..+++.+. ..+-+.....++.- ..+ ...-...+++..|.+ ..+.+|.+++.+++-.. T Consensus 232 ---~~g~~~~v~~~~~~~~~~~~v~~~~~~~-----~~e--------~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~ 295 (296) T protein:vir:10 232 ---GTGTSAAIAYEKDPNNMAIEIPEATNAL-----PAQ--------PKDLHFKIPVTSKATGLIVYRPLTMAVMKGITF 295 (296) T ss_pred ---CCcceEEEEEEcCCceEEEEcCcceeee-----ccc--------ccCceEEEeeEeeEEEEEEECCceeEEEeeeec Confidence 112222333222 22222222222211 001 011134567788886 67899999999998666 Q ss_pred c Q lcl|Aclame:pro 299 A 299 (305) Q Consensus 299 a 299 (305) | T Consensus 296 ~ 296 (296) T protein:vir:10 296 A 296 (296) T ss_pred C Confidence 5 No 153 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.55 E-value=1.8e-08 Score=63.11 Aligned_cols=281 Identities=13% Similarity=0.064 Sum_probs=145.4 Q ss_pred CCCcc---------CCccceEccHHHHHHHHHHHHhh-hhhhhhcceeecCCCceEEEEEeCCCceeeeecchhh---cc Q lcl|Aclame:pro 1 MADIS---------RAEVASLIQEAYSDTLLAAAKQG-STVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATD---PK 67 (305) Q Consensus 1 Ma~~t---------~~~gg~lip~~~~~~i~~~~~~~-~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~---~~ 67 (305) |+... ++=....+ +++..++....++. +.|++-++...-.+++-.+-.+. .....-++++... .+ T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~d 78 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFV-QTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLA-SMDPDAVKRKRSRQQSAD 78 (322) T ss_pred CcccceeeeeeeeechhhhHHH-HHHHHHHHHHHHHhhhhhhcccccccccccccceeecc-cccccccccccccccccC Confidence 32211 11122233 66666665555555 44555544333333322211111 1111112222111 11 Q ss_pred c--ccccccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccc Q lcl|Aclame:pro 68 G--VKPTSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAV 145 (305) Q Consensus 68 ~--~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~ 145 (305) + +.|.....+...............|.+.-......|+.+...+..+.+++|+.|+.++.+-=.+......+ ..... T Consensus 79 ~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~g-t~v~~ 157 (322) T protein:vir:10 79 GTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTG-QPVEF 157 (322) T ss_pred cccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccc-ccccc Confidence 1 23444444444444444444456777755555677889999999999999999998886421111100000 00111 Q ss_pred ccccceeecccchhhhHHHHHHHHHHHHhhhccccc---eEEEEchHHHHHHHHhhc-----cCC-ceeec---ccccCc Q lcl|Aclame:pro 146 TAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAP---DTLLSSLALRYEVANIRD-----ANG-NPVFR---DDSFAG 213 (305) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~v~~~~~~~~l~~~kd-----~~G-~~l~~---~~~l~G 213 (305) .....+.......+++.+.+ +...+....-.. -.++.+|..+..|..... -.| ..+.. ..+++| T Consensus 158 ~ss~~i~~g~~g~t~~kl~~----a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lG 233 (322) T protein:vir:10 158 LATQEIGDGTKPISFDYVTE----ITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMG 233 (322) T ss_pred CCCcccccCccchhHHHHHH----HHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeee Confidence 11111222222333444333 333333322221 258889999988765432 111 22322 357999 Q ss_pred cceEecCccccCCC--------------CceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEE Q lcl|Aclame:pro 214 FRTFFNRNGAWDAD--------------AAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKAR 279 (305) Q Consensus 214 ~pv~~~~~~~~~~~--------------~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r 279 (305) +.++.++++|...+ ....+++.-+.+.++.+.+++.++...... .+-..+++..- T Consensus 234 f~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~-----------~~a~~I~~~~~ 302 (322) T protein:vir:10 234 YTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSA-----------SFAWRIYSAFT 302 (322) T ss_pred EEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCc-----------chhhhhhhhhh Confidence 99999998873211 224666666777777777777776544421 12234677788 Q ss_pred EccEeecccceEEEeccccc Q lcl|Aclame:pro 280 FAYVLGVSATAQGANKTPVA 299 (305) Q Consensus 280 ~~~~v~~p~a~~~~~~t~~a 299 (305) +|..+.+|++++.+.-..+= T Consensus 303 ~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 303 ADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred hCceEeccCcEEEEEEeccC Confidence 89999999999999987653 No 154 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.55 E-value=2e-08 Score=62.79 Aligned_cols=269 Identities=10% Similarity=0.016 Sum_probs=140.3 Q ss_pred CCC---------ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCC-ce-EEEEEeCCCceeeeecchhhcccc Q lcl|Aclame:pro 1 MAD---------ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTK-TT-HLPVLATLPEADWVGESATDPKGV 69 (305) Q Consensus 1 Ma~---------~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~-~~-~~p~~~~~~~a~~v~E~~~~~~~~ 69 (305) |-. +++++-+....-++.+++-.-+.+-..++...+.+||..| .+ .+|.|+....+.-|+||+. T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe~----- 75 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEV----- 75 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccCCcc----- Confidence 321 2222333334455666665555555455556688999855 56 4566887788888999986 Q ss_pred ccccccccee---EEeeeeeEEEeehhhHHHhhcC-HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccc Q lcl|Aclame:pro 70 KPTSKVTWAN---RTLVAEEIAVIIPVHENVIDDA-TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAV 145 (305) Q Consensus 70 ~~~~~~~f~~---v~~~~~k~~~~~~is~ell~ds-~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~ 145 (305) +|.++.+.+. .+++.+|.+..+ |+|.++.| .-+....-.++|..+++++++..++.--.++++ T Consensus 76 Iplskvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~----------- 142 (296) T protein:vir:98 76 IPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG----------- 142 (296) T ss_pred cchhhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccc----------- Confidence 6666666543 667777877764 99999644 446788889999999999999999953221110 Q ss_pred ccccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeec--c-cccCccceEecCcc Q lcl|Aclame:pro 146 TAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR--D-DSFAGFRTFFNRNG 222 (305) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~-~~l~G~pv~~~~~~ 222 (305) ..+ ..+..-...+...+.++.......+-..-..++||.+...+++-..-.-+-.|- - ..++|.-++.+..+ T Consensus 143 ----t~~-~t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~it~qt~fG~tyl~nfLG~~II~S~kV 217 (296) T protein:vir:98 143 ----TQD-ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTVIISTNDV 217 (296) T ss_pred ----eee-echhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCccchhheechhhhhhccccEEEEcCcC Confidence 000 111111112222334444444444444567888998888776322111111111 0 13778878888877 Q ss_pred ccCCCCceEEEEehhhEEE---EeecCcEEEEeecceeccCcceeeeeecCcEE--EEEEE--EEccE--eecccceEEE Q lcl|Aclame:pro 223 AWDADAAIEVIADSSRVKI---GVRQDITVKFLDQATLGTGENQINLAERDMVA--LRLKA--RFAYV--LGVSATAQGA 293 (305) Q Consensus 223 ~~~~~~~~~~~gdf~~~~~---~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~--~r~~~--r~~~~--v~~p~a~~~~ 293 (305) + ++.++..--.++.+ -.+.+ ++...-.+..+....--+.++... +-++. .-++. ..+++++++. T Consensus 218 ~----~G~~~~T~~~Ni~~ay~~~~~~---~l~~~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ 290 (296) T protein:vir:98 218 T----KGEIWATVPENIIFAYINPNNS---ELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKV 290 (296) T ss_pred C----CceEEEeeecceEEEeeccccc---chhhhhccccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEE Confidence 6 33343333333221 11111 111111111111111111111110 00111 11111 3567889999 Q ss_pred eccccc Q lcl|Aclame:pro 294 NKTPVA 299 (305) Q Consensus 294 ~~t~~a 299 (305) +.+++- T Consensus 291 tI~~~~ 296 (296) T protein:vir:98 291 TLTPGV 296 (296) T ss_pred EecCCC Confidence 987653 No 155 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.54 E-value=1.3e-08 Score=63.80 Aligned_cols=225 Identities=17% Similarity=0.122 Sum_probs=138.0 Q ss_pred CCCc-----cCCc-cceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCCceeeeecchhhcccccccc Q lcl|Aclame:pro 1 MADI-----SRAE-VASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT-THLPVLATLPEADWVGESATDPKGVKPTS 73 (305) Q Consensus 1 Ma~~-----t~~~-gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~~ 73 (305) |+.+ |-.+ ..-+-|......|+|.+.+.+.|++..++......+ ....+.++-|.+.|..=.+. .+.+ T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN~g-----~~~s 75 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGG-----VLPN 75 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcCCc-----cccc Confidence 6654 3333 233556677788999999999999988886533222 22345577788888543332 4556 Q ss_pred cccceeEEeeeeeEEEeehhhHHHhhcCH--HHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccc--------- Q lcl|Aclame:pro 74 KVTWANRTLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIP--------- 142 (305) Q Consensus 74 ~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~--------- 142 (305) +.++.+++...+-+++...+-+.+.+... -++.....+...+++.+++...+|+|+.+..+..-.|+.. T Consensus 76 ~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~ 155 (330) T protein:vir:10 76 KSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) T ss_pred cceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCc Confidence 78899999999999999999998876533 2344556677889999999999999965322111111100 Q ss_pred -------------------------------------ccc---ccccceeecc--------------------------- Q lcl|Aclame:pro 143 -------------------------------------AAV---TAGQAVEVVG--------------------------- 155 (305) Q Consensus 143 -------------------------------------~~~---~~~~~~~~~~--------------------------- 155 (305) +.. .........+ T Consensus 156 ~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~v 235 (330) T protein:vir:10 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) T ss_pred hhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccE Confidence 000 0000000011 Q ss_pred ------------cchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh-hccCCceeec-------ccccCccc Q lcl|Aclame:pro 156 ------------GVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI-RDANGNPVFR-------DDSFAGFR 215 (305) Q Consensus 156 ------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~-kd~~G~~l~~-------~~~l~G~p 215 (305) ......++.+.++.+...+++.......|+||++....|++. .+.++..+-. ...+.|.| T Consensus 236 vRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~gip 315 (330) T protein:vir:10 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIP 315 (330) T ss_pred EEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEECCeE Confidence 111233566777777777777766778899999999999875 3333322211 13577888 Q ss_pred eEecCccccCCCCceEE Q lcl|Aclame:pro 216 TFFNRNGAWDADAAIEV 232 (305) Q Consensus 216 v~~~~~~~~~~~~~~~~ 232 (305) +...+.+..+- ..++ T Consensus 316 ir~~Dail~tE--~~vv 330 (330) T protein:vir:10 316 VQRTDALLNTE--SRVV 330 (330) T ss_pred EEEEeeeecCc--cccC Confidence 77666554221 1111 No 156 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.54 E-value=1.3e-08 Score=63.81 Aligned_cols=292 Identities=9% Similarity=-0.020 Sum_probs=149.3 Q ss_pred CCCccCCccc---------eEccHHHHHHHHHHHHhhhhhhhhcceeecCC-CceEEEEEeCCCceeeeecchhhccccc Q lcl|Aclame:pro 1 MADISRAEVA---------SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGT-KTTHLPVLATLPEADWVGESATDPKGVK 70 (305) Q Consensus 1 Ma~~t~~~gg---------~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~E~~~~~~~~~ 70 (305) |+.......+ .+.=+.+..++.+...+.+.++++..+.++.+ +++++|+. +..+++...-|+.... . T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~ld~-~- 77 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAA-T- 77 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCcCC-C- Confidence 8865544422 24448888999999999999999999988875 48899987 6666776666665432 2 Q ss_pred ccccccceeEEeeeeeE-EEeehhhHHHhhcCHHH-HHHHHHHHHHHHHHHHHHHHHHc-----Ccc--cCcCccccccc Q lcl|Aclame:pro 71 PTSKVTWANRTLVAEEI-AVIIPVHENVIDDATVA-VLTEVAELGGQAIGKKLDQAVIF-----GTD--KPASWVSPALI 141 (305) Q Consensus 71 ~~~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~-~~~~v~~~la~~~a~~~d~a~l~-----G~g--~~~~~~~~~~~ 141 (305) .+..++..|..-.+ .....|.+=---++..+ +.+.+.+++.+++++.+|+.++. |-. ++....+.+.. T Consensus 78 ---~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~ 154 (401) T protein:vir:70 78 ---STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKG 154 (401) T ss_pred ---CcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCC Confidence 23344444444332 11112211001123455 67889999999999999987743 210 01111111111 Q ss_pred ccccccccceeecccchhhhHHHHHHHHHHHHhhhccccc--eEEEEchHHHHHHHHh---h--c----cCCceeec-cc Q lcl|Aclame:pro 142 PAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAP--DTLLSSLALRYEVANI---R--D----ANGNPVFR-DD 209 (305) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~~---k--d----~~G~~l~~-~~ 209 (305) .+.... ..........+...+.+.+..+...+...+... -++++.|..+..|..- - | .+|.+... -. T Consensus 155 ~G~~i~-v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~ 233 (401) T protein:vir:70 155 HGFSIN-VEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTL 233 (401) T ss_pred CceEEe-ccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEE Confidence 110000 111122223344557777777777766554332 3567777777776542 1 1 12222211 13 Q ss_pred ccCccceEecCccccCCCC-----------ceE--EEEehhhEE--EEeecCc-EEEEeecceeccCcceeeeeecCcEE Q lcl|Aclame:pro 210 SFAGFRTFFNRNGAWDADA-----------AIE--VIADSSRVK--IGVRQDI-TVKFLDQATLGTGENQINLAERDMVA 273 (305) Q Consensus 210 ~l~G~pv~~~~~~~~~~~~-----------~~~--~~gdf~~~~--~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~ 273 (305) .+.|+||+.++++|...+. +.. +-|||+.-. +..+.-+ .++..+ .+.+... ..+-|.+ . T Consensus 234 ~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~-lt~~~~~-d~r~~~~---~ 308 (401) T protein:vir:70 234 SSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSID-VTGDIFY-EKKEKTY---Y 308 (401) T ss_pred EEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeec-cccchhh-hhhhhHH---H Confidence 6899999999998753211 222 236665432 2222222 122211 1100000 0000111 1 Q ss_pred EEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 274 LRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 274 ~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) +-+..-+|..+.||+++..++.+-. .++|+. T Consensus 309 id~~~a~g~g~~RPeaa~vv~~k~~-~~~~~~ 339 (401) T protein:vir:70 309 IDTFMAEGAIPDRWEAVSVVTTKRN-TTTGAV 339 (401) T ss_pred HHHHHHhCCcccchhheEEEeecCc-cccccc Confidence 2233455777899988887754322 224443 No 157 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.51 E-value=3.4e-08 Score=61.50 Aligned_cols=292 Identities=11% Similarity=0.004 Sum_probs=151.5 Q ss_pred CCCccCCccc---------eEccHHHHHHHHHHHHhhhhhhhhcceeecCC-CceEEEEEeCCCceeeeecchhhccccc Q lcl|Aclame:pro 1 MADISRAEVA---------SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGT-KTTHLPVLATLPEADWVGESATDPKGVK 70 (305) Q Consensus 1 Ma~~t~~~gg---------~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~E~~~~~~~~~ 70 (305) |+......-+ .+.=+.+..++.+...+.+.++++..+.++.+ +++++|+. +..+++...-|++..... T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~ldg~~- 78 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAATS- 78 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCcCCCC- Confidence 8765443321 24458888999999999999999999988875 48889886 667777777777643322 Q ss_pred ccccccceeEEeeeeeE-EEeehhhHHHhhcCHHH-HHHHHHHHHHHHHHHHHHHHHHc----C----cccCcCcccccc Q lcl|Aclame:pro 71 PTSKVTWANRTLVAEEI-AVIIPVHENVIDDATVA-VLTEVAELGGQAIGKKLDQAVIF----G----TDKPASWVSPAL 140 (305) Q Consensus 71 ~~~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~-~~~~v~~~la~~~a~~~d~a~l~----G----~g~~~~~~~~~~ 140 (305) ...++..|..-.+ .....|.+=---++..| +-+.+.+++.+++++.+|+.++. + +..+.+ .+.+. T Consensus 79 ----~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~-~~~g~ 153 (400) T protein:vir:10 79 ----TQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRT-NPRVK 153 (400) T ss_pred ----cccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-cCCcc Confidence 2334444443332 12222222101123455 68899999999999999988763 2 111111 11111 Q ss_pred cccccccccceeecccchhhhHHHHHHHHHHHHhhhcccc--ceEEEEchHHHHHHHHhh-----c---c-CCceeec-c Q lcl|Aclame:pro 141 IPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWA--PDTLLSSLALRYEVANIR-----D---A-NGNPVFR-D 208 (305) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~~k-----d---~-~G~~l~~-~ 208 (305) ....... ..........+...+.+.+..+...+...+.. --++++.|..+..|..-. | + +|.+... - T Consensus 154 ~~g~s~~-v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v 232 (400) T protein:vir:10 154 GHGFSVN-VEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFV 232 (400) T ss_pred cccccee-ecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceE Confidence 1111100 01112222234455666666666666544322 236777888887775421 2 1 1222211 1 Q ss_pred cccCccceEecCccccCCCC-----------ceE--EEEehhhEE--EEeecCc-EEEEeecceeccCcceeeeeecCcE Q lcl|Aclame:pro 209 DSFAGFRTFFNRNGAWDADA-----------AIE--VIADSSRVK--IGVRQDI-TVKFLDQATLGTGENQINLAERDMV 272 (305) Q Consensus 209 ~~l~G~pv~~~~~~~~~~~~-----------~~~--~~gdf~~~~--~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~ 272 (305) ..+.|+||+.++++|...+. +.. +-|||+.-. +..+.-+ .++..+ .+.+... ..+.| .. T Consensus 233 ~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~-lt~~~~~-d~r~~---~~ 307 (400) T protein:vir:10 233 LSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSID-VIGDIFY-EKKEK---TY 307 (400) T ss_pred EEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeec-ccccccc-chhhH---HH Confidence 36899999999998753211 212 336765432 2222222 122211 1100000 00001 11 Q ss_pred EEEEEEEEccEeecccceEEEeccc--ccccc--CCC Q lcl|Aclame:pro 273 ALRLKARFAYVLGVSATAQGANKTP--VAVVA--PAA 305 (305) Q Consensus 273 ~~r~~~r~~~~v~~p~a~~~~~~t~--~a~v~--~a~ 305 (305) .+-+..-+|..+.||+++..++..- ...+. |++ T Consensus 308 ~id~~~a~G~g~~RPeaa~vv~~~~~~~~~~~~~~~~ 344 (400) T protein:vir:10 308 YIDTFMSEGAIPDRWEAVSVVTTKRQSTGAVDSGNAA 344 (400) T ss_pred HHHHHHHhCCcccchhheEEEEecCCcccccccCcch Confidence 1233445677789999888887552 22222 333 No 158 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.51 E-value=3.9e-08 Score=61.23 Aligned_cols=225 Identities=13% Similarity=0.127 Sum_probs=137.7 Q ss_pred CCCc-----cCCccceEc-cHH-HHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MADI-----SRAEVASLI-QEA-YSDTLLAAAKQGSTVLSAFQNVNMGTKT-THLPVLATLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~-----t~~~gg~li-p~~-~~~~i~~~~~~~~~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (305) |++. |-.+....+ |.. +...|+|.+.+.++|++..+++....++ ....+.++-|.+.|..=++. .+. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g-----~~~ 75 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYG-----VQP 75 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCc-----cCc Confidence 6654 333332222 433 4567999999999999999988654333 45667788899999554443 456 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhcCH--HHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccc-------- Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIP-------- 142 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~-------- 142 (305) ++.++.+++...+-+++.+.+.+.+.+... .++...-.+...+++.+++...||+|+.+..+..-.|+.. T Consensus 76 s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~ 155 (331) T protein:vir:98 76 EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) T ss_pred ccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccc Confidence 778899999999999999999998887643 2344445667888999999999999974311100000000 Q ss_pred -------ccc--------------------------ccc----------------------------------------- Q lcl|Aclame:pro 143 -------AAV--------------------------TAG----------------------------------------- 148 (305) Q Consensus 143 -------~~~--------------------------~~~----------------------------------------- 148 (305) ... ..+ T Consensus 156 ~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ 235 (331) T protein:vir:98 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) T ss_pred cccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEE Confidence 000 000 Q ss_pred ---cc--eeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh-hccCCceeec--------ccccCcc Q lcl|Aclame:pro 149 ---QA--VEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI-RDANGNPVFR--------DDSFAGF 214 (305) Q Consensus 149 ---~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~-kd~~G~~l~~--------~~~l~G~ 214 (305) +. ........+..++.+++..+...+++.......|+||++....|++. .+......+. ...+.|. T Consensus 236 ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gi 315 (331) T protein:vir:98 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI 315 (331) T ss_pred EEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCe Confidence 00 00000011234566777777777776666667899999999999875 3332222221 1357788 Q ss_pred ceEecCccccCCCCceEE Q lcl|Aclame:pro 215 RTFFNRNGAWDADAAIEV 232 (305) Q Consensus 215 pv~~~~~~~~~~~~~~~~ 232 (305) |+...+.+..+ +..++ T Consensus 316 pir~~dai~~t--E~~Vv 331 (331) T protein:vir:98 316 PCRRTDALLLT--EARVV 331 (331) T ss_pred eEEEeeeeecC--ccccC Confidence 87766654431 11111 No 159 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.51 E-value=3.9e-08 Score=61.23 Aligned_cols=225 Identities=13% Similarity=0.127 Sum_probs=137.7 Q ss_pred CCCc-----cCCccceEc-cHH-HHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MADI-----SRAEVASLI-QEA-YSDTLLAAAKQGSTVLSAFQNVNMGTKT-THLPVLATLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~-----t~~~gg~li-p~~-~~~~i~~~~~~~~~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (305) |++. |-.+....+ |.. +...|+|.+.+.++|++..+++....++ ....+.++-|.+.|..=++. .+. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g-----~~~ 75 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYG-----VQP 75 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCc-----cCc Confidence 6654 333332222 433 4567999999999999999988654333 45667788899999554443 456 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhcCH--HHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccc-------- Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIP-------- 142 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~-------- 142 (305) ++.++.+++...+-+++.+.+.+.+.+... .++...-.+...+++.+++...||+|+.+..+..-.|+.. T Consensus 76 s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~ 155 (331) T protein:vir:10 76 EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) T ss_pred ccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccc Confidence 778899999999999999999998887643 2344445667888999999999999974311100000000 Q ss_pred -------ccc--------------------------ccc----------------------------------------- Q lcl|Aclame:pro 143 -------AAV--------------------------TAG----------------------------------------- 148 (305) Q Consensus 143 -------~~~--------------------------~~~----------------------------------------- 148 (305) ... ..+ T Consensus 156 ~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ 235 (331) T protein:vir:10 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) T ss_pred cccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEE Confidence 000 000 Q ss_pred ---cc--eeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh-hccCCceeec--------ccccCcc Q lcl|Aclame:pro 149 ---QA--VEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI-RDANGNPVFR--------DDSFAGF 214 (305) Q Consensus 149 ---~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~-kd~~G~~l~~--------~~~l~G~ 214 (305) +. ........+..++.+++..+...+++.......|+||++....|++. .+......+. ...+.|. T Consensus 236 ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gi 315 (331) T protein:vir:10 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI 315 (331) T ss_pred EEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCe Confidence 00 00000011234566777777777776666667899999999999875 3332222221 1357788 Q ss_pred ceEecCccccCCCCceEE Q lcl|Aclame:pro 215 RTFFNRNGAWDADAAIEV 232 (305) Q Consensus 215 pv~~~~~~~~~~~~~~~~ 232 (305) |+...+.+..+ +..++ T Consensus 316 pir~~dai~~t--E~~Vv 331 (331) T protein:vir:10 316 PCRRTDALLLT--EARVV 331 (331) T ss_pred eEEEeeeeecC--ccccC Confidence 87766654431 11111 No 160 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.51 E-value=3.9e-08 Score=61.23 Aligned_cols=225 Identities=13% Similarity=0.127 Sum_probs=137.7 Q ss_pred CCCc-----cCCccceEc-cHH-HHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MADI-----SRAEVASLI-QEA-YSDTLLAAAKQGSTVLSAFQNVNMGTKT-THLPVLATLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~-----t~~~gg~li-p~~-~~~~i~~~~~~~~~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (305) |++. |-.+....+ |.. +...|+|.+.+.++|++..+++....++ ....+.++-|.+.|..=++. .+. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g-----~~~ 75 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYG-----VQP 75 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCc-----cCc Confidence 6654 333332222 433 4567999999999999999988654333 45667788899999554443 456 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhcCH--HHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccc-------- Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIP-------- 142 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~-------- 142 (305) ++.++.+++...+-+++.+.+.+.+.+... .++...-.+...+++.+++...||+|+.+..+..-.|+.. T Consensus 76 s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~ 155 (331) T protein:vir:10 76 EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) T ss_pred ccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccc Confidence 778899999999999999999998887643 2344445667888999999999999974311100000000 Q ss_pred -------ccc--------------------------ccc----------------------------------------- Q lcl|Aclame:pro 143 -------AAV--------------------------TAG----------------------------------------- 148 (305) Q Consensus 143 -------~~~--------------------------~~~----------------------------------------- 148 (305) ... ..+ T Consensus 156 ~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ 235 (331) T protein:vir:10 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) T ss_pred cccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEE Confidence 000 000 Q ss_pred ---cc--eeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh-hccCCceeec--------ccccCcc Q lcl|Aclame:pro 149 ---QA--VEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI-RDANGNPVFR--------DDSFAGF 214 (305) Q Consensus 149 ---~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~-kd~~G~~l~~--------~~~l~G~ 214 (305) +. ........+..++.+++..+...+++.......|+||++....|++. .+......+. ...+.|. T Consensus 236 ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gi 315 (331) T protein:vir:10 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI 315 (331) T ss_pred EEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCe Confidence 00 00000011234566777777777776666667899999999999875 3332222221 1357788 Q ss_pred ceEecCccccCCCCceEE Q lcl|Aclame:pro 215 RTFFNRNGAWDADAAIEV 232 (305) Q Consensus 215 pv~~~~~~~~~~~~~~~~ 232 (305) |+...+.+..+ +..++ T Consensus 316 pir~~dai~~t--E~~Vv 331 (331) T protein:vir:10 316 PCRRTDALLLT--EARVV 331 (331) T ss_pred eEEEeeeeecC--ccccC Confidence 87766654431 11111 No 161 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.50 E-value=6.4e-08 Score=60.02 Aligned_cols=273 Identities=7% Similarity=-0.004 Sum_probs=151.9 Q ss_pred CCCc--cCCccceEcc---HHHHHHHHHHHHhhhhhhhhcceee-cC--CCceEEEEEeCCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MADI--SRAEVASLIQ---EAYSDTLLAAAKQGSTVLSAFQNVN-MG--TKTTHLPVLATLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~--t~~~gg~lip---~~~~~~i~~~~~~~~~l~~l~~~~~-~~--~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (305) |... +..+.|.+.. +.+...+++...+.-..+++..+.. .+ ..+..+.+.+....+.|++.... ++|. T Consensus 21 ~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~d~~~----dip~ 96 (319) T protein:vir:10 21 AGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDKVGTAQIIADYTD----DLPL 96 (319) T ss_pred ccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeeccccceeeecCccc----cccc Confidence 2221 1122344544 3444567777777777777766542 22 22566666666677888876542 4666 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhc---CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccc Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDD---ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQ 149 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~d---s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~ 149 (305) .+..++......+.++..+.++..=++. ...++..--....++++++++|+.+|+|+..- ...|+++....... T Consensus 97 v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~---g~~GLlN~p~~~~~ 173 (319) T protein:vir:10 97 VDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPH---KIVSVFNHPNITKI 173 (319) T ss_pred eeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccc---cceeEEeCCCceee Confidence 6777777788888888888887743332 34577888888999999999999999997532 23444443332211 Q ss_pred cee-ec-ccchhhhHHHHHHHHHHHHhhh---ccccceEEEEchHHHHHHHHhhccCCceeec-------ccccCccceE Q lcl|Aclame:pro 150 AVE-VV-GGVANESDIVGATNRAAKAVAS---AGWAPDTLLSSLALRYEVANIRDANGNPVFR-------DDSFAGFRTF 217 (305) Q Consensus 150 ~~~-~~-~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~-------~~~l~G~pv~ 217 (305) ... .. ....+.+.+++++..+..++.. ....+..++++|..+..|.......|..++. +..+.+.|.+ T Consensus 174 ~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel 253 (319) T protein:vir:10 174 TSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSYLDYFKSQNSGIEIDSIAEL 253 (319) T ss_pred ecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeHHHHHHHhcCCceEEEeeee Confidence 111 11 1112445677777777776653 2345778999999999997555444533321 1223333333 Q ss_pred ecCccccCCCCceEEEEehh-hE-EEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEc-cEeecccceEEEe Q lcl|Aclame:pro 218 FNRNGAWDADAAIEVIADSS-RV-KIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFA-YVLGVSATAQGAN 294 (305) Q Consensus 218 ~~~~~~~~~~~~~~~~gdf~-~~-~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~-~~v~~p~a~~~~~ 294 (305) .... ..++..+++...+ ++ -+.....++.. ..+. ..-...+.+..|++ ..+.+|.+++.++ T Consensus 254 ~~ag---~~g~~~~v~y~~~~~~~~~~v~~~~~~~-----~~e~--------~~l~~~~~~~~r~~Gv~i~~P~ai~~~d 317 (319) T protein:vir:10 254 EDID---GAGTKGVLVYEKNPMNMSIEIPEAFNML-----PAQP--------KDLHFKVPCTSKCTGLTIYRPMTIVLIT 317 (319) T ss_pred cccC---CCcceEEEEEecCCceEEEecCcceeee-----eeee--------cCceEEEeeeeeeEEEEEEccceeEeee Confidence 2211 1112222222221 11 12222222110 0000 01123455677776 4478899999999 Q ss_pred cc Q lcl|Aclame:pro 295 KT 296 (305) Q Consensus 295 ~t 296 (305) +- T Consensus 318 GI 319 (319) T protein:vir:10 318 GV 319 (319) T ss_pred cC Confidence 97 No 162 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.40 E-value=1.7e-07 Score=57.65 Aligned_cols=270 Identities=14% Similarity=0.095 Sum_probs=151.3 Q ss_pred CccCCccceEcc--HHHHHHHHHHHHhhhhhhhhccee-ecC--CCceEEEEEeCCCceeeeecchhhcccccccccccc Q lcl|Aclame:pro 3 DISRAEVASLIQ--EAYSDTLLAAAKQGSTVLSAFQNV-NMG--TKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTW 77 (305) Q Consensus 3 ~~t~~~gg~lip--~~~~~~i~~~~~~~~~l~~l~~~~-~~~--~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f 77 (305) -.+.++|.++.. +.+.+.+++.+.+.-..+++..+. +++ .....+.+.+....+.|++.+.. ++|..+..+ T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~----dip~~~~~~ 76 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGAD----DLPLVDVDM 76 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCccc----ccccccccc Confidence 333444443321 445577888888887788776553 222 33566666666677788776553 466667777 Q ss_pred eeEEeeeeeEEEeehhhHHHhhc---CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccc--ee Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDD---ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQA--VE 152 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~d---s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~--~~ 152 (305) +......+.++.-+.++..=++. ...++..--....++++++++|+.+|+|+..- ...|+++........ .+ T Consensus 77 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~---g~~GLlN~p~~~~~~~~~~ 153 (301) T protein:vir:80 77 VRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKY---AIKGAFEATGIQIDVSPTT 153 (301) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccc---cceeeecCCCcccccccCc Confidence 78888888888888888743333 34578888888999999999999999997642 234444433221110 01 Q ss_pred ecc-----cchhhhHHHHHHHHHHHHhhh---ccccceEEEEchHHHHHHHHhh--ccCCceeec-------ccccCccc Q lcl|Aclame:pro 153 VVG-----GVANESDIVGATNRAAKAVAS---AGWAPDTLLSSLALRYEVANIR--DANGNPVFR-------DDSFAGFR 215 (305) Q Consensus 153 ~~~-----~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~~~~~~~l~~~k--d~~G~~l~~-------~~~l~G~p 215 (305) ... ...+.+.+++++..+..++.. ....+..++++|..+..|...+ +..|..++. ...+...| T Consensus 154 ~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p 233 (301) T protein:vir:80 154 GVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVP 233 (301) T ss_pred ccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEcc Confidence 111 112445667777777776643 2235678999999999997543 344433321 11233333 Q ss_pred eEecCccccCCCCceEE-EEe-hhhEEEEeecCcEEEEeecceeccCcceeeeeecC-cEEEEEEEEEc-cEeecccceE Q lcl|Aclame:pro 216 TFFNRNGAWDADAAIEV-IAD-SSRVKIGVRQDITVKFLDQATLGTGENQINLAERD-MVALRLKARFA-YVLGVSATAQ 291 (305) Q Consensus 216 v~~~~~~~~~~~~~~~~-~gd-f~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~-~~~~r~~~r~~-~~v~~p~a~~ 291 (305) .+.... ..++..++ +.+ ...+.+...+.++. .. .-.++ ...+.+..|++ ..+.+|.+++ T Consensus 234 ~L~~~g---~~g~~~~v~~~~~~d~~~~~v~~~~~~--------~~------~e~~~~~~~~~~~~r~~Gv~i~~P~ai~ 296 (301) T protein:vir:80 234 DLAGMG---TAGSDSFAVIHDSNETAELIIPMDITR--------HP------EEYSFPRTKVPFEERTAGVVVRFPAAIV 296 (301) T ss_pred eeccCC---CCcccEEEEEecCCcEEEEEecCceee--------ec------ceecCceeEeeeeeeeEEEEEEccceEE Confidence 332211 01122221 111 11111222122111 00 01122 23345567774 5678899999 Q ss_pred EEecc Q lcl|Aclame:pro 292 GANKT 296 (305) Q Consensus 292 ~~~~t 296 (305) .+++- T Consensus 297 ~~~GI 301 (301) T protein:vir:80 297 RVDGI 301 (301) T ss_pred EEecC Confidence 99997 No 163 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.34 E-value=9.2e-08 Score=59.17 Aligned_cols=274 Identities=10% Similarity=0.010 Sum_probs=152.3 Q ss_pred CCCccCCccceEccH---HHHHHHHHHHHhhhhhhhhcceeecC---CCceEEEEEeCCCceeeeecchhhccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQE---AYSDTLLAAAKQGSTVLSAFQNVNMG---TKTTHLPVLATLPEADWVGESATDPKGVKPTSK 74 (305) Q Consensus 1 Ma~~t~~~gg~lip~---~~~~~i~~~~~~~~~l~~l~~~~~~~---~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~ 74 (305) |-..+-..+|.++-. .+...|++.....-..+++..+..-. ..+..+...+....+.|++.... ++|..+ T Consensus 19 ~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a~~~~d~~~----dip~vd 94 (314) T protein:vir:10 19 MGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIAQIIADYSD----DLPLVD 94 (314) T ss_pred hcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccccceeeeCCccc----ccceee Confidence 433333334555543 44456777666665555655443211 22566767777777888776542 467777 Q ss_pred ccceeEEeeeeeEEEeehhhHHHhhcC---HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccce Q lcl|Aclame:pro 75 VTWANRTLVAEEIAVIIPVHENVIDDA---TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAV 151 (305) Q Consensus 75 ~~f~~v~~~~~k~~~~~~is~ell~ds---~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~ 151 (305) ..+++.....+.++..+.++..=++.+ ..++..--....++++++.+|+.+|+|+..- ...|+++........ T Consensus 95 ~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~---g~~GLlN~p~v~~~~- 170 (314) T protein:vir:10 95 AFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPH---GIVSVFDQPNINNVV- 170 (314) T ss_pred cccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccc---cceeEeecCCCcccc- Confidence 778888888888888888876433322 4567888888899999999999999997532 234444433222111 Q ss_pred eecccchhhhHHHHHHHHHHHHhhhc---cccceEEEEchHHHHHHHHhhccCCceeec-------ccccCccceEecCc Q lcl|Aclame:pro 152 EVVGGVANESDIVGATNRAAKAVASA---GWAPDTLLSSLALRYEVANIRDANGNPVFR-------DDSFAGFRTFFNRN 221 (305) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~-------~~~l~G~pv~~~~~ 221 (305) ..... .+.+.+++++..+..++... ...+..++++|..+..|...-+..|.-++. +-.+.+.|-+.+.. T Consensus 171 ~~~~W-aT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~~l~~n~~~l~I~~~~el~~ag 249 (314) T protein:vir:10 171 ATPNW-SVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGELFTRNNPGLTIRFLQFLDNYD 249 (314) T ss_pred CCCCc-ccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHHHHHHhCCCcEEEEcccccccC Confidence 11122 35567778888877777642 245678999999998886544444433321 12233333322211 Q ss_pred cccCCCCceEEEEeh--hhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEc-cEeecccceEEEecccc Q lcl|Aclame:pro 222 GAWDADAAIEVIADS--SRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFA-YVLGVSATAQGANKTPV 298 (305) Q Consensus 222 ~~~~~~~~~~~~gdf--~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~-~~v~~p~a~~~~~~t~~ 298 (305) ..++..+++-.- ..+.+.....++. ... .. ..-...+.+..|++ ..+.+|.+++.+++-.. T Consensus 250 ---~~g~~~~v~y~~~~~~~~~~vp~~~~~--------l~~--e~---~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~ 313 (314) T protein:vir:10 250 ---GAGGKAALAFEKSPLNMSIEIPEVTNV--------LPA--QP---KDLHFRYPVTSKATGLIVYRPLTMAVIKGITF 313 (314) T ss_pred ---CCcceEEEEEecCCcEEEEecCcccee--------ecc--ee---cCceEEEcceeeeEEEEEECcceeEeeeeeec Confidence 011111221111 1111111111111 000 00 01123445667775 55788999999998766 Q ss_pred c Q lcl|Aclame:pro 299 A 299 (305) Q Consensus 299 a 299 (305) | T Consensus 314 ~ 314 (314) T protein:vir:10 314 A 314 (314) T ss_pred C Confidence 5 No 164 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.34 E-value=2.3e-07 Score=56.98 Aligned_cols=276 Identities=9% Similarity=0.057 Sum_probs=152.8 Q ss_pred CCCccC--CccceEcc---HHHHHHHHHHHHhhhhhhhhcceee-cC--CCceEEEEEeCCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MADISR--AEVASLIQ---EAYSDTLLAAAKQGSTVLSAFQNVN-MG--TKTTHLPVLATLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~t~--~~gg~lip---~~~~~~i~~~~~~~~~l~~l~~~~~-~~--~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (305) |+..+. .+.+.++- +.+...|++...+.-..+++..+.. .+ ..+..+.+.+....+.|++.... ++|. T Consensus 26 ~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~~~~d~~~----dip~ 101 (329) T protein:vir:79 26 LRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVGHAKIIADYTD----DLST 101 (329) T ss_pred cccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecceeeeeecCccc----ccce Confidence 333222 22344444 3455778887777777777765532 22 23566777777777888775432 4666 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhc---CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccc Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDD---ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQ 149 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~d---s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~ 149 (305) .+..+++.....+.++..+.++..=++. ...++..--....++++++++|+.+|+|++.- ...|+++.-..... T Consensus 102 vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~---g~~GLlN~p~v~~~ 178 (329) T protein:vir:79 102 VDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSKPH---KIISVFEHPNLTTI 178 (329) T ss_pred eecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeecccc---cceeeecCCCcccc Confidence 6677777777777888888887633332 24578888888899999999999999997632 22344443222211 Q ss_pred cee----ecccchhhhHHHHHHHHHHHHhhhc--c-ccceEEEEchHHHHHHHHhhccCCceeec-------ccccCccc Q lcl|Aclame:pro 150 AVE----VVGGVANESDIVGATNRAAKAVASA--G-WAPDTLLSSLALRYEVANIRDANGNPVFR-------DDSFAGFR 215 (305) Q Consensus 150 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~v~~~~~~~~l~~~kd~~G~~l~~-------~~~l~G~p 215 (305) ... ..-...+.+.+++++..+..++... + ..+..++++|..+..|.......|.-++. +-.+.+.| T Consensus 179 ~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~~tvl~~lk~~~~~l~I~~~~ 258 (329) T protein:vir:79 179 NSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETTMSYLDYFKQQNGGITIESIS 258 (329) T ss_pred ccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCCccHHHHHHHhCCCcEEEEcc Confidence 100 0011124456677777777766542 2 34678999999999886544444543322 11233333 Q ss_pred eEecCccccCCCCceEEEEehhh--EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEc-cEeecccceEE Q lcl|Aclame:pro 216 TFFNRNGAWDADAAIEVIADSSR--VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFA-YVLGVSATAQG 292 (305) Q Consensus 216 v~~~~~~~~~~~~~~~~~gdf~~--~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~-~~v~~p~a~~~ 292 (305) -+.+. ...+...+++-+.+. +-+.....++. . .. . ...-...+.+..|++ ..+.+|.+++. T Consensus 259 el~~a---g~~g~~~~v~y~~~~~~~~~~vp~~~~~--l------~~--q---~~~~~~~v~~~~r~~Gv~i~~P~ai~~ 322 (329) T protein:vir:79 259 ELEDI---DGAGTKAALVYEKDPMNMSIEIPEAFNM--L------TA--Q---PKDLHFKVPCTSKCTGLTIYRPLTLVL 322 (329) T ss_pred ccccc---CCCCceEEEEEecCCceEEEecCcceee--e------ec--e---ecCceEEEceeeeEEEEEEECcceeee Confidence 22111 011222233222221 11221111111 0 00 0 001123445667776 45788999999 Q ss_pred Eeccccc Q lcl|Aclame:pro 293 ANKTPVA 299 (305) Q Consensus 293 ~~~t~~a 299 (305) +++-.++ T Consensus 323 ~dGI~~~ 329 (329) T protein:vir:79 323 IKGLVVG 329 (329) T ss_pred eeeeeeC Confidence 9997666 No 165 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.31 E-value=7.9e-08 Score=59.52 Aligned_cols=273 Identities=10% Similarity=0.008 Sum_probs=142.7 Q ss_pred CCCc----cCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCC-ce---EEEEEeCCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MADI----SRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTK-TT---HLPVLATLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~----t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~-~~---~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (305) |+.- ++.+-+....-++.+++-.-+.+-..++...+.+||..+ .+ ++|+++....+.-|+||+. +|. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~-----Ipl 75 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDV-----IPL 75 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcc-----cch Confidence 7642 223334445566667776666665556666688898855 34 4555566677888999986 666 Q ss_pred ccccce---eEEeeeeeEEEeehhhHHHhhcC-HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccc Q lcl|Aclame:pro 73 SKVTWA---NRTLVAEEIAVIIPVHENVIDDA-TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAG 148 (305) Q Consensus 73 ~~~~f~---~v~~~~~k~~~~~~is~ell~ds-~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~ 148 (305) ++.+.+ ..+++.+|.+..+ |.|.++.+ .-+....-.++|.++++.+++..|+.--.++++ T Consensus 76 skvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~-------------- 139 (303) T protein:vir:10 76 TKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIE-------------- 139 (303) T ss_pred hhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhccc-------------- Confidence 666643 4677788887755 99999644 446778889999999999999999853211110 Q ss_pred cceeecccchhhhHHHHHHHHHHHHhhh--ccccceEEEEchHHHHHHHHhhcc--C----CceeecccccCccceEecC Q lcl|Aclame:pro 149 QAVEVVGGVANESDIVGATNRAAKAVAS--AGWAPDTLLSSLALRYEVANIRDA--N----GNPVFRDDSFAGFRTFFNR 220 (305) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~~kd~--~----G~~l~~~~~l~G~pv~~~~ 220 (305) ....+.......+.+.+.+.....++.. ......++++||.+...+++-..- + |--+++ .++|.-++.+. T Consensus 140 t~~~t~~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L~--nfLG~~II~S~ 217 (303) T protein:vir:10 140 NGKRTNKTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVNLLT--PYVGVKIVEFA 217 (303) T ss_pred ccccccceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhhhhh--hhhcceEEEec Confidence 0001111122333344444333322211 122345788999999988743211 1 111112 38888888888 Q ss_pred ccccCCCCceEEEEehhhE---EEEeecCcEEEEeecceeccCcceeeeeecCcEE--EEEEEE--EccE--eecccceE Q lcl|Aclame:pro 221 NGAWDADAAIEVIADSSRV---KIGVRQDITVKFLDQATLGTGENQINLAERDMVA--LRLKAR--FAYV--LGVSATAQ 291 (305) Q Consensus 221 ~~~~~~~~~~~~~gdf~~~---~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~--~r~~~r--~~~~--v~~p~a~~ 291 (305) .++. +.++.---.++ +...++++. ..-.+..+....--+.++... +-++.. -++. ..++++++ T Consensus 218 kv~~----G~~~~T~~~Ni~~ay~~~~g~l~----~~f~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv 289 (303) T protein:vir:10 218 DVPQ----GEVWMTVAENLNVAYANPRGELS----RAFAFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVI 289 (303) T ss_pred cCCC----ceEEEeeccceEEEEecCchhhh----hhhhhccccccceEEEeccccceeeehhHhHhHHHhcccccceEE Confidence 7763 33332222222 111121111 000111111111111111110 001111 1111 35678889 Q ss_pred EEecccc-ccccCC Q lcl|Aclame:pro 292 GANKTPV-AVVAPA 304 (305) Q Consensus 292 ~~~~t~~-a~v~~a 304 (305) +.+.++. +.-.|+ T Consensus 290 ~~ti~~~e~~~~~~ 303 (303) T protein:vir:10 290 KVTIKKDEAGELPS 303 (303) T ss_pred EEEEeccccCCCCC Confidence 8888633 344566 No 166 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.31 E-value=1.2e-07 Score=58.49 Aligned_cols=226 Identities=13% Similarity=0.059 Sum_probs=129.8 Q ss_pred CCCccC-----Cc-cceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCCceeeeecchhhcccccccc Q lcl|Aclame:pro 1 MADISR-----AE-VASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT-THLPVLATLPEADWVGESATDPKGVKPTS 73 (305) Q Consensus 1 Ma~~t~-----~~-gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~~ 73 (305) |+.... .+ ..-+-|......|+|.+.+.+.|++..++......+ ....+.++-|.+.|..=.+. .+.+ T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN~g-----~~~s 75 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYNQG-----VQPT 75 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcCCc-----cccc Confidence 666533 33 222445666777999999999999988886533222 22345677788888543332 4556 Q ss_pred cccceeEEeeeeeEEEeehhhHHHhhcCH--HHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccc-------- Q lcl|Aclame:pro 74 KVTWANRTLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPA-------- 143 (305) Q Consensus 74 ~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~-------- 143 (305) +.++.+++...+-+++...|-+.+.+... -++...-.+...+++.+++...+|+|+.+..+..-.|+... T Consensus 76 ~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~ 155 (335) T protein:vir:73 76 KTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSK 155 (335) T ss_pred cceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccc Confidence 78899999999999999999998776543 23445555668899999999999999653322111111000 Q ss_pred ----------ccccc----------------------------------------------------------------- Q lcl|Aclame:pro 144 ----------AVTAG----------------------------------------------------------------- 148 (305) Q Consensus 144 ----------~~~~~----------------------------------------------------------------- 148 (305) ....+ T Consensus 156 a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~ 235 (335) T protein:vir:73 156 AASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRS 235 (335) T ss_pred cCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCccc Confidence 00000 Q ss_pred -----cceeec--ccchhhhHHHHHHHHHHH--HhhhccccceEEEEchHHHHHHHHhhccCCceeecc--------ccc Q lcl|Aclame:pro 149 -----QAVEVV--GGVANESDIVGATNRAAK--AVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD--------DSF 211 (305) Q Consensus 149 -----~~~~~~--~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~--------~~l 211 (305) +...+. .......++.+++..+.. .+++.......|+||++....|++..-..++.-+.. ..+ T Consensus 236 vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~~ 315 (335) T protein:vir:73 236 ISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVSF 315 (335) T ss_pred EEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccCCceeEEE Confidence 000000 000022334444444432 233334444789999999999987532223322221 246 Q ss_pred CccceEecCccccCCCCceE Q lcl|Aclame:pro 212 AGFRTFFNRNGAWDADAAIE 231 (305) Q Consensus 212 ~G~pv~~~~~~~~~~~~~~~ 231 (305) .|.|+...+.+..+-...+. T Consensus 316 ~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 316 LGIPIRRVDAILNTESAVTA 335 (335) T ss_pred CCeEEEEEeeeecCcccccC Confidence 67777766654422111111 No 167 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=98.16 E-value=6e-07 Score=54.67 Aligned_cols=274 Identities=12% Similarity=0.062 Sum_probs=142.9 Q ss_pred CCC-ccCCccceEccHHHHHHHHHHHHhh-hhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhhcccccccccccc Q lcl|Aclame:pro 1 MAD-ISRAEVASLIQEAYSDTLLAAAKQG-STVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTW 77 (305) Q Consensus 1 Ma~-~t~~~gg~lip~~~~~~i~~~~~~~-~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f 77 (305) +|- -+|+|.+.++-......+++.-+.. ..+++.|++.+++ ....+..+..+.++..-|.|+.+..-++..+ T Consensus 359 ~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e----- 433 (652) T protein:vir:79 359 AAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGD----- 433 (652) T ss_pred HHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccceeeecC----- Confidence 333 3677777766555555554443333 3466777776654 3333445556778888899999887665543 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHH---cCcccCcCcccccccccccccccceeec Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVI---FGTDKPASWVSPALIPAAVTAGQAVEVV 154 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l---~G~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (305) +.-++...++|..+.||++++-.-..+...-|-..++++.++.+++.++ .++.+-. ..+..+-.-+.. ++..+. T Consensus 434 ~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~-~DGk~LF~hA~H-~Nl~~~- 510 (652) T protein:vir:79 434 KQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKIS-TDNVSLFDKAKH-ANVLES- 510 (652) T ss_pred ccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccc-cCCceeeccccc-cccccc- Confidence 3345677899999999999875557788899999999999998886655 2321100 011111101111 111111 Q ss_pred ccchhhhHHHHHHHHHHH-Hhhh---ccccceEEEEchHHHHHHHHhhccCCce---e--ecccccCcc-ceEecCcccc Q lcl|Aclame:pro 155 GGVANESDIVGATNRAAK-AVAS---AGWAPDTLLSSLALRYEVANIRDANGNP---V--FRDDSFAGF-RTFFNRNGAW 224 (305) Q Consensus 155 ~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~~v~~~~~~~~l~~~kd~~G~~---l--~~~~~l~G~-pv~~~~~~~~ 224 (305) +....+. +.....++. +-.. -...+..|++.+......+++..+...+ . ....++.|. .+++...+.. T Consensus 511 -aa~~~~~-l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~a~~~~~~~Np~~~~~~~i~eprL~~ 588 (652) T protein:vir:79 511 -AAMDVAS-LDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPVKDFATVIAEPRLDD 588 (652) T ss_pred -ccCCHHH-HHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCCcccccccccccccccccccccccccCC Confidence 1112222 222222222 2111 1235667888888777666654221111 0 001123333 2232222211 Q ss_pred CCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcce----eeeeecCcEEEEEEEEEccEeecccceEEEec Q lcl|Aclame:pro 225 DADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQ----INLAERDMVALRLKARFAYVLGVSATAQGANK 295 (305) Q Consensus 225 ~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~----~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~ 295 (305) . ....-++++-.. .-+|++ ++++..+.+ -.-|..+-+.+|+...+|.++.|-.+++|.+- T Consensus 589 ~-s~~~wylaa~~~-------~dtiev---~yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 589 N-SQTTFYLAASKG-------SDTIEV---AYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred C-CcccEEEecCCC-------CCeEEE---EEecCCCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 1 111122221111 001111 222221111 11277888889999999999999999998887 No 168 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.02 E-value=3e-06 Score=50.84 Aligned_cols=275 Identities=11% Similarity=0.010 Sum_probs=143.5 Q ss_pred CCCccCC---ccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCcee-eeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRA---EVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEAD-WVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~---~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~-~v~E~~~~~~~~~~~~~~~ 76 (305) ||..+.+ ......-+++.++|...-....|+..+.......+...++...+-...+. -..||...+...... ... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~-r~~ 79 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSF-TTM 79 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccC-CEE Confidence 8754322 22345678888888888888899998877666655556665543222211 122554322222110 011 Q ss_pred ceeEEeeeeeEEEeehhhHH--HhhcCH-HHHHHHHHHHHHHHHHHHHHHHHHcCccc-----C-cCccccccccccccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHEN--VIDDAT-VAVLTEVAELGGQAIGKKLDQAVIFGTDK-----P-ASWVSPALIPAAVTA 147 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~e--ll~ds~-~~~~~~v~~~la~~~a~~~d~a~l~G~g~-----~-~~~~~~~~~~~~~~~ 147 (305) +.+++= =+...+.+|.- ...... .+...|-..+-..++.+.+|.++|+|.-. . ..-...|+..-.... T Consensus 80 ~~N~tQ---If~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~ 156 (317) T protein:vir:88 80 LNNYCQ---ISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTN 156 (317) T ss_pred eccEEE---EEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccC Confidence 111111 11222344443 222211 23333333444456889999999998532 1 111122222111110 Q ss_pred c--------------ccee-ecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceee--cccc Q lcl|Aclame:pro 148 G--------------QAVE-VVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVF--RDDS 210 (305) Q Consensus 148 ~--------------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~--~~~~ 210 (305) . ...+ ......+ .+++.++..++...+..+..+++++.....+.++-..++.++. .... T Consensus 157 ~~~~~~g~~~~~~~~~~~t~~t~~~lt----e~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~ 232 (317) T protein:vir:88 157 GSLGANGVAPVGDGSNTGTAGDLRLLT----EDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDN 232 (317) T ss_pred ceeccCccccccCCCcccccccccccc----HHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCe Confidence 0 0000 0111112 3345556666677777778899999999999888433444442 2222 Q ss_pred cCc------------cceEecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEE Q lcl|Aclame:pro 211 FAG------------FRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKA 278 (305) Q Consensus 211 l~G------------~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~ 278 (305) ..| +.++.+.+++ .+.+++.|++++-+..-+.+..+.. -.++ +........ T Consensus 233 ~~g~~v~~~~tdfG~v~ii~~r~lp----~~~~~~~D~~~~~l~~Lr~~~~e~l----aKtG---------d~~k~~i~~ 295 (317) T protein:vir:88 233 RIAQTVDVYESDFGKYTIRANRWFH----ENTLFVFDPKMHSLCYLRPFFQHEL----AKTG---------DSEKRQLLV 295 (317) T ss_pred EEEEEEEEEEeCCeEEEEEeCCCCC----CCeEEEEcccccceeecccceeecc----CCCc---------ccceeEEEE Confidence 222 2445555554 5678888988876554344333211 1111 233456677 Q ss_pred EEccEeecccceEEEecccccc Q lcl|Aclame:pro 279 RFAYVLGVSATAQGANKTPVAV 300 (305) Q Consensus 279 r~~~~v~~p~a~~~~~~t~~a~ 300 (305) .++..+.+|++..++....+.. T Consensus 296 E~tLe~~N~~a~a~i~~l~~~~ 317 (317) T protein:vir:88 296 EYTFRVNNEKSGALIRDVVAQL 317 (317) T ss_pred EEEEEEcCccceeEEEEecccC Confidence 8899999999999999887665 No 169 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=97.95 E-value=5.4e-06 Score=49.48 Aligned_cols=275 Identities=12% Similarity=0.014 Sum_probs=117.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhccee---e---cCCCceEEEEEeCCCceeeeecchhhccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNV---N---MGTKTTHLPVLATLPEADWVGESATDPKGVKPTSK 74 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~---~---~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~ 74 (305) ||. ..++|+.+..++++.+++..++..++..- . -.+++++||+.... .+.+...........+...+ T Consensus 1 Ma~------~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~-~~~~~~~~~~~~~~~~~~~~ 73 (392) T protein:vir:99 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPS-RGHTRKLRGAGAERNLTVSD 73 (392) T ss_pred Ccc------ccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccc-cceeeeccccccCCcccccc Confidence 884 34899999999999999999988887432 2 13557899875432 22222111111111222223 Q ss_pred ccceeEEeeee-eEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCc-ccCcCccccccccccccccccee Q lcl|Aclame:pro 75 VTWANRTLVAE-EIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGT-DKPASWVSPALIPAAVTAGQAVE 152 (305) Q Consensus 75 ~~f~~v~~~~~-k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~-g~~~~~~~~~~~~~~~~~~~~~~ 152 (305) .+-.++++... ..+.-+.++++-......++.+.+.++..++++.++|.-++.-- +.+.. . . T Consensus 74 ~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~-----~----~------- 137 (392) T protein:vir:99 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE-----A----A------- 137 (392) T ss_pred cccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----c----c------- Confidence 33344444442 23344567776555557788888888899999999998876311 11000 0 0 Q ss_pred ecccchhhhHHHHHHHHHHHHhhhcccc-ceEEEEchHHHHHHHHhh-----ccCC---ceeecc---cccCccceEecC Q lcl|Aclame:pro 153 VVGGVANESDIVGATNRAAKAVASAGWA-PDTLLSSLALRYEVANIR-----DANG---NPVFRD---DSFAGFRTFFNR 220 (305) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~l~~~k-----d~~G---~~l~~~---~~l~G~pv~~~~ 220 (305) ...........++.+..+...+...... .-.++++|..+..|.+.. +..| ...++. .++.|++++.+. T Consensus 138 ~~~~~~~~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~ 217 (392) T protein:vir:99 138 GAVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVEST 217 (392) T ss_pred ccccccChhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeec Confidence 0001111222334444444444433211 126788999988886431 1111 112222 468899999888 Q ss_pred ccccCCCCceEEEEehhhEEEEee-----------------cCcEE--EEeecceeccCcceeeeeecCcEEEEEEEEEc Q lcl|Aclame:pro 221 NGAWDADAAIEVIADSSRVKIGVR-----------------QDITV--KFLDQATLGTGENQINLAERDMVALRLKARFA 281 (305) Q Consensus 221 ~~~~~~~~~~~~~gdf~~~~~~~~-----------------~~i~v--~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~ 281 (305) +++.... +.+..+...+..+ ..+.. ....+.+...+......+. .- .......+ T Consensus 218 ~~~~~t~----~a~~~~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~-g~--~~v~~~~~ 290 (392) T protein:vir:99 218 LIPHGDA----YLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYF-GL--KVVEDPNG 290 (392) T ss_pred ccccccc----eeeeccccccccccccccccccceeEEecccceecceeecccceeeccccccceeE-EE--EEEeeccc Confidence 7764321 1111111100000 00110 0101111111000000000 00 00000000 Q ss_pred cEeecccceEEEecc-ccccccCCC Q lcl|Aclame:pro 282 YVLGVSATAQGANKT-PVAVVAPAA 305 (305) Q Consensus 282 ~~v~~p~a~~~~~~t-~~a~v~~a~ 305 (305) -.......+.....+ ...++.++. T Consensus 291 ~~~~~~~~~~~~~~~v~v~~v~~~~ 315 (392) T protein:vir:99 291 VGFVRARKIHLIPGSIEVAPEAGAN 315 (392) T ss_pred cceeeeeeeeeecceeeeeeeeccc Confidence 000000000000000 000111111 No 170 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=97.92 E-value=3.4e-06 Score=50.59 Aligned_cols=277 Identities=13% Similarity=0.065 Sum_probs=142.2 Q ss_pred CCC-ccCCccceEccHHHHHHHHHHHHhh-hhhhhhcceeecC-CCceEEEEEeCCCceeeeecchhhcccccccccccc Q lcl|Aclame:pro 1 MAD-ISRAEVASLIQEAYSDTLLAAAKQG-STVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTW 77 (305) Q Consensus 1 Ma~-~t~~~gg~lip~~~~~~i~~~~~~~-~~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f 77 (305) ||- -||+|.+.++-......+++.-+.. ..+++.|++..++ ....+.....+-++..-|.|+.+..-++..+.. T Consensus 394 ~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~--- 470 (693) T protein:vir:95 394 LAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLRQVREGAEYKYVTLGERG--- 470 (693) T ss_pred HHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCCChhhcCCCCceeeeecCCcc--- Confidence 443 4677777666555544444332222 3456666655543 223333345566777788898877655543322 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcC-cccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFG-TDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G-~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) -++...++|..+.||++++-.-..+...-|-..++++.++.+++.++.= .+++.-..+..+-. +. -++..+.+.+ T Consensus 471 --e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFh-ad-H~Nl~tga~s 546 (693) T protein:vir:95 471 --EQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFH-AD-HSNLLTGAAS 546 (693) T ss_pred --ceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceee-cc-cccccccccc Confidence 3456678999999999987665778889999999999999988765521 11111101111111 11 1121222222 Q ss_pred chhhhHHHHHHHHHHHHhh--------hccccceEEEEchHHHHHHHHhhccCCceee-----cccccCccc-eEecCcc Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVA--------SAGWAPDTLLSSLALRYEVANIRDANGNPVF-----RDDSFAGFR-TFFNRNG 222 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~-----~~~~l~G~p-v~~~~~~ 222 (305) ..+.+.+......+..+-. .-...+.-|++++......+++..+...+-- ...++.|+. +++...+ T Consensus 547 als~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~a~~~~~~~NP~~~~~~vi~~prL 626 (693) T protein:vir:95 547 ALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPGADVNSGIVNPIRAFAQVIGEPRL 626 (693) T ss_pred ccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccccccccccccchhcccccccccee Confidence 3333333333222322210 1234567788888888777766533221110 012244432 2222222 Q ss_pred ccCCCCceEEEEehhh--EEEEe---ecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 223 AWDADAAIEVIADSSR--VKIGV---RQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 223 ~~~~~~~~~~~gdf~~--~~~~~---~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) ....+..=.++.|... +-+++ .++..++. . ..|..+-+.+|+.-.+|.++.|-.+++|-.++ T Consensus 627 ~~~s~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~--------~----~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 627 DDASATAWYMAAKKGSDTIEVAYLDGVDTPYLEQ--------Q----EGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred cCCCCCceEEecCCCCCeEEEEEecCCCCCeEee--------c----CCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 1111111122222211 11111 12222221 1 12778888999999999999998888887766 No 171 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=97.77 E-value=1.2e-05 Score=47.59 Aligned_cols=287 Identities=10% Similarity=0.046 Sum_probs=130.2 Q ss_pred CCCccCCc--cceEccHHHHHHHHHHHHhhhhhhhhccee---------ecCCCceEEEEEeCC-Cceeeeecchhhccc Q lcl|Aclame:pro 1 MADISRAE--VASLIQEAYSDTLLAAAKQGSTVLSAFQNV---------NMGTKTTHLPVLATL-PEADWVGESATDPKG 68 (305) Q Consensus 1 Ma~~t~~~--gg~lip~~~~~~i~~~~~~~~~l~~l~~~~---------~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~ 68 (305) |+.....+ .-.++|+.+..-+.+...+.+.|++-.=+. ..++...++|.+..- ....-+.+..... T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~-- 78 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV-- 78 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcc-- Confidence 88644211 225778888766776666666655532222 235667899998532 2222222222111 Q ss_pred ccccccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc---C---cccCcCcccc---c Q lcl|Aclame:pro 69 VKPTSKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF---G---TDKPASWVSP---A 139 (305) Q Consensus 69 ~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~---G---~g~~~~~~~~---~ 139 (305) +.+..+.+-++-.-..+..+.-+..++-.-.-+..|..+.|.+++++-..+...+.+|. | +......... + T Consensus 79 ~~t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~ 158 (367) T protein:vir:80 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) T ss_pred cccccccccchheeeeehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhhh Confidence 12222222222111222233333333322223445788999999998777777665553 2 1100000000 0 Q ss_pred c-----cccccccccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh------hccCCceeecc Q lcl|Aclame:pro 140 L-----IPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI------RDANGNPVFRD 208 (305) Q Consensus 140 ~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~------kd~~G~~l~~~ 208 (305) . ..........++...+.....--.+.+.++...+-.....-+.++||+.++..|+++ ++++|. ..- T Consensus 159 ~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~li~~i~~sd~~--~~i 236 (367) T protein:vir:80 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ--LTI 236 (367) T ss_pred ccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccccccccCCCCc--ccc Confidence 0 000000000111111100001113345555555555555677899999999999875 456653 223 Q ss_pred cccCccceEecCccccCCCC-c----eEEEEehhhEEEEe--ecC-cEEEEeecceeccCcceeeeeecCcEEEEEEEEE Q lcl|Aclame:pro 209 DSFAGFRTFFNRNGAWDADA-A----IEVIADSSRVKIGV--RQD-ITVKFLDQATLGTGENQINLAERDMVALRLKARF 280 (305) Q Consensus 209 ~~l~G~pv~~~~~~~~~~~~-~----~~~~gdf~~~~~~~--~~~-i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~ 280 (305) ++++|++|++++.||..... . ..+||. ..++. ... ..+++.++.....+. ++-.+....| T Consensus 237 ~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~---GAi~~~~~~~~~~~E~~Rd~~~~~~g--------G~d~L~~Rr~- 304 (367) T protein:vir:80 237 PTYMGKVVIVDDGMPVFGTGADKTYLSILFGG---AAFGYADGAPQVPVAVGRRELRGNGS--------GLEYILERKE- 304 (367) T ss_pred ceecceeEEEeCCCcccccCCCceEEEEEEec---ceeeecccCCccceecccchhhhcCC--------ceEEEEeeee- Confidence 67899999999999975321 1 123333 12221 111 223444444321111 1112222222 Q ss_pred ccEeecccceEEEecccccc---ccCCC Q lcl|Aclame:pro 281 AYVLGVSATAQGANKTPVAV---VAPAA 305 (305) Q Consensus 281 ~~~v~~p~a~~~~~~t~~a~---v~~a~ 305 (305) .+.+|.++.....+-+++ .+|+. T Consensus 305 --~~~hP~G~s~~~~~v~~~~~~~~~~~ 330 (367) T protein:vir:80 305 --WIVHPGGFNWLDADVTIPDNTGSPSG 330 (367) T ss_pred --EEeecceeeecccccccccccccccc Confidence 255777766554321111 11111 No 172 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=97.73 E-value=8.1e-06 Score=48.49 Aligned_cols=278 Identities=7% Similarity=-0.001 Sum_probs=146.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCC---CceEEEEEeCCCceeeeecchhhcccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGESATDPKGVKPTSKVTW 77 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f 77 (305) |..+..+.-.+...+.+...|++...+.-..+.+..+.+.+. .++.+++.+....+.+.+.+...+ .+..+..+ T Consensus 46 ~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~P---l~~~~v~~ 122 (339) T protein:vir:94 46 LQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANG---MSKANVNF 122 (339) T ss_pred cccccccchhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCC---ccccccee Confidence 221111111123455556778888888888888888877653 367888888888888887665422 22234567 Q ss_pred eeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccc-cceeec Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAG-QAVEVV 154 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~-~~~~~~ 154 (305) .+.++.....+-.+. ..|+..- ...++..--.+..++++.+++|+..++|+..- ...|+++.-.... ...... T Consensus 123 ~~~~v~~~~~g~~y~-~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~---~~~GLlN~P~l~~~v~~s~~ 198 (339) T protein:vir:94 123 ESRQNYRYQTWTEYG-DLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGI---ANYGLMNDPSLPAPVAATVN 198 (339) T ss_pred eEEeEEEEEEEEeec-HHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeeccc---ceEEEEeCCCccccccCCCC Confidence 666666666555443 3344433 34578888888899999999999999997432 2344443211111 010111 Q ss_pred ccchhhhHHHHHHHHHHHHhhhccc------cceEEEEchHHHHHHHHhhccCCceeec--ccccCccceEecCccccCC Q lcl|Aclame:pro 155 GGVANESDIVGATNRAAKAVASAGW------APDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l~G~pv~~~~~~~~~~ 226 (305) =...+.+.+++++..+..++..... .+..+++.+..+..|.. ++..|.-++. .....++.++-........ T Consensus 199 Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~-~n~~~~Tvl~~lk~n~pnl~i~~~~el~~a~ 277 (339) T protein:vir:94 199 WATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNR-TNNFGLSAGAKIAQTYPNIQFVAVPEFDTAS 277 (339) T ss_pred cccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhccc-CCcCCccHHHHHHHhcCCcEEEEccccccCC Confidence 1123456677888877777654422 24478999999998864 3444433321 1111122222211111111 Q ss_pred CCce-EEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccE-eecccceEEEecc Q lcl|Aclame:pro 227 DAAI-EVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYV-LGVSATAQGANKT 296 (305) Q Consensus 227 ~~~~-~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~-v~~p~a~~~~~~t 296 (305) ++.. ++.-+.. ......+.+-....... .. ...-...+.+..|.++. +.+|.+++.+++- T Consensus 278 g~~~~~~~~~~~-----~~~~~~~~~p~~~~~lp--vq---~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 278 GRLVQLWVPEVN-----GQPTGEVAFAEKLRSHS--IE---RYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred CceEEEEEEecc-----CCcceEEEcchhhhccc--cE---EcCceEEecceeeeeeEEEEccceeeeeecC Confidence 1111 1111110 00111121111111000 00 11123455677786655 6789999999987 No 173 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=97.51 E-value=5.3e-05 Score=44.02 Aligned_cols=281 Identities=10% Similarity=0.088 Sum_probs=121.6 Q ss_pred CCCccCCccceEccH--HHHHHHHHHHHhhhhhhhhcce---------eecCCCceEEEEEeCC-Cceee-eecchhhcc Q lcl|Aclame:pro 1 MADISRAEVASLIQE--AYSDTLLAAAKQGSTVLSAFQN---------VNMGTKTTHLPVLATL-PEADW-VGESATDPK 67 (305) Q Consensus 1 Ma~~t~~~gg~lip~--~~~~~i~~~~~~~~~l~~l~~~---------~~~~~~~~~~p~~~~~-~~a~~-v~E~~~~~~ 67 (305) ||.+.-+| ..+|+ .+..-+.+.-.+.+.|.+-.=+ ...++..+++|.+..- .++.. +.... .. T Consensus 1 Ma~T~l~D--~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt--~~ 76 (349) T protein:vir:94 1 MAITTIGN--IVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDV--YQ 76 (349) T ss_pred CCceEEee--eeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCC--cc Confidence 99766555 35676 3555555555555666552211 1234667889988542 22110 11100 00 Q ss_pred ccccccccc-ceeEEeeeeeEEE--eehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccc--c Q lcl|Aclame:pro 68 GVKPTSKVT-WANRTLVAEEIAV--IIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALI--P 142 (305) Q Consensus 68 ~~~~~~~~~-f~~v~~~~~k~~~--~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~--~ 142 (305) +..+..+.+ ..++....+.--+ .-.++.++ +..+..+.|.+++++-..+...+.+|.=- .+.+..... . T Consensus 77 ~~~t~~kit~~~~~a~~~~r~kaw~~~Dla~~l---sG~dpm~~Ia~~va~yW~r~~q~~Lia~L---~Gvf~~~~~~~~ 150 (349) T protein:vir:94 77 DIATPRAIQTGEMMARVAYLNEGFGQADLTVEL---TSQNPLQSVASRLDNFWQRQAQRRLIATA---LGLYNDNVSATD 150 (349) T ss_pred cccccccccccceeeeeeeeccccchhHHHHHh---hCchHHHHHHHHHHHHHhhHHHHHHHHHH---Hhhhcccccccc Confidence 111112222 2222222222111 12344443 33477899999999988887776665300 000110000 0 Q ss_pred cccccccceee--cccchhhhHHHHHHHHHHHHh-hhccccceEEEEchHHHHHHHHh------hccCCceeecccccCc Q lcl|Aclame:pro 143 AAVTAGQAVEV--VGGVANESDIVGATNRAAKAV-ASAGWAPDTLLSSLALRYEVANI------RDANGNPVFRDDSFAG 213 (305) Q Consensus 143 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~~~~~~l~~~------kd~~G~~l~~~~~l~G 213 (305) ........... .........+.+....+-.+. ....-..+.++||+.++..|++. ++++|..-+ ++++| T Consensus 151 ~~~~~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i--~ty~G 228 (349) T protein:vir:94 151 AYHEQNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTMF--ATYQG 228 (349) T ss_pred cccccCceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhhccCcccCccc--ceecC Confidence 00000000111 111122223333333322221 11223457899999999999875 345554222 67899 Q ss_pred cceEecCccccCCCCc-----eEEEEehhhEEEEeec-CcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecc Q lcl|Aclame:pro 214 FRTFFNRNGAWDADAA-----IEVIADSSRVKIGVRQ-DITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVS 287 (305) Q Consensus 214 ~pv~~~~~~~~~~~~~-----~~~~gdf~~~~~~~~~-~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p 287 (305) ++|++++.||...+.. ..+||. ..+.++... ...+++.++....... .+-.+..+.|+ +.+| T Consensus 229 ~~VivDD~~Pv~~~g~~~~yttylfg~-GAi~~~~~~~~~~~E~~rd~~~g~~~--------G~d~L~~R~~~---~~hp 296 (349) T protein:vir:94 229 YRVIVDDSMTVVGQDTSRKFISIIFGQ-GAIGYGEGNPEMPLEYEREASRANGG--------GVETLWTRKTW---LLHP 296 (349) T ss_pred cEEEEeCCCccccCCCCceEEEEEeec-ceEEeecCCCCcceeeecccccCCcc--------eeEEEEEeeEE---Eeee Confidence 9999999998654321 123442 111122211 1234444443211111 11222233332 3466 Q ss_pred cceEEEeccccc------cccCC-------C Q lcl|Aclame:pro 288 ATAQGANKTPVA------VVAPA-------A 305 (305) Q Consensus 288 ~a~~~~~~t~~a------~v~~a-------~ 305 (305) .++.......+. ...|. + T Consensus 297 ~G~s~~~a~v~~~~~~~~~~sPt~aeLa~~~ 327 (349) T protein:vir:94 297 FGYSFTSAVITGNGTETIARSASWQDLANAA 327 (349) T ss_pred eeeeecccccCCCccccccCCCChHHhcCCc Confidence 665554421110 01111 1 No 174 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=97.48 E-value=5.8e-05 Score=43.80 Aligned_cols=274 Identities=11% Similarity=0.110 Sum_probs=120.1 Q ss_pred CCCccCCccceEccH--HHHHHHHHHHHhhhhhhhhcce---------eecCCCceEEEEEeCC-Cce--eeeecchhhc Q lcl|Aclame:pro 1 MADISRAEVASLIQE--AYSDTLLAAAKQGSTVLSAFQN---------VNMGTKTTHLPVLATL-PEA--DWVGESATDP 66 (305) Q Consensus 1 Ma~~t~~~gg~lip~--~~~~~i~~~~~~~~~l~~l~~~---------~~~~~~~~~~p~~~~~-~~a--~~v~E~~~~~ 66 (305) ||.+.-+| ..+|+ .+..-+.++-.+.+.|.+-.=+ ...++..+++|.+..- ..+ .+.+.+. T Consensus 1 Ma~T~l~D--~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~--- 75 (349) T protein:vir:78 1 MAITTIGD--IVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVY--- 75 (349) T ss_pred CCceEEee--eeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCc--- Confidence 99766555 35676 3555555555555655552111 2234667889998532 211 1111110 Q ss_pred ccccccccc-cceeEEeeeeeEEEee---hhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc---CcccCcCccccc Q lcl|Aclame:pro 67 KGVKPTSKV-TWANRTLVAEEIAVII---PVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF---GTDKPASWVSPA 139 (305) Q Consensus 67 ~~~~~~~~~-~f~~v~~~~~k~~~~~---~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~---G~g~~~~~~~~~ 139 (305) .+..+..+. +..++....+. +.-+ .++.++ +..+..+.|.+++++-..+...+.+|. |- +... T Consensus 76 ~~~~t~~kitt~~~~a~~~~r-~kaw~~~Dla~~l---sG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gv------f~~~ 145 (349) T protein:vir:78 76 QDIATPRAIQTGEMMARVAYL-NEGFGQADLTVEL---TSQNPLQSVASRLDNFWQRQAQRRLIATALGL------YNDN 145 (349) T ss_pred ccccccccccccceeeeeeee-ccccchhHHHHHh---hCchHHHHHHHHHHHHHhhHHHHHHHHHHHHh------hccc Confidence 011111222 22223222222 2222 334443 344778999999998888777665553 21 1000 Q ss_pred cc--cccccccc-cee-ecccchhhhHHHHHHHHHHHHh-hhccccceEEEEchHHHHHHHHh------hccCCceeecc Q lcl|Aclame:pro 140 LI--PAAVTAGQ-AVE-VVGGVANESDIVGATNRAAKAV-ASAGWAPDTLLSSLALRYEVANI------RDANGNPVFRD 208 (305) Q Consensus 140 ~~--~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~~~~~~l~~~------kd~~G~~l~~~ 208 (305) .. ........ ... ......+...+.+....+-... ....-..+.++||+.++..|++. ++++|..- - T Consensus 146 ~~a~~~~~~~~~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~--i 223 (349) T protein:vir:78 146 VSATDAYHEQNDMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTM--F 223 (349) T ss_pred ccccchhhhcccceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhhccCcccCcc--c Confidence 00 00000000 000 0111112223333322222221 11233457899999999999865 34554322 2 Q ss_pred cccCccceEecCccccCCCC-c----eEEEEehhhEEEEeecC---cEEEEeecceeccCcceeeeeecCcEEEEEEEEE Q lcl|Aclame:pro 209 DSFAGFRTFFNRNGAWDADA-A----IEVIADSSRVKIGVRQD---ITVKFLDQATLGTGENQINLAERDMVALRLKARF 280 (305) Q Consensus 209 ~~l~G~pv~~~~~~~~~~~~-~----~~~~gdf~~~~~~~~~~---i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~ 280 (305) ++++|++|++++.+|..... . ..+||. ..++...+ ..++..++.....+ ..+-.+..+.|+ T Consensus 224 ~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~---GAi~~~~~~~~~~~et~rd~~~g~~--------~G~d~l~~R~~~ 292 (349) T protein:vir:78 224 ATYQGYRVIVDDSMTVVGQGAQRKFISIIFGQ---GAIGYGEGNPVMPLEYEREASRANG--------GGVETLWTRKTW 292 (349) T ss_pred ceecCeEEEEeCCCccccCCCCceEEEEEeec---ceEEEccCCCccceeeecccccCCc--------ceeEEEEEeeEE Confidence 67899999999999875432 1 234442 22222111 23444444321111 112223333333 Q ss_pred ccEeecccceEEEeccccc------cccCC-------C Q lcl|Aclame:pro 281 AYVLGVSATAQGANKTPVA------VVAPA-------A 305 (305) Q Consensus 281 ~~~v~~p~a~~~~~~t~~a------~v~~a-------~ 305 (305) +.+|.++.......+. ...|. + T Consensus 293 ---~~hp~G~s~~~a~v~~~~~~~~~~sPt~aeLa~~~ 327 (349) T protein:vir:78 293 ---LLHPFGYRFTSAVITGNGTETIARSASWQDLANAT 327 (349) T ss_pred ---EeeeeeeeeccccccCCccccccCCCChHHhcCCc Confidence 3466655554322110 01111 1 No 175 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=97.14 E-value=1.7e-05 Score=46.77 Aligned_cols=274 Identities=9% Similarity=-0.027 Sum_probs=139.6 Q ss_pred CCC-----ccCCccceEccHHHH----HHHHHHHHhhhhhhhhcceeecCC---CceEEEEEeCCCceeeeecchhhccc Q lcl|Aclame:pro 1 MAD-----ISRAEVASLIQEAYS----DTLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGESATDPKG 68 (305) Q Consensus 1 Ma~-----~t~~~gg~lip~~~~----~~i~~~~~~~~~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~E~~~~~~~ 68 (305) .|. +++ .+...+|..+. ..+++.+.......++..+..++. ....+++.+....+.+.+-+. T Consensus 34 da~d~~~~~~~-~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~----- 107 (336) T protein:vir:10 34 DAADLSPHLSS-TGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYS----- 107 (336) T ss_pred hhhhccCcccc-CCCchhHHHHHhhcccceeeehhhhhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccC----- Confidence 010 111 11223454332 334455555555666666655432 245566666566666665443 Q ss_pred ccccccccceeEEeeeeeEEEeehhhH-HHhhcC--HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccc Q lcl|Aclame:pro 69 VKPTSKVTWANRTLVAEEIAVIIPVHE-NVIDDA--TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAV 145 (305) Q Consensus 69 ~~~~~~~~f~~v~~~~~k~~~~~~is~-ell~ds--~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~ 145 (305) +.|..+...+..+-+.+.++....++. |+.+.. ..++..--....++++.+++|+-.+.|+..- ...|.++.-. T Consensus 108 D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~---~~yGllN~P~ 184 (336) T protein:vir:10 108 SDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGL---ENYGLINDPS 184 (336) T ss_pred CCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEecccc---ceEEEEeCCC Confidence 346666555555666777888888884 554432 4567788888888999999999889997642 2234443221 Q ss_pred cc-ccceeecc-cchhhhHHHHHHHHHHHHhhhcc------ccceEEEEchHHHHHHHHhhccCCceeec--ccccCccc Q lcl|Aclame:pro 146 TA-GQAVEVVG-GVANESDIVGATNRAAKAVASAG------WAPDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGFR 215 (305) Q Consensus 146 ~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l~G~p 215 (305) .. .....+.. ...+.+.+++++..+..++.... ..+..+++.+..+..|.+ ++..|.-++. .....++. T Consensus 185 l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~lk~n~Pnl~ 263 (336) T protein:vir:10 185 LSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDIFPKLE 263 (336) T ss_pred CccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccC-CCccCccHHHHHHHhcCccE Confidence 11 11111111 12234667788877777766533 247789999999888854 3333433321 11121222 Q ss_pred eEecCccccCCCCceEEEEehhhEEEEeecC---cEEEEeecceeccCcceeeeeecCcEEEEEEEEEccE-eecccceE Q lcl|Aclame:pro 216 TFFNRNGAWDADAAIEVIADSSRVKIGVRQD---ITVKFLDQATLGTGENQINLAERDMVALRLKARFAYV-LGVSATAQ 291 (305) Q Consensus 216 v~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~---i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~-v~~p~a~~ 291 (305) ++-........+... .+++....+ ..+.+......... . ...-.....+..|.++. +.+|.+++ T Consensus 264 i~t~pEl~~a~G~~~-------~l~~~~~~~~~t~~~~~p~~~~~l~v--q---~~~~~~~v~~~~rt~Gv~i~~P~ai~ 331 (336) T protein:vir:10 264 FVTIPEYDTASGRLV-------QLWAPRVEGKDTATCGFTEKMRAHSI--E---RYSSYFRQKKSAGTWGAVIFRPFAVA 331 (336) T ss_pred EEEccccccCCCceE-------EEEEEecCCCcceeeecchhhhccce--e---ecCceeEeccccceeeeeeeccchhe Confidence 221111111112111 122211111 11111111100000 0 11123456677788766 57799999 Q ss_pred EEecc Q lcl|Aclame:pro 292 GANKT 296 (305) Q Consensus 292 ~~~~t 296 (305) ++++- T Consensus 332 ~~~GI 336 (336) T protein:vir:10 332 QMIGV 336 (336) T ss_pred eeecC Confidence 99987 No 176 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=97.12 E-value=0.00016 Score=41.34 Aligned_cols=274 Identities=5% Similarity=-0.063 Sum_probs=113.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcc-------eeecCCCceEEEEEeCCC----ceeeeecchhhcccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQ-------NVNMGTKTTHLPVLATLP----EADWVGESATDPKGV 69 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~-------~~~~~~~~~~~p~~~~~~----~a~~v~E~~~~~~~~ 69 (305) ||- +|-- +..+.+....+|++.+.......+. -.+..++-+..|.+..-. +...+.+.. + T Consensus 1 m~l---sD~~-vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~-----~ 71 (325) T protein:vir:95 1 MAL---SDLA-VYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSG-----T 71 (325) T ss_pred Cch---hhhh-hhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCc-----e Confidence 543 3332 2466666777777776544444322 123345556677765321 111122221 1 Q ss_pred ccccc-ccceeEEeeeeeEEEeehhhHHHh---hcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccc Q lcl|Aclame:pro 70 KPTSK-VTWANRTLVAEEIAVIIPVHENVI---DDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAV 145 (305) Q Consensus 70 ~~~~~-~~f~~v~~~~~k~~~~~~is~ell---~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~ 145 (305) +...+ .+...+....+.-.+......+.+ .+....+.+.|.+++++...+.+-+.++.+.... ..... T Consensus 72 vt~~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a--------~~~~~ 143 (325) T protein:vir:95 72 VAEKVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSA--------LSQVS 143 (325) T ss_pred eccceeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------hcccc Confidence 22222 223344444333323222222221 2223334455555555554444434444222110 00000 Q ss_pred ccccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhc--------cCCceeecccccCccceE Q lcl|Aclame:pro 146 TAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRD--------ANGNPVFRDDSFAGFRTF 217 (305) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd--------~~G~~l~~~~~l~G~pv~ 217 (305) .....+....+.....--...+.++..++-.....-..|+||..++..|.+..- ..|...+ +..+|++|+ T Consensus 144 ~~v~dis~~~~~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~~i--~t~~G~~VI 221 (325) T protein:vir:95 144 DVVYDATANTDAADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVNVV--RDPFGKLLV 221 (325) T ss_pred cceeeeecccCcccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCcccc--cccCCcEEE Confidence 000001111111000011345666667766666677899999999999986433 3333222 468899999 Q ss_pred ecCccccCCCCceEEEEehhhEEEEe-ecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 218 FNRNGAWDADAAIEVIADSSRVKIGV-RQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 218 ~~~~~~~~~~~~~~~~gdf~~~~~~~-~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) +++.+|.......- ....++. .+.+.+....+......... .-++-...+|. |.- -+++|.++..-+ + T Consensus 222 VdD~~p~~~~g~~~-----~ytty~lg~GAi~~~~~~~~~~~~~~~~--~~~~~~~~~~~--~~t-f~lhp~G~sw~~-s 290 (325) T protein:vir:95 222 MTDSPNLFAAGTPN-----VYHILGLVPGGVLIGQNNDFDANEETKN--GDENIIRTYQA--EWS-YNIGVKGFAWDK-A 290 (325) T ss_pred EeCCCCCCCccCce-----eEEEEEEecCeEEecCCCCccccccccC--cccceeeeeee--eee-EEeecceeeeec-c Confidence 99998865322110 0011111 12222221111111111000 00111222232 221 256787777732 2 Q ss_pred ccccccCCC Q lcl|Aclame:pro 297 PVAVVAPAA 305 (305) Q Consensus 297 ~~a~v~~a~ 305 (305) . +-+.|.- T Consensus 291 ~-~g~sPt~ 298 (325) T protein:vir:95 291 N-GGKSPTD 298 (325) T ss_pred c-ccCCcCh Confidence 1 1234544 No 177 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=97.11 E-value=0.00017 Score=41.27 Aligned_cols=264 Identities=6% Similarity=-0.035 Sum_probs=104.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcce--e-----ecCCCceEEEEEe-CCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQN--V-----NMGTKTTHLPVLA-TLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~--~-----~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~ 72 (305) ||++--+|-- +.-+.+....+|++.+.......+.- + ++.++=.+.+.+. ++.. ........++... T Consensus 1 ~~~t~~sdl~-vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~----~~rnv~~~~~~t~ 75 (315) T protein:vir:96 1 MATTVNSDLV-IYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAI----ADRDVNSTATVAG 75 (315) T ss_pred Cceeeeccee-eehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccch----hhcccCCCccccc Confidence 9998888854 45677777788887776555543221 1 1222211122111 1100 0000011111111 Q ss_pred ccc-cceeEEeeeeeEEEeehhhHHHhh---cCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccc Q lcl|Aclame:pro 73 SKV-TWANRTLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAG 148 (305) Q Consensus 73 ~~~-~f~~v~~~~~k~~~~~~is~ell~---ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~ 148 (305) .+. +...+..+..--.+-+..+.+.+. +........|.+.+..++.+.+=...+.|.-..-......+ T Consensus 76 ~kit~~~dvaVk~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~t~~~-------- 147 (315) T protein:vir:96 76 TKIAADEMVSVKVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGSNAGMN-------- 147 (315) T ss_pred eecccccceeEEEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccc-------- Confidence 111 122222222111111223344443 22333333344444444444333333322211000000000 Q ss_pred cceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh-----hccCCceeec--ccccCccceEecCc Q lcl|Aclame:pro 149 QAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI-----RDANGNPVFR--DDSFAGFRTFFNRN 221 (305) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~-----kd~~G~~l~~--~~~l~G~pv~~~~~ 221 (305) .+..... .-...+.++..++-.....-..|+||..++..|.+- ..+.+.-+.. ++..+|+||++++. T Consensus 148 --~~~~~a~----~~~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~q~L~~~~~~~~~~~~~~~~~~~lGkrViVdD~ 221 (315) T protein:vir:96 148 --VSGELAT----EGKKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVDEAIDNKLYEEAGVVVYGGTPGTLGKPVLVTDQ 221 (315) T ss_pred --ccccccc----cCHHHHHHHHHHhcccccCeeEEEEchHHHHHHHHhhhhhhcccccceeEecCcCcccccEEEEECC Confidence 0011111 123455667777766667778999999999998761 1222222222 23456999999998 Q ss_pred cccCCCCceEEEEehhhEEEEee-cCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccE-eecccceEEEeccccc Q lcl|Aclame:pro 222 GAWDADAAIEVIADSSRVKIGVR-QDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYV-LGVSATAQGANKTPVA 299 (305) Q Consensus 222 ~~~~~~~~~~~~gdf~~~~~~~~-~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~-v~~p~a~~~~~~t~~a 299 (305) ||.. +.++.. +.+.+.-..+......... ..+. +....|..|. ..+|.++..-+. + T Consensus 222 ~P~~-------------~~~gl~~GAi~~~~~~~~~~~~~~~~----g~e~--l~~~~r~e~tf~l~p~G~sw~~~---~ 279 (315) T protein:vir:96 222 CPAT-------------KIFGLVAGAVMITESQAPGMRSYQID----DQEN--LAIGFRAEGTANVEVLGYKWKTK---T 279 (315) T ss_pred CCcc-------------eeeeeecceeeecCCCccccccccCC----Ccce--eEEEEeeeeEeeeeeeeEEeecC---C Confidence 8852 112211 1111211111100000000 0011 1111222221 456666555322 2 Q ss_pred cccCCC Q lcl|Aclame:pro 300 VVAPAA 305 (305) Q Consensus 300 ~v~~a~ 305 (305) ...|-- T Consensus 280 ~~sPt~ 285 (315) T protein:vir:96 280 NVNPAS 285 (315) T ss_pred CcCCCh Confidence 223322 No 178 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=96.99 E-value=0.00022 Score=40.63 Aligned_cols=287 Identities=12% Similarity=0.061 Sum_probs=151.4 Q ss_pred CCC---ccC--CccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCce-EEEEEeCCCceeeeecchhhccccccccc Q lcl|Aclame:pro 1 MAD---ISR--AEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTT-HLPVLATLPEADWVGESATDPKGVKPTSK 74 (305) Q Consensus 1 Ma~---~t~--~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~-~~p~~~~~~~a~~v~E~~~~~~~~~~~~~ 74 (305) +|. +++ ..-.+-|-+.+...+.+.+.+.+-++++.++++|+--.. .+-.-..++-++...-+... .-.|... T Consensus 16 ~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~~--~R~~~~~ 93 (355) T protein:vir:98 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDK--ERQTADF 93 (355) T ss_pred HHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccccCCCCC--Ccccccc Confidence 332 222 123567888888999999999999999999999874322 22222333444332111000 0022223 Q ss_pred ccceeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc----ccc--------- Q lcl|Aclame:pro 75 VTWANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV----SPA--------- 139 (305) Q Consensus 75 ~~f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~----~~~--------- 139 (305) ..++.-.+..++.-.-..|+.+.|.. ...+|+..+.+.+.++++.-+-.-.|||+-...... |.+ T Consensus 94 ~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ 173 (355) T protein:vir:98 94 TALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQ 173 (355) T ss_pred cccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHH Confidence 44566667777777777888887753 235799999999999999888888899965221111 110 Q ss_pred ---------ccccccc-ccccee---ecccchhhhHHHHHHHHHHHH-hhhcccc--ceEEEEchHHHH-HHHHhhccCC Q lcl|Aclame:pro 140 ---------LIPAAVT-AGQAVE---VVGGVANESDIVGATNRAAKA-VASAGWA--PDTLLSSLALRY-EVANIRDANG 202 (305) Q Consensus 140 ---------~~~~~~~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G 202 (305) +.+.... .+..+. ..+...++..+-.+..++... ++..+.. .-++++.+.... ..-++-.... T Consensus 174 ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (355) T protein:vir:98 174 KYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQ 253 (355) T ss_pred HHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHhhhHhhccC Confidence 1111000 000000 112233455555555555543 3433332 235666665443 2223322233 Q ss_pred ce--------eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEE Q lcl|Aclame:pro 203 NP--------VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVA 273 (305) Q Consensus 203 ~~--------l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~ 273 (305) .| +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+. . ++|.+. T Consensus 254 ~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~----~~~lVT~L~NLsIY~Q~gs~RR~~~d~----p--------~r~rie 317 (355) T protein:vir:98 254 ENSESLAADIIISQKRIGNLPAVRVPYFPA----NAVLVTTLENLSIYFMDESHRRSIDEN----P--------KKDRVE 317 (355) T ss_pred CcHHHHHHHHHHHhhhhCCceeEEccccCC----CceEEeeccccEEEEecCcEEEEEEec----c--------cccccc Confidence 33 22346899999999999874 4478888888866555443 2222111 1 122222 Q ss_pred EEEEEEEccEeecccceEEEecc----ccccccCCC Q lcl|Aclame:pro 274 LRLKARFAYVLGVSATAQGANKT----PVAVVAPAA 305 (305) Q Consensus 274 ~r~~~r~~~~v~~p~a~~~~~~t----~~a~v~~a~ 305 (305) -.-..-.||.|.++..++.+... +.++..|++ T Consensus 318 ~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~ 353 (355) T protein:vir:98 318 NYESMNIDYVVEVYAAGCLLENITLGDFTAPAAPES 353 (355) T ss_pred chhhhcceeeeeccccEEEeeceeeeCCCCCccccc Confidence 22222345666666666555432 222333333 No 179 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=96.99 E-value=2e-05 Score=46.32 Aligned_cols=274 Identities=8% Similarity=-0.028 Sum_probs=139.7 Q ss_pred CCC-----ccCCccceEccHHHHH----HHHHHHHhhhhhhhhcceeecCC---CceEEEEEeCCCceeeeecchhhccc Q lcl|Aclame:pro 1 MAD-----ISRAEVASLIQEAYSD----TLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGESATDPKG 68 (305) Q Consensus 1 Ma~-----~t~~~gg~lip~~~~~----~i~~~~~~~~~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~E~~~~~~~ 68 (305) .|. .++ .+..-+|..+.+ .+++.+.......++..+..++. ....+++.+....+.+.+-+. T Consensus 34 da~d~~~~~~~-~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~----- 107 (336) T protein:vir:36 34 DAADLSPHLSS-TGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYS----- 107 (336) T ss_pred hhhhccCcccc-CCCcchHHHHHHhhccceEeeecchhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccC----- Confidence 011 111 111225555543 44555555555666666655432 245566666556666665443 Q ss_pred ccccccccceeEEeeeeeEEEeehhh-HHHhhcC--HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccc Q lcl|Aclame:pro 69 VKPTSKVTWANRTLVAEEIAVIIPVH-ENVIDDA--TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAV 145 (305) Q Consensus 69 ~~~~~~~~f~~v~~~~~k~~~~~~is-~ell~ds--~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~ 145 (305) +.|..+...+..+-+.+.++....++ .|+.+.. ..++..--....++++.+++|+-.+.|+..- ...|.++.-. T Consensus 108 D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~---~~yGllNdP~ 184 (336) T protein:vir:36 108 SDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGL---ENYGLINDPS 184 (336) T ss_pred CCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEecccc---ceEEEEecCC Confidence 34666655555566677788888887 4555433 3567777888888899999999889987642 2234444221 Q ss_pred cc-ccceeecc-cchhhhHHHHHHHHHHHHhhhcc------ccceEEEEchHHHHHHHHhhccCCceeec--ccccCccc Q lcl|Aclame:pro 146 TA-GQAVEVVG-GVANESDIVGATNRAAKAVASAG------WAPDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGFR 215 (305) Q Consensus 146 ~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l~G~p 215 (305) .. .....+.. ...+.+.+++++..+..++.... ..+..+++.+..+..|.+ ++..|.-++. .....++. T Consensus 185 l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~lk~n~Pnl~ 263 (336) T protein:vir:36 185 LSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDIFPKLE 263 (336) T ss_pred CccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccC-CCccCccHHHHHHHhcCccE Confidence 11 01111111 12334667788887777766532 246789999999888854 3333433321 11111222 Q ss_pred eEecCccccCCCCceEEEEehhhEEEEeecC---cEEEEeecceeccCcceeeeeecCcEEEEEEEEEccE-eecccceE Q lcl|Aclame:pro 216 TFFNRNGAWDADAAIEVIADSSRVKIGVRQD---ITVKFLDQATLGTGENQINLAERDMVALRLKARFAYV-LGVSATAQ 291 (305) Q Consensus 216 v~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~---i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~-v~~p~a~~ 291 (305) ++-........+... .+++....+ ..+.+......... . ...-.....+..|.++. +.+|.+++ T Consensus 264 i~t~pEl~~a~g~~~-------~l~~~~~~~~~t~~~~~p~~~~~l~v--q---~~~~~~~v~~~~rt~Gv~i~~P~ai~ 331 (336) T protein:vir:36 264 FVTIPEYDTASGRLV-------QLWAPRVEGKDTATCGFTEKMRAHSI--E---RYSSYFRQKKSAGTWGAVIFRPFAVA 331 (336) T ss_pred EEEccccccCCCceE-------EEEEEecCCCcceeeecchhhhccce--e---ecCceeEeccccceeeeeeeccchhe Confidence 221111111112211 122211111 11111111100000 0 11123456677788766 57799999 Q ss_pred EEecc Q lcl|Aclame:pro 292 GANKT 296 (305) Q Consensus 292 ~~~~t 296 (305) ++++- T Consensus 332 ~~~GI 336 (336) T protein:vir:36 332 QMIGV 336 (336) T ss_pred eeecC Confidence 99987 No 180 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=96.93 E-value=0.00026 Score=40.28 Aligned_cols=282 Identities=9% Similarity=0.004 Sum_probs=156.8 Q ss_pred CC---CccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCce-EEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MA---DISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTT-HLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma---~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~-~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) +| .+...+-.+-|.+.+...+.+.+.+.+-++++.++++|+.-.. .+-.-..++-++...-... ++-.|..-.. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT~~~--~~R~~~~~~~ 93 (338) T protein:vir:11 16 LAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDTTGD--GVRKPRDVSA 93 (338) T ss_pred HHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccCCCC--Cccccccccc Confidence 32 3444555778889999999999999999999999999874322 2222233344443321100 0001111124 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCc----ccc------------ Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASW----VSP------------ 138 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~----~~~------------ 138 (305) ++.-.+..++.-.-..|+.+.|.. ...+|+..+.+.+.++++.-+-.-.|||+-..... .|. T Consensus 94 l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~ 173 (338) T protein:vir:11 94 LDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAANPLLQDVNIGWFQQY 173 (338) T ss_pred cCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHH Confidence 555567777777777888887753 23589999999999999998888889996522111 011 Q ss_pred ------cccccccccccceeecccchhhhHHHHHHHHHHHH-hhhcccc--ceEEEEchHHHH-HHHHhhccCCce---- Q lcl|Aclame:pro 139 ------ALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKA-VASAGWA--PDTLLSSLALRY-EVANIRDANGNP---- 204 (305) Q Consensus 139 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G~~---- 204 (305) -+++.........-..+...++..+-.+..++... ++..+.. .-++++.+.... ..-.+-.....| T Consensus 174 Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~ptE~~ 253 (338) T protein:vir:11 174 RNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKYFPMVNKDQPATEKI 253 (338) T ss_pred HhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHHhcCCChHHHH Confidence 01111111111111122223455555555555543 3444332 235666665443 222222222222 Q ss_pred ----eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEEEEEEEE Q lcl|Aclame:pro 205 ----VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVALRLKAR 279 (305) Q Consensus 205 ----l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r 279 (305) +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.. ++|.+.-.-..- T Consensus 254 Aa~~~~s~k~iGGlpa~~~PffP~----~~~lVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~rie~y~s~N 317 (338) T protein:vir:11 254 ATDLILSQKRMGGLPPVEVPYVPE----KGLMVTTLKNLSLYWQIGGRRRYLKEVP------------EKNRIENYESSN 317 (338) T ss_pred HHHHHHHhhhhCCceeEEccccCC----CceEEeeccccEEEEecCcEEEEEEecc------------ccccccchhhhc Confidence 22245899999999999874 4478888888866555443 22221111 223333222334 Q ss_pred EccEeecccceEEEecccccc Q lcl|Aclame:pro 280 FAYVLGVSATAQGANKTPVAV 300 (305) Q Consensus 280 ~~~~v~~p~a~~~~~~t~~a~ 300 (305) .||.|.++..++.+.....+- T Consensus 318 e~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 318 DAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred cceeeeccccEEEeecceecC Confidence 577888888888877543322 No 181 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=96.85 E-value=0.0003 Score=39.92 Aligned_cols=268 Identities=10% Similarity=-0.006 Sum_probs=121.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee-----cCCCceEEEEEeCCCceeeeecchhhcccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN-----MGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKV 75 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~-----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~ 75 (305) ||... +..|-|+.+..++++.+++..++.+++..-. -.+++++||+..... +.++....-. +. T Consensus 1 m~~~~---N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~----v~dg~~~~~~-----~~ 68 (418) T protein:vir:10 1 MAVQD---NNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVK----SASGRTLVKQ-----PM 68 (418) T ss_pred CCccc---cccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCcee----ecccCCcccc-----cc Confidence 77643 5677799999999999999999888876522 125689998843211 2233322211 22 Q ss_pred ccee--EEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceee Q lcl|Aclame:pro 76 TWAN--RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV 153 (305) Q Consensus 76 ~f~~--v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 153 (305) +-.+ ++++.+|. .-+.++++=...+..++.+.+.+...++++..+|..++.-- .. .... .+. . T Consensus 69 te~~v~l~id~~k~-~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~-~~-------a~~~---~gt---~ 133 (418) T protein:vir:10 69 VDQTIPFKIAYQEH-VGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTL-KK-------AFHS---SGT---P 133 (418) T ss_pred ccceEEEEEecccc-cceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH-hh-------cccc---ccc---C Confidence 2233 44544454 34567765444567788888889999999999998876310 00 0000 000 0 Q ss_pred cccchhhhHHHHHHHHHHHHhhhcccc--c-eEEEEchHHHHHHHHhhc----cCC-ceeec---ccccCccceEecCcc Q lcl|Aclame:pro 154 VGGVANESDIVGATNRAAKAVASAGWA--P-DTLLSSLALRYEVANIRD----ANG-NPVFR---DDSFAGFRTFFNRNG 222 (305) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~v~~~~~~~~l~~~kd----~~G-~~l~~---~~~l~G~pv~~~~~~ 222 (305) ......+ +.+.++...+...+-. . =..+++|..+..|.+-.. ..+ .-.++ -.++.|+.++.++++ T Consensus 134 gt~~~~~----~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~ni 209 (418) T protein:vir:10 134 GVRPGAF----IDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNL 209 (418) T ss_pred CcCcchH----HHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEecCC Confidence 0111123 3344444444443322 1 146789998877753221 111 01122 247999999999998 Q ss_pred ccCC-CC--c-eEEEEehhhE-EEEeecCcEEEEeecceeccCcc--eeeeeecCc---------EEEEEEEEEccEeec Q lcl|Aclame:pro 223 AWDA-DA--A-IEVIADSSRV-KIGVRQDITVKFLDQATLGTGEN--QINLAERDM---------VALRLKARFAYVLGV 286 (305) Q Consensus 223 ~~~~-~~--~-~~~~gdf~~~-~~~~~~~i~v~~~~~~~~~~~~~--~~~~~~~~~---------~~~r~~~r~~~~v~~ 286 (305) |..+ +. + ..+.|-.... -+....+ ..+...++..++. ....+.-|+ ..+++..-+. .. T Consensus 210 p~~tag~~~~t~~v~ga~~~~~~~~~~~~---t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~---~~ 283 (418) T protein:vir:10 210 PKHTVGDHGGTPLVNGTVVNGDTVGFDGG---TASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVD---TD 283 (418) T ss_pred CcccccccccceeeecccccceeEEEeec---ceeeccceeeccEEEECceeecccccccccccceEEEEEeecc---cc Confidence 8432 11 1 1222221111 1111111 0011111111110 000011011 1111111110 00 Q ss_pred ccceEEEeccccc---------------------c--ccCCC Q lcl|Aclame:pro 287 SATAQGANKTPVA---------------------V--VAPAA 305 (305) Q Consensus 287 p~a~~~~~~t~~a---------------------~--v~~a~ 305 (305) -.+-..++..|+- - -.||+ T Consensus 284 ~~~~~tv~i~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~ 325 (418) T protein:vir:10 284 AGGAGSIKISPSLNDGTATINNENGDPVSLTAYQNVTALPAD 325 (418) T ss_pred ccCcceeEeccccccccccccccccccccccCCCcccccccC Confidence 0111122222210 0 12222 No 182 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=96.84 E-value=0.00011 Score=42.30 Aligned_cols=273 Identities=8% Similarity=-0.026 Sum_probs=141.1 Q ss_pred CC---C------ccCCccceEccHHHH----HHHHHHHHhhhhhhhhcceeecCC---CceEEEEEeCCCceeeeecchh Q lcl|Aclame:pro 1 MA---D------ISRAEVASLIQEAYS----DTLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGESAT 64 (305) Q Consensus 1 Ma---~------~t~~~gg~lip~~~~----~~i~~~~~~~~~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~E~~~ 64 (305) |+ . .|.+.. -+|..+. .++++.+.......++..+..++. ....+++.+....+.+.+-+. T Consensus 31 ~a~da~d~~~~~~t~~~~--g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~- 107 (336) T protein:vir:78 31 YAMDAADLSPHLSSTGSS--GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYS- 107 (336) T ss_pred HHHhhhhhccccccCCCc--chHHHHHHhcccceeeehhhhhhhhhhcccccCCCccccEEEEeeeecceeeEEeeccc- Confidence 11 1 111111 1444332 344455555555566666655432 256677777667777776543 Q ss_pred hcccccccccccceeEEeeeeeEEEeehhhH-HHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccc Q lcl|Aclame:pro 65 DPKGVKPTSKVTWANRTLVAEEIAVIIPVHE-NVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALI 141 (305) Q Consensus 65 ~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~-ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~ 141 (305) +.|..+...+...-+.+.++..+.++. |+-.- ...++..--....++++.+++++-.++|+..- ...|++ T Consensus 108 ----D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~---~~~Gll 180 (336) T protein:vir:78 108 ----SDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGL---ENYGLI 180 (336) T ss_pred ----CCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEecccc---ceEEEE Confidence 346666666677777778888888885 44332 24567888888888899999999899997532 233444 Q ss_pred ccccccc-cceeecc-cchhhhHHHHHHHHHHHHhhhcc------ccceEEEEchHHHHHHHHhhccCCceeec--cccc Q lcl|Aclame:pro 142 PAAVTAG-QAVEVVG-GVANESDIVGATNRAAKAVASAG------WAPDTLLSSLALRYEVANIRDANGNPVFR--DDSF 211 (305) Q Consensus 142 ~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l 211 (305) +.-.... ....+.. ...+.+.+++++..+..++.... ..+..+++.+..+..|.+ ++..|.-++. .... T Consensus 181 N~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~lk~n~ 259 (336) T protein:vir:78 181 NDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEIF 259 (336) T ss_pred eCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHHHHHhc Confidence 4322211 1111111 12344667888888877765433 134579999999999864 3333432221 1111 Q ss_pred CccceEecCccccCCCCceEEEEehhhEEEEeecC---cEEEEeecceeccCcceeeeeecCcEEEEEEEEEccE-eecc Q lcl|Aclame:pro 212 AGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQD---ITVKFLDQATLGTGENQINLAERDMVALRLKARFAYV-LGVS 287 (305) Q Consensus 212 ~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~---i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~-v~~p 287 (305) .++.++-........+.... ++..+..+ ..+.+-........ . .........+..|.++. +.+| T Consensus 260 Pnl~i~t~pel~~Agg~~~~-------~~~~~~~~~~t~~~~~p~~f~~lpv--q---~~~~~~~v~~~~rt~Gv~i~~P 327 (336) T protein:vir:78 260 PKLEFVTIPEYDTASGRLVQ-------LWAPRVEGKDTATCGFTEKMRAHSI--E---RYSSYFRQKKSAGTWGAVIFRP 327 (336) T ss_pred CccEEEEcccccccCcceEE-------EEEeeccCCcceeeecchhhhccce--e---ecCceeEeccccceeeeeeecc Confidence 12222211111111111111 11111111 11111111100000 0 11123455677777766 5779 Q ss_pred cceEEEecc Q lcl|Aclame:pro 288 ATAQGANKT 296 (305) Q Consensus 288 ~a~~~~~~t 296 (305) .+++++++- T Consensus 328 ~ai~~~~GI 336 (336) T protein:vir:78 328 FAVAQMIGV 336 (336) T ss_pred chheeeccC Confidence 999999987 No 183 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=96.81 E-value=0.00032 Score=39.74 Aligned_cols=283 Identities=13% Similarity=0.067 Sum_probs=157.0 Q ss_pred CC---CccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCce-EEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MA---DISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTT-HLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma---~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~-~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) +| .+...+-.+-|-+.+.+.+.+.+.+.+-++++.++++++--.. ++-.-..++-++...-... .-.|..-.. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~~~---~R~~~~~~~ 92 (339) T protein:vir:79 16 IAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDTTQQ---DRETSDIST 92 (339) T ss_pred HHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccCCCC---Ccccccccc Confidence 22 3344455678888999999999999999999999998874322 2222233343433211100 011222235 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccc---------------- Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSP---------------- 138 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~---------------- 138 (305) ++.-.+..++.-.-..|+.+.|.. ...+|+..+.+.+.++++.-.-.-.|||+-......+. T Consensus 93 l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~ 172 (339) T protein:vir:79 93 MDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPMLQDVNKGWLQNL 172 (339) T ss_pred cCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcCccccchhHHHHH Confidence 566667777776677788877753 23579999999999999988888888986532211110 Q ss_pred ------cccccccccccceeecccchhhhHHHHHHHHHHH-Hhhhcccc--ceEEEEchHHHH-HHHHhhccCCce---- Q lcl|Aclame:pro 139 ------ALIPAAVTAGQAVEVVGGVANESDIVGATNRAAK-AVASAGWA--PDTLLSSLALRY-EVANIRDANGNP---- 204 (305) Q Consensus 139 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G~~---- 204 (305) -+.+........+...+...++..+-.+..++.. .++..+.. .-++++.+.... .--.+-.....| T Consensus 173 Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~ 252 (339) T protein:vir:79 173 REQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKYFPLVNRDRDPVQQI 252 (339) T ss_pred HhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHhhhHhhcCCChHHHH Confidence 0111111111111111223345555555555554 33444432 234555555443 222222222233 Q ss_pred ----eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEEEEEEEE Q lcl|Aclame:pro 205 ----VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVALRLKAR 279 (305) Q Consensus 205 ----l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r 279 (305) +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.+ ++|.+.-.-..- T Consensus 253 Aa~~i~s~k~iGGl~a~~~PfFP~----~~llVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~rie~y~s~N 316 (339) T protein:vir:79 253 AADLIISQKRIGNLPAIRVPYFPA----NGLLVTRLDNLSIYYQEGGRRRTILDNA------------KRDRIENYESSN 316 (339) T ss_pred HHHHHHHhhhhCCceeEEccccCC----CceEEeechhcEEEEecCcEEEEEEecc------------ccccccchhhcc Confidence 22346899999999998874 4478888888766554442 22222211 223333323334 Q ss_pred EccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 280 FAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 280 ~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .||.|.++..++.+.... +..|| T Consensus 317 e~YvVEd~~~~a~iEni~---~~~aa 339 (339) T protein:vir:79 317 DAYVIEDLACAAMAENIA---LAAAA 339 (339) T ss_pred ceeeeeccccEEEeeeee---cccCC Confidence 578888888888887542 34444 No 184 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=96.67 E-value=0.00039 Score=39.26 Aligned_cols=271 Identities=11% Similarity=0.031 Sum_probs=143.9 Q ss_pred CCCccCCccceEccHHH---HHHHHHHHHhhhhhhhhcceee---cCCCceEEEEEeCCCcee--eeecchhhccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAY---SDTLLAAAKQGSTVLSAFQNVN---MGTKTTHLPVLATLPEAD--WVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~---~~~i~~~~~~~~~l~~l~~~~~---~~~~~~~~p~~~~~~~a~--~v~E~~~~~~~~~~~ 72 (305) |+. .++++ .++ ...|.+...+.-..+++..+.+ ....++.+...+....+. |++-.. .++|. T Consensus 1 ~~~-----lafl~-~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a----~dip~ 70 (304) T protein:vir:52 1 MSL-----LAYVK-NGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGT----STLDQ 70 (304) T ss_pred Cch-----HHHHH-HHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcC----Cccce Confidence 332 33443 333 3445554444444555554432 223356666666555666 876554 35788 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhcC---HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccc Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDDA---TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQ 149 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~ 149 (305) .+..+++.....+..+..+.+|.+=++.+ ..++..--.+...+++...+|+..+.|+-...+ ..|+++....... T Consensus 71 vd~~~~~~~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g--~~GllN~p~v~~~ 148 (304) T protein:vir:52 71 VEVGFTPTRSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSR--LTGLLNNKSVEVY 148 (304) T ss_pred eecccceeEEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccc--eEEEEeCCCccee Confidence 88888888888888888888876433322 335666666777788999999999999642222 2333333322211 Q ss_pred cee--ec---ccchhhhHHHHHHHHHHHHhhhcc---ccceEEEEchHHHHHHHHhh-ccCCceee----c-ccccCccc Q lcl|Aclame:pro 150 AVE--VV---GGVANESDIVGATNRAAKAVASAG---WAPDTLLSSLALRYEVANIR-DANGNPVF----R-DDSFAGFR 215 (305) Q Consensus 150 ~~~--~~---~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~v~~~~~~~~l~~~k-d~~G~~l~----~-~~~l~G~p 215 (305) ... .+ -...+.+.+.+++..+..++.... ..++.+++.|..+..|.... ...+.-++ + .....|.| T Consensus 149 ~~~~~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~~~g~~ 228 (304) T protein:vir:52 149 AIKGAAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSAAAGRQ 228 (304) T ss_pred eecCCccCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcccccCCc Confidence 111 01 112255667888888777665432 34678999999999886542 22222222 1 11233555 Q ss_pred eEec----Cccc-cCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCc--EEEEEEEEEccE-eecc Q lcl|Aclame:pro 216 TFFN----RNGA-WDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDM--VALRLKARFAYV-LGVS 287 (305) Q Consensus 216 v~~~----~~~~-~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~--~~~r~~~r~~~~-v~~p 287 (305) +-+- .... ...++..+++.+.+.=.+...--+.+.... ...++. ..+=++.|+|+. +.+| T Consensus 229 l~I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~------------~q~~~~~~~~vp~~~r~gGv~v~~P 296 (304) T protein:vir:52 229 VAIKALPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLD------------AQPKGLLAFESGLRMAFGGVTFMEP 296 (304) T ss_pred ceEEEecccccccCCCCceEEEEEecChhheEEecCccccccc------------hhhcCCceEEecceeeeeeEEEEcc Confidence 3211 1111 112233344444332222111111111110 122332 334467888766 5779 Q ss_pred cceEEEec Q lcl|Aclame:pro 288 ATAQGANK 295 (305) Q Consensus 288 ~a~~~~~~ 295 (305) .+++.+.. T Consensus 297 ~a~~y~D~ 304 (304) T protein:vir:52 297 DSALYVDY 304 (304) T ss_pred ceeeeecC Confidence 99999999 No 185 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=96.62 E-value=0.00046 Score=38.88 Aligned_cols=287 Identities=11% Similarity=0.042 Sum_probs=153.4 Q ss_pred CCC---cc--CCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeCCCceeeeecchhhccccccccc Q lcl|Aclame:pro 1 MAD---IS--RAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTH-LPVLATLPEADWVGESATDPKGVKPTSK 74 (305) Q Consensus 1 Ma~---~t--~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~-~p~~~~~~~a~~v~E~~~~~~~~~~~~~ 74 (305) +|. ++ ..+-.+-|-+.+...+.+.+.+.+-++++.++++|+--... +-.-..++-++...-+.. . .-.|... T Consensus 16 ~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~-~-~R~~~~~ 93 (355) T protein:vir:18 16 LAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTTDTSGD-K-ERQTADF 93 (355) T ss_pred HHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeeccccCCC-C-Ccccccc Confidence 322 21 12235678888889999999999999999999998743222 222233344443221100 0 0022233 Q ss_pred ccceeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc----ccc--------- Q lcl|Aclame:pro 75 VTWANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV----SPA--------- 139 (305) Q Consensus 75 ~~f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~----~~~--------- 139 (305) ..++.-.+..++.-.-..|+.+.|.. ...+|+..+.+.+.++++.-+-.-.|||+-...... |.+ T Consensus 94 ~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ 173 (355) T protein:vir:18 94 TALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVKNPMLQDVAVGWLQ 173 (355) T ss_pred cccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHH Confidence 44566667777777777888887753 235799999999999999888888899965222111 110 Q ss_pred ---------ccccccc-ccccee---ecccchhhhHHHHHHHHHHHH-hhhcccc--ceEEEEchHHHH-HHHHhhccCC Q lcl|Aclame:pro 140 ---------LIPAAVT-AGQAVE---VVGGVANESDIVGATNRAAKA-VASAGWA--PDTLLSSLALRY-EVANIRDANG 202 (305) Q Consensus 140 ---------~~~~~~~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G 202 (305) +.+.... .+..+. ..+...++..+-.+..++... ++..+.. .-++++.+.... ..-++-...+ T Consensus 174 ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (355) T protein:vir:18 174 KYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRKLLADKYFPLVNKQQ 253 (355) T ss_pred HHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHhhccC Confidence 0111000 000000 112233455555555555543 3443332 235666665443 2222322233 Q ss_pred ce--------eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEE Q lcl|Aclame:pro 203 NP--------VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVA 273 (305) Q Consensus 203 ~~--------l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~ 273 (305) .| +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.. ++|.+. T Consensus 254 ~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~----~~~lVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~rie 317 (355) T protein:vir:18 254 ENTESLAADIIISQKRIGNLPAVRVPYFPA----NAVFVTTLENLSIYFMDESHRRSIDENP------------KKDRVE 317 (355) T ss_pred ChHHHHHHHHHHHHHhhCCceeEEccccCC----CceEEeeccccEEEEecCcEEEEEEecc------------cccccc Confidence 32 22246899999999999874 4478888888866555443 22221111 122222 Q ss_pred EEEEEEEccEeecccceEEEecc----ccccccCCC Q lcl|Aclame:pro 274 LRLKARFAYVLGVSATAQGANKT----PVAVVAPAA 305 (305) Q Consensus 274 ~r~~~r~~~~v~~p~a~~~~~~t----~~a~v~~a~ 305 (305) -.-..-.||.|.++..++.+... +.++.+|++ T Consensus 318 ~y~s~Ne~YvVEd~~~~a~ieni~~~~~~~~~~~~~ 353 (355) T protein:vir:18 318 NYESMNIDYVVEAYAAGCLLENITLGDFTAPAAPEG 353 (355) T ss_pred chhhhcceeeeeccccEEEEeeeeecCCCCcccccC Confidence 22223446677777766666533 222233333 No 186 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=96.52 E-value=0.00055 Score=38.47 Aligned_cols=282 Identities=12% Similarity=0.033 Sum_probs=152.6 Q ss_pred CCCcc-----CCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeCCCceeeeecchhhccccccccc Q lcl|Aclame:pro 1 MADIS-----RAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTH-LPVLATLPEADWVGESATDPKGVKPTSK 74 (305) Q Consensus 1 Ma~~t-----~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~-~p~~~~~~~a~~v~E~~~~~~~~~~~~~ 74 (305) +|... ..+-.+-|.+.+...+.+.+.+.+-++++.++++++--... +-.-..++-++...-+ .+... T Consensus 20 ~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt~tr-------~~~~~ 92 (358) T protein:vir:78 20 LAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLYTGRKKGG-------RFKGK 92 (358) T ss_pred HHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCcccceecCCC-------ccccc Confidence 33211 22346788899999999999999999999999998743222 2222333444332221 23334 Q ss_pred ccceeEEeeeeeEEEeehhhHHHhhc-----CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc------------- Q lcl|Aclame:pro 75 VTWANRTLVAEEIAVIIPVHENVIDD-----ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV------------- 136 (305) Q Consensus 75 ~~f~~v~~~~~k~~~~~~is~ell~d-----s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~------------- 136 (305) ..++.-.+..++.-.-..|+.+.|.. +..+|+..+.+.+.++++.-.-.-.|||+-...... T Consensus 93 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~G 172 (358) T protein:vir:78 93 VGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADDTDPTANPLGQDVNKG 172 (358) T ss_pred cccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchH Confidence 45666777777777777888887753 122699999999999999888888889865322111 Q ss_pred ---------cccccccccccccceeecccchhhhHHHHHHHHHHH-Hhhhcccc--ceEEEEchHHHH-HHHHhhccCCc Q lcl|Aclame:pro 137 ---------SPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAK-AVASAGWA--PDTLLSSLALRY-EVANIRDANGN 203 (305) Q Consensus 137 ---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G~ 203 (305) +.-+...........-..+...++..+-.+..++.. .++..+.. .-++++.+.... .--.+-...+. T Consensus 173 WlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ 252 (358) T protein:vir:78 173 WHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGTDLVAAAQAKLYSEATK 252 (358) T ss_pred HHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhcCCC Confidence 111111111111111122222345555555555543 33443333 235556655444 32223222222 Q ss_pred ee---e---cccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEEEEE Q lcl|Aclame:pro 204 PV---F---RDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVALRL 276 (305) Q Consensus 204 ~l---~---~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~~r~ 276 (305) |- - .-.++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.. ++|.+.-.- T Consensus 253 pTE~~Aa~~i~k~iGGlpa~~~PfFP~----~~ilVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~riE~y~ 316 (358) T protein:vir:78 253 PSEQIAAQQLAKSIAGRKAYIPPFFPG----KRMVVTTLDNLHCYTQRGTRKRKADDNQ------------DSKSFDNQY 316 (358) T ss_pred cHHHHHHHHHHHHhCCCeEEEccccCC----CceEEeeccccEEEEecCcEEEEEEecc------------ccccccchh Confidence 21 0 014789999999998874 4478888888765554442 22222211 122222222 Q ss_pred EEEEccEeecccceEEEecc-------ccccccCCC Q lcl|Aclame:pro 277 KARFAYVLGVSATAQGANKT-------PVAVVAPAA 305 (305) Q Consensus 277 ~~r~~~~v~~p~a~~~~~~t-------~~a~v~~a~ 305 (305) ..-.||.|.++..++.+... |+.+-..|+ T Consensus 317 s~Ne~YvVEd~~~~a~iE~i~v~~~~~pa~~~~~~~ 352 (358) T protein:vir:78 317 WRMEGYALGEHKAYGGFEEADIEIGADPAVLAVEAA 352 (358) T ss_pred hhcceeeeeccccEEEEeeeeeeeCCCCCccccCCc Confidence 23346667777666666543 222222122 No 187 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=96.45 E-value=0.00061 Score=38.20 Aligned_cols=272 Identities=16% Similarity=0.143 Sum_probs=123.0 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhh-hhhhhhcceeecCCCceEEEEEeCCCce-eeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQG-STVLSAFQNVNMGTKTTHLPVLATLPEA-DWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~-~~l~~l~~~~~~~~~~~~~p~~~~~~~a-~~v~E~~~~~~~~~~~~~~~f~ 78 (305) |..++.. =.++-..+...+.+..... ....++|++.+-+...-++.....-|.. .|++|.. .....=. T Consensus 1 m~it~~~--l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge~~--------~~~l~~~ 70 (302) T protein:vir:10 1 MLINKQS--LNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGAKV--------VKNLKAY 70 (302) T ss_pred CcccHHH--HHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCcccccccee--------ecccccc Confidence 6654321 1112223333333333332 3356677777755555555555444443 4555433 2233444 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc----Ccc----cCcCccccccccccccccc- Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTD----KPASWVSPALIPAAVTAGQ- 149 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~----G~g----~~~~~~~~~~~~~~~~~~~- 149 (305) ..+++.++.+..+.||++.+.+-..++..-+.+.|+++.++.+|+.++. |.+ .+..++...-........+ T Consensus 71 ~~~i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~ 150 (302) T protein:vir:10 71 KYVVENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNK 150 (302) T ss_pred ceeEEeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccc Confidence 5678888999999999999998888899999999999999999877764 211 1111111110000000000 Q ss_pred ---ceeecccchhhhHHHHHHHHHHHHhhh-----ccccceEEEEchHHHHHHHHhhccCCceee-cccccCcc-ceEec Q lcl|Aclame:pro 150 ---AVEVVGGVANESDIVGATNRAAKAVAS-----AGWAPDTLLSSLALRYEVANIRDANGNPVF-RDDSFAGF-RTFFN 219 (305) Q Consensus 150 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~-~~~~l~G~-pv~~~ 219 (305) .......... .+.+.....++.+... -...+..+++.|.....-+++-.+ ++.-. ....+.|. .+++. T Consensus 151 g~~~~~~~~~~l~-~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~-~~~~~g~~Np~~g~~~~vv~ 228 (302) T protein:vir:10 151 GTAPLSNASQAAA-KAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN-PKLADNTPNPYVGTAELVVD 228 (302) T ss_pred cchhhhhcccccc-hHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc-cccCCCCcceeccceEEEEe Confidence 0000011111 1112222222222211 123456678777777666554211 11100 01122232 23333 Q ss_pred CccccCCCCceEEEEehhhE---EEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEc------cEeecccce Q lcl|Aclame:pro 220 RNGAWDADAAIEVIADSSRV---KIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFA------YVLGVSATA 290 (305) Q Consensus 220 ~~~~~~~~~~~~~~gdf~~~---~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~------~~v~~p~a~ 290 (305) ..+. .+..=.++.|.+.+ ++.-+++..++..+. |..+.+.+|.+..+| .+...+... T Consensus 229 p~L~--s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~------------~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a 294 (302) T protein:vir:10 229 GRIE--SDTAWFLLDTTKPVKPFIFQPRKQPEFVSQVN------------LDSDDVFNLRKLKFGAEARAAAGYGFWQLA 294 (302) T ss_pred eccC--CCCceEEEecCCccceEEEcCccccEEEeccC------------CCCCceEEEEEEEEeeeeeeecchhhhhhh Confidence 3332 12222344454433 233345555554221 233334444433333 233344444 Q ss_pred EEEecccc Q lcl|Aclame:pro 291 QGANKTPV 298 (305) Q Consensus 291 ~~~~~t~~ 298 (305) ..-+++.+ T Consensus 295 ~~s~g~~~ 302 (302) T protein:vir:10 295 YGSTGTGA 302 (302) T ss_pred hccCccCC Confidence 44444333 No 188 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=96.35 E-value=0.00068 Score=37.95 Aligned_cols=277 Identities=11% Similarity=0.002 Sum_probs=131.4 Q ss_pred CCCc------------cCCccceEccHH---HHHHHHHHHHhhhhhhhhcceeecCC---CceEEEEEeCCCceeeeecc Q lcl|Aclame:pro 1 MADI------------SRAEVASLIQEA---YSDTLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGES 62 (305) Q Consensus 1 Ma~~------------t~~~gg~lip~~---~~~~i~~~~~~~~~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~E~ 62 (305) |..- ++... .-+|.. +...+++.+..-..+.++..+.+.+. ....+++.+....+.+.+-+ T Consensus 56 md~~~~~~~~~~~~~l~~~~~-~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~ 134 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSI-PGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDG 134 (379) T ss_pred hccccccccccccCccccccc-cchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEeccc Confidence 2211 10000 012222 22456666666555666666655432 24556666666667666544 Q ss_pred hhhcccccccccccceeEEeeeeeEEEeehhhH-HHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccc Q lcl|Aclame:pro 63 ATDPKGVKPTSKVTWANRTLVAEEIAVIIPVHE-NVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPA 139 (305) Q Consensus 63 ~~~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~-ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~ 139 (305) .. .|..+...+...-..+.++..+.++. |+..- ...++..--....++++.+++|+..|+|.+.+ +....| T Consensus 135 ~d-----~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~-~~~~yG 208 (379) T protein:vir:10 135 GN-----MALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDG-SGRTFG 208 (379) T ss_pred cC-----CCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCC-CcceEE Confidence 32 34444433333444455666666665 44332 34578888889999999999999999996432 122233 Q ss_pred ccccccccc--cceee----cc-cchhhhHHHHHHHHHHHHhhhccc-------cceEEEEchHHHHHHHHhhccCCcee Q lcl|Aclame:pro 140 LIPAAVTAG--QAVEV----VG-GVANESDIVGATNRAAKAVASAGW-------APDTLLSSLALRYEVANIRDANGNPV 205 (305) Q Consensus 140 ~~~~~~~~~--~~~~~----~~-~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~v~~~~~~~~l~~~kd~~G~~l 205 (305) .++.-.... ...+. .. ...+.+.+++++..+..++..... .+..+++.+..+..|..- +..|.-+ T Consensus 209 llNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~-n~~g~Tv 287 (379) T protein:vir:10 209 FLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP-TELGYSV 287 (379) T ss_pred EEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc-cccCccH Confidence 333222111 00111 11 122455677777777776553321 233788999999988743 3333322 Q ss_pred ec--ccccCccceEecCccccCCCCc--eEEEEehhhEEEEeecCcEEE-------EeecceeccCcceeeeeecCcEEE Q lcl|Aclame:pro 206 FR--DDSFAGFRTFFNRNGAWDADAA--IEVIADSSRVKIGVRQDITVK-------FLDQATLGTGENQINLAERDMVAL 274 (305) Q Consensus 206 ~~--~~~l~G~pv~~~~~~~~~~~~~--~~~~gdf~~~~~~~~~~i~v~-------~~~~~~~~~~~~~~~~~~~~~~~~ 274 (305) +. .....++.++-...+....+.+ ..++.+ ...+...+ ...+. +..-... ...-.... T Consensus 288 l~~lk~n~Pnl~i~t~pEL~~aggg~~~~~~~~~-------~~~~~~t~~~~~~~~~~p~k-~~~l~ve---~~~~~~~~ 356 (379) T protein:vir:10 288 AQYMRESYPNVTFVSAPELNDANGGSSAIYYYAD-------AVENNGTDDGRTWLQVVPTK-MFTLGVE---KKIKGYAE 356 (379) T ss_pred HHHHHHhcCCcEEEEcccccccCCCccEEEEEee-------ccCCCccCCcceEEEecchh-hhhccce---ecCceeEe Confidence 21 1112122222211121111211 122221 11111110 00000 0000000 01123344 Q ss_pred EEEEEEccE-eecccceEEEecc Q lcl|Aclame:pro 275 RLKARFAYV-LGVSATAQGANKT 296 (305) Q Consensus 275 r~~~r~~~~-v~~p~a~~~~~~t 296 (305) .+..|.++. |.+|.+++...++ T Consensus 357 ~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 357 GYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred ccccceeeeeeecchhhheecCC Confidence 666777665 5779999999998 No 189 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=96.21 E-value=0.00087 Score=37.37 Aligned_cols=281 Identities=11% Similarity=0.037 Sum_probs=155.5 Q ss_pred CCC---ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCce-EEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MAD---ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTT-HLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~---~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~-~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) +|. +....-.+-|-+.+...+.+.+.+.+-++++.++++|+--.. .+-.-..++-++...-+.. ...|..-.. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t~~~---~R~~~~~~~ 92 (337) T protein:vir:10 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKA---ARQPIDPTA 92 (337) T ss_pred HHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecCCCC---ccccccccc Confidence 332 223334567778888999999999999999999999874322 2222233344443332211 012223345 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc----cc------------ Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV----SP------------ 138 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~----~~------------ 138 (305) ++.-.+..++.-.-..|+.+.|.. ...+|+..+.+.+.++++.-+-.-.|||+-...... |. T Consensus 93 l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~ 172 (337) T protein:vir:10 93 LDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQY 172 (337) T ss_pred cCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHH Confidence 666677777777777888888753 235899999999999999988888899965322110 11 Q ss_pred ------cccccccccccceeecccchhhhHHHHHHHHHHHH-hhhcccc--ceEEEEchHHHH-HHHHhhccCCce---- Q lcl|Aclame:pro 139 ------ALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKA-VASAGWA--PDTLLSSLALRY-EVANIRDANGNP---- 204 (305) Q Consensus 139 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G~~---- 204 (305) -+++........+ ..+...++..+-.+..++... ++..+.. .-++++.+.... .--.+-.....| T Consensus 173 Re~ap~rV~~~~~~~~~~i-~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n~~~~ptE~~ 251 (337) T protein:vir:10 173 RERAAQRVLHEGAKQAGKV-LVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERL 251 (337) T ss_pred HhcchhhhhccccccCcce-eecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhccCCCcHHHH Confidence 1111111110001 112233455555555555543 3443332 234555555444 222222222232 Q ss_pred ----eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEEEEEEEE Q lcl|Aclame:pro 205 ----VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVALRLKAR 279 (305) Q Consensus 205 ----l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r 279 (305) +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.. ++|.+.-.-..- T Consensus 252 Aa~~i~s~k~iGGlpa~~~PffP~----~~~lVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~rie~y~s~N 315 (337) T protein:vir:10 252 AADLIVSQKRIGNLPAVRVPFFPK----RALMVTKLSNLSIYYQEGARRRTLKEVP------------ERDRIENYESSN 315 (337) T ss_pred HHHHHHHhhhhCCceeEEccccCC----CceEEeechhcEEEEecCcEEEEEEEcc------------ccccccchhhcc Confidence 22346899999999999874 4488888888866555443 22221111 223333322334 Q ss_pred EccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 280 FAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 280 ~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) .||.|.++..++.+..-.. ..| T Consensus 316 e~YvVEd~~~~a~ienI~~---~~a 337 (337) T protein:vir:10 316 DAYVVEDFGCGCVAENIEL---AAA 337 (337) T ss_pred ceeeeeccccEEEEeceee---cCC Confidence 5778888888888774321 111 No 190 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=96.15 E-value=0.00094 Score=37.18 Aligned_cols=281 Identities=11% Similarity=0.037 Sum_probs=155.3 Q ss_pred CCC---ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MAD---ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTH-LPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~---~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~-~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) +|. +....-.+-|-+.+...+.+.+.+.+-++++.++++|+--... +-.-..++-++...-+.. ...|..-.. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t~~~---~R~~~~~~~ 92 (337) T protein:vir:79 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKA---ARQPIDPTA 92 (337) T ss_pred HHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecCCCC---ccccccccc Confidence 322 2233335667788889999999999999999999998743222 222233344443332211 012223345 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc----cc------------ Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV----SP------------ 138 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~----~~------------ 138 (305) ++.-.+..++.-.-..|+.+.|.. ...+|+..+.+.+.++++.-+-.-.|||+-...... |. T Consensus 93 l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~ 172 (337) T protein:vir:79 93 LDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQY 172 (337) T ss_pred cCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHH Confidence 666677777777777888888753 235899999999999999988888899965322110 11 Q ss_pred ------cccccccccccceeecccchhhhHHHHHHHHHHHH-hhhcccc--ceEEEEchHHHH-HHHHhhccCCce---- Q lcl|Aclame:pro 139 ------ALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKA-VASAGWA--PDTLLSSLALRY-EVANIRDANGNP---- 204 (305) Q Consensus 139 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G~~---- 204 (305) -+++........+ ..+...++..+-.+..++... ++..+.. .-+.++.+.... .--.+-.....| T Consensus 173 Re~ap~rV~~~~~~~~~~i-~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n~~~~ptE~~ 251 (337) T protein:vir:79 173 RERAAQRVLHEGAKQAGKV-LVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKYFPIVNATQAPTERL 251 (337) T ss_pred HhcchhhhhccccccCcce-eecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhccCCCcHHHH Confidence 1111111110001 112333455555555555543 3443332 234555555444 222222222232 Q ss_pred ----eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEEEEEEEE Q lcl|Aclame:pro 205 ----VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVALRLKAR 279 (305) Q Consensus 205 ----l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r 279 (305) +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.. ++|.+.-.-..- T Consensus 252 Aa~~i~s~k~iGGlpa~~~PffP~----~~~lVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~rie~y~s~N 315 (337) T protein:vir:79 252 AADLIVSQKRIGNLPAVRVPFFPK----RALMVTKLSNLSIYYQEGARRRTLKEVP------------ERDRIENYESSN 315 (337) T ss_pred HHHHHHHhhhhCCceeEEccccCC----CceEEeechhcEEEEecCcEEEEEEEcc------------ccccccchhhcc Confidence 22346899999999999874 4488888888866555443 22221111 223333322334 Q ss_pred EccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 280 FAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 280 ~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) .||.|.++..++.+..-.. ..| T Consensus 316 e~YvVEd~~~~a~ienI~~---~~a 337 (337) T protein:vir:79 316 DAYVVEDFGCGCVAENIEL---AAA 337 (337) T ss_pred ceeeeeccccEEEEeceee---cCC Confidence 5778888888888774321 112 No 191 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=96.03 E-value=0.00045 Score=38.94 Aligned_cols=273 Identities=9% Similarity=-0.020 Sum_probs=131.9 Q ss_pred CC---C------ccCCccceEccHHHHH----HHHHHHHhhhhhhhhcceeecCC---CceEEEEEeCCCceeeeecchh Q lcl|Aclame:pro 1 MA---D------ISRAEVASLIQEAYSD----TLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGESAT 64 (305) Q Consensus 1 Ma---~------~t~~~gg~lip~~~~~----~i~~~~~~~~~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~E~~~ 64 (305) |+ . .|.+.. -+|..+.+ ++++.+.......++..+.+.+. ....+++.+....+.+.+-. T Consensus 31 ~a~da~d~~~~~~t~~~~--g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~-- 106 (336) T protein:vir:10 31 YAMDAADLSPHLSSTGSS--GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDY-- 106 (336) T ss_pred HHHhhhhhccccccCCCc--chHHHHHhhcCcceeeeeechhchhhhcccccCCCcceeeEEEEeeeeeeeEEEcccc-- Confidence 21 1 111111 14444432 23333444433444444444321 12344444444444443322 Q ss_pred hcccccccccccceeEEeeeeeEEEeehhhH-HHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccc Q lcl|Aclame:pro 65 DPKGVKPTSKVTWANRTLVAEEIAVIIPVHE-NVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALI 141 (305) Q Consensus 65 ~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~-ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~ 141 (305) .+.|..+...+.-.-+.+.++....++. |+-.- ...++..--....++++.+++++-.+.|+..- ...|++ T Consensus 107 ---~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~---~~~Gll 180 (336) T protein:vir:10 107 ---SSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGL---ENYGLI 180 (336) T ss_pred ---CCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeeccc---ceEEEe Confidence 2467767666666667778888888886 44332 24567888888888899999999889987642 233444 Q ss_pred ccccccc-cceeecc-cchhhhHHHHHHHHHHHHhhhcc------ccceEEEEchHHHHHHHHhhccCCceeec--cccc Q lcl|Aclame:pro 142 PAAVTAG-QAVEVVG-GVANESDIVGATNRAAKAVASAG------WAPDTLLSSLALRYEVANIRDANGNPVFR--DDSF 211 (305) Q Consensus 142 ~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l 211 (305) +.-.... ....+.. ...+.+.+++++..+..++.... ..+..+++.+..+..|.+ ++..|.-++. .... T Consensus 181 N~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~lk~n~ 259 (336) T protein:vir:10 181 NDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEIF 259 (336) T ss_pred ecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHHHHHhC Confidence 4322211 1111111 12344667888888877765433 135579999999999864 3343432221 1111 Q ss_pred CccceEecCccccCCCCceEEEEehhhEEEEeecC---cEEEEeecceeccCcceeeeeecCcEEEEEEEEEccE-eecc Q lcl|Aclame:pro 212 AGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQD---ITVKFLDQATLGTGENQINLAERDMVALRLKARFAYV-LGVS 287 (305) Q Consensus 212 ~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~---i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~-v~~p 287 (305) .++.++-........+....++ ..+..+ ..+.+-........+ .........+..|.++. +.+| T Consensus 260 Pnl~i~t~pel~~Agg~~~~~~-------~~~~~~~~t~~~~~P~~f~~lpvq-----~~~~~~~v~~~~rt~Gv~i~rP 327 (336) T protein:vir:10 260 PKLEFVTIPEYDTASGRLVQLW-------APRVEGKDTATCGFTEKMRAHSIE-----RYSSYFRQKKSAGTWGAVIFRP 327 (336) T ss_pred CccEEEEcccccccCCceEEEE-------EecccCCcceeeecChhhhcccee-----ecCceeEeccccceeeeeeecc Confidence 1222222111211112211111 111111 111111100000000 11123455677777666 5779 Q ss_pred cceEEEecc Q lcl|Aclame:pro 288 ATAQGANKT 296 (305) Q Consensus 288 ~a~~~~~~t 296 (305) .++++..+- T Consensus 328 ~ai~~~~GI 336 (336) T protein:vir:10 328 FAVAQMLGV 336 (336) T ss_pred chheeeccC Confidence 999999987 No 192 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=95.97 E-value=0.00091 Score=37.25 Aligned_cols=278 Identities=8% Similarity=0.021 Sum_probs=128.0 Q ss_pred CC-----CccCCccceEccHHHH----HHHHHHHHhhhhhhhhcceeecCC---CceEEEEEeCCCceeeeecchhhccc Q lcl|Aclame:pro 1 MA-----DISRAEVASLIQEAYS----DTLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGESATDPKG 68 (305) Q Consensus 1 Ma-----~~t~~~gg~lip~~~~----~~i~~~~~~~~~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~E~~~~~~~ 68 (305) |= -.|+.+.| +|-.+. ..+++.+.......++..+...+. ....+++.+....|.+.+-+.. T Consensus 63 mDa~~~~~~t~~~~g--~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D---- 136 (382) T protein:vir:96 63 MDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTN---- 136 (382) T ss_pred cccccCCccccCCcc--HHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccC---- Confidence 21 12222222 465554 445556666555666666655432 3567777777777877765543 Q ss_pred cccccc--ccceeEEeeeeeEEEeehh-hHHHhhcC--HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccc Q lcl|Aclame:pro 69 VKPTSK--VTWANRTLVAEEIAVIIPV-HENVIDDA--TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPA 143 (305) Q Consensus 69 ~~~~~~--~~f~~v~~~~~k~~~~~~i-s~ell~ds--~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~ 143 (305) .|..+ ..+.+.++....+ ...+ ..|+.+.+ ..++.+--....++++.+++|+..|.|+-.+.+....|+++. T Consensus 137 -~Pl~d~~~~~~~r~v~~~~~--g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNd 213 (382) T protein:vir:96 137 -IPLTSWNANFERRTIVRGEL--GLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLND 213 (382) T ss_pred -CCccccccceeEEEEEEEEE--eeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeC Confidence 23333 3455555544444 4455 35655543 456677777888889999999999999643322222344443 Q ss_pred cccc--ccceeecccchhhhHHHHHHHHHHHHhhhccc-------cceEEEEchHHHHHHHHhhccCCceeec--ccccC Q lcl|Aclame:pro 144 AVTA--GQAVEVVGGVANESDIVGATNRAAKAVASAGW-------APDTLLSSLALRYEVANIRDANGNPVFR--DDSFA 212 (305) Q Consensus 144 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l~ 212 (305) -... .......-...+.+.+++++..+..++..... .+..+++.+..+..|.. .+..|.-++. ..... T Consensus 214 P~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~-~n~~g~Tvl~~lk~n~P 292 (382) T protein:vir:96 214 PNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSV-TTPYGISVSDWIEQTYP 292 (382) T ss_pred CCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhccc-cCccCccHHHHHHHhcC Confidence 2211 11111111233556677888887777754432 12257789888888853 2333322221 11111 Q ss_pred ccceEe-cCccccCC---CCceEEEEehhhEEEEeecCcEE--EEeec---ceeccCcce---eeeeec-CcEEEEEEEE Q lcl|Aclame:pro 213 GFRTFF-NRNGAWDA---DAAIEVIADSSRVKIGVRQDITV--KFLDQ---ATLGTGENQ---INLAER-DMVALRLKAR 279 (305) Q Consensus 213 G~pv~~-~~~~~~~~---~~~~~~~gdf~~~~~~~~~~i~v--~~~~~---~~~~~~~~~---~~~~~~-~~~~~r~~~r 279 (305) ++.++- .+.-.... +...++ +.....+.. ..+.+ +..+..... .....+ -.....+..| T Consensus 293 nl~i~t~peL~~a~~~g~g~~~~~--------~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~ 364 (382) T protein:vir:96 293 KMRIVSAPELSGVQMQGKTPEDAL--------VLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNG 364 (382) T ss_pred CcEEEEccccccccCCCccceeEE--------EEecchhhhhcccccccCcceeccccceeeeccceeecceeEeccccc Confidence 111111 11100100 111111 111111100 00000 000000000 000001 1122233344 Q ss_pred Ec-cEeecccceEEEecc Q lcl|Aclame:pro 280 FA-YVLGVSATAQGANKT 296 (305) Q Consensus 280 ~~-~~v~~p~a~~~~~~t 296 (305) .+ ..|.+|.+++..++- T Consensus 365 t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 365 TAGALCKRPWAVVRYLGI 382 (382) T ss_pred eeeeEEEcchhhhhccCC Confidence 44 446789999999987 No 193 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=95.86 E-value=0.0013 Score=36.34 Aligned_cols=287 Identities=12% Similarity=0.051 Sum_probs=149.3 Q ss_pred CCC---ccCC--ccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCce-EEEEEeCCCceeeeecchhhccccccccc Q lcl|Aclame:pro 1 MAD---ISRA--EVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTT-HLPVLATLPEADWVGESATDPKGVKPTSK 74 (305) Q Consensus 1 Ma~---~t~~--~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~-~~p~~~~~~~a~~v~E~~~~~~~~~~~~~ 74 (305) +|. +..+ .-.+-|-+.+.+.+.+.+.+.+-++++.++++++--.. ++-.-..++-++...-+.. .+ -.|..- T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~-~~-R~~~~~ 93 (357) T protein:vir:56 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGG-TE-RQPKDF 93 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccccCCCC-CC-cccccc Confidence 332 2222 23567888889999999999999999999998874322 2222233343433211100 00 012111 Q ss_pred ccceeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccc-------------- Q lcl|Aclame:pro 75 VTWANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSP-------------- 138 (305) Q Consensus 75 ~~f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~-------------- 138 (305) ..++.-.+..++.-.-..|+.+.|.. ...+|+..+.+.+.++++.-.-.-.|||+-......+. T Consensus 94 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ 173 (357) T protein:vir:56 94 SKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQ 173 (357) T ss_pred cccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHH Confidence 34566667777776677788887753 23578999999999999988888888986532211110 Q ss_pred --------cccccccc-cccce---eecccchhhhHHHHHHHHHHHH-hhhcccc--ceEEEEchHHHH-HHHHhhccCC Q lcl|Aclame:pro 139 --------ALIPAAVT-AGQAV---EVVGGVANESDIVGATNRAAKA-VASAGWA--PDTLLSSLALRY-EVANIRDANG 202 (305) Q Consensus 139 --------~~~~~~~~-~~~~~---~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G 202 (305) -+++.... .+... -..+...++..+-.+..++... ++..+.. .-++++.+.... ..-.+-...+ T Consensus 174 ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (357) T protein:vir:56 174 KYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQ 253 (357) T ss_pred HHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccC Confidence 01111000 00000 0112223455555555555543 3444332 234555555443 2223322333 Q ss_pred ce--------eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEE Q lcl|Aclame:pro 203 NP--------VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVA 273 (305) Q Consensus 203 ~~--------l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~ 273 (305) .| +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.. + +|.+. T Consensus 254 ~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~----~~llVT~L~NLsIY~Q~gs~RR~~~d~p----~--------r~riE 317 (357) T protein:vir:56 254 DNSEMLAADVIISQKRIGNLPAVRVPYFPA----DAMLITKLENLSIYYMDDSHRRVIEENP----K--------LDRVE 317 (357) T ss_pred ChHHHHHHHHHHHhhhhCCceeEEccccCC----CceEEeeccccEEEEecCcEEEEEEecc----c--------ccccc Confidence 32 22245799999999998874 4478888888765554442 22222211 1 22222 Q ss_pred EEEEEEEccEeecccceEEEeccc--------cccccCCC Q lcl|Aclame:pro 274 LRLKARFAYVLGVSATAQGANKTP--------VAVVAPAA 305 (305) Q Consensus 274 ~r~~~r~~~~v~~p~a~~~~~~t~--------~a~v~~a~ 305 (305) -.-..-.||.|.++..++.+.... .+...|+| T Consensus 318 ~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~~~~~~~~~~~a 357 (357) T protein:vir:56 318 NYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKATEEPGA 357 (357) T ss_pred chhhhcceeeeeccccEEEeeeeeeccCCCCcccCCCCCC Confidence 222223456666666666555432 22223333 No 194 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=95.80 E-value=0.0014 Score=36.18 Aligned_cols=281 Identities=11% Similarity=0.045 Sum_probs=154.9 Q ss_pred CC---CccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCce-EEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MA---DISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTT-HLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma---~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~-~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) +| .+...+-.+-|-+.+.+.+.+.+.+.+-++++.++++++--.. ++-.-..++-++...-+.. .-.|..-.. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~~~---~R~~~~~~~ 92 (337) T protein:vir:78 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKA---ARQPIDPTA 92 (337) T ss_pred HHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecCCCc---ccccccccc Confidence 33 2233344677888899999999999999999999998874322 2222233343433222111 012222344 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc------------------ Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV------------------ 136 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~------------------ 136 (305) ++.-.+..++.-.-..|+.+.|.. ...+|+..+.+.+.++++.-.-.-.|||+-...... T Consensus 93 l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~ 172 (337) T protein:vir:78 93 LDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQY 172 (337) T ss_pred cCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHHHHH Confidence 566667777776677888887753 235799999999999999888888889865322111 Q ss_pred ----cccccccccccccceeecccchhhhHHHHHHHHHHHH-hhhcccc--ceEEEEchHHHH-HHHHhhccCCce---- Q lcl|Aclame:pro 137 ----SPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKA-VASAGWA--PDTLLSSLALRY-EVANIRDANGNP---- 204 (305) Q Consensus 137 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G~~---- 204 (305) +.-+++........+ ..+...++..+-.+..++... ++..+.. .-++++.+.... ..-.+-...+.| T Consensus 173 Re~ap~rVl~~~~~~~~~i-~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~ptE~~ 251 (337) T protein:vir:78 173 RERAAQRVLHEGAKQAGKV-LIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERL 251 (337) T ss_pred HhcchhhhhccccccCCce-eecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhcCCCcHHHH Confidence 111111111110001 112333455555555555543 3444332 235555655444 222222222333 Q ss_pred ----eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEEEEEEEE Q lcl|Aclame:pro 205 ----VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVALRLKAR 279 (305) Q Consensus 205 ----l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r 279 (305) +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.+ ++|.+.-.-..- T Consensus 252 Aa~~i~s~k~iGGl~a~~~PfFP~----~~ilVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~rie~y~s~N 315 (337) T protein:vir:78 252 AADLIVSQKRIGNLPAVRVPFFPK----RALMVTKLSNLSIYYQEGARRRTLKEVP------------ERDRIENYESSN 315 (337) T ss_pred HHHHHHHhhhhcCcceEEccccCC----CceEEeechhcEEEEecCcEEEEEEecc------------ccccccchhhcc Confidence 23346899999999998874 4478888888765554442 22222211 223333222334 Q ss_pred EccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 280 FAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 280 ~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) .||.|.++..++.+..-.. ..| T Consensus 316 e~YvVEd~~~~a~iEnI~~---~~a 337 (337) T protein:vir:78 316 DAYVVEDFGCGCVAENIEL---AAA 337 (337) T ss_pred ceeeeeccccEEEEeceee---cCC Confidence 5778888888888774321 112 No 195 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=95.70 E-value=0.0016 Score=35.93 Aligned_cols=287 Identities=12% Similarity=0.057 Sum_probs=150.1 Q ss_pred CCC---ccCC--ccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCce-EEEEEeCCCceeeeecchhhccccccccc Q lcl|Aclame:pro 1 MAD---ISRA--EVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTT-HLPVLATLPEADWVGESATDPKGVKPTSK 74 (305) Q Consensus 1 Ma~---~t~~--~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~-~~p~~~~~~~a~~v~E~~~~~~~~~~~~~ 74 (305) +|. +..+ .-.+-|-+.+.+.+.+.+.+.+-++++.++++++--.. ++-.-..++-++...-+.. .+ -.|..- T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~-~~-R~~~~~ 93 (357) T protein:vir:60 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGG-TE-RQPKDF 93 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcccccccccCCC-CC-cccccc Confidence 332 2222 23567888889999999999999999999998874322 2222233344433211000 00 012222 Q ss_pred ccceeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccc-------------- Q lcl|Aclame:pro 75 VTWANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSP-------------- 138 (305) Q Consensus 75 ~~f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~-------------- 138 (305) ..++.-.+..++.-.-..|+.+.|.. ...+|+..+.+.+.++++.-.-.-.|||+-......+. T Consensus 94 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ 173 (357) T protein:vir:60 94 SKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSSNQMLQDVAVGWLQ 173 (357) T ss_pred cccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHH Confidence 34566667777776677788887753 23579999999999999988888888986532211110 Q ss_pred --------cccccccc-cccce---eecccchhhhHHHHHHHHHHHH-hhhcccc--ceEEEEchHHHH-HHHHhhccCC Q lcl|Aclame:pro 139 --------ALIPAAVT-AGQAV---EVVGGVANESDIVGATNRAAKA-VASAGWA--PDTLLSSLALRY-EVANIRDANG 202 (305) Q Consensus 139 --------~~~~~~~~-~~~~~---~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G 202 (305) -+++.... .+... -..+...++..+-.+..++... ++..+.. .-++++.+.... ..-.+-...+ T Consensus 174 ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (357) T protein:vir:60 174 KYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNREQ 253 (357) T ss_pred HHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhcCC Confidence 01110000 00000 0112223455555555555543 3444332 235555555443 2222322333 Q ss_pred ce--------eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEE Q lcl|Aclame:pro 203 NP--------VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVA 273 (305) Q Consensus 203 ~~--------l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~ 273 (305) .| +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.. .+|.+. T Consensus 254 ~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~----~~llVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~riE 317 (357) T protein:vir:60 254 DNSEMLAADVIISQKRIGNLPAVRVPYFPA----DAMLITKLENLSIYYMDDSHRRVIEENP------------KLDRVE 317 (357) T ss_pred ChHHHHHHHHHHHhhhhcCcceEEccccCC----CceEEeeccccEEEEecCcEEEEEEecc------------cccccc Confidence 33 22345899999999998874 4478888888765554442 22222211 122222 Q ss_pred EEEEEEEccEeecccceEEEeccccccc-cCCC Q lcl|Aclame:pro 274 LRLKARFAYVLGVSATAQGANKTPVAVV-APAA 305 (305) Q Consensus 274 ~r~~~r~~~~v~~p~a~~~~~~t~~a~v-~~a~ 305 (305) -.-..-.||.|.++..++.+.....+.. .||. T Consensus 318 ~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~pa~ 350 (357) T protein:vir:60 318 NYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAK 350 (357) T ss_pred chhhhcceeeeeccccEEEeeeeeeccCccccc Confidence 2222334666777766666653322111 1222 No 196 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=95.50 E-value=0.002 Score=35.44 Aligned_cols=282 Identities=14% Similarity=0.106 Sum_probs=153.8 Q ss_pred CCC---cc----CCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCce-EEEEEeCCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MAD---IS----RAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTT-HLPVLATLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~---~t----~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~-~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (305) +|. +. +.+-.+-|-+.+.+.+.+.+.+.+-++++.++++|+--.. ++-.-..++-++...-.... .-.|. T Consensus 16 ~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~--~R~~~ 93 (342) T protein:vir:10 16 QAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVASTTDTSGDG--ERKTT 93 (342) T ss_pred HHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCcccccccccCCCC--Ccccc Confidence 332 21 2222477888899999999999999999999999874322 23222333444332111000 00122 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccc------------- Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVS------------- 137 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~------------- 137 (305) .-..++.-.+..++.-.-..|+.+.|.. ...+|+..+.+.+.+++|.-.-.-.|||+-......+ T Consensus 94 ~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GW 173 (342) T protein:vir:10 94 SIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDRNSNPLLQDVAKGW 173 (342) T ss_pred cccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHH Confidence 2235566667777777777888887753 2357999999999999998888888898653221110 Q ss_pred ---------ccccccccccccceeecccchhhhHHHHHHHHHHHH-hhhcccc--ceEEEEchHHHH-HHHHhhccCCce Q lcl|Aclame:pro 138 ---------PALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKA-VASAGWA--PDTLLSSLALRY-EVANIRDANGNP 204 (305) Q Consensus 138 ---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G~~ 204 (305) .-+.+........ . .+...++..+-.+..++... ++..+.. .-++++.+.... ..-.+-.....| T Consensus 174 lQ~~Re~ap~rv~~~~~~~~~i-~-iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~p 251 (342) T protein:vir:10 174 LQKMREDAKERVMNGESTDNQV-L-VGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLLADKYFPIVNQQNAP 251 (342) T ss_pred HHHHHhhhhhhhcccceeccce-e-ecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhcCCCh Confidence 0111111111111 1 12223455555555555543 3444332 235555655444 222222222222 Q ss_pred --------eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecC-cEEEEeecceeccCcceeeeeecCcEEEE Q lcl|Aclame:pro 205 --------VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQD-ITVKFLDQATLGTGENQINLAERDMVALR 275 (305) Q Consensus 205 --------l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~-i~v~~~~~~~~~~~~~~~~~~~~~~~~~r 275 (305) +....++.|+|.+...++|. ..+++--|+++-+...+| .+=.+.+.+ ++|.+.-. T Consensus 252 tE~~Aa~~i~s~k~iGGl~a~~~PfFP~----~~ilVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~rie~y 315 (342) T protein:vir:10 252 TEELAADIVISQKRIGGLKAVRVPFFPA----NAILITKLENLAIYVQEGTTRKHIENVP------------KKDRIETY 315 (342) T ss_pred HHHHHHHHHHhhhhhcCceeEEccccCC----CceEEeeccccEEEEecCcEEEEEEecc------------ccccccch Confidence 22245899999999998874 447888888876555444 222222211 22333322 Q ss_pred EEEEEccEeecccceEEEeccccccccCC Q lcl|Aclame:pro 276 LKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) Q Consensus 276 ~~~r~~~~v~~p~a~~~~~~t~~a~v~~a 304 (305) -..-.||.|.++..++.+.....+ .|= T Consensus 316 ~s~Ne~YvVEd~~~~a~iE~i~i~--~~~ 342 (342) T protein:vir:10 316 ESENIDYVVEDYGCAALIENITLK--DKE 342 (342) T ss_pred hhhccceeeeccccEEEeecceec--CCC Confidence 233457788888888888755432 232 No 197 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=95.15 E-value=0.0027 Score=34.70 Aligned_cols=282 Identities=13% Similarity=0.040 Sum_probs=148.7 Q ss_pred CCCcc-------CCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEE-EEEeCCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MADIS-------RAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHL-PVLATLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~t-------~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~-p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (305) +|... ..+-.+-|.+.+.+.+.+.+.+.+-++++.+++++.--...+ ....+...++.........+ ..+ T Consensus 16 ~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~r~~t~~~~~~-~~~- 93 (343) T protein:vir:98 16 AAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYGAHDRRTPIQQ-RWT- 93 (343) T ss_pred HHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccCccccCCCccc-ccc- Confidence 33211 222347788899999999999999999999998886322222 11122221211111111000 011 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhc--CHHH-HHHHHHHHHHHHHHHHHHHHHHcCcccCcCc-ccc---------- Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDD--ATVA-VLTEVAELGGQAIGKKLDQAVIFGTDKPASW-VSP---------- 138 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~d--s~~~-~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~-~~~---------- 138 (305) .+.-.+..++.-.-..|+.+.|.. ...| |+..+.+.+.+++|.-.-.-.|||+-..... .|. T Consensus 94 ----~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T~nPllqDVN~GWLQ 169 (343) T protein:vir:98 94 ----RQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDTSDPNLADVNKGWIQ 169 (343) T ss_pred ----CCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCCCCcchhhcchHHHH Confidence 111235555665566777777753 1245 8888889998888888878888886532111 111 Q ss_pred --------cccccccccccceeecccchhhhHHHHHHHHHHHHhhhcccc--ceEEEEchHHHHH-HHHhhccCCce--- Q lcl|Aclame:pro 139 --------ALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWA--PDTLLSSLALRYE-VANIRDANGNP--- 204 (305) Q Consensus 139 --------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~~-l~~~kd~~G~~--- 204 (305) -+.+.....+.... .+...++..+-.+..++...++..+.. .-++++.+..... .-.+-...+++ T Consensus 170 ~~Re~ap~rVm~~~~~~~~~~~-~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~n~~~~~ptE 248 (343) T protein:vir:98 170 FVRENKATQILTQGATSGEIRL-FGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLVAKEASLVYKGNGLIATE 248 (343) T ss_pred HHHhcchhhhhccceeccceeE-ecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhhhhhhhhhhhhcCCChHH Confidence 11111111111111 122224445444455555555554433 2355566554432 22232233331 Q ss_pred ------eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecC-cEEEEeecceeccCcceeeeeecCcEEEEEE Q lcl|Aclame:pro 205 ------VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQD-ITVKFLDQATLGTGENQINLAERDMVALRLK 277 (305) Q Consensus 205 ------l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~-i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~ 277 (305) +....++.|+|.+...++|. ..+++--|+++-+...+| .+=.+.+.+ ++|.+.-.=. T Consensus 249 k~Aa~~~~~~k~iGGl~a~~~PfFP~----~~llVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~rie~y~s 312 (343) T protein:vir:98 249 KAALNTHDLMKSFGGMPAMIVPNMPP----RAAIVTSLSNLSIYTQEGSMRRGMKDDD------------DKKAVRDSYY 312 (343) T ss_pred HHHHHHHHHHHhhCCCeeEEccccCC----CceEEeeccccEEEEecCcEEEEEEecc------------ccccccchhh Confidence 12235799999999998874 447888888876555444 222222211 2233333333 Q ss_pred EEEccEeecccceEEEeccccccccCC-C Q lcl|Aclame:pro 278 ARFAYVLGVSATAQGANKTPVAVVAPA-A 305 (305) Q Consensus 278 ~r~~~~v~~p~a~~~~~~t~~a~v~~a-~ 305 (305) .-.||.|.++..++.+.....+....+ + T Consensus 313 ~Ne~YvVEd~~~~a~iE~i~v~~~~~~g~ 341 (343) T protein:vir:98 313 RNEAYAVEDCGKFMAVDFTKVKLSSGKGT 341 (343) T ss_pred hcceeeeeccccEEEeeeeeeeecCCCCC Confidence 345788899999988887765444432 2 No 198 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=95.03 E-value=0.0029 Score=34.47 Aligned_cols=280 Identities=10% Similarity=0.042 Sum_probs=146.3 Q ss_pred CCC---cc----CCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCce-EEEEEeCCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MAD---IS----RAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTT-HLPVLATLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~---~t----~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~-~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (305) +|. ++ +.+-.+-|.+.+...+.+.+.+.+-++++.++++|+.-.. ++-.-..++-++...-+ ... T Consensus 13 ~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~-------R~~ 85 (336) T protein:vir:37 13 LAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATEKGVTGRKQTG-------RNL 85 (336) T ss_pred HHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccCcccccccCCC-------ccc Confidence 222 11 2223578889999999999999999999999999874322 22222233333322111 111 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhcC--HHHH-HHHHHHHHHHHHHHHHHHHHHcCcccCcC-ccccc--------- Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDDA--TVAV-LTEVAELGGQAIGKKLDQAVIFGTDKPAS-WVSPA--------- 139 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~-~~~v~~~la~~~a~~~d~a~l~G~g~~~~-~~~~~--------- 139 (305) .+..++.-.+..++.-.-..|+.+.|..= ..|+ ...+...+.+++|.-+-.-.|||+-.... ..|.+ T Consensus 86 ~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPllqDVNkGWlQ 165 (336) T protein:vir:37 86 ANLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVADNTTKADLSDVNKGWLK 165 (336) T ss_pred cccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhcccceeeccCCCCCcccccchhHHH Confidence 23356666677777777778888887531 2342 34445556666676666777788643211 11111 Q ss_pred ---------ccccccccccceeecccchhhhHHHHHHHHHHHHhhhcccc--ceEEEEchHHHH-HHHHhhccCC-ce-- Q lcl|Aclame:pro 140 ---------LIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWA--PDTLLSSLALRY-EVANIRDANG-NP-- 204 (305) Q Consensus 140 ---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G-~~-- 204 (305) +++........+...+...++..+-.+..++...++..+.. .-+.++.+.... ..-.+-..++ .| T Consensus 166 ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~PtE 245 (336) T protein:vir:37 166 LLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGLTPTE 245 (336) T ss_pred HHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhcCchHHhcCCCeEEEEchhhhhhhhhhhhhhcCCCHHH Confidence 11111111101111223334555555555555555554433 234555554433 2222333332 22 Q ss_pred ------eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEEEEEE Q lcl|Aclame:pro 205 ------VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVALRLK 277 (305) Q Consensus 205 ------l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~ 277 (305) +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.. ++|.+.-.=. T Consensus 246 ~~Aa~~~~~~k~iGGlpa~~~PffP~----~~~lVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~rie~y~s 309 (336) T protein:vir:37 246 KAALGSHNLMGSFGGMNAITPPNFPA----RAAAVTTLKNLSVYTEAESVRRSLRNDE------------DKKGLVTSYY 309 (336) T ss_pred HHHHHHHHHHHhhCCceeEEccccCC----CceEEeechhcEEEEecCcEEEEEEEcc------------ccccccchhh Confidence 12245799999999999874 4488888888866555443 22221111 1223322222 Q ss_pred EEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 278 ARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 278 ~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .--||.|.++..++.+..... .-|+- T Consensus 310 ~Ne~YvVEd~~~~a~iE~i~v--~~~~e 335 (336) T protein:vir:37 310 RQEGYVVEDLGLMTAIDHTKV--KLNGE 335 (336) T ss_pred hcceeeeeccccEEEeeeeee--eecCc Confidence 345777888888877775522 12222 No 199 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=94.99 E-value=0.003 Score=34.41 Aligned_cols=280 Identities=11% Similarity=0.048 Sum_probs=145.3 Q ss_pred CCCcc-------CCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCce-EEEEEeCCCceeeeecchhhccccccc Q lcl|Aclame:pro 1 MADIS-------RAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTT-HLPVLATLPEADWVGESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~t-------~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~-~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (305) +|... +.+-.+-|.+.+...+.+.+.+.+-++++.++++|+.-.. ++-.-..++-++...-+ ... T Consensus 13 ~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~-------r~r 85 (336) T protein:vir:37 13 LAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGVTGRKQTG-------RNL 85 (336) T ss_pred HHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCcccccccCCC-------CCc Confidence 22211 1223578889999999999999999999999999874322 22222233333322211 111 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhcC--HHH-HHHHHHHHHHHHHHHHHHHHHHcCcccCcCc-cccc--------- Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDDA--TVA-VLTEVAELGGQAIGKKLDQAVIFGTDKPASW-VSPA--------- 139 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~-~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~-~~~~--------- 139 (305) ....++.-.+..++.-.-..|+.+.|..= ..| +...+...+.+++|.-+-.-.|||+-..... .|.+ T Consensus 86 ~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPllqDVNkGWlQ 165 (336) T protein:vir:37 86 ATLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTTKTDLSDVNKGWLK 165 (336) T ss_pred cccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCCCccccccchhHHH Confidence 12235556666777766778888877531 133 3344455556666666667777886432211 1111 Q ss_pred ---------ccccccccccceeecccchhhhHHHHHHHHHHHHhhhcccc--ceEEEEchHHHH-HHHHhhccCC-ce-- Q lcl|Aclame:pro 140 ---------LIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWA--PDTLLSSLALRY-EVANIRDANG-NP-- 204 (305) Q Consensus 140 ---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G-~~-- 204 (305) +++........+...+...++..+-.+..++...++..+.. .-++++.+.... ..-.+-..++ .| T Consensus 166 ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~PtE 245 (336) T protein:vir:37 166 LLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGLTPTE 245 (336) T ss_pred HHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEEEEchhhhhhhhhhhhhhcCCCHHH Confidence 11111111001111223334555555555555555554433 234555554433 2222323322 22 Q ss_pred ------eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEEEEEE Q lcl|Aclame:pro 205 ------VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVALRLK 277 (305) Q Consensus 205 ------l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~ 277 (305) +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.. ++|.+.-.=. T Consensus 246 ~~Aa~~~~~~k~iGGlpa~~~PffP~----~~~lVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~rie~y~s 309 (336) T protein:vir:37 246 KAALGSHNLMGSFGGMNAITPPNFPA----RAAAVTTLKNLSVYTEAESVRRSLRNDE------------DKKGLVTSYY 309 (336) T ss_pred HHHHHHHHHHHhhCCceEEEccccCC----CceEEeeccccEEEEecCcEEEEEEEcc------------ccccccchhh Confidence 12245799999999999874 4478888888866555443 22221111 1233322222 Q ss_pred EEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 278 ARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 278 ~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .-.||.|.++..++.+..... .-|+- T Consensus 310 ~Ne~YvVEd~~~~a~iE~i~v--~~~~e 335 (336) T protein:vir:37 310 RQEGYVVEDLGLMTAIDHTKV--KLNGE 335 (336) T ss_pred hcceeeeeccccEEEeeeeee--ecccc Confidence 345778888888887776532 22222 No 200 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=94.76 E-value=0.0036 Score=34.01 Aligned_cols=287 Identities=11% Similarity=0.054 Sum_probs=150.4 Q ss_pred CCC---ccCC--ccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCce-EEEEEeCCCceeeeecchhhccccccccc Q lcl|Aclame:pro 1 MAD---ISRA--EVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTT-HLPVLATLPEADWVGESATDPKGVKPTSK 74 (305) Q Consensus 1 Ma~---~t~~--~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~-~~p~~~~~~~a~~v~E~~~~~~~~~~~~~ 74 (305) +|. ++.+ .-.+-|-+.+.+.+.+.+.+.+-++++.++++++--.. ++-.-..++-++...-+.. .+ -.|..- T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~-~~-R~~~~~ 93 (357) T protein:vir:20 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGG-TE-RQPKDF 93 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccccCCCC-CC-cccccc Confidence 332 2222 23567888889999999999999999999998874322 2222233343433211100 00 012111 Q ss_pred ccceeEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccc-------------- Q lcl|Aclame:pro 75 VTWANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSP-------------- 138 (305) Q Consensus 75 ~~f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~-------------- 138 (305) ..++.-.+..++.-.-..|+.+.|.. ...+|+..+.+.+.++++.-.-.-.|||+-......+. T Consensus 94 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ 173 (357) T protein:vir:20 94 SKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQ 173 (357) T ss_pred cccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHH Confidence 34566667777776677788887753 23579999999999999988888888986532211110 Q ss_pred --------cccccccc-cccce---eecccchhhhHHHHHHHHHHHH-hhhcccc--ceEEEEchHHHH-HHHHhhccCC Q lcl|Aclame:pro 139 --------ALIPAAVT-AGQAV---EVVGGVANESDIVGATNRAAKA-VASAGWA--PDTLLSSLALRY-EVANIRDANG 202 (305) Q Consensus 139 --------~~~~~~~~-~~~~~---~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G 202 (305) -+++.... .+... -..+...++..+-.+..++... ++..+.. .-++++.+.... ..-.+-...+ T Consensus 174 ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (357) T protein:vir:20 174 KYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQ 253 (357) T ss_pred HHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccC Confidence 01111000 00000 0112223455555555555543 3444332 234555555443 2223322333 Q ss_pred ce--------eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCc-EEEEeecceeccCcceeeeeecCcEE Q lcl|Aclame:pro 203 NP--------VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVA 273 (305) Q Consensus 203 ~~--------l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-~v~~~~~~~~~~~~~~~~~~~~~~~~ 273 (305) .| +....++.|+|.+...++|. ..+++--|+++-+...+|- +=.+.+.. .+|.+. T Consensus 254 ~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~----~~ilVT~L~NLsIY~Q~gs~RR~~~d~p------------~r~riE 317 (357) T protein:vir:20 254 DNSEMLAADVIISQKRIGNLPAVRVPYFPA----DAMLITKLENLSIYYMDDSHRRVIEENP------------KLDRVE 317 (357) T ss_pred ChHHHHHHHHHHHhhhhCCceeEEccccCC----CceEEeeccccEEEEecCcEEEEEEecc------------cccccc Confidence 32 22245799999999998874 4478888888765554442 22222211 122222 Q ss_pred EEEEEEEccEeecccceEEEecccccc-ccCCC Q lcl|Aclame:pro 274 LRLKARFAYVLGVSATAQGANKTPVAV-VAPAA 305 (305) Q Consensus 274 ~r~~~r~~~~v~~p~a~~~~~~t~~a~-v~~a~ 305 (305) -.-..-.||.|.++..++.+.....+. -.||. T Consensus 318 ~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~p~~ 350 (357) T protein:vir:20 318 NYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAK 350 (357) T ss_pred chhhhcceeeeeccccEEEeeeeeeccccCCcc Confidence 222234466777777777666432211 11222 No 201 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=94.51 E-value=0.0042 Score=33.60 Aligned_cols=184 Identities=14% Similarity=0.018 Sum_probs=85.2 Q ss_pred EEeehhhHHHhh-----cCHHHHHHHHHHHHHHHHHHHHHHHHHc----CcccCcCcccccccccccccccceeecccch Q lcl|Aclame:pro 88 AVIIPVHENVID-----DATVAVLTEVAELGGQAIGKKLDQAVIF----GTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) Q Consensus 88 ~~~~~is~ell~-----ds~~~~~~~v~~~la~~~a~~~d~a~l~----G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) ---.-+++-++. ++..++.+...+++++++++..|+.++. +..+..+. + ...........++... T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~-~-----~~~~g~~~~~~a~~t~ 74 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPV-T-----GQDGGFSVNIGAGNTN 74 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcc-c-----ccccCcceeccccccC Confidence 001123333332 3567899999999999999999988864 22111100 0 0000011111222334 Q ss_pred hhhHHHHHHHHHHHHhhhccccce--EEEEchHHHHHHHHhhc----------cCCceeec---ccccCccceEecCccc Q lcl|Aclame:pro 159 NESDIVGATNRAAKAVASAGWAPD--TLLSSLALRYEVANIRD----------ANGNPVFR---DDSFAGFRTFFNRNGA 223 (305) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~~kd----------~~G~~l~~---~~~l~G~pv~~~~~~~ 223 (305) +...+++.+.++..++...+.... .++++|..+..|.+..| ++|. +-. -..+.|++|+.++++| T Consensus 75 ~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~-~~~g~~i~~v~G~~V~~SnnlP 153 (221) T protein:vir:17 75 NAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGD-MNTGKGLYVNAGIRIYKSNVLA 153 (221) T ss_pred CHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeeccccccc-ccccceeeeecCcEEEEeccCC Confidence 556677778878777776654322 46778876666654222 1111 111 1358899999999998 Q ss_pred cCCCCce-EEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccccccc Q lcl|Aclame:pro 224 WDADAAI-EVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVA 302 (305) Q Consensus 224 ~~~~~~~-~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~ 302 (305) ...+... ...|+|..- ....+.++.+ | ...+ +.+.+|+|...++.-.-..-. T Consensus 154 ~~~gt~~~~~ag~~~~~-~~~~~~yr~~----------------f---------s~~~-glv~~~~Avgtvkl~~~~~~~ 206 (221) T protein:vir:17 154 SLYGTNLVTDPGDATTS-GENNGSYRPA----------------I---------TDRA-GLVFHKEAADTVEVLLPPSRP 206 (221) T ss_pred cccccccccCCcccccc-cccccccccc----------------c---------cceE-EEEEcchheeeeeeecCCCCC Confidence 6544321 112222100 0000000000 0 0111 234555555444432111111 Q ss_pred CCC Q lcl|Aclame:pro 303 PAA 305 (305) Q Consensus 303 ~a~ 305 (305) |-- T Consensus 207 ~~~ 209 (221) T protein:vir:17 207 PLV 209 (221) T ss_pred cee Confidence 111 No 202 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=94.32 E-value=0.0047 Score=33.32 Aligned_cols=280 Identities=13% Similarity=0.096 Sum_probs=144.5 Q ss_pred CCCccCCccceEcc---HHHHHHHHHHHHhhhhhhhh-c---ceeecCC-CceEEEEEeC-CCceeeeecchhhcccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQ---EAYSDTLLAAAKQGSTVLSA-F---QNVNMGT-KTTHLPVLAT-LPEADWVGESATDPKGVKP 71 (305) Q Consensus 1 Ma~~t~~~gg~lip---~~~~~~i~~~~~~~~~l~~l-~---~~~~~~~-~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~ 71 (305) |.-.. -+.++- .+.+.++.+.+..+++|++. . ++.+.++ .++..|.... ..++.|-.-.+... . T Consensus 1 mp~~~---lsel~t~tl~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~----~ 73 (321) T protein:vir:34 1 MPFPN---ISDIITTTIESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSGYDVLP----T 73 (321) T ss_pred CCCch---HHHHHHHHHHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEeeeeec----c Confidence 43311 111111 22334455556666665543 2 3344444 3666776644 67888854333221 1 Q ss_pred cccccceeEEeeeeeEEEeehhhH-HHhhcCH----HHHHHHHHHHHHHHHHHHHHHHHHc-CcccCcCccccc------ Q lcl|Aclame:pro 72 TSKVTWANRTLVAEEIAVIIPVHE-NVIDDAT----VAVLTEVAELGGQAIGKKLDQAVIF-GTDKPASWVSPA------ 139 (305) Q Consensus 72 ~~~~~f~~v~~~~~k~~~~~~is~-ell~ds~----~~~~~~v~~~la~~~a~~~d~a~l~-G~g~~~~~~~~~------ 139 (305) .-...|+.-++..+.+++-+.||- |+++.+. ++|.+.=.+...+.++.+++..+.. |++.+.. +..| T Consensus 74 ~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~sdGTa~g~~-~i~GL~~lv~ 152 (321) T protein:vir:34 74 APQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYGDGTAFGGR-AINGLDGAVP 152 (321) T ss_pred chhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhccccccccc-hhhhhhhhcc Confidence 123478899999999999988887 6665543 4555555566667788888887764 5542211 1111 Q ss_pred ---------cccccc--ccccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccC------- Q lcl|Aclame:pro 140 ---------LIPAAV--TAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDAN------- 201 (305) Q Consensus 140 ---------~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~------- 201 (305) .+..+. ..-+..+...+..+...+...+..+..+....+..++-|++....+...+...-.. T Consensus 153 ~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~y~~y~~s~q~~qR~~~~~ 232 (321) T protein:vir:34 153 VDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDAWTTYSNSLQVLQRFTSAE 232 (321) T ss_pred cCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHHHHHHHHhhheeeeecccc Confidence 111110 00112223333334445555566666666656667888999999888776532111 Q ss_pred -CceeecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceecc-CcceeeeeecCcEEEEEEEE Q lcl|Aclame:pro 202 -GNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGT-GENQINLAERDMVALRLKAR 279 (305) Q Consensus 202 -G~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~-~~~~~~~~~~~~~~~r~~~r 279 (305) ++--|..-...|.-++..+......+....+|-|-+.+.+....+=.+......-+.. ++. ...-++.+++.. T Consensus 233 ~a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r~~~~Nqd----A~~q~I~~~GnL- 307 (321) T protein:vir:34 233 EANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSRRAAFNQD----AEAQILAWAGNL- 307 (321) T ss_pred cccccceeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCcccccccchh----HHhhhhhhhhee- Confidence 2222333456777778777655556677899999998877765543333322221100 000 001112222211 Q ss_pred EccEeecccceEEEecc Q lcl|Aclame:pro 280 FAYVLGVSATAQGANKT 296 (305) Q Consensus 280 ~~~~v~~p~a~~~~~~t 296 (305) ...++.+-.+++.. T Consensus 308 ---~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 308 ---TCSGAQFQGRLIAE 321 (321) T ss_pred ---eeecccceeEEeeC Confidence 23344444444333 No 203 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=93.96 E-value=0.0058 Score=32.84 Aligned_cols=278 Identities=10% Similarity=0.010 Sum_probs=147.7 Q ss_pred CCC---ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MAD---ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTH-LPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~---~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~-~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) +|. .....-.+-|-+.+.+.+.+.+.+.+-++++.++++|+.-... +-.-..++-++...-+ ..+ .+.. T Consensus 20 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdt~------R~~-r~~~ 92 (341) T protein:vir:27 20 LAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTGRKAGG------RFT-KQVG 92 (341) T ss_pred HHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceeeccCCC------cee-cccc Confidence 221 2233345677778889999999999999999999988743222 2222233434332211 122 2235 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhc-----CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc----ccc------cc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDD-----ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV----SPA------LI 141 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~d-----s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~----~~~------~~ 141 (305) ++.-.+..++.-.-..|+.+.|.. +..+|+..+.+.+.++++.-+-.-.|+|+-...... |.+ =+ T Consensus 93 l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td~~anPllqDVNkGWl 172 (341) T protein:vir:27 93 VGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTDPSANPLGQDVNEGWI 172 (341) T ss_pred cCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCChhhcccccccchhHH Confidence 666677777777777788877742 136789999999999999988888899975221110 100 00 Q ss_pred ----ccc--cccccceeecccchhhhHHHHHHHHHHHH-hhhcccc--ceEEEEchHHHH-HHHHhhccCCcee------ Q lcl|Aclame:pro 142 ----PAA--VTAGQAVEVVGGVANESDIVGATNRAAKA-VASAGWA--PDTLLSSLALRY-EVANIRDANGNPV------ 205 (305) Q Consensus 142 ----~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~~~~~~~-~l~~~kd~~G~~l------ 205 (305) ..+ ..-.......+...++..+-.+..++... ++..+.. .-++++.+.... .--.+-.....|- T Consensus 173 Q~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa~ 252 (341) T protein:vir:27 173 AFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAAQAKLYDKADKPSEQIAAQ 252 (341) T ss_pred HHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhhhhhhhhhccCCCCHHHHHHH Confidence 000 00000011112234455554455555443 3333332 134555555443 2222222211110 Q ss_pred ecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEee Q lcl|Aclame:pro 206 FRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLG 285 (305) Q Consensus 206 ~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~ 285 (305) ....++.|+|.+...++|. ..+++--|+++.+...+|-.=....+ ..+ +|.+.-+ +. +|.|. T Consensus 253 ~i~k~iGGlpa~~~PffP~----~~~lVT~L~NLsIY~Q~gs~RR~~~d---~p~--------r~rie~y-es--~YvVE 314 (341) T protein:vir:27 253 KLDKTIAGRPAYVPPFLPD----NAMVVTIPENLQVLTQHGTAQRKAKH---ESD--------RKRSKTH-TG--AWKVT 314 (341) T ss_pred HHHHhhCCCeEEEccccCC----CceEEeeccceEEEEecCcEEEEEEe---ccc--------cccccch-hh--hheee Confidence 0135799999999999874 44888888888766655532211111 111 1222211 11 46677 Q ss_pred cccceEEEeccccccccCCC Q lcl|Aclame:pro 286 VSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 286 ~p~a~~~~~~t~~a~v~~a~ 305 (305) +-.++..+..+.+-. |++ T Consensus 315 dyg~~~~~~~~~vkl--~~~ 332 (341) T protein:vir:27 315 QWVCWKRSPLTTQKK--STS 332 (341) T ss_pred hhhhhhhcccccccc--Ccc Confidence 766666666654322 333 No 204 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=93.96 E-value=0.0058 Score=32.83 Aligned_cols=284 Identities=11% Similarity=-0.018 Sum_probs=128.5 Q ss_pred CCCccCCccceEccHHHHH----HHHHHHHhhhhhhhhcceeecCC---CceEEEEEeCCCceeeeecchhhcccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSD----TLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGESATDPKGVKPTS 73 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~----~i~~~~~~~~~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~ 73 (305) .+-.|+. +.=+|-.+.+ .|++.+..-....++..+.+.+. ....+++.+....+.+.+-+.. .|.. T Consensus 72 ~~~~t~~--~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D-----~Pl~ 144 (388) T protein:vir:99 72 VAPTTQA--SIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTN-----IPLS 144 (388) T ss_pred ccccccC--cccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccC-----CCce Confidence 1122222 2225666654 34444444444555655555432 2556666666667777765433 3444 Q ss_pred cccceeEEeeeeeEEEeehhhH-HHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccc-c Q lcl|Aclame:pro 74 KVTWANRTLVAEEIAVIIPVHE-NVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAG-Q 149 (305) Q Consensus 74 ~~~f~~v~~~~~k~~~~~~is~-ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~-~ 149 (305) +...+...-..+.++....++. |+-.- ...++...-....++++.+++|+-.|+|..........|+++.-.... . T Consensus 145 d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v 224 (388) T protein:vir:99 145 SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAI 224 (388) T ss_pred eccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCccccc Confidence 4333333333344444556665 33332 345778888888889999999999999953222222334443211110 1 Q ss_pred ceee-----cccchhhhHHHHHHHHHHHHhhhccc-------cceEEEEchHHHHHHHHhhccCCceeec--ccccCccc Q lcl|Aclame:pro 150 AVEV-----VGGVANESDIVGATNRAAKAVASAGW-------APDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGFR 215 (305) Q Consensus 150 ~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~v~~~~~~~~l~~~kd~~G~~l~~--~~~l~G~p 215 (305) ..+. .-...+.+.+++++..+..++..... .+..+++.+..+..|.+. +..|.-++. .....++. T Consensus 225 ~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~~lk~n~Pnl~ 303 (388) T protein:vir:99 225 ASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVR 303 (388) T ss_pred ccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhcccc-CcCCccHHHHHHHhcCCcE Confidence 1111 11123556778888888777654332 122578899888888533 333322221 11111221 Q ss_pred eE-ecCccccC-CCCce--EEEEeh-hhEEEEee-cCcEEE-EeecceeccCcceeeeeecCcEEEEEEEEEccE-eecc Q lcl|Aclame:pro 216 TF-FNRNGAWD-ADAAI--EVIADS-SRVKIGVR-QDITVK-FLDQATLGTGENQINLAERDMVALRLKARFAYV-LGVS 287 (305) Q Consensus 216 v~-~~~~~~~~-~~~~~--~~~gdf-~~~~~~~~-~~i~v~-~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~-v~~p 287 (305) ++ +.+.-..+ .+.+. .++.+- .....+-. ...+.. ...+. +..-... ...-.....+..|.++. |.+| T Consensus 304 i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~-~~~l~vq---~~~~~~~~~~~~rt~Gv~ir~P 379 (388) T protein:vir:99 304 VMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSK-FVTLGVE---KRVKNYVEAYSNATAGVMLKRP 379 (388) T ss_pred EEEecccccccccCCceeEEEEecccccccccCccCcceeEEecccc-cccccce---ecCceeEeccccceeeeEEecc Confidence 21 11111111 11121 111111 00000000 000000 00000 0000000 00112334455566555 6779 Q ss_pred cceEEEecc Q lcl|Aclame:pro 288 ATAQGANKT 296 (305) Q Consensus 288 ~a~~~~~~t 296 (305) .+++.+++- T Consensus 380 ~Ai~~~~GI 388 (388) T protein:vir:99 380 WAVVRLIGL 388 (388) T ss_pred chhheeccC Confidence 999999987 No 205 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=93.37 E-value=0.0078 Score=32.14 Aligned_cols=271 Identities=9% Similarity=0.022 Sum_probs=113.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee-----cC--CCceEEEEEeCCCceeeeecchhhcccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN-----MG--TKTTHLPVLATLPEADWVGESATDPKGVKPTS 73 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~-----~~--~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~ 73 (305) ||..- -..+|+.+.++.++.++++.++.++++.-. .. +++++||+........+ .......+... T Consensus 1 MAN~l----lT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~----~~~~~~~~~~~ 72 (423) T protein:vir:35 1 MANNL----ESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERT----ETGDITGKDKN 72 (423) T ss_pred Cccch----hhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecc----cCcCCCCcccc Confidence 88421 234899999999999999999988876522 11 56888887542221111 11000011111 Q ss_pred cccce--eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccce Q lcl|Aclame:pro 74 KVTWA--NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAV 151 (305) Q Consensus 74 ~~~f~--~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~ 151 (305) +..-. .++++.+|... ++++++=...+..++++++...+ ++++.++|..++.---... + . .+ T Consensus 73 ~~~e~~v~l~id~~k~~a-~~v~d~e~~l~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a---~----~-------~v 136 (423) T protein:vir:35 73 GLFSAKATGKVGKYITVA-VEWTQIEEALKLNQLDQILSPIH-ERMVTDLETELAHFMMNNG---A----L-------SL 136 (423) T ss_pred ccccceeeEEeccceecc-ceeCHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcc---c----c-------cc Confidence 11112 25555555433 46666544445678887777664 7799999988874210000 0 0 00 Q ss_pred eeccc-chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh----hccC--Cceeec----ccccCccceEecC Q lcl|Aclame:pro 152 EVVGG-VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI----RDAN--GNPVFR----DDSFAGFRTFFNR 220 (305) Q Consensus 152 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~----kd~~--G~~l~~----~~~l~G~pv~~~~ 220 (305) ...+. ...++++.+.-..+.....+.. .-..+++|..+..|.+- ...+ +.--++ .+++.|+.++.++ T Consensus 137 gt~~t~~~~~~~i~~a~~~Ld~~~vP~~--~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Sn 214 (423) T protein:vir:35 137 GSPNTAIKKWADVAQTASFIKDIGIKTG--ENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIRALMSN 214 (423) T ss_pred ccccCCcchHHHHHHHHHHHHHhcCCcC--CCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceEEEEcC Confidence 00111 1223343333333322222221 22568899988877531 1111 111121 2478999999999 Q ss_pred ccccCC-CC--ceEE-----------EEehhhEEEEeecCcEEEEeecceeccCcceeeeee-------c---------- Q lcl|Aclame:pro 221 NGAWDA-DA--AIEV-----------IADSSRVKIGVRQDITVKFLDQATLGTGENQINLAE-------R---------- 269 (305) Q Consensus 221 ~~~~~~-~~--~~~~-----------~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~-------~---------- 269 (305) ++|..+ +. +.+. ..+.+...++..+ .+ .....++..++. ..... . T Consensus 215 nvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~-~~--~~~~g~l~~GD~-~t~aGv~~v~~~t~~~~~~~~t~ 290 (423) T protein:vir:35 215 GLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVALTG-AT--PSKTGFLKAGDQ-LKFTSTHWLNQQSKQTLYNGSTA 290 (423) T ss_pred CCccccccccccceeeccccccccccccccccceeeeee-ee--eccCCcEEecce-EEeeeeeeccccccceeecccCC Confidence 988532 11 1111 0111111111110 00 111111111110 00000 0 Q ss_pred CcEEEEEEEEEccEeecccceEEEeccccccc-------------cCCC Q lcl|Aclame:pro 270 DMVALRLKARFAYVLGVSATAQGANKTPVAVV-------------APAA 305 (305) Q Consensus 270 ~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v-------------~~a~ 305 (305) ....+++..-. .....+...++..|+ ++ +||+ T Consensus 291 ~~~~~~V~~~~---~~~a~g~~~v~i~p~-~~~~~~~~~~~~v~a~~a~ 335 (423) T protein:vir:35 291 MSFTATVLEET---NSTASGDVTVKLSGV-PIYDEKNSQYNAVDAKVKA 335 (423) T ss_pred ceeEEEEeccc---cccccCceeEEcccc-ccccCCCcccccccccccC Confidence 01111111000 000011112222222 11 1222 No 206 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=91.91 E-value=0.014 Score=30.81 Aligned_cols=291 Identities=12% Similarity=0.096 Sum_probs=133.1 Q ss_pred CCCccCCc--cceEccH-HHHHHHHHHHHhhhhhhhhcceeecCCC---ceEEEEEeCCCcee-eeecchhhccccc--- Q lcl|Aclame:pro 1 MADISRAE--VASLIQE-AYSDTLLAAAKQGSTVLSAFQNVNMGTK---TTHLPVLATLPEAD-WVGESATDPKGVK--- 70 (305) Q Consensus 1 Ma~~t~~~--gg~lip~-~~~~~i~~~~~~~~~l~~l~~~~~~~~~---~~~~p~~~~~~~a~-~v~E~~~~~~~~~--- 70 (305) -+..++.+ .|.-+-. .+....+..+++.-.+.+++...|++.+ +.++-+...-+.+. ...||..-...+. T Consensus 9 ~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a~G~~~~~g 88 (401) T protein:vir:95 9 DGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDASGATIVNG 88 (401) T ss_pred ccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCcccccccCc Confidence 12222222 2334434 3345556666666888899999998743 44444433222211 2334432111100 Q ss_pred -------ccc-------------------cccceeEEeeeeeEEEeehhhHHHhh-cCHHHHHHHH-HHHHHHHHHHHH- Q lcl|Aclame:pro 71 -------PTS-------------------KVTWANRTLVAEEIAVIIPVHENVID-DATVAVLTEV-AELGGQAIGKKL- 121 (305) Q Consensus 71 -------~~~-------------------~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~~~~~v-~~~la~~~a~~~- 121 (305) ..+ ..+-..+..+.+++|.+..+|+++.. ++...+.+.+ .+.|.-+..+.+ T Consensus 89 ~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~d 168 (401) T protein:vir:95 89 NLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMNGATQITEA 168 (401) T ss_pred cccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhhhhhhhHHH Confidence 000 11223455678899999999998765 4455666654 344444433333 Q ss_pred --HHHHHcCcccCcCcccccccccccccccceeecccchhhhHHHHHHHHHHHHhh---------hccc------cceEE Q lcl|Aclame:pro 122 --DQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVA---------SAGW------APDTL 184 (305) Q Consensus 122 --d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~------~~~~~ 184 (305) -..+|++-+.-- -+..... ......-..+.+..+.+++......+..... +... ..-.- T Consensus 169 ~i~~dll~ag~~vi--yAg~ats--~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va 244 (401) T protein:vir:95 169 VLQKDLLAAAGTVL--YAGAATS--DATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVM 244 (401) T ss_pred HHHHHHHhhcCeee--cCCccce--eeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEE Confidence 355664322100 0000000 0000111223333344444333222221000 0000 11135 Q ss_pred EEchHHHHHHHHhhccCCceeecc---------------cccCccceEecCccc--------cC-------------CCC Q lcl|Aclame:pro 185 LSSLALRYEVANIRDANGNPVFRD---------------DSFAGFRTFFNRNGA--------WD-------------ADA 228 (305) Q Consensus 185 v~~~~~~~~l~~~kd~~G~~l~~~---------------~~l~G~pv~~~~~~~--------~~-------------~~~ 228 (305) ++|+.....|+.++|-.|.|-|.+ +.+.+++++++..+. .. .+. T Consensus 245 ~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~~gg~ 324 (401) T protein:vir:95 245 YVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVSGQEH 324 (401) T ss_pred EEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccccccccCCCc Confidence 779999999988888777666643 356677777665421 10 011 Q ss_pred c----eEEEEehhhEEEEeecC-cE--EEE-eecceec--cCcceeeeeecCcEEEEEEEEEccEeecccceEEEecccc Q lcl|Aclame:pro 229 A----IEVIADSSRVKIGVRQD-IT--VKF-LDQATLG--TGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) Q Consensus 229 ~----~~~~gdf~~~~~~~~~~-i~--v~~-~~~~~~~--~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~ 298 (305) . .+++|+=....++..++ .. +++ -+...+. +.++++. |+..+.++ ...++.+++++-.+++... T Consensus 325 ~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlg--Q~g~vgwK--~~~a~~vL~~e~m~~ies~-- 398 (401) T protein:vir:95 325 YDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYG--ETGFSSIK--WYYGILVKRPERLALIKTV-- 398 (401) T ss_pred ceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCccc--ceehhhhh--hhhhhheeccceeEEEEee-- Confidence 1 24566654444544322 11 111 1222222 1223322 22223322 2446668888888887753 Q ss_pred ccc Q lcl|Aclame:pro 299 AVV 301 (305) Q Consensus 299 a~v 301 (305) +++ T Consensus 399 a~~ 401 (401) T protein:vir:95 399 APL 401 (401) T ss_pred cCC Confidence 233 No 207 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=90.96 E-value=0.018 Score=30.13 Aligned_cols=271 Identities=10% Similarity=-0.000 Sum_probs=114.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee-----c--CCCceEEEEEeCCCceeeee-cchhhccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN-----M--GTKTTHLPVLATLPEADWVG-ESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~-----~--~~~~~~~p~~~~~~~a~~v~-E~~~~~~~~~~~ 72 (305) ||..- -..+|+.+.++.++.+++..++.+++..-. . .+++++|++........+-+ .+......+..+ T Consensus 1 MaN~l----lT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e 76 (423) T protein:vir:10 1 MPNNL----DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cccch----hhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCcccc Confidence 88421 224899999999999999999888876521 1 36688887654222211111 111111111111 Q ss_pred ccccceeEEeeeeeEEEeehhhH-HHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcC-cccCcCcccccccccccccccc Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHE-NVIDDATVAVLTEVAELGGQAIGKKLDQAVIFG-TDKPASWVSPALIPAAVTAGQA 150 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~-ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G-~g~~~~~~~~~~~~~~~~~~~~ 150 (305) .--.++++.+|...+ ++++ |+. ....++++++... .++++..+|..++.- .+.+... .... T Consensus 77 ---~~v~l~id~~k~va~-~v~d~E~~-~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~--------~gt~--- 139 (423) T protein:vir:10 77 ---GKATGRVGNYITVAV-EYQQLEEA-IKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGALS--------LGSP--- 139 (423) T ss_pred ---ceeEEEeeceeeeee-eechHHHh-cChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhccccc--------cccC--- Confidence 111356666665444 4555 544 4456787766555 688999999988742 1111100 0000 Q ss_pred eeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh----h--ccCCceeec----ccccCccceEecC Q lcl|Aclame:pro 151 VEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI----R--DANGNPVFR----DDSFAGFRTFFNR 220 (305) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~----k--d~~G~~l~~----~~~l~G~pv~~~~ 220 (305) ......++++.+.-..+.....+.. .=..+++|..+..|.+. . +..+.--++ ..++.|+.++.++ T Consensus 140 ---~t~~~a~~~i~~a~~~Ld~~~vP~~--~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Sn 214 (423) T protein:vir:10 140 ---NTPITKWSDVAQTASFLKDLGVNEG--ENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSN 214 (423) T ss_pred ---CcccchHHHHHHHHHHHHhccCCcC--CCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeC Confidence 0011223444333333322222221 22578999988877532 1 111111122 2478899999999 Q ss_pred ccccCCCC---ce--EEEEeh-hhEEEEeecCcEEEE-----eecceeccCcceeeeeecCc------------------ Q lcl|Aclame:pro 221 NGAWDADA---AI--EVIADS-SRVKIGVRQDITVKF-----LDQATLGTGENQINLAERDM------------------ 271 (305) Q Consensus 221 ~~~~~~~~---~~--~~~gdf-~~~~~~~~~~i~v~~-----~~~~~~~~~~~~~~~~~~~~------------------ 271 (305) ++|..+.. +. .-.+-+ .+.......+..+.+ ....++..++ .|..+- T Consensus 215 nip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD----~~t~aGv~~v~~~tk~~~~~~~t~ 290 (423) T protein:vir:10 215 GLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGD----QVKFTNTYWLQQQTKQALYNGATP 290 (423) T ss_pred CCccccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecc----eEEecceeeecccccccccccccC Confidence 88853211 01 111110 000000111111111 1111111111 111100 Q ss_pred --EEEEEEEEEccEeecccceEEEecccccccc-------------CCC Q lcl|Aclame:pro 272 --VALRLKARFAYVLGVSATAQGANKTPVAVVA-------------PAA 305 (305) Q Consensus 272 --~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~-------------~a~ 305 (305) ..+++..-. ..+-.+...++..|+ ++. ||+ T Consensus 291 ~~~~~~v~a~~---~~~~~g~~tv~i~p~-~i~~~~~~~~~~v~a~~a~ 335 (423) T protein:vir:10 291 ISFTATVTADA---NSDSGGDVTVTLSGV-PIYDTTNPQYNSVSRQVEA 335 (423) T ss_pred cceEEEEEeee---eeccCCceeeeccCc-cccccCCcccccccccccC Confidence 111111100 000001112222221 111 111 No 208 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=89.29 E-value=0.0058 Score=32.85 Aligned_cols=270 Identities=13% Similarity=0.110 Sum_probs=103.1 Q ss_pred CCC--ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhccc-ccccccccc Q lcl|Aclame:pro 1 MAD--ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKG-VKPTSKVTW 77 (305) Q Consensus 1 Ma~--~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~-~~~~~~~~f 77 (305) ++. +|-+|.-..+|+.+...|...+..+.++++...+..++.--++.. +..+. |.+...+| ++.+...+| T Consensus 35 L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s-~~s~A------eAq~HkdGqTK~eqa~~~ 107 (318) T protein:vir:86 35 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRS-FDSSA------EAQVHKDGQTKTEQAATL 107 (318) T ss_pred hhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhhhhh-hhhhh------hhhhhccCCccccceeee Confidence 332 333555678999999999999999999988666665543222211 11222 33333222 233333334 Q ss_pred eeEEeeeeeEEEeehhhHHHhh---cCHHHHHHHHHHHHHHHHH-HHHHHHHHcCcccCcCcccccccccccccccceee Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIG-KKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV 153 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~---ds~~~~~~~v~~~la~~~a-~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 153 (305) .--++.+.-+.....+ -|+.+ .+...+..+|..+|+.+|. +..|.+++.|+|+.. +....-...........+ T Consensus 108 ~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~-f~~~DK~advK~I~k~Tt- 184 (318) T protein:vir:86 108 TIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGSNG-FKSIDKEADVKKIKKITT- 184 (318) T ss_pred eeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhheeecCCCC-ccchhhHHHHHHHHHHhh- Confidence 3333333322222222 24443 3455679999999999999 899999999999643 111000000000000000 Q ss_pred cccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCc---eeecccccCccceEecCccccCCCC-- Q lcl|Aclame:pro 154 VGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGN---PVFRDDSFAGFRTFFNRNGAWDADA-- 228 (305) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~---~l~~~~~l~G~pv~~~~~~~~~~~~-- 228 (305) ....+....+...+..+..-+.+...+.--++-.......|..++-+..+ .+-.+++-....|-+.+.+.....+ T Consensus 185 kaksagttpfanaieeavdfvrptagrrylivkaedrkalldelrqatanahvriknddteiasevgvdeiivytgskal 264 (318) T protein:vir:86 185 KAKSAGTTPFANAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANAHVRIKNDDTEIASEVGVDEIIVYTGSKAL 264 (318) T ss_pred hhhccCCCchhhHHHHHHhhhccCCCceEEEEeecchHHHHHHHHhhcccceeEEeccchhhhhhcCcceeeeeeccccc Confidence 00000001111222222222222111111122222222333333322221 1112221111111111111111111 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccCcceeee--eecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINL--AERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~--~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) .+-++.|.+ |.+. -.++ ..++. |..|.-.+.++.--.+-|.--.+-+.++.. T Consensus 265 kptvlvdqk-yhid-mqdl--------------tkvdafewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:86 265 KPTVLVDQK-YHID-MQDL--------------TKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred cceeeeccc-eecc-hhhh--------------hhhhcceeccCCceEEEeecccCcceeecCceeEEeC Confidence 112222221 1110 0010 01112 222333333343333333222222222222 No 209 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=89.03 E-value=0.029 Score=29.03 Aligned_cols=278 Identities=10% Similarity=0.016 Sum_probs=113.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee-----c--CCCceEEEEEeCCCceeeeecchhhcccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN-----M--GTKTTHLPVLATLPEADWVGESATDPKGVKPTS 73 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~-----~--~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~ 73 (305) ||.. -..++|+.+++++++.+++..++.+++..-. . .+++++||+........ ..+......+.... T Consensus 1 MANs----l~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d--~~~~~~t~~~~~~l 74 (423) T protein:vir:10 1 MANN----LDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSER--TMDGDITGKSKNSL 74 (423) T ss_pred Cccc----cccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeec--ccCcccCccccccc Confidence 8832 2348999999999999999999988876522 2 25688887643211110 00000000000000 Q ss_pred cccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceee Q lcl|Aclame:pro 74 KVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV 153 (305) Q Consensus 74 ~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 153 (305) ...--.++++.+|...+ +++++=...+..++++++... .++++..+|+.+......... . ..+.. T Consensus 75 ~e~~v~l~id~~k~~a~-~v~d~E~~l~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~-------~---~vgt~--- 139 (423) T protein:vir:10 75 ISAKATGEVGNYITVAV-EYRQIEEALKLNQLDQILVPI-NERMVTDLETELALFMMKHGA-------L---SLGSP--- 139 (423) T ss_pred ccceEEEEecceeeeee-eeChHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHhhhccc-------c---ccccc--- Confidence 00012455666665444 565543335677887766554 688999999988632211000 0 00000 Q ss_pred cccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHH----hhccC--Cceeec----ccccCccceEecCccc Q lcl|Aclame:pro 154 VGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVAN----IRDAN--GNPVFR----DDSFAGFRTFFNRNGA 223 (305) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~----~kd~~--G~~l~~----~~~l~G~pv~~~~~~~ 223 (305) ......++++.+.-..+.....+.. .=..+++|.....|.+ +...+ +.--++ ..++.|+.++.++++| T Consensus 140 ~t~~~a~~~~a~a~~~L~~~~vP~~--~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp 217 (423) T protein:vir:10 140 NTPIKKWSDVAQTASFLKDLGINSG--ENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGNFGGIRALMSNGLA 217 (423) T ss_pred ccccccHHHHHHHHHHHhhccCCcC--CCEEEeCHHHHHHHhhhhhhhccccccchHHHHhcccceeecceEEEEecCCc Confidence 0011223344333332322222222 2257899998888753 22211 111122 2478899999999887 Q ss_pred cC-CCCce--------E-EEEeh-------hhEEEEeecC--cEEEEeecceeccCcceeeeeecC---------cEEEE Q lcl|Aclame:pro 224 WD-ADAAI--------E-VIADS-------SRVKIGVRQD--ITVKFLDQATLGTGENQINLAERD---------MVALR 275 (305) Q Consensus 224 ~~-~~~~~--------~-~~gdf-------~~~~~~~~~~--i~v~~~~~~~~~~~~~~~~~~~~~---------~~~~r 275 (305) .- .+... . +-|+- ....+..-.. -.|..-|..++..- ...+...+. ...++ T Consensus 218 ~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv-~~v~~~tk~~l~~~~~~~~~~~~ 296 (423) T protein:vir:10 218 SRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDT-HWLNQQSKQTLYNGASALSFTAT 296 (423) T ss_pred ccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecce-eeecccccceeecccCCcceEEE Confidence 42 11111 1 10110 0000000000 00111111111000 000000000 00111 Q ss_pred EEEEEccEeecccceEEEeccccc--cc----------cCCC Q lcl|Aclame:pro 276 LKARFAYVLGVSATAQGANKTPVA--VV----------APAA 305 (305) Q Consensus 276 ~~~r~~~~v~~p~a~~~~~~t~~a--~v----------~~a~ 305 (305) +.. +....-+.++ .++..|+- .+ .||+ T Consensus 297 V~~--~~~~~a~~~~-tv~i~p~~~~~~~~~~~~~V~a~~a~ 335 (423) T protein:vir:10 297 VME--DANAHSSGDV-TVKISGVPIFDAGYPQYNAVDRLLAE 335 (423) T ss_pred EEe--cccccccCce-EEEeccccccccCcccccceeccccC Confidence 111 0000011111 12222211 00 1222 No 210 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=88.13 E-value=0.0061 Score=32.74 Aligned_cols=270 Identities=13% Similarity=0.112 Sum_probs=105.1 Q ss_pred CCC--ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhccc-ccccccccc Q lcl|Aclame:pro 1 MAD--ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKG-VKPTSKVTW 77 (305) Q Consensus 1 Ma~--~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~-~~~~~~~~f 77 (305) ++. +|-+|.-..+|+.+...|...+..+.++++...+..++.-.++.. +.... |.+...+| ++.+...+| T Consensus 110 L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s-~~s~~------eAq~HkdGqTK~eqa~~~ 182 (393) T protein:vir:16 110 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRS-FDSAN------EAQVHKDGQTKTEQAATL 182 (393) T ss_pred HhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhh-hhhhh------hhhhhccCCccccceeee Confidence 332 333555678999999999999999999988666655543222211 11222 22322222 233323334 Q ss_pred eeEEeeeeeEEEeehhhHHHhh---cCHHHHHHHHHHHHHHHHH-HHHHHHHHcCcccCcCcccccccccccccccceee Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIG-KKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV 153 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~---ds~~~~~~~v~~~la~~~a-~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 153 (305) .--++.+.-+.....+ -|+.. .+...+..+|..+|+.+|. +..|.+++.|+|+.. +....-.+........ +. T Consensus 183 ~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~-f~~~DK~advK~I~k~-Tt 259 (393) T protein:vir:16 183 TIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNG-FKSIDKEADVKKIKKI-TT 259 (393) T ss_pred eeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCC-ccchhhHHHHHHHHHH-hh Confidence 3333333322222222 24443 3455679999999999999 899999999998643 1110000000000000 00 Q ss_pred cccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhcc--CCc-eeecccccCccceEecCccccCCCC-- Q lcl|Aclame:pro 154 VGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDA--NGN-PVFRDDSFAGFRTFFNRNGAWDADA-- 228 (305) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~--~G~-~l~~~~~l~G~pv~~~~~~~~~~~~-- 228 (305) ....+....+.+.+..+..-+.+...+.--++-.......|..++-+ +.+ .+-.+++-....|-+.+.+.....+ T Consensus 260 kaksagktpfadaieeavdfvrptagrrylivktedrkalldelrqatananvriknddteiasevgvdeiivytgskal 339 (393) T protein:vir:16 260 KAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAL 339 (393) T ss_pred hhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhccCceeeeccchhhhhhcCcceeeeeeccccc Confidence 01111111223333333333332211111222222222333333322 211 2222222111111111111111111 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccCcceeee--eecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINL--AERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~--~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) .+-++.|.++- +. -.++ ..++. |..|.-.+.++.--.+-|.--.+-+.++.. T Consensus 340 kptvlvdqkyh-id-mqdl--------------tkvdafewktnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 340 KPTVLVDQKYH-ID-MQDL--------------TKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred cceeeeccccc-cc-hhhh--------------hhhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 11222232211 10 0010 01112 222333333333333333222222222222 No 211 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=87.72 E-value=0.037 Score=28.44 Aligned_cols=274 Identities=9% Similarity=-0.004 Sum_probs=115.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceee-----c--CCCceEEEEEeCCCceeeee-cchhhccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVN-----M--GTKTTHLPVLATLPEADWVG-ESATDPKGVKPT 72 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~-----~--~~~~~~~p~~~~~~~a~~v~-E~~~~~~~~~~~ 72 (305) ||..- -..+|+.+.++.++.+++..++.+++..-. . .+++++||+........+-+ .+......+..+ T Consensus 1 MaN~l----lT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e 76 (423) T protein:vir:17 1 MPNNL----DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cccch----hhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCcccc Confidence 88532 124899999999999999999888876522 1 35688888633211111100 110001111111 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcC-cccCcCcccccccccccccccce Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFG-TDKPASWVSPALIPAAVTAGQAV 151 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G-~g~~~~~~~~~~~~~~~~~~~~~ 151 (305) .--.++++.+|...+ +++++=......++++++... .++++..+|..++.- .+.+.. . .+... T Consensus 77 ---~~v~l~id~~k~va~-~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a~~--------~---~gt~~ 140 (423) T protein:vir:17 77 ---GKATGRVGNYITVAV-EYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGAL--------S---LGSPN 140 (423) T ss_pred ---ceeEEEeeceeeeee-eecHHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhcccc--------c---cccCC Confidence 112466666665444 555543334566787766555 588999999887732 111100 0 00000 Q ss_pred eecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHh----hc--cCCceeec----ccccCccceEecCc Q lcl|Aclame:pro 152 EVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANI----RD--ANGNPVFR----DDSFAGFRTFFNRN 221 (305) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~----kd--~~G~~l~~----~~~l~G~pv~~~~~ 221 (305) .....++++.+.-..+.....+.. .=..+++|..+..|.+. .. ..+.--++ ..++.|+.++.+++ T Consensus 141 ---t~~~a~~~i~~a~~~Ld~~~vP~~--~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snn 215 (423) T protein:vir:17 141 ---TPITKWSDVAQTASFLKDLGVNEG--ENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNG 215 (423) T ss_pred ---cccccHHHHHHHHHHHHhccCCcC--CCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCC Confidence 011124444333333332222221 22578999998877532 11 11111122 24788999999999 Q ss_pred cccCCCCc---eEE--EEeh-hhEEEEeecC--cEEEE---eecceeccCcceeeeeecCcEEEEEEEEEccEee----- Q lcl|Aclame:pro 222 GAWDADAA---IEV--IADS-SRVKIGVRQD--ITVKF---LDQATLGTGENQINLAERDMVALRLKARFAYVLG----- 285 (305) Q Consensus 222 ~~~~~~~~---~~~--~gdf-~~~~~~~~~~--i~v~~---~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~----- 285 (305) +|..+... .+. .+.. .......... +.+.. ....++..++ .|... .++.-.++...+. T Consensus 216 ip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD----~~t~a--Gv~~v~~~tk~v~~~~~t 289 (423) T protein:vir:17 216 LASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGD----QVKFT--NTYWLQQQTKQALYNGAT 289 (423) T ss_pred CccccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecc----eEEec--ceeeeccccccccccccc Confidence 88532111 111 1111 0111111000 01110 0111111111 11111 1111122222221 Q ss_pred -cccceEEEecc-c------cccccCCC Q lcl|Aclame:pro 286 -VSATAQGANKT-P------VAVVAPAA 305 (305) Q Consensus 286 -~p~a~~~~~~t-~------~a~v~~a~ 305 (305) ++.-|...... . +=.+.|+- T Consensus 290 ~~~~~~~v~~~~~~~a~~~~tv~i~p~~ 317 (423) T protein:vir:17 290 PISFTATVTADANSDSSGDVTVTLSGVP 317 (423) T ss_pred ccceEEEEEecccccccCceEEEecCcc Confidence 12222211100 0 00111221 No 212 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=86.49 E-value=0.045 Score=27.96 Aligned_cols=269 Identities=9% Similarity=-0.025 Sum_probs=114.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhh-hcc--eeecCCCceEEEEEeCCCceeeeecchhhcccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLS-AFQ--NVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTW 77 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~-l~~--~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f 77 (305) .+.-+-.-+...+-+-+...+-+.+...+.-.. +++ .....+++++||+.....-..+ .-......+.+ +.+. T Consensus 30 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~DY-~R~~g~~~g~v---t~~~ 105 (329) T protein:vir:10 30 FANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELKDY-KRNATNEFDHP---QIQE 105 (329) T ss_pred hcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeecccccccc-cCCCCcccccc---ccce Confidence 222111112222333343444333333221111 122 3455678999999865432222 11111111222 2244 Q ss_pred eeEEeeeeeEEEeehhhHHHhhcCHH--HHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeecc Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVIDDATV--AVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVG 155 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~ds~~--~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (305) ...+++..|.-.+. |.+==..++.. .+...+.+...+.++..+|.-.+.---...+ . ... T Consensus 106 ~t~tidqdR~~~F~-VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~-----------~------~~~ 167 (329) T protein:vir:10 106 TTYFLDQEKYWGRF-VDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKA-----------K------HLT 167 (329) T ss_pred eEEEeecccceeee-cchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcc-----------c------ccc Confidence 45556555543332 11100122222 2345555666667777777554421100000 0 001 Q ss_pred cchhhhHHHHHHHHHHHHhhhccc-cceEEEEchHHHHHHHHhh----c--cCCceee--cccccCccceEecCccccCC Q lcl|Aclame:pro 156 GVANESDIVGATNRAAKAVASAGW-APDTLLSSLALRYEVANIR----D--ANGNPVF--RDDSFAGFRTFFNRNGAWDA 226 (305) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~~~~~~l~~~k----d--~~G~~l~--~~~~l~G~pv~~~~~~~~~~ 226 (305) ...+.+..++.+.++...+..... ..=.++++|.++..|.+.. . .....+. +-..+.|++|+...... . T Consensus 168 ~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g~Vg~idG~~Ii~vps~~--~ 245 (329) T protein:vir:10 168 VGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKGVQGELDGFTIVKVPSKM--L 245 (329) T ss_pred cccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeeeeecCeEEEEecCCc--c Confidence 112333455556666555554432 1225788999998887532 1 1111111 22568999988543211 1 Q ss_pred CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEe--ccccccccCC Q lcl|Aclame:pro 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGAN--KTPVAVVAPA 304 (305) Q Consensus 227 ~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~--~t~~a~v~~a 304 (305) .+..+++|..+.......-. .+++.+.. .. ++--.++...+.|.-|.+|++..... .+..+....| T Consensus 246 k~in~ii~~~~A~~~~~K~~-~~~~~~p~-----~~------~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a~~~~~~~ 313 (329) T protein:vir:10 246 QGVEAMAVIGEVMASPIQAN-EAKLNSNV-----PG------MFGTLAEQMLYTGAFVPEHLQKYIFTIGGKEVETNRDG 313 (329) T ss_pred cceeEEEEcCCceeeeeeee-eeeeeCCC-----Cc------cchheeeeeeeeeeEEEccccCEEEEecccCcccCCCC Confidence 12234455444333222111 12221110 00 11236778889999999987444333 3222222222 Q ss_pred C Q lcl|Aclame:pro 305 A 305 (305) Q Consensus 305 ~ 305 (305) + T Consensus 314 ~ 314 (329) T protein:vir:10 314 V 314 (329) T ss_pred C Confidence 2 No 213 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=86.21 E-value=0.011 Score=31.22 Aligned_cols=270 Identities=13% Similarity=0.109 Sum_probs=104.4 Q ss_pred CCC--ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhccc-ccccccccc Q lcl|Aclame:pro 1 MAD--ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKG-VKPTSKVTW 77 (305) Q Consensus 1 Ma~--~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~-~~~~~~~~f 77 (305) ++. +|-+|.-..+|+.+...|...+..+.++++...+..++.--++.. +.... |.+...+| ++.+...+| T Consensus 117 L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s-~~s~~------~Aq~HkdGqTK~eqa~~~ 189 (400) T protein:vir:93 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRS-FDSAN------EAQVHKDGQTKTEQAATL 189 (400) T ss_pred HhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhh-hhhhh------hhhhhccCCccccceeee Confidence 332 333555678999999999999999999988666665543222211 11222 22322222 233333334 Q ss_pred eeEEeeeeeEEEeehhhHHHhh---cCHHHHHHHHHHHHHHHHH-HHHHHHHHcCcccCcCcccccccccccccccceee Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIG-KKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV 153 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~ell~---ds~~~~~~~v~~~la~~~a-~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 153 (305) .--++.+.-+.....+ -|+.. .+...+..+|..+|+.+|. +..|.+++.|+|+.. +....-.+........ +. T Consensus 190 ~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~-f~~~DK~advK~I~~~-Tt 266 (400) T protein:vir:93 190 TIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNG-FKSIDKEADVKKIKKI-TT 266 (400) T ss_pred eeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCC-ccchhhHHHHHHHHHH-hh Confidence 3333333322222222 23433 4555679999999999999 899999999998643 1110000000000000 00 Q ss_pred cccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCce---eecccccCccceEecCccccCCCC-- Q lcl|Aclame:pro 154 VGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNP---VFRDDSFAGFRTFFNRNGAWDADA-- 228 (305) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~---l~~~~~l~G~pv~~~~~~~~~~~~-- 228 (305) ....+....+.+.+..+..-+.+...+.--++-.......|..++-+..+. +-.++.-....|-+.+.+.....+ T Consensus 267 kaksagktpfadaieeavdfvrptagrrylivktedrkalldelrqatanahvriknddaeiasevgvdeiivytgskal 346 (400) T protein:vir:93 267 KAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANAHVRIKNDDAEIASEVGVDEIIVYTGSKAL 346 (400) T ss_pred hhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhccccceEeecchhhhhhhcCcceeeeeeccccc Confidence 011111112223333333333222111112222222223333343222211 111111100011111111111100 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccCcceeee--eecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINL--AERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~--~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) .+-++.|.++- +. -.++ ..++. |..|.-.+.++.--.+-|.--.+-+.++.. T Consensus 347 kptvlvdqkyh-id-mqdl--------------tkvdafewktnsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 347 KPTVLVDQKYH-ID-MQDL--------------TKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred cceeeeccccc-cc-hhhh--------------hhhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 11222222111 10 0000 01112 222333333333333333222222222222 No 214 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=83.59 E-value=0.067 Score=27.01 Aligned_cols=273 Identities=12% Similarity=0.123 Sum_probs=114.5 Q ss_pred CCC------ccCCccceEccHHHHHHHHHHHHhhh--hhhhhcceeecCCCceEEEE---EeCCCceeeeecchhhcccc Q lcl|Aclame:pro 1 MAD------ISRAEVASLIQEAYSDTLLAAAKQGS--TVLSAFQNVNMGTKTTHLPV---LATLPEADWVGESATDPKGV 69 (305) Q Consensus 1 Ma~------~t~~~gg~lip~~~~~~i~~~~~~~~--~l~~l~~~~~~~~~~~~~p~---~~~~~~a~~v~E~~~~~~~~ 69 (305) |.+ .+-.+++++=-+.+.++|........ .+.+-..+.+..+--.++-. ......+.+++|+.. T Consensus 22 ~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~----- 96 (464) T protein:vir:80 22 FTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATSTVAKYDVYLAHGRVGHTRFTREIGV----- 96 (464) T ss_pred HHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhhheeeccCccccccccccccc----- Confidence 221 22233555555666666654444333 23333344444443222222 223356777888753 Q ss_pred cccccccceeEEeeeeeEEEe--ehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCccc-------CcCcccccc Q lcl|Aclame:pro 70 KPTSKVTWANRTLVAEEIAVI--IPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDK-------PASWVSPAL 140 (305) Q Consensus 70 ~~~~~~~f~~v~~~~~k~~~~--~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~-------~~~~~~~~~ 140 (305) .+.+++.+.+...+.+=+..- +.+-.++. ++..+-.....+.-...+++.+|.+.|.|+-+ +.+.+--|+ T Consensus 97 ~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lv-n~~~d~~~~~~~dai~~va~tiE~a~FyGds~l~~~~~~~~gleFDGl 175 (464) T protein:vir:80 97 APISDPNLRQKTVNMKYVSDTKNMSIATGLV-NNIEDPMRILTDDAISVVAKTIEWASFYGDSDLSENPDAGSGLEFDGL 175 (464) T ss_pred cccCCCceEEEEEEeeeeecceeeeeehhhh-cchhhHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCCCccccchhhh Confidence 566777787777665433222 22333333 45667777888888888999999999999753 233343444 Q ss_pred cccccccccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHH-HHhhccCCceeecc--cccCccceE Q lcl|Aclame:pro 141 IPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEV-ANIRDANGNPVFRD--DSFAGFRTF 217 (305) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l-~~~kd~~G~~l~~~--~~l~G~pv~ 217 (305) . ......++...-+..... +.+..+...+...+..++-++|+..+.+.+ .+.-+.+-+.+..+ ....|+|+ T Consensus 176 ~-~lI~~~NViDarG~~Ls~----~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~q~~~~~~n~~~~~~G~~v- 249 (464) T protein:vir:80 176 A-KLIDKHNVLDAKGASLTE----ALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDRQVQVISDNGQNATMGFNV- 249 (464) T ss_pred H-hhcCCCceeecCCCCcCH----HHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCceeEEEcCCCCcceeeeec- Confidence 4 333444555555544442 333444444555666677788888887664 55544433322211 11223332 Q ss_pred ecCccccCCCCceEEEEehhhEEEEeecCcEEEEee----cceeccCcc-eeeeeecCcEEEEEEEEEc--cEeec---- Q lcl|Aclame:pro 218 FNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLD----QATLGTGEN-QINLAERDMVALRLKARFA--YVLGV---- 286 (305) Q Consensus 218 ~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~----~~~~~~~~~-~~~~~~~~~~~~r~~~r~~--~~v~~---- 286 (305) ..+ +.-++.+.+.-+. +..+..... ....++--.++.-.+.--. |.-.+ T Consensus 250 -------------------~~f-~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apaapsvt~tv~~~~~g~f~~~~~~~~ 309 (464) T protein:vir:80 250 -------------------KGF-NSARGFIRLHGSTVMELEQILDENRMQLPNAPQKATVKATLEAGTKGKFRDEDLTID 309 (464) T ss_pred -------------------ccc-cccccceeccCccccCcccccccccccCCCCcCCceeEEEecCCcccCCccccccce Confidence 111 1112222221100 000000000 0000000000000000000 00000 Q ss_pred -ccceEEEecc-ccc-----cccCCC Q lcl|Aclame:pro 287 -SATAQGANKT-PVA-----VVAPAA 305 (305) Q Consensus 287 -p~a~~~~~~t-~~a-----~v~~a~ 305 (305) --+++....- ..+ -++.++ T Consensus 310 ~~Ykv~~vn~~GeS~ps~~~~~ti~~ 335 (464) T protein:vir:80 310 TEYKVVVVSDDAESAPSDVASVVIDD 335 (464) T ss_pred eEEEEEEECCCCccccceeeeeeecC Confidence 0000000000 000 011111 No 215 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=82.56 E-value=0.076 Score=26.72 Aligned_cols=286 Identities=10% Similarity=0.097 Sum_probs=117.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-------CC------------Cceeeeec Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA-------TL------------PEADWVGE 61 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~-------~~------------~~a~~v~E 61 (305) .+..+++..=.-.-+.+. .+..++..+-+..+++.+-||++++.-|.-.. +. +++.|-+. T Consensus 87 ia~s~~s~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt~fSG~ 165 (534) T protein:vir:10 87 IASGETSGSITNVGPAVM-GLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDADFSGR 165 (534) T ss_pred ccccccccccccccchhh-hHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCcccccccccccccccccccc Confidence 232222221111222221 23344445555667888888876542221110 00 11111000 Q ss_pred c------------------------------------------------------------------------------- Q lcl|Aclame:pro 62 S------------------------------------------------------------------------------- 62 (305) Q Consensus 62 ~------------------------------------------------------------------------------- 62 (305) + T Consensus 166 ~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~Ta~ 245 (534) T protein:vir:10 166 GAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSAMATAF 245 (534) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccceecccccchhh Confidence 0 Q ss_pred -hh------hcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHcCccc Q lcl|Aclame:pro 63 -AT------DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGTDK 131 (305) Q Consensus 63 -~~------~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~G~g~ 131 (305) +. ......++...+++++++..+...-...+|-||.+|- ..|.++.|.+-|+-.|...+++.||.---. T Consensus 246 AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~ 325 (534) T protein:vir:10 246 AELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINA 325 (534) T ss_pred HhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhh Confidence 00 0011256667778888888877777889999999984 457899999999999999999888853211 Q ss_pred ------CcCcccccccccccccccceeecccchhhhH---HHHHHHHHHHHhhh--ccccceEEEEchHHHHHHHHh--- Q lcl|Aclame:pro 132 ------PASWVSPALIPAAVTAGQAVEVVGGVANESD---IVGATNRAAKAVAS--AGWAPDTLLSSLALRYEVANI--- 197 (305) Q Consensus 132 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~~--- 197 (305) ..+..+.....+............+-...+. ++-.+......+.. .....+-+++++++...|... T Consensus 326 ~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l 405 (534) T protein:vir:10 326 TAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDML 405 (534) T ss_pred hhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccch Confidence 0000000000000000000000001111111 11222222222322 223467789999999998642 Q ss_pred -------------hccCCceeecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEE----EEeecceeccC Q lcl|Aclame:pro 198 -------------RDANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITV----KFLDQATLGTG 260 (305) Q Consensus 198 -------------kd~~G~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v----~~~~~~~~~~~ 260 (305) +|.++. ++..-.-.|++|+++.+.+.+ -++ +|.++...+ ....+.-+..- T Consensus 406 ~~~~~~~~~~~~~~d~~~~-~~~G~l~~~~~vy~D~y~~~d----y~~--------vG~KG~~~~~~glfyaPYv~l~~~ 472 (534) T protein:vir:10 406 MTPAVMGANTTMNTDTTSS-LFAGVLAGKYRVYIDQYAVED----YFT--------VGYKGASEMDAGLYYCPYVALTPL 472 (534) T ss_pred hccccccccccccccCCCc-eEEEEecCceEEEecCCCCcc----eEE--------EEEeCCcccccceeeccccccccc Confidence 122221 111222345677777765432 122 222222111 11111000000 Q ss_pred -cceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCC----------C Q lcl|Aclame:pro 261 -ENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA----------A 305 (305) Q Consensus 261 -~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a----------~ 305 (305) ...-..|+- .+=...|++.. .+|-+ ...+..+.+.+.-+ + T Consensus 473 ~~~dp~sfqP---~~g~~tRY~l~-~NP~~-~~~~~~~~~~i~~g~~~~~~~ag~n 523 (534) T protein:vir:10 473 RGTDPKNFQP---VLGFKTRYGVK-LHPMA-DATQNKGFAKISNGMPQHTNMFGKN 523 (534) T ss_pred cccCCccccc---eeeeeeeecee-ecCcc-cccCCccccccccCCcchhhhcccc Confidence 000011222 12234466543 34421 11111222222211 1 No 216 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=82.54 E-value=0.076 Score=26.72 Aligned_cols=293 Identities=12% Similarity=0.038 Sum_probs=130.2 Q ss_pred CCCccCCc-cceEccHHHHHHHHHHHHhhhhh-----hhhcceeecCCCceEEEEEeCCCceeeeecchh--hccccccc Q lcl|Aclame:pro 1 MADISRAE-VASLIQEAYSDTLLAAAKQGSTV-----LSAFQNVNMGTKTTHLPVLATLPEADWVGESAT--DPKGVKPT 72 (305) Q Consensus 1 Ma~~t~~~-gg~lip~~~~~~i~~~~~~~~~l-----~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~--~~~~~~~~ 72 (305) -+..+ ++ ....++..-. +++...+ ....++..+.++.+++-+-.++..|.-+..+.+ .-+...++ T Consensus 69 ta~~~-a~~T~i~V~~~~~------f~~~~l~~~~~~~EvirVtsVng~~lTV~RG~~~t~aa~iaag~~~~~ig~~~eE 141 (418) T protein:vir:96 69 TAEAL-ADATVLTVENSDG------LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRIAAAIIAANTKLIVIGTAFEE 141 (418) T ss_pred EEEEe-cCceEEEecCCcc------cccccEEEEecCCeEEEEEEEeCCEEEEEEccCCeeeeeeecCceEEEeecCccc Confidence 11111 12 1233333221 2222222 123345556677777777655544444444432 11112333 Q ss_pred ccccceeEEeeeeeEEEeehhhHHHhhcCHHH-----------HHHHHHHHHHHHHHHHHHHHHHcCc---ccCcCccc- Q lcl|Aclame:pro 73 SKVTWANRTLVAEEIAVIIPVHENVIDDATVA-----------VLTEVAELGGQAIGKKLDQAVIFGT---DKPASWVS- 137 (305) Q Consensus 73 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~-----------~~~~v~~~la~~~a~~~d~a~l~G~---g~~~~~~~- 137 (305) .+........+...+.-+..|-+|..+-|.-. +.....+.|.+. ...+|.+++.|. |..++ .+ T Consensus 142 Gsd~~ta~~~k~~~vsN~tQIf~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~~-kv~iE~ali~g~~~~~~~ng-~p~ 219 (418) T protein:vir:96 142 GSQRPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFMGTYNG-QPL 219 (418) T ss_pred ccccCCcceecceeccchhheehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHHH-HHHHHHhhhccccccCCCCC-ccc Confidence 33334444455555556666666655443321 222223344444 457788899886 22211 11 Q ss_pred ---ccccccccc--cccceeecc-cchhhhHHHHHHHHHHHHhhhccccc----eEEEEchHHHHHHHHhhccCCceeec Q lcl|Aclame:pro 138 ---PALIPAAVT--AGQAVEVVG-GVANESDIVGATNRAAKAVASAGWAP----DTLLSSLALRYEVANIRDANGNPVFR 207 (305) Q Consensus 138 ---~~~~~~~~~--~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~v~~~~~~~~l~~~kd~~G~~l~~ 207 (305) .++..+... ..+.++... ...+.+.+.+.+.++...-.+.+... -.++++.+....|.++-. +-+. -+ T Consensus 220 ~~t~R~m~gI~~f~~~Nvi~ag~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~~-~I~~-~~ 297 (418) T protein:vir:96 220 HTTQGIVDAIRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFG-EVTV-TQ 297 (418) T ss_pred ccccchhHHHHhhccccccccCCCCcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhhc-eeEe-cc Confidence 111111111 122222222 23455666666655543111122221 236889999999987642 2221 11 Q ss_pred ccccC-----------c-cceEecCccccCC-CCceEEEEehhhEEEEee--cCcEEEEeecce----eccCcceeeeee Q lcl|Aclame:pro 208 DDSFA-----------G-FRTFFNRNGAWDA-DAAIEVIADSSRVKIGVR--QDITVKFLDQAT----LGTGENQINLAE 268 (305) Q Consensus 208 ~~~l~-----------G-~pv~~~~~~~~~~-~~~~~~~gdf~~~~~~~~--~~i~v~~~~~~~----~~~~~~~~~~~~ 268 (305) .+... | ++++++.++|.+. ..+.+++-|.+..-+..- +....+...... +......+. .. T Consensus 298 ~en~~G~vv~~~~Td~G~v~ii~n~~~pad~I~~g~mlVvD~~~vkL~yL~~R~~~~E~l~k~G~~~~~~~~~~~~~-~~ 376 (418) T protein:vir:96 298 RETSYGMVFTEWKFFKGRLIIKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYG-HG 376 (418) T ss_pred ccceeceEEEEEEeeccEEEEEecCCCCccccCcceEEEEecCceEEEEecCCCccchhcccCCCcccccccccccc-cc Confidence 12222 2 3667777777654 677788888877654332 233222221110 000000000 00 Q ss_pred cCcEEEEEEEEEccEeecccceEEEeccc-----cccccCCC Q lcl|Aclame:pro 269 RDMVALRLKARFAYVLGVSATAQGANKTP-----VAVVAPAA 305 (305) Q Consensus 269 ~~~~~~r~~~r~~~~v~~p~a~~~~~~t~-----~a~v~~a~ 305 (305) .|.+.-.....+...+.+|.+.+++++-. +.+.+||- T Consensus 377 ~D~~~G~l~~Eltle~~N~~a~a~itgl~~~~~~~~~~~~~~ 418 (418) T protein:vir:96 377 VDAQGGSLTSEWALELLNPQGCAVITGLQKAKERVYLTAPAP 418 (418) T ss_pred cccccCEEEEEEEEEeecccccEEeecccccccccccCCCCC Confidence 12222223345666779999999998642 22334444 No 217 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=81.64 E-value=0.084 Score=26.48 Aligned_cols=277 Identities=9% Similarity=0.066 Sum_probs=106.2 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcce-------eecCCCce--EEEE--------EeCCCceeeeecch Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQN-------VNMGTKTT--HLPV--------LATLPEADWVGESA 63 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~-------~~~~~~~~--~~p~--------~~~~~~a~~v~E~~ 63 (305) ++..++...|...-.. ...+.+...... .+..+... .+.. ..+..-+.-.+|.. T Consensus 174 ~~~~~~~~~G~~~~~t---------~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~AE~l 244 (528) T protein:vir:80 174 LAIGTQIEAGDIVHHT---------FAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSIAEIQ 244 (528) T ss_pred ccccccccccceeccc---------cccccccccccccccccCccccCCcccccccccccccccccccccccchhhhhhh Confidence 1111111111111000 000000000000 00000000 0000 00001111122321 Q ss_pred h----hcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCc Q lcl|Aclame:pro 64 T----DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASW 135 (305) Q Consensus 64 ~----~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~ 135 (305) . ......++...+++++++..+..+-...+|-||.+|- ..|.++.|.+-|+..|...+++.||.=....... T Consensus 245 e~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~~a~~ 324 (528) T protein:vir:80 245 EGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQV 324 (528) T ss_pred cccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeee Confidence 1 1122357778888999888888888899999999984 4688999999999999999999996422110000 Q ss_pred cccccc----cc--ccccccceeecccc---hhhhHHHHHHHHHHHHhhh--ccccceEEEEchHHHHHHHHhh-----c Q lcl|Aclame:pro 136 VSPALI----PA--AVTAGQAVEVVGGV---ANESDIVGATNRAAKAVAS--AGWAPDTLLSSLALRYEVANIR-----D 199 (305) Q Consensus 136 ~~~~~~----~~--~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~~k-----d 199 (305) .-.+.. .. ...........++- ..+..++-.+......+.. .....+.+++++++...|...- . T Consensus 325 ~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~~~~ 404 (528) T protein:vir:80 325 GKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLA 404 (528) T ss_pred eeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcccccccc Confidence 000000 00 00000000000000 1112222223333333333 2224467899999999987531 1 Q ss_pred cCC-ceee---------cccccCccceEecCccccCCCCceEEEEehh-------hEEEEeecCcEEEEeecceeccCcc Q lcl|Aclame:pro 200 ANG-NPVF---------RDDSFAGFRTFFNRNGAWDADAAIEVIADSS-------RVKIGVRQDITVKFLDQATLGTGEN 262 (305) Q Consensus 200 ~~G-~~l~---------~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~-------~~~~~~~~~i~v~~~~~~~~~~~~~ 262 (305) ..| ...+ ..-.-.|++|+++.+.+.+ -+++|--. -|+--.-...-+...+..+ T Consensus 405 ~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----y~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~s------ 474 (528) T protein:vir:80 405 MQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQD----YFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQS------ 474 (528) T ss_pred ccccccccccCCCCceEEEEecCceEEEecCCCCcc----eEEEEEeCCcccccceeecccccceeeEeeCCcc------ Confidence 111 1111 1112345677777664322 12222110 0110111111122222222 Q ss_pred eeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 263 QINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 263 ~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) |+- .+=...|++. ..+|-+ ...+.++.+-+.-+. T Consensus 475 ----fqP---~~g~~tRY~l-~~NP~~-~~~~~~~~~r~~~g~ 508 (528) T protein:vir:80 475 ----FHP---VLGFKTRYGI-GINPFA-DSKSQAPSARITSGM 508 (528) T ss_pred ----ccc---eeeeeeeece-eecCcc-cccCCcccccccccc Confidence 221 1223345554 345511 122222222222222 No 218 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=81.55 E-value=0.085 Score=26.46 Aligned_cols=275 Identities=9% Similarity=0.079 Sum_probs=128.2 Q ss_pred CCC------ccCCccceEccHHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEEE---EeCCCceeeeecchhhcccc Q lcl|Aclame:pro 1 MAD------ISRAEVASLIQEAYSDTLLAAAKQGST--VLSAFQNVNMGTKTTHLPV---LATLPEADWVGESATDPKGV 69 (305) Q Consensus 1 Ma~------~t~~~gg~lip~~~~~~i~~~~~~~~~--l~~l~~~~~~~~~~~~~p~---~~~~~~a~~v~E~~~~~~~~ 69 (305) |.+ .+-.++|++=-+.+.++|......... +.+-..+.+..+--.++-. ..+...+.+++|+.. T Consensus 26 ~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~----- 100 (462) T protein:vir:96 26 YQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGV----- 100 (462) T ss_pred HhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccc----- Confidence 332 333345555556666666555444332 3333334444443222222 223366777888753 Q ss_pred cccccccceeEEeeeeeEEEeehhhHHHh-hcCHHHHHHHHHHHHHHHHHHHHHHHHHcCccc------CcCcccccccc Q lcl|Aclame:pro 70 KPTSKVTWANRTLVAEEIAVIIPVHENVI-DDATVAVLTEVAELGGQAIGKKLDQAVIFGTDK------PASWVSPALIP 142 (305) Q Consensus 70 ~~~~~~~f~~v~~~~~k~~~~~~is~ell-~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~------~~~~~~~~~~~ 142 (305) .+.+++.+.+.....+=++..-.+|.-.- ..+..+.+....+.-...+++.+|.+.|.|+-+ +.+.+-.|+ . T Consensus 101 ~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl-~ 179 (462) T protein:vir:96 101 APVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGL-A 179 (462) T ss_pred cccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccccccchhhh-h Confidence 67778889999988888888777776432 345667888888888899999999999999753 222333344 3 Q ss_pred cccccccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeeccc---ccCccceE-- Q lcl|Aclame:pro 143 AAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD---SFAGFRTF-- 217 (305) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~---~l~G~pv~-- 217 (305) ......+++..-+..... +.+..+...+...+..++-++|+..+.+.|..---..-|.+.++. ...|+|+- T Consensus 180 ~lI~~~NViDarG~~Ls~----~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f 255 (462) T protein:vir:96 180 KLIDKDNVIDAKGESLTE----TLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQDNSGNVNAGYNVQGF 255 (462) T ss_pred hhcCCCceeecCCCCccH----HHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEEEcCCCCceeeeeeccce Confidence 333444555544444432 233334444455666777788899888888743322222232221 23344331 Q ss_pred ecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEc--cEeeccc------- Q lcl|Aclame:pro 218 FNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFA--YVLGVSA------- 288 (305) Q Consensus 218 ~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~--~~v~~p~------- 288 (305) ++.. +.| ++....+.+...+ +..+... .+ ..-....+.++..-+ +...++. T Consensus 256 ~s~~-------G~I---~L~~s~~m~~~~i-~~~~~~~------~p---~ap~~~~vsaTv~t~~~g~f~~~~d~~~y~Y 315 (462) T protein:vir:96 256 YSSR-------GFI---KLHGSTVMENELI-LDESLQP------LP---NAPQPATVKATVETGKKGLFTDEHDRAELTY 315 (462) T ss_pred eeee-------eee---eeCCceecCcccc-ccccccc------CC---CCCCCCceeEEEEeCCCCCCCCccCceeEEE Confidence 1100 000 1111111110000 0000000 00 000112223332222 1122221 Q ss_pred ceEEEecc----ccc--cccCCC Q lcl|Aclame:pro 289 TAQGANKT----PVA--VVAPAA 305 (305) Q Consensus 289 a~~~~~~t----~~a--~v~~a~ 305 (305) +++....- |.. .+|.|+ T Consensus 316 ~V~avs~dgeS~PS~~VtaTva~ 338 (462) T protein:vir:96 316 KVVVNSDDAQSAPSEAVTATVNN 338 (462) T ss_pred EEEEECCCCccccceeeEeeeec Confidence 11111111 100 111111 No 219 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=81.46 E-value=0.085 Score=26.44 Aligned_cols=283 Identities=9% Similarity=0.082 Sum_probs=130.4 Q ss_pred CCC------ccCCccceEccHHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEEE---EeCCCceeeeecchhhcccc Q lcl|Aclame:pro 1 MAD------ISRAEVASLIQEAYSDTLLAAAKQGST--VLSAFQNVNMGTKTTHLPV---LATLPEADWVGESATDPKGV 69 (305) Q Consensus 1 Ma~------~t~~~gg~lip~~~~~~i~~~~~~~~~--l~~l~~~~~~~~~~~~~p~---~~~~~~a~~v~E~~~~~~~~ 69 (305) |.+ .+-.+|+++=-+.+.++|......... +.+-..+.+..+--.++-. ..+...+.+++|+.. T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~----- 100 (463) T protein:vir:99 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGV----- 100 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccc----- Confidence 332 333445566566666666555444332 3333334444443222222 223366777888753 Q ss_pred cccccccceeEEeeeeeEEEeehhhHHH-hhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCccc------CcCcccccccc Q lcl|Aclame:pro 70 KPTSKVTWANRTLVAEEIAVIIPVHENV-IDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDK------PASWVSPALIP 142 (305) Q Consensus 70 ~~~~~~~f~~v~~~~~k~~~~~~is~el-l~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~------~~~~~~~~~~~ 142 (305) .+++++.+.......+=++....+|.-+ +.++..+.+..+.+.-...++..+|.+.|.|+-+ +.+.+-.|+. T Consensus 101 ~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~- 179 (463) T protein:vir:99 101 APVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLA- 179 (463) T ss_pred cccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhh- Confidence 6677888999888888888887777733 4566778889999999999999999999999753 2333334443 Q ss_pred cccccccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc---cccCccceE-- Q lcl|Aclame:pro 143 AAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD---DSFAGFRTF-- 217 (305) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~---~~l~G~pv~-- 217 (305) ......++...-+...... .+..+...+...+..++-++|+..+.+.|..---..-|.+.++ ....|+|+- T Consensus 180 ~lId~enviDarG~~Ls~~----~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f 255 (463) T protein:vir:99 180 KLIDKNNVINAKGNQLTEK----HLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGF 255 (463) T ss_pred hhcCCCCeeecCCCcccHH----HHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccce Confidence 3333444444433333322 2444444555566677788889999988874332222222221 123344431 Q ss_pred ecCccccCCC-----CceEEEEehhhEEEEe--ecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccce Q lcl|Aclame:pro 218 FNRNGAWDAD-----AAIEVIADSSRVKIGV--RQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATA 290 (305) Q Consensus 218 ~~~~~~~~~~-----~~~~~~gdf~~~~~~~--~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~ 290 (305) ++..-..... +.+.+++--.+..-.- .--++..+. -.......+.-......+++...-+.+=..|..+ T Consensus 256 ~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~----~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~i 331 (463) T protein:vir:99 256 YSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVE----TKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEE 331 (463) T ss_pred eeeeeeeeeCCceecCCcccccchhhcCCCCccCceeEEEEe----eccCCCCCCcccccceEEEEEEECCCCCcccchh Confidence 1110000000 0000000000000000 000011110 0001111111111122222222212111112222 Q ss_pred EEEeccccccccCCC Q lcl|Aclame:pro 291 QGANKTPVAVVAPAA 305 (305) Q Consensus 291 ~~~~~t~~a~v~~a~ 305 (305) + .+|.|+ T Consensus 332 --v------taT~a~ 338 (463) T protein:vir:99 332 --V------TATVSN 338 (463) T ss_pred --e------eeeeee Confidence 1 112222 No 220 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=81.46 E-value=0.085 Score=26.44 Aligned_cols=283 Identities=9% Similarity=0.082 Sum_probs=130.4 Q ss_pred CCC------ccCCccceEccHHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEEE---EeCCCceeeeecchhhcccc Q lcl|Aclame:pro 1 MAD------ISRAEVASLIQEAYSDTLLAAAKQGST--VLSAFQNVNMGTKTTHLPV---LATLPEADWVGESATDPKGV 69 (305) Q Consensus 1 Ma~------~t~~~gg~lip~~~~~~i~~~~~~~~~--l~~l~~~~~~~~~~~~~p~---~~~~~~a~~v~E~~~~~~~~ 69 (305) |.+ .+-.+|+++=-+.+.++|......... +.+-..+.+..+--.++-. ..+...+.+++|+.. T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~----- 100 (463) T protein:vir:95 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGV----- 100 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccc----- Confidence 332 333445566566666666555444332 3333334444443222222 223366777888753 Q ss_pred cccccccceeEEeeeeeEEEeehhhHHH-hhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCccc------CcCcccccccc Q lcl|Aclame:pro 70 KPTSKVTWANRTLVAEEIAVIIPVHENV-IDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDK------PASWVSPALIP 142 (305) Q Consensus 70 ~~~~~~~f~~v~~~~~k~~~~~~is~el-l~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~------~~~~~~~~~~~ 142 (305) .+++++.+.......+=++....+|.-+ +.++..+.+..+.+.-...++..+|.+.|.|+-+ +.+.+-.|+. T Consensus 101 ~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~- 179 (463) T protein:vir:95 101 APVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLA- 179 (463) T ss_pred cccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhh- Confidence 6677888999888888888887777733 4566778889999999999999999999999753 2333334443 Q ss_pred cccccccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecc---cccCccceE-- Q lcl|Aclame:pro 143 AAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD---DSFAGFRTF-- 217 (305) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~---~~l~G~pv~-- 217 (305) ......++...-+...... .+..+...+...+..++-++|+..+.+.|..---..-|.+.++ ....|+|+- T Consensus 180 ~lId~enviDarG~~Ls~~----~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f 255 (463) T protein:vir:95 180 KLIDKNNVINAKGNQLTEK----HLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGF 255 (463) T ss_pred hhcCCCCeeecCCCcccHH----HHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccce Confidence 3333444444433333322 2444444555566677788889999988874332222222221 123344431 Q ss_pred ecCccccCCC-----CceEEEEehhhEEEEe--ecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccce Q lcl|Aclame:pro 218 FNRNGAWDAD-----AAIEVIADSSRVKIGV--RQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATA 290 (305) Q Consensus 218 ~~~~~~~~~~-----~~~~~~gdf~~~~~~~--~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~ 290 (305) ++..-..... +.+.+++--.+..-.- .--++..+. -.......+.-......+++...-+.+=..|..+ T Consensus 256 ~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~----~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~i 331 (463) T protein:vir:95 256 YSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVE----TKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEE 331 (463) T ss_pred eeeeeeeeeCCceecCCcccccchhhcCCCCccCceeEEEEe----eccCCCCCCcccccceEEEEEEECCCCCcccchh Confidence 1110000000 0000000000000000 000011110 0001111111111122222222212111112222 Q ss_pred EEEeccccccccCCC Q lcl|Aclame:pro 291 QGANKTPVAVVAPAA 305 (305) Q Consensus 291 ~~~~~t~~a~v~~a~ 305 (305) + .+|.|+ T Consensus 332 --v------taT~a~ 338 (463) T protein:vir:95 332 --V------TATVSN 338 (463) T ss_pred --e------eeeeee Confidence 1 112222 No 221 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=79.75 E-value=0.1 Score=26.03 Aligned_cols=268 Identities=10% Similarity=0.036 Sum_probs=113.8 Q ss_pred CCC----ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MAD----ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~----~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 76 (305) +.. ...++.+..-+ ... .......+ .....+ ............ ..........++...+ T Consensus 219 l~gEA~t~~sTd~at~~~-Gtt-----~t~~~~~l------yt~~~g-~~t~~~~~~~~~----~~~~~~~~~~~eM~Fs 281 (523) T protein:vir:59 219 LYARLFFVTGSDFATVAG-GTP-----STQDLDLV------YYIDAR-NDFEDQSTDPDY----PDPGFQSLDIPEINLE 281 (523) T ss_pred ccccccccccccccccCC-Ccc-----cccccccc------cccccc-cchhhccccccc----cccccccccccceeeE Confidence 110 01111110000 000 00000000 011100 000000000000 0001112346777888 Q ss_pred ceeEEeeeeeEEEeehhhHHHhhcC-----HHHHHHHHHHHHHHHHHHHHHHHHHcCcc------cCcCccccccccccc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENVIDDA-----TVAVLTEVAELGGQAIGKKLDQAVIFGTD------KPASWVSPALIPAAV 145 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~ell~ds-----~~~~~~~v~~~la~~~a~~~d~a~l~G~g------~~~~~~~~~~~~~~~ 145 (305) ++++++..+..+-...+|-||.+|- ..|.+..|.+-|+..|...+++.||.--- +-.+..+.|+..-.. T Consensus 282 IeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~ 361 (523) T protein:vir:59 282 LRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYD 361 (523) T ss_pred EEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecc Confidence 9999999888888899999999983 45689999999999999999999886321 111111111111000 Q ss_pred ccccceeec----ccchhhhHHHHHHHHHHHHhhh--ccccceEEEEchHHHHHHHHhhccCC----c-----eeecccc Q lcl|Aclame:pro 146 TAGQAVEVV----GGVANESDIVGATNRAAKAVAS--AGWAPDTLLSSLALRYEVANIRDANG----N-----PVFRDDS 210 (305) Q Consensus 146 ~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~~kd~~G----~-----~l~~~~~ 210 (305) ......... ........++-.+......+.. .....+-+++++++...|...---++ + ..+..-. T Consensus 362 ~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~~~g~l 441 (523) T protein:vir:59 362 ETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIFYVGMV 441 (523) T ss_pred cccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCCccccccccceeEEEe Confidence 000000000 0000111121222222222332 22246778999999998874211110 0 1111112 Q ss_pred cCccceEecCccccCCCCceEEEEehhhEEEEeecCc-----EEEEeecceeccCcc--eeeeeecCcEEEEEEEEEccE Q lcl|Aclame:pro 211 FAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-----TVKFLDQATLGTGEN--QINLAERDMVALRLKARFAYV 283 (305) Q Consensus 211 l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i-----~v~~~~~~~~~~~~~--~~~~~~~~~~~~r~~~r~~~~ 283 (305) -.|++|+++.+.+.+ -+++ |.++.. .+....+.-+..-.. .=..|+- .+=...|++.. T Consensus 442 ~~~~~vy~d~~~~~d----y~~~--------g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp---~~~~~tRY~l~ 506 (523) T protein:vir:59 442 QGRYRLYKNIYQNQP----VIIM--------GNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSY---RRGLMTRYALE 506 (523) T ss_pred cCceEEEecCCCCcc----eEEE--------EecccCCcccccceecccchhhcccccccCCcccc---eeeeeeehhhe Confidence 345677777664321 1222 222211 111111111100000 0011322 33456799998 Q ss_pred eecccceEEEeccccccccC Q lcl|Aclame:pro 284 LGVSATAQGANKTPVAVVAP 303 (305) Q Consensus 284 v~~p~a~~~~~~t~~a~v~~ 303 (305) |.||-+...+-.+ ..+| T Consensus 507 v~nP~~~~~~~~~---~~~~ 523 (523) T protein:vir:59 507 VVRPEFYGLLYVK---LLQP 523 (523) T ss_pred ecchhHhhhhhhh---hcCC Confidence 9999888777765 3455 No 222 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=78.73 E-value=0.11 Score=25.81 Aligned_cols=287 Identities=12% Similarity=0.109 Sum_probs=120.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe----CC---------------Cceeeeec Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA----TL---------------PEADWVGE 61 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~----~~---------------~~a~~v~E 61 (305) .++.+++..=.-.-+.+. .+..++..+-+..+++.+-||++++.-|.-.. .. +++.|-+. T Consensus 79 i~es~~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~ 157 (521) T protein:vir:10 79 IAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQ 157 (521) T ss_pred ccccccccccccCCchhh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhcccccccccc Confidence 343333222112222222 23344455555667888888876542221110 00 11111000 Q ss_pred ---------------------------------------------------------------------chh-------- Q lcl|Aclame:pro 62 ---------------------------------------------------------------------SAT-------- 64 (305) Q Consensus 62 ---------------------------------------------------------------------~~~-------- 64 (305) +-. T Consensus 158 ~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~ 237 (521) T protein:vir:10 158 GAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQE 237 (521) T ss_pred ccccccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhhHhhhc Confidence 000 Q ss_pred ----hcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc Q lcl|Aclame:pro 65 ----DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV 136 (305) Q Consensus 65 ----~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~ 136 (305) ......++...+++++++..+..+-...+|-||.+|- ..|.++.|.+-|+..|...+++.+|.=.-...-.. T Consensus 238 ~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~ 317 (521) T protein:vir:10 238 SFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVG 317 (521) T ss_pred cCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeee Confidence 0011245666777888877777777888999999984 45789999999999999999999984211000000 Q ss_pred cccccc------cccccccceeecccchhhhH---HHHHHHHHHHHhhh--ccccceEEEEchHHHHHHHHhh------- Q lcl|Aclame:pro 137 SPALIP------AAVTAGQAVEVVGGVANESD---IVGATNRAAKAVAS--AGWAPDTLLSSLALRYEVANIR------- 198 (305) Q Consensus 137 ~~~~~~------~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~~k------- 198 (305) -.+... +.............-...+. ++--+......+.. .....+-+++++++...|...- T Consensus 318 ~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~ 397 (521) T protein:vir:10 318 KSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAA 397 (521) T ss_pred eeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccc Confidence 011100 00000000000001111111 11122222223322 2245567899999999988531 Q ss_pred --ccCC------ceeecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEE----EEeecceeccC-cceee Q lcl|Aclame:pro 199 --DANG------NPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITV----KFLDQATLGTG-ENQIN 265 (305) Q Consensus 199 --d~~G------~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v----~~~~~~~~~~~-~~~~~ 265 (305) +..| ..++-.-.-.|++|+++.+.+.+ -++ +|.++..++ ....+.-+..- ...-. T Consensus 398 ~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----y~~--------vG~KG~~~~~~glfyaPYv~l~~~~~~dp~ 465 (521) T protein:vir:10 398 QGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQD----YFT--------VGYKGPNEMDAGIYYAPYVALTPLRGSDPK 465 (521) T ss_pred ccccccccccCCCceEEEEecCceEEEecCCCCcc----eEE--------EEEeCCcccccceeeccccccccccccCCc Confidence 0111 01111122345677777664321 122 222222111 11111000000 00000 Q ss_pred eeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 266 LAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 266 ~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .|+- .+=...|++. ..+| -....+..++..+...- T Consensus 466 sfqP---~~g~~tRY~l-~~NP-~~~~~~~~~~~~i~~~~ 500 (521) T protein:vir:10 466 NFQP---VMGFKTRYGI-GINP-FAESAAQAPASRIQSGM 500 (521) T ss_pred cccc---eeeeeeeece-eecC-cccccCCccceeecccc Confidence 1222 2223456664 3455 22233333332322221 No 223 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=77.10 E-value=0.13 Score=25.47 Aligned_cols=268 Identities=10% Similarity=-0.052 Sum_probs=114.8 Q ss_pred CC------------------CccCCccceEccHHHHHHHHHHHHhhhhhhh-h-cc--eeecCCCceEEEEEeCCCceee Q lcl|Aclame:pro 1 MA------------------DISRAEVASLIQEAYSDTLLAAAKQGSTVLS-A-FQ--NVNMGTKTTHLPVLATLPEADW 58 (305) Q Consensus 1 Ma------------------~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~-l-~~--~~~~~~~~~~~p~~~~~~~a~~ 58 (305) |. --+-..+...+-+-+. .+++.+.....+.. + ++ .....+++++||+.+...-..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~-~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY 79 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHH-HHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccc Confidence 32 2222222223333333 33444444333322 1 22 3445678999999875432222 Q ss_pred eecchhhcccccccccccceeEEeeeeeEEEeehhhHHHhhcCHH--HHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc Q lcl|Aclame:pro 59 VGESATDPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDATV--AVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV 136 (305) Q Consensus 59 v~E~~~~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~--~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~ 136 (305) .-......+++ +.+....+++..|.-.+. |.+-=..++.. .+...+.+...+.++-.+|.-.+.-.-+..+. T Consensus 80 -~R~~g~~~g~v---t~~~~t~tidqdR~~~F~-VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~- 153 (319) T protein:vir:97 80 -KRNATNEFDHP---KIEETTYFLDQEKYWGRF-VDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK- 153 (319) T ss_pred -cCCCCcccCCc---ccceeEEEeecccccccc-cchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc- Confidence 11111111222 223444555554432221 11100122222 23444555566666666775544221100000 Q ss_pred cccccccccccccceeecccchhhhHHHHHHHHHHHHhhhcccc-ceEEEEchHHHHHHHHhhc----c-CC-ceee--c Q lcl|Aclame:pro 137 SPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWA-PDTLLSSLALRYEVANIRD----A-NG-NPVF--R 207 (305) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~l~~~kd----~-~G-~~l~--~ 207 (305) ......+.+..++.+.++...+...+.. .-.++++|.++..|.+-.. . .+ ..+. + T Consensus 154 ----------------~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~ 217 (319) T protein:vir:97 154 ----------------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGV 217 (319) T ss_pred ----------------ccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeee Confidence 0011123344566666666666554322 2357889999988865321 1 11 1111 2 Q ss_pred ccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecc Q lcl|Aclame:pro 208 DDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVS 287 (305) Q Consensus 208 ~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p 287 (305) -..+.|++|+..... ...+..+++|..+....... =-.+++.+. ... +.--.++...+.|.-|.+| T Consensus 218 Vg~idG~~Vi~vps~--~~k~in~i~~h~~A~~~~~k-~~~~~~~~p-----~~~------~~a~~v~gr~y~d~~V~~~ 283 (319) T protein:vir:97 218 QGELDGFVIVKVPTK--LLQGLQAIAVVGEVLASPIQ-ADLAKTNSN-----IPG------MFGTLAEQLLYTGAFVPEH 283 (319) T ss_pred ceeecCeEEEEeccc--ccccceEEEEcCCeeeeeee-eeeeeccCC-----Ccc------ccceeeeeeeeeeeEEecc Confidence 257889998854221 11222355554443322221 011221110 000 1123577888999999998 Q ss_pred cceEEEeccccccccCCC Q lcl|Aclame:pro 288 ATAQGANKTPVAVVAPAA 305 (305) Q Consensus 288 ~a~~~~~~t~~a~v~~a~ 305 (305) +.........+++.+-.. T Consensus 284 k~~~Iy~~~~~~~~~~~~ 301 (319) T protein:vir:97 284 LQKYIFTIGGTEVATKRD 301 (319) T ss_pred ccceEEEeecCCcccCCC Confidence 855555433332222222 No 224 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=77.10 E-value=0.13 Score=25.47 Aligned_cols=268 Identities=10% Similarity=-0.052 Sum_probs=114.8 Q ss_pred CC------------------CccCCccceEccHHHHHHHHHHHHhhhhhhh-h-cc--eeecCCCceEEEEEeCCCceee Q lcl|Aclame:pro 1 MA------------------DISRAEVASLIQEAYSDTLLAAAKQGSTVLS-A-FQ--NVNMGTKTTHLPVLATLPEADW 58 (305) Q Consensus 1 Ma------------------~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~-l-~~--~~~~~~~~~~~p~~~~~~~a~~ 58 (305) |. --+-..+...+-+-+. .+++.+.....+.. + ++ .....+++++||+.+...-..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~-~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY 79 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHH-HHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccc Confidence 32 2222222223333333 33444444333322 1 22 3445678999999875432222 Q ss_pred eecchhhcccccccccccceeEEeeeeeEEEeehhhHHHhhcCHH--HHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc Q lcl|Aclame:pro 59 VGESATDPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDATV--AVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV 136 (305) Q Consensus 59 v~E~~~~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~--~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~ 136 (305) .-......+++ +.+....+++..|.-.+. |.+-=..++.. .+...+.+...+.++-.+|.-.+.-.-+..+. T Consensus 80 -~R~~g~~~g~v---t~~~~t~tidqdR~~~F~-VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~- 153 (319) T protein:vir:94 80 -KRNATNEFDHP---KIEETTYFLDQEKYWGRF-VDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK- 153 (319) T ss_pred -cCCCCcccCCc---ccceeEEEeecccccccc-cchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc- Confidence 11111111222 223444555554432221 11100122222 23444555566666666775544221100000 Q ss_pred cccccccccccccceeecccchhhhHHHHHHHHHHHHhhhcccc-ceEEEEchHHHHHHHHhhc----c-CC-ceee--c Q lcl|Aclame:pro 137 SPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWA-PDTLLSSLALRYEVANIRD----A-NG-NPVF--R 207 (305) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~l~~~kd----~-~G-~~l~--~ 207 (305) ......+.+..++.+.++...+...+.. .-.++++|.++..|.+-.. . .+ ..+. + T Consensus 154 ----------------~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~ 217 (319) T protein:vir:94 154 ----------------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGV 217 (319) T ss_pred ----------------ccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeee Confidence 0011123344566666666666554322 2357889999988865321 1 11 1111 2 Q ss_pred ccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecc Q lcl|Aclame:pro 208 DDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVS 287 (305) Q Consensus 208 ~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p 287 (305) -..+.|++|+..... ...+..+++|..+....... =-.+++.+. ... +.--.++...+.|.-|.+| T Consensus 218 Vg~idG~~Vi~vps~--~~k~in~i~~h~~A~~~~~k-~~~~~~~~p-----~~~------~~a~~v~gr~y~d~~V~~~ 283 (319) T protein:vir:94 218 QGELDGFVIVKVPTK--LLQGLQAIAVVGEVLASPIQ-ADLAKTNSN-----IPG------MFGTLAEQLLYTGAFVPEH 283 (319) T ss_pred ceeecCeEEEEeccc--ccccceEEEEcCCeeeeeee-eeeeeccCC-----Ccc------ccceeeeeeeeeeeEEecc Confidence 257889998854221 11222355554443322221 011221110 000 1123577888999999998 Q ss_pred cceEEEeccccccccCCC Q lcl|Aclame:pro 288 ATAQGANKTPVAVVAPAA 305 (305) Q Consensus 288 ~a~~~~~~t~~a~v~~a~ 305 (305) +.........+++.+-.. T Consensus 284 k~~~Iy~~~~~~~~~~~~ 301 (319) T protein:vir:94 284 LQKYIFTIGGTEVATKRD 301 (319) T ss_pred ccceEEEeecCCcccCCC Confidence 855555433332222222 No 225 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=76.92 E-value=0.13 Score=25.44 Aligned_cols=286 Identities=10% Similarity=0.096 Sum_probs=121.8 Q ss_pred CCCccCCcc-ceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE----eC---------------CCceeeee Q lcl|Aclame:pro 1 MADISRAEV-ASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVL----AT---------------LPEADWVG 60 (305) Q Consensus 1 Ma~~t~~~g-g~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~----~~---------------~~~a~~v~ 60 (305) .++.+++.. ...=|.-+ .+..++..+-+..+++.+-||++++.-|.-. .. .+++.|-+ T Consensus 80 i~es~~t~~v~~~~P~li--~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG 157 (522) T protein:vir:69 80 IAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSG 157 (522) T ss_pred ccccccccccccccchHH--HHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCcccccccccccccccccc Confidence 333333221 22223222 2334445555566788888887654211110 00 00010000 Q ss_pred c---------------------------------------------------------------------------chh- Q lcl|Aclame:pro 61 E---------------------------------------------------------------------------SAT- 64 (305) Q Consensus 61 E---------------------------------------------------------------------------~~~- 64 (305) . ++. T Consensus 158 ~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal 237 (522) T protein:vir:69 158 QGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQ 237 (522) T ss_pred ccccccccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhc Confidence 0 000 Q ss_pred -----hcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCc Q lcl|Aclame:pro 65 -----DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASW 135 (305) Q Consensus 65 -----~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~ 135 (305) ......++...+++++++..+..+-...+|-||.+|- ..|.++.|.+-|+..|...+++.||.=.-..... T Consensus 238 ~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~ 317 (522) T protein:vir:69 238 EGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQV 317 (522) T ss_pred ccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhee Confidence 0011256677788888888887777889999999984 4578999999999999999999998422111100 Q ss_pred ccccccc------cccccccceeecccc---hhhhHHHHHHHHHHHHhhhc--cccceEEEEchHHHHHHHHhh------ Q lcl|Aclame:pro 136 VSPALIP------AAVTAGQAVEVVGGV---ANESDIVGATNRAAKAVASA--GWAPDTLLSSLALRYEVANIR------ 198 (305) Q Consensus 136 ~~~~~~~------~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~~k------ 198 (305) .-.+... +............+- ..+..++--+......+... ....+-+++++++...|...- T Consensus 318 ~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~ 397 (522) T protein:vir:69 318 GKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYA 397 (522) T ss_pred eccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccc Confidence 0111110 000000000000000 01111222222333333332 224677899999999997531 Q ss_pred ----------ccCCceeecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEE----EEeecceeccCc-ce Q lcl|Aclame:pro 199 ----------DANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITV----KFLDQATLGTGE-NQ 263 (305) Q Consensus 199 ----------d~~G~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v----~~~~~~~~~~~~-~~ 263 (305) |.++. ++-.-.-.|++|+++.+.+.+ -++ +|.++..++ ....+..++.-. .. T Consensus 398 ~~~~~~g~~~d~~~~-~~~G~l~~~~~vy~D~y~~~d----y~~--------vG~KG~~~~~~glfyaPYv~l~~~~~~d 464 (522) T protein:vir:69 398 AQGLASGFNTDTTKS-VFAGVLGGKYRVYIDQYAKQD----YFT--------VGYKGANEMDAGIYYAPYVALTPLRGSD 464 (522) T ss_pred cccccccccccCCCc-eEEEEecCceEEEecCCCCcc----eEE--------EEEeCCcccccceeeccccccccccccC Confidence 11111 111222345677777664321 122 222222111 111111000000 00 Q ss_pred eeeeecCcEEEEEEEEEccEeecccc--------eEEEeccccccccCCC Q lcl|Aclame:pro 264 INLAERDMVALRLKARFAYVLGVSAT--------AQGANKTPVAVVAPAA 305 (305) Q Consensus 264 ~~~~~~~~~~~r~~~r~~~~v~~p~a--------~~~~~~t~~a~v~~a~ 305 (305) -..|+- .+=...|++.. .+|-+ ...++++|...-.... T Consensus 465 p~sfqP---~~g~~tRY~l~-vNP~~~~~~~~~~~ri~~g~p~~~~~~~~ 510 (522) T protein:vir:69 465 PKNFQP---VMGFKTRYGIG-VNPFAESSLQAPGARIQSGMPSILNSLGK 510 (522) T ss_pred Cccccc---eeeeeeeecee-ecCcccccCCcccceeecccchhhcccCC Confidence 011222 22334566643 34411 1223343332222222 No 226 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=73.71 E-value=0.17 Score=24.84 Aligned_cols=273 Identities=13% Similarity=0.106 Sum_probs=125.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcc-eeecC-CCceEEEEEeCCCceeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQ-NVNMG-TKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~-~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) |-.+ +..-...+.++++.+|...+.+.--=-.+.+ +...+ +..+.||.. +.+...--.|-. +-.......+ T Consensus 1 ~~~T-SNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti-Gs~~~~~~~E~~-----~~~~~~i~TG 73 (313) T protein:vir:95 1 MQLT-SNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI-GSVTLQEAEEDT-----PLIYNPIETG 73 (313) T ss_pred Cccc-ccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc-CceeeeccccCC-----Ceeecccccc Confidence 6543 2223344456666666555554421112233 33333 446777754 333322122222 2223334556 Q ss_pred eEEeeeeeEEEe-ehhhHHHhhcCHH--HHHHHHHHHHHHHHHHHHHHHHHc-Ccc----cCcCcccccccccccccccc Q lcl|Aclame:pro 79 NRTLVAEEIAVI-IPVHENVIDDATV--AVLTEVAELGGQAIGKKLDQAVIF-GTD----KPASWVSPALIPAAVTAGQA 150 (305) Q Consensus 79 ~v~~~~~k~~~~-~~is~ell~ds~~--~~~~~v~~~la~~~a~~~d~a~l~-G~g----~~~~~~~~~~~~~~~~~~~~ 150 (305) ++++....+.+- +.||++|-+|+-. .+...+.-+-+++|-...+.-++. |.- .+.+....|.-. ..+ T Consensus 74 EIt~~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH-----~~V 148 (313) T protein:vir:95 74 EITFQITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPH-----VIV 148 (313) T ss_pred eEEEEEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccc-----eEE Confidence 777777776554 5799999999742 344444445556666666655552 211 111111111100 011 Q ss_pred eeecccchhhhHHHHHHHHHHHHhhhc--cccceEEEEchHHHHHHHHhhc------cCCceeeccc---------ccCc Q lcl|Aclame:pro 151 VEVVGGVANESDIVGATNRAAKAVASA--GWAPDTLLSSLALRYEVANIRD------ANGNPVFRDD---------SFAG 213 (305) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~~kd------~~G~~l~~~~---------~l~G 213 (305) .+.+.+.-. ++.+..+....... ....-.++..|.....|..+.. .+|+.+...+ .+.| T Consensus 149 ~~~T~~~~~----~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG 224 (313) T protein:vir:95 149 SAETNGVFA----LKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYG 224 (313) T ss_pred eccCCceeh----hhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhh Confidence 111222222 22333332222221 2233468999999999988753 3456665432 4677 Q ss_pred cceEecCcccc-CCC----CceEEEEeh----hh----EEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEE Q lcl|Aclame:pro 214 FRTFFNRNGAW-DAD----AAIEVIADS----SR----VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARF 280 (305) Q Consensus 214 ~pv~~~~~~~~-~~~----~~~~~~gdf----~~----~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~ 280 (305) +-+.+++.+.. +-+ .+..+.|++ ++ =+++-|+.+.-. ....+.+ -.++... ..+|. T Consensus 225 ~Di~~SN~L~~AN~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s-------~~~~~~~--~~~~~~~--~~~R~ 293 (313) T protein:vir:95 225 WDILTSNRLHVANYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKS-------EGERNKD--RARDEHV--VRCRY 293 (313) T ss_pred hhhhhhhhhhhccccccccccCceeeeeeeeeecccccceeeeecccccc-------ccccccc--cccccce--eeeee Confidence 77776664431 111 111233332 11 123334333211 1111111 1222333 45688 Q ss_pred ccEeecccceEEEecccccc Q lcl|Aclame:pro 281 AYVLGVSATAQGANKTPVAV 300 (305) Q Consensus 281 ~~~v~~p~a~~~~~~t~~a~ 300 (305) |+++.|.+..+.+.....+- T Consensus 294 G~Gi~R~~~L~~~~~~A~~~ 313 (313) T protein:vir:95 294 GFGIQRLDTLGLLATSATAY 313 (313) T ss_pred cccceeecceeEEEeccccC Confidence 99988877776665544433 No 227 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=69.06 E-value=0.23 Score=24.10 Aligned_cols=282 Identities=11% Similarity=0.074 Sum_probs=121.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-----CC-C-------ceeeeecchh--- Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA-----TL-P-------EADWVGESAT--- 64 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~-----~~-~-------~a~~v~E~~~--- 64 (305) .+..+++..=.-.-+.+.. +..++..+-+..+++.+-||++.+.-|.-.. .. . ...|-+.+.. T Consensus 69 i~~st~t~~v~~~~P~Li~-lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG~EaffnEA~T~fSG~~~~~~~ 147 (470) T protein:vir:10 69 SADATAAGPVAGFDPVLIS-LIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSGTEALFNEADTAFSGQPDGLDD 147 (470) T ss_pred cccccccccccccCchhhh-hHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCccceeeecCCcccCcccccccc Confidence 2332222211112222322 4444555666678888999987654443211 00 0 0011100000 Q ss_pred -------------------------------------------------------hcccccccccccceeEEeeeeeEEE Q lcl|Aclame:pro 65 -------------------------------------------------------DPKGVKPTSKVTWANRTLVAEEIAV 89 (305) Q Consensus 65 -------------------------------------------------------~~~~~~~~~~~~f~~v~~~~~k~~~ 89 (305) ......++...+++++++..+...- T Consensus 148 ~~~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaL 227 (470) T protein:vir:10 148 TSGFTATGANNVGLGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRAL 227 (470) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccchHHhhhcCCCCCcccceeeeEEEEEEEEeeccce Confidence 0011245566677777777777667 Q ss_pred eehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHcCccc------CcCcccccccccccccccceeecccc-- Q lcl|Aclame:pro 90 IIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGTDK------PASWVSPALIPAAVTAGQAVEVVGGV-- 157 (305) Q Consensus 90 ~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~G~g~------~~~~~~~~~~~~~~~~~~~~~~~~~~-- 157 (305) ...+|-||.+|- ..|.++.|.+-|+..|...+++.||.---. -.+....|+.. -..... T Consensus 228 KAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~~~~~~~Gv~D---------l~~~~~gr 298 (470) T protein:vir:10 228 KAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQANVAAAGTFD---------LDTDSNGR 298 (470) T ss_pred eccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceeccccccceEE---------eecccchh Confidence 788999999984 457899999999999999999988853211 11111111110 001111 Q ss_pred hhhhHHHHHHHHHH---HH--hhhccccceEEEEchHHHHHHHHh--------------hccCCceeecccccCccceEe Q lcl|Aclame:pro 158 ANESDIVGATNRAA---KA--VASAGWAPDTLLSSLALRYEVANI--------------RDANGNPVFRDDSFAGFRTFF 218 (305) Q Consensus 158 ~~~~~~~~~~~~~~---~~--~~~~~~~~~~~v~~~~~~~~l~~~--------------kd~~G~~l~~~~~l~G~pv~~ 218 (305) ...+....++..+. .. ........+-+++++.....|... +|.+|. .+..-.-.|++|++ T Consensus 299 ~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~~~~~~D~t~~-~~~G~l~~~~~vy~ 377 (470) T protein:vir:10 299 WSVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGN-TFAGILQGKYRVYI 377 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhccccccccccccccccCCCCc-eEEEEecCceEEEe Confidence 11111112222221 11 122334556789999999888431 122221 11112234567777 Q ss_pred cCccccCCCCceEEEEehhhEEEEeecCcEE----EEeecceeccC-cceeeeeecCcEEEEEEEEEccEeecccceEEE Q lcl|Aclame:pro 219 NRNGAWDADAAIEVIADSSRVKIGVRQDITV----KFLDQATLGTG-ENQINLAERDMVALRLKARFAYVLGVSATAQGA 293 (305) Q Consensus 219 ~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v----~~~~~~~~~~~-~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~ 293 (305) +.++..+... .+..+.+|.++...+ ....+.-++.- ...-..|+- .+=...|++. ..+|-....- T Consensus 378 d~y~~~~~~a------~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP---~~g~~tRY~l-~~NP~~~~~~ 447 (470) T protein:vir:10 378 DPFSASGGAA------ATQYYVVGYKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQP---KIGFKTRYGL-VENPFSQGTT 447 (470) T ss_pred eccccccCcc------cccEEEEEEecCcceecceeeccccccccCCCCCCccccc---eeeeeeeece-eecCcccCCC Confidence 6554321100 111222333322221 11111111110 000011322 2233456664 3455432211 Q ss_pred eccccccccCCC Q lcl|Aclame:pro 294 NKTPVAVVAPAA 305 (305) Q Consensus 294 ~~t~~a~v~~a~ 305 (305) . +.+.+..+. T Consensus 448 ~--~~~~i~~~~ 457 (470) T protein:vir:10 448 Q--GLGTLTRNS 457 (470) T ss_pred c--ccccccCCC Confidence 1 223344444 No 228 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=64.42 E-value=0.3 Score=23.45 Aligned_cols=281 Identities=11% Similarity=0.079 Sum_probs=117.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhh--hhhcceeecCCCceEEEE---EeCCCceeeeecchhhcccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTV--LSAFQNVNMGTKTTHLPV---LATLPEADWVGESATDPKGVKPTSKV 75 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l--~~l~~~~~~~~~~~~~p~---~~~~~~a~~v~E~~~~~~~~~~~~~~ 75 (305) -...+-.+|+++=-+.+.++|.........+ .+-..+.+..+--.++-. ..+...+.+++|+.. .+.+++ T Consensus 31 ~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~-----~~~~~~ 105 (467) T protein:vir:80 31 ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGV-----APVSDP 105 (467) T ss_pred cCCccccCcchhhhhhhhhhhheeeccccchhhhhhcccchhhhhhhhheeeeccCccccccccccccc-----cccCCC Confidence 2222333455565666666665555444333 222223333332222222 223356777888753 667788 Q ss_pred cceeEEeeeeeEEEeehhhHHHh-hcCHHHHHHHHHHHHHHHHHHHHHHHHHcCccc-------CcCccccccccccccc Q lcl|Aclame:pro 76 TWANRTLVAEEIAVIIPVHENVI-DDATVAVLTEVAELGGQAIGKKLDQAVIFGTDK-------PASWVSPALIPAAVTA 147 (305) Q Consensus 76 ~f~~v~~~~~k~~~~~~is~ell-~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~-------~~~~~~~~~~~~~~~~ 147 (305) .+.+...+.+=++....+|.-+- ..+..+.+....+.-...++..+|.+.|.|+-. +.+++.-|+..- ... T Consensus 106 ~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~l-i~~ 184 (467) T protein:vir:80 106 NIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKL-INQ 184 (467) T ss_pred ceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEE-ecC Confidence 89999998888888777776432 334567788888888889999999999999753 223333344422 233 Q ss_pred ccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHH-HHhhccCCceeec--ccccCccceEecCcccc Q lcl|Aclame:pro 148 GQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEV-ANIRDANGNPVFR--DDSFAGFRTFFNRNGAW 224 (305) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l-~~~kd~~G~~l~~--~~~l~G~pv~~~~~~~~ 224 (305) .+..+.-+..... +++..+.......+..+.-++|+..+...| ...-..+-+.... .....|.|+ ...+.. T Consensus 185 enviDa~G~~ls~----~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~~q~~v~~~n~~~~~~G~~v--~g~~sa 258 (467) T protein:vir:80 185 DNVHDARGASLTE----SLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNI--QGFHSA 258 (467) T ss_pred CceeccCCCccCH----HHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEEEEcCCCCceeeeecc--cceecc Confidence 3444433333332 222223333333445556677777777666 3222222111110 112334333 111111 Q ss_pred CC---CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccE-eecccceEEEecccccc Q lcl|Aclame:pro 225 DA---DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYV-LGVSATAQGANKTPVAV 300 (305) Q Consensus 225 ~~---~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~-v~~p~a~~~~~~t~~a~ 300 (305) .. -.+..+++|.... .-++..-...-... ... ....... . ..+-+.. ..-.-+|+....-..++ T Consensus 259 ~G~I~l~gs~il~~~~~l--------~~~~~~~~~Apsp~-~vs-aT~~~~~-~-g~~~~~~~a~y~Y~v~~vs~~GES~ 326 (467) T protein:vir:80 259 RGFIKLHGSTVMENEQIL--------DERILALPTAPQPA-KVT-ATQEAGK-K-GQFRAEDLAAHEYKVVVSSDDAESI 326 (467) T ss_pred eeeeeecCceeeccccCC--------CcccccccccccCC-ccc-eeeeccc-C-CcccCCCcceEEEEEEEECCCCccc Confidence 00 0001111111111 00000000000000 000 0000000 0 0000000 00111222222222111 Q ss_pred ccCCC Q lcl|Aclame:pro 301 VAPAA 305 (305) Q Consensus 301 v~~a~ 305 (305) ..++. T Consensus 327 pS~~v 331 (467) T protein:vir:80 327 ASEVA 331 (467) T ss_pred cccce Confidence 11111 No 229 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=63.65 E-value=0.31 Score=23.35 Aligned_cols=287 Identities=11% Similarity=0.076 Sum_probs=116.4 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE-------EEeCC-------------------- Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLP-------VLATL-------------------- 53 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p-------~~~~~-------------------- 53 (305) .+..+++..=.-.-+.+. .+..++..+-+..+++.+-||++++.-|. ..... T Consensus 79 ia~s~~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~ 157 (529) T protein:vir:10 79 IAAGQSSGAITNIGPAVI-GMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGL 157 (529) T ss_pred ccccccccccccccchhh-hhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCccccccccccccccccccc Confidence 233222221111122221 23333444455566777777765432110 00000 Q ss_pred ---------------------------Cceee-------------------------------------------eecch Q lcl|Aclame:pro 54 ---------------------------PEADW-------------------------------------------VGESA 63 (305) Q Consensus 54 ---------------------------~~a~~-------------------------------------------v~E~~ 63 (305) ..-.| .+++- T Consensus 158 ~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gm 237 (529) T protein:vir:10 158 AAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGM 237 (529) T ss_pred ccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCcccccccccccccccccccccc Confidence 00000 00000 Q ss_pred h------------hcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|Aclame:pro 64 T------------DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIF 127 (305) Q Consensus 64 ~------------~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~ 127 (305) . ......++...+++++++..+..+-...+|-||.+|- ..|.++.|.+-|+..|...+++.||. T Consensus 238 sTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~ 317 (529) T protein:vir:10 238 ATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVID 317 (529) T ss_pred chhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHH Confidence 0 0011246667778888888777777889999999984 45789999999999999999999996 Q ss_pred CcccCcCccccccc------ccccccccceeecccchhh---hHHHHHHHHHHHHhhh--ccccceEEEEchHHHHHHHH Q lcl|Aclame:pro 128 GTDKPASWVSPALI------PAAVTAGQAVEVVGGVANE---SDIVGATNRAAKAVAS--AGWAPDTLLSSLALRYEVAN 196 (305) Q Consensus 128 G~g~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~ 196 (305) =.-...-..-.+.. .+.............-... ..++-.+......+.. .....+.+++++.+...|.. T Consensus 318 ~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~ 397 (529) T protein:vir:10 318 WINYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALAL 397 (529) T ss_pred HhhhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhh Confidence 11100000000000 0000000000000001111 1222223333333333 22346778999999999974 Q ss_pred h--hcc-------CC------ceeecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEE----Eeeccee Q lcl|Aclame:pro 197 I--RDA-------NG------NPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVK----FLDQATL 257 (305) Q Consensus 197 ~--kd~-------~G------~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~----~~~~~~~ 257 (305) . ++. .| ..++-.-.-.|++|+++.+.+.+ -+ .+|.++...++ ...+.-+ T Consensus 398 ~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----y~--------~vG~KG~~~~~~glfy~PYv~l 465 (529) T protein:vir:10 398 VDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQD----YF--------TMGYRGANNLDAGIYYCPYVAL 465 (529) T ss_pred hccccccccccccccceeecCCceEEEEecCceEEEecCCCCcc----eE--------EEEEeCCcccccceeecccccc Confidence 2 111 11 01111222345677777664321 12 22222221111 1111100 Q ss_pred ccC-cceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccc------cCCC Q lcl|Aclame:pro 258 GTG-ENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVV------APAA 305 (305) Q Consensus 258 ~~~-~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v------~~a~ 305 (305) +.- ...-..|+- .+=...|++. ..+|-+ ...+.++.+-+ +-.| T Consensus 466 ~~~~~~dp~sfqP---~~g~~tRY~l-~~NP~~-~~~~~~~~~r~~~g~~~~~~a 515 (529) T protein:vir:10 466 TPLRGSDPKNFQP---VMGFKTRYAI-GVNPFA-ESRTQAPTSRISNGMPGAHSV 515 (529) T ss_pred ccccccCCCcccc---eeeeeeeece-eecCcc-ccccccccccccCCcchhhhc Confidence 000 000001222 2223456654 345511 11112211111 1111 No 230 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=63.49 E-value=0.32 Score=23.32 Aligned_cols=281 Identities=10% Similarity=0.119 Sum_probs=107.8 Q ss_pred CC-----CccCCccc-eEccHHHHHHHHHHHHhhhhhhhhcce---e---------------ecC-CCceEEEEEeCCCc Q lcl|Aclame:pro 1 MA-----DISRAEVA-SLIQEAYSDTLLAAAKQGSTVLSAFQN---V---------------NMG-TKTTHLPVLATLPE 55 (305) Q Consensus 1 Ma-----~~t~~~gg-~lip~~~~~~i~~~~~~~~~l~~l~~~---~---------------~~~-~~~~~~p~~~~~~~ 55 (305) .. ..+...++ ...-...........- .......+. . ... +..+.+. .+-.. T Consensus 159 SG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g--~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~--~GmsT 234 (524) T protein:vir:98 159 SGEGAHTAFAKITTGTAIATGAIVYHIFQETG--IAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEIS--VGMAT 234 (524) T ss_pred CCcccccccccccccccccccccccccccccc--ceeccccccCcccccccccccccccccccccccceeecc--cccch Confidence 00 00000000 0000000000000000 000000000 0 000 0001111 01011 Q ss_pred eeeeecch----hhcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|Aclame:pro 56 ADWVGESA----TDPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIF 127 (305) Q Consensus 56 a~~v~E~~----~~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~ 127 (305) + .+|.. .......++...+++++++..+..+-...+|-||.+|- ..|.++.|.+-|+..|...+++.||. T Consensus 235 A--~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~ 312 (524) T protein:vir:98 235 S--VAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVD 312 (524) T ss_pred h--hhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHH Confidence 1 12311 11123367778889998888888888889999999984 45789999999999999999999984 Q ss_pred CcccCcCcccccccccccc-cc-----cceeecccc---hhhhHHHHHHHHHHHHhhhc--cccceEEEEchHHHHHHHH Q lcl|Aclame:pro 128 GTDKPASWVSPALIPAAVT-AG-----QAVEVVGGV---ANESDIVGATNRAAKAVASA--GWAPDTLLSSLALRYEVAN 196 (305) Q Consensus 128 G~g~~~~~~~~~~~~~~~~-~~-----~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~ 196 (305) =.-...-..-.+..+.... .+ .......+- ..+..++-.+......+... ....+-+++++++...|.. T Consensus 313 ~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~ 392 (524) T protein:vir:98 313 LINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALAR 392 (524) T ss_pred HHhhhheeceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhh Confidence 2111111111111111000 00 000000010 11122222233333333332 2246778999999988875 Q ss_pred h----h------------ccCCceeecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEE----EEeecce Q lcl|Aclame:pro 197 I----R------------DANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITV----KFLDQAT 256 (305) Q Consensus 197 ~----k------------d~~G~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v----~~~~~~~ 256 (305) + . |.++ .++....-.|++|+++.+.+.+ -++ +|.++..++ ....+.- T Consensus 393 ~~~g~~~~s~~~~~~~~~d~~~-~~~~G~l~~~~~vy~D~y~~~d----y~~--------vG~KG~~~~~~glfyaPYv~ 459 (524) T protein:vir:98 393 IDSGITPASQGLQKTLNVDTTK-AVFAGVLGGTYKVYIDQYARQD----YFT--------VGFKGDNEMDAGIYYAPYVA 459 (524) T ss_pred hhcccccccchhhcccccCCcc-ceEEEEecCceEEEecCCCCcc----eEE--------EEeeCCcccccceeeccccc Confidence 3 0 1111 1222223346677777664321 122 222222111 1111100 Q ss_pred eccC-cceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 257 LGTG-ENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 257 ~~~~-~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) +..- ...-..|+- .+=...|++. ..+|-+ ...+.++..-+.-+. T Consensus 460 l~~~~~~dp~sfqP---~~g~~tRY~l-~~NP~~-~~~~~~~~~ri~~g~ 504 (524) T protein:vir:98 460 LTPLRGSDPKNFQP---VMGFKTRYGI-GINPFA-NSRSQAPADRITSGM 504 (524) T ss_pred cccccccCCccccc---eeeeeeeece-eecCcc-cccCCccccccccCc Confidence 0000 000001222 2223456664 345522 122333333333333 No 231 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=62.94 E-value=0.33 Score=23.25 Aligned_cols=281 Identities=11% Similarity=0.079 Sum_probs=116.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhh--hhhcceeecCCCceEEEE---EeCCCceeeeecchhhcccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTV--LSAFQNVNMGTKTTHLPV---LATLPEADWVGESATDPKGVKPTSKV 75 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l--~~l~~~~~~~~~~~~~p~---~~~~~~a~~v~E~~~~~~~~~~~~~~ 75 (305) -...+-.+|+++=-+.+..+|.........+ .+-..+.+..+--.++-. ..+...+.+++|+.. .+++++ T Consensus 32 ~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~-----~~~~~~ 106 (468) T protein:vir:63 32 ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGV-----APVSDP 106 (468) T ss_pred cCCccccCcchhhhhhhhhhhheeeecccchhhhhhcccchhhhhhhhheeeeccCccccccccccccc-----cccCCC Confidence 2222233355555666666665554444333 222223333332222222 223356777888753 667788 Q ss_pred cceeEEeeeeeEEEeehhhHHHh-hcCHHHHHHHHHHHHHHHHHHHHHHHHHcCccc-------CcCccccccccccccc Q lcl|Aclame:pro 76 TWANRTLVAEEIAVIIPVHENVI-DDATVAVLTEVAELGGQAIGKKLDQAVIFGTDK-------PASWVSPALIPAAVTA 147 (305) Q Consensus 76 ~f~~v~~~~~k~~~~~~is~ell-~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~-------~~~~~~~~~~~~~~~~ 147 (305) .+.+...+.+=++....+|.-+- ..+..+.+....+.-...++..+|.+.|.|+-. +.+++.-|+..- ... T Consensus 107 ~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~l-i~~ 185 (468) T protein:vir:63 107 NIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKL-INQ 185 (468) T ss_pred ceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEE-ecC Confidence 89999998888888777776432 334567788888888889999999999999753 223333344422 233 Q ss_pred ccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHH-HHhhccCCceeec--ccccCccceEecCcccc Q lcl|Aclame:pro 148 GQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEV-ANIRDANGNPVFR--DDSFAGFRTFFNRNGAW 224 (305) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l-~~~kd~~G~~l~~--~~~l~G~pv~~~~~~~~ 224 (305) .+..+.-+..... +++..+.......+..+.-++|+..+...| ...-..+-+.... .....|.|+ ...+.. T Consensus 186 enviDa~G~~ls~----~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~~q~~v~~~n~~~~~~G~~v--~g~~sa 259 (468) T protein:vir:63 186 DNVHDARGASLTE----SLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNI--QGFHSA 259 (468) T ss_pred CceeccCCCccCH----HHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEEEEcCCCCceeeeecc--cceecc Confidence 3444433333332 222223333333445556677777777666 3222222111110 112334333 111111 Q ss_pred CC---CCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccE-eecccceEEEecccccc Q lcl|Aclame:pro 225 DA---DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYV-LGVSATAQGANKTPVAV 300 (305) Q Consensus 225 ~~---~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~-v~~p~a~~~~~~t~~a~ 300 (305) .. -.+..+++|.... .-++..-...-... ... ....... . ..+-+.. ..-.-+|+....-..++ T Consensus 260 ~G~I~l~gs~il~~~~~l--------~~~~~~~~~Apsp~-~vs-aT~~~~~-~-g~~~~~~~a~y~Y~v~~vs~~GES~ 327 (468) T protein:vir:63 260 RGFIKLHGSTVMENEQIL--------DERILALPTAPQPA-KVT-ATQEAGK-K-GQFRAEDLAAHEYKVVVSSDDAESI 327 (468) T ss_pred eeeeeecCceeeccccCC--------CcccccccccccCC-ccc-eeeeccc-C-CcccCCCcceEEEEEEEECCCCccc Confidence 00 0001111111111 00000000000000 000 0000000 0 0000000 00111222222222111 Q ss_pred ccCCC Q lcl|Aclame:pro 301 VAPAA 305 (305) Q Consensus 301 v~~a~ 305 (305) ..++. T Consensus 328 pS~~v 332 (468) T protein:vir:63 328 ASEVA 332 (468) T ss_pred cccce Confidence 11111 No 232 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=62.02 E-value=0.34 Score=23.13 Aligned_cols=270 Identities=14% Similarity=0.130 Sum_probs=107.9 Q ss_pred CCC--ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE-EeCCCceeeeecchhhcccccccccccc Q lcl|Aclame:pro 1 MAD--ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPV-LATLPEADWVGESATDPKGVKPTSKVTW 77 (305) Q Consensus 1 Ma~--~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~-~~~~~~a~~v~E~~~~~~~~~~~~~~~f 77 (305) +|. .|-++.-+-+|..++..|-..+....++.+...+.+++.- -+.+ ++...++.....|+. +.+...++ T Consensus 35 laengvtitdttfqlprklvesintallntnpvfkvfhvtnvgal--lvsrsfdssneaqvhkdgqt-----kteqaatl 107 (318) T protein:vir:94 35 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGAL--LVSRSFDSSNEAQVHKDGQT-----KTEQAATL 107 (318) T ss_pred hhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhhhhe--eeeccccccchhhhhccccc-----ccccceee Confidence 442 4445556678999999999999999888887777776542 2222 233333433333332 22222223 Q ss_pred eeEEeeeeeEEEeehhhHH--HhhcCHHHHHHHHHHHHHHHHHHH-HHHHHHcCcccCcCcccccccccccccc-cceee Q lcl|Aclame:pro 78 ANRTLVAEEIAVIIPVHEN--VIDDATVAVLTEVAELGGQAIGKK-LDQAVIFGTDKPASWVSPALIPAAVTAG-QAVEV 153 (305) Q Consensus 78 ~~v~~~~~k~~~~~~is~e--ll~ds~~~~~~~v~~~la~~~a~~-~d~a~l~G~g~~~~~~~~~~~~~~~~~~-~~~~~ 153 (305) .--++.+.-+..+..+... -+++|...+...|..++..++..+ .|.+++.|+|... + ..+...+.... .-++. T Consensus 108 tidtlepvmvyklqslaervkrlqmsyselynlivaeltqaivnkivdlalvegdgtng-f--ksidkeadvkkikkitt 184 (318) T protein:vir:94 108 TIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNG-F--KSIDKEADVKKIKKITT 184 (318) T ss_pred eecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHhhhhheeeeecCCcch-h--hhhchhhhHHHHHHhhh Confidence 2223333333344444432 356788888999999999988865 5778888988531 1 11111111100 00001 Q ss_pred cccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhcc--CCc-eeecccccCccceEecCccccCCCC-- Q lcl|Aclame:pro 154 VGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDA--NGN-PVFRDDSFAGFRTFFNRNGAWDADA-- 228 (305) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~--~G~-~l~~~~~l~G~pv~~~~~~~~~~~~-- 228 (305) ....+....+.+.+..+..-+.+...+.--++-.......|..++-+ +.+ .+-.+++--...+-+.+.+.....+ T Consensus 185 kaksagktpfadaieeavdfvrptagrrylivktedrkalldelrqatananvriknddteiasevgvdeiivytgskav 264 (318) T protein:vir:94 185 KAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAV 264 (318) T ss_pred hhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhcccceEEeccchhhhhhcCcceeEEeeccccc Confidence 11111111223333333333322211111222222222333333322 211 2222222111111111111111111 Q ss_pred ceEEEEehhhEEEEeecCcEEEEeecceeccCcceeee--eecCcEEEEEEEEEccEeecccceEEEecc Q lcl|Aclame:pro 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINL--AERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) Q Consensus 229 ~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~--~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t 296 (305) .+-++.|.++ .+. -.++ ..++. |..|.-.+.++.--.+-|.--.+-+.++.. T Consensus 265 kptvlvdqky-hid-mqdl--------------tkvdafewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:94 265 KPTVLVDQKY-HID-MQDL--------------TKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred cceeEeccce-ecc-hhhh--------------hhhhceeeccCCceEEEEecccCcceeecCceeEEeC Confidence 1122233221 110 0010 01112 222333333443333333222222222222 No 233 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=59.26 E-value=0.4 Score=22.79 Aligned_cols=287 Identities=12% Similarity=0.066 Sum_probs=116.1 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe----CCC---------------------- Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA----TLP---------------------- 54 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~----~~~---------------------- 54 (305) .+..+++..=.-.-+.+. .+..++..+-+..+++.+-||++++.-|.-.. ..+ T Consensus 79 i~~st~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~~~pda~~sga 157 (529) T protein:vir:10 79 IAAGQSSGAITNIGPAVI-GMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSL 157 (529) T ss_pred cccccccccccccCchhh-hhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCccccccccccccccccccccccc Confidence 233232221111122221 23333444555567777777764421111000 000 Q ss_pred -------------------------------------------------c----------------------eeeeecch Q lcl|Aclame:pro 55 -------------------------------------------------E----------------------ADWVGESA 63 (305) Q Consensus 55 -------------------------------------------------~----------------------a~~v~E~~ 63 (305) . ..-++++- T Consensus 158 ~~~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~Gm 237 (529) T protein:vir:10 158 ATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGM 237 (529) T ss_pred ccccccccccccccccccccccccccccceeeecccCceeeccccccccccCccccCcccccccccccccccccccccch Confidence 0 00000000 Q ss_pred h------------hcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|Aclame:pro 64 T------------DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIF 127 (305) Q Consensus 64 ~------------~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~ 127 (305) . ......++...+++++++..+..+-...+|-||.+|- ..|.+..|.+-|+..|...+++.||. T Consensus 238 sTa~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~ 317 (529) T protein:vir:10 238 ATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVID 317 (529) T ss_pred hhhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHH Confidence 0 0011245666778888887777777888999999984 45788999999999999999988885 Q ss_pred Ccc------cCcCcccccccccccccccceeecccchhh---hHHHHHHHHHHHHhhh--ccccceEEEEchHHHHHHHH Q lcl|Aclame:pro 128 GTD------KPASWVSPALIPAAVTAGQAVEVVGGVANE---SDIVGATNRAAKAVAS--AGWAPDTLLSSLALRYEVAN 196 (305) Q Consensus 128 G~g------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~ 196 (305) --- +..+....+...+.............-... ..++-.+......+.. .....+.+++++.+...|.. T Consensus 318 ~l~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~ 397 (529) T protein:vir:10 318 WINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALAL 397 (529) T ss_pred HHhhhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHh Confidence 321 111111111101111110000000111111 1222223333333332 22346778999999999874 Q ss_pred h----------------hccCCceeecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEE----Eeecce Q lcl|Aclame:pro 197 I----------------RDANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVK----FLDQAT 256 (305) Q Consensus 197 ~----------------kd~~G~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~----~~~~~~ 256 (305) . .|.++ .++..-.-.|++|+++.+.+.+ .+.+|.++...++ ...+.. T Consensus 398 ~~~~~~~~~~~~~sg~~~d~~~-~~~~G~l~~~~~vy~D~y~~~d------------y~~vG~KG~~~~~~glfy~PYv~ 464 (529) T protein:vir:10 398 IDTNISPAAQGMASGLNADTTK-GVFAGILGGRYKVYIDQYARQD------------YFTMGYRGANNLDAGIYYCPYVA 464 (529) T ss_pred hcccccccccccccccccccCC-ceEEEEecCceEEEecCCCCcc------------eEEEEEeCCcccccceeeccccc Confidence 2 11111 1122223445677776664321 1222332222111 111100 Q ss_pred eccCc-ceeeeeecCcEEEEEEEEEccEeecccce--------EEEeccccccccCCC Q lcl|Aclame:pro 257 LGTGE-NQINLAERDMVALRLKARFAYVLGVSATA--------QGANKTPVAVVAPAA 305 (305) Q Consensus 257 ~~~~~-~~~~~~~~~~~~~r~~~r~~~~v~~p~a~--------~~~~~t~~a~v~~a~ 305 (305) ++.-. ..-..|+- .+=...|++.. .+|-+- .++++.+..--..-+ T Consensus 465 l~~~~~~dp~sfqP---~~g~~tRY~l~-~NP~~~~~~~~~~~r~~~g~~~~~~ag~n 518 (529) T protein:vir:10 465 LTPLRGFDPKNFQP---VMGFKTRYAIG-VNPFAESRTQAPQGRITSGMPGVNSVGKN 518 (529) T ss_pred cccccccCCCcccc---eeeeeeeecee-ecCccccccccccccccCCcchhhhcCcc Confidence 00000 00001221 22234455542 344221 122222211111111 No 234 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=54.72 E-value=0.5 Score=22.25 Aligned_cols=284 Identities=9% Similarity=0.050 Sum_probs=117.9 Q ss_pred CCC-ccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCc-------eEE------------------------- Q lcl|Aclame:pro 1 MAD-ISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT-------THL------------------------- 47 (305) Q Consensus 1 Ma~-~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~-------~~~------------------------- 47 (305) .+. .++++-...=|.-+ .+..++..+-+..+++.+-||++++ .++ T Consensus 78 i~es~~t~~v~~~~P~Li--~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fse 155 (528) T protein:vir:66 78 IAAGQTTGAITNVGPAVI--GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSS 155 (528) T ss_pred ccccccccccccCchhHH--HHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCcccccccccccccccccccccc Confidence 222 22222221222221 2233444455556777788876621 000 Q ss_pred ----------------EEEe------CCCc-------------------------------------------eeee--- Q lcl|Aclame:pro 48 ----------------PVLA------TLPE-------------------------------------------ADWV--- 59 (305) Q Consensus 48 ----------------p~~~------~~~~-------------------------------------------a~~v--- 59 (305) .... .... +.-+ T Consensus 156 a~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~G 235 (528) T protein:vir:66 156 LAAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFG 235 (528) T ss_pred cccccccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccccccccccccccceecccc Confidence 0000 0000 0000 Q ss_pred -----ecchh----hcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 60 -----GESAT----DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVI 126 (305) Q Consensus 60 -----~E~~~----~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l 126 (305) +|... ......++...+++++++..+..+-...+|-||.+|- ..|.+..|.+-|+..|...+++.|| T Consensus 236 m~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii 315 (528) T protein:vir:66 236 MATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIV 315 (528) T ss_pred cchhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHH Confidence 01000 0011246677788888888888777889999999984 4688999999999999999999996 Q ss_pred cCcccCcCccccccc----cc--ccccccceeecccc---hhhhHHHHHHHHHHHHhhhc--cccceEEEEchHHHHHHH Q lcl|Aclame:pro 127 FGTDKPASWVSPALI----PA--AVTAGQAVEVVGGV---ANESDIVGATNRAAKAVASA--GWAPDTLLSSLALRYEVA 195 (305) Q Consensus 127 ~G~g~~~~~~~~~~~----~~--~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~~~~~l~ 195 (305) .=........-.+.. .. ...........+.- ..+..++-.+......+... ....+.+++++++...|. T Consensus 316 ~~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~ 395 (528) T protein:vir:66 316 DVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILA 395 (528) T ss_pred hhhhheeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHh Confidence 422111000000000 00 00000000001100 11122222233333343332 224467899999999987 Q ss_pred Hhh-----ccC-Cceee---------cccccCccceEecCccccCCCCceEEEEehh-------hEEEEeecCcEEEEee Q lcl|Aclame:pro 196 NIR-----DAN-GNPVF---------RDDSFAGFRTFFNRNGAWDADAAIEVIADSS-------RVKIGVRQDITVKFLD 253 (305) Q Consensus 196 ~~k-----d~~-G~~l~---------~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~-------~~~~~~~~~i~v~~~~ 253 (305) ..- +.. ....+ ..-.-.|++|+++.+.+.+ -+++|--. -|+--.-.+.-+...+ T Consensus 396 ~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----y~~vG~KG~~~~~~glfyaPYv~l~~~~~~d 471 (528) T protein:vir:66 396 SADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQD----YFTVGYKGDNEMDAGIYYAPYVALTPLRATD 471 (528) T ss_pred hccccccccccccccccccCCCCceeEEEecCceEEEecCCCCcc----eEEEEEeCCcccccceeecccccceeeEeeC Confidence 531 111 11111 1112346677777664322 12222110 0110111111122222 Q ss_pred cceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 254 QATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 254 ~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ..+ |+- .+=...|++. ..+|-+- ..+..|.+-+.-+. T Consensus 472 p~s----------fqP---~~g~~tRY~l-~vNP~~~-~~~~~~~~ri~~g~ 508 (528) T protein:vir:66 472 PQS----------FHP---VLGFKTRYGI-GINPFAD-SKSQEPSARITSGM 508 (528) T ss_pred Ccc----------ccc---eeeeeeeece-eecCccc-ccCccccccccccc Confidence 222 221 1223345554 3355221 11122222222222 No 235 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=54.15 E-value=0.51 Score=22.18 Aligned_cols=293 Identities=11% Similarity=0.061 Sum_probs=121.8 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhh-----hhhcceeecCCCceEEEEEeCCCceeeeecchhh--cccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTV-----LSAFQNVNMGTKTTHLPVLATLPEADWVGESATD--PKGVKPTS 73 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l-----~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~--~~~~~~~~ 73 (305) -+..+.+++...++..-. +.+...+ ....++..+.++++++-|-.++..|.-+.++.+. -+...++. T Consensus 69 ta~a~a~~T~l~ve~~~~------f~~~~l~~~~~~~Evirv~sVng~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEG 142 (418) T protein:vir:10 69 TAEAAADATVLTVENSDG------LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRISAAIIAANTKLIVIGTAFEEG 142 (418) T ss_pred EEEEecCceEEEEcCcce------eccccEEEEccCCeEEEEEEEeCCEEEEEEecCCeeEEEEecCceEEEeccccccc Confidence 111111112222322211 2222221 1133445556777787776666555544444421 11112333 Q ss_pred cccceeEEeeeeeEEEeehhhHHHhhcCH-----------HH-HHHHHHHHHHHHHHHHHHHHHHcCc----ccCcC--c Q lcl|Aclame:pro 74 KVTWANRTLVAEEIAVIIPVHENVIDDAT-----------VA-VLTEVAELGGQAIGKKLDQAVIFGT----DKPAS--W 135 (305) Q Consensus 74 ~~~f~~v~~~~~k~~~~~~is~ell~ds~-----------~~-~~~~v~~~la~~~a~~~d~a~l~G~----g~~~~--~ 135 (305) +...+....+...+.-+..|=++..+-|. .+ +++....++-+ +..+|+++|+|. ++..+ - T Consensus 143 sd~~ta~~~k~~~vsNvtQIF~~avsvSgTaqAs~~q~Gvsn~~ese~drk~~~--av~iEkalI~G~~~~~~~~~g~~R 220 (418) T protein:vir:10 143 SQRPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH--ATEQETAIFFGQAFMGTYNGQPLH 220 (418) T ss_pred cccCCcceecceeccchhhhhhhhhhhhhhhhhccccccCchHHHHHHHHHHHH--HHHHHHHHhcccccCCCcCCcchh Confidence 33333333333333333333333322211 12 23333333333 358899999995 22222 2 Q ss_pred ccccccccccc--cccceeecc-cchhhhHHHHHHHHHHHHhhhccccc----eEEEEchHHHHHHHHhhccCCceee-c Q lcl|Aclame:pro 136 VSPALIPAAVT--AGQAVEVVG-GVANESDIVGATNRAAKAVASAGWAP----DTLLSSLALRYEVANIRDANGNPVF-R 207 (305) Q Consensus 136 ~~~~~~~~~~~--~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~v~~~~~~~~l~~~kd~~G~~l~-~ 207 (305) ...|++..... .++.++... +..+.+.+.+.+.++...-.+.+... =..+++++....+.++- +.... + T Consensus 221 ~m~GIl~~vr~~~~gnVv~a~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~---~~I~~~~ 297 (418) T protein:vir:10 221 TTQGIVDAVRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFF---GEVTVTQ 297 (418) T ss_pred hHHHHHHHHhhhcccceeccCCCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhh---hheeecc Confidence 33455433321 123333222 24566777777666543111122111 24677888888887763 22111 1 Q ss_pred ccccCcc---------ceE-ecCcc---ccCCCCceEEEEehhhEEEEee--cCcEEEEeecce----eccCcceeeeee Q lcl|Aclame:pro 208 DDSFAGF---------RTF-FNRNG---AWDADAAIEVIADSSRVKIGVR--QDITVKFLDQAT----LGTGENQINLAE 268 (305) Q Consensus 208 ~~~l~G~---------pv~-~~~~~---~~~~~~~~~~~gdf~~~~~~~~--~~i~v~~~~~~~----~~~~~~~~~~~~ 268 (305) .....|+ -.+ +..+. .....++.+++.|..+..+..- +.+..+...... +......+. .. T Consensus 298 ~e~~~G~vv~~~~~~~G~I~L~~~p~~~~~~lp~g~mlVvD~~~vkL~~L~~R~~~~E~l~k~G~~~~~~~~~~~~~-~~ 376 (418) T protein:vir:10 298 RETSYGMVFTEWKFFKGRLILKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYG-HG 376 (418) T ss_pred cceeeeEEEEEEEcceEEEEeecccccccccCCCceEEEEccccceEEEeccccccchhcccCCCcccccccccccc-cc Confidence 1111121 111 11110 0123466788888877654432 333333321111 000000000 00 Q ss_pred cCcEEEEEEEEEccEeecccceEEEecc----ccccccCCC Q lcl|Aclame:pro 269 RDMVALRLKARFAYVLGVSATAQGANKT----PVAVVAPAA 305 (305) Q Consensus 269 ~~~~~~r~~~r~~~~v~~p~a~~~~~~t----~~a~v~~a~ 305 (305) .|.+.-.....+...+.+|.+.+++++- |..+.||-| T Consensus 377 ~D~~kG~iv~E~tLe~~N~~a~avitgl~~~~~~~~~t~p~ 417 (418) T protein:vir:10 377 VDAQGGSLTSEWALELLNPQGCAVITGLQKAKERVYLTAPA 417 (418) T ss_pred cccccceEEEEeeeeeecccceEEeeccceecccccCCCCC Confidence 1222222334566677999999999864 333333333 No 236 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=52.17 E-value=0.56 Score=21.96 Aligned_cols=287 Identities=11% Similarity=0.050 Sum_probs=116.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE-------EeCC-------------------- Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPV-------LATL-------------------- 53 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~-------~~~~-------------------- 53 (305) .+..+++..=.-.-+.+. .+..++..+-+..+++.+-||++.+.-|.- .... T Consensus 79 i~est~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga 157 (529) T protein:vir:10 79 IAAGQSSGAITNIGPAVI-GMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSL 157 (529) T ss_pred cccccccccccccCchhh-hhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccccccccccccccccccccccc Confidence 233332221111122221 223334444455567777777543211000 0000 Q ss_pred ----------------------------------------------------------------------Cceeeeecch Q lcl|Aclame:pro 54 ----------------------------------------------------------------------PEADWVGESA 63 (305) Q Consensus 54 ----------------------------------------------------------------------~~a~~v~E~~ 63 (305) ....-++++- T Consensus 158 ~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm 237 (529) T protein:vir:10 158 ATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGM 237 (529) T ss_pred cccccccccCccccccccccccccccCcceeeeecccceecccccccccccCccccCccccccccccccccccccccccc Confidence 0000000010 Q ss_pred h------------hcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|Aclame:pro 64 T------------DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIF 127 (305) Q Consensus 64 ~------------~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~ 127 (305) . ......++...+++++++..+..+-...+|-||.+|- ..|.+..|.+-|+..|...+++.||. T Consensus 238 ~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~ 317 (529) T protein:vir:10 238 ATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVID 317 (529) T ss_pred chhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHH Confidence 0 0011246667778888888887777889999999984 45789999999999999999988885 Q ss_pred Ccc------cCcCcccccccccccccccceeecccchhh---hHHHHHHHHHHHHhhh--ccccceEEEEchHHHHHHHH Q lcl|Aclame:pro 128 GTD------KPASWVSPALIPAAVTAGQAVEVVGGVANE---SDIVGATNRAAKAVAS--AGWAPDTLLSSLALRYEVAN 196 (305) Q Consensus 128 G~g------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~ 196 (305) --- +..+....+...+.............-... ..++-.+......+.. .....+.+++++.+...|.. T Consensus 318 ~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~ 397 (529) T protein:vir:10 318 WINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALAL 397 (529) T ss_pred hHhhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHh Confidence 321 111111111111111110000000111111 1222223333333332 22346778999999999874 Q ss_pred h----------------hccCCceeecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEE----Eeecce Q lcl|Aclame:pro 197 I----------------RDANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVK----FLDQAT 256 (305) Q Consensus 197 ~----------------kd~~G~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~----~~~~~~ 256 (305) . .|.++ .++..-.-.|++|+++.+.+.+ -+ .+|.++...++ ...+.- T Consensus 398 ~~~~~~~~~~~~~sg~~~d~~~-~~~~G~l~~~~~vy~D~y~~~d----y~--------~vG~KG~~~~~~glfy~PYv~ 464 (529) T protein:vir:10 398 IDTNISPAAQGMASGLNADTTK-GVFAGILGGRYKVYIDQYARQD----YF--------TMGYRGANNLDAGIYYCPYVA 464 (529) T ss_pred hhhhccccccccccccccccCC-ceEEEEecCceEEEecCCCCcc----eE--------EEEEeCCcccccceeeccccc Confidence 2 11111 1122223445677777664321 12 22222221111 111110 Q ss_pred eccC-cceeeeeecCcEEEEEEEEEccEeecccce--------EEEeccccccccCCC Q lcl|Aclame:pro 257 LGTG-ENQINLAERDMVALRLKARFAYVLGVSATA--------QGANKTPVAVVAPAA 305 (305) Q Consensus 257 ~~~~-~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~--------~~~~~t~~a~v~~a~ 305 (305) ++.- ...-..|+- .+=...|++.. .+|-+- .++++.+..--..-+ T Consensus 465 l~~~~~~dp~sfqP---~~g~~tRY~l~-~NP~~~~~~~~~~~r~~~g~~~~~~ag~n 518 (529) T protein:vir:10 465 LTPLRGSDPKNFQP---VMGFKTRYAIG-VNPFAESRTQAPQGRITSGMPGVNSVGKN 518 (529) T ss_pred cccccccCCCcccc---eeeeeeeecee-ecCccccccccccccccCCcchhhhcCcc Confidence 0000 000001221 12234455543 344221 122222211111111 No 237 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=50.96 E-value=0.59 Score=21.82 Aligned_cols=265 Identities=11% Similarity=0.043 Sum_probs=112.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcc------eeecCCCceEEEEEeCCCceeeeecchhhccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQ------NVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSK 74 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~------~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~ 74 (305) ||... .++.+...+.+.+...+....|+. +...++++++||+.+...-...--.+.-...+ .-+ T Consensus 1 MA~~n-------~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g---~~~ 70 (299) T protein:vir:79 1 MAALN-------YAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQR---NYD 70 (299) T ss_pred Cccch-------hHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCccccc---ccC Confidence 88533 247788888888888877655532 22345678999998654322221110001111 122 Q ss_pred ccceeEEeeeeeEEEeehhhHHHhhcCH--HHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCccccccccccccccccee Q lcl|Aclame:pro 75 VTWANRTLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVE 152 (305) Q Consensus 75 ~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 152 (305) .+....+++..|.-.+ .|..-=.+.+. ..+...+.+...+.++-.+|.-.+..--+. +...+. . T Consensus 71 ~~~~t~~ldqdr~~~f-~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~-----------a~~~g~--~ 136 (299) T protein:vir:79 71 NAWEPKVLTNQRKWST-LVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYAD-----------WTALGN--T 136 (299) T ss_pred cceeEEEeecccccee-ccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHh-----------hhhcCC--c Confidence 3455566666664333 22210011111 123344444455555555665444321000 000000 0 Q ss_pred ecccchhhhHHHHHHHHHHHHhhhccc--cceEEEEchHHHHHHHHhh------ccC-Cceee--cccccCccceEe--c Q lcl|Aclame:pro 153 VVGGVANESDIVGATNRAAKAVASAGW--APDTLLSSLALRYEVANIR------DAN-GNPVF--RDDSFAGFRTFF--N 219 (305) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~~k------d~~-G~~l~--~~~~l~G~pv~~--~ 219 (305) ......+.+++++.+.++...+..... ..-.++++|.++..|.+.. +.. ++... +-..+.|+||+. + T Consensus 137 ~~~~~~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps 216 (299) T protein:vir:79 137 ADTTVLTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPS 216 (299) T ss_pred ccccccCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEech Confidence 112223445667777777777775543 2346788999999887532 111 11111 124688999874 2 Q ss_pred CccccC----CC------C--ceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeec- Q lcl|Aclame:pro 220 RNGAWD----AD------A--AIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGV- 286 (305) Q Consensus 220 ~~~~~~----~~------~--~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~- 286 (305) +.+... .+ . --++++.. ...+...+--.+++. ..+.+ +.....+.-..+.|.=|.+ T Consensus 217 ~r~~t~~~~~~G~~~~~~ak~in~ii~~~-~a~~~~~K~~~~~~~-----~P~~~-----~~~~~~~~~r~y~d~~v~~n 285 (299) T protein:vir:79 217 NLMKTAYDFTTGWKVGAGAKQIFMSLVHP-SAIITPVSYQFSKLD-----EPTAV-----TEGKYFYFEESFEDVFILNK 285 (299) T ss_pred hhcCccceeccCccccCcccccceEEEcC-CeeeeeEeeeeEEee-----cCCCC-----Cccceeeeeeeeeeeeeecc Confidence 223211 01 0 11233322 222222111122221 11111 1111111112233333333 Q ss_pred ccceEEEecccccc Q lcl|Aclame:pro 287 SATAQGANKTPVAV 300 (305) Q Consensus 287 p~a~~~~~~t~~a~ 300 (305) ...-+.+..+.+.. T Consensus 286 k~~~i~~~~~~a~~ 299 (299) T protein:vir:79 286 KADAIQFVVEGAGA 299 (299) T ss_pred ccCeEEEEeeecCC Confidence 22233333333322 No 238 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=47.67 E-value=0.69 Score=21.45 Aligned_cols=255 Identities=11% Similarity=0.091 Sum_probs=113.0 Q ss_pred CC-----CccCCc-cceEccHHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEEE---EeCCCceeeeecchhhcccc Q lcl|Aclame:pro 1 MA-----DISRAE-VASLIQEAYSDTLLAAAKQGST--VLSAFQNVNMGTKTTHLPV---LATLPEADWVGESATDPKGV 69 (305) Q Consensus 1 Ma-----~~t~~~-gg~lip~~~~~~i~~~~~~~~~--l~~l~~~~~~~~~~~~~p~---~~~~~~a~~v~E~~~~~~~~ 69 (305) |. +.++.. |+++=-+.+.+++......+.. +.+-..+.+..+--.+|-. ..+...+.+++|+.- T Consensus 45 ~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi----- 119 (514) T protein:vir:10 45 FTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERDFTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGI----- 119 (514) T ss_pred hccccccCCccccCccchhhhhhccceeEeeecCcchhhhhhcCCchhhHHHhhhhhhcccCccccccccccccc----- Confidence 21 122222 3333334444444333222222 2222233333332212211 223346667788753 Q ss_pred cccccccceeEEeeeeeEEEeehhhH--HHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCccc------CcCccccccc Q lcl|Aclame:pro 70 KPTSKVTWANRTLVAEEIAVIIPVHE--NVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDK------PASWVSPALI 141 (305) Q Consensus 70 ~~~~~~~f~~v~~~~~k~~~~~~is~--ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g~------~~~~~~~~~~ 141 (305) .+.+++.+....+..+-++....+|. ++ .++..+......+.-...++..+|.+.|+|+.. +.+.+--|+. T Consensus 120 ~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l-~n~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~ 198 (514) T protein:vir:10 120 GDVNNPNERQRTINIKYIVDTHVTSIALQR-ANTIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLF 198 (514) T ss_pred CcCCCcceEEEEEeeeeeeeeeeeeehhhh-ccchhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHH Confidence 45667778877777776666655554 44 457778888888998999999999999998753 2334444444 Q ss_pred ccccccccceeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeecccccCccceEecCc Q lcl|Aclame:pro 142 PAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDDSFAGFRTFFNRN 221 (305) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~~l~G~pv~~~~~ 221 (305) +-+ ...+.+..-+.... .+.+..+...+...+..++-++|+..+.+.+..- .+.+.+|++..+ T Consensus 199 ~lI-~~~NvIDarG~~Ls----~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~------------~~~~qRV~~~~n 261 (514) T protein:vir:10 199 KLI-APENHIDLRGGRLS----PAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQ------------HLNGQRVMLPGQ 261 (514) T ss_pred Hhh-cCCCeEecCCCCcc----HHHHhhhhhhhhcccCChhheeCchHHHHHHhhc------------ccCcceEEeecC Confidence 443 23344443333333 2233333344444555666677787777766532 233444444433 Q ss_pred cccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEee-cccceEEEecccccc Q lcl|Aclame:pro 222 GAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLG-VSATAQGANKTPVAV 300 (305) Q Consensus 222 ~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~-~p~a~~~~~~t~~a~ 300 (305) .-. --.-.+.+.+ +..++.+.+.- .+ + ..-+.++++... .|.|-..-+ -++. T Consensus 262 ~~~-----~~~G~~v~~f-~s~~G~I~L~g---s~----------i------m~~~n~L~~~~~~~~~Ap~~~~--va~s 314 (514) T protein:vir:10 262 TGG-----MTTGLDIDKF-LSAHGSIRIQG---ST----------I------MDSDNKLDFDRPVSPTAPTAPQ--LSAT 314 (514) T ss_pred ccc-----eeeeeeccce-eEeccceeecC---Ce----------e------ecccccCccCCccCCcCCCCCc--ceEE Confidence 210 0000111222 22223332210 00 1 111112211111 111111000 0111 Q ss_pred ccCCC Q lcl|Aclame:pro 301 VAPAA 305 (305) Q Consensus 301 v~~a~ 305 (305) |||-+ T Consensus 315 vT~~~ 319 (514) T protein:vir:10 315 VTPDG 319 (514) T ss_pred EecCc Confidence 22221 No 239 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=43.98 E-value=0.82 Score=21.05 Aligned_cols=283 Identities=14% Similarity=0.136 Sum_probs=121.5 Q ss_pred CC-------CccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe----C--C-------Cceeeee Q lcl|Aclame:pro 1 MA-------DISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA----T--L-------PEADWVG 60 (305) Q Consensus 1 Ma-------~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~----~--~-------~~a~~v~ 60 (305) |+ +.++++-.. .-+.+. .+..++...-+..+++.+-||++++.-|.-.. . + ++..|-+ T Consensus 63 ~~~~n~~~~~~~t~~v~~-~~P~Li-~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~g~EAf~nEadt~fSg 140 (468) T protein:vir:10 63 IAPAGSALGSANTGGLAG-FDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTG 140 (468) T ss_pred cchhhhhhhhcccccccc-cCchhh-hhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCCCccceeccccccccc Confidence 22 122222111 223332 23344445556677888888876643332211 0 0 0001100 Q ss_pred ----------------------------------------------cchhh--cccccccccccceeEEeeeeeEEEeeh Q lcl|Aclame:pro 61 ----------------------------------------------ESATD--PKGVKPTSKVTWANRTLVAEEIAVIIP 92 (305) Q Consensus 61 ----------------------------------------------E~~~~--~~~~~~~~~~~f~~v~~~~~k~~~~~~ 92 (305) +++.. .....++...+++++++..+..+-... T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAe 220 (468) T protein:vir:10 141 GYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAE 220 (468) T ss_pred cccccccccccccccccccCCCCCcccccccccccccccccccchHHHhhcCCCCcccceeeeEEEEEEEeeeccceecc Confidence 00000 012245566677787777777777888 Q ss_pred hhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHcCcc------cCcCcccccccccccccccceeecccchhhhH Q lcl|Aclame:pro 93 VHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGTD------KPASWVSPALIPAAVTAGQAVEVVGGVANESD 162 (305) Q Consensus 93 is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~G~g------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 162 (305) +|-||.+|- ..|.++.|.+-|+..|...+++.+|.--- +-.+....|+..-.. ...+...-+. T Consensus 221 YTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~~~~Gv~d~~~-------~~~~rw~~e~ 293 (468) T protein:vir:10 221 YTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDV-------DSNGRWSVEK 293 (468) T ss_pred ccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheecccccccccccccc-------cccchhHHHH Confidence 999999984 45788999999999999999988885321 111111111111000 0000001111 Q ss_pred HHHHHHHH---HHHh--hhccccceEEEEchHHHHHHHH---hh----------------ccCCceeecccccCccceEe Q lcl|Aclame:pro 163 IVGATNRA---AKAV--ASAGWAPDTLLSSLALRYEVAN---IR----------------DANGNPVFRDDSFAGFRTFF 218 (305) Q Consensus 163 ~~~~~~~~---~~~~--~~~~~~~~~~v~~~~~~~~l~~---~k----------------d~~G~~l~~~~~l~G~pv~~ 218 (305) ...++..+ +..+ .......+-+++++.+...|.. +. |.+|. ++..-.-.|++|++ T Consensus 294 ~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~tg~-~~~G~l~~r~~vy~ 372 (468) T protein:vir:10 294 FKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGN-LAVGTINGRIKVFV 372 (468) T ss_pred HHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceecccccccccccccccccCcc-eEEEEecCceEEEE Confidence 11111221 1222 2233556779999999999985 22 11111 11111234667777 Q ss_pred cCccccCCCCceEEEEehhhEEEEeecCcEE----EEeecceeccC-cceeeeeecCcEEEEEEEEEccEeecccceEEE Q lcl|Aclame:pro 219 NRNGAWDADAAIEVIADSSRVKIGVRQDITV----KFLDQATLGTG-ENQINLAERDMVALRLKARFAYVLGVSATAQGA 293 (305) Q Consensus 219 ~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v----~~~~~~~~~~~-~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~ 293 (305) +.+.....+..-+++| .++..++ ....+.-++.. ...-..|+- .+=...|++. ..+|-+...- T Consensus 373 D~Ya~~~s~~dY~~vG--------~KG~~~~d~glfyaPYv~l~~~~~~dp~sfqP---~~g~~tRY~l-~~NP~~~~~~ 440 (468) T protein:vir:10 373 DPYAANLSDKHYYVIG--------YKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQP---KIGFKTRYGM-VSNPFVTTNG 440 (468) T ss_pred ccccccCCccceEEEE--------EecCcceeceeeeccccccccccccCCCcccc---eeeeeeeece-eecccceecc Confidence 7654433322333333 3222211 11111100000 000011222 2233456664 3456432221 Q ss_pred --eccccc-cccCCC Q lcl|Aclame:pro 294 --NKTPVA-VVAPAA 305 (305) Q Consensus 294 --~~t~~a-~v~~a~ 305 (305) ...|.+ ..++.+ T Consensus 441 ~~~g~~~~~~~~~~~ 455 (468) T protein:vir:10 441 LYNGTPDGEALTPNA 455 (468) T ss_pred ccCCCcccccccccc Confidence 111111 122233 No 240 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=42.12 E-value=0.9 Score=20.84 Aligned_cols=269 Identities=7% Similarity=-0.017 Sum_probs=96.2 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhh-hh-hhcceeecCCCceEEEEE-eCCCc-eeeeecchhhccccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGST-VL-SAFQNVNMGTKTTHLPVL-ATLPE-ADWVGESATDPKGVKPTSKVT 76 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~-l~-~l~~~~~~~~~~~~~p~~-~~~~~-a~~v~E~~~~~~~~~~~~~~~ 76 (305) |+.+.. .+-+.++..-|.+.-..... +. .+.+..++.+-...+... .+... +.++..+.+.. ...... T Consensus 1 M~~i~d----~f~~~~l~~~i~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~----~~~r~~ 72 (348) T protein:vir:96 1 MGLIYD----KVTASNIAGYFNTLQENVDSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVT----IRDRVS 72 (348) T ss_pred Ccchhh----ccCHHHHHHHHHhcccchhhhhhhhcCCCccccceeEEEEeecCCceeEeeeecCCCCcc----eecccc Confidence 876532 33345554433333222222 32 345555544333333221 22222 55666554321 122334 Q ss_pred ceeEEeeeeeEEEeehhhHHH------hhcC-HH----HHHHHHH---HHHHHHHHHHHHHHHH----cCcc--cCcCcc Q lcl|Aclame:pro 77 WANRTLVAEEIAVIIPVHENV------IDDA-TV----AVLTEVA---ELGGQAIGKKLDQAVI----FGTD--KPASWV 136 (305) Q Consensus 77 f~~v~~~~~k~~~~~~is~el------l~ds-~~----~~~~~v~---~~la~~~a~~~d~a~l----~G~g--~~~~~~ 136 (305) ++...+.+-.++....++.+- +.++ .. .+...+. ..+.+.+.+.+|..+. +|-= .+.+. T Consensus 73 ~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~- 151 (348) T protein:vir:96 73 AEIHDEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGV- 151 (348) T ss_pred eeeeeeecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCe- Confidence 555555555555544444321 1111 11 1222222 2233455656663333 3310 01000 Q ss_pred ccccccccccccccee-ecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHH---hhcc----CCce-eec Q lcl|Aclame:pro 137 SPALIPAAVTAGQAVE-VVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVAN---IRDA----NGNP-VFR 207 (305) Q Consensus 137 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~---~kd~----~G~~-l~~ 207 (305) ...+. -.....+.++ .........+.+.++..+...+...+..+..++|++..+..|++ +++. ++.. ... T Consensus 152 ~~~vd-fg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~ 230 (348) T protein:vir:96 152 NKDID-YGVKADHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVT 230 (348) T ss_pred eEEEe-ccCCcccceeeccccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHHhccCCcccccc Confidence 00000 0000111111 11222334566677777776666778888999999999999863 2221 1110 000 Q ss_pred ccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEcc----- Q lcl|Aclame:pro 208 DDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAY----- 282 (305) Q Consensus 208 ~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~----- 282 (305) +. ++.++ ++...|+.+.+.++.+...++.....|..|.+.+-.....|. T Consensus 231 ~~-------------------------~~~~~-~~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~~~~~G~~~yg~ 284 (348) T protein:vir:96 231 KA-------------------------ELQNY-VADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGT 284 (348) T ss_pred HH-------------------------HHHHH-HhhhcCceEEEEccEEEecCCcEeccccCCeEEEEcCCCceeEEecc Confidence 00 00000 001112222222222221122222233333332211111110 Q ss_pred ---------------------------Ee--ecccceEEEecc-ccc-cccCCC Q lcl|Aclame:pro 283 ---------------------------VL--GVSATAQGANKT-PVA-VVAPAA 305 (305) Q Consensus 283 ---------------------------~v--~~p~a~~~~~~t-~~a-~v~~a~ 305 (305) .+ .||......... |-. +..|-+ T Consensus 285 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~plPv~~~~~~ 338 (348) T protein:vir:96 285 TPEESDLFADNTVNADVEIVDSGIAVTTTKTTDPVNVQTKVSMVALPSFERLGD 338 (348) T ss_pred ChhhhhhhhcccccccceecCCeeEEEeeecCCCceEEEEEeeeeeccccCCCc Confidence 01 122221111111 000 111222 No 241 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=40.66 E-value=0.96 Score=20.68 Aligned_cols=281 Identities=11% Similarity=0.058 Sum_probs=120.3 Q ss_pred CC-CccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe----CC--------Cceeeeec------ Q lcl|Aclame:pro 1 MA-DISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA----TL--------PEADWVGE------ 61 (305) Q Consensus 1 Ma-~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~----~~--------~~a~~v~E------ 61 (305) -. ++++++-... -+.+.. +..++...-+..+++.+-||++++.-|.-.. .. .++ +..| T Consensus 59 ~~~s~~t~~v~~~-~P~Li~-l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a~~~EA-l~nEadt~fS 135 (457) T protein:vir:10 59 TGGDTVTGPVAGF-DPVLIS-LIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAAGYDEA-FFNEPNAGFS 135 (457) T ss_pred Ccccccccccccc-cchhhh-hhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccccccce-eeeccCcccC Confidence 11 1111111111 222222 3444555556677888888876543332211 00 000 0011 Q ss_pred ----------------------------------------------chhhcc----cccccccccceeEEeeeeeEEEee Q lcl|Aclame:pro 62 ----------------------------------------------SATDPK----GVKPTSKVTWANRTLVAEEIAVII 91 (305) Q Consensus 62 ----------------------------------------------~~~~~~----~~~~~~~~~f~~v~~~~~k~~~~~ 91 (305) ++...+ ...++...+++++++..+..+-.. T Consensus 136 g~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKA 215 (457) T protein:vir:10 136 GGPGAYDPGATGVTNDAEGTNPALLNDSPAGTYEQADDATGMSTATVEALDDSTANTAFREMGFSIEKVTVTARARALKA 215 (457) T ss_pred cccccccccccccccccccccccccCccccccccccccccchhhhhhhccCCCCCccchhhheeEEEEEEEeeeccceec Confidence 000000 113455556678777777777788 Q ss_pred hhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHcCc------ccCcCcccccccccccccccceeecccchhhh Q lcl|Aclame:pro 92 PVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGT------DKPASWVSPALIPAAVTAGQAVEVVGGVANES 161 (305) Q Consensus 92 ~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~G~------g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 161 (305) .+|-||.+|- ..|.++.|.+-|+..|...+++.||.-- |+..+....|+..-. ....+....+ T Consensus 216 EYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~~~~~~~~~gv~dl~-------~~~~g~~~~e 288 (457) T protein:vir:10 216 EYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVAGAQNNTATAGVFDLD-------VDSNGRWSVE 288 (457) T ss_pred cccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeccccccceeeeee-------ccccchhhHH Confidence 8999999984 4578999999999999999999888631 111111111111110 0001111111 Q ss_pred HHHHHHHHH----HHH-hhhccccceEEEEchHHHHHHHH--hh---------------ccCCceeecccccCccceEec Q lcl|Aclame:pro 162 DIVGATNRA----AKA-VASAGWAPDTLLSSLALRYEVAN--IR---------------DANGNPVFRDDSFAGFRTFFN 219 (305) Q Consensus 162 ~~~~~~~~~----~~~-~~~~~~~~~~~v~~~~~~~~l~~--~k---------------d~~G~~l~~~~~l~G~pv~~~ 219 (305) ....++..+ ... ........+-+++++.+...|.. .- |..|. .+..-.-.|++|+++ T Consensus 289 ~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~~~~-~~~G~l~~r~~vy~D 367 (457) T protein:vir:10 289 KFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGVLDYTPALNGNNGLAGVDDTSS-TLVGTLNGRIKVYVD 367 (457) T ss_pred HHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhcccccccchhhccccccccccccc-eeEEEecCCeEEEEe Confidence 111111222 111 12334556778999999988875 21 11111 011112345677777 Q ss_pred CccccCCCCceEEEEehhhEEEEeecCcEE----EEeecceeccC-cceeeeeecCcEEEEEEEEEccEeecccceEEEe Q lcl|Aclame:pro 220 RNGAWDADAAIEVIADSSRVKIGVRQDITV----KFLDQATLGTG-ENQINLAERDMVALRLKARFAYVLGVSATAQGAN 294 (305) Q Consensus 220 ~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v----~~~~~~~~~~~-~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~ 294 (305) .+...+....-+++| .++...+ ....+.-+..- ...-..|+- .+=...|++. +.+|-.. .++ T Consensus 368 ~Ya~~ns~~dy~~vG--------~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP---~~g~~tRY~l-~~NP~~~-~~~ 434 (457) T protein:vir:10 368 PYSANVADKHFYVAG--------YKGTSPYDAGLFYCPYVPLQQVRAINPDTFQP---KIGFKTRYGM-VSNPFAG-GLT 434 (457) T ss_pred cccccCCccceEEEE--------EeCCcceecceeecccccccccCccCCccccc---eeeeeeeeee-eeccccc-ccc Confidence 554332222233333 2222211 11111111000 000011222 2334457766 5666533 222 Q ss_pred ccccccccCCC Q lcl|Aclame:pro 295 KTPVAVVAPAA 305 (305) Q Consensus 295 ~t~~a~v~~a~ 305 (305) ..+...+...= T Consensus 435 ~~~~~~~~~~n 445 (457) T protein:vir:10 435 QGSGALTVNAN 445 (457) T ss_pred cccccccccch Confidence 22222211111 No 242 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=40.36 E-value=0.97 Score=20.64 Aligned_cols=288 Identities=11% Similarity=0.086 Sum_probs=115.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE----eCC---------------Cceeee-- Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVL----ATL---------------PEADWV-- 59 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~----~~~---------------~~a~~v-- 59 (305) .+..+++..=.-.-+.+. .+..++..+-+..+++.+-||++++.-|.-. ... +++.|- T Consensus 79 iaes~~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~ 157 (521) T protein:vir:72 79 IAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQ 157 (521) T ss_pred ccccccccccccCCchhh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhcccccccccc Confidence 333333222112222222 2334444555566778888877553221110 000 000000 Q ss_pred -------------------------------------------------------------------ecc------hh-- Q lcl|Aclame:pro 60 -------------------------------------------------------------------GES------AT-- 64 (305) Q Consensus 60 -------------------------------------------------------------------~E~------~~-- 64 (305) +++ +. T Consensus 158 ~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~ 237 (521) T protein:vir:72 158 GAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQE 237 (521) T ss_pred cccccccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccchhhhhhhc Confidence 000 00 Q ss_pred ----hcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc Q lcl|Aclame:pro 65 ----DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV 136 (305) Q Consensus 65 ----~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~ 136 (305) ......++...+++++++..+..+-...+|-||.+|- ..|.++.|.+-|+..|...+++.+|.=.-...-.. T Consensus 238 ~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g 317 (521) T protein:vir:72 238 GFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVG 317 (521) T ss_pred ccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeee Confidence 0011245556666887777777777888999999984 45789999999999999999999984211000000 Q ss_pred cccccc------cccccccceeecccchhhhH---HHHHHHHHHHHhhh--ccccceEEEEchHHHHHHHHhh--c---- Q lcl|Aclame:pro 137 SPALIP------AAVTAGQAVEVVGGVANESD---IVGATNRAAKAVAS--AGWAPDTLLSSLALRYEVANIR--D---- 199 (305) Q Consensus 137 ~~~~~~------~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~~k--d---- 199 (305) -.+... +.............-...+. ++--+......+.. .....+-+++++++...|...- | T Consensus 318 ~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~ 397 (521) T protein:vir:72 318 KSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAA 397 (521) T ss_pred eeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccc Confidence 001100 00000000000001111111 11122222223322 2245567899999999998531 1 Q ss_pred ---cCC------ceeecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEE----EEeecceeccC-cceee Q lcl|Aclame:pro 200 ---ANG------NPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITV----KFLDQATLGTG-ENQIN 265 (305) Q Consensus 200 ---~~G------~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v----~~~~~~~~~~~-~~~~~ 265 (305) ..| ..++..-.-.|++|+++.+.+.+ -++ +|.++...+ ....+.-+..- ...-. T Consensus 398 ~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----y~~--------vG~KG~~~~~~glfyaPYv~l~~~~~~dp~ 465 (521) T protein:vir:72 398 QGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQD----YFT--------VGYKGPNEMDAGIYYAPYVALTPLRGSDPK 465 (521) T ss_pred ccccccccccCCCceEEEEccCceEEEecCCCCcc----eEE--------EEEeCCcccccceeeccccccccccccCCc Confidence 111 01111122345677777664321 122 222222111 01110000000 00000 Q ss_pred eeecCcEEEEEEEEEccEeecccc-------eEEEeccccccccCCC Q lcl|Aclame:pro 266 LAERDMVALRLKARFAYVLGVSAT-------AQGANKTPVAVVAPAA 305 (305) Q Consensus 266 ~~~~~~~~~r~~~r~~~~v~~p~a-------~~~~~~t~~a~v~~a~ 305 (305) .|+- .+=...|++.. .+|-+ ..+++...-..-...+ T Consensus 466 sfqP---~~g~~tRY~l~-~NP~~~~~~~~~a~~i~~~~~~~~a~~~ 508 (521) T protein:vir:72 466 NFQP---VMGFKTRYGIG-INPFAESAAQAPASRIQSGMPSILNSLG 508 (521) T ss_pred cccc---eeeeeeeecee-ecCcccccCcccceeecCcChhhhcCcc Confidence 1221 12234455543 34421 2222222100000001 No 243 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=36.48 E-value=1.2 Score=20.21 Aligned_cols=286 Identities=11% Similarity=0.083 Sum_probs=124.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE-----EEeC--------------CCceeee-- Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLP-----VLAT--------------LPEADWV-- 59 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p-----~~~~--------------~~~a~~v-- 59 (305) .+..+++.+=.-+-+.+. .+..++....+..+++.+-||++.+.-|. .... .+++.|- T Consensus 77 i~~~~~t~~v~~~~P~l~-~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~ 155 (519) T protein:vir:10 77 IAAGQTSGAVTQIGPAVM-GMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQ 155 (519) T ss_pred cccccccccccccchhHH-HHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccCcc Confidence 333333332222222222 23344455556677888888876532221 1000 0000110 Q ss_pred -------------------------------------------------------------------ecchh-------- Q lcl|Aclame:pro 60 -------------------------------------------------------------------GESAT-------- 64 (305) Q Consensus 60 -------------------------------------------------------------------~E~~~-------- 64 (305) +++.. T Consensus 156 ~~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~ 235 (519) T protein:vir:10 156 GAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQE 235 (519) T ss_pred ccccccccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchhhccc Confidence 00000 Q ss_pred ----hcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcc Q lcl|Aclame:pro 65 ----DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWV 136 (305) Q Consensus 65 ----~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~ 136 (305) ......++...+++++++..+..+-...+|-||.+|- ..|.++.|.+-|+..|...+++.+|.=.....-.. T Consensus 236 ~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~ 315 (519) T protein:vir:10 236 GFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVG 315 (519) T ss_pred cCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhcc Confidence 0011245667778888888877777889999999984 45789999999999999999999995221111111 Q ss_pred ccccccc------ccccccceeecccch---hhhHHHHHHHHHHHHhhh--ccccceEEEEchHHHHHHHHhh------- Q lcl|Aclame:pro 137 SPALIPA------AVTAGQAVEVVGGVA---NESDIVGATNRAAKAVAS--AGWAPDTLLSSLALRYEVANIR------- 198 (305) Q Consensus 137 ~~~~~~~------~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~~k------- 198 (305) -.+..+. ............+-. .+..++-.+......+.. .....+-+++++++...|...- T Consensus 316 ~~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~ 395 (519) T protein:vir:10 316 KSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAA 395 (519) T ss_pred eeecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhcccc Confidence 1111110 000000000000000 111222223333333333 2233467899999999887542 Q ss_pred ---------ccCCceeecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEE----EeecceeccC-ccee Q lcl|Aclame:pro 199 ---------DANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVK----FLDQATLGTG-ENQI 264 (305) Q Consensus 199 ---------d~~G~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~----~~~~~~~~~~-~~~~ 264 (305) |.++ .++..-.-.|++|+++.+.+. ..+.+|.++..+++ ...+.-+..- ...- T Consensus 396 ~~~~~~~~~d~~~-~~~~G~l~~~~~vy~D~y~~~------------dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp 462 (519) T protein:vir:10 396 QGLGQGFNVDTTK-AVFAGVLGGKYRVYIDQYARS------------DYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDP 462 (519) T ss_pred ccccccccccCCC-ceEEEEecCceEEEecCCCCc------------ceEEEEEecCcccccceeeccccccccccccCC Confidence 1111 111112234567776666442 12233443332221 1111111100 0001 Q ss_pred eeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 265 NLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 265 ~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ..||- .+=...|++. ..+|-+- ..+..+.+.+.=+- T Consensus 463 ~sfqP---~~g~~tRY~l-~~NP~~~-~~~~~~~~~i~~g~ 498 (519) T protein:vir:10 463 KNFQP---VMGFKTRYGI-GINPFAD-PAAQAPTKRIQNGM 498 (519) T ss_pred ccccc---eeeeeeeece-eecCccc-ccccCccceeccCc Confidence 11322 2233455554 3455221 22333333332221 No 244 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=36.28 E-value=1.2 Score=20.19 Aligned_cols=288 Identities=9% Similarity=0.019 Sum_probs=124.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhh-hcce------ee---c---CCCceEEEEEeCCCceeeeecchhhcc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLS-AFQN------VN---M---GTKTTHLPVLATLPEADWVGESATDPK 67 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~-l~~~------~~---~---~~~~~~~p~~~~~~~a~~v~E~~~~~~ 67 (305) ||.+....+.......++..+.....+.+.+.. +... .. . .+.++++..... -...+|.+.+..++ T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~g~gv~Gd~~leG 79 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVH-LRGKPTYGDARVEG 79 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeee-cccCCcccCceeec Confidence 998777666566667788888777777766554 4321 11 1 123344443221 12333444444333 Q ss_pred cccccccccceeEEeeeeeEEEeehhhHHHh-hcCHHHHHHHHHHHHHHHHHHHHHHHHH-cCcccCcCccccccccccc Q lcl|Aclame:pro 68 GVKPTSKVTWANRTLVAEEIAVIIPVHENVI-DDATVAVLTEVAELGGQAIGKKLDQAVI-FGTDKPASWVSPALIPAAV 145 (305) Q Consensus 68 ~~~~~~~~~f~~v~~~~~k~~~~~~is~ell-~ds~~~~~~~v~~~la~~~a~~~d~a~l-~G~g~~~~~~~~~~~~~~~ 145 (305) . +..++|.+-++.+..+..-+.....+- +-+..+|...-++.|..-+++..|+.+| +-.|+-.-..+........ T Consensus 80 n---ee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~ 156 (364) T protein:vir:93 80 K---EESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFT 156 (364) T ss_pred c---ccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCcc Confidence 2 345667776666666655555433333 2467899999999999999999998666 3333110000000000000 Q ss_pred c-cccce--------------ee-cccchhhhHHHHHHHHHHHHhhhcc----------------ccceEEEEchHHHHH Q lcl|Aclame:pro 146 T-AGQAV--------------EV-VGGVANESDIVGATNRAAKAVASAG----------------WAPDTLLSSLALRYE 193 (305) Q Consensus 146 ~-~~~~~--------------~~-~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~v~~~~~~~~ 193 (305) . ..+.+ +. ..-..++.--++.+.++.......+ ...-.+++||..+.. T Consensus 157 ~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~ 236 (364) T protein:vir:93 157 GYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATD 236 (364) T ss_pred cccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhh Confidence 0 00000 00 0000111112344444444332211 112267899999988 Q ss_pred HHHhhc--------------cCCceeecc--cccCccceEecCccc----cCCCCce-----EEEEehhhE-EEEeecCc Q lcl|Aclame:pro 194 VANIRD--------------ANGNPVFRD--DSFAGFRTFFNRNGA----WDADAAI-----EVIADSSRV-KIGVRQDI 247 (305) Q Consensus 194 l~~~kd--------------~~G~~l~~~--~~l~G~pv~~~~~~~----~~~~~~~-----~~~gdf~~~-~~~~~~~i 247 (305) |+.-+| ...+|||.. .++.|..+.-...+. ...+... +++|--.-. .++-.+|. T Consensus 237 Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a~g~~~g~ 316 (364) T protein:vir:93 237 MRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGL 316 (364) T ss_pred hhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEEeecCCCC Confidence 874332 123577765 356676554333332 1111111 223322211 12222333 Q ss_pred EEEEeecceeccCcceeeeeec-CcEEEEEEEEEccEeec--ccceEEEeccccccccCCC Q lcl|Aclame:pro 248 TVKFLDQATLGTGENQINLAER-DMVALRLKARFAYVLGV--SATAQGANKTPVAVVAPAA 305 (305) Q Consensus 248 ~v~~~~~~~~~~~~~~~~~~~~-~~~~~r~~~r~~~~v~~--p~a~~~~~~t~~a~v~~a~ 305 (305) .....++. |.+ |...|-+..-+|+...+ ..-|-.+..-.+++-- + T Consensus 317 ~~~w~Ee~-----------~D~gn~~~i~~~~i~G~kK~rF~~~DfGvi~idtaa~~~--~ 364 (364) T protein:vir:93 317 RFDWEETV-----------KDYGNEPAIAAGFIAGMKKARFNNKDFGVISIDTAAKKH--S 364 (364) T ss_pred Cceeeecc-----------cCCCCchhhhhhhHhhhhhcccCCccceEEEeccccccc--C Confidence 33222221 111 23333333333333322 1111111110000000 0 No 245 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=35.65 E-value=1.2 Score=20.11 Aligned_cols=262 Identities=14% Similarity=0.087 Sum_probs=110.3 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhh--hhhhhcceeecCCCceEEEEEe---CCCceeeeecchhhcccccccccc Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGS--TVLSAFQNVNMGTKTTHLPVLA---TLPEADWVGESATDPKGVKPTSKV 75 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~--~l~~l~~~~~~~~~~~~~p~~~---~~~~a~~v~E~~~~~~~~~~~~~~ 75 (305) =+.+.+ |+++=-+.+.+++......+. .+.+-..+.+..+--.+|-... +...-..+.|++ ..+.+++ T Consensus 18 ~~a~~~--g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~rhG~~g~s~~~E~~-----l~~~~d~ 90 (470) T protein:vir:10 18 NAAGQV--AESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTARHDKIGYAAFREGG-----LPRTVEV 90 (470) T ss_pred HHhhhc--chhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhhhhhccccccccceeecccc-----cCccCCC Confidence 111111 222212222222211111111 1111122222222211221111 222222345554 3556778 Q ss_pred cceeEEeeeeeEEEeehhhHHH---hhcCHHHHHHHHHHHHHHHHHHHHHHHHHcCcc--------cCcCcccccccccc Q lcl|Aclame:pro 76 TWANRTLVAEEIAVIIPVHENV---IDDATVAVLTEVAELGGQAIGKKLDQAVIFGTD--------KPASWVSPALIPAA 144 (305) Q Consensus 76 ~f~~v~~~~~k~~~~~~is~el---l~ds~~~~~~~v~~~la~~~a~~~d~a~l~G~g--------~~~~~~~~~~~~~~ 144 (305) .+.+..+..+=++....+|.-+ ++....+++..+.+.---.+++.+|.++|.||. ...+.+--|+.+-+ T Consensus 91 ~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s~~~g~~~gleFDGl~~lI 170 (470) T protein:vir:10 91 NVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGDDVPGSPNNLQQDGIINII 170 (470) T ss_pred ceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhccccccccCcccCceeccchhhhc Confidence 8999999888888888888753 233344788888888888899999999999954 33444444554422 Q ss_pred cc--cccceeecccchhhhHHHHHHHHHHHHhh--hccccceEEEEchHHHHHHHHhhccCCceeeccc---ccCccceE Q lcl|Aclame:pro 145 VT--AGQAVEVVGGVANESDIVGATNRAAKAVA--SAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD---SFAGFRTF 217 (305) Q Consensus 145 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~---~l~G~pv~ 217 (305) .. ..++...-+.... .+.+..+...+. ..+..++-++|+..+.+.|..--...-|.+.++. ...|+|+- T Consensus 171 d~~~~~NViDarG~~Ls----~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qRv~~~~N~~~~~~G~~v~ 246 (470) T protein:vir:10 171 KRGAPQNVLDAGGRPLS----IDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISRVMTTADRRAGLLGADAQ 246 (470) T ss_pred cCCCCccccccCCCCcc----HHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceEEEEecCCCceeeeeecc Confidence 21 2334333333332 233444444443 3555667788899998888766555555444422 23344321 Q ss_pred ecCccccCCCCceEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEccEeecccceEEEecc- Q lcl|Aclame:pro 218 FNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT- 296 (305) Q Consensus 218 ~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t- 296 (305) ..+ .-++.+.+-.+. .+.+ +. .....|....++ .+.-|...+.+..+ T Consensus 247 --~f~-------------------sa~G~I~L~~s~--~m~~-------~~-k~~p~~l~~~v~-~~aAP~~~~tv~~t~ 294 (470) T protein:vir:10 247 --SYI-------------------GVRGEHSLYPSQ--FLGD-------FH-KFNPARFGAEVG-DFAAPSNSWTVSTTD 294 (470) T ss_pred --cee-------------------eeeeeeeecccc--cccc-------hh-hcCcccCCcccC-CcccCceeEEeecCC Confidence 111 111222111000 0000 00 000000000000 00112211111111 Q ss_pred ccccccCCC Q lcl|Aclame:pro 297 PVAVVAPAA 305 (305) Q Consensus 297 ~~a~v~~a~ 305 (305) +.+...+++ T Consensus 295 ~~~a~~~~s 303 (470) T protein:vir:10 295 NFVTLPYNS 303 (470) T ss_pred CceeecccC Confidence 111111111 No 246 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=33.21 E-value=1.4 Score=19.83 Aligned_cols=287 Identities=13% Similarity=0.099 Sum_probs=116.5 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE----eCC-------------Cceeeee--- Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVL----ATL-------------PEADWVG--- 60 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~----~~~-------------~~a~~v~--- 60 (305) .++.+++..=.-+-+.+. .+..++..+-+..+++.+-||++++.-|.-. ... +++.|-+ T Consensus 76 ia~s~~t~~v~~~~P~ll-~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~tg~EAf~~~nEadt~fSG~~~ 154 (514) T protein:vir:56 76 IAQGVTTGAVTNIGPTVM-GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAA 154 (514) T ss_pred cccccccccccccchhHH-HHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcccccccccccccCcCcccccc Confidence 333333221111222221 2334444555566788888887653222110 000 0011100 Q ss_pred -------------------------------------------------------------------cc------hh--- Q lcl|Aclame:pro 61 -------------------------------------------------------------------ES------AT--- 64 (305) Q Consensus 61 -------------------------------------------------------------------E~------~~--- 64 (305) ++ +. T Consensus 155 ~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~ 234 (514) T protein:vir:56 155 ASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQEN 234 (514) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhccc Confidence 00 00 Q ss_pred ---hcccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHH---cCccc-Cc Q lcl|Aclame:pro 65 ---DPKGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVI---FGTDK-PA 133 (305) Q Consensus 65 ---~~~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l---~G~g~-~~ 133 (305) ......++...+++++++..+...-...+|-||.+|- ..|.++.|.+-|+..|...+++.|| +-.-. ++ T Consensus 235 lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~ 314 (514) T protein:vir:56 235 FNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGK 314 (514) T ss_pred CCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehh Confidence 0011245666777887777777777888999999984 4578999999999999999999995 21110 00 Q ss_pred Ccccccccccccccccc-eeecccchhhhHHHHHH---HHHHHHhh--hccccceEEEEchHHHHHHHHh---------- Q lcl|Aclame:pro 134 SWVSPALIPAAVTAGQA-VEVVGGVANESDIVGAT---NRAAKAVA--SAGWAPDTLLSSLALRYEVANI---------- 197 (305) Q Consensus 134 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~v~~~~~~~~l~~~---------- 197 (305) .....++.+........ ....+.-...+....+. .+....+. ......+-+++++.+...|... T Consensus 315 ~~~~~~~~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g 394 (514) T protein:vir:56 315 SGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQG 394 (514) T ss_pred cccccccccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccC Confidence 00001111100000000 00001111111111111 11122222 1223567789999999998741 Q ss_pred -------hccCCceeecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEE----EeecceeccCc-ceee Q lcl|Aclame:pro 198 -------RDANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVK----FLDQATLGTGE-NQIN 265 (305) Q Consensus 198 -------kd~~G~~l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~----~~~~~~~~~~~-~~~~ 265 (305) .|.. .+++..-.-.|++|+++.+.+.+ .+.+|.++..+++ ...+.-++.-. ..-. T Consensus 395 ~~~~~~~~d~~-~~~~aG~l~~~~~vy~D~y~~~d------------y~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~ 461 (514) T protein:vir:56 395 MQDGSMNTDTN-QTVFAGVLGGRFKVYIDQYAVND------------YFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSK 461 (514) T ss_pred ccccccccccC-cceEEEEecCceEEEecCCCCcc------------eEEEEEecCcceecceeeccccccccccccCCc Confidence 1111 12222223456677777765421 1222332222111 11111110000 0001 Q ss_pred eeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 266 LAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 266 ~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) .|+- .+=...|++. ..+|-+=..-...+.+--.|-+ T Consensus 462 sfqP---~~g~~tRY~l-~~NPy~~~~~~~~~~~~~~~~~ 497 (514) T protein:vir:56 462 NFQP---VIGFKTRYGV-QVNPFADPTASATKVGNGAPVA 497 (514) T ss_pred cccc---eeeeeeeece-eeCCCCCccccccccCCcchhh Confidence 1221 1223445554 3355210000000000001111 No 247 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=29.89 E-value=1.6 Score=19.43 Aligned_cols=284 Identities=15% Similarity=0.095 Sum_probs=103.6 Q ss_pred CCCccCCccc---eEccHHHHHHHHHH--------------HHhhhhhhhhcceee---cCCCceEEEEEe-CCCceeee Q lcl|Aclame:pro 1 MADISRAEVA---SLIQEAYSDTLLAA--------------AKQGSTVLSAFQNVN---MGTKTTHLPVLA-TLPEADWV 59 (305) Q Consensus 1 Ma~~t~~~gg---~lip~~~~~~i~~~--------------~~~~~~l~~l~~~~~---~~~~~~~~p~~~-~~~~a~~v 59 (305) |-..-..+.. .--.+.+.++--.. ....+.+.......+ .......+-... +..-..-. T Consensus 107 mRsrY~~~~~~~nq~gtEAlfnEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~GM~Ta~ 186 (462) T protein:vir:10 107 MRSFYGSERRPANSDFREALFNEPNAGFSGGAGTGLSNYDPTASSSAVNDAEGANPGLLNDSPAGTYEVTGDATGMATAT 186 (462) T ss_pred eeeeccCCccccccccchhhhccCCcCccccccccccccccccccccccccccccceeecCCCccceecccccccccchh Confidence 1111000000 00001110000000 000000000000000 000000000000 00000001 Q ss_pred ecchhhc--ccccccccccceeEEeeeeeEEEeehhhHHHhhcC----HHHHHHHHHHHHHHHHHHHHHHHHHcCcc--- Q lcl|Aclame:pro 60 GESATDP--KGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDA----TVAVLTEVAELGGQAIGKKLDQAVIFGTD--- 130 (305) Q Consensus 60 ~E~~~~~--~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds----~~~~~~~v~~~la~~~a~~~d~a~l~G~g--- 130 (305) +|.-... ....++...+++++++..+..+-...+|-||.+|- ..|.++.|.+-|+..|...+++.||.--- T Consensus 187 aE~lg~~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a 266 (462) T protein:vir:10 187 AEALDDSSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNA 266 (462) T ss_pred ccccCCccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhh Confidence 1211110 12357778888998888888888889999999984 45789999999999999999998885321 Q ss_pred ---cCcCcccccccccccccccceeecccchhhhHHHHHHHHHH---HHh--hhccccceEEEEchHHHHHHHHhh--c- Q lcl|Aclame:pro 131 ---KPASWVSPALIPAAVTAGQAVEVVGGVANESDIVGATNRAA---KAV--ASAGWAPDTLLSSLALRYEVANIR--D- 199 (305) Q Consensus 131 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~--~~~~~~~~~~v~~~~~~~~l~~~k--d- 199 (305) +-.+....|+..-. ....+-...+....++..+. ..+ .......+-+++++++...|...- + T Consensus 267 ~~~k~~~~~~~Gv~dl~-------~~~~gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~ 339 (462) T protein:vir:10 267 VKGAIANTATDGIFDLD-------VDSNGRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGVLDY 339 (462) T ss_pred eeeecccccccceeeec-------cccchHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccchhc Confidence 11111111111100 00001111111112222221 111 223345677899999999885321 0 Q ss_pred c---CCce----------eecccccCccceEecCccccCCCCceEEEEehhhEEEEeecCcEEE----EeecceeccC-c Q lcl|Aclame:pro 200 A---NGNP----------VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVK----FLDQATLGTG-E 261 (305) Q Consensus 200 ~---~G~~----------l~~~~~l~G~pv~~~~~~~~~~~~~~~~~gdf~~~~~~~~~~i~v~----~~~~~~~~~~-~ 261 (305) . +++. .+..-.-.|++|+++.+...+....-+++ |.++...++ ...+.-+..- . T Consensus 340 ~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~v--------G~KG~~~~~~glfy~PYv~l~~~~~ 411 (462) T protein:vir:10 340 APGLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYVA--------GYKGTSPYDAGLFYCPYVPLQQVRA 411 (462) T ss_pred cccccccccccccccccceeEEEecCceEEEEecccCCCcccceEEE--------EEeCCcccccceeeccccccccccc Confidence 0 1110 11111234567777665432222222333 332222111 1111100000 0 Q ss_pred ceeeeeecCcEEEEEEEEEccEeecccceEEEeccccccccCCC Q lcl|Aclame:pro 262 NQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) Q Consensus 262 ~~~~~~~~~~~~~r~~~r~~~~v~~p~a~~~~~~t~~a~v~~a~ 305 (305) ..-..|+- .+=...|++. ..+|-+- ..+.++.+ +..+. T Consensus 412 ~dp~sfqP---~~g~~tRY~l-~~NP~t~-~~~~~~~~-~~~~~ 449 (462) T protein:vir:10 412 INPNTFQP---KIGFKTRYGM-VSNPFSG-GLTQGSGA-LTANA 449 (462) T ss_pred cCCccccc---eeeeeeeeee-eecCCCC-CcCCcccc-ccccC Confidence 00011222 1223345554 2344311 11111211 12222 No 248 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=29.04 E-value=1.7 Score=19.33 Aligned_cols=281 Identities=12% Similarity=0.002 Sum_probs=112.7 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCCceeeeecchhhcccccccccc--cce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKV--TWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~~~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~~~~--~f~ 78 (305) |+. +...+-+.+.+--+..-...-+-..+++.+|+...+.+|+++... ++--+........+....-+. +-. T Consensus 1 ~~~-----~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~-e~F~~~~t~r~~~~~~~~v~~~~~~~ 74 (309) T protein:vir:99 1 MSN-----APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLA-QGFTVPETLVGRKSKPNEVEFSATDE 74 (309) T ss_pred CCC-----CCcCcCHhHHHHHhhccChhhhhhhcCCccccCccccceeeechh-hcccccchhhccCCCcceEeecccCc Confidence 443 333333344333233322333334568889998878888887531 111111111111111111111 123 Q ss_pred eEEeeeeeEEEeehhhHHHhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCcCcccccccccccccccceeeccc Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~d--s~~~~~~~v~~~la~~~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) .+.+..|-+. .+|..+-..+ +..+.++.-.+.+.+.+.+..|..+-.---++... +.+ ......+. .-. T Consensus 75 ~~~~~~~~L~--~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y-~~~--~k~~Lsgt----~~w 145 (309) T protein:vir:99 75 TGSTEDHGLD--APVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSY-AAG--NKTTLSGA----DQW 145 (309) T ss_pred eeeeccccee--ecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhc-CCC--ceEEecCc----ccc Confidence 3344444444 4444444433 34677788788888888776664332211001000 000 00000000 001 Q ss_pred chhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHH-------hhccCCce-eeccc---ccCcc-ceEecCcccc Q lcl|Aclame:pro 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVAN-------IRDANGNP-VFRDD---SFAGF-RTFFNRNGAW 224 (305) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~-------~kd~~G~~-l~~~~---~l~G~-pv~~~~~~~~ 224 (305) .....|.+.++......+ +..++..++....|..|+. +|-..+.. +..+. .++|. .|++.+..-. T Consensus 146 sd~~SDPi~~i~~~~~~~---g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l~~ve~V~vg~a~~n 222 (309) T protein:vir:99 146 SDPTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLN 222 (309) T ss_pred CCCCCCcHHHHHHHHHhh---CCCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHHHHhCcceEEeecceee Confidence 123345555665555443 6788999999999988764 23232222 12221 24444 3444332211 Q ss_pred CC--CCc---eEEEEehhhEEEEeecCcEEE-Eeecceec-----cCcceeeee-ecCcEEEEEEEEEccEeecccceEE Q lcl|Aclame:pro 225 DA--DAA---IEVIADSSRVKIGVRQDITVK-FLDQATLG-----TGENQINLA-ERDMVALRLKARFAYVLGVSATAQG 292 (305) Q Consensus 225 ~~--~~~---~~~~gdf~~~~~~~~~~i~v~-~~~~~~~~-----~~~~~~~~~-~~~~~~~r~~~r~~~~v~~p~a~~~ 292 (305) .. ++. .=+-|+..-+.+.....-+++ .+-..+++ .+......+ +..--.+|+...+.-.+.-+.+-.. T Consensus 223 ~a~~g~~~~~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~l 302 (309) T protein:vir:99 223 IARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFF 302 (309) T ss_pred ccccccccccccccCCcEEEEEcCCCCCCcccccccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcchh Confidence 10 110 001111111111111110000 00000000 000000000 1112236666666655666666666 Q ss_pred Eeccccc Q lcl|Aclame:pro 293 ANKTPVA 299 (305) Q Consensus 293 ~~~t~~a 299 (305) +....++ T Consensus 303 i~~~va~ 309 (309) T protein:vir:99 303 FENAVAA 309 (309) T ss_pred hhhcccC Confidence 6666555 No 249 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=20.74 E-value=2.7 Score=18.20 Aligned_cols=229 Identities=13% Similarity=0.180 Sum_probs=104.9 Q ss_pred CCCccCCccceEccHHHHHHHHHHHHhh-hhhhhhcceeecCCCceEEEEEeCCCc-eeeeecchhhcccccccccccce Q lcl|Aclame:pro 1 MADISRAEVASLIQEAYSDTLLAAAKQG-STVLSAFQNVNMGTKTTHLPVLATLPE-ADWVGESATDPKGVKPTSKVTWA 78 (305) Q Consensus 1 Ma~~t~~~gg~lip~~~~~~i~~~~~~~-~~l~~l~~~~~~~~~~~~~p~~~~~~~-a~~v~E~~~~~~~~~~~~~~~f~ 78 (305) |..+... -. .+-..+...+.+.+... +...+++.++|-++.+-+|.....-|. -.|+||-. ..+++-. T Consensus 1 M~i~~~~-l~-~l~~~~~~~f~~~~~~a~~~~~~iA~~vpSt~~~~tY~wLg~fP~lrewiGer~--------i~~l~~~ 70 (305) T protein:vir:19 1 MIVTPAS-IK-ALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKRT--------IQQMEAH 70 (305) T ss_pred CccCHHH-HH-HHHHHHHHHHHHHHhhcCcccceEEeEecCCCCcccccccccCCccchhhccee--------eeecccc Confidence 4432211 00 01112222222222222 224556777775555556655543333 45776643 2344445 Q ss_pred eEEeeeeeEEEeehhhHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHc----Cccc----CcCcccccccccccccccc Q lcl|Aclame:pro 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTDK----PASWVSPALIPAAVTAGQA 150 (305) Q Consensus 79 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~v~~~la~~~a~~~d~a~l~----G~g~----~~~~~~~~~~~~~~~~~~~ 150 (305) .-+++-+++..-+.|.++.++|-..++.+-+.++|+++.+...|+-++. |-.+ |.+++... + T Consensus 71 ~y~i~Nk~fe~tV~V~R~dIeDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~cyDGq~FFdtD----------H 140 (305) T protein:vir:19 71 GYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKE----------H 140 (305) T ss_pred ceeEeeccccceeccchhhccccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccCCCCCcccCCC----------C Confidence 6677778888999999999999899999999999999999988877663 2110 11111100 0 Q ss_pred eeecccchhhhHHHHHHHHHHHHhhhccccceEEEEchHHHHHHHHhhccCCceeeccc-ccCccceEecCccccCCCCc Q lcl|Aclame:pro 151 VEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD-SFAGFRTFFNRNGAWDADAA 229 (305) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~kd~~G~~l~~~~-~l~G~pv~~~~~~~~~~~~~ 229 (305) .....+...+ +......++..++..|.+.|.-+ ...=.| T Consensus 141 ------------------pv~~~~~~tg--------~~~~vsn~~~~~~~~g~~w~Lld~~~~ikP-------------- 180 (305) T protein:vir:19 141 ------------------PVYPNVDGTG--------SAVNTSNIVEQDSFSGLPFYLLDCSRAVKP-------------- 180 (305) T ss_pred ------------------CcccCCcccc--------cccchhhhhcCCCCCCceeeeeecCCccee-------------- Confidence 0000000000 00000123333444444432110 010112 Q ss_pred eEEEEehhhEEEEeecCcEEEEeecceeccCcceeeeeecCcEEEEEEEEEc--cEeec--------------------- Q lcl|Aclame:pro 230 IEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFA--YVLGV--------------------- 286 (305) Q Consensus 230 ~~~~gdf~~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~~--~~v~~--------------------- 286 (305) +++-.|+..++.-.++. +.-+.|.+++..+-+.+|.. |+... T Consensus 181 ---------~I~Q~Rk~~~~~~~~~~------~d~~vf~~~e~~ygvd~R~n~Gygfwq~a~gS~~~Ls~~nl~aar~aM 245 (305) T protein:vir:19 181 ---------LIFQERRKPELVARTRI------DDDHVFMDNEFLFGASTRRAAGYGFWQMAVAVKGDLTLDNLWKGWQLM 245 (305) T ss_pred ---------EEEecccccceeeccCC------CchhhhhhceeeeeeeeeeeccccchhheecCCCCCCHHHHHHHHHHH Confidence 34555555554332222 11123555555554444443 22110 Q ss_pred -----ccceEEEeccccccccCCC Q lcl|Aclame:pro 287 -----SATAQGANKTPVAVVAPAA 305 (305) Q Consensus 287 -----p~a~~~~~~t~~a~v~~a~ 305 (305) ..+ ..+...|.=.|-|.+ T Consensus 246 ~~qk~d~G-~pL~I~P~~LvVPp~ 268 (305) T protein:vir:19 246 RSFEGDGG-KKLGLKPTHIVVPVG 268 (305) T ss_pred HhhcCCCC-ceeeeecCeEEeCch Confidence 111 234444544455555 Done!