Query lcl|Aclame:protein:vir:9509|NCBI_annot:hypothetical protein|genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Match_columns 381 No_of_seqs 144 out of 598 Neff 8.6 Searched_HMMs 1612 Date Sat Nov 30 15:47:49 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_40 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_40_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:100632 Length: 381 100.0 2E-102 2E-105 577.7 39.3 381 1-381 1-381 (381) 2 protein:vir:9509 Length: 381 # 100.0 8E-101 5E-104 569.3 39.7 381 1-381 1-381 (381) 3 protein:vir:101291 Length: 381 100.0 8E-101 5E-104 569.3 39.7 381 1-381 1-381 (381) 4 protein:vir:78350 Length: 383 100.0 2.8E-93 1.7E-96 528.0 38.1 376 1-376 1-383 (383) 5 protein:vir:9643 Length: 377 # 100.0 1.4E-90 8.5E-94 513.3 39.3 368 1-368 1-377 (377) 6 protein:vir:98635 Length: 377 100.0 1.1E-86 6.8E-90 491.8 37.7 368 1-368 1-377 (377) 7 protein:vir:95963 Length: 395 100.0 1.1E-84 6.7E-88 480.9 37.7 377 1-381 1-389 (395) 8 protein:vir:4092 Length: 390 # 100.0 1.3E-74 8.1E-78 425.6 38.4 370 1-381 1-388 (390) 9 protein:vir:80128 Length: 466 100.0 3E-66 1.9E-69 379.8 32.6 377 1-381 21-461 (466) 10 protein:vir:95376 Length: 425 100.0 6.3E-65 3.9E-68 372.6 35.3 353 1-377 7-425 (425) 11 protein:vir:4456 Length: 401 # 100.0 3.4E-64 2.1E-67 368.5 34.8 353 1-368 1-401 (401) 12 protein:vir:100247 Length: 425 100.0 8.4E-63 5.2E-66 360.9 34.5 354 1-369 21-425 (425) 13 protein:vir:485 Length: 407 # 100.0 4.9E-62 3E-65 356.7 35.9 356 3-375 1-407 (407) 14 protein:vir:1328 Length: 392 # 100.0 8.7E-62 5.4E-65 355.3 31.2 346 1-369 1-392 (392) 15 protein:vir:6242 Length: 390 # 100.0 1.1E-60 6.9E-64 349.3 30.8 344 1-369 1-390 (390) 16 protein:vir:6212 Length: 434 # 100.0 2.8E-60 1.8E-63 347.0 32.3 346 1-373 1-434 (434) 17 protein:vir:4511 Length: 409 # 100.0 2.3E-58 1.4E-61 336.6 33.6 345 1-371 1-409 (409) 18 protein:vir:101650 Length: 497 100.0 2.7E-58 1.7E-61 336.2 33.3 362 3-377 1-497 (497) 19 protein:vir:7855 Length: 497 # 100.0 2.7E-58 1.7E-61 336.2 33.3 362 3-377 1-497 (497) 20 protein:vir:8102 Length: 543 # 100.0 1.3E-57 8.1E-61 332.4 31.4 345 1-369 143-543 (543) 21 protein:vir:105038 Length: 428 100.0 7.1E-57 4.4E-60 328.4 34.2 349 3-368 1-428 (428) 22 protein:vir:1268 Length: 397 # 100.0 1.2E-56 7.2E-60 327.2 34.7 329 1-368 1-397 (397) 23 protein:vir:1433 Length: 435 # 100.0 4.6E-57 2.9E-60 329.4 32.5 350 1-377 1-435 (435) 24 protein:vir:80376 Length: 435 100.0 1.4E-56 8.9E-60 326.7 33.5 350 1-370 1-435 (435) 25 protein:vir:78640 Length: 352 100.0 5E-57 3.1E-60 329.2 30.7 337 3-374 1-352 (352) 26 protein:vir:81160 Length: 371 100.0 3.1E-56 1.9E-59 324.9 34.1 327 1-368 1-371 (371) 27 protein:vir:97053 Length: 390 100.0 8.5E-56 5.3E-59 322.5 34.8 339 1-366 1-390 (390) 28 protein:vir:9361 Length: 402 # 100.0 8.3E-57 5.2E-60 328.0 28.9 339 1-373 16-402 (402) 29 protein:vir:10364 Length: 390 100.0 5.8E-56 3.6E-59 323.4 33.0 339 1-366 1-390 (390) 30 protein:vir:81070 Length: 390 100.0 6.2E-56 3.8E-59 323.3 32.9 339 1-366 1-390 (390) 31 protein:vir:93881 Length: 387 100.0 5.4E-56 3.4E-59 323.6 31.2 339 1-373 1-387 (387) 32 protein:vir:4953 Length: 397 # 100.0 2.1E-55 1.3E-58 320.3 34.2 339 3-380 1-397 (397) 33 protein:vir:94424 Length: 387 100.0 4.3E-56 2.6E-59 324.1 29.5 339 1-373 1-387 (387) 34 protein:vir:96978 Length: 387 100.0 4.3E-56 2.6E-59 324.1 29.5 339 1-373 1-387 (387) 35 protein:vir:2685 Length: 387 # 100.0 4.3E-56 2.6E-59 324.1 29.5 339 1-373 1-387 (387) 36 protein:vir:107593 Length: 392 100.0 3.8E-55 2.4E-58 318.9 34.7 336 1-376 1-392 (392) 37 protein:vir:102873 Length: 392 100.0 3.8E-55 2.4E-58 318.9 34.7 336 1-376 1-392 (392) 38 protein:vir:105004 Length: 392 100.0 3.8E-55 2.4E-58 318.9 34.7 336 1-376 1-392 (392) 39 protein:vir:102082 Length: 392 100.0 3.8E-55 2.4E-58 318.9 34.7 336 1-376 1-392 (392) 40 protein:vir:102119 Length: 404 100.0 4.8E-55 3E-58 318.4 34.5 346 1-372 1-404 (404) 41 protein:vir:1025 Length: 408 # 100.0 1E-54 6.4E-58 316.6 33.9 343 1-381 5-406 (408) 42 protein:vir:100135 Length: 418 100.0 6.6E-55 4.1E-58 317.6 32.5 345 1-376 16-418 (418) 43 protein:vir:4339 Length: 395 # 100.0 1.5E-54 9.4E-58 315.6 34.2 343 1-368 1-395 (395) 44 protein:vir:104256 Length: 458 100.0 2.2E-54 1.4E-57 314.8 34.2 348 1-370 24-458 (458) 45 protein:vir:7771 Length: 330 # 100.0 4.5E-56 2.8E-59 324.0 24.8 290 65-375 1-330 (330) 46 protein:vir:7409 Length: 408 # 100.0 3.5E-54 2.2E-57 313.6 34.3 343 1-381 5-406 (408) 47 protein:vir:81227 Length: 413 100.0 3.5E-54 2.2E-57 313.6 33.9 346 1-374 1-413 (413) 48 protein:vir:4830 Length: 397 # 100.0 4E-54 2.5E-57 313.4 33.9 339 3-380 1-397 (397) 49 protein:vir:4997 Length: 397 # 100.0 3.7E-54 2.3E-57 313.5 33.4 339 3-380 1-397 (397) 50 protein:vir:1886 Length: 385 # 100.0 5.2E-54 3.2E-57 312.7 34.1 340 3-369 1-385 (385) 51 protein:vir:191 Length: 385 # 100.0 5.2E-54 3.2E-57 312.7 34.1 340 3-369 1-385 (385) 52 protein:vir:4600 Length: 415 # 100.0 5E-53 3.1E-56 307.3 36.0 350 3-379 1-415 (415) 53 protein:vir:4700 Length: 415 # 100.0 5E-53 3.1E-56 307.3 36.0 350 3-379 1-415 (415) 54 protein:vir:79987 Length: 415 100.0 5.5E-53 3.4E-56 307.1 35.7 349 3-379 1-415 (415) 55 protein:vir:98339 Length: 415 100.0 5.5E-53 3.4E-56 307.1 35.7 349 3-379 1-415 (415) 56 protein:vir:81100 Length: 415 100.0 5.5E-53 3.4E-56 307.1 35.7 349 3-379 1-415 (415) 57 protein:vir:3845 Length: 395 # 100.0 3.5E-53 2.1E-56 308.2 34.2 341 1-381 1-394 (395) 58 protein:vir:4226 Length: 326 # 100.0 6.7E-55 4.2E-58 317.6 24.7 295 54-371 1-326 (326) 59 protein:vir:3991 Length: 404 # 100.0 2.6E-53 1.6E-56 308.8 33.3 341 1-379 1-404 (404) 60 protein:vir:9410 Length: 415 # 100.0 1E-52 6.4E-56 305.6 35.9 349 3-381 1-414 (415) 61 protein:vir:96762 Length: 632 100.0 2.4E-53 1.5E-56 309.1 29.0 343 1-369 244-632 (632) 62 protein:vir:41 Length: 299 # N 100.0 2.8E-54 1.7E-57 314.2 23.8 272 60-369 1-299 (299) 63 protein:vir:80684 Length: 315 100.0 1.6E-53 1E-56 310.0 27.3 280 76-378 1-315 (315) 64 protein:vir:97148 Length: 324 100.0 1.5E-53 9.3E-57 310.2 26.7 301 36-378 1-324 (324) 65 protein:vir:5739 Length: 366 # 100.0 5.2E-54 3.2E-57 312.7 23.9 336 1-368 1-366 (366) 66 protein:vir:104085 Length: 320 100.0 2.6E-53 1.6E-56 308.9 27.4 289 59-373 1-320 (320) 67 protein:vir:2430 Length: 318 # 100.0 3.9E-53 2.4E-56 307.9 27.9 287 59-376 1-318 (318) 68 protein:vir:3870 Length: 400 # 100.0 1E-51 6.5E-55 300.1 33.4 323 1-369 1-400 (400) 69 protein:vir:94673 Length: 419 100.0 1.4E-51 9E-55 299.3 32.9 348 1-370 1-419 (419) 70 protein:vir:2344 Length: 397 # 100.0 1.1E-52 6.9E-56 305.4 26.5 288 63-381 1-319 (397) 71 protein:vir:1383 Length: 421 # 100.0 3.7E-51 2.3E-54 297.0 34.1 341 1-381 1-402 (421) 72 protein:vir:105905 Length: 304 100.0 8.1E-53 5E-56 306.2 24.8 279 57-369 1-304 (304) 73 protein:vir:94142 Length: 304 100.0 8.1E-53 5E-56 306.2 24.8 279 57-369 1-304 (304) 74 protein:vir:2504 Length: 305 # 100.0 2E-52 1.3E-55 304.0 26.0 283 76-374 1-305 (305) 75 protein:vir:9704 Length: 394 # 100.0 3.5E-51 2.2E-54 297.2 32.4 328 1-375 1-394 (394) 76 protein:vir:8187 Length: 311 # 100.0 2.7E-52 1.7E-55 303.3 26.2 270 78-369 1-311 (311) 77 protein:vir:100884 Length: 389 100.0 6.2E-51 3.8E-54 295.9 33.1 332 3-375 1-389 (389) 78 protein:vir:78223 Length: 333 100.0 2.5E-52 1.6E-55 303.4 25.4 287 57-372 1-333 (333) 79 protein:vir:101607 Length: 379 100.0 1.3E-50 8.2E-54 294.0 33.4 332 1-368 1-379 (379) 80 protein:vir:78830 Length: 324 100.0 5.6E-52 3.5E-55 301.6 25.9 301 36-378 1-324 (324) 81 protein:vir:96392 Length: 324 100.0 5.6E-52 3.5E-55 301.6 25.9 301 36-378 1-324 (324) 82 protein:vir:100172 Length: 394 100.0 3.1E-51 1.9E-54 297.5 29.2 335 3-379 1-394 (394) 83 protein:vir:8420 Length: 477 # 100.0 6E-51 3.7E-54 295.9 30.4 354 1-376 8-477 (477) 84 protein:vir:99749 Length: 324 100.0 3.5E-51 2.1E-54 297.2 28.1 301 36-378 1-324 (324) 85 protein:vir:9309 Length: 324 # 100.0 2.4E-51 1.5E-54 298.1 26.9 301 36-378 1-324 (324) 86 protein:vir:4856 Length: 293 # 100.0 9.3E-52 5.7E-55 300.4 24.6 270 72-380 1-293 (293) 87 protein:vir:103955 Length: 324 100.0 4.9E-51 3.1E-54 296.4 28.0 301 36-378 1-324 (324) 88 protein:vir:95763 Length: 297 100.0 1E-51 6.3E-55 300.1 24.2 275 65-373 1-297 (297) 89 protein:vir:1084 Length: 437 # 100.0 1.5E-50 9.2E-54 293.8 30.4 337 1-378 48-437 (437) 90 protein:vir:93616 Length: 645 100.0 6.4E-50 4E-53 290.3 31.9 345 1-377 193-645 (645) 91 protein:vir:962 Length: 397 # 100.0 3.5E-50 2.2E-53 291.7 28.4 322 1-368 15-397 (397) 92 protein:vir:96223 Length: 324 100.0 3.3E-50 2E-53 291.9 27.9 301 36-378 1-324 (324) 93 protein:vir:9759 Length: 303 # 100.0 7E-50 4.3E-53 290.1 25.6 271 78-368 1-303 (303) 94 protein:vir:78523 Length: 338 100.0 7.1E-50 4.4E-53 290.0 25.2 288 64-372 1-338 (338) 95 protein:vir:9574 Length: 300 # 100.0 2.6E-49 1.6E-52 287.0 24.4 270 76-370 1-300 (300) 96 protein:vir:1638 Length: 298 # 100.0 4.9E-49 3.1E-52 285.4 25.4 268 80-367 1-298 (298) 97 protein:vir:99920 Length: 311 100.0 2.1E-48 1.3E-51 281.9 23.1 271 76-368 1-311 (311) 98 protein:vir:94771 Length: 298 100.0 1.9E-47 1.2E-50 276.7 25.1 265 80-367 1-298 (298) 99 protein:vir:4159 Length: 315 # 100.0 2.9E-45 1.8E-48 264.8 21.6 278 64-367 1-315 (315) 100 protein:vir:4197 Length: 314 # 100.0 9.9E-45 6.1E-48 261.8 22.5 279 66-371 1-314 (314) 101 protein:vir:3158 Length: 321 # 100.0 1.8E-38 1.1E-41 227.5 23.8 296 57-381 1-321 (321) 102 protein:vir:97397 Length: 517 100.0 5.1E-36 3.1E-39 214.1 26.4 340 1-376 136-517 (517) 103 protein:vir:4074 Length: 480 # 100.0 3.5E-31 2.2E-34 187.5 19.0 327 1-371 109-480 (480) 104 protein:vir:3033 Length: 272 # 99.9 4.2E-27 2.6E-30 165.2 20.5 256 76-371 1-272 (272) 105 protein:vir:9820 Length: 272 # 99.9 4.2E-27 2.6E-30 165.2 20.5 256 76-371 1-272 (272) 106 protein:vir:94933 Length: 330 99.7 1.8E-19 1.1E-22 123.3 17.5 287 61-376 1-330 (330) 107 protein:vir:3613 Length: 272 # 99.6 3.5E-17 2.2E-20 110.8 17.0 255 76-368 1-272 (272) 108 protein:vir:93742 Length: 274 99.6 9E-17 5.6E-20 108.6 18.7 260 76-376 1-274 (274) 109 protein:vir:80930 Length: 278 99.5 2.1E-15 1.3E-18 101.1 19.3 261 76-371 1-278 (278) 110 protein:vir:105334 Length: 276 99.5 4.9E-15 3E-18 99.0 18.3 260 76-377 1-276 (276) 111 protein:vir:99424 Length: 360 99.5 1E-14 6.5E-18 97.2 19.4 308 46-372 1-360 (360) 112 protein:vir:96123 Length: 274 99.5 6.7E-15 4.2E-18 98.3 17.9 258 76-374 1-274 (274) 113 protein:vir:96833 Length: 275 99.4 2.4E-14 1.5E-17 95.2 17.6 259 74-377 1-275 (275) 114 protein:vir:94494 Length: 274 99.4 5.6E-14 3.5E-17 93.2 19.0 258 76-376 1-274 (274) 115 protein:vir:97433 Length: 274 99.4 5.6E-14 3.5E-17 93.2 19.0 258 76-376 1-274 (274) 116 protein:vir:97255 Length: 310 99.3 4E-13 2.5E-16 88.5 19.6 272 62-379 1-310 (310) 117 protein:vir:1239 Length: 274 # 99.3 5.3E-13 3.3E-16 87.9 18.1 258 76-376 1-274 (274) 118 protein:vir:95898 Length: 274 99.3 1.7E-12 1E-15 85.2 18.3 258 76-380 1-274 (274) 119 protein:vir:96262 Length: 274 99.3 1.7E-12 1E-15 85.2 18.3 258 76-380 1-274 (274) 120 protein:vir:79928 Length: 393 99.2 7.7E-12 4.8E-15 81.5 21.0 349 1-381 1-392 (393) 121 protein:vir:95107 Length: 270 99.1 1.6E-11 9.6E-15 79.8 18.2 256 76-380 1-270 (270) 122 protein:vir:739 Length: 231 # 99.0 4E-11 2.5E-14 77.6 14.5 217 110-368 1-231 (231) 123 protein:vir:93858 Length: 400 98.9 1.7E-09 1.1E-12 68.6 20.8 340 1-366 1-400 (400) 124 protein:vir:80213 Length: 334 98.7 1.6E-09 9.8E-13 68.8 15.3 291 57-370 1-334 (334) 125 protein:vir:2201 Length: 345 # 98.7 1.6E-09 1E-12 68.8 14.3 287 57-368 1-345 (345) 126 protein:vir:102605 Length: 273 98.7 6.1E-09 3.8E-12 65.6 17.3 253 82-370 1-273 (273) 127 protein:vir:105822 Length: 273 98.7 6.1E-09 3.8E-12 65.6 17.3 253 82-370 1-273 (273) 128 protein:vir:7990 Length: 273 # 98.7 6.1E-09 3.8E-12 65.6 16.8 252 82-370 1-273 (273) 129 protein:vir:3364 Length: 347 # 98.6 7.1E-09 4.4E-12 65.3 15.7 291 57-370 1-347 (347) 130 protein:vir:94576 Length: 347 98.6 4.1E-09 2.5E-12 66.6 14.0 287 57-368 1-347 (347) 131 protein:vir:10450 Length: 344 98.6 4.4E-09 2.7E-12 66.4 13.4 287 57-368 1-344 (344) 132 protein:vir:95318 Length: 328 98.4 5.3E-08 3.3E-11 60.5 15.9 238 57-313 1-328 (328) 133 protein:vir:8324 Length: 410 # 98.4 3.8E-08 2.3E-11 61.3 15.0 330 1-370 29-410 (410) 134 protein:vir:1541 Length: 347 # 98.4 4.2E-08 2.6E-11 61.0 15.1 293 57-370 1-347 (347) 135 protein:vir:78739 Length: 332 98.4 2.5E-08 1.5E-11 62.3 12.6 275 70-366 1-332 (332) 136 protein:vir:7019 Length: 401 # 98.3 2.6E-08 1.6E-11 62.2 12.4 295 59-381 1-349 (401) 137 protein:vir:8885 Length: 347 # 98.3 4.8E-08 2.9E-11 60.7 13.2 287 57-369 1-347 (347) 138 protein:vir:80180 Length: 381 98.3 2.3E-07 1.4E-10 57.0 16.3 293 57-381 1-317 (381) 139 protein:vir:108211 Length: 318 98.2 1.7E-07 1E-10 57.7 14.5 286 65-372 1-318 (318) 140 protein:vir:103759 Length: 330 98.2 2.2E-07 1.4E-10 57.1 14.9 241 57-313 1-330 (330) 141 protein:vir:9927 Length: 295 # 98.2 3.8E-07 2.4E-10 55.8 16.2 260 76-376 1-295 (295) 142 protein:vir:6324 Length: 335 # 98.2 1.7E-07 1.1E-10 57.7 14.0 296 59-378 1-335 (335) 143 protein:vir:78935 Length: 335 98.2 3.1E-07 1.9E-10 56.3 15.0 296 59-378 1-335 (335) 144 protein:vir:100057 Length: 375 98.1 5.5E-07 3.4E-10 54.9 15.2 297 57-375 1-375 (375) 145 protein:vir:94711 Length: 347 98.1 1.1E-07 6.7E-11 58.8 11.3 288 57-369 1-347 (347) 146 protein:vir:103323 Length: 364 98.1 1.4E-06 8.5E-10 52.7 17.2 295 59-381 1-349 (364) 147 protein:vir:97031 Length: 402 98.1 3.5E-07 2.2E-10 56.0 13.1 294 59-381 1-345 (402) 148 protein:vir:107826 Length: 331 98.0 1.1E-06 6.7E-10 53.3 14.2 238 57-313 1-331 (331) 149 protein:vir:98525 Length: 331 98.0 1.1E-06 6.7E-10 53.3 14.2 238 57-313 1-331 (331) 150 protein:vir:107388 Length: 331 98.0 1.1E-06 6.7E-10 53.3 14.2 238 57-313 1-331 (331) 151 protein:vir:94622 Length: 341 97.9 9.9E-07 6.1E-10 53.5 13.7 279 59-372 1-341 (341) 152 protein:vir:7324 Length: 335 # 97.8 2.3E-06 1.4E-09 51.5 13.8 239 57-314 1-335 (335) 153 protein:vir:5974 Length: 324 # 97.8 1.4E-05 8.4E-09 47.3 17.9 270 76-381 1-297 (324) 154 protein:vir:105645 Length: 400 97.8 9.7E-07 6E-10 53.5 11.0 299 59-381 1-346 (400) 155 protein:vir:99675 Length: 324 97.7 3.7E-06 2.3E-09 50.4 12.9 258 109-381 1-309 (324) 156 protein:vir:102944 Length: 330 97.7 1.3E-05 8.1E-09 47.4 16.0 280 76-381 1-303 (330) 157 protein:vir:106647 Length: 303 97.6 8.2E-06 5.1E-09 48.5 13.7 265 65-373 1-303 (303) 158 protein:vir:3136 Length: 322 # 97.5 5.2E-06 3.2E-09 49.6 11.8 280 76-375 1-322 (322) 159 protein:vir:107687 Length: 319 97.5 5.4E-05 3.3E-08 44.0 17.2 296 44-370 1-319 (319) 160 protein:vir:9875 Length: 296 # 97.5 1.6E-06 1E-09 52.3 8.7 267 61-374 1-296 (296) 161 protein:vir:1583 Length: 351 # 97.4 5.5E-05 3.4E-08 43.9 15.8 277 76-381 1-301 (351) 162 protein:vir:103285 Length: 296 97.3 0.00011 6.7E-08 42.3 17.1 273 76-373 1-296 (296) 163 protein:vir:80068 Length: 301 97.0 0.00021 1.3E-07 40.8 20.6 277 78-370 1-301 (301) 164 protein:vir:102655 Length: 322 96.8 0.00032 2E-07 39.8 15.0 278 71-369 1-322 (322) 165 protein:vir:99075 Length: 392 95.4 0.0022 1.4E-06 35.1 15.1 268 82-381 1-328 (392) 166 protein:vir:1829 Length: 355 # 95.1 0.0028 1.7E-06 34.6 18.3 300 65-378 1-355 (355) 167 protein:vir:79642 Length: 329 94.7 0.0036 2.2E-06 34.0 18.6 305 40-369 1-329 (329) 168 protein:vir:8843 Length: 317 # 94.6 0.004 2.5E-06 33.7 14.3 281 78-370 1-317 (317) 169 protein:vir:104342 Length: 314 94.3 0.0047 2.9E-06 33.3 18.2 291 48-373 1-314 (314) 170 protein:vir:1663 Length: 393 # 94.1 0.0053 3.3E-06 33.1 14.3 337 1-366 1-393 (393) 171 protein:vir:98566 Length: 355 94.1 0.0055 3.4E-06 33.0 17.3 300 65-378 1-355 (355) 172 protein:vir:93966 Length: 400 93.0 0.0092 5.7E-06 31.8 14.1 339 1-366 1-400 (400) 173 protein:vir:5694 Length: 357 # 92.4 0.011 7.1E-06 31.2 17.0 298 65-381 1-356 (357) 174 protein:vir:103463 Length: 521 91.6 0.015 9.2E-06 30.6 16.2 352 1-381 1-510 (521) 175 protein:vir:2016 Length: 357 # 89.8 0.024 1.5E-05 29.4 17.0 298 65-381 1-356 (357) 176 protein:vir:6061 Length: 357 # 89.6 0.026 1.6E-05 29.3 16.6 299 65-377 1-357 (357) 177 protein:vir:1153 Length: 338 # 88.3 0.033 2.1E-05 28.7 17.5 288 65-369 1-338 (338) 178 protein:vir:79171 Length: 337 88.1 0.035 2.1E-05 28.6 16.2 288 65-370 1-337 (337) 179 protein:vir:104011 Length: 337 87.5 0.039 2.4E-05 28.3 17.1 288 65-370 1-337 (337) 180 protein:vir:3746 Length: 336 # 86.8 0.043 2.7E-05 28.1 16.0 286 65-376 1-336 (336) 181 protein:vir:98856 Length: 343 86.7 0.044 2.7E-05 28.0 15.9 292 65-379 1-343 (343) 182 protein:vir:3783 Length: 336 # 86.2 0.047 2.9E-05 27.8 16.0 286 65-376 1-336 (336) 183 protein:vir:270 Length: 341 # 85.8 0.05 3.1E-05 27.7 16.3 300 61-377 1-341 (341) 184 protein:vir:78777 Length: 358 85.5 0.053 3.3E-05 27.6 18.9 303 61-381 1-353 (358) 185 protein:vir:861 Length: 318 # 84.7 0.058 3.6E-05 27.4 11.4 297 33-366 1-318 (318) 186 protein:vir:79548 Length: 652 84.0 0.064 4E-05 27.1 16.8 341 1-365 224-652 (652) 187 protein:vir:78186 Length: 337 82.4 0.077 4.8E-05 26.7 16.1 288 65-370 1-337 (337) 188 protein:vir:1781 Length: 221 # 81.3 0.087 5.4E-05 26.4 10.4 189 158-381 1-215 (221) 189 protein:vir:100331 Length: 342 81.1 0.089 5.5E-05 26.4 16.4 291 65-379 1-342 (342) 190 protein:vir:7214 Length: 521 # 80.9 0.09 5.6E-05 26.3 16.5 354 1-381 1-510 (521) 191 protein:vir:79157 Length: 339 78.3 0.12 7.2E-05 25.7 17.9 291 65-373 1-339 (339) 192 protein:vir:80986 Length: 528 74.5 0.16 9.8E-05 25.0 18.3 354 3-381 1-509 (528) 193 protein:vir:95131 Length: 325 70.8 0.2 0.00013 24.4 14.0 268 65-381 1-300 (325) 194 protein:vir:100603 Length: 529 68.7 0.23 0.00014 24.0 16.7 347 1-381 1-513 (529) 195 protein:vir:6901 Length: 522 # 65.1 0.29 0.00018 23.5 15.2 346 1-381 1-511 (522) 196 protein:vir:80446 Length: 367 65.0 0.29 0.00018 23.5 13.5 287 69-381 1-342 (367) 197 protein:vir:94870 Length: 318 60.3 0.37 0.00023 22.9 11.1 301 33-366 1-318 (318) 198 protein:vir:95512 Length: 693 53.1 0.54 0.00033 22.1 18.0 335 1-370 301-693 (693) 199 protein:vir:5255 Length: 304 # 52.4 0.56 0.00034 22.0 15.5 273 81-365 1-304 (304) 200 protein:vir:6601 Length: 528 # 50.9 0.6 0.00037 21.8 19.8 344 3-368 1-528 (528) 201 protein:vir:96792 Length: 315 49.5 0.64 0.0004 21.7 11.3 262 76-381 1-287 (315) 202 protein:vir:108303 Length: 418 44.3 0.81 0.0005 21.1 16.0 262 79-381 1-303 (418) 203 protein:vir:3525 Length: 423 # 41.8 0.91 0.00056 20.8 14.7 268 76-381 1-311 (423) 204 protein:vir:101039 Length: 529 38.9 1 0.00065 20.5 14.7 346 1-381 1-513 (529) 205 protein:vir:98143 Length: 524 37.5 1.1 0.00069 20.3 17.0 354 3-381 1-507 (524) 206 protein:vir:5942 Length: 523 # 34.5 1.3 0.0008 20.0 15.2 349 1-370 1-523 (523) 207 protein:vir:3643 Length: 336 # 30.9 1.5 0.00096 19.6 14.6 302 48-367 1-336 (336) 208 protein:vir:105522 Length: 423 29.5 1.7 0.001 19.4 16.7 263 82-381 1-311 (423) 209 protein:vir:78558 Length: 336 29.1 1.7 0.001 19.3 15.6 301 48-367 1-336 (336) 210 protein:vir:104915 Length: 470 28.9 1.7 0.0011 19.3 16.1 344 1-381 1-460 (470) 211 protein:vir:103886 Length: 302 27.1 1.9 0.0012 19.1 18.7 281 63-374 1-302 (302) 212 protein:vir:94989 Length: 349 25.9 2 0.0012 18.9 15.2 286 76-381 1-323 (349) 213 protein:vir:78387 Length: 349 24.5 2.2 0.0013 18.7 16.7 285 76-381 1-323 (349) 214 protein:vir:348 Length: 321 # 21.7 2.6 0.0016 18.3 14.5 283 63-368 1-321 (321) No 1 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=2.4e-102 Score=577.68 Aligned_cols=381 Identities=98% Similarity=1.367 Sum_probs=365.0 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHhccc Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNV 80 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~ 80 (381) |+||+.++++++++++.+++++.+.++++.+.+++..+.+.++.....+.+++++.+..++.+.++.+|+++++++++++ T Consensus 1 m~~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~l~~~e~~~~~~~~~~t 80 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQTLSANQRNFFMDINKSV 80 (381) T ss_pred CchhHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccCHHHHHHHHHHhhcC Confidence 99999999999999999999888777778888888888888888888888899999999999999999999999999999 Q ss_pred CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEEEecCCcceEEecccccccccccccccceeccceeeeee Q lcl|Aclame:pro 81 NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) Q Consensus 81 ~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~ 160 (381) +++|||+||+++.++|++.|++.||||++|+++++++..++|+.++.+.++|++|.++++++++|+|+++++.+||++++ T Consensus 81 ~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~ 160 (381) T protein:vir:10 81 GYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) T ss_pred CCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCcceEEEeecCCcceEEeecccccccccCccceeEeecceeEEee Confidence 99999999999999999999999999999999999999999999999999999998888888899999999999999999 Q ss_pred hhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHH Q lcl|Aclame:pro 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATV 240 (381) Q Consensus 161 ~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~ 240 (381) ++||++||+||.+||++||+++++++|+++++.+|++|||++||+||++++.+..+++.++.+++++.+++++.++..++ T Consensus 161 i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~ 240 (381) T protein:vir:10 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATV 240 (381) T ss_pred ccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCccccccccccccccccccccccchhhHH Confidence 99999999999999999999999999999999999999999999999999888888888888888888899999999999 Q ss_pred HHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeeccCCCceEEecCCCCCccEEEEeccceE Q lcl|Aclame:pro 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~y~ 320 (381) +.+.++++.++....+..+.|+++++|+|||.|++.++++++.++++|+|+|.+|+|++|+++++||+++|+|||||+|+ T Consensus 241 ~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~lp~g~~vv~~~~~p~~~i~fGDfs~Y~ 320 (381) T protein:vir:10 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) T ss_pred HHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecCCCCceeEEcCCCCcCcEEEEEcccEE Confidence 99999999998888888889999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 321 i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) |++|++++|++|+|.+|.+|+++||+++|+||+|+|++||+|++|++++++|++|.+++|| T Consensus 321 i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEDTEETL 381 (381) T ss_pred EEEecccEEEeechhhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCccccccccccC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=8.1e-101 Score=569.31 Aligned_cols=381 Identities=100% Similarity=1.381 Sum_probs=365.7 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHhccc Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNV 80 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~ 80 (381) |+||+.++++++++++.+++++.+.++++.+.+.+.++.+.++...+++.+++++....++++.++++|+++++++.+++ T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~~~~~~ 80 (381) T protein:vir:95 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNV 80 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHHHhccc Confidence 99999999999999999999888777778888888888888888888888999999999999999999999999999999 Q ss_pred CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEEEecCCcceEEecccccccccccccccceeccceeeeee Q lcl|Aclame:pro 81 NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) Q Consensus 81 ~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~ 160 (381) +++|||+||+++.++|++.|++.+|||++|+++++++..+||+.++.+.|.|++|.++++++++|+|+++++.+|||+++ T Consensus 81 ~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~ 160 (381) T protein:vir:95 81 NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) T ss_pred CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcceEEEEecCCcceeeecccccccccccccceeeeecceeEEee Confidence 99999999999999999999999999999999999999999999999999999998888878899999999999999999 Q ss_pred hhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHH Q lcl|Aclame:pro 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATV 240 (381) Q Consensus 161 ~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~ 240 (381) ++||++||+||.+||++||+++++++|+++++.+|++|+|++||+||++++....++++++.+++.+.+++++.++..+. T Consensus 161 ~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~ 240 (381) T protein:vir:95 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATV 240 (381) T ss_pred chhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhH Confidence 99999999999999999999999999999999999999999999999999988888999998888888999999999999 Q ss_pred HHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeeccCCCceEEecCCCCCccEEEEeccceE Q lcl|Aclame:pro 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~y~ 320 (381) +.+.++++.++....+....|+++++|+|||.|++.++++++.++++|+|+|.+|+|++|++|++||+++|+|||||+|+ T Consensus 241 ~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~~g~~vv~s~~~p~~~iifgDfs~Y~ 320 (381) T protein:vir:95 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) T ss_pred HHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCCCCceEEecCCCCcCcEEEEecccEE Confidence 99999999999888888889999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 321 i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) |++|++++|++|+|.+|.+|+++||+++|+||+|+|++||+|++|++.+.+|+++++.+|| T Consensus 321 i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:95 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred EEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCcccccccC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=8.1e-101 Score=569.31 Aligned_cols=381 Identities=100% Similarity=1.381 Sum_probs=365.7 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHhccc Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNV 80 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~ 80 (381) |+||+.++++++++++.+++++.+.++++.+.+.+.++.+.++...+++.+++++....++++.++++|+++++++.+++ T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~~~~~~ 80 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNV 80 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHHHhccc Confidence 99999999999999999999888777778888888888888888888888999999999999999999999999999999 Q ss_pred CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEEEecCCcceEEecccccccccccccccceeccceeeeee Q lcl|Aclame:pro 81 NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) Q Consensus 81 ~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~ 160 (381) +++|||+||+++.++|++.|++.+|||++|+++++++..+||+.++.+.|.|++|.++++++++|+|+++++.+|||+++ T Consensus 81 ~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~ 160 (381) T protein:vir:10 81 NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) T ss_pred CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcceEEEEecCCcceeeecccccccccccccceeeeecceeEEee Confidence 99999999999999999999999999999999999999999999999999999998888878899999999999999999 Q ss_pred hhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHH Q lcl|Aclame:pro 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATV 240 (381) Q Consensus 161 ~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~ 240 (381) ++||++||+||.+||++||+++++++|+++++.+|++|+|++||+||++++....++++++.+++.+.+++++.++..+. T Consensus 161 ~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~ 240 (381) T protein:vir:10 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATV 240 (381) T ss_pred chhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhH Confidence 99999999999999999999999999999999999999999999999999988888999998888888999999999999 Q ss_pred HHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeeccCCCceEEecCCCCCccEEEEeccceE Q lcl|Aclame:pro 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~y~ 320 (381) +.+.++++.++....+....|+++++|+|||.|++.++++++.++++|+|+|.+|+|++|++|++||+++|+|||||+|+ T Consensus 241 ~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~~g~~vv~s~~~p~~~iifgDfs~Y~ 320 (381) T protein:vir:10 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) T ss_pred HHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCCCCceEEecCCCCcCcEEEEecccEE Confidence 99999999999888888889999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 321 i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) |++|++++|++|+|.+|.+|+++||+++|+||+|+|++||+|++|++.+.+|+++++.+|| T Consensus 321 i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred EEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCcccccccC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=2.8e-93 Score=527.98 Aligned_cols=376 Identities=57% Similarity=0.956 Sum_probs=343.2 Q ss_pred CCccHHHHHH---HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhhhhhccccHHHHHHH Q lcl|Aclame:pro 1 MTINLSETFA---NAKNEFINAVNNGEPQERQNELYGDMINQLFEET----KLQAKAEAERVSSLPKSAQSLSANQRSFF 73 (381) Q Consensus 1 m~~~l~~~~~---e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~lt~~e~~~~ 73 (381) |+|+|+++.+ |+++++.+.+++.+.++++.+.+.++.+.+.... +.+.+...++.....++.+.++.+|++++ T Consensus 1 M~~kl~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~lt~~e~~~~ 80 (383) T protein:vir:78 1 MTIKLKNNLANYEEKRTAFVNAVKNEDTQEIQNKAYVEMVDAMAADIMEQAKKEARQEADAYISASRTDKNITNEEIKFF 80 (383) T ss_pred CchhHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhHHHHHHH Confidence 9999988874 6777777777777767777777766665544433 34444455666777888899999999999 Q ss_pred HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEEEecCCcceEEecccccccccccccccceecc Q lcl|Aclame:pro 74 MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAI 153 (381) Q Consensus 74 ~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~ 153 (381) +++.++++++|||+||++++++|++.|+++||||++|++++++|..+||+.++.+.|.|++|.++++++++|+|++++|. T Consensus 81 ~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~ 160 (383) T protein:vir:78 81 NDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLRTKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESI 160 (383) T ss_pred HHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCCceEEEEEcCCcceEEeecccccccccCcceeeEeec Confidence 99999999999999999999999999999999999999999999999999999999999999888887889999999999 Q ss_pred ceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccc Q lcl|Aclame:pro 154 QNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTF 233 (381) Q Consensus 154 ~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~ 233 (381) +|||+++++||++||+||.+|+++||+++++++|++++|.+|++|+|++||+||++++....+++++..++.+..+++++ T Consensus 161 ~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (383) T protein:vir:78 161 QNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTF 240 (383) T ss_pred ceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCcccccccccccccccchhhh Confidence 99999999999999999999999999999999999999999999999999999999988888888888888888888899 Q ss_pred cChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeeccCCCceEEecCCCCCccEEE Q lcl|Aclame:pro 234 ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLT 313 (381) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l~~g~~vi~s~~~p~~~i~~ 313 (381) +++..+.+.+..+.+..+...++....+.++++|+|||.+++.+++.++.++++|+|++.||+|++|+++++||+++|+| T Consensus 241 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~t~l~~~~~iv~s~~~p~~~iif 320 (383) T protein:vir:78 241 ANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSLNANGVYVTALPFNLNIIESLFVPEKKAIS 320 (383) T ss_pred hhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchhccCCCCceeeecCCCceEEecCCCCcccEEE Confidence 99999999998888777777777778888999999999999999998888999999999999999999999999999999 Q ss_pred EeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 314 YVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 314 gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) |||++|+|++|++++|++|+|.+|.+|+++||+++|+||+|+|++||+|++|++.+..+.|+| T Consensus 321 gdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 321 YVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINPAEQTPEG 383 (383) T ss_pred eeccceEEEecccceEEecchhhhhcCceEEEEEEEEcCEEecCCeEEEEEEEecCCCCCCCC Confidence 999999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=1.4e-90 Score=513.25 Aligned_cols=368 Identities=39% Similarity=0.645 Sum_probs=333.4 Q ss_pred CCccHHH--HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHh- Q lcl|Aclame:pro 1 MTINLSE--TFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDIN- 77 (381) Q Consensus 1 m~~~l~~--~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~- 77 (381) |||++++ ++.++++++.+++++.+.++++.++++++.+.+..+...+.+.++++.+...+..+.++++|+++++++. T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~ 80 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHh Confidence 8887654 7788999999998888777888888888888888887778888888888888888999999999998764 Q ss_pred cccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEEEecCCcceEEecccccccccccccccceeccceee Q lcl|Aclame:pro 78 KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKL 157 (381) Q Consensus 78 ~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl 157 (381) .+++++|||+||+++.++|++.+++.||||++|+++++++..+||+.++.+.|.|++|.++++++++|+|++++|.+||+ T Consensus 81 ~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl 160 (377) T protein:vir:96 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKL 160 (377) T ss_pred cCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceeEeecccccccccCccceeEeeeeeeE Confidence 46677899999999999999999999999999999999999999999999999999998888878899999999999999 Q ss_pred eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccc------cccccccchhhhc Q lcl|Aclame:pro 158 TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVT------EGAYPEKEEQGTL 231 (381) Q Consensus 158 ~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~------~~~~~~~~~~~~~ 231 (381) +++++||++||+||.+|+++||+++++++|+++++.+|++|+|++||+||++++....... .+.+++....+.+ T Consensus 161 ~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (377) T protein:vir:96 161 TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL 240 (377) T ss_pred EeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeecccccccc Confidence 9999999999999999999999999999999999999999999999999998765443322 2234445566677 Q ss_pred cccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeeccCCCceEEecCCCCCccE Q lcl|Aclame:pro 232 TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKV 311 (381) Q Consensus 232 t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l~~g~~vi~s~~~p~~~i 311 (381) +..++.++++.+.++.+.++.+..+.+..+.++++|+|||.|++.+++.+..++++|+|++.|++|++|++|++||+++| T Consensus 241 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~~p~~v~~s~~~p~~~i 320 (377) T protein:vir:96 241 SDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKA 320 (377) T ss_pred ccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccccccccCCCCCceeccCCCceEEecCCCCcccE Confidence 88899999999999998888887777888899999999999999988888888999999999999999999999999999 Q ss_pred EEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 312 LTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 312 ~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) +||||++|+|++|++++|++|+|++|.+|+++||+++|+||+|+|++||+|++|.+- T Consensus 321 ~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEcCcEEEEEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 999999999999999999999999999999999999999999999999999999985 No 6 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=1.1e-86 Score=491.84 Aligned_cols=368 Identities=39% Similarity=0.646 Sum_probs=321.1 Q ss_pred CCccHHH--HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHH-h Q lcl|Aclame:pro 1 MTINLSE--TFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDI-N 77 (381) Q Consensus 1 m~~~l~~--~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~-~ 77 (381) |+|++++ +++++++++.+++++....+++.+.++++.+.+.++...+.+.+++++....+..+.++++|+++++++ . T Consensus 1 M~i~~k~~~~~~~~~~~l~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~ 80 (377) T protein:vir:98 1 MAINLKELPKYREAVAELSAKISAGATSEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHh Confidence 8886555 778888888888888777778888888888888888888888888888988889999999999999865 5 Q ss_pred cccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEEEecCCcceEEecccccccccccccccceeccceee Q lcl|Aclame:pro 78 KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKL 157 (381) Q Consensus 78 ~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl 157 (381) .+++++|||+||+++.++|++.|++.+|||++|++++++|..++|+.++.+.+.|++|.++++++++|+|+++++.+||+ T Consensus 81 ~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl 160 (377) T protein:vir:98 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKL 160 (377) T ss_pred ccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCcceEEEEecCCcceeEeecccccCcccCccceeEeecceeE Confidence 57788999999999999999999999999999999999999999999999999999998888878899999999999999 Q ss_pred eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccc------cccccccchhhhc Q lcl|Aclame:pro 158 TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVT------EGAYPEKEEQGTL 231 (381) Q Consensus 158 ~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~------~~~~~~~~~~~~~ 231 (381) +++++||++||+||.+|+++||+++++++|+++++.+|++|+|++||+||++.+...+... .+.+++......+ T Consensus 161 ~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 240 (377) T protein:vir:98 161 TAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL 240 (377) T ss_pred EeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccccccchhhhHhhh Confidence 9999999999999999999999999999999999999999999999999998765433322 1222233334444 Q ss_pred cccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeeccCCCceEEecCCCCCccE Q lcl|Aclame:pro 232 TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKV 311 (381) Q Consensus 232 t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l~~g~~vi~s~~~p~~~i 311 (381) .+.....+......++..++.....+.....++.+|+|||.+++.+++.++.++++|+|++.||+|++|++|++||+++| T Consensus 241 ~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i 320 (377) T protein:vir:98 241 SDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKA 320 (377) T ss_pred hhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEEecCCCCcccE Confidence 44444445555555555555555555667888999999999999999988889999999999999999999999999999 Q ss_pred EEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 312 LTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 312 ~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) +||||++|+|++|++++|++|+|++|.+|+++||+++|+||+|++++||++++|..- T Consensus 321 ~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEecceeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 999999999999999999999999999999999999999999999999999988884 No 7 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=1.1e-84 Score=480.91 Aligned_cols=377 Identities=45% Similarity=0.767 Sum_probs=319.1 Q ss_pred CCccHH-----HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhhhhhccccHHHH Q lcl|Aclame:pro 1 MTINLS-----ETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAE-----RVSSLPKSAQSLSANQR 70 (381) Q Consensus 1 m~~~l~-----~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~lt~~e~ 70 (381) ||.... ++++|+++++.+++++....+++.+++.++++.+..+..++...+.+ +.....++.+.++.+|+ T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~r~~~~l~~ee~ 80 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFANLVQNGASDEEQSKAFGAMFDALSNDLQEEITAEINNRVVDNGILAKRSQDPLTSEER 80 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccchHHH Confidence 776332 22234445555555666666666777776666655444433333322 33445567888999999 Q ss_pred HHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 71 SFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 71 ~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) ++++++.++++++|||+||++++++|++.+++.+|||++|++++++|..++|+.++.+.++|++|.++++++++|+|+++ T Consensus 81 ~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 160 (395) T protein:vir:95 81 KFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGIKTRVIKADPAGQAVWGKVFGEIKGQLDAAFREE 160 (395) T ss_pred HHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccCccccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999998888877889999999 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCC--cceeeeeccccccccccccccccchh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD--QPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~--qP~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) ++.+|+++++++||++||+|+.+|+++||+++|+++|++++|.+|++|+|++ ||+||++++..... ........ T Consensus 161 ~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~----~~~~~~~~ 236 (395) T protein:vir:95 161 NFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSG----AVTDKASS 236 (395) T ss_pred eeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeeccccccc----cccccccc Confidence 9999999999999999999999999999999999999999999999999985 79999987644322 12223344 Q ss_pred hhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeeccCCCceEEecCCCCC Q lcl|Aclame:pro 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEA 308 (381) Q Consensus 229 ~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l~~g~~vi~s~~~p~ 308 (381) ++++..+...+...+.+++..++...++....+.+++.|+|||.|++.++..+..++++|+|++.|++|+||+++++||+ T Consensus 237 ~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~~~~~~G~~~~~lg~g~~v~~~~~~p~ 316 (395) T protein:vir:95 237 GTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYTYLTANGGFVTVLPYNVTIITSEFVPE 316 (395) T ss_pred chhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcceeccCCCcceeccCCcceEEEcCCCCC Confidence 55566667777777888888888777778888999999999999999888877778899999999999999999999999 Q ss_pred ccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 309 GKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 309 ~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) ++|+||||++|+|++|++++|++++|.+|.+|+++||+++|+||+|+|++||+|++|++....+.+...+-|- T Consensus 317 ~~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~~~~~~~~~~ 389 (395) T protein:vir:95 317 GKLVAFVTDRYNAVRGGGLTVKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASAPRRQTSAGGTT 389 (395) T ss_pred CcEEEEecccEEEEEecceEEEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEeeccCCCCCCCCCCCCC Confidence 9999999999999999999999999999999999999999999999999999999999998888777776666 No 8 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=1.3e-74 Score=425.65 Aligned_cols=370 Identities=27% Similarity=0.428 Sum_probs=289.0 Q ss_pred CCc--cHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhhhhhccccHHHHHHH Q lcl|Aclame:pro 1 MTI--NLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEA-----ERVSSLPKSAQSLSANQRSFF 73 (381) Q Consensus 1 m~~--~l~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~lt~~e~~~~ 73 (381) |.= |++++..+.++++.++++.....+++.++++++.+.+..+...+.+.+. +......++.+.++.++|+++ T Consensus 1 ik~L~e~~~e~~e~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~ 80 (390) T protein:vir:40 1 MNNLDKKDSETLNISTAFLNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKYY 80 (390) T ss_pred CchHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHHH Confidence 221 3444555666677777777666667777777766666555444443322 233345567788999999998 Q ss_pred HHHh-cccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc-eEEEEecCCcceEEeccccccccccccccccee Q lcl|Aclame:pro 74 MDIN-KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEET 151 (381) Q Consensus 74 ~~~~-~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~-~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~ 151 (381) +++. .+++++||++||++++++|++.+++.++|+++|+++++++. ..+|+.++.+.+.|++|+++++++++++|++++ T Consensus 81 ~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~ 160 (390) T protein:vir:40 81 NEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQ 160 (390) T ss_pred HHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcCCcceeeeccccccCccccccceeeE Confidence 7654 45677999999999999999999999999999999998764 779999999999999998888777899999999 Q ss_pred ccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhc Q lcl|Aclame:pro 152 AIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTL 231 (381) Q Consensus 152 l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~ 231 (381) |.+|+++++++||+|||+||.+++++||+++|+++|++++|.+|++|+|+++|.||++.+........ .. .....+ T Consensus 161 l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~---~~-~~~~~~ 236 (390) T protein:vir:40 161 TGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEH---PV-KTATPL 236 (390) T ss_pred eeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeecccccccccc---cc-cccccc Confidence 99999999999999999999999999999999999999999999999999999999986643322211 11 122334 Q ss_pred cccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHH-hhhhccCCCCceeec-cCCCceEEecCCCCCc Q lcl|Aclame:pro 232 TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQ-AQYTHLNANGVYVTA-LPFNLNVIESTVQEAG 309 (381) Q Consensus 232 t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~-~~~~~~~~~G~~~~~-l~~g~~vi~s~~~p~~ 309 (381) +..+...+...+...+. .....+.++++|+|||+|++... .+...++++|+|+|. +++|+||+++++||++ T Consensus 237 t~~~~~~~~~~l~~~~~-------~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~~~~g~pvv~~~~~p~~ 309 (390) T protein:vir:40 237 TDLTPATLATKVMLPLT-------DNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGILPVPLEIVQSVAVPVG 309 (390) T ss_pred chhhHHHHHHHHHHHhh-------cchhhhhcCceEEEcchhHHHHHHHHhhccCCCCccccccCCCceeEEEcCCCCCC Confidence 44554444443333221 11223557899999999976543 344567889999986 4589999999999999 Q ss_pred cEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEeccccc--CCCCC-----CCCC Q lcl|Aclame:pro 310 KVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP--ALEGT-----EETL 381 (381) Q Consensus 310 ~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~--~~~~~-----~~~~ 381 (381) +++||||++|++++|++++|++++|.+|.+|+++||+++|+||++++++||+++.++-.+.++ .|++. ++|- T Consensus 310 ~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 388 (390) T protein:vir:40 310 KAVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDITGLEGSPAIDVNVVNNATPSETP 388 (390) T ss_pred cEEEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEecccceEEEEeeccCCCCCCCcceeeCCCCCCCC Confidence 999999999999999999999999999999999999999999999999999999998886542 23333 2222 No 9 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=3e-66 Score=379.78 Aligned_cols=377 Identities=12% Similarity=0.136 Sum_probs=251.0 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHHH------HHHHHH--------------HHHHHHHHHHHHHHHH----------H Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQE------RQNELY--------------GDMINQLFEETKLQAK----------A 50 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~~------~~~~~~--------------~~~~~~~~~~~~~~~~----------~ 50 (381) --++.++++..+.+++..++++....+ ++.+.+ ++.++.+..+...... . T Consensus 21 el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~le~el~e~~~~~~~~~~~~~~ 100 (466) T protein:vir:80 21 ELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKELENELEQLNNKEPKNNSEPAQ 100 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCchhHH Confidence 001222222222222222221110000 000001 1111111110000000 0 Q ss_pred H---HH----------HHH-----HhhhhhccccHHHHHHHHH----Hhcc-cCCCCceEccHHHHHHHHHHHHhhhhhh Q lcl|Aclame:pro 51 E---AE----------RVS-----SLPKSAQSLSANQRSFFMD----INKN-VNYKEEKLLPEETIDRIFEDLTTNHPLL 107 (381) Q Consensus 51 ~---~~----------~~~-----~~~~~~~~lt~~e~~~~~~----~~~~-~~~~gg~lvP~~~~~~Ii~~l~~~~~l~ 107 (381) . .. +.. ...+..+.+..+++.++.. ..+. +.++|+++||+++.++|++.+++++||+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~ 180 (466) T protein:vir:80 101 VSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLI 180 (466) T ss_pred HHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhh Confidence 0 00 000 0001111222233333322 2222 3445678999999999999999999999 Q ss_pred hhceeeecCCceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHH Q lcl|Aclame:pro 108 ADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAF 187 (381) Q Consensus 108 ~~~~v~~~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~ 187 (381) ++|++.+++|..++|+..+.+.+.|++|+++++ +++|+|++|++.+|+++++++||++||+||.+|+++||+++|+++| T Consensus 181 ~~~~v~~~~g~~~~~~~~~~~~a~wv~E~~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~ 259 (466) T protein:vir:80 181 SKVRLRPLKGTARQNIAGAIPEGVWTEAVANLN-ELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAI 259 (466) T ss_pred hheeeeecCceeEeeeecCCcceeecccccccc-cccccccceeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHH Confidence 999999999999999998889999999988775 5689999999999999999999999999999999999999999999 Q ss_pred HHHHhhheeeccCCCcceeeeeccccccccccccccc--cchhhhccccChhH----HHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 188 AVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPE--KEEQGTLTFANPRA----TVNELTQVFKYHSTNEKGKSVAV 261 (381) Q Consensus 188 a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~--~~~~~~~t~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~ 261 (381) ++++|.+||+|+|+++|+|||+.+.....+....... .............. ....+.++...+. ....... T Consensus 260 ~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~ 336 (466) T protein:vir:80 260 GFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLS---KARANYS 336 (466) T ss_pred HHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHH---hhhcccc Confidence 9999999999999999999998765443332211110 00000000000000 0111222211111 1123345 Q ss_pred cCceEEEEchhhHHHHHhhhhccCCCCceeecc-----CCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhh Q lcl|Aclame:pro 262 KGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL-----PFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETL 336 (381) Q Consensus 262 ~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l-----~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~ 336 (381) +++.+|+||+.++..++......+++|.|++.. .+|+||+++++||+++++||||++|+|++|++++|.+|++.+ T Consensus 337 ~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~~ 416 (466) T protein:vir:80 337 NGMKFWAMSSNTHAVLMSKAITFNSAGALVASLNNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLAERADIKLAQSEHVR 416 (466) T ss_pred CCceeEEecchhHHHhhcccccccCCccccccCCCcccccccceeecCccCccceeeeccccEEEEeecceEEEechhhh Confidence 667789999999888887776667888887643 469999999999999999999999999999999999999999 Q ss_pred hhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 337 ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 337 ~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) |.+|+++||+++|+||+|++++||+++++...++..+++..+++- T Consensus 417 f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~~~~~~~~~~ 461 (466) T protein:vir:80 417 FIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTTSITFAPDEA 461 (466) T ss_pred hhcCcEEEEEEEEEccEEeccCceEEEEecCCCcccceeeecCcC Confidence 999999999999999999999999998877776666666666665 No 10 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=6.3e-65 Score=372.56 Aligned_cols=353 Identities=18% Similarity=0.166 Sum_probs=240.8 Q ss_pred CCccHHHHHHHH--------------HHHHHHHHhhhhHHHHHH------HHHHHHHHHHHH---HHHHHH---HHHHHH Q lcl|Aclame:pro 1 MTINLSETFANA--------------KNEFINAVNNGEPQERQN------ELYGDMINQLFE---ETKLQA---KAEAER 54 (381) Q Consensus 1 m~~~l~~~~~e~--------------~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~---~~~~~~---~~~~~~ 54 (381) |--+..++...+ .+++..+++....+++.. +.++...+.+.+ ....+. ..+.++ T Consensus 7 ~~~~el~~~~~~l~el~~~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~~~~~~~l~~ 86 (425) T protein:vir:95 7 MLTKKIEQRKAALDELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEIAQLEDELEQ 86 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 211111111111 111111111111000000 000000000000 000000 000000 Q ss_pred -----------------------------HHHhhhhhccccHHHHHHHHHHh-cccCCCCceEccHHHHHHHHHHHHhhh Q lcl|Aclame:pro 55 -----------------------------VSSLPKSAQSLSANQRSFFMDIN-KNVNYKEEKLLPEETIDRIFEDLTTNH 104 (381) Q Consensus 55 -----------------------------~~~~~~~~~~lt~~e~~~~~~~~-~~~~~~gg~lvP~~~~~~Ii~~l~~~~ 104 (381) ......+....+.+.+++.+.+. .++.++||++||+++.++|++.+++.+ T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~ 166 (425) T protein:vir:95 87 INSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYT 166 (425) T ss_pred hhhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhh Confidence 00000011112223333433332 345668999999999999999999999 Q ss_pred hhhhhceeeecCCceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHH Q lcl|Aclame:pro 105 PLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIE 184 (381) Q Consensus 105 ~l~~~~~v~~~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la 184 (381) +|+++|++++++|+.+||+..+.+.+.|++|+++.+....++|++|++.+|+++++++||+|||+||.++|++||+++++ T Consensus 167 ~i~~~~~~~~~~g~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~ 246 (425) T protein:vir:95 167 TLYPLVDKIRVKGTTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIA 246 (425) T ss_pred hHHHhhceeecCceeEEEEecCCccccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHH Confidence 99999999999999999999999999999999887766668999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhheeeccCC--CcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 185 EAFAVALETAFLKGTGK--DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK 262 (381) Q Consensus 185 ~a~a~~~d~a~l~G~G~--~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 262 (381) +++++++|.+|++|+|+ ++|+||++.+......+.. .+.. +...+..+...+. ...... T Consensus 247 ~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~-------------~~~~-~~~~~~~~~~~~~-----~~~~~~ 307 (425) T protein:vir:95 247 RAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVE-------------ADNN-LLKNLVKQIGLID-----TGDDSV 307 (425) T ss_pred HHHHHHHHHHhhccCCCCccccceeecccccccccccc-------------cccc-hHHHHHHHHHhhh-----hhcccc Confidence 99999999999999995 4899999865443221110 0011 1112222222111 111234 Q ss_pred CceEEEEchhhHHHH-HhhhhccCCCCceeecc-------CCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehh Q lcl|Aclame:pro 263 GNVTMVVNPSDAFEV-QAQYTHLNANGVYVTAL-------PFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKE 334 (381) Q Consensus 263 ~~~~~imn~~~~~~~-~~~~~~~~~~G~~~~~l-------~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~ 334 (381) ++++|+||+.|++.. ..+...++.+|+|+|.+ .+|+||+++++||++.++||||++|++++|++++|.+|+| T Consensus 308 ~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l~G~pvv~~~~~~~~~i~~Gd~~~~~~~~~~~~~i~~~~~ 387 (425) T protein:vir:95 308 GEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRVVFNNFLDDDTVLFGEFEQYTLVERENITIDSSTH 387 (425) T ss_pred CceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCccccceeeEEcCcCCCccEEEEecccEEEEeecceEEEeecc Confidence 678999999998753 34444578999999863 3699999999999999999999999999999999999999 Q ss_pred hhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCC Q lcl|Aclame:pro 335 TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGT 377 (381) Q Consensus 335 ~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~ 377 (381) .+|.+|+++||++.|+||++++++||++++++.+. +|. T Consensus 388 ~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~-----~g~ 425 (425) T protein:vir:95 388 VKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPV-----QGA 425 (425) T ss_pred cccccCceEEEEEEeeCcEeecccceEEEEecCcC-----CCC Confidence 99999999999999999999999999999877654 444 No 11 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=3.4e-64 Score=368.54 Aligned_cols=353 Identities=12% Similarity=0.125 Sum_probs=242.4 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHH--------HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH-hhh----hhccc Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQ--------ERQNELYGDMINQLFEETKL--QAKAEAERVSS-LPK----SAQSL 65 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~-~~~----~~~~l 65 (381) |||+++ ++++.++++.++++....+ +++...+....+.+...... ......++... ..+ ..... T Consensus 1 m~~~lk-~l~~~~~el~~~~~~~k~~~~~~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (401) T protein:vir:44 1 MAVDIK-DVEQVAQELQQKFDDFKAKNDKRVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKV 79 (401) T ss_pred CCccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Confidence 999876 3344444444333211100 11111111111111111100 00000000000 000 00111 Q ss_pred cHHHHHHH-----------------HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCC Q lcl|Aclame:pro 66 SANQRSFF-----------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETS 127 (381) Q Consensus 66 t~~e~~~~-----------------~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~ 127 (381) ..++++.+ .++..+++++||++||+++.++|++.+++.++|+++|+++++++ ...+|+..+. T Consensus 80 ~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 159 (401) T protein:vir:44 80 AAEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGG 159 (401) T ss_pred hHHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCC Confidence 11222211 24567788899999999999999999999999999999999865 4789998888 Q ss_pred cceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceee Q lcl|Aclame:pro 128 GVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGL 207 (381) Q Consensus 128 ~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gi 207 (381) +.+.|++|+++.+....++|+++++.+||++++++||+|||+||.+||++||.++|+++++++++.+|++|+|+++|.|| T Consensus 160 ~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gi 239 (401) T protein:vir:44 160 TASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGF 239 (401) T ss_pred ccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCcccee Confidence 88999999887776667999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCC Q lcl|Aclame:pro 208 NRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNAN 287 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~ 287 (381) ++............. .......+........+.+.+++..+ ...|+.+++|+||++++..++.+ ++.+ T Consensus 240 l~~~~~~~~~~~~~~--~~~~~~~t~~~~~~~~d~i~~~~~~l-------~~~~~~~a~~v~n~~~~~~L~~l---kd~~ 307 (401) T protein:vir:44 240 LAYESTEESDKARAF--GKLQHIVSGEATAVTADAIIKLIYTL-------RKAHRTGAKFMMNNNSLFAIRLL---KDTE 307 (401) T ss_pred ecccccccccccccc--ccccccccccccccCHHHHHHHHHhc-------chhhhcCCEEEEcHHHHHHHHHh---hccC Confidence 975443221111100 00111111111112223333333322 34678899999999999888875 5678 Q ss_pred Cceeec---------cCCCceEEecCCCCCc-----cEEEEeccc-eEEEecceeeEeeehhhhhhcCceEEEEEEEEcC Q lcl|Aclame:pro 288 GVYVTA---------LPFNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYG 352 (381) Q Consensus 288 G~~~~~---------l~~g~~vi~s~~~p~~-----~i~~gd~s~-y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dg 352 (381) |+|++. ..+|+||+.+++||.. .++||||++ |.+++|.++++.+ +.++.+|+++||+++|+|| T Consensus 308 G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~--~~~~~~~~v~~~a~~r~d~ 385 (401) T protein:vir:44 308 GNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILR--DPYTNKPFVGFYTTKRTGG 385 (401) T ss_pred CceeecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEee--eccccCCcEEEEEEEEecc Confidence 888874 2479999999999852 278999987 8899999998865 4568899999999999999 Q ss_pred EEecCcceEEEEEEec Q lcl|Aclame:pro 353 KAKDNKVAAVWKLDLK 368 (381) Q Consensus 353 k~~~~~Af~v~~l~~~ 368 (381) ++++++||++++++-+ T Consensus 386 ~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 386 MLVDSQAIKLLKIAAA 401 (401) T ss_pred EEecccceEEEEeecC Confidence 9999999999888776 No 12 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=8.4e-63 Score=360.92 Aligned_cols=354 Identities=14% Similarity=0.121 Sum_probs=240.8 Q ss_pred CCcc---HHHHHHHHHHHHHHHHhhhh--HHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHH---h-- Q lcl|Aclame:pro 1 MTIN---LSETFANAKNEFINAVNNGE--PQERQNELYGDM------------INQLFEETKLQAKAEAERVSS---L-- 58 (381) Q Consensus 1 m~~~---l~~~~~e~~~~~~~~~~~~~--~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~---~-- 58 (381) |.-. ++.+..++.+++.+.++... -.+++.+.+... .+.+..+... .+...++... . T Consensus 21 ~~~~l~e~ra~~~~e~~~l~~~~~~~~~~~k~~~~~~~~~~~~~~~~~e~~~~~~~~~~ei~~-~~~~~~~~~~~~~~~~ 99 (425) T protein:vir:10 21 VPRGIISVRAEGPTEVKALIENLQKAFHDFKAEHTKQLDAVKAGLPTSDALAKVDKVSADLEA-LQAAVDEANIKIAAAQ 99 (425) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhh Confidence 3322 22222222223222221110 001111111100 0011101000 0000110000 0 Q ss_pred --hhhhc-cccHHHHHHH----------HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC-CceEEEEe Q lcl|Aclame:pro 59 --PKSAQ-SLSANQRSFF----------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKS 124 (381) Q Consensus 59 --~~~~~-~lt~~e~~~~----------~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~~~ip~~ 124 (381) ..... ..+.+.++.| +++..+++++||++||+++.++|++.+++.++|+++|++++++ +..++|+. T Consensus 100 ~~~~~~~~~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~ 179 (425) T protein:vir:10 100 MGANGVKPLRDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFN 179 (425) T ss_pred cccccccccccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEE Confidence 00111 1122223322 4567788999999999999999999999999999999999986 45899999 Q ss_pred cCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcc Q lcl|Aclame:pro 125 ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQP 204 (381) Q Consensus 125 ~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP 204 (381) ++.+.+.|++|++..+....++|+++++.+|+++++++||+|||+||.++|++||.+++++++++++|.+|++|+|+++| T Consensus 180 ~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p 259 (425) T protein:vir:10 180 MGGTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKP 259 (425) T ss_pred cCCcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCc Confidence 88899999999887765556899999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhcc Q lcl|Aclame:pro 205 IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL 284 (381) Q Consensus 205 ~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~ 284 (381) .||++.................... +........+.+.+++.. ....|+++++|+|||+++..++.+ + T Consensus 260 ~Gil~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~d~l~~l~~~-------l~~~~~~~a~~vmn~~~~~~L~~l---k 327 (425) T protein:vir:10 260 NGLLTYIAGGANAAKHPFGAIEVVN--SGAAADITSDGIIDLVYD-------LPSAFTGNARFAMNRNTQRQVRKL---K 327 (425) T ss_pred ceeeecccccccccccccccccccc--ccccccccHHHHHHHHhh-------hhhhhccCCEEEEchHHHHHHHHh---h Confidence 9999876544332221111000000 111111122223333222 135688999999999999888765 5 Q ss_pred CCCCceeec---------cCCCceEEecCCCCC-----ccEEEEeccc-eEEEecceeeEeeehhhhhhcCceEEEEEEE Q lcl|Aclame:pro 285 NANGVYVTA---------LPFNLNVIESTVQEA-----GKVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQF 349 (381) Q Consensus 285 ~~~G~~~~~---------l~~g~~vi~s~~~p~-----~~i~~gd~s~-y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r 349 (381) +++|+|+|. ..+|+||+++++||. ..|+||||++ |++++|.++++. .+.|+.+|+++|++..| T Consensus 328 D~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~--~d~~~~~~~~~~~~~~r 405 (425) T protein:vir:10 328 DGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVL--RDPYTAKPYVLFYTTKR 405 (425) T ss_pred cCCCceeeccCccCCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEE--ecccccCCcEEEEEEEE Confidence 788998874 347999999999994 2389999998 789999988765 46678999999999999 Q ss_pred EcCEEecCcceEEEEEEecc Q lcl|Aclame:pro 350 AYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 350 ~dgk~~~~~Af~v~~l~~~~ 369 (381) +||++++++||+++.++-+. T Consensus 406 ~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 406 VGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred eccEeecccceEEEEeeccC Confidence 99999999999998888877 No 13 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=4.9e-62 Score=356.72 Aligned_cols=356 Identities=12% Similarity=0.077 Sum_probs=241.5 Q ss_pred ccHHHHHHHHHHHHHHHH---hhhhH-----HHHHHHHHHHHHHHHHHH---HHHH---------------------HHH Q lcl|Aclame:pro 3 INLSETFANAKNEFINAV---NNGEP-----QERQNELYGDMINQLFEE---TKLQ---------------------AKA 50 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~---~~~~~-----~~~~~~~~~~~~~~~~~~---~~~~---------------------~~~ 50 (381) |..-+++.+..+++..++ +.... .+++...+....+.+... .... ... T Consensus 1 l~~~k~l~~~i~e~~~~~~~~k~~~~~~~~~~e~~~~~l~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFDDFKEKNDKRIDAIEQEKGKLAGEVETLNGKLAELENLKSDLEAELAEVKRPAGGTQNKVAS 80 (407) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhh Confidence 322223333222222222 11110 000001111111111000 0000 000 Q ss_pred HHHHHHH---hhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecC Q lcl|Aclame:pro 51 EAERVSS---LPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET 126 (381) Q Consensus 51 ~~~~~~~---~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~ 126 (381) ++.++.. .......++..|. +++.++++++||++||+++.++|++.++++++|+++|+++++++ ...+|+..+ T Consensus 81 e~~~a~~~~l~~g~~~~~~~~e~---~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 157 (407) T protein:vir:48 81 EHKEAFIGFMRKGREDGLRELER---KALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLG 157 (407) T ss_pred HHHHHHHHHHhccchhhhhHHHH---HhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecC Confidence 1111110 0001112222232 35567888899999999999999999999999999999999865 689999988 Q ss_pred CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCccee Q lcl|Aclame:pro 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIG 206 (381) Q Consensus 127 ~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~G 206 (381) .+.+.|++|++..+....++|+++++.+||++++++||+|||+||.+++++||.++|++++++++|.+|++|+|++||.| T Consensus 158 ~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~G 237 (407) T protein:vir:48 158 GTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKG 237 (407) T ss_pred CcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccce Confidence 89999999988776666799999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCC Q lcl|Aclame:pro 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~ 286 (381) |++................... .+........+.+.++...+ +..|+++++|+||+.++..++.+ ++. T Consensus 238 il~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~d~i~~l~~~l-------~~~~~~~a~~v~n~~~~~~L~~l---kD~ 305 (407) T protein:vir:48 238 FLAYESTDEDDKTRAFGKLQHI--ASGAASGVTADAIIKLIYTL-------RKAHRSGAKFMMNNSSLFAIRLL---KDN 305 (407) T ss_pred eeeccccccccccccccccccc--ccccccccChHHHHHHHHhh-------chhhhcCCEEEEcHHHHHHHHHh---hcc Confidence 9976543222111110000000 01111111223333333322 34688899999999999887765 577 Q ss_pred CCceeec---------cCCCceEEecCCCCCc-----cEEEEeccc-eEEEecceeeEeeehhhhhhcCceEEEEEEEEc Q lcl|Aclame:pro 287 NGVYVTA---------LPFNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFAY 351 (381) Q Consensus 287 ~G~~~~~---------l~~g~~vi~s~~~p~~-----~i~~gd~s~-y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~d 351 (381) +|+|+|. ..+|+||+++++||.. .|+||||++ |.+++|.++++.++ .|+.+|+++||+.+|+| T Consensus 306 ~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d--~~~~~~~~~~~~~~r~d 383 (407) T protein:vir:48 306 DGNYLWRPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRD--PYTNKPFVGFYTTKRTG 383 (407) T ss_pred CCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEee--ccccCCcEEEEEEEEec Confidence 8998874 2479999999999952 378999987 88999999988764 56889999999999999 Q ss_pred CEEecCcceEEEEEEecccccCCC Q lcl|Aclame:pro 352 GKAKDNKVAAVWKLDLKGHKPALE 375 (381) Q Consensus 352 gk~~~~~Af~v~~l~~~~~~~~~~ 375 (381) |++++++||+++++.-++...+-- T Consensus 384 ~~v~~~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 384 GMLVDSQAIKLMKIGAATRQKAAA 407 (407) T ss_pred cEEecccceEEEEeeccCCCCCCC Confidence 999999999998887766554444 No 14 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=8.7e-62 Score=355.34 Aligned_cols=346 Identities=16% Similarity=0.089 Sum_probs=242.3 Q ss_pred CCccHHHHHHHHHHHHHHHHhhh-------hHHHHHHHHHH---HHHHHHHHHHHHHHHH----HHHHHHHhhhhh---- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNG-------EPQERQNELYG---DMINQLFEETKLQAKA----EAERVSSLPKSA---- 62 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~-------~~~~~~~~~~~---~~~~~~~~~~~~~~~~----~~~~~~~~~~~~---- 62 (381) |....-.+++|+++++...++.. +..+++.+.+. ...+.+.+........ ............ T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGSG 80 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccc Confidence 77766667777776655444321 11122222222 1222222111110000 000000000000 Q ss_pred -------------ccccHHHHHHH---HHHhcccCCCCceEccHHHHHHHHHHHHhh-hhhhhhceeeecCC--ceEEEE Q lcl|Aclame:pro 63 -------------QSLSANQRSFF---MDINKNVNYKEEKLLPEETIDRIFEDLTTN-HPLLADLGIKNAGL--RLKFLK 123 (381) Q Consensus 63 -------------~~lt~~e~~~~---~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~-~~l~~~~~v~~~~~--~~~ip~ 123 (381) +.....+++.+ .....++.+++|.++|+++.+++|..+... ++++.+++++++++ ...+|+ T Consensus 81 ~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (392) T protein:vir:13 81 AQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTV 160 (392) T ss_pred hhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEE Confidence 00001111111 122334556667777888888877665554 56788889888743 478999 Q ss_pred ecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCc Q lcl|Aclame:pro 124 SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ 203 (381) Q Consensus 124 ~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~q 203 (381) .++.+.+.|++|+++.+ +++++|+++++.+||++++++||+|||+||.+++++||+++|++++++++|.+|++|+|+++ T Consensus 161 ~~~~~~a~~v~E~~~~~-~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~ 239 (392) T protein:vir:13 161 ITGRATAGIVGETAEIP-ESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQ 239 (392) T ss_pred EcCCcceeeeccccccc-ccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCcc Confidence 99999999999988765 57899999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc Q lcl|Aclame:pro 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~ 283 (381) |.||++.......... ....+. ... +.+.+++..+ +..|+++++|+||+.++..++.+ T Consensus 240 p~Gil~~~~~~~~~~~-----~~~~~~---~~~----d~l~~~~~~l-------~~~~~~~a~~v~n~~~~~~l~~l--- 297 (392) T protein:vir:13 240 PRGILTDATGANAAFG-----EADADS---KVS----DALIDLFHEV-------PSAYRKNAKFVVNDLRAAQMRKL--- 297 (392) T ss_pred cccccccccccccccc-----cccccc---ccH----HHHHHHHHhh-------hhhhhcCCEEEEcHHHHHHHHHh--- Confidence 9999976533221111 001111 111 1222222211 34578899999999999888765 Q ss_pred cCCCCceeecc---------CCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEE Q lcl|Aclame:pro 284 LNANGVYVTAL---------PFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKA 354 (381) Q Consensus 284 ~~~~G~~~~~l---------~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~ 354 (381) ++++|+|+|.. .+|+||+.+++||++.|+||||++|++++|++++++++.+.+|.+|+++||++.|+||++ T Consensus 298 kd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~ 377 (392) T protein:vir:13 298 KDANGQYLWQSALTVGAPDTFNGKVVETDDGMPADKVLFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLL 377 (392) T ss_pred hccCCceeecCCcCCCCCceecceeeEEcCCCCCCcEEEeeccceeEEeecceEEEeeccccccCCcEEEEEEEEeccEE Confidence 67889998742 479999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCcceEEEEEEecc Q lcl|Aclame:pro 355 KDNKVAAVWKLDLKG 369 (381) Q Consensus 355 ~~~~Af~v~~l~~~~ 369 (381) ++++||++++++-++ T Consensus 378 ~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 378 VDARGAKVLTVTPAA 392 (392) T ss_pred ecccceEEEEeeccC Confidence 999999999998877 No 15 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=1.1e-60 Score=349.27 Aligned_cols=344 Identities=17% Similarity=0.096 Sum_probs=231.7 Q ss_pred CCccHHHHHHHHHHHHHHHHh-------hhhHHHHHHHHHHHH---HHHHHHHHHHHH---HHH-HHHHHHhhhhh---- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVN-------NGEPQERQNELYGDM---INQLFEETKLQA---KAE-AERVSSLPKSA---- 62 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~-------~~~~~~~~~~~~~~~---~~~~~~~~~~~~---~~~-~~~~~~~~~~~---- 62 (381) |..-...++.+++.++.++++ +....+++.+.++++ .+.+.+...... +.. ..+........ T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSG 80 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 554333344444444333222 111122222222222 222211111110 000 00000000000 Q ss_pred --cccc-----------HHHHHHHH---HHhcccCCCCceEccHHHHHHHH-HHHHhhhhhhhhceeeecCC--ceEEEE Q lcl|Aclame:pro 63 --QSLS-----------ANQRSFFM---DINKNVNYKEEKLLPEETIDRIF-EDLTTNHPLLADLGIKNAGL--RLKFLK 123 (381) Q Consensus 63 --~~lt-----------~~e~~~~~---~~~~~~~~~gg~lvP~~~~~~Ii-~~l~~~~~l~~~~~v~~~~~--~~~ip~ 123 (381) .... ..+++.+. ....++.+++|.++|+++.+++| +.++..++++++|+++++++ .++||+ T Consensus 81 ~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~ 160 (390) T protein:vir:62 81 AQRSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTV 160 (390) T ss_pred chhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEE Confidence 0001 01111110 11223444455555555555555 55666677889999998754 378999 Q ss_pred ecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCc Q lcl|Aclame:pro 124 SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ 203 (381) Q Consensus 124 ~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~q 203 (381) .++.+.+.|++|+++++ +++++|+++++.+|+++++++||+|||+||.+|+++||+++++++|++++|.+|++|+| + T Consensus 161 ~~~~~~a~wv~E~~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G--~ 237 (390) T protein:vir:62 161 ITGRSSASIVGETAEIP-ESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTG--Q 237 (390) T ss_pred EcCCcceeeeccccccc-ccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCC--c Confidence 99999999999988775 46899999999999999999999999999999999999999999999999999999988 7 Q ss_pred ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc Q lcl|Aclame:pro 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~ 283 (381) |.||++........... +..+...... +.+++..+ ...|+++++|+||+.++..++.+ T Consensus 238 p~Gi~~~~~~~~~~~~~--------~~~~~~~~~~----l~~~~~~l-------~~~~~~~a~~vmn~~~~~~L~~l--- 295 (390) T protein:vir:62 238 PRGILTDASPATATFLA--------TDTDSKVSDA----LIDLFHEV-------PSAYRANAKYVVNDLRAAQMRKL--- 295 (390) T ss_pred cccccccccccccceec--------ccccccchHH----HHHHHHhh-------hhhhhcCCEEEEchHHHHHHHHh--- Confidence 99999865332211100 0001111222 22222221 24577899999999998888765 Q ss_pred cCCCCceeec---------cCCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEE Q lcl|Aclame:pro 284 LNANGVYVTA---------LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKA 354 (381) Q Consensus 284 ~~~~G~~~~~---------l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~ 354 (381) ++++|+|+|. ..+|+||++++++|++.|+||||++|++++|++++++++.+.+|.+|+++||+++|+||++ T Consensus 296 kd~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~ 375 (390) T protein:vir:62 296 KDANGQYLWQSGLTVGAPSLFNGKVVETDDGMPADKILFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLL 375 (390) T ss_pred hccCCCeeecCCcCCCccceecccceEEecCCCCccEEEeeccceeEEeecceEEEeeccccccCCcEEEEEEEEeCcEe Confidence 5788999874 2479999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCcceEEEEEEecc Q lcl|Aclame:pro 355 KDNKVAAVWKLDLKG 369 (381) Q Consensus 355 ~~~~Af~v~~l~~~~ 369 (381) ++++||++++++-++ T Consensus 376 ~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 376 VDARGAKVLTVTPGA 390 (390) T ss_pred echhheEEEEeecCC Confidence 999999999988877 No 16 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=2.8e-60 Score=347.04 Aligned_cols=346 Identities=14% Similarity=0.067 Sum_probs=233.8 Q ss_pred CCc-cHHHHHHHHHHHHHHHHhh----hhH-HHH------HHHHHHHHHHHHHHH-------HHHHHHH----------- Q lcl|Aclame:pro 1 MTI-NLSETFANAKNEFINAVNN----GEP-QER------QNELYGDMINQLFEE-------TKLQAKA----------- 50 (381) Q Consensus 1 m~~-~l~~~~~e~~~~~~~~~~~----~~~-~~~------~~~~~~~~~~~~~~~-------~~~~~~~----------- 50 (381) |+| |+.++...+.++....++. .+. .++ +.+.+....+.+.+. ...+... T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~~~ 80 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQGKVEKNEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEKKE 80 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhhc Confidence 666 3333333333332222211 110 010 111111111111110 0000000 Q ss_pred ----------------HHHHHHHh-------hhhh-ccccHHHHHHHHHH------------hcccCCCCceEccHHHHH Q lcl|Aclame:pro 51 ----------------EAERVSSL-------PKSA-QSLSANQRSFFMDI------------NKNVNYKEEKLLPEETID 94 (381) Q Consensus 51 ----------------~~~~~~~~-------~~~~-~~lt~~e~~~~~~~------------~~~~~~~gg~lvP~~~~~ 94 (381) +....... .++. .....++++++..+ ...++++||++||+++.+ T Consensus 81 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~ 160 (434) T protein:vir:62 81 DPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNGSVTIPDFLSK 160 (434) T ss_pred chhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccccccceecchhhHH Confidence 00000000 0000 01112334433221 112345799999999999 Q ss_pred HHHHHHHhhhhhhhhceeeecCCceEEEEecCCcceEEeccc--ccccccccccccceeccceeeeeehhhhHHHHhcCh Q lcl|Aclame:pro 95 RIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIY--GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP 172 (381) Q Consensus 95 ~Ii~~l~~~~~l~~~~~v~~~~~~~~ip~~~~~~~a~w~~e~--~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~ 172 (381) .|++.++++++|+++|++++++++.++|+....+.+.|..+. +...+.++++|+++++.+|+++++++||+|||+||. T Consensus 161 ~Ii~~l~~~~~i~~~~~~~~~~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~ 240 (434) T protein:vir:62 161 EIITYAQEENFLRRLGTGVKTKENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLARTG 240 (434) T ss_pred HHHHhhhhhhhhhhhcceeccCCceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHHHhcch Confidence 999999999999999999999999999998777777775332 333456799999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHhhheeeccCCCcc-eeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhh Q lcl|Aclame:pro 173 AWIERFVRVQIEEAFAVALETAFLKGTGKDQP-IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHS 251 (381) Q Consensus 173 ~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP-~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~ 251 (381) +||++||.++|++++++++|.+||+|+|+++| .|+++....... .... .+.+.+.++...+ T Consensus 241 ~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~----------~~~~-------~~~d~l~~l~~~l- 302 (434) T protein:vir:62 241 LPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFK----------TDEK-------NLYDALVKMKNTP- 302 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeeccccccc----------cccc-------chhhHHHHHHhhc- Confidence 99999999999999999999999999998875 566642211000 0000 1122233332222 Q ss_pred hccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec-----------cCCCceEEecCCCCCcc------EEEE Q lcl|Aclame:pro 252 TNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-----------LPFNLNVIESTVQEAGK------VLTY 314 (381) Q Consensus 252 ~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~-----------l~~g~~vi~s~~~p~~~------i~~g 314 (381) ...|+++++|+|||.++..++.+ ++++|+|+|. ..+|+||+++++||.+. |+|| T Consensus 303 ------~~~~~~~a~~v~n~~~~~~L~~l---kd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~G 373 (434) T protein:vir:62 303 ------VKEVRKKARWVLNTAALTKIETM---KTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFG 373 (434) T ss_pred ------chhhhcCCEEEEcHHHHHHHHHh---hccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEe Confidence 34688899999999999888765 6778999873 24799999999998544 7899 Q ss_pred eccceEEEecc-eeeEeeehhhhhhcCceEEEEEEEEcCEEec-CcceEEEEEEecccccC Q lcl|Aclame:pro 315 VKGLYDGYLAG-GINVQKFKETLALDDMDLYTAKQFAYGKAKD-NKVAAVWKLDLKGHKPA 373 (381) Q Consensus 315 d~s~y~i~~r~-~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~-~~Af~v~~l~~~~~~~~ 373 (381) |||+|+|++|. +++++++++.+|.+|+++||++.|+|||++. +++.+++.+++...+.+ T Consensus 374 dfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 374 DFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred eccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 99999999886 5889999999999999999999999999997 99999999998776666 No 17 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=2.3e-58 Score=336.57 Aligned_cols=345 Identities=13% Similarity=0.107 Sum_probs=239.7 Q ss_pred CCc-cHHHHHHHHHHHH---HHHHhhhhHHHHHHHHHHH---HHHHHHHHHH----------HHHH-------------- Q lcl|Aclame:pro 1 MTI-NLSETFANAKNEF---INAVNNGEPQERQNELYGD---MINQLFEETK----------LQAK-------------- 49 (381) Q Consensus 1 m~~-~l~~~~~e~~~~~---~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~----------~~~~-------------- 49 (381) |+| +|+++..+..+++ .++.++....+++.+.+.+ .++.+..... ...+ T Consensus 1 M~l~eL~e~r~~l~~e~~~l~~k~~~~~~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (409) T protein:vir:45 1 MKLHELKQKRNTIATDMRALNEKIGDNAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPEN 80 (409) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCCCC Confidence 777 4444443333332 2222211111112111111 1111111100 0000 Q ss_pred ----HHH-HHHHH--hhhhhccccHHHHHHHHH---HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc- Q lcl|Aclame:pro 50 ----AEA-ERVSS--LPKSAQSLSANQRSFFMD---INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR- 118 (381) Q Consensus 50 ----~~~-~~~~~--~~~~~~~lt~~e~~~~~~---~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~- 118 (381) .+. .++.. ...+...++.+|++.+.+ +..+++++||++||+++.++|++.+++.+||+++|+++++++. T Consensus 81 ~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 160 (409) T protein:vir:45 81 NSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGR 160 (409) T ss_pred cchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCc Confidence 000 01111 112334567777776543 4456778899999999999999999999999999999998654 Q ss_pred -eEEEEecCC-cceEEecccccccccccccccceeccceeee-eehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhhe Q lcl|Aclame:pro 119 -LKFLKSETS-GVAVWGKIYGEIKGQLDAAFSEETAIQNKLT-AFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) Q Consensus 119 -~~ip~~~~~-~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~-~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~ 195 (381) ..+|...+. ..+.|++|+++. ++++++|+++++.+||++ ++++||+|||+||.++|++||.++|+++++++++.+| T Consensus 161 ~~~~~~~~~~~~~~~~v~E~~~~-~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~ 239 (409) T protein:vir:45 161 TMEWATADGTSEVGVLLGENEEA-GEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYL 239 (409) T ss_pred eEEEEeeccCccccccccccccc-cccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 455655443 457899997765 467899999999999986 6899999999999999999999999999999999999 Q ss_pred eeccCCC---cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEE--EEc Q lcl|Aclame:pro 196 LKGTGKD---QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTM--VVN 270 (381) Q Consensus 196 l~G~G~~---qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--imn 270 (381) |+|+|++ +|+||++...+..... ..+.++.+ .+.++...+ +..|++++.| +|| T Consensus 240 l~G~G~~~~~~p~Gil~~~~~~~~~~--------~~~~~~~d-------~i~~l~~~l-------~~~~~~~a~~~~~~n 297 (409) T protein:vir:45 240 IQGTGAGTPKQPKGLAASVTGTTQTA--------AANAVKWQ-------EILALKHSI-------DPAYRRGPKFRLAFN 297 (409) T ss_pred hccCCCCCccccceeeeccccccccc--------cccccchH-------HHHHHHHhh-------hhhhccCCeEEEEEC Confidence 9999975 8999997654322111 01111111 122222211 3346666655 679 Q ss_pred hhhHHHHHhhhhccCCCCceeec---------cCCCceEEecCCCCC-----ccEEEEeccceEEEecceeeEeeehhhh Q lcl|Aclame:pro 271 PSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEA-----GKVLTYVKGLYDGYLAGGINVQKFKETL 336 (381) Q Consensus 271 ~~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~~p~-----~~i~~gd~s~y~i~~r~~~~i~~~~~~~ 336 (381) +.++..++.+ ++.+|+|++. ..+|+||+++++||. ..++||||++|++++++++.++.+++.| T Consensus 298 ~~~~~~l~~l---kd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~~ 374 (409) T protein:vir:45 298 DNTLKLISEM---EDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERY 374 (409) T ss_pred HHHHHHHHHh---hcCCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhhhheeeccceEEEEeeccc Confidence 9999888765 5788999874 347999999999985 3478999999999999999999999999 Q ss_pred hhcCceEEEEEEEEcCEEecCcceEEEEEEecccc Q lcl|Aclame:pro 337 ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 337 ~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~ 371 (381) |.+|+++||+..|+||++++++||++++++-+..- T Consensus 375 ~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 375 AEYDQTGFLAFHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred ccCCcEEEEEEEEeccEeechhheEEEEeccCCCC Confidence 99999999999999999999999999887765533 No 18 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=2.7e-58 Score=336.16 Aligned_cols=362 Identities=13% Similarity=0.072 Sum_probs=227.6 Q ss_pred ccHHHHHHHHHHHHHHHHhhh--------------------------------hHHHHHHHHHH---HHHHHHHHHHH-H Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNG--------------------------------EPQERQNELYG---DMINQLFEETK-L 46 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~--------------------------------~~~~~~~~~~~---~~~~~~~~~~~-~ 46 (381) |+-+..++.+.+++.+.++.. ...++..+.+. ...+.+..... . T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222211211111111111100 00000000000 00000000000 0 Q ss_pred HH------H-------HHH---HHHHHhhhhh-------ccccH--------HHHH----------HHHHHhcccCCCCc Q lcl|Aclame:pro 47 QA------K-------AEA---ERVSSLPKSA-------QSLSA--------NQRS----------FFMDINKNVNYKEE 85 (381) Q Consensus 47 ~~------~-------~~~---~~~~~~~~~~-------~~lt~--------~e~~----------~~~~~~~~~~~~gg 85 (381) +. . ... .+........ ..... +.+. ....+..+++++|| T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:10 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 00 0 000 0000000000 00000 0000 11233456778899 Q ss_pred eEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecC-CcceEEecccccccccccccccceeccceeeeeehhh Q lcl|Aclame:pro 86 KLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVL 163 (381) Q Consensus 86 ~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~-~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~i 163 (381) ++||+++..+|++.+++.++|+++|+++++++ ..+||+.++ .+.+.|++|++..+ +++++|++|++.+||++++++| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~-~s~~~f~~i~~~~~k~a~~~~i 239 (497) T protein:vir:10 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTI 239 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccc-cccccceeeEeeeeeeEeecHh Confidence 99999999999999999999999999999865 489998765 56899999987765 5789999999999999999999 Q ss_pred hHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhcc----------c Q lcl|Aclame:pro 164 PKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLT----------F 233 (381) Q Consensus 164 S~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t----------~ 233 (381) |+|||+|++ ++++||.++++++|++++|.+||+|+|+++|.||++....................... . T Consensus 240 S~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) T protein:vir:10 240 TDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) T ss_pred HHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccch Confidence 999999986 69999999999999999999999999999999999865443332221111100000000 0 Q ss_pred -cChh--------------------------HHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCC Q lcl|Aclame:pro 234 -ANPR--------------------------ATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 234 -~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~ 286 (381) .... +....+..+...+... ....+.....|+|||.|+..++.+ +++ T Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~vmn~~~~~~l~~l---kd~ 392 (497) T protein:vir:10 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI---QLTLFQTPNAVVMNPRDWELLRLT---KDA 392 (497) T ss_pred hhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhh---hhhcccCCCeEEEchHHHHHHHHh---hcC Confidence 0000 0000111111111100 111233334699999999998876 678 Q ss_pred CCceeecc---------------CCCceEEecCCCCCccEEEEeccc--eEEEecceeeEeeehh--hhhhcCceEEEEE Q lcl|Aclame:pro 287 NGVYVTAL---------------PFNLNVIESTVQEAGKVLTYVKGL--YDGYLAGGINVQKFKE--TLALDDMDLYTAK 347 (381) Q Consensus 287 ~G~~~~~l---------------~~g~~vi~s~~~p~~~i~~gd~s~--y~i~~r~~~~i~~~~~--~~~~~d~~~~~~~ 347 (381) +|+|+|.. .+|+||+++++||+++++||||++ |.|++|++++|+++++ .+|.+|+++||+. T Consensus 393 ~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~ 472 (497) T protein:vir:10 393 NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) T ss_pred CCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEE Confidence 89998752 368999999999999999999987 5688999999999887 5699999999999 Q ss_pred EEEcCEEecCcceEEEEEEecccccCCCCC Q lcl|Aclame:pro 348 QFAYGKAKDNKVAAVWKLDLKGHKPALEGT 377 (381) Q Consensus 348 ~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~ 377 (381) .|+||.+++++||++++++-+.. +. T Consensus 473 ~r~~~~v~~p~A~~~l~~~~~~~-----~~ 497 (497) T protein:vir:10 473 ERLGLLVYRPSAFQLIQLKKGAT-----GS 497 (497) T ss_pred EeecceeeccccEEEEEecCCcc-----CC Confidence 99999999999999998865442 22 No 19 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=2.7e-58 Score=336.16 Aligned_cols=362 Identities=13% Similarity=0.072 Sum_probs=227.6 Q ss_pred ccHHHHHHHHHHHHHHHHhhh--------------------------------hHHHHHHHHHH---HHHHHHHHHHH-H Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNG--------------------------------EPQERQNELYG---DMINQLFEETK-L 46 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~--------------------------------~~~~~~~~~~~---~~~~~~~~~~~-~ 46 (381) |+-+..++.+.+++.+.++.. ...++..+.+. ...+.+..... . T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222211211111111111100 00000000000 00000000000 0 Q ss_pred HH------H-------HHH---HHHHHhhhhh-------ccccH--------HHHH----------HHHHHhcccCCCCc Q lcl|Aclame:pro 47 QA------K-------AEA---ERVSSLPKSA-------QSLSA--------NQRS----------FFMDINKNVNYKEE 85 (381) Q Consensus 47 ~~------~-------~~~---~~~~~~~~~~-------~~lt~--------~e~~----------~~~~~~~~~~~~gg 85 (381) +. . ... .+........ ..... +.+. ....+..+++++|| T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:78 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 00 0 000 0000000000 00000 0000 11233456778899 Q ss_pred eEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecC-CcceEEecccccccccccccccceeccceeeeeehhh Q lcl|Aclame:pro 86 KLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVL 163 (381) Q Consensus 86 ~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~-~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~i 163 (381) ++||+++..+|++.+++.++|+++|+++++++ ..+||+.++ .+.+.|++|++..+ +++++|++|++.+||++++++| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~-~s~~~f~~i~~~~~k~a~~~~i 239 (497) T protein:vir:78 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTI 239 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccc-cccccceeeEeeeeeeEeecHh Confidence 99999999999999999999999999999865 489998765 56899999987765 5789999999999999999999 Q ss_pred hHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhcc----------c Q lcl|Aclame:pro 164 PKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLT----------F 233 (381) Q Consensus 164 S~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t----------~ 233 (381) |+|||+|++ ++++||.++++++|++++|.+||+|+|+++|.||++....................... . T Consensus 240 S~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) T protein:vir:78 240 TDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) T ss_pred HHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccch Confidence 999999986 69999999999999999999999999999999999865443332221111100000000 0 Q ss_pred -cChh--------------------------HHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCC Q lcl|Aclame:pro 234 -ANPR--------------------------ATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 234 -~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~ 286 (381) .... +....+..+...+... ....+.....|+|||.|+..++.+ +++ T Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~vmn~~~~~~l~~l---kd~ 392 (497) T protein:vir:78 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI---QLTLFQTPNAVVMNPRDWELLRLT---KDA 392 (497) T ss_pred hhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhh---hhhcccCCCeEEEchHHHHHHHHh---hcC Confidence 0000 0000111111111100 111233334699999999998876 678 Q ss_pred CCceeecc---------------CCCceEEecCCCCCccEEEEeccc--eEEEecceeeEeeehh--hhhhcCceEEEEE Q lcl|Aclame:pro 287 NGVYVTAL---------------PFNLNVIESTVQEAGKVLTYVKGL--YDGYLAGGINVQKFKE--TLALDDMDLYTAK 347 (381) Q Consensus 287 ~G~~~~~l---------------~~g~~vi~s~~~p~~~i~~gd~s~--y~i~~r~~~~i~~~~~--~~~~~d~~~~~~~ 347 (381) +|+|+|.. .+|+||+++++||+++++||||++ |.|++|++++|+++++ .+|.+|+++||+. T Consensus 393 ~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~ 472 (497) T protein:vir:78 393 NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) T ss_pred CCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEE Confidence 89998752 368999999999999999999987 5688999999999887 5699999999999 Q ss_pred EEEcCEEecCcceEEEEEEecccccCCCCC Q lcl|Aclame:pro 348 QFAYGKAKDNKVAAVWKLDLKGHKPALEGT 377 (381) Q Consensus 348 ~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~ 377 (381) .|+||.+++++||++++++-+.. +. T Consensus 473 ~r~~~~v~~p~A~~~l~~~~~~~-----~~ 497 (497) T protein:vir:78 473 ERLGLLVYRPSAFQLIQLKKGAT-----GS 497 (497) T ss_pred EeecceeeccccEEEEEecCCcc-----CC Confidence 99999999999999998865442 22 No 20 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=1.3e-57 Score=332.45 Aligned_cols=345 Identities=11% Similarity=0.014 Sum_probs=233.2 Q ss_pred CCcc-HHHHHHHHHHH---HHHHHhhhhH--HHHHHHHHH---HHHHHHHHHHHHH-HH-H----------------HHH Q lcl|Aclame:pro 1 MTIN-LSETFANAKNE---FINAVNNGEP--QERQNELYG---DMINQLFEETKLQ-AK-A----------------EAE 53 (381) Q Consensus 1 m~~~-l~~~~~e~~~~---~~~~~~~~~~--~~~~~~~~~---~~~~~~~~~~~~~-~~-~----------------~~~ 53 (381) |.++ ++.+......+ +.+.++.... ..+..+.++ ...+.+....... .+ . ... T Consensus 143 ~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~ 222 (543) T protein:vir:81 143 DSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSPAYL 222 (543) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhh Confidence 5542 22222222211 1222111100 000111111 1111111000000 00 0 000 Q ss_pred HH---HHhhhhhccccHHHHHHHHHH--hcccCCCCceEccHHHHHHHH-HHHHhhhhhhhhceeeecCCceEEEEecCC Q lcl|Aclame:pro 54 RV---SSLPKSAQSLSANQRSFFMDI--NKNVNYKEEKLLPEETIDRIF-EDLTTNHPLLADLGIKNAGLRLKFLKSETS 127 (381) Q Consensus 54 ~~---~~~~~~~~~lt~~e~~~~~~~--~~~~~~~gg~lvP~~~~~~Ii-~~l~~~~~l~~~~~v~~~~~~~~ip~~~~~ 127 (381) ++ .........++..+++.+... ...++++||++||+++...|| +.++..++|++++++.+++|...+|+.++. T Consensus 223 ~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~ 302 (543) T protein:vir:81 223 RAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVATGDVWHGVSSAA 302 (543) T ss_pred hHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccCCcceEEEEecCC Confidence 00 001111122344444444332 224567899999999999877 557788999999999999999999999999 Q ss_pred cceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCC-ccee Q lcl|Aclame:pro 128 GVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD-QPIG 206 (381) Q Consensus 128 ~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~-qP~G 206 (381) +.+.|++|++..+ +++++|+++++.+|+++++++||++||+|+ +++++||.+.|+++++++++.+|++|+|++ +|.| T Consensus 303 ~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~G 380 (543) T protein:vir:81 303 VQWSWDAEFEEVS-DDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTG 380 (543) T ss_pred cceeecccCcccc-ccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccc Confidence 9999999987764 579999999999999999999999999998 699999999999999999999999999975 9999 Q ss_pred eeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCC Q lcl|Aclame:pro 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~ 286 (381) |++........... .....+ ....+..+...+ +..|+.+++|+|||.++..++.+ +++ T Consensus 381 i~~~~~~~~~~~~~-----~~~~~~-------~~~~~~~~~~~l-------~~~~~~~~~~v~n~~~~~~l~~l---kd~ 438 (543) T protein:vir:81 381 IVTALAGTAAEIAP-----VTAETF-------ALADVYAVYEQL-------AARHRRQGAWLANNLIYNKIRQF---DTQ 438 (543) T ss_pred chhhcccccccccc-----cccccc-------cHHHHHHHHHhh-------hccccCCcEEEEcHHHHHHHHHh---hcC Confidence 99754322211000 001111 111222222211 34577888999999999888875 567 Q ss_pred CCceeec--------cCCCceEEecCCCCCcc----------EEEEeccceEEEecceeeEeeehhhh----hhcCceEE Q lcl|Aclame:pro 287 NGVYVTA--------LPFNLNVIESTVQEAGK----------VLTYVKGLYDGYLAGGINVQKFKETL----ALDDMDLY 344 (381) Q Consensus 287 ~G~~~~~--------l~~g~~vi~s~~~p~~~----------i~~gd~s~y~i~~r~~~~i~~~~~~~----~~~d~~~~ 344 (381) +|.|+|. ..+|+||+.+++||.+. ++||||+.|+|+++++++|.++++.+ |.+++++| T Consensus 439 ~G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~ 518 (543) T protein:vir:81 439 GGAGLWTTIGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGW 518 (543) T ss_pred CCceeccCcCCCCCccccceeeEEeccccccccccccCCcceEEEeeccceeEEeecccEEEEeccccccchhhcCceEE Confidence 8888874 24799999999998643 78999999999999999999988764 45679999 Q ss_pred EEEEEEcCEEecCcceEEEEEEecc Q lcl|Aclame:pro 345 TAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 345 ~~~~r~dgk~~~~~Af~v~~l~~~~ 369 (381) ++++|+||++.+++||++++++.++ T Consensus 519 ~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 519 FAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred EEEEeeccEeecccceEEEEecccC Confidence 9999999999999999999888877 No 21 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=7.1e-57 Score=328.41 Aligned_cols=349 Identities=11% Similarity=0.043 Sum_probs=234.2 Q ss_pred ccHHHHHHHHHHHHHHHHhh---hhH-----HHHHHHHH---HHHHHHHHHHHHH-HHHHHHHH----HHHh-------- Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNN---GEP-----QERQNELY---GDMINQLFEETKL-QAKAEAER----VSSL-------- 58 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~---~~~-----~~~~~~~~---~~~~~~~~~~~~~-~~~~~~~~----~~~~-------- 58 (381) |+...+++++++++.+.++. ... .+++.+.+ ....+.+..+... +...+..+ .... T Consensus 1 M~kl~~L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~ 80 (428) T protein:vir:10 1 MPQIEELRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVI 80 (428) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccc Confidence 54455555555555443321 110 11222222 2222222211110 00000000 0000 Q ss_pred -------------hhhhcccc------HHHHHH---------HHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhh- Q lcl|Aclame:pro 59 -------------PKSAQSLS------ANQRSF---------FMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLAD- 109 (381) Q Consensus 59 -------------~~~~~~lt------~~e~~~---------~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~- 109 (381) .+....+. ....++ ......++.+.||++||+++.++|++.+++.++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~ 160 (428) T protein:vir:10 81 VKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLG 160 (428) T ss_pred cccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhhc Confidence 00000000 000000 0011223445789999999999999999999999998 Q ss_pred ceeeec-CCceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 110 LGIKNA-GLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFA 188 (381) Q Consensus 110 ~~v~~~-~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a 188 (381) ++++++ +|.++||+.++.+.+.|++|+++.+ +++++|++|++.+|+++++++||+|||+||.++|++||.++|+++++ T Consensus 161 ~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~ 239 (428) T protein:vir:10 161 ARSIPLPNGNMSLPRLAGGATASYTGENQDAK-VSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAIS 239 (428) T ss_pred ceeeecCCcceEEEEEeCCcceeeeccCcccc-ccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHH Confidence 677775 5679999998889999999987765 57899999999999999999999999999999999999999999999 Q ss_pred HHHhhheeeccCCC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEE Q lcl|Aclame:pro 189 VALETAFLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTM 267 (381) Q Consensus 189 ~~~d~a~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 267 (381) +++|.+|++|+|++ +|.||++............ .....+........+ .+... ......+..+++| T Consensus 240 ~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~----~~~~~----~~~~~~~~~~~~~ 306 (428) T protein:vir:10 240 VREDKAFMRDDGTGDTPIGMKARATQWNRLLPWA-----ADAAVNLDTIDTYLD----SIILM----SMDGNSNMISSGW 306 (428) T ss_pred HHHHHHHhccCCCCcccccccccccccccccccc-----ccccccHHHHHHHHH----HHHHh----hhccccccccCEE Confidence 99999999999975 9999997543222111110 001111111111111 11111 1123445667899 Q ss_pred EEchhhHHHHHhhhhccCCCCceeecc-----CCCceEEecCCCCCc--------cEEEEeccceEEEecceeeEeeehh Q lcl|Aclame:pro 268 VVNPSDAFEVQAQYTHLNANGVYVTAL-----PFNLNVIESTVQEAG--------KVLTYVKGLYDGYLAGGINVQKFKE 334 (381) Q Consensus 268 imn~~~~~~~~~~~~~~~~~G~~~~~l-----~~g~~vi~s~~~p~~--------~i~~gd~s~y~i~~r~~~~i~~~~~ 334 (381) +||+.++..++.+ ++.+|+|++.. .+|+||+.+++||++ .++|||||+|++++|++++++++++ T Consensus 307 v~n~~~~~~L~~l---kd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~~~~~ 383 (428) T protein:vir:10 307 GMSNRTYMKLFGL---RDGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVDFSKE 383 (428) T ss_pred EEcHHHHHHHHHh---hccCCceeccCCCCCeeeceeeEEeccccccccCCCccceEEEEecceEEEEEecceEEEeecc Confidence 9999999888775 57889998742 379999999999864 3899999999999999999999987 Q ss_pred h-----------hhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 335 T-----------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 335 ~-----------~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) . +|..|+++||+.+|+|+++.+++||++++-..= T Consensus 384 ~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 384 ASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 4 588999999999999999999999999764443 No 22 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=1.2e-56 Score=327.24 Aligned_cols=329 Identities=9% Similarity=0.029 Sum_probs=237.2 Q ss_pred CCccHHHHHHHHHHHHHH---HHhhhhHHH--HHHHHHHHHHHHHHHHHH----------HHH----------------- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFIN---AVNNGEPQE--RQNELYGDMINQLFEETK----------LQA----------------- 48 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~---~~~~~~~~~--~~~~~~~~~~~~~~~~~~----------~~~----------------- 48 (381) |+|++++++.+.+++..+ +++....+. +..+......+.+.++.. .+. T Consensus 1 ~~~~m~k~l~el~~~~~~~~~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:12 1 MPMQMSKKEIALRQQFTEKKQQADKALQEGNTDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQ 80 (397) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhccc Confidence 999888877765555433 222211111 111111111111111000 000 Q ss_pred ----------HHHHHHHHHhhhhhccccHHHHHHH-----HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceee Q lcl|Aclame:pro 49 ----------KAEAERVSSLPKSAQSLSANQRSFF-----MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK 113 (381) Q Consensus 49 ----------~~~~~~~~~~~~~~~~lt~~e~~~~-----~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~ 113 (381) ..++.+........+.+..+++.++ +++..+++++||++||+++...|++.+++.++|+++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~ 160 (397) T protein:vir:12 81 RSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVE 160 (397) T ss_pred ccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhccee Confidence 0011111111111233444444433 2345667789999999999999999999999999999998 Q ss_pred ecC---CceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 114 NAG---LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVA 190 (381) Q Consensus 114 ~~~---~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~ 190 (381) +++ +...+|+.++.+.+.|++|+++.+..+.++|+++++.+|+++++++||+||++||.+++++||.+.|+++++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~ 240 (397) T protein:vir:12 161 PVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVT 240 (397) T ss_pred eccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 874 45678888888899999998887766789999999999999999999999999999999999999999999999 Q ss_pred HhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEc Q lcl|Aclame:pro 191 LETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVN 270 (381) Q Consensus 191 ~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn 270 (381) +|.+|++|+|+++|.|+++. . .+...+. ......|+++++|+|| T Consensus 241 ~d~~il~G~g~~~~~g~~~~-----------------------~-------~i~~~~~------~~l~~~~~~~a~~~~n 284 (397) T protein:vir:12 241 RNNLILAAIASLKKVDIDGL-----------------------D-------GIKKALN------VTLDPMVAPGSIVLTN 284 (397) T ss_pred HHHHHHhccccccccccccH-----------------------H-------HHHHHHh------hccchhhhCCCEEEEc Confidence 99999999999999988531 0 0111110 0113457888999999 Q ss_pred hhhHHHHHhhhhccCCCCceeec---------cCCCceEEecCC-CCC-----ccEEEEeccc-eEEEecceeeEeeehh Q lcl|Aclame:pro 271 PSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTV-QEA-----GKVLTYVKGL-YDGYLAGGINVQKFKE 334 (381) Q Consensus 271 ~~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~-~p~-----~~i~~gd~s~-y~i~~r~~~~i~~~~~ 334 (381) |.++..++.+ ++++|+|+|. ..+|+||+.+++ +|. ..++||||++ |.+++|++++|+.+++ T Consensus 285 ~~~~~~L~~l---kd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 361 (397) T protein:vir:12 285 QDGYDWLDTL---KDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDT 361 (397) T ss_pred HHHHHHHHHh---hccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEecc Confidence 9998888765 6778999874 247999987665 342 2389999998 5689999999988765 Q ss_pred --hhhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 335 --TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 335 --~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) ..|.+|++.||+.+|+||++.+++||+++++... T Consensus 362 ~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 362 GAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 4588999999999999999999999999888885 No 23 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=4.6e-57 Score=329.45 Aligned_cols=350 Identities=14% Similarity=0.075 Sum_probs=234.4 Q ss_pred CCc-cHHHHHHHHHHHHHHHHhhh----hHHHHHHH---HHHHHHHHHHHHHHHHHHHHH-------------------- Q lcl|Aclame:pro 1 MTI-NLSETFANAKNEFINAVNNG----EPQERQNE---LYGDMINQLFEETKLQAKAEA-------------------- 52 (381) Q Consensus 1 m~~-~l~~~~~e~~~~~~~~~~~~----~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~-------------------- 52 (381) |+| +|+++..++.+++.+.++.. .-.+++.+ .+...++.+..+.......+. T Consensus 1 M~i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 80 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAPA 80 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhcc Confidence 888 55555444444333322111 00111122 222222222211110000000 Q ss_pred -------------H-----HHHHhhh-hhccccHHHHH---------HHHHHhcccCCCCceEccHHHHHHHHHHHHhhh Q lcl|Aclame:pro 53 -------------E-----RVSSLPK-SAQSLSANQRS---------FFMDINKNVNYKEEKLLPEETIDRIFEDLTTNH 104 (381) Q Consensus 53 -------------~-----~~~~~~~-~~~~lt~~e~~---------~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~ 104 (381) . +...... ....+....+. ..+.+..+++.+||++||+++.++|++.+++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~ 160 (435) T protein:vir:14 81 AAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKS 160 (435) T ss_pred ccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhc Confidence 0 0000000 00000000000 112345677788999999999999999999999 Q ss_pred hhhhh-ceeeec-CCceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChh--HHHHHHH Q lcl|Aclame:pro 105 PLLAD-LGIKNA-GLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA--WIERFVR 180 (381) Q Consensus 105 ~l~~~-~~v~~~-~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~--~l~~~i~ 180 (381) +++++ ++++++ ++..++|+.++.+.+.|++|++..+ +++++|+++++.+|+++++++||+|||+||.+ +|++||. T Consensus 161 ~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~ 239 (435) T protein:vir:14 161 VVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIP-TTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVV 239 (435) T ss_pred hhhhhcceeeecCCCceEEEEEeCCcceeeeccCcccc-ccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHH Confidence 99997 778775 5679999998889999999987765 67899999999999999999999999999964 5999999 Q ss_pred HHHHHHHHHHHhhheeeccCCC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 181 VQIEEAFAVALETAFLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSV 259 (381) Q Consensus 181 ~~la~a~a~~~d~a~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~ 259 (381) ++|++++++++|.+|++|+|++ +|.||++........+. .. ..........+..++..+.. .. T Consensus 240 ~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~--------~~---~~~~~~~~~~~~~l~~~~~~-----~~ 303 (435) T protein:vir:14 240 GDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITA--------SD---ASTLQKIETDLGKVILALEN-----AD 303 (435) T ss_pred HHHHHHHHHHHHHHhhccCCCCccccceeecccccceecc--------cc---ccchhhHHHHHHHHHHHhhh-----cc Confidence 9999999999999999999975 89999864321111000 00 00111111223333222211 11 Q ss_pred cccCceEEEEchhhHHHHHhhhhccCCCCceeec-----cCCCceEEecCCCCCc--------cEEEEeccceEEEecce Q lcl|Aclame:pro 260 AVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQEAG--------KVLTYVKGLYDGYLAGG 326 (381) Q Consensus 260 ~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~-----l~~g~~vi~s~~~p~~--------~i~~gd~s~y~i~~r~~ 326 (381) .+..+++|+|||.++..++.+ ++.+|+|+|. ..+|+||+.+++||++ .++||||++|+|++|++ T Consensus 304 ~~~~~~~~v~n~~~~~~L~~l---kd~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~ 380 (435) T protein:vir:14 304 ANLTQPGWIMAPRTFRFLEGL---RDGNGNKVYPELANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEET 380 (435) T ss_pred ccccCCEEEEcHHHHHHHHHh---hccCCceeccCCCCCeeecceeEeeccccccccCCCccceEEEeecccEEEEEecc Confidence 244577899999999888765 5788999873 3479999999999863 58999999999999999 Q ss_pred eeEeeehhh-----------hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCC Q lcl|Aclame:pro 327 INVQKFKET-----------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGT 377 (381) Q Consensus 327 ~~i~~~~~~-----------~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~ 377 (381) +++.++++. +|.+|+++||+.+|+|+++++++||++++=-. -|. T Consensus 381 ~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~-------~~~ 435 (435) T protein:vir:14 381 LEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVA-------WGA 435 (435) T ss_pred cEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCC-------CCC Confidence 999999874 48899999999999999999999999864333 233 No 24 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=1.4e-56 Score=326.73 Aligned_cols=350 Identities=14% Similarity=0.084 Sum_probs=234.7 Q ss_pred CCc-cHHHHHHHHHHHHHHHHhhhh----HHHHHHH---HHHHHHHHHHHHHHHHHHHHH-------------------- Q lcl|Aclame:pro 1 MTI-NLSETFANAKNEFINAVNNGE----PQERQNE---LYGDMINQLFEETKLQAKAEA-------------------- 52 (381) Q Consensus 1 m~~-~l~~~~~e~~~~~~~~~~~~~----~~~~~~~---~~~~~~~~~~~~~~~~~~~~~-------------------- 52 (381) |+| +|+++..+..+++.+.++... -.+++.+ .+...++.+..+.......+. T Consensus 1 M~l~eL~~~r~~~~~~~~~l~~~~~e~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 80 (435) T protein:vir:80 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTASA 80 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhcccc Confidence 888 555544444433332221110 0111111 122222222111110000000 Q ss_pred ------------HHHHHhhhhhcc-------ccHHH---------HHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhh Q lcl|Aclame:pro 53 ------------ERVSSLPKSAQS-------LSANQ---------RSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNH 104 (381) Q Consensus 53 ------------~~~~~~~~~~~~-------lt~~e---------~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~ 104 (381) .+.....+..+. ..... ....+.+..++++.||++||+++.++|++.+++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~ 160 (435) T protein:vir:80 81 AAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKS 160 (435) T ss_pred ccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhc Confidence 000000000000 00000 01112345677788999999999999999999999 Q ss_pred hhhhh-ceeeec-CCceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChh--HHHHHHH Q lcl|Aclame:pro 105 PLLAD-LGIKNA-GLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA--WIERFVR 180 (381) Q Consensus 105 ~l~~~-~~v~~~-~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~--~l~~~i~ 180 (381) +++++ |+++++ ++..++|+.++.+.+.|++|++..+ +++++|++|++.+|+++++++||+|||+||.+ ++++||. T Consensus 161 ~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~-~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~ 239 (435) T protein:vir:80 161 VVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIP-TTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVV 239 (435) T ss_pred hhhhccceeeecCCCceEEEEEeCCcceeeeccCcccc-ccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHH Confidence 99998 788876 5679999999999999999987655 57899999999999999999999999999954 7999999 Q ss_pred HHHHHHHHHHHhhheeeccCCC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 181 VQIEEAFAVALETAFLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSV 259 (381) Q Consensus 181 ~~la~a~a~~~d~a~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~ 259 (381) ++++++++++++.+|++|+|++ +|.||++........... . ..........+..++..+ .. .. T Consensus 240 ~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~-------~----~~~~~~~~~d~~~~~~~~---~~--~~ 303 (435) T protein:vir:80 240 GDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITAS-------D----GSTLQKIETDLGKAILAL---EN--AD 303 (435) T ss_pred HHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecc-------c----ccchhhHHHHHHHHHHHh---hc--cc Confidence 9999999999999999999975 899998754322211110 0 001111111122222111 11 12 Q ss_pred cccCceEEEEchhhHHHHHhhhhccCCCCceeec-----cCCCceEEecCCCCCc--------cEEEEeccceEEEecce Q lcl|Aclame:pro 260 AVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQEAG--------KVLTYVKGLYDGYLAGG 326 (381) Q Consensus 260 ~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~-----l~~g~~vi~s~~~p~~--------~i~~gd~s~y~i~~r~~ 326 (381) .++.+++|+|||.++..++.+ ++.+|+|++. ..+|+||+.+++||.+ .++||||++|+|++|++ T Consensus 304 ~~~~~~~~vmn~~~~~~L~~l---kd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~ 380 (435) T protein:vir:80 304 ANLTQPGWIMAPRTFRFLEGL---RDGNGNKVYPELANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEET 380 (435) T ss_pred cccccCEEEEcHHHHHHHHhh---hccCCceeccCCCCCeEeeeeeEEeccccccccCCCCcceEEEEEcccEEEEeecc Confidence 355678999999999888765 5778999863 3479999999999863 48999999999999999 Q ss_pred eeEeeehhh-----------hhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 327 INVQKFKET-----------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 327 ~~i~~~~~~-----------~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) ++|+++++. +|.+|+++||+..|+|+++++++||++++=---+. T Consensus 381 ~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 381 LEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred eEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 999999985 48899999999999999999999999865433332 No 25 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=5e-57 Score=329.23 Aligned_cols=337 Identities=12% Similarity=0.040 Sum_probs=229.3 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhHHH-----HHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHhhhhhccccHHHHHHH Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEPQE-----RQNELYGDMINQLFEETKL----QAKAEAERVSSLPKSAQSLSANQRSFF 73 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~lt~~e~~~~ 73 (381) ||.-++++++.+.+.++.+..+.+. ++.+............... ....++.+................... T Consensus 1 ~eei~~l~~~~~~l~~~~~~l~~~~d~~e~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~ 80 (352) T protein:vir:78 1 MEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLNDNEKLVKAKAEFYRHAILPNEFEKPSMEAQRLL 80 (352) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHhhHHHHH Confidence 8777777777766655443222111 1111111000000000000 001111111111111111111122334 Q ss_pred HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEEE-ecCCcceEEecccccccccccccccceec Q lcl|Aclame:pro 74 MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLK-SETSGVAVWGKIYGEIKGQLDAAFSEETA 152 (381) Q Consensus 74 ~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip~-~~~~~~a~w~~e~~~~~~~~~~~f~~v~l 152 (381) +++..+++++|||+||+++.++|++.++++++||++|+++++++ ..+|+ ..+.+.+.|++|++.. ++++++|+++++ T Consensus 81 ~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~-~~~p~~~~~~~~a~~v~E~~~~-~~~~~~f~~v~~ 158 (352) T protein:vir:78 81 HALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG-LEIPRVSYTLDDDDFITDVETA-KELKLKGDTVKF 158 (352) T ss_pred HHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC-ceEEEEecCCCccccccccccc-ccccccceeeee Confidence 57788899999999999999999999999999999999999876 45666 4455789999997765 457899999999 Q ss_pred cceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhh-heeeccCCCcceeeeeccccccccccccccccchhhhc Q lcl|Aclame:pro 153 IQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALET-AFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTL 231 (381) Q Consensus 153 ~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~-a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~ 231 (381) .+|+++++++||+|||+||.+|+++||.++|+++++++++. +|.+|+|+++|.|+++...... + T Consensus 159 ~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~---------------~ 223 (352) T protein:vir:78 159 TTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKE---------------V 223 (352) T ss_pred cceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceecccccc---------------c Confidence 99999999999999999999999999999999999998655 7789999999999986422110 0 Q ss_pred cccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec----cCCCceEEecCCCC Q lcl|Aclame:pro 232 TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA----LPFNLNVIESTVQE 307 (381) Q Consensus 232 t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~----l~~g~~vi~s~~~p 307 (381) +.. ++++.+.+++.. .+..|++|++|+||+.+++.++.+.. + +|.|++. ..+|+||+++++++ T Consensus 224 t~~---~~~d~i~~~~~~-------l~~~~~~~a~~~mn~~t~~~l~~~~~--~-~~~~~~~~~~~~llG~PV~~~~~~~ 290 (352) T protein:vir:78 224 EGA---NMYDAIINALAD-------LHEDYRDNATIYMRYADYVKIISVLS--N-GTTNFFDTPAEKVFGKPVVFTDAAV 290 (352) T ss_pred ccc---chHHHHHHHHhc-------cChhhhcCCEEEEehHHHHHHHHHHh--c-cCCcccccCCccccccceEEecCCC Confidence 111 112333333322 24568889999999999988877643 2 3444432 23699999999886 Q ss_pred CccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCC Q lcl|Aclame:pro 308 AGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 308 ~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~ 374 (381) +++||||++|++. +.++.++++.+. ..++++|++..|+||++++++||++++++-++..... T Consensus 291 --~~~~Gdf~~~~~~-~~~~~~~~~~~~--~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~~~~ 352 (352) T protein:vir:78 291 --KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGSLPS 352 (352) T ss_pred --ceeEeehhhhhhh-hhhheeeeeccc--cCCeeEEEEEeeeCceeechhheEEEEeecccCCCCC Confidence 5899999998764 456777777663 4799999999999999999999999877764422211 No 26 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=3.1e-56 Score=324.93 Aligned_cols=327 Identities=13% Similarity=0.064 Sum_probs=237.7 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh------hccccHHHHH- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQ--ERQNELYGDMINQLFEETKLQAKAEAERVSSLPKS------AQSLSANQRS- 71 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~lt~~e~~- 71 (381) |+-++++ +.|+++++.++++....+ .++.+....+++.+........... +........ ......++++ T Consensus 1 M~k~l~~-l~e~~~~~~~e~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (371) T protein:vir:81 1 MPKELRE-LLEQINNKKEEARKLLAENKIEEAKKLKEEIVALQEKFDVAKELY-EEQKQTIEDKEPLKPTVQVKENEVEA 78 (371) T ss_pred CcHHHHH-HHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhccccccccchhhHHHHHHH Confidence 8876654 444444444444332211 1122222233333322221111100 000000000 0001111222 Q ss_pred --------HHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC---ceEEEEecCCcceEEeccccccc Q lcl|Aclame:pro 72 --------FFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSETSGVAVWGKIYGEIK 140 (381) Q Consensus 72 --------~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~---~~~ip~~~~~~~a~w~~e~~~~~ 140 (381) ..+++..+++++||++||+++...|++.+++.++|+++++++++++ ...+++..+.+.+.|++|+++.+ T Consensus 79 ~~~~l~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~ 158 (371) T protein:vir:81 79 FVNHIRTRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIG 158 (371) T ss_pred HHHHHHHHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccccc Confidence 2245677888999999999999999999999999999999998763 34567777778899999988776 Q ss_pred ccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccc Q lcl|Aclame:pro 141 GQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEG 220 (381) Q Consensus 141 ~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~ 220 (381) ..++++|+++++.+||++++++||+|||+||.++|++||.+.+++++++++|.+|++|+|+++|.|+.+. T Consensus 159 ~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~~~~~---------- 228 (371) T protein:vir:81 159 EKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKTAIADL---------- 228 (371) T ss_pred cccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccH---------- Confidence 6678999999999999999999999999999999999999999999999999999999999999887421 Q ss_pred cccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec------- Q lcl|Aclame:pro 221 AYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA------- 293 (381) Q Consensus 221 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~------- 293 (381) .. +...+. ...+..|+.+++|+|||.++..++.+ ++++|+|+|. T Consensus 229 ----------------~~----i~~~~~------~~l~~~~~~~a~~vmn~~~~~~L~~l---kd~~g~~l~~~~~~~~~ 279 (371) T protein:vir:81 229 ----------------DG----LKQIIN------VQLDPVFRSTSSVIVNQDAFNWLDTL---KDQNGQYLLQPSISSPT 279 (371) T ss_pred ----------------HH----HHHHHH------hhcchhhhcCCEEEEcHHHHHHHHHh---hccCCCeeeecccCCCC Confidence 00 000000 01133577889999999999888775 5678999874 Q ss_pred --cCCCceEEecCCCCCc------------cEEEEeccc-eEEEecceeeEeeehhh--hhhcCceEEEEEEEEcCEEec Q lcl|Aclame:pro 294 --LPFNLNVIESTVQEAG------------KVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKD 356 (381) Q Consensus 294 --l~~g~~vi~s~~~p~~------------~i~~gd~s~-y~i~~r~~~~i~~~~~~--~~~~d~~~~~~~~r~dgk~~~ 356 (381) ..+|+||+.+++||.+ .++||||++ |.+++|.+++++++++. +|.+|+++||+.+|+||++++ T Consensus 280 ~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~ 359 (371) T protein:vir:81 280 GRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRD 359 (371) T ss_pred CceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEec Confidence 2479999999998732 479999998 67899999999998876 688999999999999999999 Q ss_pred CcceEEEEEEec Q lcl|Aclame:pro 357 NKVAAVWKLDLK 368 (381) Q Consensus 357 ~~Af~v~~l~~~ 368 (381) ++||++++++.+ T Consensus 360 ~~a~~~~~~~~A 371 (371) T protein:vir:81 360 DEAFVFGEVQLA 371 (371) T ss_pred ccceEEEEEecC Confidence 999999998887 No 27 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=8.5e-56 Score=322.50 Aligned_cols=339 Identities=13% Similarity=0.109 Sum_probs=234.7 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhH--------HHHHHHHHHH---HHHHHHHHHHH-HHH-HHHH-------------- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEP--------QERQNELYGD---MINQLFEETKL-QAK-AEAE-------------- 53 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~--------~~~~~~~~~~---~~~~~~~~~~~-~~~-~~~~-------------- 53 (381) |. ++.+++.++++++.+.++.... .++..+.+.+ .++.+...... +.+ .+.+ T Consensus 1 m~-~~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~ 79 (390) T protein:vir:97 1 MT-DITAKLEATLANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVG 79 (390) T ss_pred Ch-HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccch Confidence 22 3444455555554444332111 1111111211 11111111110 000 0000 Q ss_pred ---------HHHH-h-hhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEE Q lcl|Aclame:pro 54 ---------RVSS-L-PKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKF 121 (381) Q Consensus 54 ---------~~~~-~-~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~i 121 (381) +... . .+.......+.+...+....+++.+||++||++++..|++.+++.++|+++|++.++++ ..++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:97 80 DMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEE Confidence 0000 0 00000111122223344556677889999999999999999999999999999999865 4789 Q ss_pred EEecC-CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccC Q lcl|Aclame:pro 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) Q Consensus 122 p~~~~-~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G 200 (381) |+.++ .+.+.|++|+++.+ +++++|+++++.+|+++++++||+|||+|+. ++++||.+++++++++++|.+|++|+| T Consensus 160 ~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~la~a~~~~~d~a~l~G~g 237 (390) T protein:vir:97 160 VQETGFVNNAAIVAEGALKP-ESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTG 237 (390) T ss_pred EEEecCCcceeeecCCcccc-ccccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 98755 46899999987754 6789999999999999999999999999985 899999999999999999999999999 Q ss_pred CC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHh Q lcl|Aclame:pro 201 KD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 201 ~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~ 279 (381) ++ +|.||++........... + .......+.+++..+ ...+..+.+|+|||+++..+++ T Consensus 238 ~~~~p~Gi~~~~~~~~~~~~~-----------~---~~~~~d~~~~~~~~~-------~~~~~~~~~~v~n~~~~~~L~~ 296 (390) T protein:vir:97 238 ANDGLLGLIPQATTYAAPTTI-----------A---GATRVDQLRLAMLQA-------SLAEYPASGIVINPIDWAAIEL 296 (390) T ss_pred CCccccceeeccccccccccc-----------c---ccchHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHH Confidence 77 499999754322211110 0 011111222222111 2345567789999999888886 Q ss_pred hhhccCCCCceeecc--------CCCceEEecCCCCCccEEEEeccc-eEEEecceeeEeeehh-hhhhcCceEEEEEEE Q lcl|Aclame:pro 280 QYTHLNANGVYVTAL--------PFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE-TLALDDMDLYTAKQF 349 (381) Q Consensus 280 ~~~~~~~~G~~~~~l--------~~g~~vi~s~~~p~~~i~~gd~s~-y~i~~r~~~~i~~~~~-~~~~~d~~~~~~~~r 349 (381) + ++++|+|++.. .+|+||+++++||+++++||||++ |.+++|+++++..+++ .+|.+|+++||+.+| T Consensus 297 l---kd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r 373 (390) T protein:vir:97 297 A---KDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEER 373 (390) T ss_pred h---hcCCCceeecCccCCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEe Confidence 5 57789888742 379999999999999999999997 8889999999999875 689999999999999 Q ss_pred EcCEEecCcceEEEEEE Q lcl|Aclame:pro 350 AYGKAKDNKVAAVWKLD 366 (381) Q Consensus 350 ~dgk~~~~~Af~v~~l~ 366 (381) +||++++++||+++++- T Consensus 374 ~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 374 LALVVYRPEALITGSFA 390 (390) T ss_pred eccEEeccccEEEEEeC Confidence 99999999999998777 No 28 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=8.3e-57 Score=328.03 Aligned_cols=339 Identities=12% Similarity=0.081 Sum_probs=222.2 Q ss_pred CC--ccHHHHHHHHHHHHHH---HHhhh----hHHHHHHHHHHHH-------HHHHHHHHH---HHHHH----------- Q lcl|Aclame:pro 1 MT--INLSETFANAKNEFIN---AVNNG----EPQERQNELYGDM-------INQLFEETK---LQAKA----------- 50 (381) Q Consensus 1 m~--~~l~~~~~e~~~~~~~---~~~~~----~~~~~~~~~~~~~-------~~~~~~~~~---~~~~~----------- 50 (381) |. ++|++++.+..+++.+ ++++. ....++.+.+.+. .+.+..... .+.+. T Consensus 16 mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 95 (402) T protein:vir:93 16 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 95 (402) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 76 3666555554433322 22111 0000111111111 111111100 00000 Q ss_pred ------------HHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc Q lcl|Aclame:pro 51 ------------EAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 51 ------------~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~ 118 (381) ++.+................+..+++.++++++|||+||+++.++|++.++++++||++|+++++++ T Consensus 96 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~- 174 (402) T protein:vir:93 96 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 174 (402) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC- Confidence 0000000000000000111223356677889999999999999999999999999999999999875 Q ss_pred eEEEEe-cCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHh-hhee Q lcl|Aclame:pro 119 LKFLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE-TAFL 196 (381) Q Consensus 119 ~~ip~~-~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d-~a~l 196 (381) ..+|+. .+.+++.|++|++.. ++++|+|+++++.+|+++++++||+|||+||.+|+++||.++|+++++++++ .+|. T Consensus 175 ~~~p~~~~~~~~a~~v~Eg~~~-~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~ 253 (402) T protein:vir:93 175 LEIPRVSYTLDDDDFITDVETA-KELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 253 (402) T ss_pred ceeeeeeccCCccccccccccc-cccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 567764 456779999997765 4578999999999999999999999999999999999999999999999975 5678 Q ss_pred eccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|Aclame:pro 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~ 276 (381) +|+|+++|.|+++...... + + ...+.+.+.+++.. .+..|+.|++|+||+.+++. T Consensus 254 ~g~g~g~p~g~~~~~~~~~-~--------------~---~~~~~d~l~~~~~~-------l~~~y~~na~~imn~~t~~~ 308 (402) T protein:vir:93 254 VSPKSGLEHMSFYNGSVKE-V--------------E---GADMYDAIINALAD-------LHEDYRDNATIYMRYADYVK 308 (402) T ss_pred cCCCccccceeeecccccc-c--------------c---ccchHHHHHHHHhc-------cChhhhcCCEEEEechHHHH Confidence 9999999999986422111 0 0 01112223333321 24568889999999999988 Q ss_pred HHhhhhccCCCCceeec---cCCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCE Q lcl|Aclame:pro 277 VQAQYTHLNANGVYVTA---LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~---l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk 353 (381) ++.... +.+|.+.+. ..+|+||+++++++ +++||||++|++. +.++.++.+++. ..++++|++..|+||+ T Consensus 309 ~~~~~~--d~~~~~~~~~~~~llG~PV~~t~~~~--~i~~GDf~~~~~~-~~~~~~~~~~~~--~~~~~~~~~~~r~Dg~ 381 (402) T protein:vir:93 309 IISVLS--NGTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLFVLTAWYDQQ 381 (402) T ss_pred HHHHHh--cCCCcccccCCccccccceEEecCCC--ceeeechhhhhhh-hhhhhhhhhhcc--cCCceEEEEEEEeCcE Confidence 776543 333433322 23799999999886 5899999996653 334566666654 3599999999999999 Q ss_pred EecCcceEEEEEEec-ccccC Q lcl|Aclame:pro 354 AKDNKVAAVWKLDLK-GHKPA 373 (381) Q Consensus 354 ~~~~~Af~v~~l~~~-~~~~~ 373 (381) +++++||++++++-. +++|+ T Consensus 382 v~~~~A~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 382 RTLDSAFRIAKAKENTGPLPS 402 (402) T ss_pred EechhheEEEEeecCCCCCCC Confidence 999999999988653 34444 No 29 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=5.8e-56 Score=323.39 Aligned_cols=339 Identities=12% Similarity=0.100 Sum_probs=230.1 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhh--------HHHHHHHHHHHH---HHHHHHHHHH-HHHHHHHH-------------- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGE--------PQERQNELYGDM---INQLFEETKL-QAKAEAER-------------- 54 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~--------~~~~~~~~~~~~---~~~~~~~~~~-~~~~~~~~-------------- 54 (381) |+ |+.++++++++++.+.++... ..++..+.+..+ .+.+...... +.+.+... T Consensus 1 m~-e~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (390) T protein:vir:10 1 MT-DITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVG 79 (390) T ss_pred Ch-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccchh Confidence 43 333344444444443332111 111111222211 1111111110 00000000 Q ss_pred ----------HHHhh-hhh-ccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc-eEE Q lcl|Aclame:pro 55 ----------VSSLP-KSA-QSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKF 121 (381) Q Consensus 55 ----------~~~~~-~~~-~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~-~~i 121 (381) ..... ... .....+.+.+.+....++...+|.++|+++..+|++.+++.++|+++|+++++++. .++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:10 80 DLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEE Confidence 00000 000 00011111122223344555667788888999999999999999999999998664 789 Q ss_pred EEecC-CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccC Q lcl|Aclame:pro 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) Q Consensus 122 p~~~~-~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G 200 (381) |+.++ .+.+.|++|+++. ++++++|+++++.+|+++++++||++||+|+. ++++||.++|++++++++|.+||+|+| T Consensus 160 ~~~~~~~~~a~~v~Eg~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~~~~~~il~G~G 237 (390) T protein:vir:10 160 VQETGFVNNAAIVAEGALK-PESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTG 237 (390) T ss_pred EEEecCCcceeeecCCccc-cccccceeEEEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 98765 4679999998775 46789999999999999999999999999986 899999999999999999999999999 Q ss_pred CC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHh Q lcl|Aclame:pro 201 KD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 201 ~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~ 279 (381) ++ +|.||++.......... ... ......+.+++..+ ...+..+.+|+|||+++..++. T Consensus 238 ~~~~p~Gi~~~~~~~~~~~~-----------~~~---~~~~~~~~~~~~~l-------~~~~~~~~~~v~n~~~~~~L~~ 296 (390) T protein:vir:10 238 ANDGLLGLIPQATTYAAPTT-----------IAG---ATRVDQLRLAMLQA-------SLAEYPASGIVINPIDWAAIEL 296 (390) T ss_pred CCcccccccccccccccccc-----------ccc---cchHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHH Confidence 77 59999975322111110 000 11112222222211 2345677899999999988876 Q ss_pred hhhccCCCCceeecc--------CCCceEEecCCCCCccEEEEeccc-eEEEecceeeEeeehh-hhhhcCceEEEEEEE Q lcl|Aclame:pro 280 QYTHLNANGVYVTAL--------PFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE-TLALDDMDLYTAKQF 349 (381) Q Consensus 280 ~~~~~~~~G~~~~~l--------~~g~~vi~s~~~p~~~i~~gd~s~-y~i~~r~~~~i~~~~~-~~~~~d~~~~~~~~r 349 (381) + ++++|+|+|.. .+|+||+++++||+++++||||++ |.+++|++++++.+++ .+|.+|++.||+.+| T Consensus 297 l---kd~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r 373 (390) T protein:vir:10 297 A---KDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEER 373 (390) T ss_pred h---hcCCCceeecCCcCcCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEe Confidence 5 57789998752 379999999999999999999998 7789999999999886 689999999999999 Q ss_pred EcCEEecCcceEEEEEE Q lcl|Aclame:pro 350 AYGKAKDNKVAAVWKLD 366 (381) Q Consensus 350 ~dgk~~~~~Af~v~~l~ 366 (381) +||++++++||+++++- T Consensus 374 ~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 374 LALVVYRPEALISGSFA 390 (390) T ss_pred eccEEeccccEEEEEeC Confidence 99999999999998776 No 30 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=6.2e-56 Score=323.27 Aligned_cols=339 Identities=13% Similarity=0.106 Sum_probs=234.4 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhh--------HHHHHHHHHHH---HHHHHHHHHHH-HHH-HHHH-------------- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGE--------PQERQNELYGD---MINQLFEETKL-QAK-AEAE-------------- 53 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~--------~~~~~~~~~~~---~~~~~~~~~~~-~~~-~~~~-------------- 53 (381) |+ ++.+++.++++++.+.++... ..++..+.+.+ ..+.+...... +.+ .+.+ T Consensus 1 m~-~l~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~ 79 (390) T protein:vir:81 1 MT-DITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVG 79 (390) T ss_pred Ch-HHHHHHHHHHHHHHHHHHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccch Confidence 32 344444444444443332211 00111222221 11122111100 000 0000 Q ss_pred ---------HHHHhh--hhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEE Q lcl|Aclame:pro 54 ---------RVSSLP--KSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKF 121 (381) Q Consensus 54 ---------~~~~~~--~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~i 121 (381) +..... ........+.+.+.+....++++++|++||+++...|++.+++.++|+++|+++++++ ..++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:81 80 DMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEE Confidence 000000 0000111222333344455667789999999999999999999999999999999865 4789 Q ss_pred EEecC-CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccC Q lcl|Aclame:pro 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) Q Consensus 122 p~~~~-~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G 200 (381) |+.++ .+.+.|++|+++.+ +++++|+++++.+|+++++++||++||+|+. ++++||.++|++++++++|.+|++|+| T Consensus 160 ~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~~~~~i~~~l~~~~~~~~d~a~l~G~g 237 (390) T protein:vir:81 160 VQETGFVNNAAIVAEGALKP-ESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTG 237 (390) T ss_pred EEEecCCcceeeecCCcccc-cccceeeEEEEeeeEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 98765 46899999987754 6789999999999999999999999999985 799999999999999999999999999 Q ss_pred CCc-ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHh Q lcl|Aclame:pro 201 KDQ-PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 201 ~~q-P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~ 279 (381) +++ |.||++........... + .......+.+++..+ ...+..+.+|+|||+++..+++ T Consensus 238 ~~~~~~Gi~~~~~~~~~~~~~-----------~---~~~~~~~~~~~~~~~-------~~~~~~~~~~v~~~~~~~~l~~ 296 (390) T protein:vir:81 238 ANDGLLGLIPQATTYAAPTTI-----------A---GATRVDQLRLAMLQA-------SLAEYNPSGIVINPIDWAAIEL 296 (390) T ss_pred CCCcccceeeccccccccccc-----------c---cchhHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHH Confidence 875 99999753321111100 0 011112222222221 2335566789999999988887 Q ss_pred hhhccCCCCceeecc--------CCCceEEecCCCCCccEEEEeccc-eEEEecceeeEeeehh-hhhhcCceEEEEEEE Q lcl|Aclame:pro 280 QYTHLNANGVYVTAL--------PFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE-TLALDDMDLYTAKQF 349 (381) Q Consensus 280 ~~~~~~~~G~~~~~l--------~~g~~vi~s~~~p~~~i~~gd~s~-y~i~~r~~~~i~~~~~-~~~~~d~~~~~~~~r 349 (381) + ++++|+|+|.. .+|+||+.+++||+++++||||++ |.+++|++++|+.+++ .+|.+|++.||+.+| T Consensus 297 l---kd~~G~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r 373 (390) T protein:vir:81 297 A---KDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGEDFQRNMITVLAEER 373 (390) T ss_pred h---hcCCCceeecCcccccCceecceeeEEcCCCCCCcEEEEehhceEEEEEecceEEEEecccchhhcCcEEEEEEEe Confidence 6 57789998742 379999999999999999999998 8899999999999876 689999999999999 Q ss_pred EcCEEecCcceEEEEEE Q lcl|Aclame:pro 350 AYGKAKDNKVAAVWKLD 366 (381) Q Consensus 350 ~dgk~~~~~Af~v~~l~ 366 (381) +||++++++||+++++- T Consensus 374 ~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 374 LALVVYRPEALISGSFA 390 (390) T ss_pred eccEEecccceEEEEeC Confidence 99999999999998776 No 31 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=5.4e-56 Score=323.57 Aligned_cols=339 Identities=12% Similarity=0.069 Sum_probs=224.8 Q ss_pred CC--ccHHHHHHHHHHHHHH---HHhh----hhHHHHHHH-------HHHHHHHHHHHHHH---HHHHH----------- Q lcl|Aclame:pro 1 MT--INLSETFANAKNEFIN---AVNN----GEPQERQNE-------LYGDMINQLFEETK---LQAKA----------- 50 (381) Q Consensus 1 m~--~~l~~~~~e~~~~~~~---~~~~----~~~~~~~~~-------~~~~~~~~~~~~~~---~~~~~----------- 50 (381) |. +++++++.+.++++.+ .++. .....++.+ .+....+.+..... .+.+. T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCC Confidence 66 3555555554443322 2211 110001111 11111111111000 00000 Q ss_pred ------------HHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc Q lcl|Aclame:pro 51 ------------EAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 51 ------------~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~ 118 (381) ++.+...............++..+++.++++++|||+||+++.++|++.++++++|+++|+++++++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~- 159 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 159 (387) T ss_pred cchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC- Confidence 0000000000000111112223456778889999999999999999999999999999999999875 Q ss_pred eEEEEe-cCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhh-hee Q lcl|Aclame:pro 119 LKFLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALET-AFL 196 (381) Q Consensus 119 ~~ip~~-~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~-a~l 196 (381) ..+|+. .+.+.+.|++|++.. ++++|+|+++++.+|+++++++||+|||+||.+|+++||.++++++++++++. +|. T Consensus 160 ~~~p~~~~~~~~a~~v~E~~~~-~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~ 238 (387) T protein:vir:93 160 LEIPRVSYTLDDDDFITDVETA-KELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 238 (387) T ss_pred ceEEEEeecCCccccccCcccc-cccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 567764 456779999997765 45789999999999999999999999999999999999999999999999764 678 Q ss_pred eccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|Aclame:pro 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~ 276 (381) +|+|+++|.|++...... .+ + ...+.+.+.+++.. .+..|+++++|+||+.+++. T Consensus 239 ~g~g~g~p~g~l~~~~~~-~v--------------~---~~~~~d~i~~~~~~-------l~~~~~~~a~~~mn~~t~~~ 293 (387) T protein:vir:93 239 VSPKSGLDHMSFYNGSVK-EV--------------E---GADMYDAIINALAD-------LHEDYRDNATIYMRYADYVK 293 (387) T ss_pred cCCCccccceeeeccccc-cc--------------c---ccchHHHHHHHHhc-------cChhhhcCCEEEEechHHHH Confidence 999999999998642111 00 0 01112223333322 24568889999999999887 Q ss_pred HHhhhhccCCCCceeecc---CCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCE Q lcl|Aclame:pro 277 VQAQYTHLNANGVYVTAL---PFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~l---~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk 353 (381) ++..+ .+.+|.|.+.. .+|+||+++++++ +++||||++|++. +.++.+.++.+ +..++++|++..|+||+ T Consensus 294 ~~~~~--~d~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~--~~~~~~~~~~~~r~d~~ 366 (387) T protein:vir:93 294 IISVL--SNGTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKD--VKKGEYLFVLTAWYDQQ 366 (387) T ss_pred HHHHH--hcCCCcccccCCccccccceEEecCCC--ceeeeehhhhhee-hhhheeeeccc--ccCCceeEEEEeeeCce Confidence 76654 34455554432 3799999999886 5899999998764 55677776655 56899999999999999 Q ss_pred EecCcceEEEEEEecc-cccC Q lcl|Aclame:pro 354 AKDNKVAAVWKLDLKG-HKPA 373 (381) Q Consensus 354 ~~~~~Af~v~~l~~~~-~~~~ 373 (381) +++++||++++++-++ ++|+ T Consensus 367 v~~~eA~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 367 RTLDSAFRIAKAKENTGSLPS 387 (387) T ss_pred eechhheEEEEeecCCCCCCC Confidence 9999999998886543 3333 No 32 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=2.1e-55 Score=320.31 Aligned_cols=339 Identities=11% Similarity=-0.018 Sum_probs=234.3 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHH--HHHH---Hhhhhhc--- Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEPQE-----------RQNELYGDMINQLFEETKLQAKAEA--ERVS---SLPKSAQ--- 63 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~---~~~~~~~--- 63 (381) |+..++++++++++.+.++...++. ++.+.+.+.++.+..+......... +... ...+..+ T Consensus 1 Mk~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Confidence 5555555555555444443221110 1111111111111111110000000 0000 0000000 Q ss_pred ----cccHHHHHHH------------HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC---CceEEEEe Q lcl|Aclame:pro 64 ----SLSANQRSFF------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG---LRLKFLKS 124 (381) Q Consensus 64 ----~lt~~e~~~~------------~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~---~~~~ip~~ 124 (381) .....+++.+ ..+..+++++||++||+++...|++.+++.++|+++|++++++ +...+|+. T Consensus 81 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (397) T protein:vir:49 81 KSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKW 160 (397) T ss_pred cchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEee Confidence 1112222222 2345577789999999999999999999999999999998874 34566664 Q ss_pred c-CCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCc Q lcl|Aclame:pro 125 E-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ 203 (381) Q Consensus 125 ~-~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~q 203 (381) . ..+.+.|++|+++.+..++++|+++++.+|+++++++||+|||+||.+++++||.+++++++++++|.+|++|+|+++ T Consensus 161 ~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~ 240 (397) T protein:vir:49 161 TDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALP 240 (397) T ss_pred ccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 4 457799999988877667899999999999999999999999999999999999999999999999999999999887 Q ss_pred ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc Q lcl|Aclame:pro 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~ 283 (381) |.+.... ... +.++...+ ...|..+++|+|||.++..++.+ T Consensus 241 ~~~~~~~-------------------------~d~----i~~~~~~l-------~~~~~~~a~~vmn~~~~~~l~~l--- 281 (397) T protein:vir:49 241 TKPTLTK-------------------------WDD----IIDLEAKV-------DPAIKQTSFFLTNTSGFTALKKV--- 281 (397) T ss_pred ccccccc-------------------------HHH----HHHHHHhh-------hhhhcCCCEEEEcHHHHHHHHHh--- Confidence 6543210 111 11222111 23467789999999999888876 Q ss_pred cCCCCceeec---------cCCCceEEecC--CCCC-----ccEEEEeccc-eEEEecceeeEeeehhh--hhhcCceEE Q lcl|Aclame:pro 284 LNANGVYVTA---------LPFNLNVIEST--VQEA-----GKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLY 344 (381) Q Consensus 284 ~~~~G~~~~~---------l~~g~~vi~s~--~~p~-----~~i~~gd~s~-y~i~~r~~~~i~~~~~~--~~~~d~~~~ 344 (381) ++++|+|++. ..+|+||++++ .+|. ..++||||++ |.+++|++++++++++. +|.+|++.| T Consensus 282 kd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 361 (397) T protein:vir:49 282 KNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKV 361 (397) T ss_pred hcCCCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeE Confidence 5678999874 34799998743 3443 3489999997 77899999999998864 799999999 Q ss_pred EEEEEEcCEEecCcceEEEEEEecccccCCCCCCCC Q lcl|Aclame:pro 345 TAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEET 380 (381) Q Consensus 345 ~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~ 380 (381) |+..|+||++.+++||++++++-++.++.+.+.+-- T Consensus 362 r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 362 RVIDRFDVVATDTEAFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred EEEeeeCcEEecccceEEEEeecccCCCCCcccccC Confidence 999999999999999999988887766665555544 No 33 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=4.3e-56 Score=324.15 Aligned_cols=339 Identities=12% Similarity=0.073 Sum_probs=222.4 Q ss_pred CC--ccHHHHHHHHHHHHHHH---Hhhh----hHHHHHHHHHHH-------HHHHHHHHHHH---HHHH----------- Q lcl|Aclame:pro 1 MT--INLSETFANAKNEFINA---VNNG----EPQERQNELYGD-------MINQLFEETKL---QAKA----------- 50 (381) Q Consensus 1 m~--~~l~~~~~e~~~~~~~~---~~~~----~~~~~~~~~~~~-------~~~~~~~~~~~---~~~~----------- 50 (381) |+ .++++++.+..+++.+. +.+. ....++.+...+ ..+.+...... +.+. T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 66 25555555544443222 1110 000011111111 11111111000 0000 Q ss_pred ------------HHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc Q lcl|Aclame:pro 51 ------------EAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 51 ------------~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~ 118 (381) ++.+................+..+++.++++++|||+||+++.++|++.++++++||++|+++++++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~- 159 (387) T protein:vir:94 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 159 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC- Confidence 0000000000000001111233456677889999999999999999999999999999999999875 Q ss_pred eEEEEe-cCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHh-hhee Q lcl|Aclame:pro 119 LKFLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE-TAFL 196 (381) Q Consensus 119 ~~ip~~-~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d-~a~l 196 (381) ..+|+. .+.+++.|++|++.. ++++|+|+++++.+|+++++++||+|||+||.+|+++||+++|+++++++++ .+|. T Consensus 160 ~~~p~~~~~~~~a~~v~Eg~~~-~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~ 238 (387) T protein:vir:94 160 LEIPRVSYTLDDDDFITDVETA-KELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 238 (387) T ss_pred ceeeeeeccCCccccccccccc-cccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 567764 455779999997765 4578999999999999999999999999999999999999999999999965 5678 Q ss_pred eccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|Aclame:pro 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~ 276 (381) +|+|+++|.|++....... + + .....+.+.+++.. .+..|++|++|+||+.+++. T Consensus 239 ~g~g~g~~~g~~~~~~~~~-~--------------~---~~~~~d~i~~~~~~-------l~~~y~~na~~imn~~t~~~ 293 (387) T protein:vir:94 239 VSPKSGLEHMSFYNGSVKE-V--------------E---GADMYDAIINALAD-------LHEDYRDNATIYMRYADYVK 293 (387) T ss_pred cCCCccccceeeecccccc-c--------------c---ccchHHHHHHHHhc-------cChhhhcCCEEEEechHHHH Confidence 9999999999986421110 0 0 00112223333221 24568889999999999888 Q ss_pred HHhhhhccCCCCceeec---cCCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCE Q lcl|Aclame:pro 277 VQAQYTHLNANGVYVTA---LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~---l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk 353 (381) ++..+. +.+|.+.+. ..+|+||+++++++ +++||||++|++. +.++.+.++++. ..|+++|+++.|+||+ T Consensus 294 ~~~~~~--~~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~~--~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:94 294 IISVLS--NGTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLFVLTAWYDQQ 366 (387) T ss_pred HHHHHh--cCCCcccccCCccccccceEEecCCC--ceeeechhhhhhh-hhhhhheecccc--cCCceEEEEEEEeCcE Confidence 776543 333333221 23799999999886 5899999997664 456777776654 4699999999999999 Q ss_pred EecCcceEEEEEEecc-cccC Q lcl|Aclame:pro 354 AKDNKVAAVWKLDLKG-HKPA 373 (381) Q Consensus 354 ~~~~~Af~v~~l~~~~-~~~~ 373 (381) +++++||++++++-.+ ++|+ T Consensus 367 v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:94 367 RTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred eechhheEEEEeecCCCCCCC Confidence 9999999998886533 3333 No 34 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=4.3e-56 Score=324.15 Aligned_cols=339 Identities=12% Similarity=0.073 Sum_probs=222.4 Q ss_pred CC--ccHHHHHHHHHHHHHHH---Hhhh----hHHHHHHHHHHH-------HHHHHHHHHHH---HHHH----------- Q lcl|Aclame:pro 1 MT--INLSETFANAKNEFINA---VNNG----EPQERQNELYGD-------MINQLFEETKL---QAKA----------- 50 (381) Q Consensus 1 m~--~~l~~~~~e~~~~~~~~---~~~~----~~~~~~~~~~~~-------~~~~~~~~~~~---~~~~----------- 50 (381) |+ .++++++.+..+++.+. +.+. ....++.+...+ ..+.+...... +.+. T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 66 25555555544443222 1110 000011111111 11111111000 0000 Q ss_pred ------------HHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc Q lcl|Aclame:pro 51 ------------EAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 51 ------------~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~ 118 (381) ++.+................+..+++.++++++|||+||+++.++|++.++++++||++|+++++++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~- 159 (387) T protein:vir:96 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 159 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC- Confidence 0000000000000001111233456677889999999999999999999999999999999999875 Q ss_pred eEEEEe-cCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHh-hhee Q lcl|Aclame:pro 119 LKFLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE-TAFL 196 (381) Q Consensus 119 ~~ip~~-~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d-~a~l 196 (381) ..+|+. .+.+++.|++|++.. ++++|+|+++++.+|+++++++||+|||+||.+|+++||+++|+++++++++ .+|. T Consensus 160 ~~~p~~~~~~~~a~~v~Eg~~~-~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~ 238 (387) T protein:vir:96 160 LEIPRVSYTLDDDDFITDVETA-KELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 238 (387) T ss_pred ceeeeeeccCCccccccccccc-cccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 567764 455779999997765 4578999999999999999999999999999999999999999999999965 5678 Q ss_pred eccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|Aclame:pro 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~ 276 (381) +|+|+++|.|++....... + + .....+.+.+++.. .+..|++|++|+||+.+++. T Consensus 239 ~g~g~g~~~g~~~~~~~~~-~--------------~---~~~~~d~i~~~~~~-------l~~~y~~na~~imn~~t~~~ 293 (387) T protein:vir:96 239 VSPKSGLEHMSFYNGSVKE-V--------------E---GADMYDAIINALAD-------LHEDYRDNATIYMRYADYVK 293 (387) T ss_pred cCCCccccceeeecccccc-c--------------c---ccchHHHHHHHHhc-------cChhhhcCCEEEEechHHHH Confidence 9999999999986421110 0 0 00112223333221 24568889999999999888 Q ss_pred HHhhhhccCCCCceeec---cCCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCE Q lcl|Aclame:pro 277 VQAQYTHLNANGVYVTA---LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~---l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk 353 (381) ++..+. +.+|.+.+. ..+|+||+++++++ +++||||++|++. +.++.+.++++. ..|+++|+++.|+||+ T Consensus 294 ~~~~~~--~~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~~--~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:96 294 IISVLS--NGTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLFVLTAWYDQQ 366 (387) T ss_pred HHHHHh--cCCCcccccCCccccccceEEecCCC--ceeeechhhhhhh-hhhhhheecccc--cCCceEEEEEEEeCcE Confidence 776543 333333221 23799999999886 5899999997664 456777776654 4699999999999999 Q ss_pred EecCcceEEEEEEecc-cccC Q lcl|Aclame:pro 354 AKDNKVAAVWKLDLKG-HKPA 373 (381) Q Consensus 354 ~~~~~Af~v~~l~~~~-~~~~ 373 (381) +++++||++++++-.+ ++|+ T Consensus 367 v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:96 367 RTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred eechhheEEEEeecCCCCCCC Confidence 9999999998886533 3333 No 35 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=4.3e-56 Score=324.15 Aligned_cols=339 Identities=12% Similarity=0.073 Sum_probs=222.4 Q ss_pred CC--ccHHHHHHHHHHHHHHH---Hhhh----hHHHHHHHHHHH-------HHHHHHHHHHH---HHHH----------- Q lcl|Aclame:pro 1 MT--INLSETFANAKNEFINA---VNNG----EPQERQNELYGD-------MINQLFEETKL---QAKA----------- 50 (381) Q Consensus 1 m~--~~l~~~~~e~~~~~~~~---~~~~----~~~~~~~~~~~~-------~~~~~~~~~~~---~~~~----------- 50 (381) |+ .++++++.+..+++.+. +.+. ....++.+...+ ..+.+...... +.+. T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 66 25555555544443222 1110 000011111111 11111111000 0000 Q ss_pred ------------HHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc Q lcl|Aclame:pro 51 ------------EAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 51 ------------~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~ 118 (381) ++.+................+..+++.++++++|||+||+++.++|++.++++++||++|+++++++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~- 159 (387) T protein:vir:26 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 159 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC- Confidence 0000000000000001111233456677889999999999999999999999999999999999875 Q ss_pred eEEEEe-cCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHh-hhee Q lcl|Aclame:pro 119 LKFLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE-TAFL 196 (381) Q Consensus 119 ~~ip~~-~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d-~a~l 196 (381) ..+|+. .+.+++.|++|++.. ++++|+|+++++.+|+++++++||+|||+||.+|+++||+++|+++++++++ .+|. T Consensus 160 ~~~p~~~~~~~~a~~v~Eg~~~-~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~ 238 (387) T protein:vir:26 160 LEIPRVSYTLDDDDFITDVETA-KELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 238 (387) T ss_pred ceeeeeeccCCccccccccccc-cccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 567764 455779999997765 4578999999999999999999999999999999999999999999999965 5678 Q ss_pred eccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|Aclame:pro 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~ 276 (381) +|+|+++|.|++....... + + .....+.+.+++.. .+..|++|++|+||+.+++. T Consensus 239 ~g~g~g~~~g~~~~~~~~~-~--------------~---~~~~~d~i~~~~~~-------l~~~y~~na~~imn~~t~~~ 293 (387) T protein:vir:26 239 VSPKSGLEHMSFYNGSVKE-V--------------E---GADMYDAIINALAD-------LHEDYRDNATIYMRYADYVK 293 (387) T ss_pred cCCCccccceeeecccccc-c--------------c---ccchHHHHHHHHhc-------cChhhhcCCEEEEechHHHH Confidence 9999999999986421110 0 0 00112223333221 24568889999999999888 Q ss_pred HHhhhhccCCCCceeec---cCCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCE Q lcl|Aclame:pro 277 VQAQYTHLNANGVYVTA---LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~---l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk 353 (381) ++..+. +.+|.+.+. ..+|+||+++++++ +++||||++|++. +.++.+.++++. ..|+++|+++.|+||+ T Consensus 294 ~~~~~~--~~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~~--~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:26 294 IISVLS--NGTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLFVLTAWYDQQ 366 (387) T ss_pred HHHHHh--cCCCcccccCCccccccceEEecCCC--ceeeechhhhhhh-hhhhhheecccc--cCCceEEEEEEEeCcE Confidence 776543 333333221 23799999999886 5899999997664 456777776654 4699999999999999 Q ss_pred EecCcceEEEEEEecc-cccC Q lcl|Aclame:pro 354 AKDNKVAAVWKLDLKG-HKPA 373 (381) Q Consensus 354 ~~~~~Af~v~~l~~~~-~~~~ 373 (381) +++++||++++++-.+ ++|+ T Consensus 367 v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:26 367 RTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred eechhheEEEEeecCCCCCCC Confidence 9999999998886533 3333 No 36 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=3.8e-55 Score=318.92 Aligned_cols=336 Identities=14% Similarity=0.062 Sum_probs=237.7 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHHH--HHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHHH Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQE--RQNELYGDMINQLFEETKLQ-----------------------AKAEAERV 55 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~ 55 (381) |+-+|+ ++.+++.++.++++....++ ++.+....+.+.+..+.... ...++++. T Consensus 1 M~k~l~-el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELR-ELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHH-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 886554 23333333333332221111 11111111111111111100 00111111 Q ss_pred HHhhhhhccccHHHHHHHH------HHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC---ceEEEEecC Q lcl|Aclame:pro 56 SSLPKSAQSLSANQRSFFM------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSET 126 (381) Q Consensus 56 ~~~~~~~~~lt~~e~~~~~------~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~---~~~ip~~~~ 126 (381) .......+.++.+++.+.. .+..+++++||++||+++...|++.+++.++|+++|+++++++ ...+|+.++ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~ 159 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC Confidence 1111122334444444432 3455677889999999999999999999999999999998753 456788888 Q ss_pred CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCccee Q lcl|Aclame:pro 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIG 206 (381) Q Consensus 127 ~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~G 206 (381) .+.+.|++|+++.+..+.++|+++++.+|+++++++||+|||+||.++|++||.+.|+++++++++.+|++|+|+++|.| T Consensus 160 ~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~ 239 (392) T protein:vir:10 160 MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA 239 (392) T ss_pred CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 88899999988876656799999999999999999999999999999999999999999999999999999999887655 Q ss_pred eeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCC Q lcl|Aclame:pro 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~ 286 (381) ..+ ... +.+++.. .....|+++++|+|||+++..++++ +++ T Consensus 240 ~~~--------------------------~d~----i~~~~~~------~l~~~~~~~a~~vm~~~~~~~L~~l---kd~ 280 (392) T protein:vir:10 240 IKS--------------------------LDD----IKDVLNV------KLDPAISPNAILLTNQDGFNYLDKL---KDK 280 (392) T ss_pred ccC--------------------------HHH----HHHHHHH------hhhhhhccCCEEEEcHHHHHHHHHh---hcc Confidence 421 001 1111100 1134577889999999999888876 677 Q ss_pred CCceeec---------cCCCceEEe-cCCC-C--------CccEEEEeccc-eEEEecceeeEeeehh--hhhhcCceEE Q lcl|Aclame:pro 287 NGVYVTA---------LPFNLNVIE-STVQ-E--------AGKVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDLY 344 (381) Q Consensus 287 ~G~~~~~---------l~~g~~vi~-s~~~-p--------~~~i~~gd~s~-y~i~~r~~~~i~~~~~--~~~~~d~~~~ 344 (381) +|+|+|. ..+|+|++. ++.+ | +..++||||++ |.+++|++++++++++ .+|.+|+++| T Consensus 281 ~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~ 360 (392) T protein:vir:10 281 DGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDL 360 (392) T ss_pred CCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEE Confidence 8999874 236776554 2222 1 12378999998 7789999999999875 4799999999 Q ss_pred EEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 345 TAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 345 ~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) |+.+|+||++++++||+.++++.++++.+|+| T Consensus 361 r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 361 RAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEEeeccEEecccceEEEEecccccccCCCC Confidence 99999999999999999999999999999999 No 37 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=3.8e-55 Score=318.92 Aligned_cols=336 Identities=14% Similarity=0.062 Sum_probs=237.7 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHHH--HHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHHH Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQE--RQNELYGDMINQLFEETKLQ-----------------------AKAEAERV 55 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~ 55 (381) |+-+|+ ++.+++.++.++++....++ ++.+....+.+.+..+.... ...++++. T Consensus 1 M~k~l~-el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELR-ELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHH-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 886554 23333333333332221111 11111111111111111100 00111111 Q ss_pred HHhhhhhccccHHHHHHHH------HHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC---ceEEEEecC Q lcl|Aclame:pro 56 SSLPKSAQSLSANQRSFFM------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSET 126 (381) Q Consensus 56 ~~~~~~~~~lt~~e~~~~~------~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~---~~~ip~~~~ 126 (381) .......+.++.+++.+.. .+..+++++||++||+++...|++.+++.++|+++|+++++++ ...+|+.++ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~ 159 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC Confidence 1111122334444444432 3455677889999999999999999999999999999998753 456788888 Q ss_pred CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCccee Q lcl|Aclame:pro 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIG 206 (381) Q Consensus 127 ~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~G 206 (381) .+.+.|++|+++.+..+.++|+++++.+|+++++++||+|||+||.++|++||.+.|+++++++++.+|++|+|+++|.| T Consensus 160 ~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~ 239 (392) T protein:vir:10 160 MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA 239 (392) T ss_pred CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 88899999988876656799999999999999999999999999999999999999999999999999999999887655 Q ss_pred eeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCC Q lcl|Aclame:pro 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~ 286 (381) ..+ ... +.+++.. .....|+++++|+|||+++..++++ +++ T Consensus 240 ~~~--------------------------~d~----i~~~~~~------~l~~~~~~~a~~vm~~~~~~~L~~l---kd~ 280 (392) T protein:vir:10 240 IKS--------------------------LDD----IKDVLNV------KLDPAISPNAILLTNQDGFNYLDKL---KDK 280 (392) T ss_pred ccC--------------------------HHH----HHHHHHH------hhhhhhccCCEEEEcHHHHHHHHHh---hcc Confidence 421 001 1111100 1134577889999999999888876 677 Q ss_pred CCceeec---------cCCCceEEe-cCCC-C--------CccEEEEeccc-eEEEecceeeEeeehh--hhhhcCceEE Q lcl|Aclame:pro 287 NGVYVTA---------LPFNLNVIE-STVQ-E--------AGKVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDLY 344 (381) Q Consensus 287 ~G~~~~~---------l~~g~~vi~-s~~~-p--------~~~i~~gd~s~-y~i~~r~~~~i~~~~~--~~~~~d~~~~ 344 (381) +|+|+|. ..+|+|++. ++.+ | +..++||||++ |.+++|++++++++++ .+|.+|+++| T Consensus 281 ~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~ 360 (392) T protein:vir:10 281 DGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDL 360 (392) T ss_pred CCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEE Confidence 8999874 236776554 2222 1 12378999998 7789999999999875 4799999999 Q ss_pred EEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 345 TAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 345 ~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) |+.+|+||++++++||+.++++.++++.+|+| T Consensus 361 r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 361 RAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEEeeccEEecccceEEEEecccccccCCCC Confidence 99999999999999999999999999999999 No 38 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=3.8e-55 Score=318.92 Aligned_cols=336 Identities=14% Similarity=0.062 Sum_probs=237.7 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHHH--HHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHHH Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQE--RQNELYGDMINQLFEETKLQ-----------------------AKAEAERV 55 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~ 55 (381) |+-+|+ ++.+++.++.++++....++ ++.+....+.+.+..+.... ...++++. T Consensus 1 M~k~l~-el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELR-ELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHH-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 886554 23333333333332221111 11111111111111111100 00111111 Q ss_pred HHhhhhhccccHHHHHHHH------HHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC---ceEEEEecC Q lcl|Aclame:pro 56 SSLPKSAQSLSANQRSFFM------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSET 126 (381) Q Consensus 56 ~~~~~~~~~lt~~e~~~~~------~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~---~~~ip~~~~ 126 (381) .......+.++.+++.+.. .+..+++++||++||+++...|++.+++.++|+++|+++++++ ...+|+.++ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~ 159 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC Confidence 1111122334444444432 3455677889999999999999999999999999999998753 456788888 Q ss_pred CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCccee Q lcl|Aclame:pro 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIG 206 (381) Q Consensus 127 ~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~G 206 (381) .+.+.|++|+++.+..+.++|+++++.+|+++++++||+|||+||.++|++||.+.|+++++++++.+|++|+|+++|.| T Consensus 160 ~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~ 239 (392) T protein:vir:10 160 MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA 239 (392) T ss_pred CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 88899999988876656799999999999999999999999999999999999999999999999999999999887655 Q ss_pred eeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCC Q lcl|Aclame:pro 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~ 286 (381) ..+ ... +.+++.. .....|+++++|+|||+++..++++ +++ T Consensus 240 ~~~--------------------------~d~----i~~~~~~------~l~~~~~~~a~~vm~~~~~~~L~~l---kd~ 280 (392) T protein:vir:10 240 IKS--------------------------LDD----IKDVLNV------KLDPAISPNAILLTNQDGFNYLDKL---KDK 280 (392) T ss_pred ccC--------------------------HHH----HHHHHHH------hhhhhhccCCEEEEcHHHHHHHHHh---hcc Confidence 421 001 1111100 1134577889999999999888876 677 Q ss_pred CCceeec---------cCCCceEEe-cCCC-C--------CccEEEEeccc-eEEEecceeeEeeehh--hhhhcCceEE Q lcl|Aclame:pro 287 NGVYVTA---------LPFNLNVIE-STVQ-E--------AGKVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDLY 344 (381) Q Consensus 287 ~G~~~~~---------l~~g~~vi~-s~~~-p--------~~~i~~gd~s~-y~i~~r~~~~i~~~~~--~~~~~d~~~~ 344 (381) +|+|+|. ..+|+|++. ++.+ | +..++||||++ |.+++|++++++++++ .+|.+|+++| T Consensus 281 ~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~ 360 (392) T protein:vir:10 281 DGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDL 360 (392) T ss_pred CCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEE Confidence 8999874 236776554 2222 1 12378999998 7789999999999875 4799999999 Q ss_pred EEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 345 TAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 345 ~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) |+.+|+||++++++||+.++++.++++.+|+| T Consensus 361 r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 361 RAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEEeeccEEecccceEEEEecccccccCCCC Confidence 99999999999999999999999999999999 No 39 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=3.8e-55 Score=318.92 Aligned_cols=336 Identities=14% Similarity=0.062 Sum_probs=237.7 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHHH--HHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHHH Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQE--RQNELYGDMINQLFEETKLQ-----------------------AKAEAERV 55 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~ 55 (381) |+-+|+ ++.+++.++.++++....++ ++.+....+.+.+..+.... ...++++. T Consensus 1 M~k~l~-el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELR-ELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHH-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 886554 23333333333332221111 11111111111111111100 00111111 Q ss_pred HHhhhhhccccHHHHHHHH------HHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC---ceEEEEecC Q lcl|Aclame:pro 56 SSLPKSAQSLSANQRSFFM------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSET 126 (381) Q Consensus 56 ~~~~~~~~~lt~~e~~~~~------~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~---~~~ip~~~~ 126 (381) .......+.++.+++.+.. .+..+++++||++||+++...|++.+++.++|+++|+++++++ ...+|+.++ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~ 159 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC Confidence 1111122334444444432 3455677889999999999999999999999999999998753 456788888 Q ss_pred CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCccee Q lcl|Aclame:pro 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIG 206 (381) Q Consensus 127 ~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~G 206 (381) .+.+.|++|+++.+..+.++|+++++.+|+++++++||+|||+||.++|++||.+.|+++++++++.+|++|+|+++|.| T Consensus 160 ~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~ 239 (392) T protein:vir:10 160 MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA 239 (392) T ss_pred CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 88899999988876656799999999999999999999999999999999999999999999999999999999887655 Q ss_pred eeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCC Q lcl|Aclame:pro 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~ 286 (381) ..+ ... +.+++.. .....|+++++|+|||+++..++++ +++ T Consensus 240 ~~~--------------------------~d~----i~~~~~~------~l~~~~~~~a~~vm~~~~~~~L~~l---kd~ 280 (392) T protein:vir:10 240 IKS--------------------------LDD----IKDVLNV------KLDPAISPNAILLTNQDGFNYLDKL---KDK 280 (392) T ss_pred ccC--------------------------HHH----HHHHHHH------hhhhhhccCCEEEEcHHHHHHHHHh---hcc Confidence 421 001 1111100 1134577889999999999888876 677 Q ss_pred CCceeec---------cCCCceEEe-cCCC-C--------CccEEEEeccc-eEEEecceeeEeeehh--hhhhcCceEE Q lcl|Aclame:pro 287 NGVYVTA---------LPFNLNVIE-STVQ-E--------AGKVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDLY 344 (381) Q Consensus 287 ~G~~~~~---------l~~g~~vi~-s~~~-p--------~~~i~~gd~s~-y~i~~r~~~~i~~~~~--~~~~~d~~~~ 344 (381) +|+|+|. ..+|+|++. ++.+ | +..++||||++ |.+++|++++++++++ .+|.+|+++| T Consensus 281 ~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~ 360 (392) T protein:vir:10 281 DGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDL 360 (392) T ss_pred CCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEE Confidence 8999874 236776554 2222 1 12378999998 7789999999999875 4799999999 Q ss_pred EEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 345 TAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 345 ~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) |+.+|+||++++++||+.++++.++++.+|+| T Consensus 361 r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 361 RAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEEeeccEEecccceEEEEecccccccCCCC Confidence 99999999999999999999999999999999 No 40 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=4.8e-55 Score=318.35 Aligned_cols=346 Identities=13% Similarity=0.082 Sum_probs=235.6 Q ss_pred CCccHHHHHHHHHHHHHHH----HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------------------HH Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINA----VNNGEPQERQNELYGDMINQLFEETKLQAKAE-----------------------AE 53 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~ 53 (381) |+-+++ ++.++++++.++ +++.....++.++..+..+.+........... .. T Consensus 1 M~k~l~-el~~~~~~~~~e~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (404) T protein:vir:10 1 MSKELR-ELLNQLDSKNKELNSLLNKDGVTAEELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYN 79 (404) T ss_pred CcHHHH-HHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHH Confidence 886554 233333333332 22211111111112222222211111000000 00 Q ss_pred -HHHHhhhhhccccHHHH-------HHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC---CceEEE Q lcl|Aclame:pro 54 -RVSSLPKSAQSLSANQR-------SFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG---LRLKFL 122 (381) Q Consensus 54 -~~~~~~~~~~~lt~~e~-------~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~---~~~~ip 122 (381) ...........+...++ ....++..+++++||++||+++.++|++.+++.++|+++|++.+++ +...+| T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~ 159 (404) T protein:vir:10 80 GALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYE 159 (404) T ss_pred HHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEE Confidence 00000000000111111 1123566788899999999999999999999999999999998864 457889 Q ss_pred EecCCcceEEeccccccccc-ccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCC Q lcl|Aclame:pro 123 KSETSGVAVWGKIYGEIKGQ-LDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGK 201 (381) Q Consensus 123 ~~~~~~~a~w~~e~~~~~~~-~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~ 201 (381) +.++.+.+.|++|+++.+.+ .+++|+++++.+|+++++++||+|||+|+.++|++||++.+++++++++|.+|++|+|+ T Consensus 160 ~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~ 239 (404) T protein:vir:10 160 KRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGG 239 (404) T ss_pred EecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 98888899999998776543 47999999999999999999999999999999999999999999999999999999998 Q ss_pred Cc-ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhh Q lcl|Aclame:pro 202 DQ-PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) Q Consensus 202 ~q-P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~ 280 (381) ++ |.||++......... + + .. ....+...+.. .....|+.+++|+|||.++..++++ T Consensus 240 ~~~~~gi~~~~~~~~~~~-~--------~---~~----~~~~~~~~~~~------~l~~~~~~~~~~v~n~~~~~~L~~l 297 (404) T protein:vir:10 240 DEHATGIMTANKFKKITL-P--------K---SP----ALKDFKKCKNV------ELLNVFKATSSWIVNQDGFNYLDSL 297 (404) T ss_pred CCcccceeeccccceeec-c--------c---cc----cHHHHHHHHHh------hhhccccCCCEEEEcHHHHHHHHHh Confidence 75 678875432211110 0 0 00 11122222111 1134577889999999999888876 Q ss_pred hhccCCCCceeec---------cCCCceEEe-cCCCCCc-----cEEEEeccc-eEEEecceeeEeeehhh--hhhcCce Q lcl|Aclame:pro 281 YTHLNANGVYVTA---------LPFNLNVIE-STVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMD 342 (381) Q Consensus 281 ~~~~~~~G~~~~~---------l~~g~~vi~-s~~~p~~-----~i~~gd~s~-y~i~~r~~~~i~~~~~~--~~~~d~~ 342 (381) ++.+|+|++. ..+|+||+. ++.+|.+ .++||||++ |.+++|++++|.++++. .|.+|++ T Consensus 298 ---kd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 374 (404) T protein:vir:10 298 ---EDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTT 374 (404) T ss_pred ---hccCCceeeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCce Confidence 5678888874 247999875 4445532 389999997 78899999999998864 4889999 Q ss_pred EEEEEEEEcCEEecCcceEEEEEEeccccc Q lcl|Aclame:pro 343 LYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 343 ~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~ 372 (381) .||+.+|+|+++.+++||++++++.++.++ T Consensus 375 ~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 375 KARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred EEEEEEeeccEEecccceEEEEeecccCCC Confidence 999999999999999999999999877555 No 41 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=1e-54 Score=316.57 Aligned_cols=343 Identities=10% Similarity=0.009 Sum_probs=226.5 Q ss_pred CCc-cHHHHHHHHHHHHHHHHhhhhH-------HHHHHHHHHHHHHHHHHHH---HHHHHH-HHHHHHHhhh-hhcc--- Q lcl|Aclame:pro 1 MTI-NLSETFANAKNEFINAVNNGEP-------QERQNELYGDMINQLFEET---KLQAKA-EAERVSSLPK-SAQS--- 64 (381) Q Consensus 1 m~~-~l~~~~~e~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~---~~~~~~-~~~~~~~~~~-~~~~--- 64 (381) |++ ||++++.+..+++.+..+.... ..++.+.+...++.+..+. ..+... +......... .... T Consensus 5 m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (408) T protein:vir:10 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNK 84 (408) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc Confidence 444 3444444444443332211100 0111111111111111110 000000 0000000000 0000 Q ss_pred ----ccHHHHH----------------HHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC---ceEE Q lcl|Aclame:pro 65 ----LSANQRS----------------FFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKF 121 (381) Q Consensus 65 ----lt~~e~~----------------~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~---~~~i 121 (381) ......+ -..++..+++++||++||++++++|++.+++.++|+++|+++++++ ...+ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 164 (408) T protein:vir:10 85 SENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVY 164 (408) T ss_pred chhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEE Confidence 0001111 1124566788899999999999999999999999999999999753 3445 Q ss_pred EEe-cCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccC Q lcl|Aclame:pro 122 LKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) Q Consensus 122 p~~-~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G 200 (381) |+. +..+.+.|++|+++.+..+.|+|++|++.+|+++++++||+|||+||.++|++||.++|+++++++++.+|++|+| T Consensus 165 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g 244 (408) T protein:vir:10 165 EKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMK 244 (408) T ss_pred eeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 544 4457789999988877666799999999999999999999999999999999999999999999999999999999 Q ss_pred CCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhh Q lcl|Aclame:pro 201 KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) Q Consensus 201 ~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~ 280 (381) +++|.+-.. .... +.+.+.. .....|+++++|+|||.++..++.+ T Consensus 245 ~~~~~~~~~-------------------------~~~~----l~~~~~~------~~~~~~~~~a~~v~n~~~~~~l~~l 289 (408) T protein:vir:10 245 AAPKKPTIA-------------------------KFDD----VITMINT------AVDPAIIATSSLLTNQSGLNKLALV 289 (408) T ss_pred ccccccccc-------------------------cHHH----HHHHHHH------hhhhhhccCCEEEEcHHHHHHHHHh Confidence 887642210 0111 1111110 1134678889999999999888875 Q ss_pred hhccCCCCceeec---------cCCCceEEecC--CCCCcc-----EEEEeccc-eEEEecceeeEeeehhhh--hhcCc Q lcl|Aclame:pro 281 YTHLNANGVYVTA---------LPFNLNVIEST--VQEAGK-----VLTYVKGL-YDGYLAGGINVQKFKETL--ALDDM 341 (381) Q Consensus 281 ~~~~~~~G~~~~~---------l~~g~~vi~s~--~~p~~~-----i~~gd~s~-y~i~~r~~~~i~~~~~~~--~~~d~ 341 (381) ++++|+|+|. ..+|+||++++ .+|... ++||||++ |.+++|+++++..+++.+ |.+|+ T Consensus 290 ---kd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~ 366 (408) T protein:vir:10 290 ---KTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDT 366 (408) T ss_pred ---hccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCc Confidence 5778999874 23799998854 466432 79999998 779999999999998754 88999 Q ss_pred eEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 342 DLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 342 ~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) +.||+.+|+||++++++||++++++-+++.....+++=+- T Consensus 367 ~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~ 406 (408) T protein:vir:10 367 TKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTST 406 (408) T ss_pred eEEEEEEeeccEEeccccEEEEEeeccccCCCCCCCCCcc Confidence 9999999999999999999988777654332222222222 No 42 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=6.6e-55 Score=317.61 Aligned_cols=345 Identities=12% Similarity=0.085 Sum_probs=229.9 Q ss_pred CCccHHHHHHHHHHHHHH---HHhhh----hHH--------HHHHHHHHHHHH---HHHHHHH----HHHH--------- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFIN---AVNNG----EPQ--------ERQNELYGDMIN---QLFEETK----LQAK--------- 49 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~---~~~~~----~~~--------~~~~~~~~~~~~---~~~~~~~----~~~~--------- 49 (381) ..-++.+++++.++++.+ .++.. ..+ ++..+.+++... .+..... ...+ T Consensus 16 ~~~el~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~ 95 (418) T protein:vir:10 16 GDSHPEQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQKLARGGGSAELET 95 (418) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccch Confidence 443444444433333222 22111 000 011111111111 1110000 0000 Q ss_pred -----HHH-----HH-HHHhhhhhccccHHHHHH--HHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC Q lcl|Aclame:pro 50 -----AEA-----ER-VSSLPKSAQSLSANQRSF--FMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG 116 (381) Q Consensus 50 -----~~~-----~~-~~~~~~~~~~lt~~e~~~--~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~ 116 (381) ... .+ .....+.......+.+.. ......++.++||++||++++..|++.+++.++|+++|++++++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~ 175 (418) T protein:vir:10 96 PKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTS 175 (418) T ss_pred hhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeecc Confidence 000 00 000000000000111111 11223345667899999999999999999999999999999987 Q ss_pred Cc-eEEEEecC-CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 117 LR-LKFLKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETA 194 (381) Q Consensus 117 ~~-~~ip~~~~-~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a 194 (381) +. .++|+.++ .+.+.|++|+++.+ +++++|+++++.+|+++++++||++||+|+. ++++||++.+++++++++|.+ T Consensus 176 ~~~~~~~~~~~~~~~a~~v~E~~~~~-~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~l~~a~~~~~d~a 253 (418) T protein:vir:10 176 SSSIEYTVETGFTNNAAAVAEGAQKP-TSDLKFNLKNQPVRTIAHLFKASRQILDDAP-ALQSYIDGRARYGLQLTEEGQ 253 (418) T ss_pred CCceeEEEEecCCCceeeeccCcccc-ccccceeeEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHH Confidence 64 78998765 57899999987754 5789999999999999999999999999985 899999999999999999999 Q ss_pred eeeccCCC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhh Q lcl|Aclame:pro 195 FLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSD 273 (381) Q Consensus 195 ~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~ 273 (381) |++|+|++ +|.||++.......... .. +.. .+..+..++..+ ...+..+.+|+|||.+ T Consensus 254 ~l~G~g~~~~p~Gi~~~~~~~~~~~~-------~~---~~~----~~~~i~~~~~~~-------~~~~~~~~~~v~n~~~ 312 (418) T protein:vir:10 254 ILKGDGTGANILGILPQASAFMPSIT-------LA---NAT----PIDKIRLALLQA-------VLAEFPATGIVLNPID 312 (418) T ss_pred HhccCCCCcccccccccccccccccc-------cc---ccc----cHHHHHHHHHhh-------ccccCCCCEEEEcHHH Confidence 99999987 59999975432211100 00 001 111122222111 2345667789999999 Q ss_pred HHHHHhhhhccCCCCceeec--------cCCCceEEecCCCCCccEEEEeccc-eEEEecceeeEeeehhh--hhhcCce Q lcl|Aclame:pro 274 AFEVQAQYTHLNANGVYVTA--------LPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMD 342 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~~~~--------l~~g~~vi~s~~~p~~~i~~gd~s~-y~i~~r~~~~i~~~~~~--~~~~d~~ 342 (381) +..++.+ ++.+|+|++. ..+|+||+.+++||+++++||||++ |+++++++++|.++++. +|.+|++ T Consensus 313 ~~~L~~l---kd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~ 389 (418) T protein:vir:10 313 WASIELT---KDSQGRYIVGNPVNGTTPRLWNLPVVETQAMTANEFLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNMV 389 (418) T ss_pred HHHHHHh---hcCCCceeccccccCCCceecceeeEEcCCCCCCcEEEeeccceEEEEEecceEEEEecccchhhhcCce Confidence 8887765 5778888863 3479999999999999999999998 88999999999988865 5999999 Q ss_pred EEEEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 343 LYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 343 ~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) .||+.+|+||++++++||++++++-++ .| T Consensus 390 ~~r~~~~~d~~~~~~~a~~~~~~~~~~-----~g 418 (418) T protein:vir:10 390 SIRAEERLALAVYRPESFVTGALVEQA-----GG 418 (418) T ss_pred EEEEEEeeccEEecccceEEEEeccCC-----CC Confidence 999999999999999999998776444 45 No 43 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=1.5e-54 Score=315.64 Aligned_cols=343 Identities=12% Similarity=0.095 Sum_probs=230.1 Q ss_pred CCccHHHHHHHHHHHHHHHHh---hhhHH-HHHHHHHHH---HH----HHHHHHHHHH-HH-HHHHHHHHhh--hhh-c- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVN---NGEPQ-ERQNELYGD---MI----NQLFEETKLQ-AK-AEAERVSSLP--KSA-Q- 63 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~---~~~~~-~~~~~~~~~---~~----~~~~~~~~~~-~~-~~~~~~~~~~--~~~-~- 63 (381) |+ ++.++++|.++++.+..+ ....+ .++.+.... .+ +++..+.... .+ .+.+...... +.. . T Consensus 1 m~-~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (395) T protein:vir:43 1 MS-DFEKQIGELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEE 79 (395) T ss_pred Ch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Confidence 66 444444444433332221 11000 000000100 00 1111000000 00 0000000000 000 0 Q ss_pred -c-------ccHH-HHHHH------------HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc-eEE Q lcl|Aclame:pro 64 -S-------LSAN-QRSFF------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKF 121 (381) Q Consensus 64 -~-------lt~~-e~~~~------------~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~-~~i 121 (381) . .... .+.+. +....+++..+|++||++++.+|++.+++.++|+++|+++++++. .++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~ 159 (395) T protein:vir:43 80 APKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEY 159 (395) T ss_pred hhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEE Confidence 0 0000 01111 122335667889999999999999999999999999999998764 789 Q ss_pred EEecC-CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccC Q lcl|Aclame:pro 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) Q Consensus 122 p~~~~-~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G 200 (381) |+.++ .+.+.|++|+++. ++++++|+++++.+|+++++++||++||+|+. ++++||.+.|++++++++|.+|++|+| T Consensus 160 ~~~~~~~~~a~~v~E~~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~v~~~la~a~~~~~d~~~l~G~g 237 (395) T protein:vir:43 160 VRETGFVNNAAPVSEGTQK-PYSDLTFELENAPVRTIAHLFKASRQILDDAS-ALQSYIDARARYGLMLVEECQLLYGNG 237 (395) T ss_pred EEEecCCCceeeecCCccc-cccccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99755 4789999997765 46789999999999999999999999999975 799999999999999999999999999 Q ss_pred CCc-ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHh Q lcl|Aclame:pro 201 KDQ-PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 201 ~~q-P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~ 279 (381) +++ |.||++............ .........+.+++..+ ...|..+++|+|||.++..++. T Consensus 238 ~~~~~~Gi~~~~~~~~~~~~~~------------~~~~~~~~~i~~~~~~~-------~~~~~~~~~~vmn~~~~~~l~~ 298 (395) T protein:vir:43 238 TGANLHGIIPQAQAYAPPSGVV------------VTAEQRIDRIRLAILQA-------QLAEFPASGIVLNPIDWALIEL 298 (395) T ss_pred CCCccccccccccccccccccc------------cccchhHHHHHHHHHhh-------ccccCCCcEEEEcHHHHHHHHH Confidence 876 589987643322111111 00111122222222211 3346667899999999888876 Q ss_pred hhhccCCCCceeec--------cCCCceEEecCCCCCccEEEEeccc-eEEEecceeeEeeehhh--hhhcCceEEEEEE Q lcl|Aclame:pro 280 QYTHLNANGVYVTA--------LPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLYTAKQ 348 (381) Q Consensus 280 ~~~~~~~~G~~~~~--------l~~g~~vi~s~~~p~~~i~~gd~s~-y~i~~r~~~~i~~~~~~--~~~~d~~~~~~~~ 348 (381) + ++++|+|++. ..+|+||+.+++||+++++||||++ |.+++|++++|+.+++. .|.+|+++||+.+ T Consensus 299 l---kd~~G~~i~~~~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~ 375 (395) T protein:vir:43 299 N---KDAENRYIIGSPQNGTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEE 375 (395) T ss_pred h---hccCCceeccccccCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEE Confidence 5 5778988874 2379999999999999999999998 77899999999988765 5899999999999 Q ss_pred EEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 349 FAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 349 r~dgk~~~~~Af~v~~l~~~ 368 (381) |+||++.+++||+++++.-+ T Consensus 376 r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 376 RLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred eeccEEecccceEEEEeccC Confidence 99999999999999876665 No 44 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=2.2e-54 Score=314.76 Aligned_cols=348 Identities=15% Similarity=0.065 Sum_probs=221.5 Q ss_pred CCccHHH-HHHHHHHHH--------HHHHhhh--------hHHHHHHHHHHHHHHH-----------HHHH---HHHHHH Q lcl|Aclame:pro 1 MTINLSE-TFANAKNEF--------INAVNNG--------EPQERQNELYGDMINQ-----------LFEE---TKLQAK 49 (381) Q Consensus 1 m~~~l~~-~~~e~~~~~--------~~~~~~~--------~~~~~~~~~~~~~~~~-----------~~~~---~~~~~~ 49 (381) |++..+. +..+.+++. .+..... +...++.+.+.+..+. ..+. ...+.. T Consensus 24 ~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~l~ee~~~~~~~~a~~~e~~~~~~~~~~~~~~ 103 (458) T protein:vir:10 24 LTAAQKAQEAERMRKEQEEKELARMNDLVSKAVGEDRKRLEEALELVKSLDEKSKKSNELFAQTVEKQQETIVGLQDEIK 103 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222111 111110000 0000000 0000000000000000 0000 000000 Q ss_pred HHH---H-HHHHh------------------------hhhhc-cccHHHHHH--HHHHh-cccCCCCceEccHHHHHHHH Q lcl|Aclame:pro 50 AEA---E-RVSSL------------------------PKSAQ-SLSANQRSF--FMDIN-KNVNYKEEKLLPEETIDRIF 97 (381) Q Consensus 50 ~~~---~-~~~~~------------------------~~~~~-~lt~~e~~~--~~~~~-~~~~~~gg~lvP~~~~~~Ii 97 (381) ... + +.... ....+ ....+++.+ ..+.. .++.++||++||+++.+.|+ T Consensus 104 ~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii 183 (458) T protein:vir:10 104 SLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRII 183 (458) T ss_pred HHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHH Confidence 000 0 00000 00000 000111111 11111 23456799999999999999 Q ss_pred HHHHhhhhhhhhceeeecCCc-eEEEEecCCcceEEecccccccc-----cccccccceeccceeeeeehhhhHHHHhcC Q lcl|Aclame:pro 98 EDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVWGKIYGEIKG-----QLDAAFSEETAIQNKLTAFVVLPKDLNDFG 171 (381) Q Consensus 98 ~~l~~~~~l~~~~~v~~~~~~-~~ip~~~~~~~a~w~~e~~~~~~-----~~~~~f~~v~l~~~kl~~~~~iS~ell~ds 171 (381) +.+++.++|+++|++++++++ ..+|+.++.+.+.|++|++..+. .++++|+++++.+||++++++||++||+|| T Consensus 184 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds 263 (458) T protein:vir:10 184 RDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDA 263 (458) T ss_pred HHHHhhhhHHhhcceeecCCcceEEEEecCCcceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcc Confidence 999999999999999998765 67899888899999999876543 346889999999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhh Q lcl|Aclame:pro 172 PAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHS 251 (381) Q Consensus 172 ~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~ 251 (381) .+++++||.++|+++|++++|.+|++|+|+++|+||++.......... ........+......+ .+++..+ T Consensus 264 ~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~----~~~~~~~~~~~~~~~i----~~~~~~l- 334 (458) T protein:vir:10 264 IFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVV----TEAKADGSVLVTAKTI----SKLRRKL- 334 (458) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeeccccccccee----ecccccccccccHHHH----HHHHHhh- Confidence 999999999999999999999999999999999999986543221110 0001111111122222 2222211 Q ss_pred hccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec-------------cCCCceEEecCCCCCc----cEEEE Q lcl|Aclame:pro 252 TNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-------------LPFNLNVIESTVQEAG----KVLTY 314 (381) Q Consensus 252 ~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~-------------l~~g~~vi~s~~~p~~----~i~~g 314 (381) ...|+++++|+|||.++..++.+ ++++|+|++. ..+|+||+++++||++ .++|| T Consensus 335 ------~~~~~~~~~~v~~~~~~~~l~~l---kd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~ 405 (458) T protein:vir:10 335 ------GRHGLKLSKLVLIVSMDAYYDLL---EDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSAEFAVI 405 (458) T ss_pred ------hhhhcCCCEEEEcHHHHHHHHhh---cccCCceeeccccccccccCcCceecceeeEEccccccccCCcceEEE Confidence 34567889999999998877765 6778888763 2469999999999964 58999 Q ss_pred eccc-eEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 315 VKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 315 d~s~-y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) ||+. |.+++|++++|.+ +.|+.++++.|++..|+|+.+.+++||++.+ +++. T Consensus 406 ~f~~~~~~~~~~~~~v~~--d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~--~aa~ 458 (458) T protein:vir:10 406 VYKDNFVMPRQRAVTVER--ERQAGKQRDAYYVTQRVNLQRYFANGVVSGT--YAAS 458 (458) T ss_pred EecccEEEEEeeceEEEe--ecccCCCceEEEEEEEecceEecccceEEEe--eccC Confidence 9975 8899999999876 4568899999999999999999999998754 3333 No 45 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=4.5e-56 Score=324.03 Aligned_cols=290 Identities=11% Similarity=0.045 Sum_probs=229.5 Q ss_pred ccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceEEecccccccccc Q lcl|Aclame:pro 65 LSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKGQL 143 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~ 143 (381) +..++.+. ....++.++|.+||++++++|++.+++.++|+++++++++++ ..++|+.++.+.+.|++|+++++ ++ T Consensus 1 m~~~~~~a---~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~ 76 (330) T protein:vir:77 1 MAGSTVPS---TQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAERKP-IT 76 (330) T ss_pred Ccccccch---hhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCccc-cc Confidence 44555443 233445566778888899999999999999999999999865 58999998889999999987765 57 Q ss_pred cccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCc-ceeeeeccccccccccccc Q lcl|Aclame:pro 144 DAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ-PIGLNRQVQKGVSVTEGAY 222 (381) Q Consensus 144 ~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~q-P~Gil~~~~~~~~~~~~~~ 222 (381) +++|+++++.+||++++++||+|||+|+.+++++||.++|++++++++|.+|++|+|+++ |.||++............. T Consensus 77 ~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~ 156 (330) T protein:vir:77 77 KGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNL 156 (330) T ss_pred cceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccc Confidence 899999999999999999999999999999999999999999999999999999999875 6799876544333222111 Q ss_pred cccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeecc-------- Q lcl|Aclame:pro 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL-------- 294 (381) Q Consensus 223 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l-------- 294 (381) ... ... .......+..++..+ ...+..+.+|+|||.++..++.+ ++.+|+|+|.. T Consensus 157 ~~~------~~~-~~~~~~~l~~~~~~~-------~~~~~~~~~~vmn~~~~~~l~~l---kd~~G~~l~~~~~~~~~~~ 219 (330) T protein:vir:77 157 TTA------SGP-QGNAYLAVNNALSLL-------VNSGKKWTGTLLDNVTEPILNTA---VDGNGRPLFVESTYTEQVG 219 (330) T ss_pred ccc------ccc-cchhHHHHHHHHHhh-------hhcCCCccEEEEcHHHHHHHHHH---hccCCceeecCcccccccc Confidence 110 001 111112222222211 23355677899999999888875 56788888742 Q ss_pred ------CCCceEEecCCCCCcc------EEEEeccceEEEecceeeEeeehhhh------------------hhcCceEE Q lcl|Aclame:pro 295 ------PFNLNVIESTVQEAGK------VLTYVKGLYDGYLAGGINVQKFKETL------------------ALDDMDLY 344 (381) Q Consensus 295 ------~~g~~vi~s~~~p~~~------i~~gd~s~y~i~~r~~~~i~~~~~~~------------------~~~d~~~~ 344 (381) .+|+||+.+++||++. ++||||++|+++++++++|++++|.+ |.+|++.| T Consensus 220 ~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~ 299 (330) T protein:vir:77 220 AIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAV 299 (330) T ss_pred ccCCceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEE Confidence 3699999999999754 78999999999999999999999875 78899999 Q ss_pred EEEEEEcCEEecCcceEEEEEEecccccCCC Q lcl|Aclame:pro 345 TAKQFAYGKAKDNKVAAVWKLDLKGHKPALE 375 (381) Q Consensus 345 ~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~ 375 (381) |+.+|+|+++++++||++++.+..+.+|--| T Consensus 300 r~~~r~d~~v~~~~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 300 RCEAEFAFMVNDKDAFVKLTDQVAGTDPEEE 330 (330) T ss_pred EEEEEeccEEecccceEEEEeccCCcCCCCC Confidence 9999999999999999999888877777666 No 46 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=3.5e-54 Score=313.62 Aligned_cols=343 Identities=9% Similarity=-0.007 Sum_probs=227.1 Q ss_pred CCc-cHHHHHHHHHHHHHHHHhhhhHH-------HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHh--hhhhc---- Q lcl|Aclame:pro 1 MTI-NLSETFANAKNEFINAVNNGEPQ-------ERQNELYGDMINQLFEET---KLQAKAEAERVSSL--PKSAQ---- 63 (381) Q Consensus 1 m~~-~l~~~~~e~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~--~~~~~---- 63 (381) |++ +|++++.+..+++.+..+..... .++.+......+.+..+. ..+......+.... ..... T Consensus 5 m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (408) T protein:vir:74 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNK 84 (408) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 555 34444444433333222111100 000000111111111110 00000000000000 00000 Q ss_pred ---cccHHHHHH----------------HHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC---ceEE Q lcl|Aclame:pro 64 ---SLSANQRSF----------------FMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKF 121 (381) Q Consensus 64 ---~lt~~e~~~----------------~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~---~~~i 121 (381) .....+.+. ..++..+++++||++||+++.+.|++.+++.++|+++|+++++++ .+.+ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 164 (408) T protein:vir:74 85 SENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVY 164 (408) T ss_pred hhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEE Confidence 000111111 123456778889999999999999999999999999999998753 3456 Q ss_pred EEecC-CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccC Q lcl|Aclame:pro 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) Q Consensus 122 p~~~~-~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G 200 (381) ++..+ .+.++|++|+++.+..++++|+++++.+|+++++++||+|||+||.++|++||.++|++++++++|.+|++|+| T Consensus 165 ~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G 244 (408) T protein:vir:74 165 EKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMG 244 (408) T ss_pred EeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 65443 45678999988877667899999999999999999999999999999999999999999999999999999999 Q ss_pred CCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhh Q lcl|Aclame:pro 201 KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) Q Consensus 201 ~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~ 280 (381) +++|.|.... .......+. . ..+..|+++++|+|||.++..++.+ T Consensus 245 ~~~~~~~~~~-------------------------~~~i~~~~~---~-------~l~~~~~~~a~~v~n~~~~~~l~~l 289 (408) T protein:vir:74 245 TVPKKPTIAN-------------------------FDDVITMIN---T-------SVDPAIIATSSLLTNQSGLNKLALV 289 (408) T ss_pred cccccccccc-------------------------HHHHHHHHH---H-------hhhhhhcCCCEEEEcHHHHHHHHHh Confidence 9987653210 011111111 0 1134577889999999998888765 Q ss_pred hhccCCCCceeec---------cCCCceEEecC--CCCC-----ccEEEEeccc-eEEEecceeeEeeehhh--hhhcCc Q lcl|Aclame:pro 281 YTHLNANGVYVTA---------LPFNLNVIEST--VQEA-----GKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDM 341 (381) Q Consensus 281 ~~~~~~~G~~~~~---------l~~g~~vi~s~--~~p~-----~~i~~gd~s~-y~i~~r~~~~i~~~~~~--~~~~d~ 341 (381) ++++|+|+|. ..+|+||+.++ .+|. ..++||||++ |.+++|++++++++++. .|.+|+ T Consensus 290 ---kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~ 366 (408) T protein:vir:74 290 ---KTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDT 366 (408) T ss_pred ---hcCCCceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcce Confidence 5778999874 24799998765 4663 3489999997 77899999999998874 589999 Q ss_pred eEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 342 DLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 342 ~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) +.||+.+|+||++++++||++++++-++..+...+++=+- T Consensus 367 ~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 406 (408) T protein:vir:74 367 TKIRVIDRFDVKATDSEALVAGSFTAIADQVGNFKTTTST 406 (408) T ss_pred eeEEEEEeeCcEEecccceEEEEeecccCCCCCCCCCccc Confidence 9999999999999999999999886554433333322222 No 47 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=3.5e-54 Score=313.62 Aligned_cols=346 Identities=14% Similarity=0.081 Sum_probs=229.1 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHH--------HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHhh------ Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQ--------ERQNELYGDMINQLFE-------ETKLQAKAEAERVSSLP------ 59 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~------ 59 (381) |=-|.++...++.++..++++....+ ++..+......+.+.. ....+......+..... T Consensus 1 ~~ke~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (413) T protein:vir:81 1 MVKEAGDAPTNAQVAEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYKSIGEF 80 (413) T ss_pred ChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhhhhhh Confidence 55566655555444433333221110 0000001111111000 00000000000000000 Q ss_pred ------------h-------hhccccHHHHHHHH--HHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc Q lcl|Aclame:pro 60 ------------K-------SAQSLSANQRSFFM--DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 60 ------------~-------~~~~lt~~e~~~~~--~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~ 118 (381) + ....+...+.+.+. ....+++++||++||+++..+|++.+++.++|+++|+++++++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 160 (413) T protein:vir:81 81 FAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNT 160 (413) T ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCC Confidence 0 00001111111111 12335667899999999999999999999999999999998764 Q ss_pred -eEEEEecCC----cceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 119 -LKFLKSETS----GVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALET 193 (381) Q Consensus 119 -~~ip~~~~~----~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~ 193 (381) .++|+.... ..+.|++|+++.+....++|+++++.+|+++++++||++||+|+. +|++||+++|++++++++|. T Consensus 161 ~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~ 239 (413) T protein:vir:81 161 TIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYD-FLVSYINARLLEELAIEEER 239 (413) T ss_pred ceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 788876533 457899998876654458999999999999999999999999996 49999999999999999999 Q ss_pred heeeccCCCcc-eeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|Aclame:pro 194 AFLKGTGKDQP-IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 194 a~l~G~G~~qP-~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~ 272 (381) +|++|+|+++| .||++.......... ........+...+.... ....++.++ |+|||+ T Consensus 240 ~~l~G~G~~~~~~Gi~~~~~~~~~~~~---------------~~~~~~~~i~~~~~~~~-----~~~~~~~~~-~vmn~~ 298 (413) T protein:vir:81 240 QLLLGDGTGNNLTGLLKRDGIQTLAVS---------------NKDELADSIYKAMTNIS-----LATPFQADA-LVINPL 298 (413) T ss_pred HHhccCCCCCccccccccccccccccc---------------ccchhHHHHHHHHHHhh-----hhccCCCcE-EEEcHH Confidence 99999998865 799865322211110 11111122222211111 122344454 999999 Q ss_pred hHHHHHhhhhccCCCCceeec----------------cCCCceEEecCCCCCccEEEEeccc-eEEEecceeeEeeehhh Q lcl|Aclame:pro 273 DAFEVQAQYTHLNANGVYVTA----------------LPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKET 335 (381) Q Consensus 273 ~~~~~~~~~~~~~~~G~~~~~----------------l~~g~~vi~s~~~p~~~i~~gd~s~-y~i~~r~~~~i~~~~~~ 335 (381) ++..++++ ++++|+|+|. ..+|+||+++++||+++++||||++ |.+++|++++++.+++. T Consensus 299 ~~~~l~~l---kd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~ 375 (413) T protein:vir:81 299 DYQELRLA---KDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRSAASVLRKGGVRIDSTNTN 375 (413) T ss_pred HHHHHHHh---hccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEEEecccEEEEEEecceEEEEeccc Confidence 99988876 5678888763 2479999999999999999999997 78999999999988875 Q ss_pred --hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCC Q lcl|Aclame:pro 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 336 --~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~ 374 (381) +|.+|++.||+.+|+||++++++||++++++-+. +| T Consensus 376 ~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~---~p 413 (413) T protein:vir:81 376 VDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEVV---TP 413 (413) T ss_pred cchhhcCcEEEEEEEeeccEEecccceEEEEecCCC---CC Confidence 6999999999999999999999999998775432 22 No 48 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=4e-54 Score=313.35 Aligned_cols=339 Identities=12% Similarity=0.016 Sum_probs=234.4 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhH-----------HHHHHHHHHHHHHHHHHHHHHH------HHHH-HHHHHHhhh---- Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEP-----------QERQNELYGDMINQLFEETKLQ------AKAE-AERVSSLPK---- 60 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~------~~~~-~~~~~~~~~---- 60 (381) |+..+++++.++++.+.++.... ..++.+.+...++.+..+.... .+.. ........+ T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcccccc Confidence 65555555544444333322111 1111111111121111111100 0000 000000000 Q ss_pred -hhccccHHHH------------HHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc-eE--EEE- Q lcl|Aclame:pro 61 -SAQSLSANQR------------SFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LK--FLK- 123 (381) Q Consensus 61 -~~~~lt~~e~------------~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~-~~--ip~- 123 (381) .......+++ ...+.+..+++++||++||++++++|++.+++.++|+++|+++++++. .. ++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (397) T protein:vir:48 81 KSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKW 160 (397) T ss_pred chhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEee Confidence 0000111112 222345567778899999999999999999999999999999987542 33 333 Q ss_pred ecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCc Q lcl|Aclame:pro 124 SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ 203 (381) Q Consensus 124 ~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~q 203 (381) .+..+.+.|++|+++.++.++++|++|++.+++++++++||++||+||.+++++||+++|++++++++|.+|++|+|+++ T Consensus 161 ~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~ 240 (397) T protein:vir:48 161 ADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLP 240 (397) T ss_pred cCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 35556799999988877666799999999999999999999999999999999999999999999999999999999887 Q ss_pred ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc Q lcl|Aclame:pro 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~ 283 (381) |.|.+.. .. .+.++...+ ...|..+++|+|||.++..++++ T Consensus 241 ~~~~~~~-------------------------~d----~i~~~~~~l-------~~~~~~~a~~v~n~~~~~~L~~l--- 281 (397) T protein:vir:48 241 TKPTLTK-------------------------WD----DIIDLQAKV-------DPAIKQTSFFLTNTSGFTALKKV--- 281 (397) T ss_pred ccccccc-------------------------HH----HHHHHHHHh-------hhhhcCCCEEEECHHHHHHHHHh--- Confidence 6543210 11 122222111 23467789999999999888875 Q ss_pred cCCCCceeec---------cCCCceEEecC--CCC-----CccEEEEeccc-eEEEecceeeEeeehhh--hhhcCceEE Q lcl|Aclame:pro 284 LNANGVYVTA---------LPFNLNVIEST--VQE-----AGKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLY 344 (381) Q Consensus 284 ~~~~G~~~~~---------l~~g~~vi~s~--~~p-----~~~i~~gd~s~-y~i~~r~~~~i~~~~~~--~~~~d~~~~ 344 (381) ++++|+|++. ..+|+||+.++ .+| +..++||||++ |.+++|++++++.+++. +|.+|++.| T Consensus 282 kd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 361 (397) T protein:vir:48 282 KNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKI 361 (397) T ss_pred hcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeE Confidence 5778888864 23799987644 344 34589999997 67899999999988864 799999999 Q ss_pred EEEEEEcCEEecCcceEEEEEEecccccCCCCCCCC Q lcl|Aclame:pro 345 TAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEET 380 (381) Q Consensus 345 ~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~ 380 (381) |+.+|+||++++++||+.++++-++.+|...+++.- T Consensus 362 r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 362 RVIDRFDVVATDTESFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred EEEeeeccEEecccceEEEEecccccCCCCccccCC Confidence 999999999999999999999988877777777776 No 49 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=3.7e-54 Score=313.48 Aligned_cols=339 Identities=12% Similarity=-0.000 Sum_probs=229.2 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhH-----------HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH---hhhhh---- Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEP-----------QERQNELYGDMINQLFEETKLQAK--AEAERVSS---LPKSA---- 62 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~---~~~~~---- 62 (381) |+..+++++++.++.+.+++... +.++.+.+....+.+..+...... .+.+.... ..... T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 55455544444443333322111 111111111111111111100000 00000000 00000 Q ss_pred ---ccccHHHH------------HHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC---ceEEEEe Q lcl|Aclame:pro 63 ---QSLSANQR------------SFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKS 124 (381) Q Consensus 63 ---~~lt~~e~------------~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~---~~~ip~~ 124 (381) .....+++ ..++.+..+++++||++||+++.+.|++.+++.++|+++|+++++++ ...+|+. T Consensus 81 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (397) T protein:vir:49 81 KNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKW 160 (397) T ss_pred chhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEee Confidence 00111222 22334566778899999999999999999999999999999988753 3556654 Q ss_pred c-CCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCc Q lcl|Aclame:pro 125 E-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ 203 (381) Q Consensus 125 ~-~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~q 203 (381) . ..+.+.|++|+++.+....++|++|++.+|+++++++||++||+|+.+||++||.+++++++++++|.+|++|+|+++ T Consensus 161 ~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~ 240 (397) T protein:vir:49 161 ADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLP 240 (397) T ss_pred ccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 3 457799999988876666799999999999999999999999999999999999999999999999999999999987 Q ss_pred ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc Q lcl|Aclame:pro 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~ 283 (381) |.+... + .+. +.++...+ ...|..+++|+|||.++..++++ T Consensus 241 ~~~~~~----------------------~---~d~----i~~~~~~l-------~~~~~~~a~~v~n~~~~~~l~~l--- 281 (397) T protein:vir:49 241 NKPTLA----------------------K---WDD----IIDLQAKV-------DPAIKQTSLFLTNTSGFTALKKV--- 281 (397) T ss_pred cccccc----------------------C---HHH----HHHHHHhh-------hhhhcCCCEEEEcHHHHHHHHHh--- Confidence 753211 0 111 11221111 23567789999999999888876 Q ss_pred cCCCCceeec---------cCCCceEEecC--CCCC-----ccEEEEeccc-eEEEecceeeEeeehhh--hhhcCceEE Q lcl|Aclame:pro 284 LNANGVYVTA---------LPFNLNVIEST--VQEA-----GKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLY 344 (381) Q Consensus 284 ~~~~G~~~~~---------l~~g~~vi~s~--~~p~-----~~i~~gd~s~-y~i~~r~~~~i~~~~~~--~~~~d~~~~ 344 (381) ++.+|+|++. ..+|+||+.++ .+|. ..++||||++ |++++|++++|+++++. +|.+|++.| T Consensus 282 kd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 361 (397) T protein:vir:49 282 KNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKV 361 (397) T ss_pred hccCCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeE Confidence 5778988863 34789987644 4553 3589999997 77899999999998865 799999999 Q ss_pred EEEEEEcCEEecCcceEEEEEEecccccCCCCCCCC Q lcl|Aclame:pro 345 TAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEET 380 (381) Q Consensus 345 ~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~ 380 (381) |+.+|+||++++++||++++++-.+.+++....+=- T Consensus 362 ~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 362 RVIDRFDVVSTDTEAFVPASFKAIADQKAKLSTAGA 397 (397) T ss_pred EEEEeeccEEecccceEEEEecccccccCcccccCC Confidence 999999999999999999988776554433322222 No 50 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=5.2e-54 Score=312.69 Aligned_cols=340 Identities=14% Similarity=0.100 Sum_probs=231.4 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHH-------HHHHH-H-HHHHHHHHhhhhh--------cc Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEP-QERQNELYGDMINQLFEE-------TKLQA-K-AEAERVSSLPKSA--------QS 64 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-------~~~~~-~-~~~~~~~~~~~~~--------~~ 64 (381) |+..++++++++++.++++.... .+++.+......+.+... ..... + .+.++........ +. T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFSER 80 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhHHH Confidence 44444555555554444432211 111111111111111111 10000 0 0000000000000 00 Q ss_pred ccHHHHH-------------HHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecC-Ccc Q lcl|Aclame:pro 65 LSANQRS-------------FFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET-SGV 129 (381) Q Consensus 65 lt~~e~~-------------~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~-~~~ 129 (381) ...+.++ ..+.+ ..+...+|.+||+++...|++.+++.++|+++|+++++++ ..++|+.++ .+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (385) T protein:vir:18 81 AAEELIKSWDGKQGTFGAKTFNKSL-GSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNN 159 (385) T ss_pred HHHHHHHHHHHhhccchhhHHHhhh-ccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcc Confidence 0011111 11122 2344556778899999999999999999999999999865 478998764 578 Q ss_pred eEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCc-ceeee Q lcl|Aclame:pro 130 AVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ-PIGLN 208 (381) Q Consensus 130 a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~q-P~Gil 208 (381) +.|++|+++. ++++++|+++++.+|+++++++||++||+|+. ++++||.++|++++++++|.+|++|+|+++ |.||+ T Consensus 160 a~~v~E~~~~-~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~ 237 (385) T protein:vir:18 160 ADVVAEKALK-PESDITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLN 237 (385) T ss_pred eeeeccCccc-cccccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccc Confidence 8999997765 46789999999999999999999999999875 699999999999999999999999999886 57998 Q ss_pred eccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCC Q lcl|Aclame:pro 209 RQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANG 288 (381) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G 288 (381) +......... ... .....+.+.+++..+ ...+..+.+|+|||.++..++++ ++.+| T Consensus 238 ~~~~~~~~~~-------~~~-------~~~~~d~i~~~~~~l-------~~~~~~~~~~~~~~~~~~~l~~l---kd~~G 293 (385) T protein:vir:18 238 KVATAYDTSL-------NAT-------GDTRADIIAHAIYQV-------TESEFSASGIVLNPRDWHNIALL---KDNEG 293 (385) T ss_pred cccccccccc-------ccc-------ccchHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHHh---hcCCC Confidence 7542211110 000 111222333332222 23456678999999999888876 56788 Q ss_pred ceeec--------cCCCceEEecCCCCCccEEEEeccc-eEEEecceeeEeeehhh--hhhcCceEEEEEEEEcCEEecC Q lcl|Aclame:pro 289 VYVTA--------LPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKDN 357 (381) Q Consensus 289 ~~~~~--------l~~g~~vi~s~~~p~~~i~~gd~s~-y~i~~r~~~~i~~~~~~--~~~~d~~~~~~~~r~dgk~~~~ 357 (381) +|++. ..+|+||+.+++||+++++||||++ |.++++++++|+.+++. +|.+|++.||+.+|+||++.++ T Consensus 294 ~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~ 373 (385) T protein:vir:18 294 RYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRP 373 (385) T ss_pred ceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecc Confidence 88863 2379999999999999999999997 88999999999988765 5899999999999999999999 Q ss_pred cceEEEEEEecc Q lcl|Aclame:pro 358 KVAAVWKLDLKG 369 (381) Q Consensus 358 ~Af~v~~l~~~~ 369 (381) +||++++++-++ T Consensus 374 ~a~~~~~~~aa~ 385 (385) T protein:vir:18 374 TAIIKGTFSSGS 385 (385) T ss_pred cceEEEEeccCC Confidence 999998877765 No 51 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=5.2e-54 Score=312.69 Aligned_cols=340 Identities=14% Similarity=0.100 Sum_probs=231.4 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHH-------HHHHH-H-HHHHHHHHhhhhh--------cc Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEP-QERQNELYGDMINQLFEE-------TKLQA-K-AEAERVSSLPKSA--------QS 64 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-------~~~~~-~-~~~~~~~~~~~~~--------~~ 64 (381) |+..++++++++++.++++.... .+++.+......+.+... ..... + .+.++........ +. T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFSER 80 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhHHH Confidence 44444555555554444432211 111111111111111111 10000 0 0000000000000 00 Q ss_pred ccHHHHH-------------HHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecC-Ccc Q lcl|Aclame:pro 65 LSANQRS-------------FFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET-SGV 129 (381) Q Consensus 65 lt~~e~~-------------~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~-~~~ 129 (381) ...+.++ ..+.+ ..+...+|.+||+++...|++.+++.++|+++|+++++++ ..++|+.++ .+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (385) T protein:vir:19 81 AAEELIKSWDGKQGTFGAKTFNKSL-GSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNN 159 (385) T ss_pred HHHHHHHHHHHhhccchhhHHHhhh-ccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcc Confidence 0011111 11122 2344556778899999999999999999999999999865 478998764 578 Q ss_pred eEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCc-ceeee Q lcl|Aclame:pro 130 AVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ-PIGLN 208 (381) Q Consensus 130 a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~q-P~Gil 208 (381) +.|++|+++. ++++++|+++++.+|+++++++||++||+|+. ++++||.++|++++++++|.+|++|+|+++ |.||+ T Consensus 160 a~~v~E~~~~-~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~ 237 (385) T protein:vir:19 160 ADVVAEKALK-PESDITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLN 237 (385) T ss_pred eeeeccCccc-cccccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccc Confidence 8999997765 46789999999999999999999999999875 699999999999999999999999999886 57998 Q ss_pred eccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCC Q lcl|Aclame:pro 209 RQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANG 288 (381) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G 288 (381) +......... ... .....+.+.+++..+ ...+..+.+|+|||.++..++++ ++.+| T Consensus 238 ~~~~~~~~~~-------~~~-------~~~~~d~i~~~~~~l-------~~~~~~~~~~~~~~~~~~~l~~l---kd~~G 293 (385) T protein:vir:19 238 KVATAYDTSL-------NAT-------GDTRADIIAHAIYQV-------TESEFSASGIVLNPRDWHNIALL---KDNEG 293 (385) T ss_pred cccccccccc-------ccc-------ccchHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHHh---hcCCC Confidence 7542211110 000 111222333332222 23456678999999999888876 56788 Q ss_pred ceeec--------cCCCceEEecCCCCCccEEEEeccc-eEEEecceeeEeeehhh--hhhcCceEEEEEEEEcCEEecC Q lcl|Aclame:pro 289 VYVTA--------LPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKDN 357 (381) Q Consensus 289 ~~~~~--------l~~g~~vi~s~~~p~~~i~~gd~s~-y~i~~r~~~~i~~~~~~--~~~~d~~~~~~~~r~dgk~~~~ 357 (381) +|++. ..+|+||+.+++||+++++||||++ |.++++++++|+.+++. +|.+|++.||+.+|+||++.++ T Consensus 294 ~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~ 373 (385) T protein:vir:19 294 RYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRP 373 (385) T ss_pred ceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecc Confidence 88863 2379999999999999999999997 88999999999988765 5899999999999999999999 Q ss_pred cceEEEEEEecc Q lcl|Aclame:pro 358 KVAAVWKLDLKG 369 (381) Q Consensus 358 ~Af~v~~l~~~~ 369 (381) +||++++++-++ T Consensus 374 ~a~~~~~~~aa~ 385 (385) T protein:vir:19 374 TAIIKGTFSSGS 385 (385) T ss_pred cceEEEEeccCC Confidence 999998877765 No 52 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=5e-53 Score=307.30 Aligned_cols=350 Identities=9% Similarity=0.013 Sum_probs=231.4 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhHHH---------HHHHHHHHHHHHHHHHHHHHHHH----H--HHH------------- Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEPQE---------RQNELYGDMINQLFEETKLQAKA----E--AER------------- 54 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~----~--~~~------------- 54 (381) ||.++++.+++.++.+++.....+. ++.+....+.+.+..+....... + ... T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 6666665555555544332221110 01111111222221111100000 0 000 Q ss_pred ----------HHHhhhhhccccHHHHHHHHHH---------hcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec Q lcl|Aclame:pro 55 ----------VSSLPKSAQSLSANQRSFFMDI---------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 55 ----------~~~~~~~~~~lt~~e~~~~~~~---------~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~ 115 (381) ..............+++.+... ...++++||++||+++.++|++.+++.++|+++|+++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:46 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeec Confidence 0000000011223333333221 112455789999999999999999999999999999997 Q ss_pred CC-ceEEE--EecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 116 GL-RLKFL--KSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) Q Consensus 116 ~~-~~~ip--~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d 192 (381) ++ ..++| +.++.+.+.|++|+++.+..+.++|+++++.+|+++++++||++||+||.++|++||++++++++++++| T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d 240 (415) T protein:vir:46 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 53 34555 4566778999999888776678999999999999999999999999999999999999999999999999 Q ss_pred hheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|Aclame:pro 193 TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 193 ~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~ 272 (381) .+|++|+|+++|.++............ ..+. .. .+.+.+++..+ ...|..+++|+|||. T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~~~-------~~~~---~~----~~~i~~~~~~~-------~~~~~~~~~~v~n~~ 299 (415) T protein:vir:46 241 KAIIDVITKGSTGSTSSGFEKEGKKLE-------VKKA---KS----LDDIKDAINLN-------VKPNYEHNVAIVSQT 299 (415) T ss_pred HHHhhccccCCccccccccccccceec-------cccc---cc----hHHHHHHHHhh-------hhhccCCCEEEEcHH Confidence 999999999988776543221111110 0111 11 11222222221 234566789999999 Q ss_pred hHHHHHhhhhccCCCCceeec---------cCCCceEEecCCCCCc-----cEEEEeccc-eEEEecceeeEeeehhhhh Q lcl|Aclame:pro 273 DAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) Q Consensus 273 ~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~~p~~-----~i~~gd~s~-y~i~~r~~~~i~~~~~~~~ 337 (381) ++..++.+ ++++|+|++. ..+|+||+.++++|.+ .++||||++ |++++|++++++.++ | T Consensus 300 ~~~~L~~l---kd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~---~ 373 (415) T protein:vir:46 300 MFAKLDKM---KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD---Y 373 (415) T ss_pred HHHHHHHh---hccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeec---c Confidence 98887764 6788999874 2479999999999853 379999998 778999999999876 5 Q ss_pred hcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 338 ~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) .+++++||+.+|+||++.+++||++++++-++..+---|-.- T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:46 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEeeccCCCCCCccCCC Confidence 667889999999999999999999987776553222211111 No 53 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=5e-53 Score=307.30 Aligned_cols=350 Identities=9% Similarity=0.013 Sum_probs=231.4 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhHHH---------HHHHHHHHHHHHHHHHHHHHHHH----H--HHH------------- Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEPQE---------RQNELYGDMINQLFEETKLQAKA----E--AER------------- 54 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~----~--~~~------------- 54 (381) ||.++++.+++.++.+++.....+. ++.+....+.+.+..+....... + ... T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 6666665555555544332221110 01111111222221111100000 0 000 Q ss_pred ----------HHHhhhhhccccHHHHHHHHHH---------hcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec Q lcl|Aclame:pro 55 ----------VSSLPKSAQSLSANQRSFFMDI---------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 55 ----------~~~~~~~~~~lt~~e~~~~~~~---------~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~ 115 (381) ..............+++.+... ...++++||++||+++.++|++.+++.++|+++|+++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:47 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeec Confidence 0000000011223333333221 112455789999999999999999999999999999997 Q ss_pred CC-ceEEE--EecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 116 GL-RLKFL--KSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) Q Consensus 116 ~~-~~~ip--~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d 192 (381) ++ ..++| +.++.+.+.|++|+++.+..+.++|+++++.+|+++++++||++||+||.++|++||++++++++++++| T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d 240 (415) T protein:vir:47 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 53 34555 4566778999999888776678999999999999999999999999999999999999999999999999 Q ss_pred hheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|Aclame:pro 193 TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 193 ~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~ 272 (381) .+|++|+|+++|.++............ ..+. .. .+.+.+++..+ ...|..+++|+|||. T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~~~-------~~~~---~~----~~~i~~~~~~~-------~~~~~~~~~~v~n~~ 299 (415) T protein:vir:47 241 KAIIDVITKGSTGSTSSGFEKEGKKLE-------VKKA---KS----LDDIKDAINLN-------VKPNYEHNVAIVSQT 299 (415) T ss_pred HHHhhccccCCccccccccccccceec-------cccc---cc----hHHHHHHHHhh-------hhhccCCCEEEEcHH Confidence 999999999988776543221111110 0111 11 11222222221 234566789999999 Q ss_pred hHHHHHhhhhccCCCCceeec---------cCCCceEEecCCCCCc-----cEEEEeccc-eEEEecceeeEeeehhhhh Q lcl|Aclame:pro 273 DAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) Q Consensus 273 ~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~~p~~-----~i~~gd~s~-y~i~~r~~~~i~~~~~~~~ 337 (381) ++..++.+ ++++|+|++. ..+|+||+.++++|.+ .++||||++ |++++|++++++.++ | T Consensus 300 ~~~~L~~l---kd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~---~ 373 (415) T protein:vir:47 300 MFAKLDKM---KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD---Y 373 (415) T ss_pred HHHHHHHh---hccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeec---c Confidence 98887764 6788999874 2479999999999853 379999998 778999999999876 5 Q ss_pred hcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 338 ~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) .+++++||+.+|+||++.+++||++++++-++..+---|-.- T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:47 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEeeccCCCCCCccCCC Confidence 667889999999999999999999987776553222211111 No 54 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=5.5e-53 Score=307.08 Aligned_cols=349 Identities=9% Similarity=0.020 Sum_probs=232.9 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhHH------HHHH---HHHHHHHHHHHHHHHHHHHHHHHH------------------- Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEPQ------ERQN---ELYGDMINQLFEETKLQAKAEAER------------------- 54 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~~------~~~~---~~~~~~~~~~~~~~~~~~~~~~~~------------------- 54 (381) |+..+++.++++++.+.++....+ ++.. +.....++.+....... ....++ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEK-QEELDKLKEKDGTSENNQQSVEVNE 79 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhhcccccccch Confidence 766666666666655544322211 1111 11111111111111000 000000 Q ss_pred -----------HHHhhhhhccccHHHHHHHHHH---------hcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee Q lcl|Aclame:pro 55 -----------VSSLPKSAQSLSANQRSFFMDI---------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN 114 (381) Q Consensus 55 -----------~~~~~~~~~~lt~~e~~~~~~~---------~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~ 114 (381) ..........+...+++.+... ...++++||++||+++.+.|++.+++.++|+++|++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:79 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee Confidence 0000001111233333333211 11245578999999999999999999999999999999 Q ss_pred cCC---ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 115 AGL---RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVAL 191 (381) Q Consensus 115 ~~~---~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~ 191 (381) +++ ...+|+.++...+.|++|+++.++.+.++|+++++.+|+++++++||++||+||.++|++||.++|++++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~ 239 (415) T protein:vir:79 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) T ss_pred ccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHH Confidence 753 34556667778899999988887667899999999999999999999999999999999999999999999999 Q ss_pred hhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEch Q lcl|Aclame:pro 192 ETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNP 271 (381) Q Consensus 192 d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~ 271 (381) +.+|++|+|+++|.+............ +..+.. ..+.+.+++..+ ...+..+++|+||| T Consensus 240 ~~~il~g~g~g~~~~~~~~~~~~~~~~-------~~~~~~-------~~~~i~~~~~~~-------~~~~~~~~~~v~n~ 298 (415) T protein:vir:79 240 NKAIIDVITKGSTGSTSSGFEKEGKKL-------EVKKAK-------SLDDIKDAINLN-------VKPNYEHNVAIVSQ 298 (415) T ss_pred HHHHhhccccCcccccccccccccccc-------cccccc-------chhHHHHHHHhh-------hhhccCCCEEEEcH Confidence 999999999998876653321111110 001111 112222222221 22455678999999 Q ss_pred hhHHHHHhhhhccCCCCceeec---------cCCCceEEecCCCCCcc-----EEEEeccc-eEEEecceeeEeeehhhh Q lcl|Aclame:pro 272 SDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAGK-----VLTYVKGL-YDGYLAGGINVQKFKETL 336 (381) Q Consensus 272 ~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~~p~~~-----i~~gd~s~-y~i~~r~~~~i~~~~~~~ 336 (381) .++..++.+ ++++|+|++. ..+|+||+.++++|.+. ++||||++ |++++|+++++..+++ T Consensus 299 ~~~~~l~~l---kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~-- 373 (415) T protein:vir:79 299 TMFAKLDKM---KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-- 373 (415) T ss_pred HHHHHHHHh---hccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc-- Confidence 998888765 6788998874 24799999999998543 89999998 7789999999998774 Q ss_pred hhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 337 ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 337 ~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) ..++++||+.+|+||++.+++||++++++-+...+---|-.- T Consensus 374 -~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:79 374 -MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred -ccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 566789999999999999999999988876553222211111 No 55 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=5.5e-53 Score=307.08 Aligned_cols=349 Identities=9% Similarity=0.020 Sum_probs=232.9 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhHH------HHHH---HHHHHHHHHHHHHHHHHHHHHHHH------------------- Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEPQ------ERQN---ELYGDMINQLFEETKLQAKAEAER------------------- 54 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~~------~~~~---~~~~~~~~~~~~~~~~~~~~~~~~------------------- 54 (381) |+..+++.++++++.+.++....+ ++.. +.....++.+....... ....++ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEK-QEELDKLKEKDGTSENNQQSVEVNE 79 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhhcccccccch Confidence 766666666666655544322211 1111 11111111111111000 000000 Q ss_pred -----------HHHhhhhhccccHHHHHHHHHH---------hcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee Q lcl|Aclame:pro 55 -----------VSSLPKSAQSLSANQRSFFMDI---------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN 114 (381) Q Consensus 55 -----------~~~~~~~~~~lt~~e~~~~~~~---------~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~ 114 (381) ..........+...+++.+... ...++++||++||+++.+.|++.+++.++|+++|++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:98 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee Confidence 0000001111233333333211 11245578999999999999999999999999999999 Q ss_pred cCC---ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 115 AGL---RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVAL 191 (381) Q Consensus 115 ~~~---~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~ 191 (381) +++ ...+|+.++...+.|++|+++.++.+.++|+++++.+|+++++++||++||+||.++|++||.++|++++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~ 239 (415) T protein:vir:98 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) T ss_pred ccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHH Confidence 753 34556667778899999988887667899999999999999999999999999999999999999999999999 Q ss_pred hhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEch Q lcl|Aclame:pro 192 ETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNP 271 (381) Q Consensus 192 d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~ 271 (381) +.+|++|+|+++|.+............ +..+.. ..+.+.+++..+ ...+..+++|+||| T Consensus 240 ~~~il~g~g~g~~~~~~~~~~~~~~~~-------~~~~~~-------~~~~i~~~~~~~-------~~~~~~~~~~v~n~ 298 (415) T protein:vir:98 240 NKAIIDVITKGSTGSTSSGFEKEGKKL-------EVKKAK-------SLDDIKDAINLN-------VKPNYEHNVAIVSQ 298 (415) T ss_pred HHHHhhccccCcccccccccccccccc-------cccccc-------chhHHHHHHHhh-------hhhccCCCEEEEcH Confidence 999999999998876653321111110 001111 112222222221 22455678999999 Q ss_pred hhHHHHHhhhhccCCCCceeec---------cCCCceEEecCCCCCcc-----EEEEeccc-eEEEecceeeEeeehhhh Q lcl|Aclame:pro 272 SDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAGK-----VLTYVKGL-YDGYLAGGINVQKFKETL 336 (381) Q Consensus 272 ~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~~p~~~-----i~~gd~s~-y~i~~r~~~~i~~~~~~~ 336 (381) .++..++.+ ++++|+|++. ..+|+||+.++++|.+. ++||||++ |++++|+++++..+++ T Consensus 299 ~~~~~l~~l---kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~-- 373 (415) T protein:vir:98 299 TMFAKLDKM---KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-- 373 (415) T ss_pred HHHHHHHHh---hccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc-- Confidence 998888765 6788998874 24799999999998543 89999998 7789999999998774 Q ss_pred hhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 337 ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 337 ~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) ..++++||+.+|+||++.+++||++++++-+...+---|-.- T Consensus 374 -~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:98 374 -MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred -ccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 566789999999999999999999988876553222211111 No 56 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=5.5e-53 Score=307.08 Aligned_cols=349 Identities=9% Similarity=0.020 Sum_probs=232.9 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhHH------HHHH---HHHHHHHHHHHHHHHHHHHHHHHH------------------- Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEPQ------ERQN---ELYGDMINQLFEETKLQAKAEAER------------------- 54 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~~------~~~~---~~~~~~~~~~~~~~~~~~~~~~~~------------------- 54 (381) |+..+++.++++++.+.++....+ ++.. +.....++.+....... ....++ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEK-QEELDKLKEKDGTSENNQQSVEVNE 79 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhhcccccccch Confidence 766666666666655544322211 1111 11111111111111000 000000 Q ss_pred -----------HHHhhhhhccccHHHHHHHHHH---------hcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee Q lcl|Aclame:pro 55 -----------VSSLPKSAQSLSANQRSFFMDI---------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN 114 (381) Q Consensus 55 -----------~~~~~~~~~~lt~~e~~~~~~~---------~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~ 114 (381) ..........+...+++.+... ...++++||++||+++.+.|++.+++.++|+++|++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:81 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee Confidence 0000001111233333333211 11245578999999999999999999999999999999 Q ss_pred cCC---ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 115 AGL---RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVAL 191 (381) Q Consensus 115 ~~~---~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~ 191 (381) +++ ...+|+.++...+.|++|+++.++.+.++|+++++.+|+++++++||++||+||.++|++||.++|++++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~ 239 (415) T protein:vir:81 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) T ss_pred ccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHH Confidence 753 34556667778899999988887667899999999999999999999999999999999999999999999999 Q ss_pred hhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEch Q lcl|Aclame:pro 192 ETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNP 271 (381) Q Consensus 192 d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~ 271 (381) +.+|++|+|+++|.+............ +..+.. ..+.+.+++..+ ...+..+++|+||| T Consensus 240 ~~~il~g~g~g~~~~~~~~~~~~~~~~-------~~~~~~-------~~~~i~~~~~~~-------~~~~~~~~~~v~n~ 298 (415) T protein:vir:81 240 NKAIIDVITKGSTGSTSSGFEKEGKKL-------EVKKAK-------SLDDIKDAINLN-------VKPNYEHNVAIVSQ 298 (415) T ss_pred HHHHhhccccCcccccccccccccccc-------cccccc-------chhHHHHHHHhh-------hhhccCCCEEEEcH Confidence 999999999998876653321111110 001111 112222222221 22455678999999 Q ss_pred hhHHHHHhhhhccCCCCceeec---------cCCCceEEecCCCCCcc-----EEEEeccc-eEEEecceeeEeeehhhh Q lcl|Aclame:pro 272 SDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAGK-----VLTYVKGL-YDGYLAGGINVQKFKETL 336 (381) Q Consensus 272 ~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~~p~~~-----i~~gd~s~-y~i~~r~~~~i~~~~~~~ 336 (381) .++..++.+ ++++|+|++. ..+|+||+.++++|.+. ++||||++ |++++|+++++..+++ T Consensus 299 ~~~~~l~~l---kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~-- 373 (415) T protein:vir:81 299 TMFAKLDKM---KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY-- 373 (415) T ss_pred HHHHHHHHh---hccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc-- Confidence 998888765 6788998874 24799999999998543 89999998 7789999999998774 Q ss_pred hhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 337 ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 337 ~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) ..++++||+.+|+||++.+++||++++++-+...+---|-.- T Consensus 374 -~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:81 374 -MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred -ccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 566789999999999999999999988876553222211111 No 57 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=3.5e-53 Score=308.20 Aligned_cols=341 Identities=12% Similarity=0.016 Sum_probs=228.4 Q ss_pred CCcc-HHHHHHHHHHHHHHH---HhhhhHHHH---H------HHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhhc--- Q lcl|Aclame:pro 1 MTIN-LSETFANAKNEFINA---VNNGEPQER---Q------NELYGDMINQLFEETKL-QAKAEAERVSSLPKSAQ--- 63 (381) Q Consensus 1 m~~~-l~~~~~e~~~~~~~~---~~~~~~~~~---~------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~--- 63 (381) |+++ |++++.+..+++.+. ++....++. . .+...+.++.+...... +...+..+........+ T Consensus 1 M~~~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (395) T protein:vir:38 1 MNINQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKKP 80 (395) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 9994 777666555443332 221111110 0 01111111111111000 00000000000000000 Q ss_pred ----cccHHHHHH--------HHHHh--cccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC---ceEEEEe-c Q lcl|Aclame:pro 64 ----SLSANQRSF--------FMDIN--KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKS-E 125 (381) Q Consensus 64 ----~lt~~e~~~--------~~~~~--~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~---~~~ip~~-~ 125 (381) ....+.+.+ .+.+. ..+.++||++||+++.+.|++.+++.++|+++|+++++++ ...++.. + T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 160 (395) T protein:vir:38 81 LPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLAD 160 (395) T ss_pred cchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeecc Confidence 000011111 11122 2345579999999999999999999999999999998743 3445544 4 Q ss_pred CCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcce Q lcl|Aclame:pro 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) Q Consensus 126 ~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~ 205 (381) ..+.+.|++|+++.+..+.++|+++++.+||++++++||++||+|+.++|++||.++|++++++++|.+|++|+|++.|. T Consensus 161 ~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~ 240 (395) T protein:vir:38 161 ITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKK 240 (395) T ss_pred CCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 45678999998877666679999999999999999999999999999999999999999999999999999999988764 Q ss_pred eeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccC Q lcl|Aclame:pro 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~ 285 (381) +.... +. . +..++.. .....|+.+++|+|||.++..++.+ ++ T Consensus 241 ~~~~~----------------------~~---~----i~~~~~~------~l~~~~~~~a~~v~n~~~~~~L~~l---kd 282 (395) T protein:vir:38 241 PTISQ----------------------FD---N----IKDLENN------TLDPAIESTSSFITNQSGYNILSKV---KD 282 (395) T ss_pred ccccc----------------------HH---H----HHHHHHH------hhhhhhcCCCEEEEcHHHHHHHHHh---hc Confidence 32110 00 1 1111110 1134678899999999998888765 67 Q ss_pred CCCceeec---------cCCCceEEecCCCCC------ccEEEEeccc-eEEEecceeeEeeehh--hhhhcCceEEEEE Q lcl|Aclame:pro 286 ANGVYVTA---------LPFNLNVIESTVQEA------GKVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDLYTAK 347 (381) Q Consensus 286 ~~G~~~~~---------l~~g~~vi~s~~~p~------~~i~~gd~s~-y~i~~r~~~~i~~~~~--~~~~~d~~~~~~~ 347 (381) ++|+|+|. ..+|+||+.+++++. ..++||||++ |++++|++++|+.+++ .+|.+|++.||+. T Consensus 283 ~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~ 362 (395) T protein:vir:38 283 ADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFI 362 (395) T ss_pred cCCceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEE Confidence 78999874 247999999886542 2389999997 7899999999999885 5699999999999 Q ss_pred EEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 348 QFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 348 ~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) .|+|+++.+++||++++++-++..|+ ++..|= T Consensus 363 ~r~d~~~~~~~a~~~~~~~~~~~~~~--~~~~~~ 394 (395) T protein:vir:38 363 DRFDVQLIDDGAFAAASFKTVANQAQ--GTAGTG 394 (395) T ss_pred EeeccEEecccceEEEEeecccCCCC--CccCCC Confidence 99999999999999998876554333 333333 No 58 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=6.7e-55 Score=317.57 Aligned_cols=295 Identities=13% Similarity=0.026 Sum_probs=225.3 Q ss_pred HHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceEE Q lcl|Aclame:pro 54 RVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVW 132 (381) Q Consensus 54 ~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~w 132 (381) -++...|....+..+|++ ++.+++++.|| +||++++++|++.+++.++|+++|+++++++ ..++|+.++.+.+.| T Consensus 1 ~~~~~~r~~~~~~~~e~~---a~~~~~~~~g~-~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~ 76 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPK---VAQTGDSMFEG-YLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSASW 76 (326) T ss_pred CCCCccchhhhcCcchhh---heeccccCCcc-eechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceEE Confidence 000011122223344444 34455555555 6999999999999999999999999999865 589999999999999 Q ss_pred ecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccc Q lcl|Aclame:pro 133 GKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQ 212 (381) Q Consensus 133 ~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~ 212 (381) ++|+++++ +++++|+++++.+||++++++||+|||+||.+++++||.++|++++++++|.+|++|+|+++|.||++... T Consensus 77 v~Eg~~~~-~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~ 155 (326) T protein:vir:42 77 IGEGDMKP-ITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTK 155 (326) T ss_pred ecCCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccc Confidence 99987765 56899999999999999999999999999999999999999999999999999999999999999986543 Q ss_pred cccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceee Q lcl|Aclame:pro 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVT 292 (381) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~ 292 (381) .......... ......+..+ .. +...... ....+..+++|+|||+++..++++ ++++|+|++ T Consensus 156 ~~~~~~~~~~---~~~~~~~~~~--~~---~~~~~~~-------~~~~~~~~a~~v~n~~~~~~L~~l---kd~~G~~l~ 217 (326) T protein:vir:42 156 EVSLVDPDGT---GSNADLTVYD--AV---AVNALSL-------LVNAGKKWTHTLLDDITEPILNGA---KDKSGRPLF 217 (326) T ss_pred ccceeecccc---cccccchhHH--HH---HHHHHhh-------hhhhccCccEEEEeHHHHHHHHHh---hccCCceee Confidence 2221111100 0000000000 00 1111110 123456678999999999888875 567788876 Q ss_pred cc--------------CCCceEEecCCCCCccE--EEEeccceEEEecceeeEeeehhhh--------------hhcCce Q lcl|Aclame:pro 293 AL--------------PFNLNVIESTVQEAGKV--LTYVKGLYDGYLAGGINVQKFKETL--------------ALDDMD 342 (381) Q Consensus 293 ~l--------------~~g~~vi~s~~~p~~~i--~~gd~s~y~i~~r~~~~i~~~~~~~--------------~~~d~~ 342 (381) .. .+|+||+.++++|++++ +||||++|+++++++++|+++++.+ |.+|++ T Consensus 218 ~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~ 297 (326) T protein:vir:42 218 IESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLV 297 (326) T ss_pred ccccccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcE Confidence 42 46999999999999874 6899999999999999999999876 888999 Q ss_pred EEEEEEEEcCEEecCcceEEEEEEecccc Q lcl|Aclame:pro 343 LYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 343 ~~~~~~r~dgk~~~~~Af~v~~l~~~~~~ 371 (381) +||+.+|+|+++++++||+.++.+.++.. T Consensus 298 ~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 298 AVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred EEEEEEEeccEEecccceEEEeeccccCC Confidence 99999999999999999999877776655 No 59 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=2.6e-53 Score=308.83 Aligned_cols=341 Identities=9% Similarity=-0.012 Sum_probs=228.4 Q ss_pred CCcc-----HHHHHHHHHHHHHHHHhhh-------hHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHhhhh--h- Q lcl|Aclame:pro 1 MTIN-----LSETFANAKNEFINAVNNG-------EPQERQNELYGDMINQLFEETK---LQAKAEAERVSSLPKS--A- 62 (381) Q Consensus 1 m~~~-----l~~~~~e~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~--~- 62 (381) |++| |++++.+.++++.+..+.. +..+++.+.+.+..+.+..+.. .+......+.....+. . T Consensus 1 ~~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:39 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (404) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 5543 4544444444332222110 0001111111111111111100 0000000000000000 0 Q ss_pred ------ccccHHHHHH----------------HHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ce Q lcl|Aclame:pro 63 ------QSLSANQRSF----------------FMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RL 119 (381) Q Consensus 63 ------~~lt~~e~~~----------------~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~ 119 (381) ......+++. .+++..+++++||++||+++...|++.+++.++|+++|+++++++ .. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 160 (404) T protein:vir:39 81 PLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (404) T ss_pred ccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcc Confidence 0001111111 123456778899999999999999999999999999999998753 33 Q ss_pred E--EEEe-cCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhhee Q lcl|Aclame:pro 120 K--FLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL 196 (381) Q Consensus 120 ~--ip~~-~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l 196 (381) . +++. +..+.+.|++|+++.+..++++|+++++.+|+++++++||++||+||.+++++||.++|++++++++|.+|+ T Consensus 161 ~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il 240 (404) T protein:vir:39 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (404) T ss_pred eEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4 4443 455778999998887666789999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|Aclame:pro 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~ 276 (381) +|+|+++|.|.... ... +..++.. .....|+.+++|+|||+++.. T Consensus 241 ~g~g~~~~~~~~~~-------------------------~~~----i~~~~~~------~~~~~~~~~a~~v~n~~~~~~ 285 (404) T protein:vir:39 241 AAMGTVPKKPTIAK-------------------------FDD----VITMINT------SVDPAIIATSSLLTNQSGLNK 285 (404) T ss_pred hccccccccccccc-------------------------HHH----HHHHHHH------hhhhhhccCCEEEEcHHHHHH Confidence 99999887654321 001 1111110 113456778999999999988 Q ss_pred HHhhhhccCCCCceeec---------cCCCceEEecCC--CCC-----ccEEEEeccc-eEEEecceeeEeeehhh--hh Q lcl|Aclame:pro 277 VQAQYTHLNANGVYVTA---------LPFNLNVIESTV--QEA-----GKVLTYVKGL-YDGYLAGGINVQKFKET--LA 337 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~--~p~-----~~i~~gd~s~-y~i~~r~~~~i~~~~~~--~~ 337 (381) ++.+ ++.+|+|++. ..+|+||+.+++ +|. ..++||||++ |.+++|+++++..+++. +| T Consensus 286 L~~l---kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~ 362 (404) T protein:vir:39 286 LALV---KTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAF 362 (404) T ss_pred HHHh---hccCCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhh Confidence 8865 5678888864 237999988654 453 2489999997 77899999999998875 79 Q ss_pred hcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 338 ~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) .+|++.||+.+|+||++.+++||++++++-++......++-- T Consensus 363 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 363 ETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQVGNFTAGK 404 (404) T ss_pred hhceeeEEEEeeeccEEecccceEEEEeeccccCCCCCCCCC Confidence 999999999999999999999999998887765444333222 No 60 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=1e-52 Score=305.58 Aligned_cols=349 Identities=9% Similarity=0.016 Sum_probs=234.8 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhHH-------H--HHHHHHHHHHHHHHHHHHHHHHH--HH------------------- Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEPQ-------E--RQNELYGDMINQLFEETKLQAKA--EA------------------- 52 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~~-------~--~~~~~~~~~~~~~~~~~~~~~~~--~~------------------- 52 (381) ||..+++.++++++.+.++....+ + ++.+....+++.+.......... .. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccch Confidence 776666666666655444322111 0 11111111122211111100000 00 Q ss_pred --------HHHHHhhhhhccccHHHHHHHHH-H--------hcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec Q lcl|Aclame:pro 53 --------ERVSSLPKSAQSLSANQRSFFMD-I--------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 53 --------~~~~~~~~~~~~lt~~e~~~~~~-~--------~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~ 115 (381) ..........+.+...|++.+.. + ...+.++||++||+++..+|++.+++.++|+++|+++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:94 81 STYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeec Confidence 00000001112233444444321 1 113456799999999999999999999999999999997 Q ss_pred CC---ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 116 GL---RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) Q Consensus 116 ~~---~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d 192 (381) ++ ...+|+.++.+.+.|++|+++.+..+.++|+++++.+|+++++++||+|||+||.++|++||.++|+++++++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~ 240 (415) T protein:vir:94 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 53 345566677788999999888776678999999999999999999999999999999999999999999999999 Q ss_pred hheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|Aclame:pro 193 TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 193 ~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~ 272 (381) .+|++|+|+++|.++............ ..+. . .++.+.+++..+ ...+..+++|+|||+ T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~~~-------~~~~---~----~~~~i~~~~~~~-------~~~~~~~~~~vmn~~ 299 (415) T protein:vir:94 241 KAIIDVITKGSTGSTSSGFEKEGKKLE-------VKKA---K----SLDDIKDAINLN-------VKPNYEHNVAIVSQT 299 (415) T ss_pred HHHhhccccCccccccccccccccccc-------cccc---c----chHHHHHHHHhh-------hhhccCCCEEEEcHH Confidence 999999999988776543221111110 0011 1 111222222211 123456788999999 Q ss_pred hHHHHHhhhhccCCCCceeec---------cCCCceEEecCCCCCcc-----EEEEeccc-eEEEecceeeEeeehhhhh Q lcl|Aclame:pro 273 DAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAGK-----VLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) Q Consensus 273 ~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~~p~~~-----i~~gd~s~-y~i~~r~~~~i~~~~~~~~ 337 (381) ++..++.+ ++++|+|++. ..+|+||+.++++|.+. ++||||++ |++++|++++++.++ | T Consensus 300 ~~~~l~~l---kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~---~ 373 (415) T protein:vir:94 300 MFAKLDKM---KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD---Y 373 (415) T ss_pred HHHHHHHh---hccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec---c Confidence 98888765 6788998873 24799999999998554 89999998 778999999999887 4 Q ss_pred hcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 338 ~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) ..++++||+.+|+||++++++||++++++-+. +++|..--- T Consensus 374 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~---~~~~~~~~~ 414 (415) T protein:vir:94 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSE---RGEGDLGLE 414 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEeccC---CCCCccccC Confidence 56788999999999999999999999876544 233322111 No 61 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=2.4e-53 Score=309.06 Aligned_cols=343 Identities=14% Similarity=0.093 Sum_probs=227.4 Q ss_pred CCccHHHH-------HHHHHHHHHHHHhhhhHHHHHH-----------------------H-HHHHHHHHHHHHHHH--- Q lcl|Aclame:pro 1 MTINLSET-------FANAKNEFINAVNNGEPQERQN-----------------------E-LYGDMINQLFEETKL--- 46 (381) Q Consensus 1 m~~~l~~~-------~~e~~~~~~~~~~~~~~~~~~~-----------------------~-~~~~~~~~~~~~~~~--- 46 (381) +. ++.++ +.+.++++.+.+...+....+. + ....+.+.+...... T Consensus 244 ~~-~~~~~ai~~g~sld~~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~ 322 (632) T protein:vir:96 244 QR-SLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWS 322 (632) T ss_pred hh-hhHHHHHhccccHHHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchh Confidence 10 11100 1111122222221111100000 0 000000100000000 Q ss_pred HHHH--HHHHHHHhhhhhc---c-ccHHHHHHHHHHhcccCCCCceEccHHH-HHHHHHHHHhhhhhhhh-ceeeec-CC Q lcl|Aclame:pro 47 QAKA--EAERVSSLPKSAQ---S-LSANQRSFFMDINKNVNYKEEKLLPEET-IDRIFEDLTTNHPLLAD-LGIKNA-GL 117 (381) Q Consensus 47 ~~~~--~~~~~~~~~~~~~---~-lt~~e~~~~~~~~~~~~~~gg~lvP~~~-~~~Ii~~l~~~~~l~~~-~~v~~~-~~ 117 (381) .... +..... ..+.++ . ..+.+.-..+++.++++++||+|||+++ ...||+.|+..++++++ +++++. +| T Consensus 323 ~a~~~~e~a~~~-a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g 401 (632) T protein:vir:96 323 KAGFEREVSLAI-ADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVG 401 (632) T ss_pred hhhhhhHHHHHH-HHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCc Confidence 0000 000000 001110 0 0111111224667788889999999986 57899999999999998 677775 56 Q ss_pred ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee Q lcl|Aclame:pro 118 RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK 197 (381) Q Consensus 118 ~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~ 197 (381) +++||+.++.+.++|++|+++.+ +++++|+++++.+|+++++++||++||+||.+++++||++.|++++++++|.+||+ T Consensus 402 ~~~ip~~~~~~~a~wv~E~~~~~-~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~ 480 (632) T protein:vir:96 402 DVDIPKKTSGANFYWIGEDEDVQ-DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLT 480 (632) T ss_pred ceEEEEEeCCceeEeecCCcccc-ccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhc Confidence 89999999999999999988765 57899999999999999999999999999999999999999999999999999999 Q ss_pred ccCC-CcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|Aclame:pro 198 GTGK-DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 198 G~G~-~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~ 276 (381) |+|+ ++|.||++......... ..+..+ ...+.++...+.. .....+++.|+||+.+... T Consensus 481 G~G~~~~p~Gi~~~~~~~~~~~--------~~~~~~-------~~~i~~~~~~i~~-----~~~~~~~~~~~~~~~~~~~ 540 (632) T protein:vir:96 481 GTGLANDPVGLLNMTGVPALTY--------PAGGVD-------WASVVDMETKIST-----FNADAGRLAYLTSVTQRGA 540 (632) T ss_pred ccCCCCccceeeecccccceec--------ccccCC-------HHHHHHHHHHHhh-----cccccCccEEEEchhHHHH Confidence 9995 68999987432111100 001111 1122222222211 1112456789999987765 Q ss_pred HHhhhhccCCCCceeec--cCCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEE Q lcl|Aclame:pro 277 VQAQYTHLNANGVYVTA--LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKA 354 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~--l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~ 354 (381) ++.. ...+.+|+|+|. ..+|+||+.++.||+++++||||++|+++++++++|.++++.+|.+|++.|++++|+|+++ T Consensus 541 l~~~-~l~d~~G~~i~~~~~l~G~pv~~s~~ip~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v 619 (632) T protein:vir:96 541 AKKA-QVFDNTGERIWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGV 619 (632) T ss_pred HHHH-hccCCCCceeecCCeecccceEeccccccCcEEEeecceEEEEEecceEEEEccccccccCceEEEEEeecCcee Confidence 5542 235778999985 3479999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCcceEEEEEEecc Q lcl|Aclame:pro 355 KDNKVAAVWKLDLKG 369 (381) Q Consensus 355 ~~~~Af~v~~l~~~~ 369 (381) ++++||+++. ..+ T Consensus 620 ~~~~af~~~k--~~A 632 (632) T protein:vir:96 620 RRKEAFCIAK--KGA 632 (632) T ss_pred echhhhhhee--ecC Confidence 9999999853 333 No 62 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=2.8e-54 Score=314.19 Aligned_cols=272 Identities=11% Similarity=0.061 Sum_probs=222.5 Q ss_pred hhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc-eEEEEecCCcceEEeccccc Q lcl|Aclame:pro 60 KSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVWGKIYGE 138 (381) Q Consensus 60 ~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~-~~ip~~~~~~~a~w~~e~~~ 138 (381) .+ +|+....+.++||++||++++++|++.+++.++|+++|+++++++. ..+|+.++ +.+.|++|+++ T Consensus 1 ~g-----------~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~ 68 (299) T protein:vir:41 1 MG-----------FNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMSG-VGAFWVDEAER 68 (299) T ss_pred CC-----------cCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEcC-CceeeeecCcc Confidence 11 1344555667889999999999999999999999999999998765 77887765 67999999777 Q ss_pred ccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccc Q lcl|Aclame:pro 139 IKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVT 218 (381) Q Consensus 139 ~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~ 218 (381) .+ +++++|+++++.+||++++++||+|+|+||.+++++||.+.|++++++++|.+|++|+|+++|.||++......+.. T Consensus 69 ~~-~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~ 147 (299) T protein:vir:41 69 IQ-TSKPTFTKAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLV 147 (299) T ss_pred cc-ccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceee Confidence 65 56899999999999999999999999999999999999999999999999999999999999999997544332221 Q ss_pred cccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec----- Q lcl|Aclame:pro 219 EGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA----- 293 (381) Q Consensus 219 ~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~----- 293 (381) .. ++.+ .+.+.+++..+ ...+..+++|+|||.++..++++ ++.+|+|++. T Consensus 148 ~~--------~~~~-------~~~l~~~~~~l-------~~~~~~~~~~v~n~~~~~~L~~l---kd~~G~~l~~~~~~~ 202 (299) T protein:vir:41 148 EE--------TANK-------YDDLNEAIGLI-------EAEDLEPNGIATIRKQRVKYRST---KDGNGMPIFNTATSN 202 (299) T ss_pred cc--------cccc-------HHHHHHHHHhh-------hcccCCcCEEEEcHHHHHHHHHh---hccCCceeecCCcCC Confidence 11 1111 11122222211 22345677899999999888876 5678888764 Q ss_pred ---cCCCceEEecCCCCCcc----EEEEeccceEEEecceeeEeeehhhh--------------hhcCceEEEEEEEEcC Q lcl|Aclame:pro 294 ---LPFNLNVIESTVQEAGK----VLTYVKGLYDGYLAGGINVQKFKETL--------------ALDDMDLYTAKQFAYG 352 (381) Q Consensus 294 ---l~~g~~vi~s~~~p~~~----i~~gd~s~y~i~~r~~~~i~~~~~~~--------------~~~d~~~~~~~~r~dg 352 (381) ..+|+||+.+++||.++ ++||||++|++++|++++++++++.+ |.+|++.||+.+|+|+ T Consensus 203 ~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~ 282 (299) T protein:vir:41 203 GVDDVLGLPIAYTPKYTFGDKDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGF 282 (299) T ss_pred CCceecceeeEEecccCCCCCceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Confidence 23799999999999876 89999999999999999999999865 7899999999999999 Q ss_pred EEecCcceEEEEEEecc Q lcl|Aclame:pro 353 KAKDNKVAAVWKLDLKG 369 (381) Q Consensus 353 k~~~~~Af~v~~l~~~~ 369 (381) ++.+++||++++.+-+. T Consensus 283 ~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 283 MVVKDEAFSAVQPKAGN 299 (299) T ss_pred EEecccceEEEEeccCC Confidence 99999999999888877 No 63 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=1.6e-53 Score=310.00 Aligned_cols=280 Identities=11% Similarity=0.021 Sum_probs=217.7 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceEEecccccccccccccccceeccc Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~ 154 (381) |.+++++.||++||+++..+|++.+++.++||++|+++++++ ..++|+.++.+.|.|++|+++++ +++++|+++++.+ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~-~s~~~f~~v~l~~ 79 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKP-SASVDVSAFTAQP 79 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCcccc-ccccceeeeEeee Confidence 888999999999999999999999999999999999999864 58999999999999999987665 5789999999999 Q ss_pred eeeeeehhhhHHHHhcChhH----HHHHHHHHHHHHHHHHHhhheeeccCCC--c-ceeeeeccccccccccccccccch Q lcl|Aclame:pro 155 NKLTAFVVLPKDLNDFGPAW----IERFVRVQIEEAFAVALETAFLKGTGKD--Q-PIGLNRQVQKGVSVTEGAYPEKEE 227 (381) Q Consensus 155 ~kl~~~~~iS~ell~ds~~~----l~~~i~~~la~a~a~~~d~a~l~G~G~~--q-P~Gil~~~~~~~~~~~~~~~~~~~ 227 (381) ||++++++||+|||+|+..+ |+++|.+++++++++++|.+|++|+|.. + |.|+.+.+....... . T Consensus 80 ~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~--------~ 151 (315) T protein:vir:80 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIV--------D 151 (315) T ss_pred eeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccccccccccccccee--------e Confidence 99999999999999988876 7899999999999999999999998743 2 344443221111100 0 Q ss_pred hhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc--cCCCCceeec--------cCCC Q lcl|Aclame:pro 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH--LNANGVYVTA--------LPFN 297 (381) Q Consensus 228 ~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~--~~~~G~~~~~--------l~~g 297 (381) . +.. ....+..++..+. ...+..+..|+|||.++..++++.+. ++.+|+|+|. ..+| T Consensus 152 ~---~~~----~~~d~~~~~~~~~------~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G 218 (315) T protein:vir:80 152 A---TDS----ATADLVKAVGLIA------GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRG 218 (315) T ss_pred c---ccc----chHHHHHHHHHHh------hccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecc Confidence 0 000 0111222222221 12233455799999999999887653 3456676652 3479 Q ss_pred ceEEecCCCCCc---------cEEEEeccceEEEecceeeEeeehhh--------hhhcCceEEEEEEEEcCEEecCcce Q lcl|Aclame:pro 298 LNVIESTVQEAG---------KVLTYVKGLYDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDNKVA 360 (381) Q Consensus 298 ~~vi~s~~~p~~---------~i~~gd~s~y~i~~r~~~~i~~~~~~--------~~~~d~~~~~~~~r~dgk~~~~~Af 360 (381) +||+.+++||.+ .++||||++|+++.|++++++++++. +|.+|+++||+.+|+|+++++++|| T Consensus 219 ~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~ 298 (315) T protein:vir:80 219 LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSF 298 (315) T ss_pred eeeEecCcCCcccccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccce Confidence 999999999854 37899999999999999999988763 5899999999999999999999999 Q ss_pred EEEEEEecccccCCCCCC Q lcl|Aclame:pro 361 AVWKLDLKGHKPALEGTE 378 (381) Q Consensus 361 ~v~~l~~~~~~~~~~~~~ 378 (381) ++++-+.+ +++.|..+- T Consensus 299 ~~l~~~~a-~~~~~~~~~ 315 (315) T protein:vir:80 299 AVVKEKAA-PKPNPPAEN 315 (315) T ss_pred EEEeeccC-CCCCCCCCC Confidence 99877664 444444433 No 64 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=1.5e-53 Score=310.19 Aligned_cols=301 Identities=15% Similarity=0.068 Sum_probs=225.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec Q lcl|Aclame:pro 36 MINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~ 115 (381) |.+. +..+. +..+..... .+.+.+++.......+||++||+++.++|++.+++.++|+++|+++++ T Consensus 1 ~~~~--~~~~~----~~~~f~~~~--------~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~ 66 (324) T protein:vir:97 1 MEQT--QKLKL----NLQHFASNN--------VKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM 66 (324) T ss_pred Cccc--hhHHH----HHHHHHHhh--------hhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeec Confidence 0000 00000 000111000 111223344445567799999999999999999999999999999998 Q ss_pred CC-ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 116 GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETA 194 (381) Q Consensus 116 ~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a 194 (381) ++ ..++|+.++.+.+.|++|+++++ +++++|+++++.+||++++++||+|||+|+.+++++||++.+++++++++|++ T Consensus 67 ~~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a 145 (324) T protein:vir:97 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) T ss_pred cCCceEEEEEecCcceeEeccCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHH Confidence 75 48999998889999999987765 57899999999999999999999999999999999999999999999999999 Q ss_pred eeeccCCC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhh Q lcl|Aclame:pro 195 FLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSD 273 (381) Q Consensus 195 ~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~ 273 (381) |++|+|++ +|.||++.......... +..+ .. .+.++...+ ...+..+.+|+|||.+ T Consensus 146 ~l~G~g~~~~~~gi~~~~~~~~~~~~---------~~~~---~~----~i~~~~~~l-------~~~~~~~~~~v~n~~~ 202 (324) T protein:vir:97 146 GILNQGNNPFGKSIAQSIEKTNKVIK---------GDFT---QD----NIIDLEALL-------EDDELEANAFISKTQN 202 (324) T ss_pred hhccCCCCccCccccccccccceecc---------ccCC---HH----HHHHHHHhh-------hhccCCCCEEEEcHHH Confidence 99999976 78999865433222111 1111 11 122222222 2245667789999999 Q ss_pred HHHHHhhhhccCCCCceeec-----cCCCceEEecCCCC--CccEEEEeccceEEEecceeeEeeehhhh---------- Q lcl|Aclame:pro 274 AFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQE--AGKVLTYVKGLYDGYLAGGINVQKFKETL---------- 336 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~~~~-----l~~g~~vi~s~~~p--~~~i~~gd~s~y~i~~r~~~~i~~~~~~~---------- 336 (381) +..++.+ ++++|+|++. ..+|+||+.++.++ ++.++||||++|++++|++++|++++|.. T Consensus 203 ~~~L~~l---kd~~g~~~~~~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:97 203 RSLLRKI---VDPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHHHh---hcCCCceeecCCCCccccceeeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeeccccccccccccc Confidence 9888765 5667888754 23799999887654 56799999999999999999999998853 Q ss_pred ----hhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 337 ----ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 337 ----~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) |.+|++.||+.+|+|+++++++||++++...... +++.+.- T Consensus 280 ~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~-~~~~~~~ 324 (324) T protein:vir:97 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKT-DSVPGEV 324 (324) T ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCC-CCCCCCC Confidence 8899999999999999999999999988877543 3333333 No 65 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=5.2e-54 Score=312.69 Aligned_cols=336 Identities=12% Similarity=0.030 Sum_probs=225.2 Q ss_pred CCccHHHHHHHHHHHH--HHH-HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHh Q lcl|Aclame:pro 1 MTINLSETFANAKNEF--INA-VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDIN 77 (381) Q Consensus 1 m~~~l~~~~~e~~~~~--~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~ 77 (381) |.. +..+-.+.. ... +...+.+..+-..+.++...+........+. .....+ .+...+ .-.++. T Consensus 1 ~a~----~~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a----~~~a~~---~~~~~~--~~~a~~ 67 (366) T protein:vir:57 1 MAA----AVAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVMSIAAGKGNLADA----AKFAAT---ELGDTG--LSMAIS 67 (366) T ss_pred Ccc----cccccccccccccccccccccccccchhHHHHHHHHHhcccchhHH----HHHHHH---hhcchh--hhhhcc Confidence 111 111110000 000 0000000111111222222221111111110 001110 000111 111333 Q ss_pred cccCCCCceEccHHHHHHHHHHHHhhhhhhhh-ceeeec-CCceEEEEecCCcceEEecccccccccccccccceeccce Q lcl|Aclame:pro 78 KNVNYKEEKLLPEETIDRIFEDLTTNHPLLAD-LGIKNA-GLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQN 155 (381) Q Consensus 78 ~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~-~~v~~~-~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~ 155 (381) + +.++||++||+++.++|++.+++.++++++ ++++++ ++.+++|+.++.+.+.|++|+++.+ +++++|+++++.+| T Consensus 68 ~-~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E~~~~~-~s~~~f~~i~~~~~ 145 (366) T protein:vir:57 68 T-AAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGEGKDVV-ATGATFDDVKLSAK 145 (366) T ss_pred c-cccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeeccCcccc-ccccceeEEEEeeE Confidence 3 345799999999999999999999999998 888886 5679999999999999999987765 57899999999999 Q ss_pred eeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCC-cceeeeeccccccccccccccccchhhhcccc Q lcl|Aclame:pro 156 KLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFA 234 (381) Q Consensus 156 kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~ 234 (381) |++++++||+|||+||.+++++||+++|++++++++|.+|++|+|++ +|.||++............ ....+.. T Consensus 146 k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~------~t~~~~~ 219 (366) T protein:vir:57 146 TMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWT------GTAINLT 219 (366) T ss_pred EEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeecc------ccccchh Confidence 99999999999999999999999999999999999999999999974 9999986543221111100 0001111 Q ss_pred ChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec-----cCCCceEEecCCCCCc Q lcl|Aclame:pro 235 NPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQEAG 309 (381) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~-----l~~g~~vi~s~~~p~~ 309 (381) ......+.+. .. ......+..++.|+|||.++..++++ ++++|+|+|. ..+|+||+.+++||++ T Consensus 220 ~~~~~~~~~~---~~-----~~~~~~~~~~a~~vmn~~~~~~L~~l---kd~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~ 288 (366) T protein:vir:57 220 TIDEYLDSLI---LK-----HMDSNSNMIRCGWGLSNRTYMTLFGL---RDGNGNKVYPEMSQGILKGYPIQRTSAIPAN 288 (366) T ss_pred hHHHHHHHHH---Hh-----hhccccccccCEEEecHHHHHHHHhh---hccCCceeccCCCCCeecceeeEEccccccc Confidence 1111111111 11 11234567789999999999888875 5788999874 2479999999999963 Q ss_pred --------cEEEEeccceEEEecceeeEeeehhh-----------hhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 310 --------KVLTYVKGLYDGYLAGGINVQKFKET-----------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 310 --------~i~~gd~s~y~i~~r~~~~i~~~~~~-----------~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) .++||||++|+|++|++++|+++++. .|.+|+++||+.+|+|+++.+++||++++=-.= T Consensus 289 ~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 289 LGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred cccCCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 38999999999999999999998874 367899999999999999999999998643222 No 66 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=2.6e-53 Score=308.91 Aligned_cols=289 Identities=13% Similarity=0.011 Sum_probs=224.8 Q ss_pred hhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceEEecccc Q lcl|Aclame:pro 59 PKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYG 137 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~w~~e~~ 137 (381) ...+.....+.+ ++...+++.+|.+||+++.++|++.+++.++|+++|+++++++ ..++|+.++.+.+.|++|++ T Consensus 1 ~~~~~~~~~~~~----~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~ 76 (320) T protein:vir:10 1 MAAGTAFQVDHA----QIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEGD 76 (320) T ss_pred CCCCccCCHHHH----HhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCCc Confidence 112233333433 3444556677779999999999999999999999999999865 58999998888999999987 Q ss_pred cccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeecccccccc Q lcl|Aclame:pro 138 EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSV 217 (381) Q Consensus 138 ~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~ 217 (381) +++ +++++|+++++.+||++++++||+|+|+||.+++++||.+.+++++++++|++|++|+|+++|.|++......... T Consensus 77 ~~~-~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~ 155 (320) T protein:vir:10 77 MKP-ITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLA 155 (320) T ss_pred ccc-ccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccce Confidence 765 5789999999999999999999999999999999999999999999999999999999999998886543332222 Q ss_pred ccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec---- Q lcl|Aclame:pro 218 TEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---- 293 (381) Q Consensus 218 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~---- 293 (381) ..+ ..+..+.......+.+.... ....++.+++|+|||.++..++.+ ++++|+|++. T Consensus 156 ~~~---------~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~v~n~~~~~~L~~l---kd~~G~~l~~~~~~ 216 (320) T protein:vir:10 156 DPG---------GATASDLTAYDAVAVNGLSL-------LVNAKKKWTHTLLDDIVEPILNGA---KDKNGRPLFIESTY 216 (320) T ss_pred ecc---------cccccccccHHHHHHHHHhh-------hhcccCCCcEEEEcHHHHHHHHHh---hccCCceeeccccc Confidence 111 11111111111112222111 134567788999999999888875 5667887763 Q ss_pred ----------cCCCceEEecCCCCCcc--EEEEeccceEEEecceeeEeeehhhh--------------hhcCceEEEEE Q lcl|Aclame:pro 294 ----------LPFNLNVIESTVQEAGK--VLTYVKGLYDGYLAGGINVQKFKETL--------------ALDDMDLYTAK 347 (381) Q Consensus 294 ----------l~~g~~vi~s~~~p~~~--i~~gd~s~y~i~~r~~~~i~~~~~~~--------------~~~d~~~~~~~ 347 (381) ..+|+||+.+++||+++ ++||||++|++++|+++++++++|.+ |.+|++.||+. T Consensus 217 ~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~ 296 (320) T protein:vir:10 217 TDENSPFRAGRIVSRPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVE 296 (320) T ss_pred cCccccccCceeeeeeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEE Confidence 24689999999999887 56899999999999999999998865 88999999999 Q ss_pred EEEcCEEecCcceEEEEEEecccccC Q lcl|Aclame:pro 348 QFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) Q Consensus 348 ~r~dgk~~~~~Af~v~~l~~~~~~~~ 373 (381) +|+|+++++++||++++-.. + ++| T Consensus 297 ~~~d~~v~~~~a~~~l~~~~-a-p~~ 320 (320) T protein:vir:10 297 AEYAFHNNDKDAFVKLTNVV-T-PDA 320 (320) T ss_pred EeeccEEecccceEEEEecc-C-CCC Confidence 99999999999999876333 3 333 No 67 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=3.9e-53 Score=307.93 Aligned_cols=287 Identities=12% Similarity=0.020 Sum_probs=227.8 Q ss_pred hhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceEEecccc Q lcl|Aclame:pro 59 PKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYG 137 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~w~~e~~ 137 (381) .+.+..+..+++. +...+++++|.+||+++.++|++.+++.++|+++|+++++++ ..++|+.++.+.+.|++|++ T Consensus 1 ~~~~~~~~~e~~~----~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~ 76 (318) T protein:vir:24 1 MAAGTAFAVDHAQ----IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEGD 76 (318) T ss_pred CCCCCCCCHHHHH----hhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCCc Confidence 2333555666653 344556678889999999999999999999999999999865 48999999999999999987 Q ss_pred cccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeecccccccc Q lcl|Aclame:pro 138 EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSV 217 (381) Q Consensus 138 ~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~ 217 (381) +++ +++++|+++++.+||+++++++|+|+|+||.+++++||+++|++++++++|.+|++|+|+++|.|++......... T Consensus 77 ~~~-~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~ 155 (318) T protein:vir:24 77 MKP-ITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIA 155 (318) T ss_pred ccc-ccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCccccccccccccc Confidence 765 5789999999999999999999999999999999999999999999999999999999999999998643221111 Q ss_pred ccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec---- Q lcl|Aclame:pro 218 TEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---- 293 (381) Q Consensus 218 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~---- 293 (381) ... ... + .....+...... ....+..+.+|+|||+++..++.+ ++++|+|++. T Consensus 156 ~~~------~~~--~-----~~~~~~~~~~~~-------~~~~~~~~~~~v~n~~~~~~L~~l---kd~~G~~l~~~~~~ 212 (318) T protein:vir:24 156 DTT------GAT--T-----VYDQVAVNGLSL-------LVNDGKKWTHTLLDDITEPILNGA---KDQNGRPLFIESTY 212 (318) T ss_pred ccc------ccc--c-----hHHHHHHHHHHh-------hccccCCCCEEEEcHHHHHHHHHh---hccCCceeecCccc Confidence 000 000 0 001111111111 123466778999999999888765 5678888764 Q ss_pred ----------cCCCceEEecCCCCCcc--EEEEeccceEEEecceeeEeeehhhh--------------hhcCceEEEEE Q lcl|Aclame:pro 294 ----------LPFNLNVIESTVQEAGK--VLTYVKGLYDGYLAGGINVQKFKETL--------------ALDDMDLYTAK 347 (381) Q Consensus 294 ----------l~~g~~vi~s~~~p~~~--i~~gd~s~y~i~~r~~~~i~~~~~~~--------------~~~d~~~~~~~ 347 (381) .++|+||+.++++|+++ ++||||++|+++++++++|+.+++.+ |.+|++.||+. T Consensus 213 ~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~ 292 (318) T protein:vir:24 213 GEAASPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVE 292 (318) T ss_pred cCccccccCceEEEEeeEEeCCCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEE Confidence 24688999999999776 57999999999999999999999865 88999999999 Q ss_pred EEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 348 QFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 348 ~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) +|+|+++.+++||++++...++.. || T Consensus 293 ~r~d~~v~~~~a~~~i~~~~a~~~---~~ 318 (318) T protein:vir:24 293 AEYAFHCNDAEAFVALTNVVSGGG---EG 318 (318) T ss_pred EEEccEEecccceEEEEeeccCCC---CC Confidence 999999999999999777665522 33 No 68 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=1e-51 Score=300.09 Aligned_cols=323 Identities=13% Similarity=0.025 Sum_probs=217.2 Q ss_pred CCccHHHHHHHHH---HHHHHHHhhhhHH----------HHH-------HHHHHHH---HHHHHHHHHHHHHH-HHHHHH Q lcl|Aclame:pro 1 MTINLSETFANAK---NEFINAVNNGEPQ----------ERQ-------NELYGDM---INQLFEETKLQAKA-EAERVS 56 (381) Q Consensus 1 m~~~l~~~~~e~~---~~~~~~~~~~~~~----------~~~-------~~~~~~~---~~~~~~~~~~~~~~-~~~~~~ 56 (381) |+| .+++++.. +++.+.++....+ ++. .+.+... ++.+.+........ +..+.. T Consensus 1 ~~l--~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~ 78 (400) T protein:vir:38 1 MTL--DEKLAAVKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQS 78 (400) T ss_pred CCh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 444 44444333 3332222211110 000 0001111 11110000000000 000000 Q ss_pred Hhhh-----------------------------------hhccccHHHHHHHHHHhcc-cCCCCceEccHHHHHHHHHHH Q lcl|Aclame:pro 57 SLPK-----------------------------------SAQSLSANQRSFFMDINKN-VNYKEEKLLPEETIDRIFEDL 100 (381) Q Consensus 57 ~~~~-----------------------------------~~~~lt~~e~~~~~~~~~~-~~~~gg~lvP~~~~~~Ii~~l 100 (381) .... ........+.++...+..+ ++++||++||+++.+.|++.+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~ 158 (400) T protein:vir:38 79 SGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQREL 158 (400) T ss_pred ccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHH Confidence 0000 0000000011111223333 567899999999999999999 Q ss_pred HhhhhhhhhceeeecCC-ceEEEEec-CCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHH Q lcl|Aclame:pro 101 TTNHPLLADLGIKNAGL-RLKFLKSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERF 178 (381) Q Consensus 101 ~~~~~l~~~~~v~~~~~-~~~ip~~~-~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~ 178 (381) +++++|+++|+++++++ ..++|+.. +.+.+.|+.|+++.+..++|+|++|++.+|+++++++||+|||+||.+++++| T Consensus 159 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~ 238 (400) T protein:vir:38 159 QTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGL 238 (400) T ss_pred HhhhhhhhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHH Confidence 99999999999999864 57888754 45678898888887777799999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 179 VRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKS 258 (381) Q Consensus 179 i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~ 258 (381) |.++++++++.+++.+|++|+|.++|.|+.+. +.+..++... T Consensus 239 i~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~------------------------------~~~~~~~~~~-------- 280 (400) T protein:vir:38 239 IAQNGQQIKVNTTNGAVATLLKGFTAKTISSV------------------------------DDLKHINNVD-------- 280 (400) T ss_pred HHHHHHHHHHHHHHHhhhhccccccccccccH------------------------------HHHHHHHHhh-------- Confidence 99999999999999999999998776655310 0111111100 Q ss_pred ccccCceEEEEchhhHHHHHhhhhccCCCCceeec---------cCCCceEEecCCCCCc-----cEEEEeccc-eEEEe Q lcl|Aclame:pro 259 VAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAG-----KVLTYVKGL-YDGYL 323 (381) Q Consensus 259 ~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~~p~~-----~i~~gd~s~-y~i~~ 323 (381) .....+++|+|||.++..++.+ ++++|+|+|. ..+|+||+.++++|.+ .++||||++ |++++ T Consensus 281 ~~~~~~a~~v~~~~~~~~l~~l---kd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~ 357 (400) T protein:vir:38 281 LDPAYSRVIIASQSFYNFLDTV---KDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFAN 357 (400) T ss_pred hhhhhCcEEEEcHHHHHHHHHh---hccCCCeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEe Confidence 0112367899999998888765 6788999984 2479999999998843 279999998 77899 Q ss_pred cceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecc Q lcl|Aclame:pro 324 AGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 324 r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~ 369 (381) |++++++++++.+|. ++||+++|+||++++++||+.++++-++ T Consensus 358 ~~~~~~~~~~~~~~~---~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 358 RADFMVRWVDDQIYG---QFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred ecceEEEEecccccc---eeEEEEEEeccEEecccceEEEEeecCC Confidence 999999999987764 5899999999999999999998886655 No 69 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=1.4e-51 Score=299.30 Aligned_cols=348 Identities=12% Similarity=0.079 Sum_probs=215.5 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHH--------HHHHHHHH---HHHHHHHHHHHH-HHHHHHHHHHHhh---h----- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQ--------ERQNELYG---DMINQLFEETKL-QAKAEAERVSSLP---K----- 60 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~--------~~~~~~~~---~~~~~~~~~~~~-~~~~~~~~~~~~~---~----- 60 (381) ||. ..+++|.+.++.+..+..... ++..+... ...+.+..+... +...+........ . T Consensus 1 m~~--~~~lee~~a~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (419) T protein:vir:94 1 MPP--TPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA 78 (419) T ss_pred CCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 554 344444444433322111110 11111111 111111110000 0000000000000 0 Q ss_pred -hhcc----------------------ccHHHHHHHHHH------hccc-CCCCceEccHHHHHHHHHHHHhhhhhhhhc Q lcl|Aclame:pro 61 -SAQS----------------------LSANQRSFFMDI------NKNV-NYKEEKLLPEETIDRIFEDLTTNHPLLADL 110 (381) Q Consensus 61 -~~~~----------------------lt~~e~~~~~~~------~~~~-~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~ 110 (381) ..+. .....+...+.. ..++ ...|++++|+.+...|+...+....++++| T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~ 158 (419) T protein:vir:94 79 GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) T ss_pred ccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcc Confidence 0000 000000010111 1122 234445666676666666777778899999 Q ss_pred eeeecCC-ceEEEEec--------CCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHH Q lcl|Aclame:pro 111 GIKNAGL-RLKFLKSE--------TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRV 181 (381) Q Consensus 111 ~v~~~~~-~~~ip~~~--------~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~ 181 (381) ++.++++ ..++|+.+ +.+.+.|++|++.. ++++++|+++++.+|+++++++||+|||+|+. ++++||.+ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~ 236 (419) T protein:vir:94 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAK-PQSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) T ss_pred eeeeccCCceeeeeeccccccccccCcccceecCCccc-cccccceeeEEeeeeeEEEeehhhHHHHHhHH-HHHHHHHH Confidence 9999765 46777643 34557899997765 46789999999999999999999999999975 79999999 Q ss_pred HHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 182 QIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAV 261 (381) Q Consensus 182 ~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 261 (381) +|++++++++|.+||+|+|+++|+||++............ . ...........+..++..+ ...+ T Consensus 237 ~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~----~-----~~~t~~~~~~~l~~~~~~~-------~~~~ 300 (419) T protein:vir:94 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKP----T-----APATDEPPLVDIRRAKTVA-------EIAG 300 (419) T ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccc----c-----cccccchhHHHHHHHHHhh-------hhcc Confidence 9999999999999999999999999997543222211110 0 0111111222222222221 2334 Q ss_pred cCceEEEEchhhHHHHHhhhhccCCCCceeec---------cCCCceEEecCCCCCccEEEEeccc-eEEEecceeeEee Q lcl|Aclame:pro 262 KGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQK 331 (381) Q Consensus 262 ~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~~p~~~i~~gd~s~-y~i~~r~~~~i~~ 331 (381) .++.+|+|||.++..++.+.+. .+|.|.+. ..+|+||+++++||+++++||||++ |++++|++++++. T Consensus 301 ~~~~~~v~n~~~~~~l~~~k~~--~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~ 378 (419) T protein:vir:94 301 FPPDGVVVHPQDWESIELDQAP--GSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLM 378 (419) T ss_pred CCCCEEEEcHHHHHHHHHHhhc--CCCceeecCCcccCCCccccceeeEEcCCCCCccEEEeeccceEEEEEecceEEEE Confidence 5667899999999888776432 22333321 3479999999999999999999998 7899999999999 Q ss_pred ehhh--hhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 332 FKET--LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 332 ~~~~--~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) +++. +|.+|+++||+.+|+||++++++||+++++.=+.. T Consensus 379 ~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 379 TDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred eccccchhhcCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 8875 59999999999999999999999999876554332 No 70 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=1.1e-52 Score=305.41 Aligned_cols=288 Identities=13% Similarity=0.027 Sum_probs=221.8 Q ss_pred ccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceEEecccccccc Q lcl|Aclame:pro 63 QSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKG 141 (381) Q Consensus 63 ~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~w~~e~~~~~~ 141 (381) ....++.+. +...+++++|.+||++++++|++.+++.++|+++++++++++ ..++|+.++.+.+.|++|+++++ T Consensus 1 ~g~~~e~~~----~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~- 75 (397) T protein:vir:23 1 MGFSADHSQ----IAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMKP- 75 (397) T ss_pred CCcCHHHHH----HhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCcccc- Confidence 222334332 222333444446777789999999999999999999999876 58999999899999999977765 Q ss_pred cccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeecccccccccccc Q lcl|Aclame:pro 142 QLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGA 221 (381) Q Consensus 142 ~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~ 221 (381) +++++|+++++.+||++++++||+|||+|+.+++++||+++|++++++++|++|++|+|+++|.+.+........... T Consensus 76 ~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~-- 153 (397) T protein:vir:23 76 ITKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSIS-- 153 (397) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeec-- Confidence 578999999999999999999999999999999999999999999999999999999999876554432221111100 Q ss_pred ccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec-------- Q lcl|Aclame:pro 222 YPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-------- 293 (381) Q Consensus 222 ~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~-------- 293 (381) .. .......+.+..+ ...+..++.|+||++++..++++ ++++|+|++. T Consensus 154 -------~~---~~~~~~~~~~~~l-----------~~~~~~~a~~vmn~~~~~~L~~l---kd~~G~~i~~~~~~~~~~ 209 (397) T protein:vir:23 154 -------PN---AYQGLGVSGLTKL-----------VTDGKKWTHTLLDDTVEPVLNGS---VDANGRPLFVESTYESLT 209 (397) T ss_pred -------cc---chhHHHHHHHHhh-----------hhcccCCCEEEEcHHHHHHHHHh---hccCCceeeccccccccc Confidence 00 0111111111111 23456678999999999988875 6678888764 Q ss_pred ------cCCCceEEecCCCCCccE--EEEeccceEEEecceeeEeeehhhh--------------hhcCceEEEEEEEEc Q lcl|Aclame:pro 294 ------LPFNLNVIESTVQEAGKV--LTYVKGLYDGYLAGGINVQKFKETL--------------ALDDMDLYTAKQFAY 351 (381) Q Consensus 294 ------l~~g~~vi~s~~~p~~~i--~~gd~s~y~i~~r~~~~i~~~~~~~--------------~~~d~~~~~~~~r~d 351 (381) ..+|+||+.+++||++++ +||||++|++++++++.+++++|.+ |.+|++.||+.+|+| T Consensus 210 ~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d 289 (397) T protein:vir:23 210 TPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYG 289 (397) T ss_pred ccccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeec Confidence 136899999999998874 7899999999999999999998864 889999999999999 Q ss_pred CEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 352 GKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 352 gk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) +++++++||+.++.+..+.+..+..++.|= T Consensus 290 ~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (397) T protein:vir:23 290 LLINDVNAFVKLTFDPVLTTYALDLDGASA 319 (397) T ss_pred cceecccceEEEeeccccceeeecccccCc Confidence 999999999999987766555544443333 No 71 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=3.7e-51 Score=297.04 Aligned_cols=341 Identities=9% Similarity=-0.058 Sum_probs=222.1 Q ss_pred CCccH-HHHHHHHHHHHHHHHhh-------hhHH--HHHHHHHHHHHHHHHHHHHHHHH---HHHHH-----HHHhhhhh Q lcl|Aclame:pro 1 MTINL-SETFANAKNEFINAVNN-------GEPQ--ERQNELYGDMINQLFEETKLQAK---AEAER-----VSSLPKSA 62 (381) Q Consensus 1 m~~~l-~~~~~e~~~~~~~~~~~-------~~~~--~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-----~~~~~~~~ 62 (381) |+|+. -+++.++++++.+..+. ...+ .++.+......+.+..+...... ..... ........ T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGG 80 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 77643 22333444444333211 1000 01111111122222211111000 00000 00000000 Q ss_pred c----cccHHH-----HHHHH-----------HHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEE Q lcl|Aclame:pro 63 Q----SLSANQ-----RSFFM-----------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKF 121 (381) Q Consensus 63 ~----~lt~~e-----~~~~~-----------~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~i 121 (381) + .....+ ++.+. .....++++||++||+++...|++.+++.++|+++|+++++++ ..++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~ 160 (421) T protein:vir:13 81 RVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKM 160 (421) T ss_pred ccccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEE Confidence 0 001111 11110 1122456789999999999999999999999999999999864 5788 Q ss_pred EEecCCcc--eEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeecc Q lcl|Aclame:pro 122 LKSETSGV--AVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT 199 (381) Q Consensus 122 p~~~~~~~--a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~ 199 (381) |+...... +.|++|+++. ++++++|+++++.+|+++++++||+|||+||.++|++||+++|++++++++|.++ T Consensus 161 ~~~~~~~~~~~~~~~E~~~~-~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i---- 235 (421) T protein:vir:13 161 PVRAGASVDKLANLAKDTEL-VKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEI---- 235 (421) T ss_pred EEeecCCccceeeccccccc-cccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhH---- Confidence 87655544 5567887665 4579999999999999999999999999999999999999999999999887554 Q ss_pred CCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHh Q lcl|Aclame:pro 200 GKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 200 G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~ 279 (381) .++|.|+++.... .+ ... +.+++..+ ...|..++.|+|||.++..++. T Consensus 236 -~~~~~g~~~~~~~-----------------~~---~d~----i~~~~~~l-------~~~~~~~a~~v~n~~~~~~l~~ 283 (421) T protein:vir:13 236 -VKQAKAVLAEETI-----------------ND---YAG----LVKTINSL-------VPNARKRAIIVTNSDGRAYLDG 283 (421) T ss_pred -hhhhhhccccccc-----------------cc---hHH----HHHHHHHh-------hhhhcCCCEEEEcHHHHHHHHH Confidence 4678998742110 01 111 12222111 2346678899999999888876 Q ss_pred hhhccCCCCceeec--------cCCCceEEecCCCCCc-----cEEEEeccc-eEEEecceeeEeeehhhhhhcCceEEE Q lcl|Aclame:pro 280 QYTHLNANGVYVTA--------LPFNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYT 345 (381) Q Consensus 280 ~~~~~~~~G~~~~~--------l~~g~~vi~s~~~p~~-----~i~~gd~s~-y~i~~r~~~~i~~~~~~~~~~d~~~~~ 345 (381) + ++++|+|++. ..+|+||++++++|.+ .++||||++ |++++|++++|+++++.+|.+|+++|| T Consensus 284 l---kd~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r 360 (421) T protein:vir:13 284 L---MDKQGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETIAR 360 (421) T ss_pred h---hcCCCceeecCcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeecccccccCeeEEE Confidence 5 6789999975 3479999999999854 379999998 779999999999999999999999999 Q ss_pred EEEEEcCEEecCcceEEEEEEecc------cccCCCCCCCCC Q lcl|Aclame:pro 346 AKQFAYGKAKDNKVAAVWKLDLKG------HKPALEGTEETL 381 (381) Q Consensus 346 ~~~r~dgk~~~~~Af~v~~l~~~~------~~~~~~~~~~~~ 381 (381) +.+|+||++++++||+.+...-.+ .+|++..+.++. T Consensus 361 ~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~~~~~~~ 402 (421) T protein:vir:13 361 IIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKSSPRSGKN 402 (421) T ss_pred EEeeecceeecchhhheeeecccceeeccccccCCCCcCCCC Confidence 999999999999998765443211 223333333433 No 72 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=8.1e-53 Score=306.18 Aligned_cols=279 Identities=13% Similarity=0.043 Sum_probs=220.4 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceEEecc Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKI 135 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~w~~e 135 (381) |+ .++ +++.+..++++||++||+++.++|++.+++.++|+++|+++++++ ..++|+.++.+.+.|++| T Consensus 1 ma--------~~~---~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 69 (304) T protein:vir:10 1 MA--------TPT---YTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSE 69 (304) T ss_pred Cc--------ccc---cccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeec Confidence 11 111 134455667789999999999999999999999999999999865 589999988889999999 Q ss_pred cccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeecccccc Q lcl|Aclame:pro 136 YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGV 215 (381) Q Consensus 136 ~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~ 215 (381) +++.+ +++++|+++++.+||++++++||+|||+||.+++++||+++|++++++++|.+|++|+|+++|.|++....... T Consensus 70 ~~~~~-~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~ 148 (304) T protein:vir:10 70 TERIQ-TSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEG 148 (304) T ss_pred Ccccc-cccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccc Confidence 87765 57899999999999999999999999999999999999999999999999999999999999988753211100 Q ss_pred ccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeecc- Q lcl|Aclame:pro 216 SVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL- 294 (381) Q Consensus 216 ~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l- 294 (381) .... ..+..........+.+++..+ ...+..+++|+|||.++..++++ ++++|+|++.. T Consensus 149 ~~~~----------~~~~~~~~~~~~~i~~~~~~l-------~~~~~~~~~~v~~~~~~~~L~~l---kd~~G~~l~~~~ 208 (304) T protein:vir:10 149 AEEK----------GNVVTDTNNLYVDLSALMATI-------EDEELDPNGVLTTRSFRSKMRNA---LDANDRPLFDAN 208 (304) T ss_pred cccc----------ccccccccchHHHHHHHHHHh-------hhccCCcCEEEEcHHHHHHHHHh---hccCCcEeecCC Confidence 0000 000111111222233332222 23456678899999999988875 57789998753 Q ss_pred ---CCCceEEecCCCCCc----cEEEEeccceEEEecceeeEeeehhh----------------hhhcCceEEEEEEEEc Q lcl|Aclame:pro 295 ---PFNLNVIESTVQEAG----KVLTYVKGLYDGYLAGGINVQKFKET----------------LALDDMDLYTAKQFAY 351 (381) Q Consensus 295 ---~~g~~vi~s~~~p~~----~i~~gd~s~y~i~~r~~~~i~~~~~~----------------~~~~d~~~~~~~~r~d 351 (381) .+|+||+.+++||.+ .++||||++|++++|+++++++++|. +|.+|++.||+.+|+| T Consensus 209 ~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~ 288 (304) T protein:vir:10 209 GNEIMGLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIA 288 (304) T ss_pred CccccceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEec Confidence 379999999999843 48999999999999999999999884 4999999999999999 Q ss_pred CEEecCcceEEEEEEecc Q lcl|Aclame:pro 352 GKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 352 gk~~~~~Af~v~~l~~~~ 369 (381) +++++++||+++ +.++ T Consensus 289 ~~v~~~~a~~~l--~~a~ 304 (304) T protein:vir:10 289 YMNVKPEAFATL--KPTE 304 (304) T ss_pred cEeecccceEEE--EecC Confidence 999999999875 4433 No 73 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=8.1e-53 Score=306.18 Aligned_cols=279 Identities=13% Similarity=0.043 Sum_probs=220.4 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceEEecc Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKI 135 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~w~~e 135 (381) |+ .++ +++.+..++++||++||+++.++|++.+++.++|+++|+++++++ ..++|+.++.+.+.|++| T Consensus 1 ma--------~~~---~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 69 (304) T protein:vir:94 1 MA--------TPT---YTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSE 69 (304) T ss_pred Cc--------ccc---cccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeec Confidence 11 111 134455667789999999999999999999999999999999865 589999988889999999 Q ss_pred cccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeecccccc Q lcl|Aclame:pro 136 YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGV 215 (381) Q Consensus 136 ~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~ 215 (381) +++.+ +++++|+++++.+||++++++||+|||+||.+++++||+++|++++++++|.+|++|+|+++|.|++....... T Consensus 70 ~~~~~-~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~ 148 (304) T protein:vir:94 70 TERIQ-TSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEG 148 (304) T ss_pred Ccccc-cccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccc Confidence 87765 57899999999999999999999999999999999999999999999999999999999999988753211100 Q ss_pred ccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeecc- Q lcl|Aclame:pro 216 SVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL- 294 (381) Q Consensus 216 ~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l- 294 (381) .... ..+..........+.+++..+ ...+..+++|+|||.++..++++ ++++|+|++.. T Consensus 149 ~~~~----------~~~~~~~~~~~~~i~~~~~~l-------~~~~~~~~~~v~~~~~~~~L~~l---kd~~G~~l~~~~ 208 (304) T protein:vir:94 149 AEEK----------GNVVTDTNNLYVDLSALMATI-------EDEELDPNGVLTTRSFRSKMRNA---LDANDRPLFDAN 208 (304) T ss_pred cccc----------ccccccccchHHHHHHHHHHh-------hhccCCcCEEEEcHHHHHHHHHh---hccCCcEeecCC Confidence 0000 000111111222233332222 23456678899999999988875 57789998753 Q ss_pred ---CCCceEEecCCCCCc----cEEEEeccceEEEecceeeEeeehhh----------------hhhcCceEEEEEEEEc Q lcl|Aclame:pro 295 ---PFNLNVIESTVQEAG----KVLTYVKGLYDGYLAGGINVQKFKET----------------LALDDMDLYTAKQFAY 351 (381) Q Consensus 295 ---~~g~~vi~s~~~p~~----~i~~gd~s~y~i~~r~~~~i~~~~~~----------------~~~~d~~~~~~~~r~d 351 (381) .+|+||+.+++||.+ .++||||++|++++|+++++++++|. +|.+|++.||+.+|+| T Consensus 209 ~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~ 288 (304) T protein:vir:94 209 GNEIMGLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIA 288 (304) T ss_pred CccccceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEec Confidence 379999999999843 48999999999999999999999884 4999999999999999 Q ss_pred CEEecCcceEEEEEEecc Q lcl|Aclame:pro 352 GKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 352 gk~~~~~Af~v~~l~~~~ 369 (381) +++++++||+++ +.++ T Consensus 289 ~~v~~~~a~~~l--~~a~ 304 (304) T protein:vir:94 289 YMNVKPEAFATL--KPTE 304 (304) T ss_pred cEeecccceEEE--EecC Confidence 999999999875 4433 No 74 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=2e-52 Score=303.97 Aligned_cols=283 Identities=14% Similarity=0.024 Sum_probs=217.1 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceEEeccccccc----ccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIK----GQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~w~~e~~~~~----~~~~~~f~~v 150 (381) |...++++||++||++++++|++.+++.+||+++++++++++ ..++|+.++.+.+.|++|++..+ +.++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 888889999999999999999999999999999999999865 58999999899999999976543 2468999999 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++.+||++++++||+|||+||.+++++||++++++++++++|.+|++|+|++++.+............ ... ...... T Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~--~~~-~~~~~~ 157 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAG--QAV-EVVGGV 157 (305) T ss_pred EeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCcccccccccccccc--ccc-cccccc Confidence 99999999999999999999999999999999999999999999999999765544332111111110 000 000111 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec--cCCCceEEecCCCCC Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA--LPFNLNVIESTVQEA 308 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~--l~~g~~vi~s~~~p~ 308 (381) . ...+.+..+........ ...+.. ..|+|||.++..++++ ++++|+|+|. ..+|+||++++++|. T Consensus 158 ---~---~~~~~~~~~~~~~~~~~---~~~~~~-~~~v~~~~~~~~l~~l---kd~~G~~i~~~~~l~G~Pv~~~~~~~~ 224 (305) T protein:vir:25 158 ---A---NESDIVGATNRAAKAVA---SAGWAP-DTLLSSLALRYEVANI---RDANGNPVFRDDSFAGFRTFFNRNGAW 224 (305) T ss_pred ---h---hhhHHHHHHHHHHHhhh---hccccc-ceeEecHHHHHHHHHh---hccCCceeecCCcccccceEEcCccCC Confidence 1 11111111111111111 111222 3499999999988876 6788999985 347999999999874 Q ss_pred ----ccEEEEeccceEEEecceeeEeeehhh----------hhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc-ccC Q lcl|Aclame:pro 309 ----GKVLTYVKGLYDGYLAGGINVQKFKET----------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH-KPA 373 (381) Q Consensus 309 ----~~i~~gd~s~y~i~~r~~~~i~~~~~~----------~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~-~~~ 373 (381) +.++||||++|+++++++++|+++++. .|.+|++.+|+..|+|+.+++++||+.++..-.+. +|+ T Consensus 225 ~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~pa 304 (305) T protein:vir:25 225 DADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) T ss_pred CCCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCCC Confidence 358999999999999999999998875 47889999999999999999999999887654432 444 Q ss_pred C Q lcl|Aclame:pro 374 L 374 (381) Q Consensus 374 ~ 374 (381) . T Consensus 305 ~ 305 (305) T protein:vir:25 305 A 305 (305) T ss_pred C Confidence 4 No 75 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=3.5e-51 Score=297.19 Aligned_cols=328 Identities=11% Similarity=-0.042 Sum_probs=220.4 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHHH---------HHHHHHHHHHHHHHHHHHHHHH--HHHHHH-------------- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQE---------RQNELYGDMINQLFEETKLQAK--AEAERV-------------- 55 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-------------- 55 (381) |..+.-+++.+++.++.++++....++ ++.+.....++.+.++...... .+.+.. T Consensus 1 M~~~~l~el~~~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~ 80 (394) T protein:vir:97 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 766555555555555544443322111 1111111111111111110000 000000 Q ss_pred -----------HHhhhhhc---------cccHHHHHHH------HH-HhcccCCCCceEccHHHHHHHHHHHHhhhhhhh Q lcl|Aclame:pro 56 -----------SSLPKSAQ---------SLSANQRSFF------MD-INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLA 108 (381) Q Consensus 56 -----------~~~~~~~~---------~lt~~e~~~~------~~-~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~ 108 (381) ....+... ....+..... +. ....+..+||++||+++.+.|++.+++.++|++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~ 160 (394) T protein:vir:97 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) T ss_pred chhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhh Confidence 00000000 0000000000 00 112356679999999999999999999999999 Q ss_pred hceeeecCC-ceEEEEec-CCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHH Q lcl|Aclame:pro 109 DLGIKNAGL-RLKFLKSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEA 186 (381) Q Consensus 109 ~~~v~~~~~-~~~ip~~~-~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a 186 (381) +|+++++++ ...+|+.. +.+.+.|++|+++.+..++++|++|++.+|+++++++||+|||+||.+++++||.+.++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~ 240 (394) T protein:vir:97 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) T ss_pred hceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHH Confidence 999999754 57888754 5567899999888776678999999999999999999999999999999999999999999 Q ss_pred HHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceE Q lcl|Aclame:pro 187 FAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVT 266 (381) Q Consensus 187 ~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 266 (381) ++++++.+|++|.|++.|.|..+ .. .+.+++... ..+..++. T Consensus 241 ~~~~~~~~i~~g~~~~~~~~~~~--------------------------~~----~~~~~~~~~--------~~~~~~a~ 282 (394) T protein:vir:97 241 KVNTTNDAIAKVLKSFTTKTVKN--------------------------LD----EIKALLNGG--------FDPAYNVS 282 (394) T ss_pred HHHHHHHHHhhcccccccccccc--------------------------HH----HHHHHHHhh--------hhhhhCCE Confidence 99999999999988766544321 01 111111111 01123578 Q ss_pred EEEchhhHHHHHhhhhccCCCCceeec---------cCCCceEEec--CCCCCccEEEEeccc-eEEEecceeeEeeehh Q lcl|Aclame:pro 267 MVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIES--TVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE 334 (381) Q Consensus 267 ~imn~~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s--~~~p~~~i~~gd~s~-y~i~~r~~~~i~~~~~ 334 (381) |+|||.++..++.+ ++++|+|+|. ..+|+||+++ ..++++.++||||++ |.+++|++++++.+++ T Consensus 283 ~v~n~~~~~~l~~l---kd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~ 359 (394) T protein:vir:97 283 LIVSQSFYQTLDTL---KDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADN 359 (394) T ss_pred EEEcHHHHHHHHHh---hccCCCeeeecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEecc Confidence 99999998887765 6788999984 3478999874 457778899999998 7899999999999887 Q ss_pred hhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCC Q lcl|Aclame:pro 335 TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALE 375 (381) Q Consensus 335 ~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~ 375 (381) .++ .++||+++|+||++++++||+.++++- +|.|. T Consensus 360 ~~~---~~~~~~~~r~d~~v~~~~a~~~~~~~~---~~~p~ 394 (394) T protein:vir:97 360 EIY---GQYLQAVLRFGVSKVDDKAGYYVTFTP---EPLPL 394 (394) T ss_pred ccc---ceeEEEEEEEccEEecccceEEEEecc---cccCC Confidence 665 568999999999999999999988853 33444 No 76 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=2.7e-52 Score=303.26 Aligned_cols=270 Identities=12% Similarity=-0.005 Sum_probs=213.6 Q ss_pred cccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC-CceEEEEecCCcceEEecccccccccccccccceecccee Q lcl|Aclame:pro 78 KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNK 156 (381) Q Consensus 78 ~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~k 156 (381) -.+.+.||++||+++.++|++.+++.++|+++|++++++ +..++|+.++.+.+.|++|+++.+ +++++|+++++.+|| T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~-~~~~~f~~v~l~~~k 79 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKS-ESTATFAPVTAIPRK 79 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccc-cccceeeEEEEeeEE Confidence 455668999999999999999999999999999999986 569999999999999999987765 679999999999999 Q ss_pred eeeehhhhHHHHh---cChhHHHHHHHHHHHHHHHHHHhhheeeccCCC---cceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 157 LTAFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTGKD---QPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 157 l~~~~~iS~ell~---ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~---qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++++++||+|||+ |+.++|+++|++++++++++++|.+|++|+|.+ .|.||++.+.......... . T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~--------~ 151 (311) T protein:vir:81 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT--------T 151 (311) T ss_pred EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeec--------c Confidence 9999999999995 677889999999999999999999999998643 4678876543322211110 0 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec---------cCCCceEE Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi 301 (381) ....... ..+..++..+. ...+ ....|+|||.++..++++ ++++|+|+|. ..+|+||+ T Consensus 152 ~~~~~~~---~~i~~~~~~~~------~~~~-~~~~~vmn~~~~~~l~~l---kd~~G~~l~~~~~~~~~~~tl~G~Pv~ 218 (311) T protein:vir:81 152 GTSATPD---LAVEAAVGLVL------GDNL-SPDGVALDNTFSFMLATQ---RDSQGRKLYPELGFGTDVASFAGLNAA 218 (311) T ss_pred cccchHH---HHHHHHHHHhh------hcCC-CceEEEEcHHHHHHHHhh---hccCCCeeecCccccCCCceecceeEE Confidence 0001111 11222221111 0111 223599999999988876 6778999874 24799999 Q ss_pred ecCCCCCc------------------cEEEEeccceEEEecceeeEeeehhh-------hhhcCceEEEEEEEEcCEEec Q lcl|Aclame:pro 302 ESTVQEAG------------------KVLTYVKGLYDGYLAGGINVQKFKET-------LALDDMDLYTAKQFAYGKAKD 356 (381) Q Consensus 302 ~s~~~p~~------------------~i~~gd~s~y~i~~r~~~~i~~~~~~-------~~~~d~~~~~~~~r~dgk~~~ 356 (381) .++.||.+ .++||||++|+++.|++++++++++. +|.+|++.||+.+|+|+++++ T Consensus 219 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~ 298 (311) T protein:vir:81 219 VSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMS 298 (311) T ss_pred ecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeec Confidence 99998843 36899999999999999999988763 599999999999999999999 Q ss_pred CcceEEEEEEecc Q lcl|Aclame:pro 357 NKVAAVWKLDLKG 369 (381) Q Consensus 357 ~~Af~v~~l~~~~ 369 (381) ++||++++-.+.+ T Consensus 299 ~~a~~~l~~a~~~ 311 (311) T protein:vir:81 299 TDAFAVVRDADES 311 (311) T ss_pred ccceEEEEeeccC Confidence 9999998777766 No 77 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=6.2e-51 Score=295.86 Aligned_cols=332 Identities=11% Similarity=-0.036 Sum_probs=219.3 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhHH--------HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHhh------h---hh Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEPQ--------ERQNELYGDMINQLFEETK---LQAKAEAERVSSLP------K---SA 62 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~------~---~~ 62 (381) |+..++..+++++..+++++.-.+ .+..+...+..+.+..+.. .+............ . .. T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKKG 80 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 755555554444443333221111 0111111111111111110 00000000000000 0 00 Q ss_pred ccc----cHHHHHHH-----------HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEec- Q lcl|Aclame:pro 63 QSL----SANQRSFF-----------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSE- 125 (381) Q Consensus 63 ~~l----t~~e~~~~-----------~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~- 125 (381) ... ...+++.+ +.+..+++++||++||+++...|++.++++++|+++|+++++++ ..++|+.. T Consensus 81 ~~~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (389) T protein:vir:10 81 TDLSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR 160 (389) T ss_pred cccchhHHHHHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEec Confidence 000 11122222 23456778899999999999999999999999999999999864 46777754 Q ss_pred CCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcce Q lcl|Aclame:pro 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) Q Consensus 126 ~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~ 205 (381) +.+.+.|+.|+++.+..++++|+++++.+|+++++++||+|||+||.+++++||.++|+++++++++.+|++|+|++.|. T Consensus 161 ~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~ 240 (389) T protein:vir:10 161 ATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTAK 240 (389) T ss_pred CCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 34556788887887777899999999999999999999999999999999999999999999999999999999987766 Q ss_pred eeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccC Q lcl|Aclame:pro 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~ 285 (381) |..+. .... .+..++... .+..| +++|+|||.++..++.+ ++ T Consensus 241 ~~~~~-----------------------~~~d----~l~~~~~~~------~~~~~--~a~~~~n~~~~~~L~~l---kd 282 (389) T protein:vir:10 241 KTTTD-----------------------TLVD----SLKHILNVD------LDPAY--SRALVVTQSLFNTLDTL---KD 282 (389) T ss_pred ccccc-----------------------ccHH----HHHHHHHhh------hhhhh--CcEEEecHHHHHHHHHh---hc Confidence 54211 0011 111111100 01223 46899999998888775 56 Q ss_pred CCCceeec-------------cCCCceEEecCCC-C-C--c--cEEEEeccc-eEEEecceeeEeeehhhhhhcCceEEE Q lcl|Aclame:pro 286 ANGVYVTA-------------LPFNLNVIESTVQ-E-A--G--KVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYT 345 (381) Q Consensus 286 ~~G~~~~~-------------l~~g~~vi~s~~~-p-~--~--~i~~gd~s~-y~i~~r~~~~i~~~~~~~~~~d~~~~~ 345 (381) ++|+|+|. ..+|+||++++++ + . + .++||||++ |.+++|++++|.++++.+|.+ +|| T Consensus 283 ~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~---~~~ 359 (389) T protein:vir:10 283 KNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKIYGK---YLG 359 (389) T ss_pred cCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeeccccccc---eEE Confidence 78999873 2479999775543 3 2 2 279999998 789999999999999887764 789 Q ss_pred EEEEEcCEEecCcceEEEEEEecccccCCC Q lcl|Aclame:pro 346 AKQFAYGKAKDNKVAAVWKLDLKGHKPALE 375 (381) Q Consensus 346 ~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~ 375 (381) +++|+||++++++||+.+++.-+....+.+ T Consensus 360 ~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 360 AAFRFGVQKADSKAGYFVTNTDVPGSALGK 389 (389) T ss_pred EEEEeccEEecccceEEEEeeccCCCCCCC Confidence 999999999999999998776433333333 No 78 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=2.5e-52 Score=303.45 Aligned_cols=287 Identities=11% Similarity=0.033 Sum_probs=214.2 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCCCC------ceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcc Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNYKE------EKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGV 129 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~g------g~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~ 129 (381) ++ +-.|. .....+...+| +.+||+++.++|++.+++.++++++|+++++++ ..++|+.++.+. T Consensus 1 ~a-------~l~el---~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~ 70 (333) T protein:vir:78 1 MA-------TLNEL---LPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPE 70 (333) T ss_pred Cc-------hhHHh---hhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCce Confidence 00 00111 12223333333 449999999999999999999999999999865 589999999999 Q ss_pred eEEecccc-------cccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCC Q lcl|Aclame:pro 130 AVWGKIYG-------EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD 202 (381) Q Consensus 130 a~w~~e~~-------~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~ 202 (381) +.|++|+. +..+.++++|+++++.+||++++++||+|||+|+.+++++||+++|++++++++|.+|++|+|++ T Consensus 71 a~~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~ 150 (333) T protein:vir:78 71 VGQVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPL 150 (333) T ss_pred eEeecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCC Confidence 99987753 23346789999999999999999999999999999999999999999999999999999999987 Q ss_pred cc---eeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHh Q lcl|Aclame:pro 203 QP---IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 203 qP---~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~ 279 (381) +| .|+++...... .+.. . .. ..........+.+++..+. ...+.....|+|||.++..+++ T Consensus 151 ~~~~~~g~~~~~~~~~-~~~~---~--~~----~~~~~~~~~~i~~~~~~~~------~~~~~~~~~~vmn~~~~~~L~~ 214 (333) T protein:vir:78 151 TGSALQGIDTDNVIAN-TTNV---D--YL----QETGDPLLDRLLDGYDLVS------ANTDVEFNGWAVDPRFRAHLLR 214 (333) T ss_pred CCcccccccccccccc-cccc---c--cc----ccccchhHHHHHHHHHhhc------cccccCceEEEEcchHHHHHHH Confidence 65 45544221111 0100 0 00 0111112222222222211 1223445579999999988888 Q ss_pred hhhccCCCCceeec---------cCCCceEEecCCCCCc---------cEEEEeccceEEEecceeeEeeehhh------ Q lcl|Aclame:pro 280 QYTHLNANGVYVTA---------LPFNLNVIESTVQEAG---------KVLTYVKGLYDGYLAGGINVQKFKET------ 335 (381) Q Consensus 280 ~~~~~~~~G~~~~~---------l~~g~~vi~s~~~p~~---------~i~~gd~s~y~i~~r~~~~i~~~~~~------ 335 (381) ....++.+|.|++. ..+|+||+.+++||.+ .++||||++|++++|++++|+++++. T Consensus 215 ~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~ 294 (333) T protein:vir:78 215 AQAYRDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSG 294 (333) T ss_pred HhhhcCCCCceeecCccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccc Confidence 77778899999874 2379999999999864 48999999999999999999998873 Q ss_pred -----hhhcCceEEEEEEEEcCEEecCcceEEEEEEeccccc Q lcl|Aclame:pro 336 -----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 336 -----~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~ 372 (381) .|.+|++.||+.+|+|+++++++||+++ +.. ..| T Consensus 295 ~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l--~~~-~a~ 333 (333) T protein:vir:78 295 SATVSMWQTNQIAILIEVTFGWLLGDKQAFVKF--VDD-EQP 333 (333) T ss_pred cceeehhhcCcEEEEEEEEEccEEecccceEEE--ecc-CCC Confidence 5889999999999999999999999986 332 222 No 79 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=1.3e-50 Score=294.03 Aligned_cols=332 Identities=11% Similarity=0.028 Sum_probs=216.8 Q ss_pred CCc-cHHHHHHHHHHHHHHHHhhhhHHH-HHHHHHHHH--------HHHHHHHHHH-HHHHHHHHHHHhhhhhcc----- Q lcl|Aclame:pro 1 MTI-NLSETFANAKNEFINAVNNGEPQE-RQNELYGDM--------INQLFEETKL-QAKAEAERVSSLPKSAQS----- 64 (381) Q Consensus 1 m~~-~l~~~~~e~~~~~~~~~~~~~~~~-~~~~~~~~~--------~~~~~~~~~~-~~~~~~~~~~~~~~~~~~----- 64 (381) |.+ |+++++++..+++.+..+....+. ...+...+. .+++..+... +.+.+......+...... T Consensus 1 m~~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 80 (379) T protein:vir:10 1 MEALEIKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSEDKSDS 80 (379) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccchh Confidence 665 333333333333322211110000 000000000 0111100000 000000000000000000 Q ss_pred ----ccHH------HHHHH----HH-HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecC-- Q lcl|Aclame:pro 65 ----LSAN------QRSFF----MD-INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET-- 126 (381) Q Consensus 65 ----lt~~------e~~~~----~~-~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~-- 126 (381) +... .+... .. -...++++++.+||+++...|++.+++.++|+++|+++++++ ..++|+.++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 160 (379) T protein:vir:10 81 LVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAG 160 (379) T ss_pred HHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCC Confidence 0000 00000 00 011345566778999999999999999999999999999865 489998764 Q ss_pred CcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCccee Q lcl|Aclame:pro 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIG 206 (381) Q Consensus 127 ~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~G 206 (381) .+.+.|++|++..+ +++++|++|++.+||++++++||+|||+|++ ++++||.++++++++++++.+|+.|+|.+.+.+ T Consensus 161 ~~~~~~v~Eg~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~ 238 (379) T protein:vir:10 161 EGAIGAQVEGATKG-QKDYDISMIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPNALRRDYAKAENAAFNAVLAANATAS 238 (379) T ss_pred CcccccccCCcccc-ccccceeeeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 35567899977654 5789999999999999999999999999986 699999999999999999999999998765544 Q ss_pred eeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCC Q lcl|Aclame:pro 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~ 286 (381) .... ++. ..... +.+.+..+ ...+..+..|+|||.++..++.+ +++ T Consensus 239 ~~~~-------~~~-------------~~~d~----i~~~~~~~-------~~~~~~~~~~vmn~~~~~~l~~l---kd~ 284 (379) T protein:vir:10 239 TEII-------TNK-------------NKVEM----LINEIAKQ-------ENLDFPVTAIVLRPTDYYDILVT---QKS 284 (379) T ss_pred cccc-------cCc-------------ccHHH----HHHHHHhh-------hhccCCCCEEEEcHHHHHHHHHh---hcc Confidence 3211 000 00111 11211111 12344566799999999888876 577 Q ss_pred CCceeec-----------cCCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhh--hhhcCceEEEEEEEEcCE Q lcl|Aclame:pro 287 NGVYVTA-----------LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGK 353 (381) Q Consensus 287 ~G~~~~~-----------l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~--~~~~d~~~~~~~~r~dgk 353 (381) +|+|+|. ..+|+||+.|++||+++++||||++|.+.+|.+++|+.+++. +|.+|++.||+.+|+|++ T Consensus 285 ~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~ 364 (379) T protein:vir:10 285 VGAGYGLPGVVTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALA 364 (379) T ss_pred CCceeccCCccCCCCCcceecceeeEecCCCCCCceEEeecccEEEEEEeceEEEEeecccccccCCcEEEEEEEEeccE Confidence 8988864 246999999999999999999999999999999999887765 699999999999999999 Q ss_pred EecCcceEEEEEEec Q lcl|Aclame:pro 354 AKDNKVAAVWKLDLK 368 (381) Q Consensus 354 ~~~~~Af~v~~l~~~ 368 (381) +.+++||+.+++.=. T Consensus 365 v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 365 VEQPAALIFGDFTAV 379 (379) T ss_pred EecCccEEEEEecCC Confidence 999999988766544 No 80 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=5.6e-52 Score=301.57 Aligned_cols=301 Identities=14% Similarity=0.053 Sum_probs=222.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec Q lcl|Aclame:pro 36 MINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~ 115 (381) |.+. +........+ .. ...+.+.+++......++||++||+++.++|++.+++.++|+++++++++ T Consensus 1 ~~~~---~~~~~~~~~~---~~--------~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~ 66 (324) T protein:vir:78 1 MEQT---QKLKLNLQHF---AS--------NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM 66 (324) T ss_pred CCcc---hhhhHHHHHH---HH--------HhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeec Confidence 0000 0000000000 00 00112233444555677899999999999999999999999999999998 Q ss_pred CC-ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 116 GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETA 194 (381) Q Consensus 116 ~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a 194 (381) ++ ..++|+.++.+.+.|++|+++++ +++++|+++++.+||++++++||+|||+||.+++++||.+.+++++++++|.+ T Consensus 67 ~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a 145 (324) T protein:vir:78 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) T ss_pred cCCceEEEEEecCcceeEecCCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHH Confidence 76 58999998889999999977765 57899999999999999999999999999999999999999999999999999 Q ss_pred eeeccCCC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhh Q lcl|Aclame:pro 195 FLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSD 273 (381) Q Consensus 195 ~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~ 273 (381) +++|+|++ +|.||++......... .+..+. ..+.++...+ ...+....+|+|||.+ T Consensus 146 ~l~G~g~~~~~~gi~~~~~~~~~~~---------~~~~t~-------~~i~~~~~~l-------~~~~~~~~~~vmn~~~ 202 (324) T protein:vir:78 146 GILNQGNNPFGKSIAQSIEKTNKVI---------KGDFTQ-------DNIIDLEALL-------EDDELEANAFISKTQN 202 (324) T ss_pred HhccCCCCCcCccccccccccceec---------cccccH-------HHHHHHHHhh-------hhccCCCCEEEEcHHH Confidence 99999965 6888876433222111 111111 1122222211 1235566789999999 Q ss_pred HHHHHhhhhccCCCCceeec-----cCCCceEEecCCC--CCccEEEEeccceEEEecceeeEeeehhhh---------- Q lcl|Aclame:pro 274 AFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQ--EAGKVLTYVKGLYDGYLAGGINVQKFKETL---------- 336 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~~~~-----l~~g~~vi~s~~~--p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~---------- 336 (381) +..++.+ ++.+|+|++. ..+|+||+.++.+ +++.++||||++|++++|++++++.++|.. T Consensus 203 ~~~L~~l---~d~~G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:78 203 RSLLRKI---VDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHHHh---hccCCCeeecCCCCCcccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeeccccccccccccc Confidence 9888765 5667877653 2368999987764 466799999999999999999999998853 Q ss_pred ----hhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 337 ----ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 337 ----~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) |.+|++.||+.+|+|+++.+++||++++-.....+ ++-|.- T Consensus 280 ~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~-~~~~~~ 324 (324) T protein:vir:78 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD-SVPGEV 324 (324) T ss_pred chhhhhcCcEEEEEEEEEccEEecccceEEEecccccCC-CCCCCC Confidence 89999999999999999999999998765443322 222222 No 81 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=5.6e-52 Score=301.57 Aligned_cols=301 Identities=14% Similarity=0.053 Sum_probs=222.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec Q lcl|Aclame:pro 36 MINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~ 115 (381) |.+. +........+ .. ...+.+.+++......++||++||+++.++|++.+++.++|+++++++++ T Consensus 1 ~~~~---~~~~~~~~~~---~~--------~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~ 66 (324) T protein:vir:96 1 MEQT---QKLKLNLQHF---AS--------NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM 66 (324) T ss_pred CCcc---hhhhHHHHHH---HH--------HhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeec Confidence 0000 0000000000 00 00112233444555677899999999999999999999999999999998 Q ss_pred CC-ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 116 GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETA 194 (381) Q Consensus 116 ~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a 194 (381) ++ ..++|+.++.+.+.|++|+++++ +++++|+++++.+||++++++||+|||+||.+++++||.+.+++++++++|.+ T Consensus 67 ~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a 145 (324) T protein:vir:96 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) T ss_pred cCCceEEEEEecCcceeEecCCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHH Confidence 76 58999998889999999977765 57899999999999999999999999999999999999999999999999999 Q ss_pred eeeccCCC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhh Q lcl|Aclame:pro 195 FLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSD 273 (381) Q Consensus 195 ~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~ 273 (381) +++|+|++ +|.||++......... .+..+. ..+.++...+ ...+....+|+|||.+ T Consensus 146 ~l~G~g~~~~~~gi~~~~~~~~~~~---------~~~~t~-------~~i~~~~~~l-------~~~~~~~~~~vmn~~~ 202 (324) T protein:vir:96 146 GILNQGNNPFGKSIAQSIEKTNKVI---------KGDFTQ-------DNIIDLEALL-------EDDELEANAFISKTQN 202 (324) T ss_pred HhccCCCCCcCccccccccccceec---------cccccH-------HHHHHHHHhh-------hhccCCCCEEEEcHHH Confidence 99999965 6888876433222111 111111 1122222211 1235566789999999 Q ss_pred HHHHHhhhhccCCCCceeec-----cCCCceEEecCCC--CCccEEEEeccceEEEecceeeEeeehhhh---------- Q lcl|Aclame:pro 274 AFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQ--EAGKVLTYVKGLYDGYLAGGINVQKFKETL---------- 336 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~~~~-----l~~g~~vi~s~~~--p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~---------- 336 (381) +..++.+ ++.+|+|++. ..+|+||+.++.+ +++.++||||++|++++|++++++.++|.. T Consensus 203 ~~~L~~l---~d~~G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:96 203 RSLLRKI---VDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHHHh---hccCCCeeecCCCCCcccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeeccccccccccccc Confidence 9888765 5667877653 2368999987764 466799999999999999999999998853 Q ss_pred ----hhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 337 ----ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 337 ----~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) |.+|++.||+.+|+|+++.+++||++++-.....+ ++-|.- T Consensus 280 ~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~-~~~~~~ 324 (324) T protein:vir:96 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD-SVPGEV 324 (324) T ss_pred chhhhhcCcEEEEEEEEEccEEecccceEEEecccccCC-CCCCCC Confidence 89999999999999999999999998765443322 222222 No 82 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=3.1e-51 Score=297.49 Aligned_cols=335 Identities=10% Similarity=-0.024 Sum_probs=220.5 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhHH--------HHHHHHHHHHHHHHHHHH---HHHHHHH-H--------HHHH-Hhhhh Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEPQ--------ERQNELYGDMINQLFEET---KLQAKAE-A--------ERVS-SLPKS 61 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~---~~~~~~~-~--------~~~~-~~~~~ 61 (381) |+..+++.+++++..++++..-.+ .+..+......+.+..+. ..+.... . .+.. ...+. T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 80 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPN 80 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhccc Confidence 544444444444433332221110 011111111111111100 0000000 0 0000 00000 Q ss_pred hcc----ccHHHHHHHH------------HHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEe Q lcl|Aclame:pro 62 AQS----LSANQRSFFM------------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKS 124 (381) Q Consensus 62 ~~~----lt~~e~~~~~------------~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~ 124 (381) ... ....+++.+. +...+++++||++||+++..+|++.++++++|+++|+++++++ ..++|+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (394) T protein:vir:10 81 GTDLKKKPIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPIL 160 (394) T ss_pred ccchhhhHHHHHHHHHHHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEE Confidence 000 0112222222 2345678889999999999999999999999999999999865 4778875 Q ss_pred c-CCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCc Q lcl|Aclame:pro 125 E-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ 203 (381) Q Consensus 125 ~-~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~q 203 (381) . +.+.+.|+.|+++.+..++++|++|++.+|+++++++||+|||+||.+++++||.++|++++++++|.+|++|+|+++ T Consensus 161 ~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~ 240 (394) T protein:vir:10 161 KRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFT 240 (394) T ss_pred ecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 4 446688999888877667899999999999999999999999999999999999999999999999999999999988 Q ss_pred ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc Q lcl|Aclame:pro 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~ 283 (381) |.++.+.. .. +.+..++... .+..| +++|+|||+++..++.+ T Consensus 241 ~~~~~~~~-----------------------~~----d~l~~~~~~~------~~~~~--~a~~vmn~~~~~~l~~l--- 282 (394) T protein:vir:10 241 AKATTTDT-----------------------LV----DSLKHILNVD------LDPAY--SRALVVTQSLFNTLDTL--- 282 (394) T ss_pred cccccccc-----------------------cH----HHHHHHHHhh------hhhhc--cCEEEecHHHHHHHHHh--- Confidence 87653210 00 1111111100 11223 57899999998888875 Q ss_pred cCCCCceeec-------------cCCCceEEecCCC--CC--c--cEEEEeccc-eEEEecceeeEeeehhhhhhcCceE Q lcl|Aclame:pro 284 LNANGVYVTA-------------LPFNLNVIESTVQ--EA--G--KVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDL 343 (381) Q Consensus 284 ~~~~G~~~~~-------------l~~g~~vi~s~~~--p~--~--~i~~gd~s~-y~i~~r~~~~i~~~~~~~~~~d~~~ 343 (381) ++++|+|++. ..+|+||++++++ |. + .++||||++ |++++|+++++..+++.+|. ++ T Consensus 283 kd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~~~---~~ 359 (394) T protein:vir:10 283 KDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKIYG---RY 359 (394) T ss_pred hccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecccccc---ee Confidence 5778988863 2479999876543 32 2 389999998 77899999999999887765 47 Q ss_pred EEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 344 YTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 344 ~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) |++++|+||++++++||++++++-+.. +++-++-- T Consensus 360 ~~~~~r~d~~~~~~~ai~~~~~~~~~~-~~~~~~~~ 394 (394) T protein:vir:10 360 LGAAFRFGVKQADSNAGYFVTNTDAAS-GSTSGTGK 394 (394) T ss_pred EEEEEEeccEEeccccEEEEEeecccC-CCCCCCCC Confidence 999999999999999999988766542 22222222 No 83 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=6e-51 Score=295.93 Aligned_cols=354 Identities=9% Similarity=0.032 Sum_probs=222.5 Q ss_pred CCccHHHHHHHHHHHHHHHHhh----hhHH------HHHHHHHHH-------HHHHH---HHHHHHHHHHHHHH------ Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNN----GEPQ------ERQNELYGD-------MINQL---FEETKLQAKAEAER------ 54 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~----~~~~------~~~~~~~~~-------~~~~~---~~~~~~~~~~~~~~------ 54 (381) |+.++. ++++++.++.+.++. .+.+ +++.+.+.. .++.+ ..+.+.......+. T Consensus 8 m~~~i~-eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~~~~~~~~ 86 (477) T protein:vir:84 8 LRALRA-AAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIERSGKLEAE 86 (477) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhh Confidence 665433 444444444333221 1100 111111111 11111 01000000000000 Q ss_pred ----------------------------HHHhhhhhccccHH---------------HHHHHH-----HHhcccCCCCce Q lcl|Aclame:pro 55 ----------------------------VSSLPKSAQSLSAN---------------QRSFFM-----DINKNVNYKEEK 86 (381) Q Consensus 55 ----------------------------~~~~~~~~~~lt~~---------------e~~~~~-----~~~~~~~~~gg~ 86 (381) .....+........ ++.... ....++++.||+ T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~ 166 (477) T protein:vir:84 87 TKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGY 166 (477) T ss_pred hhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcce Confidence 00000000000000 000000 011245667899 Q ss_pred EccHHH-HHHHHHHHHhhhhhhhhceeeecC---CceEEEEec-CCcceEEecccccc----cccccccccceeccceee Q lcl|Aclame:pro 87 LLPEET-IDRIFEDLTTNHPLLADLGIKNAG---LRLKFLKSE-TSGVAVWGKIYGEI----KGQLDAAFSEETAIQNKL 157 (381) Q Consensus 87 lvP~~~-~~~Ii~~l~~~~~l~~~~~v~~~~---~~~~ip~~~-~~~~a~w~~e~~~~----~~~~~~~f~~v~l~~~kl 157 (381) +||+++ .++|++.++..++|+++|++++++ +.+.||+.. +...++|++|++.. .++++++|+++++.+||+ T Consensus 167 lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~ 246 (477) T protein:vir:84 167 AVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTI 246 (477) T ss_pred eeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeE Confidence 998885 678999999999999999988754 357899854 44567889887643 346788999999999999 Q ss_pred eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCC-CcceeeeeccccccccccccccccchhhhccccCh Q lcl|Aclame:pro 158 TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGK-DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANP 236 (381) Q Consensus 158 ~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~-~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~ 236 (381) +++++||+|||+||.+++++||.++|++++++++|.+|++|+|+ ++|.||++........... ...+..+. T Consensus 247 ~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~--------~~~t~~~~ 318 (477) T protein:vir:84 247 AGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATS--------AGSALEKH 318 (477) T ss_pred EeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccc--------cccchhhH Confidence 99999999999999999999999999999999999999999996 5999999753221111110 01112222 Q ss_pred hHHHHHHHHHHHHhhhccccccccc-cCceEEEEchhhHHHHHhhhhccCCCCceeec---------------------- Q lcl|Aclame:pro 237 RATVNELTQVFKYHSTNEKGKSVAV-KGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------------------- 293 (381) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~---------------------- 293 (381) ..+...+.+....+ ...| .+..+|+|||.++..++++ ++.+|+|+|. T Consensus 319 ~~~~~~i~~~~~~~-------~~~~~~~~~~~v~~~~~~~~l~~l---kd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~ 388 (477) T protein:vir:84 319 QIIYQKIADAIQRV-------HTSRFLEPEVIVMHPRRWASFHAI---FAGDDRPLIVPSGPGFNNLGVLTEVASQRVVG 388 (477) T ss_pred HHHHHHHHHHHhhc-------cccccCCccEEEEcHHHHHHHHHh---hccCCCeeeecCcccccccccccccccccccc Confidence 22222222222111 1223 3345799999999888876 5678888874 Q ss_pred cCCCceEEecCCCCCc--------cEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEec-CcceEEEE Q lcl|Aclame:pro 294 LPFNLNVIESTVQEAG--------KVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKD-NKVAAVWK 364 (381) Q Consensus 294 l~~g~~vi~s~~~p~~--------~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~-~~Af~v~~ 364 (381) ..+|+||++++.||++ .++||||++|+++. .++++.++++.++.++++.|+...+++++++. ++||++++ T Consensus 389 ~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~-~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t 467 (477) T protein:vir:84 389 QMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE-SSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIG 467 (477) T ss_pred hhcccceEecCcccccccccCCcceEEEEEeceEEEEe-eceeEEeccccccccceeeeeehhhhhhhhhccccceEEee Confidence 2369999999999964 47999999998877 57899999999999999999999888887775 99999754 Q ss_pred EEecccccCCCC Q lcl|Aclame:pro 365 LDLKGHKPALEG 376 (381) Q Consensus 365 l~~~~~~~~~~~ 376 (381) .++.++.++. T Consensus 468 --~~~~~~~~~~ 477 (477) T protein:vir:84 468 --GTALTAPTFA 477 (477) T ss_pred --cccccccccC Confidence 4444444333 No 84 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=3.5e-51 Score=297.23 Aligned_cols=301 Identities=14% Similarity=0.065 Sum_probs=222.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec Q lcl|Aclame:pro 36 MINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~ 115 (381) |.+. ++.+.+.+. ....... .+.+++.......++|.+||++++++|++.+++.++|+++|+++++ T Consensus 1 ~~k~--~~~~~~~~~----~~~~~~~--------~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~ 66 (324) T protein:vir:99 1 MEQT--QKLKLNLQH----FASNNVK--------PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPM 66 (324) T ss_pred CCCc--hHhhHHHHH----HHHHhhh--------hhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeec Confidence 1000 000000000 0000000 1111223334455677799999999999999999999999999998 Q ss_pred CC-ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 116 GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETA 194 (381) Q Consensus 116 ~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a 194 (381) ++ ..++|+.++.+.+.|++|+++++ +++++|+++++.+||++++++||+|||+|+.+++++||.+.+++++++++|.+ T Consensus 67 ~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~ 145 (324) T protein:vir:99 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) T ss_pred cCCceEEEEEecCcceeEeccCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHH Confidence 65 58999998889999999987765 57899999999999999999999999999999999999999999999999999 Q ss_pred eeeccCCC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhh Q lcl|Aclame:pro 195 FLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSD 273 (381) Q Consensus 195 ~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~ 273 (381) +++|+|++ +|.|+++.+........ ++.+. ..+.+++..+ ...+..+.+|+|||.+ T Consensus 146 ~l~G~g~~~~~~~~~~~~~~~~~~~~---------~~~~~-------~~i~~~~~~l-------~~~~~~~~~~v~n~~~ 202 (324) T protein:vir:99 146 GILNQGNNPFGKSIAQSIEKTNKVIK---------GDFTQ-------DNIIDLEALL-------EDDELEANAFISKTQN 202 (324) T ss_pred hhhcCCCCccCccccccccccceecc---------ccCCH-------HHHHHHHHhh-------hhccCCCCEEEEcHHH Confidence 99999976 78998865443222211 11111 1222222222 1234566789999999 Q ss_pred HHHHHhhhhccCCCCceeec-----cCCCceEEecCCCC--CccEEEEeccceEEEecceeeEeeehhhh---------- Q lcl|Aclame:pro 274 AFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQE--AGKVLTYVKGLYDGYLAGGINVQKFKETL---------- 336 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~~~~-----l~~g~~vi~s~~~p--~~~i~~gd~s~y~i~~r~~~~i~~~~~~~---------- 336 (381) +..++.+ ++++|+|++. ..+|+||+.++.++ ++.+++|||++|+++++++++|++++|.. T Consensus 203 ~~~L~~l---~d~~g~~~~~~~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:99 203 RSLLRKI---VDPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHHHh---hcCCCceeecCCCCccccceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeeccccccccccccc Confidence 9888765 5667777643 24799999988766 45699999999999999999999998853 Q ss_pred ----hhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 337 ----ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 337 ----~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) |.+|++.||+.+|+|+++.+++||++++......++ +.+.- T Consensus 280 ~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~-~~~~~ 324 (324) T protein:vir:99 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDS-VPGEV 324 (324) T ss_pred chhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCC-CCCCC Confidence 889999999999999999999999998887655442 22222 No 85 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=2.4e-51 Score=298.14 Aligned_cols=301 Identities=14% Similarity=0.058 Sum_probs=222.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec Q lcl|Aclame:pro 36 MINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~ 115 (381) |++ ..+.+.++...... ..+.+.+++........+|++||+++.++|++.+++.++|+++|+++++ T Consensus 1 ~~~------~~~~~~~~~~f~~~--------~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~ 66 (324) T protein:vir:93 1 MEQ------TQKLKLNLQHFASN--------NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPM 66 (324) T ss_pred Cch------hHHHHHHHHHHHHh--------hhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeec Confidence 100 00011111111110 0111222344445556778899999999999999999999999999998 Q ss_pred CC-ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 116 GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETA 194 (381) Q Consensus 116 ~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a 194 (381) ++ ..+||+.++.+.+.|++|+++++ +++++|+++++.+||++++++||+|||+||.+++++||++++++++++++|.+ T Consensus 67 ~~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a 145 (324) T protein:vir:93 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) T ss_pred cCCceEEEEEecCcceeeecCCcccc-ccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHH Confidence 65 48999998899999999987765 57899999999999999999999999999999999999999999999999999 Q ss_pred eeeccCCC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhh Q lcl|Aclame:pro 195 FLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSD 273 (381) Q Consensus 195 ~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~ 273 (381) +|+|+|++ +|.|+++.......... ++.+. ..+.+++..+ ...+.....|+|||++ T Consensus 146 ~l~G~g~~~~~~~~~~~~~~~~~~~~---------~~~~~-------~~i~~~~~~l-------~~~~~~~~~~v~n~~~ 202 (324) T protein:vir:93 146 GILNQGNNPFGKSIAQSIEKTNKVIK---------GDFTQ-------DNIIDLEALL-------EDDELEANAFISKTQN 202 (324) T ss_pred HhcCCCCCCcCccccccccccceecc---------ccccH-------HHHHHHHHhh-------hhccCCCCEEEEcHHH Confidence 99999965 78999865433222111 11111 1222222222 1234556789999999 Q ss_pred HHHHHhhhhccCCCCceeec-----cCCCceEEecCC--CCCccEEEEeccceEEEecceeeEeeehhhh---------- Q lcl|Aclame:pro 274 AFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTV--QEAGKVLTYVKGLYDGYLAGGINVQKFKETL---------- 336 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~~~~-----l~~g~~vi~s~~--~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~---------- 336 (381) +..++.+ ++++|+|+.. ..+|+||+.+.. ++++.+++|||++|++++|++++|+.++|.. T Consensus 203 ~~~L~~l---~d~~G~~~~~~~~~~~l~G~PVv~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:93 203 RSLLRKI---VDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHHHh---hCCCCCeeecCCCCCcccceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeeccccccccccccc Confidence 8888765 5678888753 236899998765 4566799999999999999999999998853 Q ss_pred ----hhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 337 ----ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 337 ----~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) |.+|++.||+.+|+|+++++++||++++......++ +-|.- T Consensus 280 ~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~-~~~~~ 324 (324) T protein:vir:93 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS-VPGEV 324 (324) T ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCC-CCCCC Confidence 889999999999999999999999998655544333 22332 No 86 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=9.3e-52 Score=300.37 Aligned_cols=270 Identities=12% Similarity=0.010 Sum_probs=223.0 Q ss_pred HHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC---CceEEEEec-CCcceEEecccccccccccccc Q lcl|Aclame:pro 72 FFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG---LRLKFLKSE-TSGVAVWGKIYGEIKGQLDAAF 147 (381) Q Consensus 72 ~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~---~~~~ip~~~-~~~~a~w~~e~~~~~~~~~~~f 147 (381) +++++..+++++||++||+++.++|++.++++++|+++|++++++ +...+|+.. ..+.+.|++|+++.++.++++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 677888899999999999999999999999999999999998864 346677654 5577999999888776678999 Q ss_pred cceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccch Q lcl|Aclame:pro 148 SEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEE 227 (381) Q Consensus 148 ~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~ 227 (381) +++++.+||++++++||+|||+|+.+++++||++++++++++++|.+|++|+|...+.+ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~--------------------- 139 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTKP--------------------- 139 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccccc--------------------- Confidence 99999999999999999999999999999999999999999999999999987543210 Q ss_pred hhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec---------cCCCc Q lcl|Aclame:pro 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNL 298 (381) Q Consensus 228 ~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~---------l~~g~ 298 (381) +..+. +.+.++...+ ...|+.+++|+||+.++..++.+ ++.+|+|+|. ..+|+ T Consensus 140 -~~~~~-------d~i~~~~~~l-------~~~~~~~a~~vmn~~~~~~L~~l---kd~~g~~l~~~~~~~~~~~~l~G~ 201 (293) T protein:vir:48 140 -TLTKW-------DDIIDLEAKV-------DPAIKQTSFFLTNTSGFTALKKV---KNALGDYLMERDVKSPTGYSIAGF 201 (293) T ss_pred -cccCH-------HHHHHHHHhh-------hhhhcCCCEEEEcHHHHHHHHHh---hccCCceEeecCcCCCCCceecce Confidence 00011 1122222222 23577889999999999888775 5678998874 24799 Q ss_pred eEEecC--CCCCc-----cEEEEeccc-eEEEecceeeEeeehh--hhhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 299 NVIEST--VQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 299 ~vi~s~--~~p~~-----~i~~gd~s~-y~i~~r~~~~i~~~~~--~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) ||+.++ .+|.. .++||||++ |.+++|++++++++++ .+|.+|+++||+.+|+||++++++||++++++-+ T Consensus 202 Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 281 (293) T protein:vir:48 202 AVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 281 (293) T ss_pred eeEEecccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeecc Confidence 987644 44432 379999998 6789999999999875 5799999999999999999999999999999999 Q ss_pred ccccCCCCCCCC Q lcl|Aclame:pro 369 GHKPALEGTEET 380 (381) Q Consensus 369 ~~~~~~~~~~~~ 380 (381) +.+|++.+++.- T Consensus 282 ~~~~~~~~~~~~ 293 (293) T protein:vir:48 282 ADQKGNIGSTAV 293 (293) T ss_pred ccCCccccccCC Confidence 988888888888 No 87 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=4.9e-51 Score=296.39 Aligned_cols=301 Identities=15% Similarity=0.070 Sum_probs=222.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec Q lcl|Aclame:pro 36 MINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~ 115 (381) |.+. ++.+. +..+........+. +++.......++|.+||++++++|++.+++.++|+++|+++++ T Consensus 1 ~~~~--~~~~~----~~~~f~~~~~~~~~--------~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~ 66 (324) T protein:vir:10 1 MEQT--QKLKL----NLQHFASNNVKPQV--------FNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPM 66 (324) T ss_pred CCCc--hHHHH----HHHHHHHHhhccce--------ecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeec Confidence 0000 00000 01111111011111 1222334455677899999999999999999999999999998 Q ss_pred CC-ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 116 GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETA 194 (381) Q Consensus 116 ~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a 194 (381) ++ ..++|+.++.+.+.|++|+++.+ +++++|+++++.+||++++++||+|||+|+.+++++||.+.+++++++++|.+ T Consensus 67 ~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a 145 (324) T protein:vir:10 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) T ss_pred cCCceEEEEEeCCcceeEeccCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHH Confidence 65 58999998889999999987765 57899999999999999999999999999999999999999999999999999 Q ss_pred eeeccCCC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhh Q lcl|Aclame:pro 195 FLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSD 273 (381) Q Consensus 195 ~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~ 273 (381) +++|+|++ +|.|+++.+........ ++.+. ..+.+++..+ ...+..+.+|+|||.+ T Consensus 146 ~l~G~g~~~~~~~i~~~~~~~~~~~~---------~~~t~-------~~i~~~~~~l-------~~~~~~~~~~v~n~~~ 202 (324) T protein:vir:10 146 GILNQGNNPFGKSIAQSIEKTNKVIK---------GDFTQ-------DNIIDLEALL-------EDDELEANAFISKTQN 202 (324) T ss_pred hhhcCCCCccCccccccccccceecc---------ccCCH-------HHHHHHHHhh-------hhccCCCCEEEEcHHH Confidence 99999976 79999865443322211 11111 1222222222 2235566789999999 Q ss_pred HHHHHhhhhccCCCCceeec-----cCCCceEEecCCCC--CccEEEEeccceEEEecceeeEeeehhhh---------- Q lcl|Aclame:pro 274 AFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQE--AGKVLTYVKGLYDGYLAGGINVQKFKETL---------- 336 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~~~~-----l~~g~~vi~s~~~p--~~~i~~gd~s~y~i~~r~~~~i~~~~~~~---------- 336 (381) +..++.+ ++++|+|++. ..+|+||+.++.++ ++.+++|||++|++++|++++|++++|.+ T Consensus 203 ~~~L~~l---~d~~g~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:10 203 RSLLRKI---VDPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHHHh---hccCCceeecCCCCccccceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeeccccccccccccc Confidence 9888765 5667877653 24799999887755 55699999999999999999999998853 Q ss_pred ----hhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 337 ----ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 337 ----~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) |.+|++.||+.+|+|+++.+++||++++...... +++.+.- T Consensus 280 ~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~-~~~~~~~ 324 (324) T protein:vir:10 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKT-DSVPGEV 324 (324) T ss_pred chhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCC-CCCCCCC Confidence 8899999999999999999999999987766443 2232322 No 88 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=1e-51 Score=300.14 Aligned_cols=275 Identities=14% Similarity=0.036 Sum_probs=219.7 Q ss_pred ccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEeccccccccc Q lcl|Aclame:pro 65 LSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKIYGEIKGQ 142 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~~e~~~~~~~ 142 (381) ++.+.. ++++..+++++|.+||++++++|++.+++.++|+++|+++++++ ...+|+..+.+.+.|++|+++.+ + T Consensus 1 m~~~~~---~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~ 76 (297) T protein:vir:95 1 MTVQTF---NPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIK-T 76 (297) T ss_pred CCcccc---ccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCcccc-c Confidence 222222 33444566788889999999999999999999999999999764 36788888888999999987765 5 Q ss_pred ccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccc Q lcl|Aclame:pro 143 LDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAY 222 (381) Q Consensus 143 ~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~ 222 (381) ++++|+++++.+||++++++||+|+|+||.+++++||++++++++++++|.++++|+|+++|.||++.+........ T Consensus 77 ~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~--- 153 (297) T protein:vir:95 77 DKPEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIG--- 153 (297) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecc--- Confidence 68999999999999999999999999999999999999999999999999999999999999999875433222111 Q ss_pred cccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeecc----CCCc Q lcl|Aclame:pro 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL----PFNL 298 (381) Q Consensus 223 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~l----~~g~ 298 (381) +..+.+ . +.+++..+ ...+..+.+|+|||.++..++.+ ++.+|+|++.. .+|+ T Consensus 154 ------~~~t~~---~----i~~~~~~l-------~~~~~~~~~~v~~~~~~~~L~~l---~d~~G~~i~~~~~~~l~G~ 210 (297) T protein:vir:95 154 ------GPINYD---N----ILKLQDAL-------YDADVEPNAFVSKIQNRSALREA---RDGNKVSIYDKAANTIDGI 210 (297) T ss_pred ------cccCHH---H----HHHHHHHh-------hhccCCcCEEEEcHHHHHHHHHh---hccCCceeecCCCCcccce Confidence 111111 1 22222211 12345567899999999888865 56788888753 3688 Q ss_pred eEEecC--CCCCccEEEEeccceEEEecceeeEeeehhhh--------------hhcCceEEEEEEEEcCEEecCcceEE Q lcl|Aclame:pro 299 NVIEST--VQEAGKVLTYVKGLYDGYLAGGINVQKFKETL--------------ALDDMDLYTAKQFAYGKAKDNKVAAV 362 (381) Q Consensus 299 ~vi~s~--~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~--------------~~~d~~~~~~~~r~dgk~~~~~Af~v 362 (381) ||+.+. .++++.++||||++|+++++++++++++++.+ |.+|++.||+.+|+|+++.+++||++ T Consensus 211 Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~ 290 (297) T protein:vir:95 211 TTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAK 290 (297) T ss_pred eeEeecCCCCCCceEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEE Confidence 998754 46678899999999999999999999998864 88999999999999999999999997 Q ss_pred EEEEecccccC Q lcl|Aclame:pro 363 WKLDLKGHKPA 373 (381) Q Consensus 363 ~~l~~~~~~~~ 373 (381) ++ ..||+ T Consensus 291 l~----~at~~ 297 (297) T protein:vir:95 291 LT----PAERV 297 (297) T ss_pred Ee----ecCCC Confidence 52 34555 No 89 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=1.5e-50 Score=293.75 Aligned_cols=337 Identities=9% Similarity=-0.028 Sum_probs=215.2 Q ss_pred CCccHHHHHHHHHHHHHHH----Hhh---hhHH---HHHHHHH---HHHHHH----HHHH---HHHHHHHHHHHHHH--- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINA----VNN---GEPQ---ERQNELY---GDMINQ----LFEE---TKLQAKAEAERVSS--- 57 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~----~~~---~~~~---~~~~~~~---~~~~~~----~~~~---~~~~~~~~~~~~~~--- 57 (381) |.=++ +++.++.+++.+. ++. .... +...... ....+. .... ..........+... T Consensus 48 ~~~ei-~el~~~l~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 126 (437) T protein:vir:10 48 KEDEI-KEIRSNIEVLEQASALKVEEKRDDSDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQ 126 (437) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHh Confidence 21111 1222222211111 111 0000 0000000 000000 0000 00000000000000 Q ss_pred --hhhhhccccHHHHHH---------HHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC-CceEEEEec Q lcl|Aclame:pro 58 --LPKSAQSLSANQRSF---------FMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSE 125 (381) Q Consensus 58 --~~~~~~~lt~~e~~~---------~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~~~ip~~~ 125 (381) ...........+.+. ..+...+++++||++||+++.+.|. .+++.++|+.+|++++++ +...+|+.. T Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 205 (437) T protein:vir:10 127 DMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEK-EVHQFPRLGSLVRTESVTTTTGKLPIFN 205 (437) T ss_pred HHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHH-HhhhhhhhhhcceeEeeccCceeeEEee Confidence 000000011111111 1234556788999999999988665 578889999999999875 457788754 Q ss_pred -CCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcc Q lcl|Aclame:pro 126 -TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQP 204 (381) Q Consensus 126 -~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP 204 (381) ..+.+.|++|++..+..++++|++|++.+|+++++++||+|||+||.+||++||+++++++++.+++.+|++|+|+++| T Consensus 206 ~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~ 285 (437) T protein:vir:10 206 NSTDLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIK 285 (437) T ss_pred ccccccccccccccccccccccceeeeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 4567899999888776778999999999999999999999999999999999999999999999999999999998877 Q ss_pred eeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhcc Q lcl|Aclame:pro 205 IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL 284 (381) Q Consensus 205 ~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~ 284 (381) .+..+. +..+ +.+++.. ..+..|+.+++|+|||.++..++.+ + T Consensus 286 ~~~~~~---------------------~~~~-------~~~~~~~------~l~~~~~~~~~~~~~~~~~~~l~~l---k 328 (437) T protein:vir:10 286 KTTSTY---------------------LLGD-------LKKVLNV------TLKPQDSAAASIVMSQSAYNLFDMA---T 328 (437) T ss_pred cccccc---------------------chhh-------HHHHHHh------hhhhhhhcCCEEEEcHHHHHHHHHh---h Confidence 543210 0011 1111110 1234578899999999998888775 6 Q ss_pred CCCCceeec---------cCCCceEEecCCC--CCc---c--EEEEeccc-eEEEecceeeEeeehhhhhhcCceEEEEE Q lcl|Aclame:pro 285 NANGVYVTA---------LPFNLNVIESTVQ--EAG---K--VLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAK 347 (381) Q Consensus 285 ~~~G~~~~~---------l~~g~~vi~s~~~--p~~---~--i~~gd~s~-y~i~~r~~~~i~~~~~~~~~~d~~~~~~~ 347 (381) +++|+|+|. ..+|+||++++++ |.+ + ++||||++ |.+++|+++++..+++ +..+.+++++. T Consensus 329 d~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~--~~~~~~~~~~~ 406 (437) T protein:vir:10 329 DAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQFQDT--YDIWYKQLGIF 406 (437) T ss_pred ccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEEEEecc--cccccceeeEE Confidence 788999874 2479999987654 532 2 89999997 6789999999987764 56677899999 Q ss_pred EEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 348 QFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 348 ~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) +|+||++++++||++++.++++.+.+.-.+- T Consensus 407 ~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 407 LRQNVVQASKDLIVNLTGKLKAVTVVQSTAV 437 (437) T ss_pred EEEccEEecccceEEEEeeccccccCCCCCC Confidence 9999999999999999887644322221111 No 90 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=6.4e-50 Score=290.29 Aligned_cols=345 Identities=16% Similarity=0.133 Sum_probs=221.4 Q ss_pred CCcc-HHHHHHHHHHHHHHHHhhhh---HH------HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH---------- Q lcl|Aclame:pro 1 MTIN-LSETFANAKNEFINAVNNGE---PQ------ERQNELY---GDMINQLFEETKLQAKAEAERVSS---------- 57 (381) Q Consensus 1 m~~~-l~~~~~e~~~~~~~~~~~~~---~~------~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~---------- 57 (381) |++. ...++.+++.++.+.++... .+ +++.+.+ ...++.+..........+..++.. T Consensus 193 ~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~ 272 (645) T protein:vir:93 193 MNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNG 272 (645) T ss_pred cchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 6662 22344444444433322111 00 1111111 111111111111000000000000 Q ss_pred ----------------------hhh--------hhccccHHH---------HHH----HHHHh----cccCCCCceEccH Q lcl|Aclame:pro 58 ----------------------LPK--------SAQSLSANQ---------RSF----FMDIN----KNVNYKEEKLLPE 90 (381) Q Consensus 58 ----------------------~~~--------~~~~lt~~e---------~~~----~~~~~----~~~~~~gg~lvP~ 90 (381) ..+ .++...+.+ .+. ..++. .++.+.||++||+ T Consensus 273 ~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~ 352 (645) T protein:vir:93 273 NVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQ 352 (645) T ss_pred ccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCch Confidence 000 000000000 000 01111 2334469999999 Q ss_pred HHHHHHHHHHHhhhhhhhhceee-e----cCCceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhH Q lcl|Aclame:pro 91 ETIDRIFEDLTTNHPLLADLGIK-N----AGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPK 165 (381) Q Consensus 91 ~~~~~Ii~~l~~~~~l~~~~~v~-~----~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ 165 (381) ++..+||+.|+..+++++++... + ..+++++|+.++.++++|++|+++. ++++++|+++++.+||++++++||+ T Consensus 353 ~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~-~~s~~~f~~v~l~~~kla~~~~iS~ 431 (645) T protein:vir:93 353 EYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTK-PLTKFDFESITFSHAKVSAIAVLTE 431 (645) T ss_pred hhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCccc-cccccceeEEEEeeEEEEEeehhHH Confidence 99999999999999999886542 2 2457899999988999999997765 4679999999999999999999999 Q ss_pred HHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCC----cceeeeeccccccccccccccccchhhhccccChhHHHH Q lcl|Aclame:pro 166 DLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD----QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVN 241 (381) Q Consensus 166 ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~----qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~ 241 (381) |||+||.+++++||++++++++++++|.+||+|+|++ +|.|++..+.... +. +. +.. T Consensus 432 ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~--~~---------~~-~~~------- 492 (645) T protein:vir:93 432 ELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTA--SS---------GN-PDA------- 492 (645) T ss_pred HHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccc--cc---------cc-hHH------- Confidence 9999999999999999999999999999999998754 6899865321110 00 00 001 Q ss_pred HHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec-------cCCCceEEecCCCCCccEEEE Q lcl|Aclame:pro 242 ELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-------LPFNLNVIESTVQEAGKVLTY 314 (381) Q Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~-------l~~g~~vi~s~~~p~~~i~~g 314 (381) .+..++..+.. ......+++|+|||.++..++.+ ++++|+|++. ..+|+||+.|++||+ .++|| T Consensus 493 d~~~~~~~~~~-----a~~~~~~a~~vmn~~~~~~L~~l---kd~~G~~~~~~~~~~~~tL~G~PV~~s~~vp~-~~~~g 563 (645) T protein:vir:93 493 DAEAAFGQFVA-----ANLQPTGAVWLMSSTNALALSMR---KNALGQKEYPDMTLLGGSFQGLPVIVSQYVGD-QLVLV 563 (645) T ss_pred HHHHHHHHHHh-----cCCCccccEEEEcHHHHHHHHhc---cccCCceeecCCCCCCceeeceeeEEeccCCc-ceeEe Confidence 11222221110 11234578999999998888765 6778887652 237999999999996 57899 Q ss_pred eccceEEEecceeeEeeehhhh----------------------hhcCceEEEEEEEEcCEEecCcceEEEEEEeccccc Q lcl|Aclame:pro 315 VKGLYDGYLAGGINVQKFKETL----------------------ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 315 d~s~y~i~~r~~~~i~~~~~~~----------------------~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~ 372 (381) ||++|++++++++.|..+++.. |.+|+++||+.+|+|+++.+++||++++=-.=+ T Consensus 564 d~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g--- 640 (645) T protein:vir:93 564 NAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYG--- 640 (645) T ss_pred ccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCC--- Confidence 9999999999999998876632 889999999999999999999999987522211 Q ss_pred CCCCC Q lcl|Aclame:pro 373 ALEGT 377 (381) Q Consensus 373 ~~~~~ 377 (381) +-.|+ T Consensus 641 ~~~~~ 645 (645) T protein:vir:93 641 SASGG 645 (645) T ss_pred cccCC Confidence 22333 No 91 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=3.5e-50 Score=291.73 Aligned_cols=322 Identities=11% Similarity=0.038 Sum_probs=210.2 Q ss_pred CCccHHHHHHHHHHHHHH-------HHhhhhHHHH------HHHHHHHHHHHHHHHHH---HHHHHHHH----------- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFIN-------AVNNGEPQER------QNELYGDMINQLFEETK---LQAKAEAE----------- 53 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~-------~~~~~~~~~~------~~~~~~~~~~~~~~~~~---~~~~~~~~----------- 53 (381) |.-++ +++.++++++.+ .++.....++ +.+.+...++.+..... .+.+...+ T Consensus 15 l~~~l-~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l~~~~~~~~~~~~~ 93 (397) T protein:vir:96 15 RSSEI-DKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDLEDELAKAADPTDQ 93 (397) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh Confidence 21111 122222222222 2221110110 01111111111111100 00000000 Q ss_pred -------HHHHhhhhhccccHHHHHHHH---------HHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC Q lcl|Aclame:pro 54 -------RVSSLPKSAQSLSANQRSFFM---------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL 117 (381) Q Consensus 54 -------~~~~~~~~~~~lt~~e~~~~~---------~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~ 117 (381) +.............+++..+. .....+..+||++||+++.+.|++ +++..++++.|+++++++ T Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~ 172 (397) T protein:vir:96 94 KPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNS 172 (397) T ss_pred hhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccc Confidence 000000000011111122211 123356778999999999999987 678889999999988754 Q ss_pred -ceEEEEec-CCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhhe Q lcl|Aclame:pro 118 -RLKFLKSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) Q Consensus 118 -~~~ip~~~-~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~ 195 (381) ...+|+.. +.+.+.|+.|+++.+..++++|+++++.+|++++++++|++||+||.+++++||.+.++++++.+++.+| T Consensus 173 ~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i 252 (397) T protein:vir:96 173 ASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADI 252 (397) T ss_pred cceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 45666543 3456778888787776789999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHH Q lcl|Aclame:pro 196 LKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAF 275 (381) Q Consensus 196 l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~ 275 (381) ++|+|.++|.|+.+. .+ +.+++... .+ .+ .+++|+|||.++. T Consensus 253 ~~g~g~~~~~~~~~~-----------------------d~-------~~~~~~~~------~~-~~-~~a~~v~n~~~~~ 294 (397) T protein:vir:96 253 AAVLKTATAKSVVGV-----------------------DG-------LKDLINKE------IK-KV-YDVKLFISASMYS 294 (397) T ss_pred hhcccccccccccch-----------------------HH-------HHHHHHHh------hh-hh-cCcEEEEcHHHHH Confidence 999999988876420 00 11111110 01 12 2578999999998 Q ss_pred HHHhhhhccCCCCceeec---------cCCCceEEecCCCCC------ccEEEEeccc-eEEEecceeeEeeehhhhhhc Q lcl|Aclame:pro 276 EVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEA------GKVLTYVKGL-YDGYLAGGINVQKFKETLALD 339 (381) Q Consensus 276 ~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~~~p~------~~i~~gd~s~-y~i~~r~~~~i~~~~~~~~~~ 339 (381) .++.+ ++++|+|+|. ..+|+||+.++++.. ..++|||||+ |++++|+++++..+++.+| T Consensus 295 ~l~~l---kd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~-- 369 (397) T protein:vir:96 295 ELDKL---KDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNNIY-- 369 (397) T ss_pred HHHHh---hccCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEeccccc-- Confidence 88876 5788999974 247999987654332 2389999998 6789999999999998765 Q ss_pred CceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 340 DMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 340 d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) +++||+++|+||++++++||++++++.+ T Consensus 370 -~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 370 -GQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred -ceeEEEEEEEccEEecccceEEEEeecC Confidence 5689999999999999999999999997 No 92 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=3.3e-50 Score=291.87 Aligned_cols=301 Identities=15% Similarity=0.066 Sum_probs=218.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec Q lcl|Aclame:pro 36 MINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~ 115 (381) |.+ ..+ .+....+........ +.+++........+|++||++++++|++.+++.++|+++++++++ T Consensus 1 ~~~----~~~--~~~~~~~f~~~~~~~--------~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~ 66 (324) T protein:vir:96 1 MEQ----TQK--LKLNLQHFASNNVKP--------QVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPM 66 (324) T ss_pred CCc----chh--hhHHHHHHHHhhhhh--------hhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeec Confidence 000 000 000000000000000 011222233345677899999999999999999999999999998 Q ss_pred CC-ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 116 GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETA 194 (381) Q Consensus 116 ~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a 194 (381) ++ ..++|+.++.+.+.|++|+++.+ +++++|+++++.+||++++++||+|||+||.+++++||.+.+++++++++|.+ T Consensus 67 ~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~ 145 (324) T protein:vir:96 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) T ss_pred cCCceEEEEEecCcceeeecCCcccc-ccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHH Confidence 76 48999998888999999987765 57899999999999999999999999999999999999999999999999999 Q ss_pred eeeccCCC-cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhh Q lcl|Aclame:pro 195 FLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSD 273 (381) Q Consensus 195 ~l~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~ 273 (381) +++|+|++ .|.|++........... +..+.+ .+.++...+ ...+..+.+|+|||++ T Consensus 146 ~l~G~g~~~~~~~~~~~~~~~~~~~~---------~~~~~~-------~i~~~~~~i-------~~~~~~~~~~i~n~~~ 202 (324) T protein:vir:96 146 GILNQGNNPFGKSIAQSIKKTNKVIK---------GDFTQD-------NIIDLEALL-------EDDELEANAFISKTQN 202 (324) T ss_pred hhhcCCCCCcCccccccccccceecc---------cccchH-------HHHHHHHhh-------hhccCCCCEEEEcHHH Confidence 99999976 68888764332221111 111111 122222211 1234556789999999 Q ss_pred HHHHHhhhhccCCCCceeec-----cCCCceEEecCCC--CCccEEEEeccceEEEecceeeEeeehhh----------- Q lcl|Aclame:pro 274 AFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQ--EAGKVLTYVKGLYDGYLAGGINVQKFKET----------- 335 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~~~~-----l~~g~~vi~s~~~--p~~~i~~gd~s~y~i~~r~~~~i~~~~~~----------- 335 (381) +..++.+ ++++|+|+.. ..+|+||+.+... +++.++||||++|+++++++++|+.+++. T Consensus 203 ~~~L~~l---kd~~G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:96 203 RSLLRKI---VDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHHHh---hCCCCCeeecCCCCCcccceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeeccccccccccccc Confidence 8887765 5677887643 2368999887654 45679999999999999999999999875 Q ss_pred ---hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 336 ---LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 336 ---~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) +|.+|++.||+.+|+|+++++++||++++......+. +-|.- T Consensus 280 ~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~-~~~~~ 324 (324) T protein:vir:96 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS-VPGEV 324 (324) T ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCC-CCCCC Confidence 4889999999999999999999999987765544333 22222 No 93 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=7e-50 Score=290.08 Aligned_cols=271 Identities=10% Similarity=-0.058 Sum_probs=205.2 Q ss_pred cccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC-CceEEEEecCCcceEEecccccccccccccccceecccee Q lcl|Aclame:pro 78 KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNK 156 (381) Q Consensus 78 ~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~k 156 (381) -++.+.||++||++++++|++.++..++|+++|++++++ +..++|+.++.+.+.|++|+++.+ +++++|+++++.+|| T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~-~s~~~f~~v~l~~~k 79 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKT-HGGLSLEPVTIVPIK 79 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCcccc-ccccceeeEEeeeEE Confidence 456778999999999999999999999999999999986 568999998889999999987764 678999999999999 Q ss_pred eeeehhhhHHHH---hcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeee-eccccccccccccccccchhhhcc Q lcl|Aclame:pro 157 LTAFVVLPKDLN---DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLN-RQVQKGVSVTEGAYPEKEEQGTLT 232 (381) Q Consensus 157 l~~~~~iS~ell---~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil-~~~~~~~~~~~~~~~~~~~~~~~t 232 (381) +++++++|+||| .|+.+++++||++++++++++++|.+|++|+|+..+.+.. ............. . . T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~-------~--~ 150 (303) T protein:vir:97 80 VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQV-------V--K 150 (303) T ss_pred EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccc-------c--c Confidence 999999999999 4788999999999999999999999999997644332221 1000000000000 0 0 Q ss_pred ccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec----------cCCCceEEe Q lcl|Aclame:pro 233 FANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA----------LPFNLNVIE 302 (381) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~----------l~~g~~vi~ 302 (381) ..........+.+++..+ ...+..+..|+|||.++..++.+ ++++|.|++. ..+|+||+. T Consensus 151 ~~~~~~~~~~i~~~~~~~-------~~~~~~~~~~vmn~~~~~~L~~l---kd~~g~~~~~~~~~~~~~~~~l~G~Pv~~ 220 (303) T protein:vir:97 151 FTESEDADANIEAAVNLI-------QGAEGVVTGLAMDTEFSTALAKV---TNGEMGPKMYPELAWGANPDSINGLKSSV 220 (303) T ss_pred cccccchHHHHHHHHHHH-------hhcCCCccEEEEcHHHHHHHHHh---hccCCCeEEecCccCCCCCceecceeeEE Confidence 000011112222222222 11234456799999999888865 5778887763 247999999 Q ss_pred cCCCCCc--------cEEEEeccc-eEEEecceeeEeeehhh--------hhhcCceEEEEEEEEcCEEecCcceEEEEE Q lcl|Aclame:pro 303 STVQEAG--------KVLTYVKGL-YDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDNKVAAVWKL 365 (381) Q Consensus 303 s~~~p~~--------~i~~gd~s~-y~i~~r~~~~i~~~~~~--------~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l 365 (381) |++||.+ .++||||+. |.++.|++++++.+++. +|.+|+++||+.+|+|+++++++||+.++- T Consensus 221 s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~ 300 (303) T protein:vir:97 221 NTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTK 300 (303) T ss_pred ecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeC Confidence 9999853 278999965 88999999999887653 599999999999999999999999987532 Q ss_pred Eec Q lcl|Aclame:pro 366 DLK 368 (381) Q Consensus 366 ~~~ 368 (381) --. T Consensus 301 ~~~ 303 (303) T protein:vir:97 301 GEV 303 (303) T ss_pred CCC Confidence 111 No 94 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=7.1e-50 Score=290.04 Aligned_cols=288 Identities=11% Similarity=0.018 Sum_probs=210.4 Q ss_pred cccHHHHHHHHHHhcccCC------CCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcce------ Q lcl|Aclame:pro 64 SLSANQRSFFMDINKNVNY------KEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVA------ 130 (381) Q Consensus 64 ~lt~~e~~~~~~~~~~~~~------~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a------ 130 (381) .-+-.|. ..+..+.+. .++.+||++++++|++.+++.++|+++|+++++++ ..++|+.+..+.+ T Consensus 1 ~~~~~e~---~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~ 77 (338) T protein:vir:78 1 MATLNEL---APNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVG 77 (338) T ss_pred CcchHHh---hhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeeccc Confidence 0011111 122223322 34558999999999999999999999999999876 4899997765544 Q ss_pred --EEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCC---cce Q lcl|Aclame:pro 131 --VWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD---QPI 205 (381) Q Consensus 131 --~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~---qP~ 205 (381) .|++|+++. ++++++|+++++.+||++++++||+|||+||.+++++||++.+++++++++|.+|++|+|++ +|. T Consensus 78 ~~~~~~Eg~~~-~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~ 156 (338) T protein:vir:78 78 TSNEQREGGTK-PLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQ 156 (338) T ss_pred ccccccccccc-cccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccc Confidence 455665554 45789999999999999999999999999999999999999999999999999999999975 577 Q ss_pred eeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccC Q lcl|Aclame:pro 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~ 285 (381) ||++......... .+. ...........+.+....+.. .......+|+|||.++..++.....++ T Consensus 157 gi~~~~~~~~~~~----~~~------~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~m~~~~~~~L~~~~~l~d 220 (338) T protein:vir:78 157 GIDTNNVIVNTTN----VDY------LQTGTTPLLDRFLDGYDLVSA------NTDVDFNGWAADPRYRARLLRSQAYRD 220 (338) T ss_pred ccccccccccccc----ccc------ccccchhhHHHHHHHHHHhhh------hccccceEEEEchHHHHHHHHHhhhcc Confidence 7765432211100 000 011111222233333222211 112234579999999888877666688 Q ss_pred CCCceeec---------cCCCceEEecCCCCCc---------cEEEEeccceEEEecceeeEeeehhh------------ Q lcl|Aclame:pro 286 ANGVYVTA---------LPFNLNVIESTVQEAG---------KVLTYVKGLYDGYLAGGINVQKFKET------------ 335 (381) Q Consensus 286 ~~G~~~~~---------l~~g~~vi~s~~~p~~---------~i~~gd~s~y~i~~r~~~~i~~~~~~------------ 335 (381) .+|+|++. ..+|+||+.+++||++ .++||||++|++++|++++|+++++. T Consensus 221 ~~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 300 (338) T protein:vir:78 221 ANGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQT 300 (338) T ss_pred CCCceeecccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccc Confidence 89999863 2379999999999852 37899999999999999999998874 Q ss_pred --hhhcCceEEEEEEEEcCEEecCcceEEEEEEeccccc Q lcl|Aclame:pro 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 336 --~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~ 372 (381) +|.+|+++||+.+|+|+++++++||++++-.. ++.+ T Consensus 301 ~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~-~~~~ 338 (338) T protein:vir:78 301 VSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDE-DPDA 338 (338) T ss_pred hhhhhcCcEEEEEEEEeccEeecccceEEEeccc-CCCC Confidence 48899999999999999999999998864322 2222 No 95 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=2.6e-49 Score=286.97 Aligned_cols=270 Identities=8% Similarity=-0.045 Sum_probs=204.9 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC-CceEEEEecCCcceEEecccccccccccccccceeccc Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~ 154 (381) |..+++ .+|++||++++.+|++.+++.++++++|++++++ +..++|+.++.+.|.|++|+++.+ +++++|+++++.+ T Consensus 1 ma~~t~-~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~-~s~~~f~~v~l~~ 78 (300) T protein:vir:95 1 MSEAQL-SKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKT-HGGVSLDPVTIVP 78 (300) T ss_pred Cccccc-CCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccc-cccccceeeEeee Confidence 555554 5677899999999999999999999999999975 568999988889999999977654 6789999999999 Q ss_pred eeeeeehhhhHHHH---hcChhHHHHHHHHHHHHHHHHHHhhheeeccCCC--cceeeeeccccccccccccccccchhh Q lcl|Aclame:pro 155 NKLTAFVVLPKDLN---DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD--QPIGLNRQVQKGVSVTEGAYPEKEEQG 229 (381) Q Consensus 155 ~kl~~~~~iS~ell---~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~--qP~Gil~~~~~~~~~~~~~~~~~~~~~ 229 (381) ||++++++||+||| +|+.+++++||++++++++++++|.+|++|++.+ ++.++..... ..+.........+ T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~----~~~~~~~~~~~~~ 154 (300) T protein:vir:95 79 LKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNC----FDKKVTQTVPFKD 154 (300) T ss_pred EEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccc----cccccceeecccc Confidence 99999999999999 5778999999999999999999999999997543 3332321110 0000000000000 Q ss_pred hccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec---------cCCCceE Q lcl|Aclame:pro 230 TLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNV 300 (381) Q Consensus 230 ~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~v 300 (381) ......+..+...+ ...+..+.+|+|||.++..++++ ++++|+|+|. ..+|+|| T Consensus 155 -------~~~~~~i~~~~~~~-------~~~~~~~~~~vmn~~~~~~L~~l---kd~~G~~i~~~~~~~~~~~~l~G~Pv 217 (300) T protein:vir:95 155 -------TNPDESMEDAVGMI-------DGSERDITGAILDPIFTTALSKM---KNAEGGKLYPELAWGGVPDAINGLAV 217 (300) T ss_pred -------cchHHHHHHHHHHh-------hhcCCCccEEEECHHHHHHHHHh---hccCCCeeccCccccCCCceecceee Confidence 11112233333222 11234456799999999888776 6778988873 2379999 Q ss_pred EecCCCCCcc------EEEEeccce-EEEecceeeEeeehhh--------hhhcCceEEEEEEEEcCEEecCcceEEEEE Q lcl|Aclame:pro 301 IESTVQEAGK------VLTYVKGLY-DGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDNKVAAVWKL 365 (381) Q Consensus 301 i~s~~~p~~~------i~~gd~s~y-~i~~r~~~~i~~~~~~--------~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l 365 (381) +.+++||.+. +++|||+++ .++.|++++++++++. +|.+|+++||+.+|+|+++++++||+.+ T Consensus 218 ~~s~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l-- 295 (300) T protein:vir:95 218 DKNRTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARI-- 295 (300) T ss_pred EEecCCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEE-- Confidence 9999998643 678999985 4889999999987653 5999999999999999999999999985 Q ss_pred Eeccc Q lcl|Aclame:pro 366 DLKGH 370 (381) Q Consensus 366 ~~~~~ 370 (381) +.++. T Consensus 296 ~~~~g 300 (300) T protein:vir:95 296 VKTGG 300 (300) T ss_pred ecCCC Confidence 44343 No 96 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=4.9e-49 Score=285.42 Aligned_cols=268 Identities=9% Similarity=-0.060 Sum_probs=205.6 Q ss_pred cCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC-CceEEEEecCCcceEEecccccccccccccccceeccceeee Q lcl|Aclame:pro 80 VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLT 158 (381) Q Consensus 80 ~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~ 158 (381) -..+||++||++++++|++.++++++++++|++++++ +..++|+.++.+.|.|++|+++++ +++++|+++++.+||++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~-~~~~~f~~v~l~~~k~a 79 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKT-HGGVTLAPQTMVPIKVE 79 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCcccc-ccccceeEEEEeeeeEE Confidence 3457899999999999999999999999999999986 569999999999999999987665 67899999999999999 Q ss_pred eehhhhHHHHh---cChhHHHHHHHHHHHHHHHHHHhhheeeccC--CCcceeeeeccccccccccccccccchhhhccc Q lcl|Aclame:pro 159 AFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTG--KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTF 233 (381) Q Consensus 159 ~~~~iS~ell~---ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G--~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~ 233 (381) ++++||+|||+ |+.++|++||++++++++++++|.+|++|+| +++|.++.............. ... T Consensus 80 ~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-----~~~---- 150 (298) T protein:vir:16 80 YGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKV-----EAP---- 150 (298) T ss_pred EeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccc-----ccc---- Confidence 99999999994 6678999999999999999999999999964 555555432111111110000 000 Q ss_pred cChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec---------cCCCceEEecC Q lcl|Aclame:pro 234 ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIEST 304 (381) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi~s~ 304 (381) .........+.+++..+ ...+..+..|+|||.++..++.+ ++.+|+|+|. ..+|+||+.++ T Consensus 151 ~~~~~~~~~i~~~~~~~-------~~~~~~~~~~vmn~~~~~~l~~l---kd~~G~~i~~~~~~~~~~~~l~G~PV~~~~ 220 (298) T protein:vir:16 151 RGIADPNGAIENAVELL-------TGVDADVTGIAINPSFRSALAKQ---KDLQDNALFPELKWGATPDTINGLPVDVNK 220 (298) T ss_pred cccccHHHHHHHHHHHh-------hhcCCCccEEEEcHHHHHHHHHh---hccCCCeeecCcccCCCCceecceeeEEec Confidence 00111111222222222 12344566799999999888775 5778999874 24799999999 Q ss_pred CCCCc------cEEEEeccce-EEEecceeeEeeehh--------hhhhcCceEEEEEEEEcCEEecCcceEEEEEEe Q lcl|Aclame:pro 305 VQEAG------KVLTYVKGLY-DGYLAGGINVQKFKE--------TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDL 367 (381) Q Consensus 305 ~~p~~------~i~~gd~s~y-~i~~r~~~~i~~~~~--------~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~ 367 (381) +||++ .++||||+++ .++.|++++++.+++ .+|.+|+++||+.+|+|+++++++||+.++--. T Consensus 221 ~v~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 221 TVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccccccCCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 99853 4789999984 588999999988765 269999999999999999999999999863222 No 97 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=2.1e-48 Score=281.91 Aligned_cols=271 Identities=11% Similarity=-0.024 Sum_probs=203.3 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC-CceEEEEecCCcceEEecccccccccccccccceeccc Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~ 154 (381) |. +.+++||++||++++++|++.+++.++|+++|++++++ +..++|+.++.+.+.|++|+++++ +++++|+++++.+ T Consensus 1 Ma-t~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~-~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MA-TFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKS-STTGEFDFVTSTP 78 (311) T ss_pred Cc-eecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccc-cccceeeEEEEee Confidence 54 44568899999999999999999999999999999987 468999999999999999987765 5789999999999 Q ss_pred eeeeeehhhhHHHH---hcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeee---eccccccccccccccccchh Q lcl|Aclame:pro 155 NKLTAFVVLPKDLN---DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLN---RQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 155 ~kl~~~~~iS~ell---~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil---~~~~~~~~~~~~~~~~~~~~ 228 (381) ||++++++||+||| .|+.++|++||++++++++++++|++|++|+|++++.|+. +......... +. T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~-------~~- 150 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRV-------EL- 150 (311) T ss_pred EEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCcccccccccccccccee-------ec- Confidence 99999999999999 4788999999999999999999999999999987665542 2111111000 00 Q ss_pred hhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec---------cCCCce Q lcl|Aclame:pro 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLN 299 (381) Q Consensus 229 ~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~ 299 (381) +..........+..++..+.. ....+..+ .|+|||.++..++.+ ++.+|+|+|. ..+|+| T Consensus 151 ---~~~~~~~~~~~i~~~~~~~~~----~~~~~~~~-~~vmn~~~~~~L~~l---kd~~G~~l~~~~~~~~~~~~l~G~P 219 (311) T protein:vir:99 151 ---TADTIANPDLAIEAAVGLLVA----NGHPTPVN-GLALHPSIAWGLSTA---RYTDGRKKFPELGLGIGVSSFEGID 219 (311) T ss_pred ---cccccchhHHHHHHHHHHHhh----hccCCCcc-EEEEcHHHHHHHHhh---hccCCCeeecCcccCCCCceeccee Confidence 000001111112222221111 11122333 399999999888775 6778999874 237999 Q ss_pred EEecCCCCCc----------------cEEEEeccc-eEEEecceeeEeeehhh-------hhhcCceEEEEEEEEcCEEe Q lcl|Aclame:pro 300 VIESTVQEAG----------------KVLTYVKGL-YDGYLAGGINVQKFKET-------LALDDMDLYTAKQFAYGKAK 355 (381) Q Consensus 300 vi~s~~~p~~----------------~i~~gd~s~-y~i~~r~~~~i~~~~~~-------~~~~d~~~~~~~~r~dgk~~ 355 (381) |+.++.+|.+ .+++|||++ +.+++|.+++++.+++. .|.+|+++||+.+|+|+++. T Consensus 220 v~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~ 299 (311) T protein:vir:99 220 ASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVF 299 (311) T ss_pred eEeecccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceec Confidence 9999888632 257899998 45889999999887653 48999999999999999998 Q ss_pred cCcceEEEEEEec Q lcl|Aclame:pro 356 DNKVAAVWKLDLK 368 (381) Q Consensus 356 ~~~Af~v~~l~~~ 368 (381) ++ +|+++.-..+ T Consensus 300 ~~-~~v~~~~~~A 311 (311) T protein:vir:99 300 TD-RFVVIENAVA 311 (311) T ss_pred Ch-hHeeeecccC Confidence 85 6776544443 No 98 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=1.9e-47 Score=276.72 Aligned_cols=265 Identities=11% Similarity=-0.026 Sum_probs=202.6 Q ss_pred cCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC-CceEEEEecCCcceEEecccccccccccccccceeccceeee Q lcl|Aclame:pro 80 VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLT 158 (381) Q Consensus 80 ~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~ 158 (381) -..+||++||+++.++|++.++++++++++|++++++ +..++|+.++.+.+.|++|+++.+ +++++|+++++.+||++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~l~~~k~~ 79 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKT-HGGVTLAPQTMVPIKVE 79 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCcccc-ccccceeEEEEeeeEEE Confidence 2347899999999999999999999999999999986 468999998889999999987765 67999999999999999 Q ss_pred eehhhhHHHHh---cChhHHHHHHHHHHHHHHHHHHhhheeeccC--CCc---ceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 159 AFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTG--KDQ---PIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 159 ~~~~iS~ell~---ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G--~~q---P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++++||+|||+ |+..+++++|++++++++++++|.+|++|++ +++ +.|+........+.. ... T Consensus 80 ~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~--------~~~- 150 (298) T protein:vir:94 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKV--------EAP- 150 (298) T ss_pred EeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccccc--------ccc- Confidence 99999999995 5678999999999999999999999999953 222 233211111000000 000 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceeec---------cCCCceEE Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~---------l~~g~~vi 301 (381) .........+.+++..+ ...+..+.+|+|||+++..++++ ++.+|+|+|. ..+|+||+ T Consensus 151 ---~~~~~~~~~i~~~~~~~-------~~~~~~~~~~vmn~~~~~~l~~l---kd~~G~~l~~~~~~~~~~~tl~G~PV~ 217 (298) T protein:vir:94 151 ---RGIADPNGAIENAVELL-------TGVDADVTGIAINPSFRSALAKQ---KDLQGNALFPELKWGATPDTINGLPVD 217 (298) T ss_pred ---cccccHHHHHHHHHHhh-------hhcCCCccEEEEcHHHHHHHHHh---hccCCCeeecCcccCCCCceecceeeE Confidence 00111112223332222 22344567899999999988876 5678988873 24799999 Q ss_pred ecCCCCCc------cEEEEeccc-eEEEecceeeEeeehh--------hhhhcCceEEEEEEEEcCEEecCcceEEEEEE Q lcl|Aclame:pro 302 ESTVQEAG------KVLTYVKGL-YDGYLAGGINVQKFKE--------TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 302 ~s~~~p~~------~i~~gd~s~-y~i~~r~~~~i~~~~~--------~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~ 366 (381) .++.+|.+ .++||||++ |.++.|++++++++++ .+|.+|+++||+.+|+|+++.+++||++++-- T Consensus 218 ~~~~v~~~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~ 297 (298) T protein:vir:94 218 VNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) T ss_pred EecccccccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEec Confidence 99999853 478999999 5588999999987764 26999999999999999999999999986322 Q ss_pred e Q lcl|Aclame:pro 367 L 367 (381) Q Consensus 367 ~ 367 (381) . T Consensus 298 t 298 (298) T protein:vir:94 298 N 298 (298) T ss_pred C Confidence 2 No 99 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=2.9e-45 Score=264.79 Aligned_cols=278 Identities=12% Similarity=0.008 Sum_probs=203.3 Q ss_pred cccHHH------HHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc--eEEEEec-C---CcceE Q lcl|Aclame:pro 64 SLSANQ------RSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR--LKFLKSE-T---SGVAV 131 (381) Q Consensus 64 ~lt~~e------~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~--~~ip~~~-~---~~~a~ 131 (381) .|+.+. .++.+++. .++.+||||+|++. +++++.+.+.||+|++|++++..+. ..++... + ..... T Consensus 1 ~~~~~~~~~~~~~~~~k~~t-~~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~ 78 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKID-VPDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRD 78 (315) T ss_pred CcccchhhcCChhhhhhhcC-CcCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccc Confidence 222211 12223333 34568999999886 5699999999999999998764332 3344321 1 12245 Q ss_pred EecccccccccccccccceeccceeeeeehhhhHHHHhcChh--HHHHHHHHHHHHHHHHHHhhheeeccCC------Cc Q lcl|Aclame:pro 132 WGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA--WIERFVRVQIEEAFAVALETAFLKGTGK------DQ 203 (381) Q Consensus 132 w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~--~l~~~i~~~la~a~a~~~d~a~l~G~G~------~q 203 (381) |.++..+ .++++|+|+++.+.+|++++.+.||+++|+|+.+ |||+||...++++|++.++.+|++|+|+ ++ T Consensus 79 ~~~~~~~-~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~ 157 (315) T protein:vir:41 79 ETGQKLA-PPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRM 157 (315) T ss_pred cccCcCC-CCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccc Confidence 6666443 4567899999999999999999999999999975 9999999999999999999999999985 47 Q ss_pred ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhcccccccccc---CceEEEEchhhHHHHHhh Q lcl|Aclame:pro 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK---GNVTMVVNPSDAFEVQAQ 280 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~imn~~~~~~~~~~ 280 (381) |.||++.+.......... ........+.+.++++.+ +..|+ ++++|+||+.++.+++++ T Consensus 158 ~~G~l~~a~~~~~~~~~~-----------~~a~~~~~d~l~~l~~sl-------~~~yr~~~~~~~~imn~~t~~~~rkl 219 (315) T protein:vir:41 158 SDGWLKLASEKLTESDVD-----------PEAEDWPMNLFDTMIESL-------PTPYRNNLPNMKFYVTWDIYRAYRDA 219 (315) T ss_pred cccceecccccccccccc-----------cccccccHHHHHHHHHhc-------ChHHhhcCCceEEEEcHHHHHHHHHH Confidence 789998543322211110 001111223344444433 34455 578999999999999887 Q ss_pred hhccCCCCceeec---------cCCCceEEecCCCC-----CccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEE Q lcl|Aclame:pro 281 YTHLNANGVYVTA---------LPFNLNVIESTVQE-----AGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTA 346 (381) Q Consensus 281 ~~~~~~~G~~~~~---------l~~g~~vi~s~~~p-----~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~ 346 (381) + +++|+|.|. ..+|+||+.+++|| ++.|+||||+.|+++++.+++++++. ++.++.+.|.. T Consensus 220 k---~~~g~~lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~--~a~~~~~~~~~ 294 (315) T protein:vir:41 220 L---KGRETGLGDQALTGANSILYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDY--DAEMRLTKYVA 294 (315) T ss_pred h---ccCCCccccchhhcCCCceecccceEecccccccCCCCccEEEecccceEEEeccccEEEeee--cCCCCceEEEE Confidence 4 556777763 23599999999885 56799999999999999999888754 45678889999 Q ss_pred EEEEcCEEecCcceEEEEEEe Q lcl|Aclame:pro 347 KQFAYGKAKDNKVAAVWKLDL 367 (381) Q Consensus 347 ~~r~dgk~~~~~Af~v~~l~~ 367 (381) ..|+|+..++.++.++..+++ T Consensus 295 ~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 295 SLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred EEEeceeEEeccceeEeeeeC Confidence 999999999999999999999 No 100 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=9.9e-45 Score=261.84 Aligned_cols=279 Identities=11% Similarity=-0.001 Sum_probs=205.4 Q ss_pred cHHHHHHHHHHh--cccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec-C-CceEEEEec-CC---cceEEecccc Q lcl|Aclame:pro 66 SANQRSFFMDIN--KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA-G-LRLKFLKSE-TS---GVAVWGKIYG 137 (381) Q Consensus 66 t~~e~~~~~~~~--~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~-~-~~~~ip~~~-~~---~~a~w~~e~~ 137 (381) -++.++.++... ..++.+||||+|+++ +++++.+++.+++|++++++++ + ....||+-. +. +.+.|.++.. T Consensus 1 ~~~~~~~~~~~k~it~~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~~ 79 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTKV 79 (314) T ss_pred CchhhhHHHhhcccccccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCCc Confidence 223344443321 234567999999997 5799999999999999999875 3 357777643 22 2234555433 Q ss_pred cccccccccccceeccceeeeeehhhhHHHHhcChh--HHHHHHHHHHHHHHHHHHhhheeeccCC--------Ccceee Q lcl|Aclame:pro 138 EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA--WIERFVRVQIEEAFAVALETAFLKGTGK--------DQPIGL 207 (381) Q Consensus 138 ~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~--~l~~~i~~~la~a~a~~~d~a~l~G~G~--------~qP~Gi 207 (381) + .++++|+|++++|.+||+...++||+|+|+|+.+ |||+||...++++|++.++.+|++|+|+ ++|.|| T Consensus 80 ~-~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~ 158 (314) T protein:vir:41 80 A-PTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGW 158 (314) T ss_pred c-CCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhh Confidence 3 3467999999999999999999999999999986 9999999999999999999999999995 478999 Q ss_pred eeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhcccccccccc---CceEEEEchhhHHHHHhhhhcc Q lcl|Aclame:pro 208 NRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK---GNVTMVVNPSDAFEVQAQYTHL 284 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~imn~~~~~~~~~~~~~~ 284 (381) ++.......... ........+.+.++++.+ +..|+ ++++|+||+.++.++++++..+ T Consensus 159 l~~a~~~~~~~~-------------~~~~~~~~~~~~~l~~sl-------~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~ 218 (314) T protein:vir:41 159 MKLAGNQYTDAE-------------PEDENWPLNLFDGMMDEL-------DTRYLQLKPRMKFYVSNEIYNGYRKQLLVR 218 (314) T ss_pred hhhcccceeecC-------------ccccccHHHHHHHHHHhc-------CchhhcCCCceEEEecHHHHHHHHHHHhcc Confidence 975432211110 011111223344444433 34454 4778999999998888775433 Q ss_pred CCCCceeec---------cCCCceEEecCCCC-----CccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEE Q lcl|Aclame:pro 285 NANGVYVTA---------LPFNLNVIESTVQE-----AGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFA 350 (381) Q Consensus 285 ~~~G~~~~~---------l~~g~~vi~s~~~p-----~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~ 350 (381) |.+.|. ..+|+||+.+++|| ++.|+||||+.|+++++..+++. .++++.++++.|.+..|+ T Consensus 219 ---~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~--~~~~a~~~~~~~~~~~r~ 293 (314) T protein:vir:41 219 ---ETGLGDSALIGATGLQYDGIPIQYVPALDALGDDKARALLTVPTNLVYGFWRNIRIE--PKRDAAMRRTEYIASLRA 293 (314) T ss_pred ---CCcccchhhhCCCCceecceeeEecccccccCCCCceEEEechhheEEEeeceeEEe--ecccCcCCeEEEEEEEEe Confidence 333321 23599999998875 56799999999988887766655 467788999999999999 Q ss_pred cCEEecCcceEEEEEEecccc Q lcl|Aclame:pro 351 YGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 351 dgk~~~~~Af~v~~l~~~~~~ 371 (381) |+...+.+|.++..++.+..- T Consensus 294 d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 294 DCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred ceEEEEcCcEEEEEeeccCCC Confidence 999999999999999887744 No 101 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=1.8e-38 Score=227.49 Aligned_cols=296 Identities=11% Similarity=0.081 Sum_probs=197.3 Q ss_pred HhhhhhccccHHHHHHHH-HHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceEEec Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFM-DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~-~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~w~~ 134 (381) +..+..+ ..-+++.. ......+.++||+||+++..+|++.+++.++++++++++++.. ...+|.....+.+.|+. T Consensus 1 ~~~k~~~---~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~ 77 (321) T protein:vir:31 1 MASRTIN---NDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQ 77 (321) T ss_pred CchHHHH---HHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCcccccc Confidence 1111111 11111111 1122345688999999999999999999999999999999864 56788765555566654 Q ss_pred -ccccccccccccccceeccceeeeeehhhhHHHHhcCh--hHHHHHHHHHHHHHHHHHHhhheeeccCCCcc------e Q lcl|Aclame:pro 135 -IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP--AWIERFVRVQIEEAFAVALETAFLKGTGKDQP------I 205 (381) Q Consensus 135 -e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~--~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP------~ 205 (381) ++......++|+|+++++.+|++.+.++||+++|+|+. +||++||.+.++++|++.++.++++|+|+.+| . T Consensus 78 ~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~ 157 (321) T protein:vir:31 78 DEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQND 157 (321) T ss_pred cccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccch Confidence 33333446789999999999999999999999999985 59999999999999999999999999998765 6 Q ss_pred eeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhcccccccccc--CceEEEEchhhHHHHHhhhhc Q lcl|Aclame:pro 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~imn~~~~~~~~~~~~~ 283 (381) ||++.+......... ... ....+.+.++...+ +..|+ ++++|+||+.++..++..+.. T Consensus 158 G~l~~a~~~~~~~~~------~~~-------~~~~d~l~~l~~~l-------~~~yr~~~~~v~im~~~~~~~~~~~l~~ 217 (321) T protein:vir:31 158 GFITVAEGDVETIDA------ADD-------ILDNDLVIRTIAGL-------DSKYRARMNPALIVSEDQLLSYHYTLTD 217 (321) T ss_pred hhhhhhccccccccc------ccc-------ccCHHHHHHHHHhc-------cHhHhcCCCeEEEechHHHHHHHHHHhc Confidence 887643322111110 000 01112233333322 33344 478999999998776654322 Q ss_pred cCCCCceee---------ccCCCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhc-CceEEE--EEEEEc Q lcl|Aclame:pro 284 LNANGVYVT---------ALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALD-DMDLYT--AKQFAY 351 (381) Q Consensus 284 ~~~~G~~~~---------~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~-d~~~~~--~~~r~d 351 (381) .+ + +.| ...+|+||+.+++||++.|+|+||+.++++.+.+++++++.+..... ....|+ ...++| T Consensus 218 ~~--~-~~~~~~l~~~~~~tl~G~pvv~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (321) T protein:vir:31 218 RD--T-PLGDNVIMGEADVNPFSFPIIGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDD 294 (321) T ss_pred CC--C-ccccchhhccccccccceeEEEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecc Confidence 21 1 111 12469999999999999999999999999999999998776654332 234444 455688 Q ss_pred CEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 352 GKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 352 gk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) +.+-+.+|+++++ .+..+--.+ +++|- T Consensus 295 ~~ve~~~a~a~~~-~i~~~~~~~--~~~~~ 321 (321) T protein:vir:31 295 FAIENTEAVVLAE-GLGDPLEHL--EEETS 321 (321) T ss_pred eeEeccccEEEEe-cCCcchhcc--cCCCC Confidence 8888888888876 221111111 11111 No 102 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=5.1e-36 Score=214.09 Aligned_cols=340 Identities=13% Similarity=0.080 Sum_probs=199.0 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHhhhhhcc----------c Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQ-----AKAEAERVSSLPKSAQS----------L 65 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~----------l 65 (381) |..+...+..+....+.++.+...+..+..+.+.+..+.+.+..... .+.+..+.......... + T Consensus 136 ~~~~e~~~~~~~~a~~ee~~e~~~k~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (517) T protein:vir:97 136 EKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPEATEFL 215 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhcccccccccchhhHHH Confidence 22222111111111111111111111111111111111111100000 00000000000000000 0 Q ss_pred cHHHHHHHHHHhc-------------ccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEEEecCCcceEE Q lcl|Aclame:pro 66 SANQRSFFMDINK-------------NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVW 132 (381) Q Consensus 66 t~~e~~~~~~~~~-------------~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip~~~~~~~a~w 132 (381) ...+......... ....-||+++|+.+...|...+...++++..+++.+.+ ...+|...+...+.| T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~-~~~~~~~~~~~~a~~ 294 (517) T protein:vir:97 216 KTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP-TLVVGGDNALTQGTG 294 (517) T ss_pred HHHHHHHHHHHhcccccccceeeeecccccccccccchHHHHHHHHhhhhhccceeeeeecccc-ceeeecccccceeee Confidence 0000010001110 11234789999999999999999999999888876654 356777777777888 Q ss_pred ecccccccccccccccceeccceeeeeehhhhHHHHhcChhH----HHHHHHHHHHHHHHHHHhhheeeccCCC-cceee Q lcl|Aclame:pro 133 GKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAW----IERFVRVQIEEAFAVALETAFLKGTGKD-QPIGL 207 (381) Q Consensus 133 ~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~----l~~~i~~~la~a~a~~~d~a~l~G~G~~-qP~Gi 207 (381) +.|++ .+++++++|+++++.+|++++++++|++||+|+.+| |++||.++++++++++++.+||+|+|++ .+.|+ T Consensus 295 ~~eG~-~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi 373 (517) T protein:vir:97 295 HTTGT-DKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQI 373 (517) T ss_pred eecCC-cccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccc Confidence 88865 456789999999999999999999999999999988 9999999999999999999999999987 46688 Q ss_pred eeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCC Q lcl|Aclame:pro 208 NRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNAN 287 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~ 287 (381) ++........ ...+ .....+.+..+...+ ..+ .++.|+|||.|+..++++ ++++ T Consensus 374 ~~~a~~~~~~--------~~~~------~~~~~d~i~~l~~a~--------~~a-~~a~~vmn~~t~~~I~kl---KD~~ 427 (517) T protein:vir:97 374 YPVVGDAWAT--------NVTG------TTNIQELLEKLSVAT--------PKA-ADSTLVIHRNDLAAIRFL---KDKN 427 (517) T ss_pred cccccccccc--------cccc------cchHHHHHHHHHHHh--------hhc-cCCEEEECHHHHHHHHHh---hcCC Confidence 7532211110 0011 111112222221111 111 257899999999999987 6789 Q ss_pred CceeeccC---------CCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCc Q lcl|Aclame:pro 288 GVYVTALP---------FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNK 358 (381) Q Consensus 288 G~~~~~l~---------~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~ 358 (381) |+|+|... +|+.-+ -+.++.+...++.++.|.++++.++++.+. ....+++..|+.-+|++|.+..++ T Consensus 428 G~Yl~~~~~~~~~~~~l~G~~~~-~~~~~~~~~~~~~~~~y~i~~~~g~~~~~~--fd~~~n~~~f~~~~~~~g~i~~~~ 504 (517) T protein:vir:97 428 GNYVFPVGVSNQTIATHFGFNRL-VQSVAVDEKTAVSLSGYVTNGSRGMEFEQG--TILVENNKEYLFEMPISGSLEYKG 504 (517) T ss_pred CCeeccCcCCcccccccCCcccc-ccccccCceeEeeccccEEEeecceeeeee--eecccCceeEeeeeeecccccccc Confidence 99998532 231101 122334555566678899999988876543 224578999999999999999999 Q ss_pred ceEEEEEEecccccCCCC Q lcl|Aclame:pro 359 VAAVWKLDLKGHKPALEG 376 (381) Q Consensus 359 Af~v~~l~~~~~~~~~~~ 376 (381) +|++.. -+|.+-| T Consensus 505 r~a~~~-----~~p~~~~ 517 (517) T protein:vir:97 505 TTAYGT-----YTPPVAG 517 (517) T ss_pred ceEEEE-----EcCCCCC Confidence 999753 3444555 No 103 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.96 E-value=3.5e-31 Score=187.54 Aligned_cols=327 Identities=10% Similarity=0.040 Sum_probs=168.9 Q ss_pred CCccHHHHH-------------------HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-H---HHHHHHH Q lcl|Aclame:pro 1 MTINLSETF-------------------ANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAK-A---EAERVSS 57 (381) Q Consensus 1 m~~~l~~~~-------------------~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~---~~~~~~~ 57 (381) ++......+ .+...+..+........+++.+.+.+..+........... . ....... T Consensus 109 ~pa~~~a~v~~vks~~~~~e~~~~~~e~~e~~~e~~e~~~~~~el~akl~el~k~~ee~k~~~~~~~~~~~~~~~~~~e~ 188 (480) T protein:vir:40 109 LPSNKGAKVTKVREENKGEQEQMGANETQEIMKQAIEAGVKVRELEAKVEELNKEREELKKEREASIPSEKPEDAERKFM 188 (480) T ss_pred cccchhhhhhhhhhhhhhhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHHHhHHHHHhhhhhhhccccchhhhhhHHH Confidence 111111111 1111111110000000000000000000000000000000 0 0000000 Q ss_pred hhhhhccccHHHHHHH----HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEEEecCCcceEEe Q lcl|Aclame:pro 58 LPKSAQSLSANQRSFF----MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWG 133 (381) Q Consensus 58 ~~~~~~~lt~~e~~~~----~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip~~~~~~~a~w~ 133 (381) .......-...+..++ +....+...++|+ +|+.+...+.......+++...+++...++. .+.|+ T Consensus 189 r~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~----------~~~~~ 257 (480) T protein:vir:40 189 RELGSKMAEMPEQGFLREFANGADLNVVNSLGS-ITSKYARKSGIYDGAMKARFQGLTLAEDGVD----------DTFIS 257 (480) T ss_pred HHHHHHhccchhhhhhhhhhhhccccccccccc-cccchhhheeechhhhhhhhhcceeeecccc----------ceeee Confidence 0000000000111111 1112223334444 5555566555556666666666655444332 23454 Q ss_pred ccccccccc-ccccccceecc---ceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeec--cCCCcceee Q lcl|Aclame:pro 134 KIYGEIKGQ-LDAAFSEETAI---QNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKG--TGKDQPIGL 207 (381) Q Consensus 134 ~e~~~~~~~-~~~~f~~v~l~---~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G--~G~~qP~Gi 207 (381) ++..+...+ +..++.+..+. .|++++++++|+++|+|+. +|++||.++++++|+++++.+||+| +|.++|.|| T Consensus 258 ~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~ 336 (480) T protein:vir:40 258 GTFKAGTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGL 336 (480) T ss_pred eeeecccccccccccccchhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccc Confidence 443221111 12233444443 5899999999999999987 8999999999999999999999999 455678888 Q ss_pred eeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCce-EEEEchhhHHHHHhhhhccCC Q lcl|Aclame:pro 208 NRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNV-TMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~imn~~~~~~~~~~~~~~~~ 286 (381) .+..... + .. .+ .. +.+..++..+ +.+|+.++ .|+|||.|...++++ +++ T Consensus 337 ~~~~~~~---~---------~~-~~---~~---d~id~L~~al-------~~~y~~~a~~~vmn~~t~~~I~kl---KD~ 387 (480) T protein:vir:40 337 KTATDGW---T---------KQ-IE---YT---DLFEGITDAV-------AECSISDAITIVMSPQTFAELRKA---KGT 387 (480) T ss_pred eeecccc---c---------cc-ch---hH---HHHHHHHHhh-------hHHhhCCCCEEEECHHHHHHHHHh---hcC Confidence 6432110 0 00 00 11 1222222222 35676677 699999999999987 678 Q ss_pred CCceeecc---------CCCceEEec-CCCCCccEEEEeccc-eEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEe Q lcl|Aclame:pro 287 NGVYVTAL---------PFNLNVIES-TVQEAGKVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAK 355 (381) Q Consensus 287 ~G~~~~~l---------~~g~~vi~s-~~~p~~~i~~gd~s~-y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~ 355 (381) +|+|+|+. .+|+||+.+ ..+|++.-.+|.++. |.++||+ ++. .+...+..++..|....|++|.+. T Consensus 388 ~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~~~~~~~~~~~~d~~-~~~--~~~~~~~~~~~~~~~e~~v~g~~~ 464 (480) T protein:vir:40 388 DGHSRFNELATKEQIAQSFGAVNLETRVWMPKDEVAVYNHDEYVLIGDLN-VEN--YNDFDLRYNVEQWLSETLVGGSIR 464 (480) T ss_pred CCCeeccCcccccCcceecccceeeeeccccCCcceeeeCCccEEEEecc-cce--ecccccccchhhhhhhhhhceeeE Confidence 99999963 378897764 567777655555555 5678875 333 233344567778999999999999 Q ss_pred cCcceEEEEEEecccc Q lcl|Aclame:pro 356 DNKVAAVWKLDLKGHK 371 (381) Q Consensus 356 ~~~Af~v~~l~~~~~~ 371 (381) .++||.++++|-+=.- T Consensus 465 ~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:40 465 GKNRSAYLKKKGSLGV 480 (480) T ss_pred ccccEEEEEeccCcCC Confidence 9999999776643211 No 104 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.92 E-value=4.2e-27 Score=165.21 Aligned_cols=256 Identities=17% Similarity=0.102 Sum_probs=192.5 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee----cCC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~----~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |..+++..+-.++|+.+.+.|++.+.+...+.+++++.. .+| .++||+....+.+.|+.|+++++ .++++|+++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~-~~~~~~~~~ 79 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIP-MTQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccc-ccccccceE Confidence 655556667789999999999999999888878776532 234 38899988888999999987665 568999999 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++.+|+++..+++|+++..++..|+.+++.+.+++++++.+|..++..- +. ......+ . T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~---------~~---a~~~~~~---------~ 138 (272) T protein:vir:30 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDAL---------SK---STQTVEA---------T 138 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHh---------cc---ccccccc---------c Confidence 9999999999999999999999999999999999999999999987521 10 0000000 0 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc--cC----C-----CCceeeccCCCce Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH--LN----A-----NGVYVTALPFNLN 299 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~--~~----~-----~G~~~~~l~~g~~ 299 (381) .. .+.+.+....+.. .+....+|+|||.++..+++.... .. . +|... ...|+| T Consensus 139 ---~t----~d~i~da~~~l~~-------~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig--~i~G~~ 202 (272) T protein:vir:30 139 ---AT----VDGVSKALDIFND-------EDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYG--EVLGVQ 202 (272) T ss_pred ---cC----HHHHHHHHHHHhc-------cCCCccEEEEcHHHHHHHHHhccccccccccccccccccccch--hhcCee Confidence 01 1112222222211 122345799999999888764211 11 1 12211 235899 Q ss_pred EEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccc Q lcl|Aclame:pro 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 300 vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~ 371 (381) |+.|++||++++++++.+.+.++.+.+++++.+++ ...+...+++.+|++.++++++++++++++-++++ T Consensus 203 Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~--~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 203 IVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRD--ITKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred EEEcCCCCcceEEEEcCCeEEEEecCCceeeeccc--cccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 99999999999999988888888899988886554 46688999999999999999999999999998888 No 105 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.92 E-value=4.2e-27 Score=165.21 Aligned_cols=256 Identities=17% Similarity=0.102 Sum_probs=192.5 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee----cCC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~----~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |..+++..+-.++|+.+.+.|++.+.+...+.+++++.. .+| .++||+....+.+.|+.|+++++ .++++|+++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~-~~~~~~~~~ 79 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIP-MTQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccc-ccccccceE Confidence 655556667789999999999999999888878776532 234 38899988888999999987665 568999999 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++.+|+++..+++|+++..++..|+.+++.+.+++++++.+|..++..- +. ......+ . T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~---------~~---a~~~~~~---------~ 138 (272) T protein:vir:98 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDAL---------SK---STQTVEA---------T 138 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHh---------cc---ccccccc---------c Confidence 9999999999999999999999999999999999999999999987521 10 0000000 0 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc--cC----C-----CCceeeccCCCce Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH--LN----A-----NGVYVTALPFNLN 299 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~--~~----~-----~G~~~~~l~~g~~ 299 (381) .. .+.+.+....+.. .+....+|+|||.++..+++.... .. . +|... ...|+| T Consensus 139 ---~t----~d~i~da~~~l~~-------~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig--~i~G~~ 202 (272) T protein:vir:98 139 ---AT----VDGVSKALDIFND-------EDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYG--EVLGVQ 202 (272) T ss_pred ---cC----HHHHHHHHHHHhc-------cCCCccEEEEcHHHHHHHHHhccccccccccccccccccccch--hhcCee Confidence 01 1112222222211 122345799999999888764211 11 1 12211 235899 Q ss_pred EEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccc Q lcl|Aclame:pro 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 300 vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~ 371 (381) |+.|++||++++++++.+.+.++.+.+++++.+++ ...+...+++.+|++.++++++++++++++-++++ T Consensus 203 Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~--~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 203 IVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRD--ITKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred EEEcCCCCcceEEEEcCCeEEEEecCCceeeeccc--cccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 99999999999999988888888899988886554 46688999999999999999999999999998888 No 106 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.73 E-value=1.8e-19 Score=123.34 Aligned_cols=287 Identities=14% Similarity=0.098 Sum_probs=192.2 Q ss_pred hhccccHHHH----H----HH-HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcce Q lcl|Aclame:pro 61 SAQSLSANQR----S----FF-MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVA 130 (381) Q Consensus 61 ~~~~lt~~e~----~----~~-~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a 130 (381) ..+.-++.-+ . |- -+|..-+-++.+.+.|......||+.+.+.++|++.+.+.++.+ ...+++....+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~l~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~lp~a 80 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPELKMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVLGDV 80 (330) T ss_pred CceecCCccccceeehhccccccchhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecCCcc Confidence 1221111110 0 00 13344455667899999999999999999999999999887744 5788998889999 Q ss_pred EEecccccccccccccccceeccceeeeeehhhhHHHH--hcChhHHHHHHHHHHHHHHHHHHhhheeeccCC-Ccceee Q lcl|Aclame:pro 131 VWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLN--DFGPAWIERFVRVQIEEAFAVALETAFLKGTGK-DQPIGL 207 (381) Q Consensus 131 ~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell--~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~-~qP~Gi 207 (381) .|...+...+++...+|.+++...+.+.+.+.|...+. ..+..|...+-.+...++++..++..||||++. +++.|| T Consensus 81 ~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F~GL 160 (330) T protein:vir:94 81 QFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSFQGM 160 (330) T ss_pred eeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccch Confidence 99998777665555689999999999999999999994 556778888999999999999999999999975 578899 Q ss_pred eeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhcccccccccc-CceEEEEchhhHHHHHhhhhccC- Q lcl|Aclame:pro 208 NRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK-GNVTMVVNPSDAFEVQAQYTHLN- 285 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~imn~~~~~~~~~~~~~~~- 285 (381) ++.+.+......+. ..+.++. +.+..++... .... ...+|+||+.+..+++++..-.+ T Consensus 161 ~~~~~~~q~i~tg~-----~gg~~T~-------d~LDeLl~~v--------~~~~g~~~~~l~n~a~~r~I~a~~R~~~~ 220 (330) T protein:vir:94 161 MGLVAASQTISAGA-----NGGTLTF-------ELLDQLLDLV--------KDKDGQVDYLMSSFAMRRKYFSLLRALGG 220 (330) T ss_pred hhcCCcccEEecCC-----CCCCCCH-------HHHHHHHHHh--------cCCCCCCcEEEechhHHHHHHHHHHhccC Confidence 88665433322111 0111221 2223332211 0011 24578999998888877654222 Q ss_pred ---------CCCceeeccCCCceEEecCCCCCc----------cEEE---Eec--cceEEEec----ceeeEeeehhhhh Q lcl|Aclame:pro 286 ---------ANGVYVTALPFNLNVIESTVQEAG----------KVLT---YVK--GLYDGYLA----GGINVQKFKETLA 337 (381) Q Consensus 286 ---------~~G~~~~~l~~g~~vi~s~~~p~~----------~i~~---gd~--s~y~i~~r----~~~~i~~~~~~~~ 337 (381) ..|.++..- .|+||+.++.+|.+ +|++ |+- .+.+.+.. .|++++.-. .-- T Consensus 221 ~~v~~~~~~~~G~~v~~~-~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G-~~~ 298 (330) T protein:vir:94 221 AAIGEVMTLPSGRQIPTY-RGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVG-AKE 298 (330) T ss_pred CCCCCcccccCCCEEeee-CCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCC-Ccc Confidence 334444221 38899999988853 3544 433 24566653 366664311 112 Q ss_pred hcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 338 ~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) .++..-|+..+|+...+..++|++++. ..+. | T Consensus 299 ~k~v~~~~v~~y~~~av~~~~a~~~L~--~V~~-----g 330 (330) T protein:vir:94 299 NADETITRVKMYCGFANFSQLGLAAIK--GLIP-----G 330 (330) T ss_pred ccceeeEEEEEeeeeEEechhheeeec--cccC-----C Confidence 345678999999999999999998853 3331 1 No 107 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.61 E-value=3.5e-17 Score=110.82 Aligned_cols=255 Identities=14% Similarity=0.075 Sum_probs=170.3 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec----CC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~----~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |..+.+.-.-.++|+.+...+.+.+.....+.+++.+-+. +| .+.+|+....+.+.+..|+.+++ ....+.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~-~~~lt~~~~ 79 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEIS-LDKIGTTTK 79 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccC-hhhcCCcce Confidence 4433444455678999999998888877767777765432 24 47899987778888888877765 456778999 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++..++.+..+.++.+...++..|+.+.+.+.++.++++.+|..++..- .+..... .. T Consensus 80 ~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l------------~~~~~~~---------~~- 137 (272) T protein:vir:36 80 SVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAA------------KTTSQTV---------ST- 137 (272) T ss_pred eEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHh------------ccccccc---------cc- Confidence 9999999999999999999999999999999999999999998876321 0000000 00 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhcc---CCCCce--ee---ccCCCceEEe Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL---NANGVY--VT---ALPFNLNVIE 302 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~---~~~G~~--~~---~l~~g~~vi~ 302 (381) .. ..+.+.+....+... ... ..+++|||.+++.+++..... +..|.. .+ ....|++|+. T Consensus 138 --~~----~~d~i~~A~~~lgd~------~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~ 204 (272) T protein:vir:36 138 --KA----NVDGVQAALDIFNDE------DAQ-AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVR 204 (272) T ss_pred --cc----cHHHHHHHHHHhhhc------CCC-ceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEE Confidence 01 111222222222111 111 236899999999887754321 122221 11 1225899999 Q ss_pred cCCCCCccEEEEe--c--cceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 303 STVQEAGKVLTYV--K--GLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 303 s~~~p~~~i~~gd--~--s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) |+.||.++..+.. | ..+.++..+++.++.. +........+++.++++.++++++++++++.+-+ T Consensus 205 s~~~p~~~~~~~~~~~~~gA~~~~~~~~~~vE~~--R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 205 SKKLAEGSALMFKIVSNSPALKLVLKRGVQVETD--RDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eCCCCCCceeEEEEEecccceeeeecCCcccccc--cchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 9999988754322 1 2233445566676643 3444566789999999999999999998777665 No 108 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.61 E-value=9e-17 Score=108.55 Aligned_cols=260 Identities=13% Similarity=0.043 Sum_probs=178.9 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec----CC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~----~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |....+.-+-.++|+.+...+.+.+.....+.+++++... +| .++||+....+.+.|..++.+++ ..+.++++. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~-~~~it~~~~ 79 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCccc-cccccccee Confidence 4444445566789999999999999887666677765432 24 48899987777888888877765 457889999 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) .+..++.+....++.+...++..|+.+.+.+.+++++++.+|..++..-.... ..... . T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~------------~~~~~--------~- 138 (274) T protein:vir:93 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------LTVNA--------D- 138 (274) T ss_pred EEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc------------ccccc--------c- Confidence 99999999899999999999999999999999999999999988875321110 00000 0 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhh--hc-cC-CCCcee-e----ccCCCceEE Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY--TH-LN-ANGVYV-T----ALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~--~~-~~-~~G~~~-~----~l~~g~~vi 301 (381) ..+.. .+.+....+.. ..+. ..+++|||..+..+++.. .. .. ..|..+ . ....|++|+ T Consensus 139 --~~~~d----~i~dA~~~l~d------~~~~-~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:93 139 --ITKLN----GLQSAIDKFND------EDLE-PMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIV 205 (274) T ss_pred --ccCHH----HHHHHHHHhhh------ccCC-ccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEE Confidence 00111 12222222211 1122 346899999998887642 11 11 112111 0 112489999 Q ss_pred ecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 302 ~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) .|+.+|.++++++....+.++.+.++.++..++ .......+++..+++.++++++++++++ . ...+.|- T Consensus 206 ~s~~~p~~t~~l~~~gai~~~~~~~~~vE~~Rd--~~~~~d~i~~~~~y~~~~~~~~~~v~~t--~--~~~s~~~ 274 (274) T protein:vir:93 206 RTNKLEAGTAILAKKGAVKLILKRDFFLEVARD--ASTKTTALYSDKHYVAYLYDESKAVKIT--K--GSGSLEM 274 (274) T ss_pred EcCCCCcceEEEEeCCeEEEEecCCcccccccc--hhhcccEEEEEEEEEEEEEcCCceEEEe--e--CccccCC Confidence 999999999988888887777777777765444 4456789999999999999999987754 3 3445554 No 109 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.53 E-value=2.1e-15 Score=101.06 Aligned_cols=261 Identities=12% Similarity=-0.003 Sum_probs=173.4 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee----cCC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~----~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |...++.-+-.++|+.+...+.+.+.....+.+++.+-. .+| .+.||+....+.+.+..++..++. .+.++++. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~-~~lt~~~~ 79 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDY-SALETESV 79 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcc-ccccccee Confidence 554445556788999999999999987766666665432 124 478999877777888787766653 47788999 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeecc-CCCcceeeeeccccccccccccccccchhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT-GKDQPIGLNRQVQKGVSVTEGAYPEKEEQG 229 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~-G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~ 229 (381) .+..++.+..+.++++....+..|+.+.+.+.++.++++..|..++..- |.. . ...+. T Consensus 80 ~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~------~------~~~~~--------- 138 (278) T protein:vir:80 80 KHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTT------L------EVKGA--------- 138 (278) T ss_pred eEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccc------c------ccccc--------- Confidence 9999998888999999999999999999999999999999998876531 110 0 00000 Q ss_pred hccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh--cc-C---CCC-----ceeeccCCCc Q lcl|Aclame:pro 230 TLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT--HL-N---ANG-----VYVTALPFNL 298 (381) Q Consensus 230 ~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~--~~-~---~~G-----~~~~~l~~g~ 298 (381) .+..........+.+....+. . ..... ..+++|||..++.+++... .. . ++| ...+ ..|+ T Consensus 139 -~t~~~~~~~~~~~~da~~~l~----~-~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~--~~G~ 209 (278) T protein:vir:80 139 -INIGLIDKIENTFTDAPDAIE----D-ESITT-TGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGE--LLGW 209 (278) T ss_pred -cccchhhhHHHHHHHHHHhhc----c-cCCCc-ccEEEECHHHHHHHHhhhhhhccccccccccceeecccee--ecce Confidence 000111111122222222221 1 11112 2357899999888876431 11 1 122 1222 2489 Q ss_pred eEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccc Q lcl|Aclame:pro 299 NVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 299 ~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~ 371 (381) +|+.|+.+|.++.++..-..+.++..+++.++..+ ........+++.++++.++++++++++++ ..+.+ T Consensus 210 ~Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~vE~~R--d~~~~~d~i~~~~~yg~~v~~~~~~v~it--~~a~~ 278 (278) T protein:vir:80 210 EIVRTKKLADGNALAVKAGALKTFLKRNLLAESGR--DMDHKLTKFNADQHYAVALVDETKAVKVV--PVAGN 278 (278) T ss_pred eEEEcCCCCcceEEEEeccceeeeecCCccccccc--chhhccceeeeeeEEEEEEEcCcceEEEe--eccCC Confidence 99999999999887776666666667777776543 34456789999999999999999977753 33333 No 110 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.49 E-value=4.9e-15 Score=99.03 Aligned_cols=260 Identities=13% Similarity=0.066 Sum_probs=176.2 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee----cCC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~----~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |..+.+.-.-.++|+.+...+.+.+.....+.+++.+-+ .+| .+.+|+....+.+.+..|+.+++ ....++++. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~-~~~lt~~~~ 79 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIP-VDKIETNRR 79 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccC-cccccccee Confidence 443344455678899999999999988877777776543 234 37899887778888888877765 456788899 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ....++.+..+.++.+....+..|+.+.+.+.++.++++.++..++. .++.... +.. ... T Consensus 80 ~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~---------~l~~~~~--~~~---------~~~ 139 (276) T protein:vir:10 80 EAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLE---------ALRGTKL--TVS---------ADI 139 (276) T ss_pred eEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHH---------HHhcccc--ccc---------ccc Confidence 99999999999999999999999999999999999999999987753 1111000 000 000 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc--c-CC--------CCceeeccCCCce Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH--L-NA--------NGVYVTALPFNLN 299 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~--~-~~--------~G~~~~~l~~g~~ 299 (381) .+ .+.+.+.+..+.. ..+. ..+++|||..+..+++.... . .+ +|.+.+ ..|++ T Consensus 140 ~t-------~d~i~~A~~~lgd------~~~~-~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~--~~G~~ 203 (276) T protein:vir:10 140 GT-------LAGLEAAIDTFDD------EDLE-PMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGE--ALGAV 203 (276) T ss_pred cC-------HHHHHHHHHHhcc------ccCc-ccEEEEcHHHHHHHHHhccccccccccccccceeccccce--eccee Confidence 11 1222222222211 1122 23578999999998875321 1 11 122222 35899 Q ss_pred EEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCC Q lcl|Aclame:pro 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGT 377 (381) Q Consensus 300 vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~ 377 (381) |+.|+.+|.+++++..-....++...++.++..++ .......+++.+++..+++++...++++ ..+ ...|.|. T Consensus 204 Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~vE~dRd--~~~~~d~i~~~~~y~~~~~~~~~vv~~t--~~~-~~~~~~~ 276 (276) T protein:vir:10 204 IVRSKKLDEGEAILAKRGAVKLITKRDFFLETDRD--PSTKTTALYSDKHYVAYLYDESKAVKVT--KGA-GTTDSGA 276 (276) T ss_pred EEEcCCCCcceEEEEeccceeeeecCCceeecccc--hhhcccEEEEeeEEEEEEEcCcceEEEe--cCC-cCCcCCC Confidence 99999999999877666666666677777776544 3345788999999999999999877764 322 3334444 No 111 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.48 E-value=1e-14 Score=97.22 Aligned_cols=308 Identities=10% Similarity=0.065 Sum_probs=168.5 Q ss_pred HHHHHHHHHHHHhhhhhccccH-HHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEE Q lcl|Aclame:pro 46 LQAKAEAERVSSLPKSAQSLSA-NQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLK 123 (381) Q Consensus 46 ~~~~~~~~~~~~~~~~~~~lt~-~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~ 123 (381) ...+.- ...+.. ...++...-.+-.+- ||.+++++...++++.+.+.+++++.++++++.. ...|++ T Consensus 1 ~~~~~~----------~~~~~n~~~~~i~k~~it~~~l-~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~k 69 (360) T protein:vir:99 1 MSSNST----------IDSVRNQNMNSLSQKDIGLAEL-DGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQ 69 (360) T ss_pred CcchhH----------HHHHhhhHHHHHHhhhcccccc-CceeecHHHHHHHHHHHhhccchhhhcceeecccccccccc Confidence 000000 000000 111111111111122 4678899999999999999999999999988654 344443 Q ss_pred ec-CCcceEEecccccccccccccccceecc-ceeeeeehhhhHHHHhcC----hhHHHHHHHHHHHHHHHHHHhhheee Q lcl|Aclame:pro 124 SE-TSGVAVWGKIYGEIKGQLDAAFSEETAI-QNKLTAFVVLPKDLNDFG----PAWIERFVRVQIEEAFAVALETAFLK 197 (381) Q Consensus 124 ~~-~~~~a~w~~e~~~~~~~~~~~f~~v~l~-~~kl~~~~~iS~ell~ds----~~~l~~~i~~~la~a~a~~~d~a~l~ 197 (381) -. +.-.-.-..|++......+++...+.+. ..++.....+..+-+++. ...+++.|++.++++++.-++.-.++ T Consensus 70 ig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~ 149 (360) T protein:vir:99 70 FGVPRLSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIR 149 (360) T ss_pred cccceeeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhh Confidence 21 1100000122222222234555666664 345666667777776664 23567999999999999999988888 Q ss_pred ccCC---------Ccc-----eeeeeccccccccc--cc-----cccccchhhhcc---cc------Chh-HHHHHHHHH Q lcl|Aclame:pro 198 GTGK---------DQP-----IGLNRQVQKGVSVT--EG-----AYPEKEEQGTLT---FA------NPR-ATVNELTQV 246 (381) Q Consensus 198 G~G~---------~qP-----~Gil~~~~~~~~~~--~~-----~~~~~~~~~~~t---~~------~~~-~~~~~~~~~ 246 (381) |+.. ..| .||++.+.....-. ++ ...++....+.+ .. ++. .....+..+ T Consensus 150 g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~ 229 (360) T protein:vir:99 150 AGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNET 229 (360) T ss_pred ccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHH Confidence 8743 233 58887653221100 00 000000000000 00 001 122334455 Q ss_pred HHHhhhccccccccccC----ceEEEEchhhHHHHHhhhhccCC-CCc--ee---eccCCCceEEecCCCCCccEEEEec Q lcl|Aclame:pro 247 FKYHSTNEKGKSVAVKG----NVTMVVNPSDAFEVQAQYTHLNA-NGV--YV---TALPFNLNVIESTVQEAGKVLTYVK 316 (381) Q Consensus 247 ~~~~~~~~~~~~~~~~~----~~~~imn~~~~~~~~~~~~~~~~-~G~--~~---~~l~~g~~vi~s~~~p~~~i~~gd~ 316 (381) ++.+ |..|++ +.+|+|++.+....+..++.+.. -|. .. ...++|+||+..+.+|++.++|-++ T Consensus 230 ~~~L-------p~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~~mlT~p 302 (360) T protein:vir:99 230 IQTL-------DSRYRESDAYSPVLMTSPNQVQSYTMSLTEREDPLGSAVIFGDSDITPFSYDLVGVNGFPDEYMMFTDP 302 (360) T ss_pred HHhc-------chhhhcCcccceEEEccCchHHHHHHHHhccCcccchhheecccccccceeeeEEcCCCCCCceEEecc Confidence 5444 344655 44899999987766665543331 111 11 1234699999999999999999999 Q ss_pred cceEEEecceeeEeeehhh--hhhcC-ceEEEEEEEEcCEEecCcceEEEEEEeccccc Q lcl|Aclame:pro 317 GLYDGYLAGGINVQKFKET--LALDD-MDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 317 s~y~i~~r~~~~i~~~~~~--~~~~d-~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~ 372 (381) +..+++....++++.+.+. +..+. .+.|.....+|...-+.+|.+++ -.+..+++ T Consensus 303 ~NLi~g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~v-t~~~~~~~ 360 (360) T protein:vir:99 303 NNLAFGLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLV-TDLETPTA 360 (360) T ss_pred CceeEEeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEE-ecCCCCCC Confidence 9999999888998764432 22222 13333445577777777776553 44444444 No 112 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.47 E-value=6.7e-15 Score=98.27 Aligned_cols=258 Identities=14% Similarity=0.051 Sum_probs=171.2 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec----CC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~----~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |....+.-...++|+.+...+.+.+....-+.+++++-+. +| ...+|+....+.+....++.+++ ..+.++++. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~-~~~it~~~~ 79 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIP-VDQIGTSKR 79 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCc-hhhccccee Confidence 4444444567789999999998888776656666655331 24 47899876666777667766664 346788899 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++..++.+..+.++.+....+..|+.+.+.+.++.++++.+|..++.-- + +..... .... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l---------~---~a~~~~--------~~~~ 139 (274) T protein:vir:96 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEAL---------K---GATLTV--------EADI 139 (274) T ss_pred EEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHH---------h---cCCCCc--------Cccc Confidence 9999998888999999999999999999999999999999998776421 0 000000 0000 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh--c-cC---CC-----CceeeccCCCce Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT--H-LN---AN-----GVYVTALPFNLN 299 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~--~-~~---~~-----G~~~~~l~~g~~ 299 (381) . ..+.+.+....+.. ..+. ..+++|||..+..+++... . .. ++ |...+. .|++ T Consensus 140 ---~----~~d~i~dA~~~l~d------~~~~-~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~--~G~~ 203 (274) T protein:vir:96 140 ---T----KLDGLQTAIDKFND------EDLE-PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEA--LGAV 203 (274) T ss_pred ---c----cHHHHHHHHHHhcc------cCCC-ceEEEeCHHHHHHHHhcccccccccccccccceeeccccee--cCee Confidence 0 11122222222211 1122 3468999999988876421 1 11 11 222222 4889 Q ss_pred EEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCC Q lcl|Aclame:pro 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 300 vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~ 374 (381) |+.|+.+|.+++++.....+.++...++.++.. +........+++.++++.++++++++++++-.-.. -+- T Consensus 204 Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~--Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~--~~~ 274 (274) T protein:vir:96 204 IVRSNKLNKGEALLAKKGAVKLITKRDFFLEKD--RDASRKSTALYSDKHYVAYLYDESKVVKITKGAGD--EVM 274 (274) T ss_pred EEEcCCCCcceEEEEeCcceeeeecCCcccccc--cchhhcccEEEEeeEEEEEEEcCccEEEEEcCccc--ccC Confidence 999999999998777777766667777777643 33445678999999999999999998876433322 222 No 113 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.42 E-value=2.4e-14 Score=95.25 Aligned_cols=259 Identities=14% Similarity=0.047 Sum_probs=171.6 Q ss_pred HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec----CC-ceEEEEecCCcceEEeccccccccccccccc Q lcl|Aclame:pro 74 MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFS 148 (381) Q Consensus 74 ~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~----~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~ 148 (381) .+|.. .+.-.-.++|+.+...+.+.+....-+.+++.+-+. +| .++||+....+.+.+..++.+++. ...+++ T Consensus 1 ~~~~~-~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~ 78 (275) T protein:vir:96 1 MALEN-MTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPI-DLIETK 78 (275) T ss_pred CCCcc-cchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcch-hhcccc Confidence 12111 122233678999999999999888777788766542 24 478998777777888888777654 467888 Q ss_pred ceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchh Q lcl|Aclame:pro 149 EETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 149 ~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) +.....++.+..+.++.+....+..|+.+.+.+.++.++++.+|..++.--+ +..... .. T Consensus 79 ~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~------------~a~~~~--------~~ 138 (275) T protein:vir:96 79 KRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQ------------GATLKV--------EA 138 (275) T ss_pred eeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHh------------cccccc--------cc Confidence 9999999999999999999988888999999999999999999988763111 000000 00 Q ss_pred hhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh---ccCC--------CCceeeccCCC Q lcl|Aclame:pro 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNA--------NGVYVTALPFN 297 (381) Q Consensus 229 ~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~---~~~~--------~G~~~~~l~~g 297 (381) .. . ..+.+.+.+..+.. ..+. ..+++|||..+..+++... ...+ +|...+ ..| T Consensus 139 ~~---~----~~d~i~dA~~~lgd------~~~~-~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~--~~G 202 (275) T protein:vir:96 139 DI---T----KLAGLQTAIDKFND------EDLE-PMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGE--ALG 202 (275) T ss_pred cc---c----CHHHHHHHHHHhcc------ccCC-ccEEEeCHHHHHHHHhcccccccccccccccceeccccce--ecC Confidence 00 0 11222233322321 1112 2368999999988877531 1111 122222 248 Q ss_pred ceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCC Q lcl|Aclame:pro 298 LNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGT 377 (381) Q Consensus 298 ~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~ 377 (381) ++|+.|+.+|.++.++..-....++...++.++..++ .......+++.+++..+++++++.++++. +|+.-|- T Consensus 203 ~~Vi~s~~~p~~t~~i~~~gA~~~~~~~~~~vE~~Rd--~~~~~d~i~~~~~y~~~~~~~~~vv~~t~-----~~~~~~~ 275 (275) T protein:vir:96 203 AIIVRSNKIKEGEAILAKRGAVKLITKRDFFLETERH--ASHKSTALFSDKHYVAYLYDESKVVKITK-----SASGLGV 275 (275) T ss_pred eeEEEeCCCCcceEEEEeccceeeeecCCcccccccc--hhhcCcEEEEeEEEEEEEEcCccEEEEEe-----cccccCC Confidence 9999999999998776655555566666777665443 44567899999999999999999887644 3343443 No 114 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.41 E-value=5.6e-14 Score=93.22 Aligned_cols=258 Identities=14% Similarity=0.045 Sum_probs=172.8 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec----CC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~----~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |..+.+.-.-.++|+.+...+.+.++......+++.+-+. +| .+.+|+....+.+....++.+++. ...+.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccc-ccccccee Confidence 4444444566789999999999888766555566655432 34 478998776677777777666653 46778899 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) .+..++.+....++.+-...+..|+.+.+.+.++.++++..|..++.--.. ......+ .. T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~------------a~~~~~~--------~~ 139 (274) T protein:vir:94 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMG------------AKLTVNA--------DI 139 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhc------------cCccccc--------cc Confidence 999999888899999999999899999999999999999999887642110 0000000 00 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhh--hc-cC-CCCc-------eeeccCCCce Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY--TH-LN-ANGV-------YVTALPFNLN 299 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~--~~-~~-~~G~-------~~~~l~~g~~ 299 (381) .+. +.+.+....+.. ..+ ...+.+|||..+..+++.. .. .. ..|. ..+. .|++ T Consensus 140 ---~~~----d~i~dA~~~l~d------~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~--~G~~ 203 (274) T protein:vir:94 140 ---TKL----NGLQSAIDKFND------EDL-EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA--LGAI 203 (274) T ss_pred ---cCH----HHHHHHHHHhhc------cCC-CceEEEeCHHHHHHHHhhhhhhccccCcccccceecccccee--cCee Confidence 011 112222222211 111 2346789999998887642 11 11 1121 2222 4889 Q ss_pred EEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 300 vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) |+.++.+|.++.++.......++...++.++..++. ......+++..++..+++++.++++++. ...++|- T Consensus 204 Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~--~~~~d~i~~~~~y~~~~~~~~~vv~~t~----~~~~~~~ 274 (274) T protein:vir:94 204 IVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDA--STKTTALYSDKHYVAYLYDESKAVKITK----GSGSLEM 274 (274) T ss_pred EEEcCCCCcceEEEEeCcceEeeecCCceeccccch--hhcccEEEEEEEEEEEEEcCCceEEEec----CcccccC Confidence 999999999998887777777777777787765543 3456789999999999999999887653 3344444 No 115 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.41 E-value=5.6e-14 Score=93.22 Aligned_cols=258 Identities=14% Similarity=0.045 Sum_probs=172.8 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec----CC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~----~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |..+.+.-.-.++|+.+...+.+.++......+++.+-+. +| .+.+|+....+.+....++.+++. ...+.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccc-ccccccee Confidence 4444444566789999999999888766555566655432 34 478998776677777777666653 46778899 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) .+..++.+....++.+-...+..|+.+.+.+.++.++++..|..++.--.. ......+ .. T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~------------a~~~~~~--------~~ 139 (274) T protein:vir:97 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMG------------AKLTVNA--------DI 139 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhc------------cCccccc--------cc Confidence 999999888899999999999899999999999999999999887642110 0000000 00 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhh--hc-cC-CCCc-------eeeccCCCce Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY--TH-LN-ANGV-------YVTALPFNLN 299 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~--~~-~~-~~G~-------~~~~l~~g~~ 299 (381) .+. +.+.+....+.. ..+ ...+.+|||..+..+++.. .. .. ..|. ..+. .|++ T Consensus 140 ---~~~----d~i~dA~~~l~d------~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~--~G~~ 203 (274) T protein:vir:97 140 ---TKL----NGLQSAIDKFND------EDL-EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA--LGAI 203 (274) T ss_pred ---cCH----HHHHHHHHHhhc------cCC-CceEEEeCHHHHHHHHhhhhhhccccCcccccceecccccee--cCee Confidence 011 112222222211 111 2346789999998887642 11 11 1121 2222 4889 Q ss_pred EEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 300 vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) |+.++.+|.++.++.......++...++.++..++. ......+++..++..+++++.++++++. ...++|- T Consensus 204 Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~--~~~~d~i~~~~~y~~~~~~~~~vv~~t~----~~~~~~~ 274 (274) T protein:vir:97 204 IVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDA--STKTTALYSDKHYVAYLYDESKAVKITK----GSGSLEM 274 (274) T ss_pred EEEcCCCCcceEEEEeCcceEeeecCCceeccccch--hhcccEEEEEEEEEEEEEcCCceEEEec----CcccccC Confidence 999999999998887777777777777787765543 3456789999999999999999887653 3344444 No 116 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.34 E-value=4e-13 Score=88.54 Aligned_cols=272 Identities=14% Similarity=0.139 Sum_probs=164.3 Q ss_pred hccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceEEeccc---- Q lcl|Aclame:pro 62 AQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIY---- 136 (381) Q Consensus 62 ~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~w~~e~---- 136 (381) -..+|-.| .+.+.+..+...|||.+.+.++|++.+.+.++.| ...+.+....+.+.+.+.+ T Consensus 1 mpaltLae--------------a~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~ 66 (310) T protein:vir:97 1 MASVTLAE--------------SAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFS 66 (310) T ss_pred CcccchHH--------------HhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCccccccccccc Confidence 11233222 2245678889999999999999999999988755 3566665444333322211 Q ss_pred ccccccccccccceeccceeeeeehhhhHHHHhc--C-hhHHHHHHHHHHHHHHHHHHhhheeeccCCCcce-eeeeccc Q lcl|Aclame:pro 137 GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--G-PAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI-GLNRQVQ 212 (381) Q Consensus 137 ~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s-~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~-Gil~~~~ 212 (381) .+...++..+|.+++...+-+++.+.|...+.+- + ..|...+=.+...+++....+..||||+.+++|. ||++.+. T Consensus 67 ~~g~~~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~ 146 (310) T protein:vir:97 67 GAGAGKAAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCA 146 (310) T ss_pred CCCccccccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCC Confidence 1112356788999999999999999999876653 2 4444445566778999999999999999866554 9988764 Q ss_pred cccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhcc-------- Q lcl|Aclame:pro 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL-------- 284 (381) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~-------- 284 (381) .......+. .-+.++. +.|..++... .. .-....+++|||.+..++.++..-. T Consensus 147 ~~q~i~~~~-----~gg~~t~-------d~LDeLl~~v---~~----~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~ 207 (310) T protein:vir:97 147 SGQKATTGA-----TGSAISF-------AILDELMDLV---VD----KDGQVDYLTMHARTLRSYKALLRALGGASINEV 207 (310) T ss_pred ccceeecCC-----CCCCCCH-------HHHHHHHHHH---hc----CCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCc Confidence 332221111 0011111 1222222111 00 0112347999999877676554322 Q ss_pred --CCCCceeeccCCCceEEecCCCCCc----------cEE---EEec--cceEEEe----cceeeEeeehhhhhhcCceE Q lcl|Aclame:pro 285 --NANGVYVTALPFNLNVIESTVQEAG----------KVL---TYVK--GLYDGYL----AGGINVQKFKETLALDDMDL 343 (381) Q Consensus 285 --~~~G~~~~~l~~g~~vi~s~~~p~~----------~i~---~gd~--s~y~i~~----r~~~~i~~~~~~~~~~d~~~ 343 (381) +..|+++... .|+||+.++.+|.+ +|+ ||+. .+.+++. ..|+++...-+. -.++..- T Consensus 208 ~~~~~G~~v~~~-~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~-~~~~v~~ 285 (310) T protein:vir:97 208 VELPSGAEVPAY-SGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGES-EDSDEHI 285 (310) T ss_pred cccCCCCEEeee-CCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcc-cCCccee Confidence 3345554322 38999999999853 244 4543 2345543 235555432111 1235567 Q ss_pred EEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 344 YTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 344 ~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) |+..+++.-.+..++|++++. .+ +| T Consensus 286 ~~V~~Y~~~av~~~~A~a~L~-~V----------~~ 310 (310) T protein:vir:97 286 WRVKWYCGLALFSEKGLACAD-GI----------TN 310 (310) T ss_pred EEEEEeeeEEEecccceeeec-cc----------cC Confidence 889999999999999999862 11 11 No 117 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.30 E-value=5.3e-13 Score=87.89 Aligned_cols=258 Identities=13% Similarity=0.054 Sum_probs=168.2 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee----cCC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~----~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |..+.+.-.-.++|+.+...+.+.+....-+.+++.+-. .+| .+.||+....+.+....++.+++. ...+.++. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccch-hhccccee Confidence 433334445568999999999888876655556665532 234 478998776677777777666643 46778888 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) .+..++.+..+.++.+-...+..|+.+.+.+.++.++++..|..++.--.+. ..... ... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a------------~~~~~--------~~a 139 (274) T protein:vir:12 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVN--------ADI 139 (274) T ss_pred eEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcc------------ccccc--------ccc Confidence 8888898888999998888888889999999999999999998876421100 00000 000 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhh--hc-cCCC--------CceeeccCCCce Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY--TH-LNAN--------GVYVTALPFNLN 299 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~--~~-~~~~--------G~~~~~l~~g~~ 299 (381) .+ .+.+.+....+.. ... ...+.+|||..+..+++.. .. ..++ |...+ ..|++ T Consensus 140 ---~~----~d~i~dA~~~lgd------~~~-~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~--~~G~~ 203 (274) T protein:vir:12 140 ---TK----LNGLQSAIDKFND------EDL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE--ALGAI 203 (274) T ss_pred ---cC----HHHHHHHHHHhcc------ccc-cccEEEeCHHHHHHHHhhhhhhccccccccccceeccccee--ecCee Confidence 01 1122222222221 111 2346789999998887742 11 1111 22222 24899 Q ss_pred EEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 300 vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) |+.|+.+|.++.++.-...+.++...++.++..++.. .....+++.+++..++++++..++++ ....++|- T Consensus 204 Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~--~~~d~i~~~~~y~~~~~~~~~vv~~t----~~~~~~~~ 274 (274) T protein:vir:12 204 IVRSNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS--TKTTALYSDKHYVAYLYDESKAVKIT----KGSGSLEM 274 (274) T ss_pred EEEeCCCCcceEEEEeccceeeeecCCceeccccchh--hcccEEEeeeEEEEEEEcCCceEEEE----cCCccccC Confidence 9999999998876554555555566777777655433 46678999999999999999987754 34455555 No 118 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.25 E-value=1.7e-12 Score=85.16 Aligned_cols=258 Identities=14% Similarity=0.061 Sum_probs=167.0 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee----cCC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~----~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |..+.+.-.-..+|+.+...+.+.+....-+.+++.+-+ .+| .+.||+....+.+....++.+++. ...+.++. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccch-hhccccee Confidence 333333445577899999999888877655566665433 234 588998776677777777666643 46777888 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) .+..++.+..+.++.+-...+..|+.+.+.+.++.++++..|..++.--.+ ..... .... T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~------------a~~~~--------~~~~ 139 (274) T protein:vir:95 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKS------------AKLTV--------EADI 139 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhc------------ccccc--------cccc Confidence 888888888899999988888889999999999999999999877631110 00000 0000 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhh--hc-cCC--------CCceeeccCCCce Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY--TH-LNA--------NGVYVTALPFNLN 299 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~--~~-~~~--------~G~~~~~l~~g~~ 299 (381) . ..+.+.+....+... .. ...+.+|||..+..+++.. .. ..+ +|...+. .|++ T Consensus 140 ---~----~~d~i~~A~~~lgd~------~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~--~G~~ 203 (274) T protein:vir:95 140 ---T----KLTGLQTAIDKFNDE------DL-EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEA--LGAV 203 (274) T ss_pred ---c----CHHHHHHHHHHhccc------cc-cccEEEeCHHHHHHHHhhccccccccccccccceecccccee--cCeE Confidence 0 111222222222211 11 2346789999998888742 11 111 1222222 4899 Q ss_pred EEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 300 vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) |+.|+.+|.++.++.-...+.++...++.++..++ .......+++.+++..++++++..++++ .. .|..| T Consensus 204 Vi~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~Rd--~~~~~d~i~~~~~y~~~~~~~~~~v~~t----k~----~~~~~ 273 (274) T protein:vir:95 204 IVRSNKLEAGTAILAKKGAVKLITKRDFFLETDRD--PSTKTTALYSDKHYVAYLYDESKAVKIT----KG----SGSLE 273 (274) T ss_pred EEEeCCCCCceEEEEeccceeeeecCCcccccccc--cccccCEEEEeEEEEEEEEcCCcEEEEE----cC----Ccccc Confidence 99999999988765555555555667777765444 4457788999999999999999988864 11 22222 Q ss_pred C Q lcl|Aclame:pro 380 T 380 (381) Q Consensus 380 ~ 380 (381) - T Consensus 274 ~ 274 (274) T protein:vir:95 274 M 274 (274) T ss_pred C Confidence 2 No 119 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.25 E-value=1.7e-12 Score=85.16 Aligned_cols=258 Identities=14% Similarity=0.061 Sum_probs=167.0 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee----cCC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~----~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |..+.+.-.-..+|+.+...+.+.+....-+.+++.+-+ .+| .+.||+....+.+....++.+++. ...+.++. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccch-hhccccee Confidence 333333445577899999999888877655566665433 234 588998776677777777666643 46777888 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) .+..++.+..+.++.+-...+..|+.+.+.+.++.++++..|..++.--.+ ..... .... T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~------------a~~~~--------~~~~ 139 (274) T protein:vir:96 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKS------------AKLTV--------EADI 139 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhc------------ccccc--------cccc Confidence 888888888899999988888889999999999999999999877631110 00000 0000 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhh--hc-cCC--------CCceeeccCCCce Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY--TH-LNA--------NGVYVTALPFNLN 299 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~--~~-~~~--------~G~~~~~l~~g~~ 299 (381) . ..+.+.+....+... .. ...+.+|||..+..+++.. .. ..+ +|...+. .|++ T Consensus 140 ---~----~~d~i~~A~~~lgd~------~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~--~G~~ 203 (274) T protein:vir:96 140 ---T----KLTGLQTAIDKFNDE------DL-EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEA--LGAV 203 (274) T ss_pred ---c----CHHHHHHHHHHhccc------cc-cccEEEeCHHHHHHHHhhccccccccccccccceecccccee--cCeE Confidence 0 111222222222211 11 2346789999998888742 11 111 1222222 4899 Q ss_pred EEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 300 vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) |+.|+.+|.++.++.-...+.++...++.++..++ .......+++.+++..++++++..++++ .. .|..| T Consensus 204 Vi~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~Rd--~~~~~d~i~~~~~y~~~~~~~~~~v~~t----k~----~~~~~ 273 (274) T protein:vir:96 204 IVRSNKLEAGTAILAKKGAVKLITKRDFFLETDRD--PSTKTTALYSDKHYVAYLYDESKAVKIT----KG----SGSLE 273 (274) T ss_pred EEEeCCCCCceEEEEeccceeeeecCCcccccccc--cccccCEEEEeEEEEEEEEcCCcEEEEE----cC----Ccccc Confidence 99999999988765555555555667777765444 4457788999999999999999988864 11 22222 Q ss_pred C Q lcl|Aclame:pro 380 T 380 (381) Q Consensus 380 ~ 380 (381) - T Consensus 274 ~ 274 (274) T protein:vir:96 274 M 274 (274) T ss_pred C Confidence 2 No 120 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.23 E-value=7.7e-12 Score=81.51 Aligned_cols=349 Identities=11% Similarity=-0.038 Sum_probs=167.1 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHH--HHHH-HHH----HHHHHHHHHHHHhhhhhccccHHHHHHH Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMIN--QLFE-ETK----LQAKAEAERVSSLPKSAQSLSANQRSFF 73 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-~~~----~~~~~~~~~~~~~~~~~~~lt~~e~~~~ 73 (381) |- +=++++.+.-=....+.++.+.-.++.+ .+++ +.+ .+..-+..+...+...+.. ...|-+.. T Consensus 1 ~~--------~~~~~~~~~~~~~~~~~e~k~lr~~me~~et~~e~~~~~~~~~~~e~el~E~f~Kmm~G~~-p~~eV~~~ 71 (393) T protein:vir:79 1 ME--------NWLKQLKESGFTETQVQEQKSLRTRMERGETLAEADANKLALNEEETQILESFAKMMEGET-PTNEVNLR 71 (393) T ss_pred Cc--------hHHHHHHhccCchhHHHHHHHHHHHhhhhhhhhhhhhhhhhcchhHHHHHHHHHHHhcCCC-chhheehh Confidence 21 1111111110000111111111111110 0000 000 0000011111111111111 11121111 Q ss_pred HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec-CCce-EEEEecCCcceEEeccccccccc--ccccccc Q lcl|Aclame:pro 74 MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLRL-KFLKSETSGVAVWGKIYGEIKGQ--LDAAFSE 149 (381) Q Consensus 74 ~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~-~~~~-~ip~~~~~~~a~w~~e~~~~~~~--~~~~f~~ 149 (381) .. -++.++-.|||..+++-+.+........-+++.-+.. .|.. .+| .-+.--+.-++|+++.+.. +..+|+. T Consensus 72 e~---mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~-~~g~~Ra~~IgEGgE~~~~sld~~T~ds 147 (393) T protein:vir:79 72 EF---MATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFP-SIGIMRAYDVAEGQEIPEDSIDWQTHES 147 (393) T ss_pred hh---hcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceecc-chheeeeccccccccccccchhhhcCCc Confidence 11 2455778999999999888744443333333333333 2221 111 1112234445666766653 3588999 Q ss_pred eeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCC-c--ceeeeeccccccccccccccccc Q lcl|Aclame:pro 150 ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD-Q--PIGLNRQVQKGVSVTEGAYPEKE 226 (381) Q Consensus 150 v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~-q--P~Gil~~~~~~~~~~~~~~~~~~ 226 (381) +++...|++..+.+|+|++.||..|+-+++.....+++++..+.-.++|.-++ + .-|+.+...+. +++-.. +.. T Consensus 148 v~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ah--ptGr~~-~~~ 224 (393) T protein:vir:79 148 PEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAH--TTGLDK-NGV 224 (393) T ss_pred eeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccce--eecCCc-ccc Confidence 99999999999999999999999999999999999999999999999998644 3 34554322111 111000 001 Q ss_pred hhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhh----hhccCCCCcee----------- Q lcl|Aclame:pro 227 EQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ----YTHLNANGVYV----------- 291 (381) Q Consensus 227 ~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~----~~~~~~~G~~~----------- 291 (381) -.+++ .+....+.+..+. +.-|. ..+++|||--+.-+-+- .+.+++-|+|. T Consensus 225 qNGTl---SleDllDm~~av~----------~~hyt-~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~alg 290 (393) T protein:vir:79 225 QNDTF---SAEDFLDLIIAVM----------ANEYT-PSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALG 290 (393) T ss_pred ccccc---cHHHHHHHHHHHh----------cccCC-cceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhc Confidence 11222 2233333333322 12233 34689999654322221 11233334432 Q ss_pred -----eccCCCceEEecCCCCCccE--EE----Eeccc-eEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCc- Q lcl|Aclame:pro 292 -----TALPFNLNVIESTVQEAGKV--LT----YVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNK- 358 (381) Q Consensus 292 -----~~l~~g~~vi~s~~~p~~~i--~~----gd~s~-y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~- 358 (381) ..|||+..|+.|+++|=++. =| -|-.. -++-.+-+++.++-++. ..|.+-++-+.|++.-+.+.. T Consensus 291 p~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk--~rdiq~iKl~ERYG~gvLn~gk 368 (393) T protein:vir:79 291 PDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEK--ARGLQNIKMIERYGIGILNEGK 368 (393) T ss_pred hhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceeccccc--cccceeeeeeeeeceeeeeCCc Confidence 13678899999999994332 11 11121 11222335555554442 357888999999987666643 Q ss_pred c-eEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 359 V-AAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 359 A-f~v~~l~~~~~~~~~~~~~~~~ 381 (381) | |+...++++..-+.|----+-- T Consensus 369 aiavakNI~~~k~y~~P~~~~~~~ 392 (393) T protein:vir:79 369 AIAVAKNISMDKSYAEPMLIKNVG 392 (393) T ss_pred eEEEEecceeecccccchhhhccC Confidence 2 3333445444333331100000 No 121 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.13 E-value=1.6e-11 Score=79.84 Aligned_cols=256 Identities=10% Similarity=0.014 Sum_probs=163.6 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeec----CC-ceEEEEecCCcceEEecccccccccccccccce Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~----~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v 150 (381) |..+. -.-.++|+.+..-+.+.+.+..-+.+++.+-+. +| .+.+|.....+.+.-+.|+.+++. ...++++- T Consensus 1 Ma~T~--~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~-~~lt~~~~ 77 (270) T protein:vir:95 1 MTQTK--KANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDT-TQMSMTTT 77 (270) T ss_pred CCcee--hhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccch-hhcccchh Confidence 33322 233568999999999988887777777766442 34 378998777777776677666653 46778888 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ....++.+.-+.++.+-...+.-|....+.+.++..+++.++..++. . ++......+ T Consensus 78 ~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~-~--------l~~a~~~~~-------------- 134 (270) T protein:vir:95 78 KVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIA-E--------LNKSKQTAT-------------- 134 (270) T ss_pred eeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHH-H--------hcccccccc-------------- Confidence 88899999999999998887777889999999999999999987652 0 110000000 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc---cCCC-----CceeeccCCCceEEe Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH---LNAN-----GVYVTALPFNLNVIE 302 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~---~~~~-----G~~~~~l~~g~~vi~ 302 (381) ...+.. .+.+.+..+ .+. . ....+.+|||.++..+++.... +.++ |.+.+. .|++||. T Consensus 135 -~~~t~~----~~~dA~~~l---gd~---~-~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~--~G~~Viv 200 (270) T protein:vir:95 135 -VSADAT----GILDAIEVF---NSE---N-DEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEI--VGVSDIV 200 (270) T ss_pred -cccCHH----HHHHHHHHh---ccc---c-CCCcEEEEcHHHHHHHHhhhcccccccccchhccccccee--cceeEEE Confidence 001111 111111111 111 1 1123589999999988864322 1122 223332 4889877 Q ss_pred cCCCC-CccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCC Q lcl|Aclame:pro 303 STVQE-AGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEET 380 (381) Q Consensus 303 s~~~p-~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~ 380 (381) ++.+| +++.++.-.....++...++.++..++ .......+++..++..+++++..+++++++ +.|++|- T Consensus 201 ~s~~~~~~~~~l~~~gAi~~~~~~~~~vEtdRd--~~~~~d~i~~~~~y~v~~~~~skvv~~t~~-------~a~~~~~ 270 (270) T protein:vir:95 201 KSKRVSENTAFLQRYGAMEIVNKKKPEAYTDFD--ILKRTHLLSTNYHYSVNLKDETGVVKVTFK-------PSGSLEM 270 (270) T ss_pred eCCCCCceeEEEEeccceeeeecCCceeeeccc--hhhcccEEEeeeEEEEEEEccceEEEEEec-------CCCCcCC Confidence 66554 556555444445556666777765544 344677899999999999999988877664 2344444 No 122 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=98.98 E-value=4e-11 Score=77.61 Aligned_cols=217 Identities=16% Similarity=0.089 Sum_probs=144.3 Q ss_pred ceeeecCCceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 110 LGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAV 189 (381) Q Consensus 110 ~~v~~~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~ 189 (381) -+-++.+.++++|.. .+.|.-+.|+.+++. ...++++.+...++.+.-+.|+.+-.-.+.-|......+.++.+|++ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~eG~~i~~-~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCceEEeccc--ccchhhhcCCCcCCh-hhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 344555667889965 556777888777654 46778899999999999999999998888889999999999999999 Q ss_pred HHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEE Q lcl|Aclame:pro 190 ALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVV 269 (381) Q Consensus 190 ~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~im 269 (381) ++|..++. .+.. +..+ .. +..+. +.+.+.+..+. .. .....+.+| T Consensus 78 kvD~di~~------------~~~~-a~l~--------~~---~~~t~----d~i~~A~~~fg---de----~~~~~vivv 122 (231) T protein:vir:73 78 KVDDDLLK------------AAKT-TSQT--------VS---TKANV----DGVQAALDIFN---DE----DAQAYVLIV 122 (231) T ss_pred hhhHHHHH------------hhcc-cccc--------cc---ccccH----HHHHHHHHHhc---cc----cccceEEEE Confidence 99987662 1110 0000 00 11111 22222222221 11 112346899 Q ss_pred chhhHHHHHhhhhccC-----C-----CCceeeccCCCceEEecCCCCCccEEEEe----ccceEEEecceeeEeeehhh Q lcl|Aclame:pro 270 NPSDAFEVQAQYTHLN-----A-----NGVYVTALPFNLNVIESTVQEAGKVLTYV----KGLYDGYLAGGINVQKFKET 335 (381) Q Consensus 270 n~~~~~~~~~~~~~~~-----~-----~G~~~~~l~~g~~vi~s~~~p~~~i~~gd----~s~y~i~~r~~~~i~~~~~~ 335 (381) ||.+++.+|+...... . +|.+... .|++|+.|+.+|.++.++.. .....++...++.++.. + T Consensus 123 ~p~~~~~Lrk~~~~~~~~~~~g~~i~~~G~iG~i--~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtd--R 198 (231) T protein:vir:73 123 NPKDAAKIRKDANAKNIGSEVGANALINGTYADV--LGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETD--R 198 (231) T ss_pred cchHHHhhhhccchhhhhhhhccceeeecccceE--cceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeecc--c Confidence 9999999988543211 1 1222222 58999999999998765432 12344556667777654 3 Q ss_pred hhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 336 ~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) -.....+.+.+.+++..+++++..+++++++=+ T Consensus 199 d~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 199 DIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred cccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 455667889999999999999999887666654 No 123 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=98.90 E-value=1.7e-09 Score=68.62 Aligned_cols=340 Identities=14% Similarity=0.125 Sum_probs=166.6 Q ss_pred CCccHHH----HHHHHHHHHHHHHhhhhHHHHH-------------------HHHHH-------HHHHHH---HHHHHHH Q lcl|Aclame:pro 1 MTINLSE----TFANAKNEFINAVNNGEPQERQ-------------------NELYG-------DMINQL---FEETKLQ 47 (381) Q Consensus 1 m~~~l~~----~~~e~~~~~~~~~~~~~~~~~~-------------------~~~~~-------~~~~~~---~~~~~~~ 47 (381) |++-.++ .++|+++.+.+--+++..-..+ .+.+. +.++.+ .+..+.. T Consensus 1 ~~~s~~~~~k~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e~eln~~~E~~Kgk 80 (400) T protein:vir:93 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) T ss_pred CcccccccccchHHHHHHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHhhhhhhhhhhcccc Confidence 5542221 2333333222111111110000 00000 000000 0000000 Q ss_pred HHH-HHH---HHH-Hh--hhhhccccHHHHHHHH-HHhcc-c-CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC Q lcl|Aclame:pro 48 AKA-EAE---RVS-SL--PKSAQSLSANQRSFFM-DINKN-V-NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL 117 (381) Q Consensus 48 ~~~-~~~---~~~-~~--~~~~~~lt~~e~~~~~-~~~~~-~-~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~ 117 (381) .+. ++- ++. .. ......++.+-+.+.. .+.+. . ..+--+.+|.-+...|-..+..+.++++...+.+.++ T Consensus 81 ~~mtefLkT~~A~~~fa~~l~~nsg~sd~knaW~A~l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~ 160 (400) T protein:vir:93 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 160 (400) T ss_pred hhHHHhhhhHHHHHHHHHHHHhhcCCcchhhhhhhhhhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeeecCCc Confidence 000 000 000 00 0011222222222221 12222 2 1344457899999999999999999999999999855 Q ss_pred ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhc--ChhHHHHHHHHHHHHHHHH-HHhhh Q lcl|Aclame:pro 118 RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAV-ALETA 194 (381) Q Consensus 118 ~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s~~~l~~~i~~~la~a~a~-~~d~a 194 (381) -+ +-.......-+||--.|..+.++..+|..-+|.+.-++.+.++..-..++ +.-.|-.||.++|..++-. +.++| T Consensus 161 l~-V~~~~dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~A 239 (400) T protein:vir:93 161 LL-VSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLA 239 (400) T ss_pred ee-eecchhhhcccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhh Confidence 33 22223333356765556667777888999999988777777664443332 2334789999999999995 68999 Q ss_pred eeeccCCCcceeee--eccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|Aclame:pro 195 FLKGTGKDQPIGLN--RQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 195 ~l~G~G~~qP~Gil--~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~ 272 (381) ++-|+|++...|+- +++..... .+... ...+.. +...+.-.+.++. .+.. ..+...+|.|. T Consensus 240 ii~GdG~Ngf~~~dk~t~Ik~I~~--dt~kt--~~a~~~---~~qdl~E~~~d~~---------~~~a-ad~~~Iv~s~d 302 (400) T protein:vir:93 240 LVEGDGTNGFKSIDKEADVKKIKK--ITTKA--KSAGKT---PFADAIEEAVDFV---------RPTA-GRRYLIVKAED 302 (400) T ss_pred eeecccccccCCCcchhhhhhhhh--hhhhh--hhcCCc---cHHHHHHHHHhhh---------hhcc-CCceeEEeccc Confidence 99999988665551 22111000 00000 000110 1111111111111 1111 22445677887 Q ss_pred hHHHHHhhhhccCCCCceeec---------cCCCc-eEEecCC--CCCccEEEEeccceEEEecceeeEeeehhhhhhcC Q lcl|Aclame:pro 273 DAFEVQAQYTHLNANGVYVTA---------LPFNL-NVIESTV--QEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDD 340 (381) Q Consensus 273 ~~~~~~~~~~~~~~~G~~~~~---------l~~g~-~vi~s~~--~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d 340 (381) +.+.++.+ ++++|++.-. ..||+ .++.... +|...++. |-..|+ ..++++ ..+..-...+ T Consensus 303 ~~A~L~~l---k~a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~kp~V~V-Dek~~i--~~~~~~--t~~sf~~~tN 374 (400) T protein:vir:93 303 RKALLDEL---RQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLV-DQKYHI--DMQDLT--KVDAFEWKTN 374 (400) T ss_pred hHHHHHHh---cCCcceeeeeeccccchhhhhcccceeeeeccCCCCCceeee-ehhhhc--cccCce--eccceeeeec Confidence 77777765 5566665431 12343 2333333 34334444 444343 334443 3333434445 Q ss_pred ceEEEEEEEEcCEEecCcceEEEEEE Q lcl|Aclame:pro 341 MDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 341 ~~~~~~~~r~dgk~~~~~Af~v~~l~ 366 (381) +-.+..-.++.|-+.-+++.+++++- T Consensus 375 s~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 375 SNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred cceEEeeeeeccceecccceeeEeeC Confidence 55666677799999999999987766 No 124 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.73 E-value=1.6e-09 Score=68.84 Aligned_cols=291 Identities=11% Similarity=-0.025 Sum_probs=152.3 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCCCCc-eEcc-HHHHHHHHHHHHhhhhhhhhceeeecC-Cc-eEEEEecCCcceEE Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNYKEE-KLLP-EETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVW 132 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg-~lvP-~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~-~~ip~~~~~~~a~w 132 (381) |..-..+.++ ..+.+..++ +-++ +.+..++...+...+.++++..+.++. |+ ..+|+- +..++.. T Consensus 1 m~~~~~~~~t----------~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~ 69 (334) T protein:vir:80 1 MTYPAANTHT----------RPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAG 69 (334) T ss_pred CCCCcCCCcc----------ccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeee Confidence 1111111111 011112222 3444 899999999999999999999888864 43 778875 4444555 Q ss_pred ecccccccccccccccceecccee-eeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhhee----eccCCCcceee Q lcl|Aclame:pro 133 GKIYGEIKGQLDAAFSEETAIQNK-LTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL----KGTGKDQPIGL 207 (381) Q Consensus 133 ~~e~~~~~~~~~~~f~~v~l~~~k-l~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l----~G~G~~qP~Gi 207 (381) ..-+.++.. ...+-++.+|.... ++.-..|..-=--++..|+.+.+.++++.++++..|++++ .|.....|.+. T Consensus 70 ~~~g~~l~~-~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~ 148 (334) T protein:vir:80 70 RKAGEELVV-QKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHL 148 (334) T ss_pred ecCCCCCCC-CCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 554444433 23445666666555 3344444444444567789999999999999999998764 33333333211 Q ss_pred eeccccccccccccccccchhhh--ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccC Q lcl|Aclame:pro 208 NRQVQKGVSVTEGAYPEKEEQGT--LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~--~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~ 285 (381) -.... .|........++ ....++..+...+......+.. ...+..-....+.+|+|..++.++......+ T Consensus 149 ~~~~~------~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e--~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n 220 (334) T protein:vir:80 149 KPAFH------DGILLPSTISGLAADAAADADVLVAAHRQGVEAMVF--RDLGDQLMSEGVTLLDPVIFSFLLEHDRLMN 220 (334) T ss_pred ccccc------CCcceeecccccccchhhhHHHHHHHHHHHHHHHHh--cCCCCCcCCceEEEeChHHHHHHhccccccc Confidence 00000 000000000000 0111222232222222222221 1112111124567899999988876422111 Q ss_pred -----C--CCceeec---cCCCceEEecCCCCCcc-----------EEEEeccceE--EEecce--------eeEeee-h Q lcl|Aclame:pro 286 -----A--NGVYVTA---LPFNLNVIESTVQEAGK-----------VLTYVKGLYD--GYLAGG--------INVQKF-K 333 (381) Q Consensus 286 -----~--~G~~~~~---l~~g~~vi~s~~~p~~~-----------i~~gd~s~y~--i~~r~~--------~~i~~~-~ 333 (381) + ++.+.+. ...|++|+.|+.+|... +.-|||+.-. +.-++. +..+.. + T Consensus 221 ~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~ 300 (334) T protein:vir:80 221 VEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEE 300 (334) T ss_pred ceeccccccccccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeec Confidence 1 1123221 12489999999999542 3456766632 222221 222221 2 Q ss_pred hhhhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 334 ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 334 ~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) +.+|.. .+.+++-++.++++|+|++++.|+++.+ T Consensus 301 ~~~~~d---~i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 301 KKDFGH---YLDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred hhhHHH---HHHHHHHcCCceeccceEEEEEEeeecC Confidence 233433 3344455789999999999999999886 No 125 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.69 E-value=1.6e-09 Score=68.79 Aligned_cols=287 Identities=12% Similarity=0.043 Sum_probs=146.0 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCC--CCc--eEccHHHHHHHHHHHHhhhhhhhhceeeecC-Cc-eEEEEecCCcce Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNY--KEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVA 130 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~--~gg--~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~-~~ip~~~~~~~a 130 (381) ++..... .. -.+..+... .|. .|--+.+..++.+.....+.+++++++.++. |+ .++|+- +..++ T Consensus 1 ~~~~~~~----~~----~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~ 71 (345) T protein:vir:22 1 MASMTGG----QQ----MGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQA 71 (345) T ss_pred Ccccccc----hh----cccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEE Confidence 1111000 00 001111111 111 2334889999999999999999999988865 44 678875 33445 Q ss_pred EEecccccccc-cccccccc--eeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee----ccC--- Q lcl|Aclame:pro 131 VWGKIYGEIKG-QLDAAFSE--ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTG--- 200 (381) Q Consensus 131 ~w~~e~~~~~~-~~~~~f~~--v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~----G~G--- 200 (381) .....+.++.. ..+++..+ ++++..++... .|..-=--++..|+.+.+.++.+.++++..|++++. +.. T Consensus 72 ~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~-~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~ 150 (345) T protein:vir:22 72 AYLAPGENLDDKRKDIKHTEKVITIDGLLTADV-LIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVES 150 (345) T ss_pred EeeecCCCCCCCCCCcccceEEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 54444444322 23466677 55554444433 233222234567899999999999999999987752 111 Q ss_pred --CCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHH Q lcl|Aclame:pro 201 --KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQ 278 (381) Q Consensus 201 --~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~ 278 (381) ++.|.|+-+......+..+... +. .........+.+......+ +.+..+-. +.+.+++|..++.++ T Consensus 151 ~~~~~~~~~~~~~~~~~~~~g~~~---t~----~~~~~~~~~~ai~~a~~~L----de~~VP~~-~R~~vv~P~~y~~Ll 218 (345) T protein:vir:22 151 KYNENIEGLGTATVIETTQNKAAL---TD----QVALGKEIIAALTKARAAL----TKNYVPAA-DRVFYCDPDSYSAIL 218 (345) T ss_pred cccccccccccccccccccccccc---cc----cccCHHHHHHHHHHHHHHh----hhcCCCcc-CCEEEeChHHHHHHh Confidence 1233333221111111101000 00 0001112222222222222 22222222 356789999888775 Q ss_pred hhhhccC----CCCceeec---cCCCceEEecCCCCCccE--------------------------------EEEeccce Q lcl|Aclame:pro 279 AQYTHLN----ANGVYVTA---LPFNLNVIESTVQEAGKV--------------------------------LTYVKGLY 319 (381) Q Consensus 279 ~~~~~~~----~~G~~~~~---l~~g~~vi~s~~~p~~~i--------------------------------~~gd~s~y 319 (381) ......+ +++.+.+. ...|++|+.|+.+|...+ +|+..+.. T Consensus 219 ~~~~~~~~~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~ 298 (345) T protein:vir:22 219 AALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAV 298 (345) T ss_pred ccccccccccccccccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhhe Confidence 4321111 12222221 125899999998874211 11111111 Q ss_pred EEEecceeeEeee-hhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 320 DGYLAGGINVQKF-KETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 320 ~i~~r~~~~i~~~-~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) ..+.-.+++++.. ++.+|.. .+++++-++.++++|+|++++.+|+. T Consensus 299 ~~v~~~~~~~e~~r~~~~~~d---~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 299 GTVKLRDLALERARRANFQAD---QIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred eeeeeecceeeeeechhHHHH---HHHHHHhcCCcccccceeEEEEEeeC Confidence 1222333444443 3344443 67888889999999999999999996 No 126 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.69 E-value=6.1e-09 Score=65.61 Aligned_cols=253 Identities=10% Similarity=0.020 Sum_probs=134.5 Q ss_pred CCCceEccHHHHHHHHHHHHhhhhhhhhceee----ecCC-ceEEEEecCCcceEEecccccccccccccccceecccee Q lcl|Aclame:pro 82 YKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNK 156 (381) Q Consensus 82 ~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~----~~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~k 156 (381) -.--.++|+.+..++++.++..+.+.++++.- ...| .+.||+....+.+....+++.+.. .+.+-+.+++...+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCc-cccccceEEEEEee Confidence 12234689999999999999888777776431 1223 578888766555555444443322 23344555555433 Q ss_pred e-eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccC Q lcl|Aclame:pro 157 L-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFAN 235 (381) Q Consensus 157 l-~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~ 235 (381) . +.-+.|+..-...+..+++++ .+..+++++.+.|..++. .+... ......+ +..+ T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~alA~~vD~~i~~---------~~~~a--~~~~~~~-----------~~~~ 136 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIAD---------MLVDN--GTALTGS-----------APTD 136 (273) T ss_pred eeecceEeecHHHhhhhccHHHH-HHHHHHHHHHHHHHHHHH---------HHhcc--ccccccc-----------cccc Confidence 2 333456653344456678884 556789999999876642 11000 0000000 0011 Q ss_pred hhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh-ccC-----CCCceee---ccCCCceEEecCCC Q lcl|Aclame:pro 236 PRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-HLN-----ANGVYVT---ALPFNLNVIESTVQ 306 (381) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~-~~~-----~~G~~~~---~l~~g~~vi~s~~~ 306 (381) +....+.+......+. ....| ..+.+++++|..+..++.... ..+ ..+.+.. .-.+|++|+.|+.+ T Consensus 137 ~~~~~~~i~~a~~~ld--~~~vP---~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~l 211 (273) T protein:vir:10 137 ADDAFDLIAKALKELT--KANVP---NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL 211 (273) T ss_pred hhHHHHHHHHHHHHhh--hcCCC---cCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEeccc Confidence 1112222222222221 11112 235578999998887754211 111 1111211 11368999999999 Q ss_pred CCcc---EEEEeccceEEEecceeeEe--eehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 307 EAGK---VLTYVKGLYDGYLAGGINVQ--KFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 307 p~~~---i~~gd~s~y~i~~r~~~~i~--~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) |.+. +++|.-+......+ -..++ +.. .+| -..+++.++++.++++++++++ |+.++. T Consensus 212 p~~~~~~~~~~~~~A~~~a~q-~~~~e~~r~~-~~~---~~~v~~~~~yg~~v~~~~~~~~--l~~~g~ 273 (273) T protein:vir:10 212 RDTDDEQFVAFHPSAAAYVSQ-IDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVV--FNKTGS 273 (273) T ss_pred ccCCccEEEEEeccceeeeee-eehhhcccCC-Ccc---eeeeeeeeeeeeeEeccceEEE--EeccCC Confidence 9643 45554443332221 11222 222 222 4579999999999999998776 555554 No 127 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.69 E-value=6.1e-09 Score=65.61 Aligned_cols=253 Identities=10% Similarity=0.020 Sum_probs=134.5 Q ss_pred CCCceEccHHHHHHHHHHHHhhhhhhhhceee----ecCC-ceEEEEecCCcceEEecccccccccccccccceecccee Q lcl|Aclame:pro 82 YKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNK 156 (381) Q Consensus 82 ~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~----~~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~k 156 (381) -.--.++|+.+..++++.++..+.+.++++.- ...| .+.||+....+.+....+++.+.. .+.+-+.+++...+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCc-cccccceEEEEEee Confidence 12234689999999999999888777776431 1223 578888766555555444443322 23344555555433 Q ss_pred e-eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccC Q lcl|Aclame:pro 157 L-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFAN 235 (381) Q Consensus 157 l-~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~ 235 (381) . +.-+.|+..-...+..+++++ .+..+++++.+.|..++. .+... ......+ +..+ T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~alA~~vD~~i~~---------~~~~a--~~~~~~~-----------~~~~ 136 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIAD---------MLVDN--GTALTGS-----------APTD 136 (273) T ss_pred eeecceEeecHHHhhhhccHHHH-HHHHHHHHHHHHHHHHHH---------HHhcc--ccccccc-----------cccc Confidence 2 333456653344456678884 556789999999876642 11000 0000000 0011 Q ss_pred hhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh-ccC-----CCCceee---ccCCCceEEecCCC Q lcl|Aclame:pro 236 PRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-HLN-----ANGVYVT---ALPFNLNVIESTVQ 306 (381) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~-~~~-----~~G~~~~---~l~~g~~vi~s~~~ 306 (381) +....+.+......+. ....| ..+.+++++|..+..++.... ..+ ..+.+.. .-.+|++|+.|+.+ T Consensus 137 ~~~~~~~i~~a~~~ld--~~~vP---~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~l 211 (273) T protein:vir:10 137 ADDAFDLIAKALKELT--KANVP---NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL 211 (273) T ss_pred hhHHHHHHHHHHHHhh--hcCCC---cCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEeccc Confidence 1112222222222221 11112 235578999998887754211 111 1111211 11368999999999 Q ss_pred CCcc---EEEEeccceEEEecceeeEe--eehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 307 EAGK---VLTYVKGLYDGYLAGGINVQ--KFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 307 p~~~---i~~gd~s~y~i~~r~~~~i~--~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) |.+. +++|.-+......+ -..++ +.. .+| -..+++.++++.++++++++++ |+.++. T Consensus 212 p~~~~~~~~~~~~~A~~~a~q-~~~~e~~r~~-~~~---~~~v~~~~~yg~~v~~~~~~~~--l~~~g~ 273 (273) T protein:vir:10 212 RDTDDEQFVAFHPSAAAYVSQ-IDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVV--FNKTGS 273 (273) T ss_pred ccCCccEEEEEeccceeeeee-eehhhcccCC-Ccc---eeeeeeeeeeeeeEeccceEEE--EeccCC Confidence 9643 45554443332221 11222 222 222 4579999999999999998776 555554 No 128 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=98.67 E-value=6.1e-09 Score=65.61 Aligned_cols=252 Identities=10% Similarity=0.007 Sum_probs=136.2 Q ss_pred CCCceEccHHHHHHHHHHHHhhhhhhhhceee----ecCC-ceEEEEecCCcceEEecccccccccccccccceecccee Q lcl|Aclame:pro 82 YKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNK 156 (381) Q Consensus 82 ~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~----~~~~-~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~k 156 (381) -.--.++|+.+..++++.++....+.++++.- ...| ++.+|+....+.+....++..+.. .+.+.+.+++...+ T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~tid~ 79 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCc-cccccceEEEEEee Confidence 11123689999999999999987777776432 2234 588998766555555555444332 34555666666655 Q ss_pred e-eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee---ccCCCcceeeeeccccccccccccccccchhhhcc Q lcl|Aclame:pro 157 L-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK---GTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLT 232 (381) Q Consensus 157 l-~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~---G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t 232 (381) . +.-+.|+..-...+..+++++ .+..+.+++.+.|..++. +.++..+ .+. T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~vD~~i~~~~~~a~~~~~--------------~~~----------- 133 (273) T protein:vir:79 80 EKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALT--------------GSA----------- 133 (273) T ss_pred ecccceeeccHHHHhhcccHHHH-HHHHHHHHHHHHHHHHHHHHhhcccccc--------------ccc----------- Confidence 3 344566664444556788875 466889999999876532 1111100 000 Q ss_pred ccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh---ccC---CCCceee---ccCCCceEEec Q lcl|Aclame:pro 233 FANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLN---ANGVYVT---ALPFNLNVIES 303 (381) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~---~~~---~~G~~~~---~l~~g~~vi~s 303 (381) ..++....+.+..+...+. ....| ..+.+++++|..+..++.... ..+ .++.+.. .-.+|++|+.| T Consensus 134 ~~~~~~~~~~i~~a~~~ld--~~~vP---~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s 208 (273) T protein:vir:79 134 PSDADDAFDLIASALKELT--KANVP---NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVES 208 (273) T ss_pred ccchhhHHHHHHHHHHHhh--hccCC---ccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEec Confidence 0011111122222222221 11112 234578999998887754311 111 1111111 11368999999 Q ss_pred CCCCCcc---EEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 304 TVQEAGK---VLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 304 ~~~p~~~---i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) +.+|.+. ++.|.-+......+ -..++. ++.-..--+.+++.++++.++++++++++ |+.++. T Consensus 209 ~~lp~~~~~~~~a~~~~A~~~a~~-~~~~e~--~r~~~~~~~~v~~~~~yg~~v~~p~~vv~--~~~~g~ 273 (273) T protein:vir:79 209 NNLRDTDDEQFVAFHPSAAAYVSQ-IDTVEA--LRDQDSFSDRIRALHVYGGKVVRPTGVVV--FNKTGS 273 (273) T ss_pred ccccccCceEEEEEeccceeeeee-hhhhhc--ccCcccceeeeeeeeeeeeEEecCceEEE--EeccCC Confidence 9999653 33343333332221 112221 11112225579999999999999998776 555554 No 129 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.61 E-value=7.1e-09 Score=65.25 Aligned_cols=291 Identities=14% Similarity=0.040 Sum_probs=143.2 Q ss_pred Hh-hhhhccccHHHHHHHHHHhcccC-CCCc-e-EccHHHHHHHHHHHHhhhhhhhhceeeec-CCc-eEEEEecCCcce Q lcl|Aclame:pro 57 SL-PKSAQSLSANQRSFFMDINKNVN-YKEE-K-LLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVA 130 (381) Q Consensus 57 ~~-~~~~~~lt~~e~~~~~~~~~~~~-~~gg-~-lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~-~~~-~~ip~~~~~~~a 130 (381) ++ ...++.+.. +.+.+ +.|- + +--+.+..++.......+.+++++++.+. +|+ ..||+-.... + T Consensus 1 ~~~~~~~~~~~t---------~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~t-~ 70 (347) T protein:vir:33 1 MANIQGGQQIGT---------NQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTK-A 70 (347) T ss_pred CCCCccCccccc---------ccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeecccee-e Confidence 11 111111100 00111 1111 1 22388999999999999999999887764 454 6777754433 3 Q ss_pred EEecccccccc-cccccccceecc--ceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhhee-----eccCCC Q lcl|Aclame:pro 131 VWGKIYGEIKG-QLDAAFSEETAI--QNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL-----KGTGKD 202 (381) Q Consensus 131 ~w~~e~~~~~~-~~~~~f~~v~l~--~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l-----~G~G~~ 202 (381) .....+.++.. ..+++..+.+|. ..++.. ..|..-=-.++..|+.+.+.++.+.++++..|+.++ .+.... T Consensus 71 ~~~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~-~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~ 149 (347) T protein:vir:33 71 AYLKPGENLDDKRKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPD 149 (347) T ss_pred eeecCCCCCCCCCCCCccceEEEEechhhhhh-HHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 33333333322 223455564444 333332 233332233356789999999999999999999885 222222 Q ss_pred cceeeeeccccccccccccccccchhhhccc--cChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhh Q lcl|Aclame:pro 203 QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTF--ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) Q Consensus 203 qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~ 280 (381) .|.+....... ... ........++... .++....+.+......|. ....+- .+.+.+++|..++.+++. T Consensus 150 ~~~~~~~~~~~-~~~---~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Ld----e~~VP~-~gR~~vv~P~~y~~Ll~~ 220 (347) T protein:vir:33 150 GSNENIEGLGK-PTV---LTLVKPTTGSLTDPVELGKAIIAQLTIARASLT----KNYVPA-ADRTFYTTPDNYSAILAA 220 (347) T ss_pred ccccccccccc-ccc---ccccccccccccchhhhHHHHHHHHHHHHHHHh----hcCCCc-cCcEEEeCHHHHHHHhcc Confidence 22221110000 000 0000000010000 112222222222222221 112222 245678999888877653 Q ss_pred hhc--cC--CCCceeec---cCCCceEEecCCCCCccE----------------------EEEeccce--E--------E Q lcl|Aclame:pro 281 YTH--LN--ANGVYVTA---LPFNLNVIESTVQEAGKV----------------------LTYVKGLY--D--------G 321 (381) Q Consensus 281 ~~~--~~--~~G~~~~~---l~~g~~vi~s~~~p~~~i----------------------~~gd~s~y--~--------i 321 (381) ... .+ +++.+.+. ...|++|+.|+.+|...+ .-++|+.. . . T Consensus 221 ~~~~~~d~~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~ 300 (347) T protein:vir:33 221 LMPNAANYQALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGT 300 (347) T ss_pred ccccccccccccccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhhee Confidence 211 11 12222221 126999999999986432 11222111 1 1 Q ss_pred EecceeeEeeeh-hhhhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 322 YLAGGINVQKFK-ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 322 ~~r~~~~i~~~~-~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) ..-.++++++.. +.+| ...+++.+.++.++++|++.+.+.|+-..+ T Consensus 301 v~~~~~~~e~~r~~~~~---~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 301 VKLKDLALERARRANYQ---ADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred eeeeceeeeeccchhhh---hHhhhhhhhcCCceecccceEEEecCCCCC Confidence 122233444322 2233 346888889999999999999988887775 No 130 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.60 E-value=4.1e-09 Score=66.58 Aligned_cols=287 Identities=13% Similarity=0.046 Sum_probs=146.1 Q ss_pred Hhh-hhhccccHHHHHHHHHHhcccC-CCCc--eEccHHHHHHHHHHHHhhhhhhhhceeeecC-Cc-eEEEEecCCcce Q lcl|Aclame:pro 57 SLP-KSAQSLSANQRSFFMDINKNVN-YKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVA 130 (381) Q Consensus 57 ~~~-~~~~~lt~~e~~~~~~~~~~~~-~~gg--~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~-~~ip~~~~~~~a 130 (381) ++. .+++.+. .+.+.+ ++|. .|--+.+..++.+.+...+.+++++++.++. |+ ..+|+-... .+ T Consensus 1 ma~~~~~~~~~---------t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~~-~~ 70 (347) T protein:vir:94 1 MANMNGGQQMG---------KDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGRT-KA 70 (347) T ss_pred CCccccccccc---------cccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccce-eE Confidence 111 1111110 001111 2222 1234899999999999999999999887754 44 678865443 34 Q ss_pred EEeccccccccc-ccccccceeccceee-eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee----ccC---- Q lcl|Aclame:pro 131 VWGKIYGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTG---- 200 (381) Q Consensus 131 ~w~~e~~~~~~~-~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~----G~G---- 200 (381) .....+.++... .+++.++.+|....+ +.-..|..-=--++..|+.+.+.++.+.++++..|++++. +.. T Consensus 71 ~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~ 150 (347) T protein:vir:94 71 AYLQPGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTA 150 (347) T ss_pred eeeecCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 444443333222 356677777655554 3344444333334567899999999999999999988852 211 Q ss_pred CCc-ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHh Q lcl|Aclame:pro 201 KDQ-PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 201 ~~q-P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~ 279 (381) +.+ |.|.... .....+ +..........++...+..+..+...|. ....| ..+.+.+++|..++.+++ T Consensus 151 ~~~~~~g~~~~----~~v~i~---~~~~~~~~~~~~~~~~~d~i~~a~~~Ld--e~dVP---~~~R~~vv~P~~y~~LLk 218 (347) T protein:vir:94 151 NNENIAGLGKA----HVLEVG---DQATLQGDQVKLGQAIIAQLTLARAKLT--GNYVP---SSDRVFYTTPDNYSAILA 218 (347) T ss_pred cccccccCCcc----eeEeee---ccccccccccccHHHHHHHHHHHHHHhh--hcCCC---CCCCEEEeChHHHHHHHH Confidence 111 1111000 000000 0000000011223333333333322221 11122 224456778999888876 Q ss_pred hhhccCCCC----ceee---ccCCCceEEecCCCCCccE-------------------------EEEeccc--eEEEe-- Q lcl|Aclame:pro 280 QYTHLNANG----VYVT---ALPFNLNVIESTVQEAGKV-------------------------LTYVKGL--YDGYL-- 323 (381) Q Consensus 280 ~~~~~~~~G----~~~~---~l~~g~~vi~s~~~p~~~i-------------------------~~gd~s~--y~i~~-- 323 (381) .......+. .+.+ ....|++|+.|+++|...+ +=+||+. ..++- T Consensus 219 ~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~ 298 (347) T protein:vir:94 219 ALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRS 298 (347) T ss_pred hhcccccccccccccccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechh Confidence 432211111 1111 1225889999999995321 1123333 12222 Q ss_pred ------cceeeEeeeh-hhhhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 324 ------AGGINVQKFK-ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 324 ------r~~~~i~~~~-~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) -.++.++... +.++.. .+.+++=++-.+.+|++.+++.++-+ T Consensus 299 A~~tv~~~~~~~e~~~~~~~~~~---~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 299 AVGTVKLKDMALERARRANFQAD---QIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhcccceeeeechhhhhh---hhhhhhhhcCcccccceeEEEEecCC Confidence 2244444432 233333 56777778999999999999888876 No 131 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.57 E-value=4.4e-09 Score=66.42 Aligned_cols=287 Identities=13% Similarity=0.042 Sum_probs=140.9 Q ss_pred Hhhh-hhccccHHHHHHHHHHhcccCCCC-c---eEccHHHHHHHHHHHHhhhhhhhhceeeecC-Cc-eEEEEecCCcc Q lcl|Aclame:pro 57 SLPK-SAQSLSANQRSFFMDINKNVNYKE-E---KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGV 129 (381) Q Consensus 57 ~~~~-~~~~lt~~e~~~~~~~~~~~~~~g-g---~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~-~~ip~~~~~~~ 129 (381) ++.. .++.... ..+...+| | .|--+.+..++.+.....+.+++++++.++. |+ .++|+-.. .. T Consensus 1 ma~~~~~~~~n~---------~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG~-~~ 70 (344) T protein:vir:10 1 MANMTGGQQLGT---------NQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGR-TQ 70 (344) T ss_pred CccccccccCCc---------ccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeece-eE Confidence 1100 0000000 00000000 0 1223889999999999999999999988865 43 67887533 33 Q ss_pred eEEeccccccccc-ccccccceeccceee-eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee----ccCCC- Q lcl|Aclame:pro 130 AVWGKIYGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTGKD- 202 (381) Q Consensus 130 a~w~~e~~~~~~~-~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~----G~G~~- 202 (381) +.....+.++... .+++=++++|...++ +.-..|..-=--++..|+.+.+.++.+.++++..|++++. +.... T Consensus 71 ~~~~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~ 150 (344) T protein:vir:10 71 AAYLAPGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVES 150 (344) T ss_pred EEeeecCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Confidence 4444444444322 234445544443332 2233333322234667899999999999999999987742 22211 Q ss_pred ----cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHH Q lcl|Aclame:pro 203 ----QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQ 278 (381) Q Consensus 203 ----qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~ 278 (381) .|.|.-.......+. .+. +..++......+.+.+.......+....+. .+.+.+++|..++.++ T Consensus 151 ~~~~~~~g~~~~~~~~~~~----------~~~-~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~-~gR~~vv~P~~y~~Ll 218 (344) T protein:vir:10 151 QYNENITGLGTATVIETTQ----------DKT-TLTDQVALGKEIIAALTKARAALTKNYVPS-SDRVFYCDPDSYSAIL 218 (344) T ss_pred ccccccccccccceeeccc----------ccc-cccchhhhHHHHHHHHHHHHHHHhhcCCCc-cCCEEEeChHHHHHHh Confidence 222221110000000 000 011111111112221111111112222222 2356688999888765 Q ss_pred hhhhcc----CCCCceeec---cCCCceEEecCCCCCccE---------------------EEEeccceE---------- Q lcl|Aclame:pro 279 AQYTHL----NANGVYVTA---LPFNLNVIESTVQEAGKV---------------------LTYVKGLYD---------- 320 (381) Q Consensus 279 ~~~~~~----~~~G~~~~~---l~~g~~vi~s~~~p~~~i---------------------~~gd~s~y~---------- 320 (381) ...... .+++.+.+. ...|++|+.|+.+|.+.+ ..++|+.-. T Consensus 219 ~~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~ 298 (344) T protein:vir:10 219 AALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVG 298 (344) T ss_pred hcccccccccccccceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhh Confidence 432111 122223222 125899999999985321 112333311 Q ss_pred EEecceeeEeee-hhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 321 GYLAGGINVQKF-KETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 321 i~~r~~~~i~~~-~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) .+.-.+++++.. ++.+|.. .+++++-++.++++|++.+++.|+-. T Consensus 299 ~v~~~~~~~e~~r~~~~~~d---~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 299 TVKLRDLALERARRANFQAD---QIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred hhhhccceeecccchhHHHH---HHHHHhhcccceecccceEEEEeecC Confidence 112233444442 3445543 67788889999999999988877765 No 132 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.42 E-value=5.3e-08 Score=60.47 Aligned_cols=238 Identities=14% Similarity=0.192 Sum_probs=136.7 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC-C-ceEEEEecCCcceEEec Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-L-RLKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~-~~~ip~~~~~~~a~w~~ 134 (381) +...+...+|-.|- . ..+-|......|||.+.+.++|+..+.+.... + .....+.++.+++.|.. T Consensus 1 m~~~~~~~~TL~e~------A-------kr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~ 67 (328) T protein:vir:95 1 MAVKGLTALTLADW------G-------KRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRL 67 (328) T ss_pred CCccccccccHHHH------H-------hhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeee Confidence 11111112222220 0 11234567789999999999999999998873 3 36788889999999999 Q ss_pred ccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHH---HHHHHHHHHHHhhheeeccCCCcceee---e Q lcl|Aclame:pro 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVR---VQIEEAFAVALETAFLKGTGKDQPIGL---N 208 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~---~~la~a~a~~~d~a~l~G~G~~qP~Gi---l 208 (381) .+...+ ++.+++.+++-..+-+.+.+.|.+.+.+... +..+|-. ....++++..+...|++|+.+..|.++ - T Consensus 68 lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDr~la~~~G-n~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~ 145 (328) T protein:vir:95 68 LNYGVQ-PSKSTTVQVTDSVGMLETYAEVDKSLADLNG-NTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLS 145 (328) T ss_pred cCCccC-cccceeEEEEEEEEEEecceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchh Confidence 877765 5688999999999999999999999998764 5555543 457899999999999999987666544 2 Q ss_pred eccccccc-------ccccc--------------------ccc-------cchhhhccccC------------------- Q lcl|Aclame:pro 209 RQVQKGVS-------VTEGA--------------------YPE-------KEEQGTLTFAN------------------- 235 (381) Q Consensus 209 ~~~~~~~~-------~~~~~--------------------~~~-------~~~~~~~t~~~------------------- 235 (381) +....... -.+++ +|. ....+..+..+ T Consensus 146 ~R~~~~s~~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl 225 (328) T protein:vir:95 146 SRYSSLSAGNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGL 225 (328) T ss_pred hhcCccccccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeee Confidence 22211000 00000 110 00001000000 Q ss_pred --------------------hhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccC--------CC Q lcl|Aclame:pro 236 --------------------PRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN--------AN 287 (381) Q Consensus 236 --------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~--------~~ 287 (381) .......+.+ .+.......|...+++.+|.||+.-...++++...++ -. T Consensus 226 ~i~d~r~vvrI~NId~~~l~~~~~~~~l~~---lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~ 302 (328) T protein:vir:95 226 ALRDWRYVVRIANIDVSNLSEPSSAANIAK---LMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETE 302 (328) T ss_pred EEcCcccEEEEecCcccccccccChhhHHH---HHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccC Confidence 0000111111 1111111123445677889999876666655433221 23 Q ss_pred CceeeccCCCceEEecCCCCCcc-EEE Q lcl|Aclame:pro 288 GVYVTALPFNLNVIESTVQEAGK-VLT 313 (381) Q Consensus 288 G~~~~~l~~g~~vi~s~~~p~~~-i~~ 313 (381) |..++.. +|+||..++++-... .+. T Consensus 303 g~~~t~~-~gipir~~dai~~tE~~vv 328 (328) T protein:vir:95 303 GEWWTSF-RGVPIRETDALLETEARVV 328 (328) T ss_pred CcceeEE-CCeEEEEEeeeecCccccC Confidence 4444333 578877777654221 111 No 133 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=98.42 E-value=3.8e-08 Score=61.29 Aligned_cols=330 Identities=12% Similarity=0.093 Sum_probs=151.2 Q ss_pred CCccHHHHHHHHHHHH---HHHHh-hhhHHHHHHHHHHHHHHHHHHHHHHHHH-----------------HHHHHH---H Q lcl|Aclame:pro 1 MTINLSETFANAKNEF---INAVN-NGEPQERQNELYGDMINQLFEETKLQAK-----------------AEAERV---S 56 (381) Q Consensus 1 m~~~l~~~~~e~~~~~---~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~---~ 56 (381) -+-=-..++...+++. ..+.+ ..+..+.+.+. ..+.+.+..+.+.... -+++.+ . T Consensus 29 ~PAY~nA~vt~vRe~e~~~~~e~~~~~e~~en~~e~-~~~~~~~~~E~Rs~~~~i~~~~~~~r~~p~~~~veyRSaGE~l 107 (410) T protein:vir:83 29 IYDRANASNRDVNEEEGQMVAECRGRMEQIKNQMEQ-AQEVNRIAFETRSKGQAVDAAISAMRGSPVGTEVEYRSAGEYM 107 (410) T ss_pred cccccccccccchhhhccccccccCcccchhhhhHH-HHHHHHHHHHHHHHHHHHHhhhccCcCCCCCCCcccccHHHHH Confidence 0000000111111110 00000 00000111110 0011111111111000 011110 0 Q ss_pred HhhhhhccccHHHHH---H-HHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcceE Q lcl|Aclame:pro 57 SLPKSAQSLSANQRS---F-FMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAV 131 (381) Q Consensus 57 ~~~~~~~~lt~~e~~---~-~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a~ 131 (381) .+.-....-.+.=.+ + .++...+.+.+--..||+++....++.+.+..++.++....|..| ++++|+.+...+.. T Consensus 108 kal~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~ 187 (410) T protein:vir:83 108 LDMWNSAQGNASAADRLEVYARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVG 187 (410) T ss_pred HHHhccCCchHHHHHHHHHHHHhhccCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeeccccccc Confidence 000000000111111 1 122223333333446888899999999999999999877778766 46777765544321 Q ss_pred -Eeccccccccc-----ccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhh---eeeccCCC Q lcl|Aclame:pro 132 -WGKIYGEIKGQ-----LDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETA---FLKGTGKD 202 (381) Q Consensus 132 -w~~e~~~~~~~-----~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a---~l~G~G~~ 202 (381) .+.+++...+- ...+|+..+-..+.++++..+|++-++-|..+.-+...+.+..+.+++-+.+ +|.++= T Consensus 188 ~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~-- 265 (410) T protein:vir:83 188 LQGVAGGASDEKTELDSQKMVIDRLTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTS-- 265 (410) T ss_pred ccccccccccccccccccceeeeeccceeehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhh-- Confidence 22222211122 2444555555788999999999999999999999999999988877766543 343321 Q ss_pred cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh Q lcl|Aclame:pro 203 QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT 282 (381) Q Consensus 203 qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~ 282 (381) ++.. .... ..++.++...-+..... ++.++. .+-..+.++|..+-..-+..- T Consensus 266 ------t~~~--------------a~~~---~Tad~~~~~i~da~~~v--~da~~~---~~~~~i~vS~DVl~~~~~~f~ 317 (410) T protein:vir:83 266 ------TGAV--------------GYGN---ATADNVASAIWQAAGAV--YTAVKG---MGRLVIAIAPDVLGDFGPLFA 317 (410) T ss_pred ------hhhh--------------hhhh---ccHHHHHHHHHHHHHHH--hhhhcc---ceeeeEEechhhhhhccceee Confidence 0000 0001 11222222221111100 000000 001123456654322221111 Q ss_pred cc-----CCCC-------ceeeccCCCceEEecCCCCCccEEEEeccceEEEecce--eeEeeehhhhhhcCceEEEEEE Q lcl|Aclame:pro 283 HL-----NANG-------VYVTALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGG--INVQKFKETLALDDMDLYTAKQ 348 (381) Q Consensus 283 ~~-----~~~G-------~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~--~~i~~~~~~~~~~d~~~~~~~~ 348 (381) .. +..| .-+.....++||+..+..+++++.|.|......+...+ +.+.-.+-+..+++-.+|.++ T Consensus 318 ~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~Ai~~~eS~~gp~qL~d~~i~nLt~~ySgY~a~- 396 (410) T protein:vir:83 318 PVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTAAIECFEQRVGTLQVVEPSVFGLQVAYAGYFST- 396 (410) T ss_pred ccCCCCcccccccccccccchhhhhcccceEEecCCCcCeeeEeccceeeeeecCCceeEeeCCchhhhhhhheeeeee- Confidence 11 1112 22333446889999999999999999998877666554 777665555555555555433 Q ss_pred EEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 349 FAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 349 r~dgk~~~~~Af~v~~l~~~~~ 370 (381) .++++.+.+-+ ++. T Consensus 397 ----a~~~~~gliPv----~g~ 410 (410) T protein:vir:83 397 ----LVVNEDAIVPL----VGS 410 (410) T ss_pred ----ccccccceeee----ccC Confidence 34555554432 222 No 134 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.41 E-value=4.2e-08 Score=61.00 Aligned_cols=293 Identities=11% Similarity=0.037 Sum_probs=140.5 Q ss_pred Hhhh-hhccccHHHHHHHHHHhcccCCCCc--eEccHHHHHHHHHHHHhhhhhhhhceeeec-CCc-eEEEEecCCcceE Q lcl|Aclame:pro 57 SLPK-SAQSLSANQRSFFMDINKNVNYKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVAV 131 (381) Q Consensus 57 ~~~~-~~~~lt~~e~~~~~~~~~~~~~~gg--~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~-~~~-~~ip~~~~~~~a~ 131 (381) |+.- +++.+.. ....+ ++.+- .+-=+.+..++.......+.+++++++.+. +|+ ..||+.... ++. T Consensus 1 ma~~~~~~~~~t-------~~~~~-~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~~-t~~ 71 (347) T protein:vir:15 1 MANIQGGQQIGT-------NQGKG-QSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRT-KAA 71 (347) T ss_pred CCccccCCcccc-------ccccC-CCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccce-eee Confidence 1110 1111000 00000 11111 122377888999999998989999888775 454 678875443 343 Q ss_pred Eecccccccc-cccccccceecc--ceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee---ccCCCcc- Q lcl|Aclame:pro 132 WGKIYGEIKG-QLDAAFSEETAI--QNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK---GTGKDQP- 204 (381) Q Consensus 132 w~~e~~~~~~-~~~~~f~~v~l~--~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~---G~G~~qP- 204 (381) ....+.++.. ..+.+..+.+|. ..++.. ..|.+-=-.++..|+.+.+.++.+.++++..|+.++. +-....| T Consensus 72 ~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~-~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~ 150 (347) T protein:vir:15 72 YLKPGENLDDKRKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDA 150 (347) T ss_pred eeccCCCCCCCCCCCccceEEEEechhhhhh-HHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 3333333322 223455664444 444433 2333222234566899999999999999999988862 1101111 Q ss_pred --eeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh Q lcl|Aclame:pro 205 --IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT 282 (381) Q Consensus 205 --~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~ 282 (381) .+.- .+ +..... ......+....++......+.+.+.......+....+- .+.+.+++|..++.+++... T Consensus 151 ~~~~~~-~~-g~~~~~-----~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~-~gR~~vv~P~~y~~LL~~~~ 222 (347) T protein:vir:15 151 SNENIE-GL-GKPTVL-----TLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPA-ADRTFYTTPDNYSAILAALM 222 (347) T ss_pred cccccc-cc-Cccccc-----cccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCc-cCCEEEeCHHHHHHHhcccc Confidence 0000 00 000000 00000000111222222222222211111111222222 24567889998888765432 Q ss_pred ccC----CCCceeec---cCCCceEEecCCCCCccE----------------------EEEeccce--------E--EEe Q lcl|Aclame:pro 283 HLN----ANGVYVTA---LPFNLNVIESTVQEAGKV----------------------LTYVKGLY--------D--GYL 323 (381) Q Consensus 283 ~~~----~~G~~~~~---l~~g~~vi~s~~~p~~~i----------------------~~gd~s~y--------~--i~~ 323 (381) ..+ +++.+.+. ...|++|+.|+.+|...+ .-++|+.. . .+. T Consensus 223 ~~~~d~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~ 302 (347) T protein:vir:15 223 PNAANYQALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVK 302 (347) T ss_pred cccccccccccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeE Confidence 211 11122111 236999999999984321 11122111 1 112 Q ss_pred cceeeEeeeh-hhhhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 324 AGGINVQKFK-ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 324 r~~~~i~~~~-~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) -++++++++. +.+| ...+++.+.++.++++|++.+.+.|+-..+ T Consensus 303 ~~~~~~e~~~~~~~~---~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 303 LKDLALERARRANYQ---ADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred eeceeeeecccchhh---hhhhehhhhcCCceeccccEEEEecCCCCC Confidence 2333444332 2223 356888889999999999999988877775 No 135 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.36 E-value=2.5e-08 Score=62.27 Aligned_cols=275 Identities=11% Similarity=0.033 Sum_probs=139.2 Q ss_pred HHHHHHHhc-------ccCCCCc---eEccHHHHHHHHHHHHhhhhhhhhceeeecC-Cc-eEEEEecCCcceEEecccc Q lcl|Aclame:pro 70 RSFFMDINK-------NVNYKEE---KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKIYG 137 (381) Q Consensus 70 ~~~~~~~~~-------~~~~~gg---~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~-~~ip~~~~~~~a~w~~e~~ 137 (381) ..+++.|.. ..+++|. .+.=+.+..++.+.....+.++++.++.+.. |+ +.||+..... +.....+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~-~~~~~~g~ 79 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLS-AGYHTPGT 79 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEecccee-EeeecCCC Confidence 222333321 1122232 1334889999999999999999998887754 43 7788764433 33323223 Q ss_pred cccccccccccceeccce--eeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee----ccCCCcceeeeecc Q lcl|Aclame:pro 138 EIKGQLDAAFSEETAIQN--KLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTGKDQPIGLNRQV 211 (381) Q Consensus 138 ~~~~~~~~~f~~v~l~~~--kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~----G~G~~qP~Gil~~~ 211 (381) ++....+++=.+++|... ++.. ..|..==-.++..|+.+.+.++.+.++++..|+.++. +...+-|.+.... T Consensus 80 ~l~~~~~~~~~~~~l~ID~~ky~~-~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g- 157 (332) T protein:vir:78 80 PIVGDAGIKANEKTLVMDDLLVSS-QFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG- 157 (332) T ss_pred CCCCCCCCCCceEEEEEehhhhhH-HHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccc- Confidence 332222233234444433 3333 3332222223556899999999999999999987752 2211111111000 Q ss_pred ccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc-------c Q lcl|Aclame:pro 212 QKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH-------L 284 (381) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~-------~ 284 (381) .+.. ..... ...++....+.+.++...|. ....+ ..+.+.+++|..++.+++..+. . T Consensus 158 ---~~~~-----~~~~~---~~~~~~~~~~~i~~a~~~Ld----e~~VP-~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~ 221 (332) T protein:vir:78 158 ---GFHV-----NIGAG---NTNDAQAIVDGFFEAAAVLD----ERSAP-QEGRVAVLSPRQYYSLISSVDTNILNREIG 221 (332) T ss_pred ---cccc-----ccCCc---cccCHHHHHHHHHHHHHHHh----hcCCC-ccCCEEEeCHHHHHHHHhhcCceeeeeecc Confidence 0000 00000 01123333333333333332 12222 2334567899988888763321 1 Q ss_pred CCCCceee----ccCCCceEEecCCCCCcc--------------EEEEeccceE--EEec--------ceeeEeee---- Q lcl|Aclame:pro 285 NANGVYVT----ALPFNLNVIESTVQEAGK--------------VLTYVKGLYD--GYLA--------GGINVQKF---- 332 (381) Q Consensus 285 ~~~G~~~~----~l~~g~~vi~s~~~p~~~--------------i~~gd~s~y~--i~~r--------~~~~i~~~---- 332 (381) +.+|.... ....|++|+.|+.+|... .+-|||+... ++-+ .++.+++. T Consensus 222 ~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~ 301 (332) T protein:vir:78 222 NSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF 301 (332) T ss_pred ccccceecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhccc Confidence 23332222 122589999999999432 1334554422 2212 22333221 Q ss_pred hhhhhhcCceEEEEEEEEcCEEecCcceEEEEEE Q lcl|Aclame:pro 333 KETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 333 ~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~ 366 (381) ++.+| ...+++.+.++.++++|++.+++.=- T Consensus 302 ~~~~~---~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 302 NVQYQ---GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred chhhh---HhhhhhhhhhcCceecccceEEEeeC Confidence 22333 34788888999999999998875221 No 136 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.34 E-value=2.6e-08 Score=62.17 Aligned_cols=295 Identities=9% Similarity=0.017 Sum_probs=149.7 Q ss_pred hhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-c-eEEEEecCCcceEEeccc Q lcl|Aclame:pro 59 PKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-R-LKFLKSETSGVAVWGKIY 136 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~-~~ip~~~~~~~a~w~~e~ 136 (381) ....+.++. ...+...+--.|.-+.+..++.......+.++++..++++.+ + .++|+- +...+....-+ T Consensus 1 Ms~~n~~t~--------~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG 71 (401) T protein:vir:70 1 MSTPNNLTN--------VAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPG 71 (401) T ss_pred CCCCccccc--------cccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCC Confidence 000000000 000000111134567888899999999999999999988754 3 678876 33344444433 Q ss_pred ccccccccccccceeccceee-eeehhhhHHHHhcChhH-HHHHHHHHHHHHHHHHHhhheee-----ccCC-----Ccc Q lcl|Aclame:pro 137 GEIKGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAW-IERFVRVQIEEAFAVALETAFLK-----GTGK-----DQP 204 (381) Q Consensus 137 ~~~~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~-l~~~i~~~la~a~a~~~d~a~l~-----G~G~-----~qP 204 (381) .++.. +.+..++..|....+ ++...|..=---++.+| +.+.+.+++++++++..|+.++. |-.+ ..| T Consensus 72 ~~ld~-~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p 150 (401) T protein:vir:70 72 QSPAA-TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNP 150 (401) T ss_pred CCcCC-CCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCC Confidence 33322 355666655554443 23333322222234556 67889999999999999986621 2110 112 Q ss_pred eeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhh--hh Q lcl|Aclame:pro 205 IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ--YT 282 (381) Q Consensus 205 ~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~--~~ 282 (381) .|.- +|...........+..++..+...+.+....+ +.+..+. +..+.++.|.-|..++.. +. T Consensus 151 ~~~~----------~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~L----dEkdVP~-~r~vvl~pp~~Ys~Ll~~d~L~ 215 (401) T protein:vir:70 151 RVKG----------HGFSINVEVAEGEALVNPQYVMAAVEFALEQQ----LEQEVDI-SDVAILMPWRYFNVLRDADRIV 215 (401) T ss_pred CcCC----------CceEEeccccccccccCHHHHHHHHHHHHHHH----HhcCCCc-cceEEEcCHHHHHHHHhcCccc Confidence 2111 00000011111112233333433333333222 1222222 245555555554444432 11 Q ss_pred cc----CCCCceeec---cCCCceEEecCCCCCcc---------------EE--EEeccce--EEEecceee-E------ Q lcl|Aclame:pro 283 HL----NANGVYVTA---LPFNLNVIESTVQEAGK---------------VL--TYVKGLY--DGYLAGGIN-V------ 329 (381) Q Consensus 283 ~~----~~~G~~~~~---l~~g~~vi~s~~~p~~~---------------i~--~gd~s~y--~i~~r~~~~-i------ 329 (381) .. .++|.|++. ...|+||+.|+++|.+. .+ -|||+.- +++.++.+- + T Consensus 216 nrd~~~s~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt 295 (401) T protein:vir:70 216 DKTYTISQSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVT 295 (401) T ss_pred chhhccccCCccccceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccc Confidence 11 234556543 23699999999999632 11 1555542 233333221 1 Q ss_pred -ee-ehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC---CCC Q lcl|Aclame:pro 330 -QK-FKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE---ETL 381 (381) Q Consensus 330 -~~-~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~---~~~ 381 (381) +. .+...|..- +.+++=++-.+.+++|.+|++.+.+..++.++|++ -|. T Consensus 296 ~~~~~d~r~~~~~---id~~~a~g~g~~RPeaa~vv~~k~~~~~~~~~~~~~~~~~~ 349 (401) T protein:vir:70 296 GDIFYEKKEKTYY---IDTFMAEGAIPDRWEAVSVVTTKRNTTTGAVEGTDGAQHTI 349 (401) T ss_pred cchhhhhhhhHHH---HHHHHHhCCcccchhheEEEeecCcccccccccCCcchhhh Confidence 11 122233322 23455567889999999999999999999999887 333 No 137 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.32 E-value=4.8e-08 Score=60.73 Aligned_cols=287 Identities=13% Similarity=0.040 Sum_probs=145.8 Q ss_pred Hhh-hhhccccHHHHHHHHHHhcccC-CCCc--eEccHHHHHHHHHHHHhhhhhhhhceeeec-CCc-eEEEEecCCcce Q lcl|Aclame:pro 57 SLP-KSAQSLSANQRSFFMDINKNVN-YKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVA 130 (381) Q Consensus 57 ~~~-~~~~~lt~~e~~~~~~~~~~~~-~~gg--~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~-~~~-~~ip~~~~~~~a 130 (381) ++. .+++.+ ..+-+.+ +++. .|-=+.+..++.......+.+++.+++.+. +|+ ..+|+-.... + T Consensus 1 ~a~~~~~~~~---------~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~~-~ 70 (347) T protein:vir:88 1 MANATGGQQI---------GANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTK-G 70 (347) T ss_pred CCCcccchhh---------hccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeeccee-e Confidence 111 111110 0111222 1222 233488999999999988999999888775 454 6788654433 3 Q ss_pred EEecccccccc-cccccccceeccceee-eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee----ccCCC-- Q lcl|Aclame:pro 131 VWGKIYGEIKG-QLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTGKD-- 202 (381) Q Consensus 131 ~w~~e~~~~~~-~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~----G~G~~-- 202 (381) .....+.++.. ..++..++++|...++ +.-..|..-=--.+..|+.+.+.++.++++++..|++++. +.... T Consensus 71 ~~~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~ 150 (347) T protein:vir:88 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) T ss_pred eeeccccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 33333333222 1345667777766555 3444454444444567888999999999999999998752 21110 Q ss_pred ---cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHh Q lcl|Aclame:pro 203 ---QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 203 ---qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~ 279 (381) -+.|+-+. .....++..+.. ..........+.+..+...+ .....| - .+.+++++|..++.++. T Consensus 151 ~~~~~~g~~~~----~~~~~~~~~~~~----~~~~~~~~~~~~i~~a~~~L--de~~VP--~-~gR~~vv~P~~y~~Ll~ 217 (347) T protein:vir:88 151 SNENIAGLGQA----VVLNIGAAADLV----DVEARGKAILKGLTLARARL--TKNYVP--A-GDRRFYCAPEDYSAILS 217 (347) T ss_pred cccccCCcccc----cccccccccccc----chhhhHHHHHHHHHHHHHHH--hhcCCC--C-CCCEEEeCHHHHHHHhc Confidence 11222111 000000000000 00011112222222222222 112222 2 24567899988877764 Q ss_pred hhhc----cCCCCceeec---cCCCceEEecCCCCCccE-------------------------EEEeccc--eEEEec- Q lcl|Aclame:pro 280 QYTH----LNANGVYVTA---LPFNLNVIESTVQEAGKV-------------------------LTYVKGL--YDGYLA- 324 (381) Q Consensus 280 ~~~~----~~~~G~~~~~---l~~g~~vi~s~~~p~~~i-------------------------~~gd~s~--y~i~~r- 324 (381) .... .++.+.+.+. -..|++|+.|+++|.+.. +.+||++ ..++-+ T Consensus 218 ~~~~~~~~~~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~ 297 (347) T protein:vir:88 218 ALMPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRS 297 (347) T ss_pred chhhhhhhhccccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechh Confidence 2211 1122222221 125889999999984211 2234444 122211 Q ss_pred -------ceeeEeee-hhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecc Q lcl|Aclame:pro 325 -------GGINVQKF-KETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 325 -------~~~~i~~~-~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~ 369 (381) .++.++.. ++.+|.. .+++++.++.++++|++.+++.++.++ T Consensus 298 a~g~v~~~d~~~e~~r~~~~~~d---~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 298 AVGTVKLKDMALERARRPEFQAD---QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred hhhheecccceeeeeechhhHHH---HhhhhhhhcCceeccceEEEEEeCCCC Confidence 22233332 2233433 788999999999999999998887766 No 138 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.29 E-value=2.3e-07 Score=57.02 Aligned_cols=293 Identities=10% Similarity=-0.023 Sum_probs=135.2 Q ss_pred HhhhhhccccHHHHHHHHHHhcccC-CCCceEccHHHHHHHHHHHHhhhhhhhhceeeec---CC-ceEEEEecCCcceE Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVN-YKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA---GL-RLKFLKSETSGVAV 131 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~-~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~---~~-~~~ip~~~~~~~a~ 131 (381) ++.-++.. ..+..+-. ++--.+||+.+..+|.+.+.....+.++++.... .| .++||+.. .+.+. T Consensus 1 ~~~~~~~~---------~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~ 70 (381) T protein:vir:80 1 MATIQGTG---------GYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVY 70 (381) T ss_pred Cceecccc---------cccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceee Confidence 11100000 00000000 0112367999999999999888777777654332 34 47788864 34566 Q ss_pred Eecccccccccccccccceeccceee-eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeecc--CCCcceee- Q lcl|Aclame:pro 132 WGKIYGEIKGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT--GKDQPIGL- 207 (381) Q Consensus 132 w~~e~~~~~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~--G~~qP~Gi- 207 (381) ...++..+.. .+.+.+++++...++ +.-..|+..-...+..|+.+.+.+.++.++++..|+.++.-- ....+.+. T Consensus 71 d~~~g~~i~~-~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~ 149 (381) T protein:vir:80 71 DKQPQTPVNL-QARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRI 149 (381) T ss_pred eecCCCcccc-cccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 6666555433 244555666655333 344577776566677899999999999999999999886321 11111111 Q ss_pred eeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccC-- Q lcl|Aclame:pro 208 NRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN-- 285 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~-- 285 (381) .+. .....++. .....+..........+..+...+.. ...| . .+.+++++|..+..++......+ T Consensus 150 ~t~---~~~i~~~~-----~~~~~t~~~~~~t~~~i~~a~~~Lde--~~VP--~-egR~lvv~P~~~~~Ll~~~~~~~ad 216 (381) T protein:vir:80 150 YSY---DTTLGDGT-----VNAHLTGTPAPLTYAALLLAKQKLDE--ADVP--Q-EGRIVMVSPAQYIDLLSINQFISVD 216 (381) T ss_pred ccc---cccccccc-----cccccccchhhHHHHHHHHHHHHHhh--cCCC--c-CCcEEEeCHHHHHHHhhchhhhhhh Confidence 000 00000000 00011111122223333333333311 1122 2 34578899998888765321111 Q ss_pred -------CCCceeeccCCCceEEecCCCCCccEE-----EEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCE Q lcl|Aclame:pro 286 -------ANGVYVTALPFNLNVIESTVQEAGKVL-----TYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 286 -------~~G~~~~~l~~g~~vi~s~~~p~~~i~-----~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk 353 (381) .+|... ..+|++|+.|+.+|.+.+. +|-... ....+.-.. ..-.|..+..+.+.....|.+ T Consensus 217 ~~~~~~l~~G~Ig--~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~----~~~~~~~~~-~~g~~s~~a~av~~~k~yd~~ 289 (381) T protein:vir:80 217 FSQVKPVTSGVVG--TILGMEVIVTTQIGINSLTGYVNGQGAPTQ----PTPGVLGSP-YLPDQAGTANVVNTGSASDLA 289 (381) T ss_pred hccchhhhceeee--EEcceEEEeecccccccccceeeecccccc----ccccccccc-cccccccceeeeeeeeeecee Confidence 123222 2369999999999965321 111000 000000000 011122233344444445555 Q ss_pred EecCcceEEEEEEecccccCCCCCC-CCC Q lcl|Aclame:pro 354 AKDNKVAAVWKLDLKGHKPALEGTE-ETL 381 (381) Q Consensus 354 ~~~~~Af~v~~l~~~~~~~~~~~~~-~~~ 381 (381) +.. +-+.+-+.+-+..+++...++ .|. T Consensus 290 ~~~-~~~~~~~~~g~~~~~~~~~~~~~~~ 317 (381) T protein:vir:80 290 VSL-SYFGLPVFSGAGATAADGGQTLGSF 317 (381) T ss_pred eee-eeccceeeecceeeecCCCceeeee Confidence 433 223333333333333322221 111 No 139 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.23 E-value=1.7e-07 Score=57.74 Aligned_cols=286 Identities=12% Similarity=-0.021 Sum_probs=143.5 Q ss_pred ccHHHHHHHHHHhcccCCCCceEc------cHHHHHHHHHHHHhhhhhhhhceeeec--CCceEEEEec---CCcceEEe Q lcl|Aclame:pro 65 LSANQRSFFMDINKNVNYKEEKLL------PEETIDRIFEDLTTNHPLLADLGIKNA--GLRLKFLKSE---TSGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lv------P~~~~~~Ii~~l~~~~~l~~~~~v~~~--~~~~~ip~~~---~~~~a~w~ 133 (381) +|.. .- ...+..+|..+| |+-+-++|.+.++..-..-.+.+-... ++-...-... ..+.+.-+ T Consensus 1 ~~~~-----~~-i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~V 74 (318) T protein:vir:10 1 MTAP-----TG-IVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADV 74 (318) T ss_pred CCCC-----Cc-ceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhc Confidence 1110 00 001112344444 777777777766554332233332222 2223332221 12445557 Q ss_pred cccccccccccccccceec-cceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccc Q lcl|Aclame:pro 134 KIYGEIKGQLDAAFSEETA-IQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQ 212 (381) Q Consensus 134 ~e~~~~~~~~~~~f~~v~l-~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~ 212 (381) .|+++++ ...+.++...+ ..+|++.-+.||.|++..+..+..+-....++.+|++..|...+.- |..+.. T Consensus 75 aEggEiP-~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~da--------l~sa~t 145 (318) T protein:vir:10 75 AEFGEIP-VSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKAL--------LQSPIV 145 (318) T ss_pred cCccccc-ccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHH--------Hhcccc Confidence 8888876 45677877777 5579999999999999999999999999999999999888765421 000000 Q ss_pred cccccccccccccchhhhccccChhHHHHH-HHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc-----cCC Q lcl|Aclame:pro 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNE-LTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH-----LNA 286 (381) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~-----~~~ 286 (381) ....++... .+. ........++...++. .+.+...- .-.......|..+ +++|||.++..++..... .++ T Consensus 146 ~~~~~s~~w-~~~-~~~~~d~~~A~e~v~~a~~~~~~a~-~~~~~~~~GY~pd-tIVlhP~~~~~l~~n~~~~~~y~~~a 221 (318) T protein:vir:10 146 PTLAVPTAW-DNG-GKVRTDIAIAIEQISTAAPTAYPAG-VGSSDEYFGFIPD-TIVMHYALLPILMDNENFMKVYERNA 221 (318) T ss_pred ccccCCcCC-CCc-ccccccchhhhhhhhhhhhhhhhhh-hhhhhhccCccce-eeEECHHHHHHHhcchhhhhhhhccc Confidence 000000000 000 0000000111111100 00000000 0000112345544 467999998877432211 111 Q ss_pred C-----Cceee---ccCCCceEEecCCCCCccEEEEeccc-eEEEecceeeEeeehh----hhh-hcCceEEEEEEEEcC Q lcl|Aclame:pro 287 N-----GVYVT---ALPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE----TLA-LDDMDLYTAKQFAYG 352 (381) Q Consensus 287 ~-----G~~~~---~l~~g~~vi~s~~~p~~~i~~gd~s~-y~i~~r~~~~i~~~~~----~~~-~~d~~~~~~~~r~dg 352 (381) + ..|.- ..++|+.|+.|..+|.+++++.+-.. -.+.|..+++...... ..- .+.....++.++.-- T Consensus 222 ~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~ 301 (318) T protein:vir:10 222 NYVSTAPDWTGNFPGSVMGLNVIRSRTFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRAL 301 (318) T ss_pred hhhhhcccccccccceeeceEEeecCccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeee Confidence 1 11111 23468999999999999998777544 2345777777654321 111 122334456666666 Q ss_pred EEecCcceEEEEEEeccccc Q lcl|Aclame:pro 353 KAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 353 k~~~~~Af~v~~l~~~~~~~ 372 (381) .+.+|+|...+ +--.. | T Consensus 302 ~V~~PkA~~~i--tgi~~-~ 318 (318) T protein:vir:10 302 AVDQPKAALWL--TGIVT-P 318 (318) T ss_pred eeeCcceeEEE--eeccC-C Confidence 77888885543 33221 1 No 140 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.22 E-value=2.2e-07 Score=57.09 Aligned_cols=241 Identities=17% Similarity=0.193 Sum_probs=136.5 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC--CceEEEEecCCcceEEec Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~--~~~~ip~~~~~~~a~w~~ 134 (381) +.......+|-.| +.+. +-|......|+|.+.+.++|+..+.+.... ....-.+.++.|.+.|.. T Consensus 1 m~~~~~~a~TL~e------~AKr-------~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~ 67 (330) T protein:vir:10 1 MATLSTNNPTMAD------VAKR-------LDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK 67 (330) T ss_pred CCcCCCCcccHHH------HHhh-------cCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhh Confidence 2211222233222 1111 224556678999999999999999887532 222345567788999999 Q ss_pred ccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHH---HHHHHHHHHHHHHhhheeeccCCCccee---ee Q lcl|Aclame:pro 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERF---VRVQIEEAFAVALETAFLKGTGKDQPIG---LN 208 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~---i~~~la~a~a~~~d~a~l~G~G~~qP~G---il 208 (381) .+...+ ++.+++.+++-..+-|.+...|-+.|.+... |..+| -.....+++++.+.+.|++|+-+..|.+ |- T Consensus 68 lN~g~~-~s~~tt~qvt~~l~ilgg~~eVDr~la~~~G-n~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~ 145 (330) T protein:vir:10 68 LYGGVL-PNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLS 145 (330) T ss_pred cCCccc-cccceEEEEEEEeEEecchhhhhhHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchh Confidence 877664 5679999999999999999999999987644 55555 4456889999999999999997766654 42 Q ss_pred ecccccc-------cccc--------------------cccccc-------chhhhccc--cChh--------------- Q lcl|Aclame:pro 209 RQVQKGV-------SVTE--------------------GAYPEK-------EEQGTLTF--ANPR--------------- 237 (381) Q Consensus 209 ~~~~~~~-------~~~~--------------------~~~~~~-------~~~~~~t~--~~~~--------------- 237 (381) +.....+ ...+ +-+|.- ...+..+. .+.. T Consensus 146 kR~~~~ta~~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~ 225 (330) T protein:vir:10 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) T ss_pred hhcCCCCCCchhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeee Confidence 2221000 0000 011100 01110010 0000 Q ss_pred -------------------H--HHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhcc--------CCCC Q lcl|Aclame:pro 238 -------------------A--TVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL--------NANG 288 (381) Q Consensus 238 -------------------~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~--------~~~G 288 (381) . .-..-.+++..+....+.-|...+++.+|.||+.-...++.+...+ +..| T Consensus 226 Gl~i~d~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g 305 (330) T protein:vir:10 226 GLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSG 305 (330) T ss_pred eeEEeCcccEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCC Confidence 0 0000001111121112233455678889999998777776654322 2345 Q ss_pred ceeeccCCCceEEecCCCCCcc-EEE Q lcl|Aclame:pro 289 VYVTALPFNLNVIESTVQEAGK-VLT 313 (381) Q Consensus 289 ~~~~~l~~g~~vi~s~~~p~~~-i~~ 313 (381) ..++.. .|+||..++++-... .+. T Consensus 306 ~~~t~~-~gipir~~Dail~tE~~vv 330 (330) T protein:vir:10 306 ERVMTF-DGIPVQRTDALLNTESRVV 330 (330) T ss_pred eeeEEE-CCeEEEEEeeeecCccccC Confidence 555333 588888877764221 111 No 141 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.22 E-value=3.8e-07 Score=55.77 Aligned_cols=260 Identities=11% Similarity=-0.019 Sum_probs=145.8 Q ss_pred HhcccCCCCceEccH---HHHHHHHHHHHhhhhhhhhceeeecC--CceEEEEecCCcceEEeccccccccccccccc-- Q lcl|Aclame:pro 76 INKNVNYKEEKLLPE---ETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFS-- 148 (381) Q Consensus 76 ~~~~~~~~gg~lvP~---~~~~~Ii~~l~~~~~l~~~~~v~~~~--~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~-- 148 (381) |.+......--|+|. ++.++.-..+.+...++...+..|++ ..+++|+..-.+.+.-++|+.+++- +..+.. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Ipl-skvt~~~~ 79 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPL-SKVTRTKD 79 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccch-hhheeeee Confidence 333322333334433 34445545555555666666777763 4589999887788888999887753 344433 Q ss_pred -ceeccceeeeeehhhhHHHHhcChh-HHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccc Q lcl|Aclame:pro 149 -EETAIQNKLTAFVVLPKDLNDFGPA-WIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKE 226 (381) Q Consensus 149 -~v~l~~~kl~~~~~iS~ell~ds~~-~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~ 226 (381) ..++..+|+..- +|.|-++.|.. +-...-.+.|..+++..++..|+.=- ..+.. T Consensus 80 ~t~t~kikK~rK~--tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~l------------ktat~---------- 135 (295) T protein:vir:99 80 KDYTVKWFKKRRA--TTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFL------------KTKPT---------- 135 (295) T ss_pred eeeEEEeeeeccc--ccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHh------------ccCce---------- Confidence 356666777764 48998854433 45566778899999999988886411 00000 Q ss_pred hhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCC-----CceeeccCCCce-E Q lcl|Aclame:pro 227 EQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNAN-----GVYVTALPFNLN-V 300 (381) Q Consensus 227 ~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~-----G~~~~~l~~g~~-v 300 (381) + .........++..+..+...... +..+.+..+||.|.+.+|+........ .+|+.. -.|.. | T Consensus 136 ---t---~tg~~lq~a~a~~~~al~~f~Ee----~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~n-fLG~q~I 204 (295) T protein:vir:99 136 ---K---VKGVGLQKALSASWAKLATFNEF----EGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKN-FLGMQNV 204 (295) T ss_pred ---e---eehhhHHHHHHHhhhhhhhcccc----cCCceEEEEehHHHHHHHhccccccchhhhhhhhhhhh-hhccceE Confidence 0 01112223344444444332221 233568899999999998754322111 133322 13664 8 Q ss_pred EecCCCCCccEEEEec---cceEEEec-ceeeEeeehhhhhhcCceEEEEEEEEc-------------C---EEecCcce Q lcl|Aclame:pro 301 IESTVQEAGKVLTYVK---GLYDGYLA-GGINVQKFKETLALDDMDLYTAKQFAY-------------G---KAKDNKVA 360 (381) Q Consensus 301 i~s~~~p~~~i~~gd~---s~y~i~~r-~~~~i~~~~~~~~~~d~~~~~~~~r~d-------------g---k~~~~~Af 360 (381) |.|..+|+|+++.--. ..|++... +++. .-..+..|++++++..+.- | .|=..++. T Consensus 205 I~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~----~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgi 280 (295) T protein:vir:99 205 IVMPSVPEGKIYSTAVENLVFASLNVKGGDLG----GLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGV 280 (295) T ss_pred EEcccCCCceEEEeeccceEEEEecCCchhhh----hhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceE Confidence 9999999999875433 22333333 2232 3345556777887775531 1 12234566 Q ss_pred EEEEEEecccccCCCC Q lcl|Aclame:pro 361 AVWKLDLKGHKPALEG 376 (381) Q Consensus 361 ~v~~l~~~~~~~~~~~ 376 (381) ++.++.. +.+|.+-| T Consensus 281 v~~tI~~-~~~~~~~~ 295 (295) T protein:vir:99 281 VEATIEA-AAVPGIGG 295 (295) T ss_pred EEEEEec-CcCCCCCC Confidence 6666644 33444444 No 142 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.20 E-value=1.7e-07 Score=57.69 Aligned_cols=296 Identities=12% Similarity=0.018 Sum_probs=147.4 Q ss_pred hhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC-Cc-eEEEEecCCcceEEeccc Q lcl|Aclame:pro 59 PKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKIY 136 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~-~~ip~~~~~~~a~w~~e~ 136 (381) ....+.+|.. -..+..++- .+.-+.+..++.+.+...+.++++..+.++. |+ ..+|+-. ..++....-+ T Consensus 1 ms~~~~~tr~-------~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG-~~~~~~~~pG 71 (335) T protein:vir:63 1 MSFLNDLTRP-------NYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLG-NVEAKGRRAG 71 (335) T ss_pred CCCcccchhh-------hcccccchh-heehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeee-eeeeecccCC Confidence 0011111110 011222222 2334899999999999999999999888864 43 6788763 3345444433 Q ss_pred ccccccccccccceeccceee-eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhhee----eccCCCcceeeeecc Q lcl|Aclame:pro 137 GEIKGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL----KGTGKDQPIGLNRQV 211 (381) Q Consensus 137 ~~~~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l----~G~G~~qP~Gil~~~ 211 (381) .++.. +.+..++..|....+ ++...|..----++.+|+.+.+.+++++++++..|++++ .+.+..-|.++=... T Consensus 72 ~~l~~-~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~ 150 (335) T protein:vir:63 72 EELER-SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAF 150 (335) T ss_pred cCcCC-CCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCc Confidence 33322 234456655555443 233334433333466789999999999999999999763 444333222221000 Q ss_pred ccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhcc-----CC Q lcl|Aclame:pro 212 QKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL-----NA 286 (381) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~-----~~ 286 (381) ..+.+. .....+.....++..+...+......+.. ...|..-....+.+|+|.-++.++...... ++ T Consensus 151 ~~G~~~------~~~~tg~~~~~~~~~l~~a~~~a~~~L~e--~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s 222 (335) T protein:vir:63 151 SPGVLE------KLDLTGLTAKQAADKIVRMHRRVVETFID--RDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQAT 222 (335) T ss_pred CCCcce------eeeeccCcccccHHHHHHHHHHHHHHHHh--ccCCCcccCceEEEeChHHHHHHhccccccccccccc Confidence 000000 00000100011233332222222222221 111111112356789999988887642211 12 Q ss_pred CC--ceeec---cCCCceEEecCCCCCccE-----------EEEeccceEE--Eec--------ceeeEee-ehhhhhhc Q lcl|Aclame:pro 287 NG--VYVTA---LPFNLNVIESTVQEAGKV-----------LTYVKGLYDG--YLA--------GGINVQK-FKETLALD 339 (381) Q Consensus 287 ~G--~~~~~---l~~g~~vi~s~~~p~~~i-----------~~gd~s~y~i--~~r--------~~~~i~~-~~~~~~~~ 339 (381) +| .|.+. ...|+||+.|+.+|.+.+ .-|||++... .-+ .++..+. .+...|.. T Consensus 223 ~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~ 302 (335) T protein:vir:63 223 GATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSW 302 (335) T ss_pred cccccccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhH Confidence 22 23321 235999999999995432 3356655332 111 1222222 12233433 Q ss_pred CceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 340 DMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 340 d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) .+.+++=++-.+.+|+++++++++ + .++...+- T Consensus 303 ---~i~~~~a~G~g~lRPe~a~~i~~t--g-~~~~~~~~ 335 (335) T protein:vir:63 303 ---VLDTFQMYNIGARRPDTAGAIELK--G-IGAFDITA 335 (335) T ss_pred ---HhHHHHHcCCcccccceEEEEEEc--C-CCceeecC Confidence 344555588999999999987753 3 22222222 No 143 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.18 E-value=3.1e-07 Score=56.26 Aligned_cols=296 Identities=11% Similarity=0.011 Sum_probs=145.8 Q ss_pred hhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC-Cc-eEEEEecCCcceEEeccc Q lcl|Aclame:pro 59 PKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKIY 136 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~-~~ip~~~~~~~a~w~~e~ 136 (381) ....+.+|.. -..+.+++- .+.-+.+..++.+.+...+.++++..+.++. |+ ..+|+- +...+....-+ T Consensus 1 ms~~~~~t~~-------~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG 71 (335) T protein:vir:78 1 MSFLNDLTRP-------NYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAG 71 (335) T ss_pred CCcccccccc-------ccccccchh-hhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccC Confidence 0000111110 011222222 2334899999999999999999999888864 43 778875 33344444433 Q ss_pred ccccccccccccceeccceee-eeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhhee----eccCCCcceeeeecc Q lcl|Aclame:pro 137 GEIKGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL----KGTGKDQPIGLNRQV 211 (381) Q Consensus 137 ~~~~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l----~G~G~~qP~Gil~~~ 211 (381) .++.. +.+..++..|..-.+ .+...|..----++..|+.+.+.+++++++++..|++++ .+.+..-|..+=... T Consensus 72 ~~l~~-~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~ 150 (335) T protein:vir:78 72 EELER-SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAF 150 (335) T ss_pred cccCC-CCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCc Confidence 33322 234556655554443 233334333333466789999999999999999999764 333322222110000 Q ss_pred ccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhcc-----CC Q lcl|Aclame:pro 212 QKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL-----NA 286 (381) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~-----~~ 286 (381) .. ++... ....+.....++..+.+.+......+. ....|.......+.+|+|.-++.++...... ++ T Consensus 151 ~~-----G~~~~-~~~tg~~~~~~~~~l~~a~~~a~~~l~--ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s 222 (335) T protein:vir:78 151 SP-----GVLEK-LDLTGLTAKEAAEKIVRMHRRVVETFI--ERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQAT 222 (335) T ss_pred CC-----Cccee-eeeccccccccHHHHHHHHHHHHHHHH--hccCCCCCCCccEEEeChHHHHHHhccccccccccccc Confidence 00 00000 000000001123333333333322222 1112222223457899999998887642211 22 Q ss_pred CC--ceeec---cCCCceEEecCCCCCccE-----------EEEeccc-eE-EEecc--------eeeEeee-hhhhhhc Q lcl|Aclame:pro 287 NG--VYVTA---LPFNLNVIESTVQEAGKV-----------LTYVKGL-YD-GYLAG--------GINVQKF-KETLALD 339 (381) Q Consensus 287 ~G--~~~~~---l~~g~~vi~s~~~p~~~i-----------~~gd~s~-y~-i~~r~--------~~~i~~~-~~~~~~~ 339 (381) +| .|.+. ...|+||+.|+++|.+.+ .-+||+. .. ++-++ ++..+.. ++..|.. T Consensus 223 ~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~ 302 (335) T protein:vir:78 223 GATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSW 302 (335) T ss_pred ccccccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhH Confidence 22 23322 235999999999996532 1234433 11 11221 2222222 2233433 Q ss_pred CceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 340 DMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 340 d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) .+.+++=++-++.+|++.++++++ + .++...+- T Consensus 303 ---~i~~~~a~G~g~lRPe~a~~i~~t--g-~~~~~~~~ 335 (335) T protein:vir:78 303 ---VLDTFQMYNIGARRPDTAGAIELK--G-IEAFDITA 335 (335) T ss_pred ---hhhHHHHcCCcccCcceEEEEEec--C-CCcccccC Confidence 344555588999999998887643 3 22222222 No 144 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.12 E-value=5.5e-07 Score=54.89 Aligned_cols=297 Identities=10% Similarity=0.022 Sum_probs=146.5 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCC-CCc-----eEccHHHHHHHHHHHHhhhhhhhhceeeecC-Cc-eEEEEecCCc Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNY-KEE-----KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSG 128 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~-~gg-----~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~-~~ip~~~~~~ 128 (381) +..-....+ -..+.++.. -|| .+--+.+..++.......+.++++.++.++. |+ .++|+-.. . T Consensus 1 ~~~~~~~~~--------~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~-~ 71 (375) T protein:vir:10 1 MANANQVAL--------GRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGR-M 71 (375) T ss_pred Ccccccccc--------CccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeee-e Confidence 100000000 011111111 112 2334788999999999999999999888764 44 67787633 3 Q ss_pred ceEEeccccccc--ccccccccc--eeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee----ccC Q lcl|Aclame:pro 129 VAVWGKIYGEIK--GQLDAAFSE--ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTG 200 (381) Q Consensus 129 ~a~w~~e~~~~~--~~~~~~f~~--v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~----G~G 200 (381) ++....-+.++. +..+.+-.+ ++++..++.. ..|..-=--++..|+.+.+.++.+.++++..|++++. |-. T Consensus 72 t~~~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~-~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~ 150 (375) T protein:vir:10 72 TSSFHTPGTPILGNADKAPPVAEKTIVMDDLLISS-AFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGAR 150 (375) T ss_pred EEeeecCCcCcCCccccCCCCCceEEEecchhhhh-hhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 333333222221 111222233 4445444443 3333322334567899999999999999999987752 333 Q ss_pred CCcceeeeecccc-ccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHh Q lcl|Aclame:pro 201 KDQPIGLNRQVQK-GVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 201 ~~qP~Gil~~~~~-~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~ 279 (381) ..-|.+.-..... +.....+. ........++..+.+.+.++...+. .+..+- .+.+.+++|..++.+++ T Consensus 151 ~~~p~~~~~~~~~Gg~~i~~~s-----g~~~~~~~ta~~~~~ai~~a~~~Ld----e~~VP~-~~R~~vv~P~~y~~Ll~ 220 (375) T protein:vir:10 151 SASPVSATNFVEPGGTQIRVGS-----GTNESDAFTASALVNAFYDAAAAMD----EKGVSS-QGRCAVLNPRQYYALIQ 220 (375) T ss_pred hccccccccccccCcceeeecc-----ccccccccCHHHHHHHHHHHHHHHh----hcCCCC-CCCEEEeChHHHHHHHh Confidence 3333222111000 00000000 0000011223344444443333332 222222 24567899998877765 Q ss_pred hhh-----ccC--CCCceee---ccCCCceEEecCCCCCcc------------------------------EEEE----- Q lcl|Aclame:pro 280 QYT-----HLN--ANGVYVT---ALPFNLNVIESTVQEAGK------------------------------VLTY----- 314 (381) Q Consensus 280 ~~~-----~~~--~~G~~~~---~l~~g~~vi~s~~~p~~~------------------------------i~~g----- 314 (381) ..+ ..+ .+|.+.. ....|++|+.|+.+|... +.-| T Consensus 221 ~~d~~~~~n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y 300 (375) T protein:vir:10 221 DIGSNGLVNRDVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDY 300 (375) T ss_pred cCCccceeeecccccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccc Confidence 321 111 1221221 123589999999998321 1112 Q ss_pred --ec---cc--eEEEec--------ceeeEeeeh-hhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCC Q lcl|Aclame:pro 315 --VK---GL--YDGYLA--------GGINVQKFK-ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALE 375 (381) Q Consensus 315 --d~---s~--y~i~~r--------~~~~i~~~~-~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~ 375 (381) || ++ -.+.-+ .++.++++. ++.-.+-...+.+++=.+-.+.+|++++.+ +..+.-++-+ T Consensus 301 ~~d~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l--~~~~~~~~~~ 375 (375) T protein:vir:10 301 GTNAELGAKSCGLIFQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVEL--YIGATAPSAF 375 (375) T ss_pred cccccccCceEEEEEchhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEE--ecCcCccccC Confidence 33 21 111111 344555542 233445566788999999999999997764 5555555555 No 145 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.11 E-value=1.1e-07 Score=58.76 Aligned_cols=288 Identities=10% Similarity=-0.007 Sum_probs=135.6 Q ss_pred HhhhhhccccHHHHHHHHHHhcccC-CCCc--eEccHHHHHHHHHHHHhhhhhhhhceeeecC-Cc-eEEEEecCCcceE Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVN-YKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAV 131 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~-~~gg--~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~-~~ip~~~~~~~a~ 131 (381) ++.-....+ ..+.+.+ ++|- .+-=+.+..++.......+.+++++++.++. |+ ..+|+-.. .++. T Consensus 1 m~~~~~~~~---------~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~-~tv~ 70 (347) T protein:vir:94 1 MANVPGQKI---------GTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGR-TSGV 70 (347) T ss_pred CCCCCcccc---------ccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecccc-eeee Confidence 111111000 0011111 1111 1223788899998888888889998888754 44 67887633 3343 Q ss_pred Eeccccccccc-ccccccc--eeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee----ccC-CCc Q lcl|Aclame:pro 132 WGKIYGEIKGQ-LDAAFSE--ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTG-KDQ 203 (381) Q Consensus 132 w~~e~~~~~~~-~~~~f~~--v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~----G~G-~~q 203 (381) ....+.++... .+.+=.+ ++++..++.. ..|..-=-.++..|+.+.+.++.+.++++..|++++. ... ... T Consensus 71 ~~t~G~~l~~~~~~~~~~e~~itID~~~~~~-~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~ 149 (347) T protein:vir:94 71 YLAPGERLSDKRKGIKHTEKVITIDGLLTAD-VMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAA 149 (347) T ss_pred eecCCCCcCCCCCCCCcceEEEEecchhhhh-HHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 33333333211 1222344 4444333332 2232211223556788999999999999999987752 111 111 Q ss_pred ceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc Q lcl|Aclame:pro 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~ 283 (381) +.+.......+.....+...+... ...........+..+...|. ....| - .+.+.+++|..++.++..... T Consensus 150 ~~~~~~g~~~~s~~~~~~~~~~~~----~~~~~~~~~~~i~~a~~~Ld--e~~VP--~-~~R~~vv~P~~~~~Ll~~~~~ 220 (347) T protein:vir:94 150 SNENIAGLGTASVLEVGKKADLDT----PAKLGEAIIGQLTIARAKLT--SNYVP--A-GDRYFYTTPDNYSAILAALMP 220 (347) T ss_pred cccccCCCcccceeeccccccccc----hhhhHHHHHHHHHHHHHHHh--hcCCC--C-CCcEEEeCHHHHHHHhccchh Confidence 111111000000000000000000 00111222222222222221 11222 2 245678999988766432211 Q ss_pred cC---------CCCceeeccCCCceEEecCCCCCcc-----------EE---------------EEeccce--EEE---- Q lcl|Aclame:pro 284 LN---------ANGVYVTALPFNLNVIESTVQEAGK-----------VL---------------TYVKGLY--DGY---- 322 (381) Q Consensus 284 ~~---------~~G~~~~~l~~g~~vi~s~~~p~~~-----------i~---------------~gd~s~y--~i~---- 322 (381) .. .+|... ...|++|+.|+.+|.+. +. -+||+.- .++ T Consensus 221 ~~~~~~~~~~~~~G~Vg--~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A 298 (347) T protein:vir:94 221 NAANYAALIDPETGNIR--NVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSA 298 (347) T ss_pred hhhhccccccccccceE--EEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhh Confidence 11 123221 22689999999998421 11 1222221 111 Q ss_pred ----ecceeeEeeeh-hhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecc Q lcl|Aclame:pro 323 ----LAGGINVQKFK-ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 323 ----~r~~~~i~~~~-~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~ 369 (381) ...+++++... +.+|. ..+++++.++.++++|++.+++.++.++ T Consensus 299 ~~~v~~~~~~~e~~r~~~~~~---d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 299 VGTVKLRDLALERDRDVDAQG---DLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred hhhhhcccccccchhchhhHH---HHhhhhhhhcCcccccceeEEEEecCCC Confidence 11223444322 33443 3789999999999999999999888544 No 146 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.11 E-value=1.4e-06 Score=52.74 Aligned_cols=295 Identities=8% Similarity=-0.045 Sum_probs=138.3 Q ss_pred hhhhccccHHHHHHHHHHhcccCCCCce-EccHHHHHHHHHHHHhhhhhhhhceeeecC-Cc-eEEEEecCCcceEEecc Q lcl|Aclame:pro 59 PKSAQSLSANQRSFFMDINKNVNYKEEK-LLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKI 135 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~-lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~-~~ip~~~~~~~a~w~~e 135 (381) ....+.+|.. . ..+++.-+ +--+.+..++.+.....+.+++...+.++. |+ .++|+-... ++....- T Consensus 1 ms~~n~~t~~--------~-~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~~-~~~~~~~ 70 (364) T protein:vir:10 1 MSNPNVLTQP--------A-VSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGET-ELQVLSP 70 (364) T ss_pred CCCccccccc--------c-cccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeeee-EEeeecc Confidence 0000111000 0 00111112 334888999999999999999999888864 43 778876333 3433332 Q ss_pred cccccccccccccceeccceeee-eehhhhHHHHhcChhH-HHHHHHHHHHHHHHHHHhhheee---ccCCCcceeeeec Q lcl|Aclame:pro 136 YGEIKGQLDAAFSEETAIQNKLT-AFVVLPKDLNDFGPAW-IERFVRVQIEEAFAVALETAFLK---GTGKDQPIGLNRQ 210 (381) Q Consensus 136 ~~~~~~~~~~~f~~v~l~~~kl~-~~~~iS~ell~ds~~~-l~~~i~~~la~a~a~~~d~a~l~---G~G~~qP~Gil~~ 210 (381) +.++- ...+..++.+|....+- +...|-.=---++.+| +.+.+.+++++++++..|++++. --+..+-.+.... T Consensus 71 G~~ld-~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~ 149 (364) T protein:vir:10 71 GKSPD-ASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKN 149 (364) T ss_pred CcccC-CCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccC Confidence 22322 23455566565544432 2233322112234566 67899999999999999998741 0010000000000 Q ss_pred cccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhcc------ Q lcl|Aclame:pro 211 VQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL------ 284 (381) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~------ 284 (381) ....+ +|.... .+ .+..+.......+.+.+.......+.+..+.. +.+.+|+|..++.+++..... T Consensus 150 --~~~~~-~g~~i~---~~-~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~-~R~~vv~P~~y~~Ll~~~~lvn~d~~~ 221 (364) T protein:vir:10 150 --PRVAG-HGFSIH---IV-GLASSFLTSPQYMMAAIEMAMEQQTEQEVDTS-ELCGLMPWTAFNCLRDADRIVDKSYTI 221 (364) T ss_pred --CcccC-Ccceee---ec-ccCcchhhhHHHHHHHHHHHHHHHhhcCCCcc-ccEEEeChHHHHHHhcCCccccccccc Confidence 00000 000000 00 01111111112222211111111222333333 357789999988887642211 Q ss_pred CCCCceeec---cCCCceEEecCCCCCcc---------------------E--EEEeccce--EEEec--------ceee Q lcl|Aclame:pro 285 NANGVYVTA---LPFNLNVIESTVQEAGK---------------------V--LTYVKGLY--DGYLA--------GGIN 328 (381) Q Consensus 285 ~~~G~~~~~---l~~g~~vi~s~~~p~~~---------------------i--~~gd~s~y--~i~~r--------~~~~ 328 (381) .++|.|.+. ...|+||+.|+++|... . ..+||+.. .++-+ .++. T Consensus 222 ~~~~~~~~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t 301 (364) T protein:vir:10 222 AASDNTVDGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISIT 301 (364) T ss_pred cCCCccccceeEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecce Confidence 133444432 23699999999998420 0 12455442 22333 3444 Q ss_pred Eeee-hhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC---CCC Q lcl|Aclame:pro 329 VQKF-KETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE---ETL 381 (381) Q Consensus 329 i~~~-~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~---~~~ 381 (381) .+.. ++.++.. ...+++=++-.+.+|+++++++. .+.-++- .|+ T Consensus 302 ~e~~~~~~~~~~---~ida~~a~G~g~lRPeaa~~i~~------~~~~~~~~~~~~~ 349 (364) T protein:vir:10 302 GDIFYEKKEKTW---YIDTFLAEGAIPDRWEAVAVVTA------ADTAELATDHNAI 349 (364) T ss_pred eeeeeccceeee---eeeeehcccCcccCccceEEEEe------cCCCCCccchhhh Confidence 4443 3333333 33345558899999999988721 1111211 122 No 147 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.06 E-value=3.5e-07 Score=55.96 Aligned_cols=294 Identities=9% Similarity=-0.023 Sum_probs=144.3 Q ss_pred hhhhccccHHHHHHHHHHhcccCCCCce-EccHHHHHHHHHHHHhhhhhhhhceeeecC-Cc-eEEEEecCCcceEEecc Q lcl|Aclame:pro 59 PKSAQSLSANQRSFFMDINKNVNYKEEK-LLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKI 135 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~-lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~-~~-~~ip~~~~~~~a~w~~e 135 (381) ....+.+|.. . ..+++.-+ +--+.+..++.+.....+.+++...+.++. |+ .++|+-.. .++....- T Consensus 1 Ms~~n~~t~~--------~-~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~-~~a~y~~~ 70 (402) T protein:vir:97 1 MSTPNTLTNV--------A-VSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGE-TELQVLAP 70 (402) T ss_pred CCCccccccc--------c-cccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEee-eEEeeecc Confidence 0000111000 0 00111112 334888999999999999999999888864 43 77887633 33444333 Q ss_pred cccccccccccccceeccceeee-eehhhhHHHHhcChhH-HHHHHHHHHHHHHHHHHhhheee-----ccCCCcceeee Q lcl|Aclame:pro 136 YGEIKGQLDAAFSEETAIQNKLT-AFVVLPKDLNDFGPAW-IERFVRVQIEEAFAVALETAFLK-----GTGKDQPIGLN 208 (381) Q Consensus 136 ~~~~~~~~~~~f~~v~l~~~kl~-~~~~iS~ell~ds~~~-l~~~i~~~la~a~a~~~d~a~l~-----G~G~~qP~Gil 208 (381) +.+.- .+.+..++..|....+- +...|..=---++.+| +.+.+.+++++++++..|++++. |--+..|.+-. T Consensus 71 G~~ld-g~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~ 149 (402) T protein:vir:97 71 GQSPN-ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNK 149 (402) T ss_pred ccccC-CCCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 23322 23455566555544332 2222222111234456 67889999999999999997742 11111111100 Q ss_pred eccccccccccccccccchhhhccc----cChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc- Q lcl|Aclame:pro 209 RQVQKGVSVTEGAYPEKEEQGTLTF----ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH- 283 (381) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~t~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~- 283 (381) ....... ... ..+.+. .++..+...+......+ +.+..+..+ .+.+|+|..++.+++.... T Consensus 150 ~~~~~~g-~s~--------~~~~t~~~a~~~~~~l~~ai~~a~~~L----dEkdVP~~d-Rv~vv~P~~y~~Ll~~~rl~ 215 (402) T protein:vir:97 150 PRVKGHG-FSI--------NVNVTESEALANPQYVMAAVEYALEQQ----LEQEVDISD-VAIMMPWKFFNALRDADRIV 215 (402) T ss_pred Ccccccc-ccc--------ccccccchhhcCHHHHHHHHHHHHHHH----HhcCCCccc-cEEEeChHHHHHHhhccccc Confidence 0000000 000 001111 12222222222222222 223333443 5789999988877754211 Q ss_pred -----cCCCCceeec---cCCCceEEecCCCCCcc--E---------------EEEeccc--eEEEecceeeE------- Q lcl|Aclame:pro 284 -----LNANGVYVTA---LPFNLNVIESTVQEAGK--V---------------LTYVKGL--YDGYLAGGINV------- 329 (381) Q Consensus 284 -----~~~~G~~~~~---l~~g~~vi~s~~~p~~~--i---------------~~gd~s~--y~i~~r~~~~i------- 329 (381) ..++|.|.+. ...|++|+.|+++|... + .-|||+. .+++.+..+-. T Consensus 216 n~d~~~~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT 295 (402) T protein:vir:97 216 DKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVT 295 (402) T ss_pred chhhccccCCccccceeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccc Confidence 1234555543 23699999999999531 1 1255554 33333322211 Q ss_pred -ee-ehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 330 -QK-FKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 330 -~~-~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) +. .+..++..- +.+++=++-.+.+++|..|+.++. ..||+..+.+-|= T Consensus 296 ~~~~~d~r~~~~~---id~~~a~G~g~~RPeaa~vv~~~~-~~t~~~~~~~~~~ 345 (402) T protein:vir:97 296 GDIFYEKKEKTYY---IDTFMAEGAIPDRWEAVSVVTTKR-DATTGDAGGPGDD 345 (402) T ss_pred cchhhchhHHHHH---HHHHHHhCCcccCccceEEEEEec-ccccccCCccccc Confidence 11 122233322 333444677889999999999988 4455555544433 No 148 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=97.97 E-value=1.1e-06 Score=53.28 Aligned_cols=238 Identities=13% Similarity=0.100 Sum_probs=133.0 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC--CceEEEEecCCcceEEec Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~--~~~~ip~~~~~~~a~w~~ 134 (381) +.-.+...+|-.| ..+..+++ ..+...|+|.+.+.++|+..+.+.... ......+.++.+.+.|.. T Consensus 1 m~~~~~~~~TL~e------~Ak~~~~~------~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~ 68 (331) T protein:vir:10 1 MPTLSTTNPTLAD------VAARMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRK 68 (331) T ss_pred CCccccCcccHHH------HHHhcCcc------hhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhc Confidence 1111112222221 11122222 234567999999999999999998643 224456778889999999 Q ss_pred ccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHH---HHHHHHHHHHHHhhheeeccCCCcceee---e Q lcl|Aclame:pro 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFV---RVQIEEAFAVALETAFLKGTGKDQPIGL---N 208 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i---~~~la~a~a~~~d~a~l~G~G~~qP~Gi---l 208 (381) .+...+ ++.+++.+++-..+-|.+.+.|.+.|.+... +..+|- ...+.++++..+...|++|+-+..|.++ - T Consensus 69 lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~ 146 (331) T protein:vir:10 69 LNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) T ss_pred cCCccC-cccceeEEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccch Confidence 877664 5688999999999999999999999988754 555553 4457889999999999999976566544 2 Q ss_pred eccccc------ccc-cccc--------------------cccc-------chhhhcccc-------------------- Q lcl|Aclame:pro 209 RQVQKG------VSV-TEGA--------------------YPEK-------EEQGTLTFA-------------------- 234 (381) Q Consensus 209 ~~~~~~------~~~-~~~~--------------------~~~~-------~~~~~~t~~-------------------- 234 (381) +..... ... .+|+ +|.- ...+..+.. T Consensus 147 kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl 226 (331) T protein:vir:10 147 PRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGL 226 (331) T ss_pred hhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeee Confidence 211100 000 0000 0100 000000000 Q ss_pred ---------------------ChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccC-------- Q lcl|Aclame:pro 235 ---------------------NPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN-------- 285 (381) Q Consensus 235 ---------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~-------- 285 (381) ++.+..+ ++..+....+.-|...+++.+|.||+.-...++++...++ T Consensus 227 ~i~d~r~v~ri~NIdvs~l~~~~~~~~d----l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~ 302 (331) T protein:vir:10 227 TLRDWRYVVRIANVDVSELTKNASAGAD----LIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTME 302 (331) T ss_pred EEcCcccEEEEeccchhccCCCcchhhh----HHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeee Confidence 0111112 1222211122234445778899999977666665433221 Q ss_pred -CCCceeeccCCCceEEecCCCCCcc-EEE Q lcl|Aclame:pro 286 -ANGVYVTALPFNLNVIESTVQEAGK-VLT 313 (381) Q Consensus 286 -~~G~~~~~l~~g~~vi~s~~~p~~~-i~~ 313 (381) ..|..++.. .|+||..++++-... .+. T Consensus 303 ~~~g~~~t~~-~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 303 EIAGKKVVAF-DGIPCRRTDALLLTEARVV 331 (331) T ss_pred ecCCcceeEE-CCeeEEEeeeeecCccccC Confidence 123333332 477877777654221 111 No 149 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=97.97 E-value=1.1e-06 Score=53.28 Aligned_cols=238 Identities=13% Similarity=0.100 Sum_probs=133.0 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC--CceEEEEecCCcceEEec Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~--~~~~ip~~~~~~~a~w~~ 134 (381) +.-.+...+|-.| ..+..+++ ..+...|+|.+.+.++|+..+.+.... ......+.++.+.+.|.. T Consensus 1 m~~~~~~~~TL~e------~Ak~~~~~------~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~ 68 (331) T protein:vir:98 1 MPTLSTTNPTLAD------VAARMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRK 68 (331) T ss_pred CCccccCcccHHH------HHHhcCcc------hhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhc Confidence 1111112222221 11122222 234567999999999999999998643 224456778889999999 Q ss_pred ccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHH---HHHHHHHHHHHHhhheeeccCCCcceee---e Q lcl|Aclame:pro 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFV---RVQIEEAFAVALETAFLKGTGKDQPIGL---N 208 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i---~~~la~a~a~~~d~a~l~G~G~~qP~Gi---l 208 (381) .+...+ ++.+++.+++-..+-|.+.+.|.+.|.+... +..+|- ...+.++++..+...|++|+-+..|.++ - T Consensus 69 lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~ 146 (331) T protein:vir:98 69 LNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) T ss_pred cCCccC-cccceeEEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccch Confidence 877664 5688999999999999999999999988754 555553 4457889999999999999976566544 2 Q ss_pred eccccc------ccc-cccc--------------------cccc-------chhhhcccc-------------------- Q lcl|Aclame:pro 209 RQVQKG------VSV-TEGA--------------------YPEK-------EEQGTLTFA-------------------- 234 (381) Q Consensus 209 ~~~~~~------~~~-~~~~--------------------~~~~-------~~~~~~t~~-------------------- 234 (381) +..... ... .+|+ +|.- ...+..+.. T Consensus 147 kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl 226 (331) T protein:vir:98 147 PRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGL 226 (331) T ss_pred hhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeee Confidence 211100 000 0000 0100 000000000 Q ss_pred ---------------------ChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccC-------- Q lcl|Aclame:pro 235 ---------------------NPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN-------- 285 (381) Q Consensus 235 ---------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~-------- 285 (381) ++.+..+ ++..+....+.-|...+++.+|.||+.-...++++...++ T Consensus 227 ~i~d~r~v~ri~NIdvs~l~~~~~~~~d----l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~ 302 (331) T protein:vir:98 227 TLRDWRYVVRIANVDVSELTKNASAGAD----LIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTME 302 (331) T ss_pred EEcCcccEEEEeccchhccCCCcchhhh----HHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeee Confidence 0111112 1222211122234445778899999977666665433221 Q ss_pred -CCCceeeccCCCceEEecCCCCCcc-EEE Q lcl|Aclame:pro 286 -ANGVYVTALPFNLNVIESTVQEAGK-VLT 313 (381) Q Consensus 286 -~~G~~~~~l~~g~~vi~s~~~p~~~-i~~ 313 (381) ..|..++.. .|+||..++++-... .+. T Consensus 303 ~~~g~~~t~~-~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:98 303 EIAGKKVVAF-DGIPCRRTDALLLTEARVV 331 (331) T ss_pred ecCCcceeEE-CCeeEEEeeeeecCccccC Confidence 123333332 477877777654221 111 No 150 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=97.97 E-value=1.1e-06 Score=53.28 Aligned_cols=238 Identities=13% Similarity=0.100 Sum_probs=133.0 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC--CceEEEEecCCcceEEec Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~--~~~~ip~~~~~~~a~w~~ 134 (381) +.-.+...+|-.| ..+..+++ ..+...|+|.+.+.++|+..+.+.... ......+.++.+.+.|.. T Consensus 1 m~~~~~~~~TL~e------~Ak~~~~~------~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~ 68 (331) T protein:vir:10 1 MPTLSTTNPTLAD------VAARMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRK 68 (331) T ss_pred CCccccCcccHHH------HHHhcCcc------hhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhc Confidence 1111112222221 11122222 234567999999999999999998643 224456778889999999 Q ss_pred ccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHH---HHHHHHHHHHHHhhheeeccCCCcceee---e Q lcl|Aclame:pro 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFV---RVQIEEAFAVALETAFLKGTGKDQPIGL---N 208 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i---~~~la~a~a~~~d~a~l~G~G~~qP~Gi---l 208 (381) .+...+ ++.+++.+++-..+-|.+.+.|.+.|.+... +..+|- ...+.++++..+...|++|+-+..|.++ - T Consensus 69 lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~ 146 (331) T protein:vir:10 69 LNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) T ss_pred cCCccC-cccceeEEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccch Confidence 877664 5688999999999999999999999988754 555553 4457889999999999999976566544 2 Q ss_pred eccccc------ccc-cccc--------------------cccc-------chhhhcccc-------------------- Q lcl|Aclame:pro 209 RQVQKG------VSV-TEGA--------------------YPEK-------EEQGTLTFA-------------------- 234 (381) Q Consensus 209 ~~~~~~------~~~-~~~~--------------------~~~~-------~~~~~~t~~-------------------- 234 (381) +..... ... .+|+ +|.- ...+..+.. T Consensus 147 kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl 226 (331) T protein:vir:10 147 PRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGL 226 (331) T ss_pred hhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeee Confidence 211100 000 0000 0100 000000000 Q ss_pred ---------------------ChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccC-------- Q lcl|Aclame:pro 235 ---------------------NPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN-------- 285 (381) Q Consensus 235 ---------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~-------- 285 (381) ++.+..+ ++..+....+.-|...+++.+|.||+.-...++++...++ T Consensus 227 ~i~d~r~v~ri~NIdvs~l~~~~~~~~d----l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~ 302 (331) T protein:vir:10 227 TLRDWRYVVRIANVDVSELTKNASAGAD----LIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTME 302 (331) T ss_pred EEcCcccEEEEeccchhccCCCcchhhh----HHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeee Confidence 0111112 1222211122234445778899999977666665433221 Q ss_pred -CCCceeeccCCCceEEecCCCCCcc-EEE Q lcl|Aclame:pro 286 -ANGVYVTALPFNLNVIESTVQEAGK-VLT 313 (381) Q Consensus 286 -~~G~~~~~l~~g~~vi~s~~~p~~~-i~~ 313 (381) ..|..++.. .|+||..++++-... .+. T Consensus 303 ~~~g~~~t~~-~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 303 EIAGKKVVAF-DGIPCRRTDALLLTEARVV 331 (331) T ss_pred ecCCcceeEE-CCeeEEEeeeeecCccccC Confidence 123333332 477877777654221 111 No 151 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=97.95 E-value=9.9e-07 Score=53.51 Aligned_cols=279 Identities=9% Similarity=-0.032 Sum_probs=133.8 Q ss_pred hhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee---cCC-ceEEEEecCCcceEEec Q lcl|Aclame:pro 59 PKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN---AGL-RLKFLKSETSGVAVWGK 134 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~---~~~-~~~ip~~~~~~~a~w~~ 134 (381) ...++.+|.. ++ +.+.--..||+.+..+|++.+.....+.++++-.+ .+| .++||+.. .+++.-.. T Consensus 1 ~~~~~~~~~~------~~---~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d~~ 70 (341) T protein:vir:94 1 MALGNTITGP------SI---NTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS-ELGVEDKA 70 (341) T ss_pred Ccchhhhccc------cc---cchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC-cceeeeec Confidence 0111111110 00 01122235899999999999988877777664322 234 47888764 34444444 Q ss_pred ccccccccccccccceecccee-eeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeecc--CCCcceeeeecc Q lcl|Aclame:pro 135 IYGEIKGQLDAAFSEETAIQNK-LTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT--GKDQPIGLNRQV 211 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~k-l~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~--G~~qP~Gil~~~ 211 (381) .+..+..+ +.+-.++++...+ .+.-+.|+..-...+..|+.+.+.+..++++++..|+.++.-- ++.++.+-. T Consensus 71 ~~~~i~~~-~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~--- 146 (341) T protein:vir:94 71 TDVPVGVQ-PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNV--- 146 (341) T ss_pred CCCccccc-cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCcc--- Confidence 44444322 3333455555523 3444666665555667899999999999999999998875311 111111100 Q ss_pred ccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhcc--C---- Q lcl|Aclame:pro 212 QKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL--N---- 285 (381) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~--~---- 285 (381) .. .. ....+........+.+......+. ....| . .+.+++++|..+..+++..... + T Consensus 147 --------~~--~~--~~~~t~~~~~~~~~~i~~a~~~Ld--e~~VP--~-~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~ 209 (341) T protein:vir:94 147 --------FS--SS--NGAITGNGQAFSFAVFLAARRLLL--EADVP--E-EKIVLLISPGQESALFTIPQFISKDFINN 209 (341) T ss_pred --------cc--Cc--cccccCchhhhhHHHHHHHHHHHh--hcCCC--c-cCCEEEeCHHHHHHHhhchhhhhhhcccc Confidence 00 00 000000111111122222222221 11122 2 3456788999888776421111 1 Q ss_pred ---CCCceeeccCCCceEEecCCCCCccEEE---------------------------Eeccce--EEEeccee---eE- Q lcl|Aclame:pro 286 ---ANGVYVTALPFNLNVIESTVQEAGKVLT---------------------------YVKGLY--DGYLAGGI---NV- 329 (381) Q Consensus 286 ---~~G~~~~~l~~g~~vi~s~~~p~~~i~~---------------------------gd~s~y--~i~~r~~~---~i- 329 (381) .+|... -.+|++|+.|+.+|.+.... +|++.+ .++-+..+ .+ T Consensus 210 ~~l~~G~ig--~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~ 287 (341) T protein:vir:94 210 APIAQGQIG--SLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMC 287 (341) T ss_pred chhheeeee--eEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeee Confidence 122211 13699999999998643210 011111 11111110 00 Q ss_pred -------------eeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEeccccc Q lcl|Aclame:pro 330 -------------QKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 330 -------------~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~ 372 (381) ...-++.-.+-...+++.+-++.++.+|++.+ .|+..+.|- T Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v--~~~~~~~~~ 341 (341) T protein:vir:94 288 HMDWAAAVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAV--NIHTTGDTV 341 (341) T ss_pred cchhhhccccccccccccchhhhhhhhhhhhhhhcccccCcceeE--EEecCcCCC Confidence 00000111122346778888899999999964 455544332 No 152 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=97.83 E-value=2.3e-06 Score=51.49 Aligned_cols=239 Identities=13% Similarity=0.095 Sum_probs=131.8 Q ss_pred HhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC--CceEEEEecCCcceEEec Q lcl|Aclame:pro 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~--~~~~ip~~~~~~~a~w~~ 134 (381) +.......+|-.| ..+..+ |......|+|.+.+.++|+..+.+.... ....-.+.++.|.+.|.. T Consensus 1 m~~~~~~a~TL~E------~Akr~~-------~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~ 67 (335) T protein:vir:73 1 MALIGQTLPSLLD------IYNRTD-------KNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRR 67 (335) T ss_pred CCcCCCCchhHHH------HHhhcC-------cchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhh Confidence 2111112222222 111122 3445667999999999999999887532 222345567788999998 Q ss_pred ccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHH---HHHHHHHHHHHHhhheeeccCCCcceee---e Q lcl|Aclame:pro 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFV---RVQIEEAFAVALETAFLKGTGKDQPIGL---N 208 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i---~~~la~a~a~~~d~a~l~G~G~~qP~Gi---l 208 (381) .+...+ ++.+++.+++-..+-|.+...|-+.|.+... |..+|- .....++++..+.+.|++|+-+..|.++ - T Consensus 68 lN~g~~-~s~~tt~qvt~~l~ilgg~~eVDr~La~~~G-n~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~ 145 (335) T protein:vir:73 68 YNQGVQ-PTKTQTVPVTDTTGMLYDLGFVDKALADRSN-NAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLA 145 (335) T ss_pred cCCccc-cccceEEEEEEEEEEecchhhhhHHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchh Confidence 877664 5679999999999999999999998877654 555444 4457899999999999999877666544 2 Q ss_pred ecccccc----------cccc--------------------ccccccch-------hhhccccC---------------- Q lcl|Aclame:pro 209 RQVQKGV----------SVTE--------------------GAYPEKEE-------QGTLTFAN---------------- 235 (381) Q Consensus 209 ~~~~~~~----------~~~~--------------------~~~~~~~~-------~~~~t~~~---------------- 235 (381) +...+.. ...+ +-+|.-.. .+..+..+ T Consensus 146 kR~~~~st~~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~ 225 (335) T protein:vir:73 146 PRFNTLSTSKAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWD 225 (335) T ss_pred hhhcCccccccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeee Confidence 2211000 0001 11111000 00000000 Q ss_pred -------------------------hhHHHHHHHHH-HHHhhhccccccccccCceEEEEchhhHHHHHhhhhccC---- Q lcl|Aclame:pro 236 -------------------------PRATVNELTQV-FKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN---- 285 (381) Q Consensus 236 -------------------------~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~---- 285 (381) +.+..+ |.++ +..+. ....+..-.++.+|.||+.-...++.+...+. T Consensus 226 ~Gl~i~d~r~vvRI~NIdvs~l~~d~~~~~~-l~~lmi~a~~--~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l 302 (335) T protein:vir:73 226 IGLSVRDWRSISRICNIDVTTLTKDASTGAD-LISMMVDAYY--ARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNL 302 (335) T ss_pred eeeEEeCcccEEEEeecccccccccccchhh-HHhhHHHHHH--HHhccCCCCCceEEEechHHHHHHHHHHhccCceee Confidence 001111 1111 11110 01123344566899999977666666543321 Q ss_pred ----CCCceeeccCCCceEEecCCCCCcc-EEEE Q lcl|Aclame:pro 286 ----ANGVYVTALPFNLNVIESTVQEAGK-VLTY 314 (381) Q Consensus 286 ----~~G~~~~~l~~g~~vi~s~~~p~~~-i~~g 314 (381) ..|..++-. .|+||..++++-... .+.. T Consensus 303 ~~~~~~g~~~t~~-~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 303 TIEEYGGKKIVSF-LGIPIRRVDAILNTESAVTA 335 (335) T ss_pred eeeccCCceeEEE-CCeEEEEEeeeecCcccccC Confidence 234444333 488888777664221 1111 No 153 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=97.82 E-value=1.4e-05 Score=47.26 Aligned_cols=270 Identities=7% Similarity=-0.035 Sum_probs=129.1 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhh------c---eeee--cCC-ceEEEEecCC-cceEEeccccccccc Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLAD------L---GIKN--AGL-RLKFLKSETS-GVAVWGKIYGEIKGQ 142 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~------~---~v~~--~~~-~~~ip~~~~~-~~a~w~~e~~~~~~~ 142 (381) |.. +.-.-..+|+.+..-+.+.+.+.+.+++- + ...+ .+| .+.+|..... +.+.-+.++.+++.+ T Consensus 1 MA~--T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~ 78 (324) T protein:vir:59 1 MAY--TKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQ 78 (324) T ss_pred CCc--eeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchh Confidence 332 22244567888777666666666555332 1 1222 234 3678876543 455555555555432 Q ss_pred ccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccc Q lcl|Aclame:pro 143 LDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAY 222 (381) Q Consensus 143 ~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~ 222 (381) ..+-++-.-..++.+.-..++++-..-+.-|....+.+.+++..++..+..+|.-- .|++.... ..+.. T Consensus 79 -~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l-----~g~~~~~~----~~~~~- 147 (324) T protein:vir:59 79 -KINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAEL-----AGVFSNDD----MKDNK- 147 (324) T ss_pred -hcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhhhccc----cccce- Confidence 33333333344444444556665444566678888999999998888876665310 11111000 00000 Q ss_pred cccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh---ccCCCCceeeccCCCce Q lcl|Aclame:pro 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNANGVYVTALPFNLN 299 (381) Q Consensus 223 ~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~---~~~~~G~~~~~l~~g~~ 299 (381) .+... ...+..++.. +.+....+ |.. ...=.+|+||+.++..+++... .+.++|...-....|++ T Consensus 148 ~dvsa-~~~~~~s~~~----l~~A~~~~-----GD~--~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~~~G~~ 215 (324) T protein:vir:59 148 LDISG-TADGIYSAET----FVDASYKL-----GDH--ESLLTAIGMHSATMASAVKQDLIEFVKDSQSGIRFPTYMNKR 215 (324) T ss_pred eeeec-cccceecHHH----HHHHHHHh-----CCc--ccCcEEEEEchHHHHHHHHhhhhhhccccccCceeeeecccE Confidence 00000 0001112222 22211112 111 1123479999999999886421 12333332222336999 Q ss_pred EEecCCCCCcc----------EEEEeccceEEEe-cceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 300 VIESTVQEAGK----------VLTYVKGLYDGYL-AGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 300 vi~s~~~p~~~----------i~~gd~s~y~i~~-r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) |+.++.||... .+|+.-. ..+.. +.++.++..++ ...++..+....++ .+++..+....-.+. T Consensus 216 VivdD~~p~~~~~~~~~~y~s~l~~~GA-i~~~~~~~~v~vE~dRd--~~~g~~~l~~r~~~---~~~p~G~s~~~~~~~ 289 (324) T protein:vir:59 216 VIVDDSMPVETLEDGTKVFTSYLFGAGA-LGYAEGQPEVPTETARN--ALGSQDILINRKHF---VLHPRGVKFTENAMA 289 (324) T ss_pred EEEeCCCCccccCCCCceEEEEEEecCe-EEEeecCCCcceecccC--ccccceEEEEeeEE---EeEeeeEEecccccC Confidence 99999998421 1222111 11112 22333443333 34566677776664 356555554332222 Q ss_pred ccccCCCCCCCCC Q lcl|Aclame:pro 369 GHKPALEGTEETL 381 (381) Q Consensus 369 ~~~~~~~~~~~~~ 381 (381) ...++++- T Consensus 290 -----~~sPt~~~ 297 (324) T protein:vir:59 290 -----GTTPTDEE 297 (324) T ss_pred -----CCCCChhh Confidence 23444443 No 154 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=97.78 E-value=9.7e-07 Score=53.55 Aligned_cols=299 Identities=9% Similarity=0.006 Sum_probs=146.0 Q ss_pred hhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-c-eEEEEecCCcceEEeccc Q lcl|Aclame:pro 59 PKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-R-LKFLKSETSGVAVWGKIY 136 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~-~~ip~~~~~~~a~w~~e~ 136 (381) ....+.++. ...+...+--.|.-+.+..++.......+.++++..+.++.+ + .++|+- +...+....-+ T Consensus 1 Ms~~n~~t~--------p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG 71 (400) T protein:vir:10 1 MSTPNNLTN--------VAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPG 71 (400) T ss_pred CCCCccccc--------cccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCC Confidence 000000000 000001111134567888999999999999999999988754 3 678876 44445555444 Q ss_pred ccccccccccccceeccceee-eeehhhhHHHHhcChhH-HHHHHHHHHHHHHHHHHhhheee----c-c-CCCcceeee Q lcl|Aclame:pro 137 GEIKGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAW-IERFVRVQIEEAFAVALETAFLK----G-T-GKDQPIGLN 208 (381) Q Consensus 137 ~~~~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~-l~~~i~~~la~a~a~~~d~a~l~----G-~-G~~qP~Gil 208 (381) .++.. +.+..++..|....+ .+...|..=---++.+| +.+.+.+++++++++..|++++. + - -+..|.|.- T Consensus 72 ~~ldg-~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~ 150 (400) T protein:vir:10 72 QSPAA-TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNP 150 (400) T ss_pred CCcCC-CCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccC Confidence 44332 345556665554443 24444433222334566 78899999999999999987752 1 0 012232221 Q ss_pred eccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhh--hhccC- Q lcl|Aclame:pro 209 RQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ--YTHLN- 285 (381) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~--~~~~~- 285 (381) .....+.+... .........++..+...+......+. .+..++ ...+.++.|.-|..++.. +..++ T Consensus 151 ~g~~~g~s~~v------~~~~~~~~~~~~~l~~A~~~A~~~Ld----EkdVP~-~d~vvl~pp~~Ys~Ll~~dkLvnrdf 219 (400) T protein:vir:10 151 RVKGHGFSVNV------EVNEGEALVNPQYVMAAVEFALEQQL----EQEVDI-SDVAILMPWRYFNVLRDADRIVDKSY 219 (400) T ss_pred Cccccccceee------cccccccccCHHHHHHHHHHHHHHHH----hcCCCc-cceEEEcCHHHHHHHHhCCcccchhc Confidence 11000000000 00000011233333322222222221 222222 244556655555455432 11111 Q ss_pred ---CCCceeec---cCCCceEEecCCCCCcc---------------E--EEEeccce--EEEecceeeE--------ee- Q lcl|Aclame:pro 286 ---ANGVYVTA---LPFNLNVIESTVQEAGK---------------V--LTYVKGLY--DGYLAGGINV--------QK- 331 (381) Q Consensus 286 ---~~G~~~~~---l~~g~~vi~s~~~p~~~---------------i--~~gd~s~y--~i~~r~~~~i--------~~- 331 (381) ++|.|++. ...|+||+.|+.+|... . .-|||+.- +++.++.+-+ +. T Consensus 220 ~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~ 299 (400) T protein:vir:10 220 TISQSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIF 299 (400) T ss_pred cccCCCccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccc Confidence 23555543 23699999999998521 1 22566552 2333332221 11 Q ss_pred ehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 332 FKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 332 ~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) .+...|.. .+.+++=++-.+.+++|.+|++.+=....+.--|+-+-- T Consensus 300 ~d~r~~~~---~id~~~a~G~g~~RPeaa~vv~~~~~~~~~~~~~~~~~~ 346 (400) T protein:vir:10 300 YEKKEKTY---YIDTFMSEGAIPDRWEAVSVVTTKRQSTGAVDSGNAAQH 346 (400) T ss_pred cchhhHHH---HHHHHHHhCCcccchhheEEEEecCCcccccccCcchhH Confidence 12333433 334445567889999999999887755444444443332 No 155 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=97.68 E-value=3.7e-06 Score=50.39 Aligned_cols=258 Identities=13% Similarity=-0.019 Sum_probs=117.4 Q ss_pred hceeeecCCceEEEEecCCcceEEecccccccc-cccccccc--eeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHH Q lcl|Aclame:pro 109 DLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKG-QLDAAFSE--ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEE 185 (381) Q Consensus 109 ~~~v~~~~~~~~ip~~~~~~~a~w~~e~~~~~~-~~~~~f~~--v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~ 185 (381) +++-+.-+...++|+-.. ..+....-+.++.. ..+++=.+ ++++..++..+..=..+=. ++..|+.+.+.++.+. T Consensus 1 ~vr~i~~g~s~~~~~iG~-~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~-qa~~Dlr~e~s~~~G~ 78 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMGR-TKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDA-MNHYDVRSEYSTQMGE 78 (324) T ss_pred CeeeeecCceEEEeeeee-eEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHH-hcCccchhHHHHHHHH Confidence 332222233478887633 33443333333321 12223344 5555555554332222222 3667899999999999 Q ss_pred HHHHHHhhheee----ccCCCcceeeeeccccccccccccccccchhhhc-cccChhHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 186 AFAVALETAFLK----GTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTL-TFANPRATVNELTQVFKYHSTNEKGKSVA 260 (381) Q Consensus 186 a~a~~~d~a~l~----G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~-t~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 260 (381) ++++..|++++. +-....|.+--. ....++........+.. ...++..+++.+.++...| +.+..+ T Consensus 79 aLA~~~Dq~i~~~~a~~~~~~a~~~~~~-----~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~L----de~~VP 149 (324) T protein:vir:99 79 ALAMAADVANYAEMAKLVNSRKETTNEN-----IEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAF----AKKYIP 149 (324) T ss_pred HHHHHHHHHHHHHHHHhhhcccccccCC-----cccCCccceecccccccccccCHHHHHHHHHHHHHHH----hhcCCC Confidence 999999987741 110011110000 00000000000000000 0111223333333322222 222222 Q ss_pred ccCceEEEEchhhHHHHHhhhhc----cCCCCceeecc---CCCceEEecCCCCCccE---------------------- Q lcl|Aclame:pro 261 VKGNVTMVVNPSDAFEVQAQYTH----LNANGVYVTAL---PFNLNVIESTVQEAGKV---------------------- 311 (381) Q Consensus 261 ~~~~~~~imn~~~~~~~~~~~~~----~~~~G~~~~~l---~~g~~vi~s~~~p~~~i---------------------- 311 (381) - .+.+.+++|.-++.++..... ..+.|.+.+.. ..|++|+.|+++|...+ T Consensus 150 ~-~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~ 228 (324) T protein:vir:99 150 A-GDRTFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTT 228 (324) T ss_pred C-CCCEEEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccc Confidence 2 245678999888766432111 11233333321 25999999999995321 Q ss_pred ---EEEeccce--EE--------EecceeeEeee-hhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCC Q lcl|Aclame:pro 312 ---LTYVKGLY--DG--------YLAGGINVQKF-KETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGT 377 (381) Q Consensus 312 ---~~gd~s~y--~i--------~~r~~~~i~~~-~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~ 377 (381) +.+||+.- ++ +.-.++..+.+ ++.+|. ..+++++-++.++++|+++++++|.-...+.++.-- T Consensus 229 ~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~---d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~~ 305 (324) T protein:vir:99 229 TGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPEYQA---DQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAPDV 305 (324) T ss_pred ccccccccCceeEEEEehhheEEEeeecceecceechhhHH---HhhhhhhhhcCcccccceEEEEEEccCccccccchh Confidence 11222221 11 12222333332 233343 467788888999999999999888654432222211 Q ss_pred CCCC Q lcl|Aclame:pro 378 EETL 381 (381) Q Consensus 378 ~~~~ 381 (381) ..|+ T Consensus 306 ~~~~ 309 (324) T protein:vir:99 306 ITGV 309 (324) T ss_pred hhhh Confidence 2222 No 156 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=97.68 E-value=1.3e-05 Score=47.37 Aligned_cols=280 Identities=7% Similarity=-0.089 Sum_probs=121.3 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceee---------ecCC-ceEEEEecCC-cceEEecccc-cccccc Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK---------NAGL-RLKFLKSETS-GVAVWGKIYG-EIKGQL 143 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~---------~~~~-~~~ip~~~~~-~~a~w~~e~~-~~~~~~ 143 (381) |...++.-.-..+|+.+..-+.+.+.+.+.+++-.-+. ..+| .+.+|.-... +.+.-..++. .++.. T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~- 79 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETG- 79 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchh- Confidence 44333334456778887766666665555554321111 1234 3788876533 4444344432 23322 Q ss_pred cccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeecccccccccccccc Q lcl|Aclame:pro 144 DAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYP 223 (381) Q Consensus 144 ~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~ 223 (381) ..+-++-.-..++.+.-..++.+-.--+.-|....+.+++++...+..+..++.- -.|++..........-.... T Consensus 80 ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~-----l~gvf~~~~~~~~~~~~~~~ 154 (330) T protein:vir:10 80 KITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIAT-----LNGIFATGTAGEKGALEETH 154 (330) T ss_pred hcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHH-----HHhhhhhhhcccchhhhhhh Confidence 2222233333334444444444444446667788899999888777666555421 11222211000000000000 Q ss_pred ccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh---ccCCCCceeeccCCCceE Q lcl|Aclame:pro 224 EKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNANGVYVTALPFNLNV 300 (381) Q Consensus 224 ~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~---~~~~~G~~~~~l~~g~~v 300 (381) .....+.....+.... .+....+. +.. ..-.+|+||+.++..+++... .+.++|...-....|++| T Consensus 155 ~~~~~~~~a~~s~~~l----~~A~~~~G---D~~----~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~~~G~~V 223 (330) T protein:vir:10 155 VSDQSKASTGIDAGMV----LDAKQLLG---DSA----DQVTAIAMHSAVYTKLQKDNLIQYIQPTTATINIPTYLGYRV 223 (330) T ss_pred eecccccccccCHHHH----HHHHHHhc---ccc----ccceEEEEcHHHHHHHHHhhhhhhhcccccCcccccccceEE Confidence 0000000111222222 12111111 111 123479999999999887432 122333211122248999 Q ss_pred EecCCCCCcc-----EEEEeccceEEEec---ceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEeccccc Q lcl|Aclame:pro 301 IESTVQEAGK-----VLTYVKGLYDGYLA---GGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 301 i~s~~~p~~~-----i~~gd~s~y~i~~r---~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~ 372 (381) +.++.||... .+|+. ....+.+. ..+.++..++ ...++..+....++ .+++..+.--.-. -+. T Consensus 224 ivdD~~p~~~~~yt~yl~~~-GAi~~~~~~~~~~v~~EtdRd--~~~g~~~l~~r~~~---~~hp~G~s~~~~~---~~~ 294 (330) T protein:vir:10 224 IIDDGIAPTGDIYTSYLFRT-GSIGLNTGNPSGLTTFETSRE--AAKGNDMIYTRRAL---VMHPYGVKWTGAE---VDA 294 (330) T ss_pred EEeCCCCCCCCceeEEEEec-CceeeecccCCccccccccCC--ccccceEEEEeeEE---Eeeeeeeeecccc---ccc Confidence 9999998432 12221 11122221 1123333333 23455566666553 3555554432111 111 Q ss_pred CCCCCCCCC Q lcl|Aclame:pro 373 ALEGTEETL 381 (381) Q Consensus 373 ~~~~~~~~~ 381 (381) ..+.++++- T Consensus 295 ~~~sPt~~~ 303 (330) T protein:vir:10 295 GNITPSNAD 303 (330) T ss_pred CcCCcChHH Confidence 223344443 No 157 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=97.59 E-value=8.2e-06 Score=48.48 Aligned_cols=265 Identities=11% Similarity=-0.028 Sum_probs=141.8 Q ss_pred ccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC--CceE---EEEecCCcceEEecccccc Q lcl|Aclame:pro 65 LSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLK---FLKSETSGVAVWGKIYGEI 139 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~--~~~~---ip~~~~~~~a~w~~e~~~~ 139 (381) ++.++ . .....+=+..+--++.+++-..+.+..-++...+..|+. ..++ +|..+-.+.+.-++|++.+ T Consensus 1 M~~e~-----n--l~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~I 73 (303) T protein:vir:10 1 MSAEN-----N--LINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVI 73 (303) T ss_pred CCCCc-----C--CcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCccc Confidence 11111 0 011112122334566777777776666677777888864 2344 4444555777778898877 Q ss_pred cccccccc---cceeccceeeeeehhhhHHHHhcChh-HHHHHHHHHHHHHHHHHHhhheee----ccCCCcceeeeecc Q lcl|Aclame:pro 140 KGQLDAAF---SEETAIQNKLTAFVVLPKDLNDFGPA-WIERFVRVQIEEAFAVALETAFLK----GTGKDQPIGLNRQV 211 (381) Q Consensus 140 ~~~~~~~f---~~v~l~~~kl~~~~~iS~ell~ds~~-~l~~~i~~~la~a~a~~~d~a~l~----G~G~~qP~Gil~~~ 211 (381) + .+..+- ...++..+|++.-+ |.|-++.|.. +-...--+.|..+++..++..|+. |+|+. T Consensus 74 p-lskvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~--------- 141 (303) T protein:vir:10 74 P-LTKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENG--------- 141 (303) T ss_pred c-hhhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhccccc--------- Confidence 6 333332 34677778888754 9999854433 355666778888888888877754 22210 Q ss_pred ccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCC---C Q lcl|Aclame:pro 212 QKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNAN---G 288 (381) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~---G 288 (381) .. ...+..........++..+..+..... . ..+.+..|||.|.+++|+.......+ | T Consensus 142 ------~~---------t~~t~~s~~glq~Al~~~~~kl~~~~e----d-~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG 201 (303) T protein:vir:10 142 ------KR---------TNKTKLSAENLQGALSKGRANLSVLLD----D-EITPIAFVNPNDTAEYLANGFINSTGAQFG 201 (303) T ss_pred ------cc---------ccceeecHHHHHHHHHhhhhhcccccc----c-cccEEEEEchHHHHHHhhcCCcchhhhhhh Confidence 00 000112222333333333333322211 1 23568899999999988644322111 2 Q ss_pred -ceeeccCCCceEEecCCCCCccEEEEe---ccceEEEecceeeEeeehhhhhhcCceEEEEEEEEc------------- Q lcl|Aclame:pro 289 -VYVTALPFNLNVIESTVQEAGKVLTYV---KGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAY------------- 351 (381) Q Consensus 289 -~~~~~l~~g~~vi~s~~~p~~~i~~gd---~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~d------------- 351 (381) +|+.. -.|..||.|..+|+|+++.-- ...|++..++.+ .....+..|+++++|..+.- T Consensus 202 ~n~L~n-fLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l----~~~f~~t~D~tglIGv~h~~~~~~~t~eT~~~~ 276 (303) T protein:vir:10 202 VNLLTP-YVGVKIVEFADVPQGEVWMTVAENLNVAYANPRGEL----SRAFAFATDATGFVGVLHDIQPQRLTSDTIYAS 276 (303) T ss_pred hhhhhh-hhcceEEEeccCCCceEEEeeccceEEEEecCchhh----hhhhhhccccccceEEEeccccceeeehhHhHh Confidence 23322 136678999999999987543 333444444322 23445667788888876531 Q ss_pred C---EEecCcceEEEEEEecc--cccC Q lcl|Aclame:pro 352 G---KAKDNKVAAVWKLDLKG--HKPA 373 (381) Q Consensus 352 g---k~~~~~Af~v~~l~~~~--~~~~ 373 (381) | .|=..++.++.+++-.+ +.|+ T Consensus 277 ~~~lfpE~~dgiv~~ti~~~e~~~~~~ 303 (303) T protein:vir:10 277 AISMFPENIDAVIKVTIKKDEAGELPS 303 (303) T ss_pred HHHhcccccceEEEEEEeccccCCCCC Confidence 1 12234556666664333 3344 No 158 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=97.53 E-value=5.2e-06 Score=49.57 Aligned_cols=280 Identities=12% Similarity=0.064 Sum_probs=133.9 Q ss_pred HhcccCCCCc--eEccHHHHHHHHHHHHhhhhhhhhceeeec-CC-ceEEEEecCCcceEEeccccccc-ccccccccce Q lcl|Aclame:pro 76 INKNVNYKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GL-RLKFLKSETSGVAVWGKIYGEIK-GQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg--~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~-~~-~~~ip~~~~~~~a~w~~e~~~~~-~~~~~~f~~v 150 (381) |..|..+..+ +.+|+.++.+|..-|.+.-....++++... .| ++.||.-........ ...+.+. .+.+.+=-.+ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV~dY-~~~~~i~~d~ltt~~~~l 79 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVVRSR-PEQGDFTFDNLDTGEISI 79 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccccccccc-cCCCCcccccCCCceEEE Confidence 6666554433 445999999999777765443444444332 23 577876544333222 1122211 1111121255 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee--ccCCCcceeeeeccccccccccccccccchh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK--GTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~--G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) .++..|+.++. |+.+.. +...+|.+...++.+.+++...|..+.. =+|..+-.++ +..++..+....... T Consensus 80 ~IDq~KYfaf~-VdDD~~-Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~-----~~p~vin~~~~~iv~- 151 (322) T protein:vir:31 80 ILRDEVYAGNA-ISKKLR-QDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQ-----NDPNVINGVPHRFVG- 151 (322) T ss_pred EEehhhhhccc-cchhHH-HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc-----CCcceecCCccceec- Confidence 66777777655 788665 5678999999999999999988876622 1121110000 000000000000001 Q ss_pred hhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhh-------hhc----cCCCC----ceeec Q lcl|Aclame:pro 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ-------YTH----LNANG----VYVTA 293 (381) Q Consensus 229 ~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~-------~~~----~~~~G----~~~~~ 293 (381) +..++....+.+.++...+ +....+- .+.+.|++|.-+..+... .+. -+.+| ++.-. T Consensus 152 ---~gt~~~~ay~~lv~l~~kL----dkanVP~-~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg 223 (322) T protein:vir:31 152 ---TGTDQTMDVTDFSRVNYVM----TQSKMPM-GGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVR 223 (322) T ss_pred ---cCCCchhhHHHHHHHHHHh----ccccCCC-CCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHH Confidence 1112222223333332222 2222222 244567888754433211 010 11222 22112 Q ss_pred cCCCceEEecCCCCCcc--EEE---------EeccceE-EEecceeeE----eeehhhh----hhcCceEEEEEEEEcCE Q lcl|Aclame:pro 294 LPFNLNVIESTVQEAGK--VLT---------YVKGLYD-GYLAGGINV----QKFKETL----ALDDMDLYTAKQFAYGK 353 (381) Q Consensus 294 l~~g~~vi~s~~~p~~~--i~~---------gd~s~y~-i~~r~~~~i----~~~~~~~----~~~d~~~~~~~~r~dgk 353 (381) -..|..|+.|+.+++++ +.. |-++-+. +.+.+-..+ ++.+... -.+....+|+.+|++.. T Consensus 224 ~~~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g 303 (322) T protein:vir:31 224 SVYGIDLFVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNG 303 (322) T ss_pred HHhceeeeeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecce Confidence 23588999999987543 222 2222221 111111000 0111100 11234579999999999 Q ss_pred EecCcceEEEEEEecccccCCC Q lcl|Aclame:pro 354 AKDNKVAAVWKLDLKGHKPALE 375 (381) Q Consensus 354 ~~~~~Af~v~~l~~~~~~~~~~ 375 (381) ++.+|..+++.-. ..|.++ T Consensus 304 ~~r~e~l~~~~a~---~~~~~~ 322 (322) T protein:vir:31 304 LVRDENLVCVLAN---ADKVTF 322 (322) T ss_pred eecccceEEEEec---cccccC Confidence 9999998877333 333334 No 159 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=97.51 E-value=5.4e-05 Score=44.00 Aligned_cols=296 Identities=12% Similarity=0.047 Sum_probs=139.0 Q ss_pred HHHHHHHHHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEcc---HHHHHHHHHHHHhhhhhhhhceeee-cC-Cc Q lcl|Aclame:pro 44 TKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLP---EETIDRIFEDLTTNHPLLADLGIKN-AG-LR 118 (381) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP---~~~~~~Ii~~l~~~~~l~~~~~v~~-~~-~~ 118 (381) .+....++.+......+..+ ++ +......+.|++.. +.+..++++.....-..+.++.+.. .+ +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~---------~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~ 70 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQ-AG---------VKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTD 70 (319) T ss_pred CCCcchhHHhhHHHHHHHhh-cc---------chhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCce Confidence 01011111111111111100 00 01111122343333 2344456655544434444444432 22 12 Q ss_pred --eEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcC---hhHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 119 --LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALET 193 (381) Q Consensus 119 --~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds---~~~l~~~i~~~la~a~a~~~d~ 193 (381) ..+...+..+.+.|.+..+..-+..+..+.+.....+.++.-+.++..=|+.+ ..+++.--....++++++.+|+ T Consensus 71 ~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~ 150 (319) T protein:vir:10 71 KTFEYMTFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNR 150 (319) T ss_pred EEEEeeeeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhce Confidence 33445556677888765443223345667777777888887777776555444 4667888888899999999999 Q ss_pred heeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhh Q lcl|Aclame:pro 194 AFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSD 273 (381) Q Consensus 194 a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~ 273 (381) -+++|+...+-.||+++........ .++.... ...++...+++..++..+.....+.. ....++++|.. T Consensus 151 i~f~G~~~~g~~GLlN~p~~~~~~~----~~~~~~~---t~t~~~i~~di~~~~~~l~~~s~g~~----~p~~L~L~p~~ 219 (319) T protein:vir:10 151 LVFKGSAPHKIVSVFNHPNITKITS----GKWIDVS---TMKPETAEAELTQAIETIETITRGQH----RATNILIPPSM 219 (319) T ss_pred EEEeecccccceeEEeCCCceeeec----CCCCCcc---ccCHHHHHHHHHHHHHHHHHhcCcee----eceEEEecHHH Confidence 9999998888899998754322211 1111111 12344445555555544433222322 23468889987 Q ss_pred HHHHHhhhhccCCCCce----eeccCCCceEEecCCCCC----c--cEEEEeccc-eE-EEecceeeEeeehhhhhhcCc Q lcl|Aclame:pro 274 AFEVQAQYTHLNANGVY----VTALPFNLNVIESTVQEA----G--KVLTYVKGL-YD-GYLAGGINVQKFKETLALDDM 341 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~----~~~l~~g~~vi~s~~~p~----~--~i~~gd~s~-y~-i~~r~~~~i~~~~~~~~~~d~ 341 (381) +..+.. ..+..|.. +.....++.|.....+.. + .+++...+. |+ +...+.++.-.. +.+. -. T Consensus 220 ~~~L~~---~~~~~~~t~l~~lk~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~-e~~~--l~ 293 (319) T protein:vir:10 220 RKVLAI---RMPETTMSYLDYFKSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPA-QPKD--LH 293 (319) T ss_pred HHhhhc---ccCCCCeeHHHHHHHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCcceeeeee-eecC--ce Confidence 655521 12223321 111223445554443331 1 133333322 32 223333332211 1111 12 Q ss_pred eEEEEEEEEcCEE-ecCcceEEEEEEeccc Q lcl|Aclame:pro 342 DLYTAKQFAYGKA-KDNKVAAVWKLDLKGH 370 (381) Q Consensus 342 ~~~~~~~r~dgk~-~~~~Af~v~~l~~~~~ 370 (381) ....+..|+.|.. ..|.|++.+ .+. T Consensus 294 ~~~~~~~r~~Gv~i~~P~ai~~~----dGI 319 (319) T protein:vir:10 294 FKVPCTSKCTGLTIYRPMTIVLI----TGV 319 (319) T ss_pred EEEeeeeeeEEEEEEccceeEee----ecC Confidence 2334466666544 455555543 222 No 160 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=97.50 E-value=1.6e-06 Score=52.33 Aligned_cols=267 Identities=11% Similarity=-0.031 Sum_probs=138.9 Q ss_pred hhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEE-EEecCCcceEEecccc Q lcl|Aclame:pro 61 SAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKF-LKSETSGVAVWGKIYG 137 (381) Q Consensus 61 ~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~i-p~~~~~~~a~w~~e~~ 137 (381) .....+-.|. +.....+=+...--++.+++-..+.+..-++...+..|++. .+++ |...-.+.+.-++|++ T Consensus 1 ~~~~~~~~e~------nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe 74 (296) T protein:vir:98 1 MVTSRTYPEE------NLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) T ss_pred CCCccccCcC------CCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccCCc Confidence 1110000010 01111222223345667777777776666777778888753 3533 4456667777888987 Q ss_pred ccccccccccc---ceeccceeeeeehhhhHHHHhcCh-hHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeecccc Q lcl|Aclame:pro 138 EIKGQLDAAFS---EETAIQNKLTAFVVLPKDLNDFGP-AWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQK 213 (381) Q Consensus 138 ~~~~~~~~~f~---~v~l~~~kl~~~~~iS~ell~ds~-~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~ 213 (381) +++- +..+-. ..++..+|++.-+ |.|-++.|. -+-...--+.|..+++..++..|+.=- .. T Consensus 75 ~Ipl-skvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~L------------kt 139 (296) T protein:vir:98 75 VIPL-SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTAL------------KT 139 (296) T ss_pred ccch-hhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHH------------hc Confidence 7753 333332 3566667777664 999986443 345566778888999998888876411 00 Q ss_pred ccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccC---CCCce Q lcl|Aclame:pro 214 GVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN---ANGVY 290 (381) Q Consensus 214 ~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~---~~G~~ 290 (381) +... +......+...+++.+..+...-.... ....+..|||.|.+..++...... -.++| T Consensus 140 aT~t--------------~~~t~~~lQ~Ala~~~~~l~~~feded---~~~~V~FVnP~D~a~ylg~a~it~qt~fG~ty 202 (296) T protein:vir:98 140 GTGT--------------QDALGAGLQGALASAWGKLQVLFEDYG---SERAIVFANSLDVAEYIAKAGITTQTAFGLTY 202 (296) T ss_pred ccce--------------eeechhhHHHHHHHHhhhhhhhccccC---CCceEEEEehHHHHHHhcCCccchhheechhh Confidence 0000 000111222223322222222111110 134678899999999886433211 12455 Q ss_pred eeccCCCceEEecCCCCCccEEEEec---cceEEEecceeeEeeehhhhhhcCceEEEEEEEEc-------------C-- Q lcl|Aclame:pro 291 VTALPFNLNVIESTVQEAGKVLTYVK---GLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAY-------------G-- 352 (381) Q Consensus 291 ~~~l~~g~~vi~s~~~p~~~i~~gd~---s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~d-------------g-- 352 (381) +... .|..||.|..+|+|+++.--. ..|++-.+.| +......+..|++++++..+.- | T Consensus 203 l~nf-LG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~---~l~~~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~ 278 (296) T protein:vir:98 203 LVDF-TGTVIISTNDVTKGEIWATVPENIIFAYINPNNS---ELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGML 278 (296) T ss_pred hhhc-cccEEEEcCcCCCceEEEeeecceEEEeeccccc---chhhhhccccccccceEEEeccccceeeehhHhHhHHH Confidence 5422 367899999999999876533 3344433323 3333444556788888776531 0 Q ss_pred -EEecCcceEEEEEEecccccCC Q lcl|Aclame:pro 353 -KAKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 353 -k~~~~~Af~v~~l~~~~~~~~~ 374 (381) .|=..++.++.++ +|++ T Consensus 279 lfpE~~dgiv~~tI-----~~~~ 296 (296) T protein:vir:98 279 MYPERIDGIVKVTL-----TPGV 296 (296) T ss_pred hcccccceEEEEEe-----cCCC Confidence 1112233333333 3444 No 161 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=97.38 E-value=5.5e-05 Score=43.92 Aligned_cols=277 Identities=8% Similarity=-0.061 Sum_probs=121.6 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhce------e---eecCC-ceEEEEecCC-cceEEeccccccccccc Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLG------I---KNAGL-RLKFLKSETS-GVAVWGKIYGEIKGQLD 144 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~------v---~~~~~-~~~ip~~~~~-~~a~w~~e~~~~~~~~~ 144 (381) |.. +.-.-..+|+.+..-+.+...+.+.+++-.- + ...+| .+.+|.-... +.+.-+.++.+++.+. T Consensus 1 MA~--T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~k- 77 (351) T protein:vir:15 1 MAE--THLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNN- 77 (351) T ss_pred CCc--eeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhe- Confidence 332 2224456788776666565555555543211 1 11234 3788976543 4555555555554332 Q ss_pred ccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccc Q lcl|Aclame:pro 145 AAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPE 224 (381) Q Consensus 145 ~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~ 224 (381) .+-++-.-..+..+.-..++.+-..-+.-|....+.++++...++..+..+|.- -.|++.... ...+..... T Consensus 78 itt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~-----l~gv~~~~~---~~~~~~~d~ 149 (351) T protein:vir:15 78 LTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSV-----LKGVMGVTK---IANSKVYDQ 149 (351) T ss_pred ecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhhchh---hcccceecc Confidence 222222233344444455565544445667888899999988888877766531 011111000 000000000 Q ss_pred cchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh---ccCCCCceeeccCCCceEE Q lcl|Aclame:pro 225 KEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNANGVYVTALPFNLNVI 301 (381) Q Consensus 225 ~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~---~~~~~G~~~~~l~~g~~vi 301 (381) ....+.....+... +.+....+. + ... ..=.+|+||+..+..+++... .+.++|...-..-.|++|+ T Consensus 150 t~~~~~~~~is~~~----l~~A~~~~G---D--~~~-~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~t~~G~~Vi 219 (351) T protein:vir:15 150 TKVSPSEPMFGAKG----FTGAIGLMG---D--LQD-TAFGAIAVNSATYSLMKVQGLIETIQPQNGATPFEAYNGLRIV 219 (351) T ss_pred ccccccccccCHHH----HHHHHHHhc---c--ccc-cceEEEEEChHHHHHHHhhhhhhhccccccCcccceecceEEE Confidence 00001111122222 222221111 1 110 112579999999998876432 2223332111112489999 Q ss_pred ecCCCCCcc----------EEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccc Q lcl|Aclame:pro 302 ESTVQEAGK----------VLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 302 ~s~~~p~~~----------i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~ 371 (381) .++.||... .+||.-. ..+..+ +..++..++.....++.......| ..+++..+..-.-..+ T Consensus 220 vdD~~p~~~~~~~~~~ytsyl~~~GA-i~~~~~-~~~ve~~rd~~~~~g~d~l~~r~~---~~~hp~G~s~~~~~~~--- 291 (351) T protein:vir:15 220 LDDDIEIDLTDKTKPVSTSYIFAPGA-VRYSTN-MRSTETKYDPLINGGQDVIVQKRV---GTIHVAGTSIKASFSP--- 291 (351) T ss_pred EcCCCccccCCCCCceeEEEEEecce-eeeecC-CcCcceeecccCCCCceEEEEeee---eeeeeeeeeecccccc--- Confidence 999998421 1222111 111122 222333333333344444444433 3366655553211111 Q ss_pred cCCCCCCCCC Q lcl|Aclame:pro 372 PALEGTEETL 381 (381) Q Consensus 372 ~~~~~~~~~~ 381 (381) .....++++- T Consensus 292 ~~~~sPt~~~ 301 (351) T protein:vir:15 292 SKASFPTIDE 301 (351) T ss_pred cCcCCcChHH Confidence 1122344442 No 162 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=97.27 E-value=0.00011 Score=42.31 Aligned_cols=273 Identities=13% Similarity=0.075 Sum_probs=137.4 Q ss_pred Hhcc-cCCCCceEcc--HHHHHHHHHHHHhhhhhhhhceeee-cCC---ceEEEEecCCcceEEeccccccccccccccc Q lcl|Aclame:pro 76 INKN-VNYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKN-AGL---RLKFLKSETSGVAVWGKIYGEIKGQLDAAFS 148 (381) Q Consensus 76 ~~~~-~~~~gg~lvP--~~~~~~Ii~~l~~~~~l~~~~~v~~-~~~---~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~ 148 (381) |.-. .++.|-|++. +.+...|++.....-..+.++.+.. .+. ...+...+..+.+.|.+..+..-+..+..+. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~ 80 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALAT 80 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccce Confidence 1110 1222333332 2345556554444333333333332 221 1344455566778887654432233456677 Q ss_pred ceeccceeeeeehhhhHHHHhcC---hhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeecccccccccccccccc Q lcl|Aclame:pro 149 EETAIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEK 225 (381) Q Consensus 149 ~v~l~~~kl~~~~~iS~ell~ds---~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~ 225 (381) +.....+.++.-+.++.+=|+.+ ..+++.--....++++++.+|+.+++|+....-.|+|++.........+ ++ T Consensus 81 ~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~---~W 157 (296) T protein:vir:10 81 ERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGG---SW 157 (296) T ss_pred eEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccC---Cc Confidence 78888888888788776666544 5678888888899999999999999999877889999875432222211 11 Q ss_pred chhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCc----eeeccCCCceEE Q lcl|Aclame:pro 226 EEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGV----YVTALPFNLNVI 301 (381) Q Consensus 226 ~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~----~~~~l~~g~~vi 301 (381) .++....+++..++..+.....+. -+...++++|..+..+... .+..|. ++.....++.|. T Consensus 158 --------~~~t~i~~Di~~~~~~l~~~s~g~----~~p~~l~L~p~~~~~L~~~---~~~~~~t~l~~ik~~~~~l~i~ 222 (296) T protein:vir:10 158 --------SQPTTAVSDITSLLDIIETSTNGQ----HRATHLLLPTTARRIMQNL---VPGTSVSYGEFFRQNNSGVTVE 222 (296) T ss_pred --------cCHHHHHHHHHHHHHHHHHhhCce----ecceeEEeCHHHHHHHhhc---cCCCCccHHHHHHHhcCCceEE Confidence 112233344444443332222222 2234678888766544211 122231 111112244444 Q ss_pred ecCCCCC------ccEEEEeccc-eE-EEecceeeEeeehhhhhhcCceEEEEEEEEcC-EEecCcceEEEEEEeccccc Q lcl|Aclame:pro 302 ESTVQEA------GKVLTYVKGL-YD-GYLAGGINVQKFKETLALDDMDLYTAKQFAYG-KAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 302 ~s~~~p~------~~i~~gd~s~-y~-i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dg-k~~~~~Af~v~~l~~~~~~~ 372 (381) ....+.. +.+++.+.+. |+ +...+.++.-. .....-...++...|+.| .+..|.|+++++ +.|= T Consensus 223 ~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~---~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~d----GI~~ 295 (296) T protein:vir:10 223 FVQYLNDYNGTGTSAAIAYEKDPNNMAIEIPEATNALP---AQPKDLHFKIPVTSKATGLIVYRPLTMAVMK----GITF 295 (296) T ss_pred EeeeeccCCCCcceEEEEEEcCCceEEEEcCcceeeec---ccccCceEEEeeEeeEEEEEEECCceeEEEe----eeec Confidence 3333321 1234444333 32 33333333321 112223456677888864 566677777642 2222 Q ss_pred C Q lcl|Aclame:pro 373 A 373 (381) Q Consensus 373 ~ 373 (381) + T Consensus 296 ~ 296 (296) T protein:vir:10 296 A 296 (296) T ss_pred C Confidence 2 No 163 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=97.02 E-value=0.00021 Score=40.76 Aligned_cols=277 Identities=8% Similarity=-0.040 Sum_probs=140.6 Q ss_pred cccCCCCceEcc--HHHHHHHHHHHHhhhhhhhhceeee-cC-Cc--eEEEEecCCcceEEeccccccccccccccccee Q lcl|Aclame:pro 78 KNVNYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKN-AG-LR--LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEET 151 (381) Q Consensus 78 ~~~~~~gg~lvP--~~~~~~Ii~~l~~~~~l~~~~~v~~-~~-~~--~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~ 151 (381) -.+++.|.|++- +.+...+++.+.+.--.|.++.+.. .+ +. ..+...+..+.+.|....+..-+..+..+++.. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 233445554432 2445566666655544555554432 22 11 344555666778887654432234466677778 Q ss_pred ccceeeeeehhhhHHHHhcC---hhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchh Q lcl|Aclame:pro 152 AIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 152 l~~~kl~~~~~iS~ell~ds---~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) ...+.++.-+.++..=|+.+ ..+++.--....++++++.+|+.+++|+....-.||+++.......+... ..... T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~--~~~~~ 158 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTT--GVGNV 158 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCc--ccccc Confidence 88888887777776655544 56788888889999999999999999998878899998754222111110 00001 Q ss_pred hhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCceee----ccCCCceEEecC Q lcl|Aclame:pro 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVT----ALPFNLNVIEST 304 (381) Q Consensus 229 ~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~~~~----~l~~g~~vi~s~ 304 (381) ......+++...+++..++..+.....+.. ....++|+|..+..+..-. ..+..|.-+. .-..+..|+..+ T Consensus 159 ~~w~~~t~~ei~~di~~~~~~l~~~s~g~~----~p~~L~L~p~~~~~L~~~~-~~~~~~~tvl~~l~~~~~~~~I~~~p 233 (301) T protein:vir:80 159 SKWEKKTAEQIIDEIGEAHTKITVLPGYGT----ASLKLCLPPKQFELINKKR-YSNEDSRSVLKVLQDNAWFSAIVRVP 233 (301) T ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCcee----cccEEEecHHHHHhhhhcc-ccCCCCeeHHHHHHHHcCcceEEEcc Confidence 111223445555555555544432222221 2346889998766553211 1122232111 111233444443 Q ss_pred CCCC----c--cEEEEec-cce-EEEecceeeEeeehhhhhhcCceEE--EEEEEEcC-EEecCcceEEEEEEeccc Q lcl|Aclame:pro 305 VQEA----G--KVLTYVK-GLY-DGYLAGGINVQKFKETLALDDMDLY--TAKQFAYG-KAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 305 ~~p~----~--~i~~gd~-s~y-~i~~r~~~~i~~~~~~~~~~d~~~~--~~~~r~dg-k~~~~~Af~v~~l~~~~~ 370 (381) .+.. + .+++..- .++ .+...+.++.. ........| ....|+.| .+..|.|++.++ +. T Consensus 234 ~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~-----~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~----GI 301 (301) T protein:vir:80 234 DLAGMGTAGSDSFAVIHDSNETAELIIPMDITRH-----PEEYSFPRTKVPFEERTAGVVVRFPAAIVRVD----GI 301 (301) T ss_pred eeccCCCCcccEEEEEecCCcEEEEEecCceeee-----cceecCceeEeeeeeeeEEEEEEccceEEEEe----cC Confidence 3321 1 1222221 122 23333333221 111111223 34667765 556677766542 22 No 164 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=96.80 E-value=0.00032 Score=39.76 Aligned_cols=278 Identities=10% Similarity=-0.037 Sum_probs=123.4 Q ss_pred HHHHHHhcccCCCCceEcc----HHHHHHHHHHHHhh-hhhhhhceeeecCCc---eEEEEecCCcceEEecccccc--- Q lcl|Aclame:pro 71 SFFMDINKNVNYKEEKLLP----EETIDRIFEDLTTN-HPLLADLGIKNAGLR---LKFLKSETSGVAVWGKIYGEI--- 139 (381) Q Consensus 71 ~~~~~~~~~~~~~gg~lvP----~~~~~~Ii~~l~~~-~~l~~~~~v~~~~~~---~~ip~~~~~~~a~w~~e~~~~--- 139 (381) -.+++...+.+.= ...|| +++.+++.-.+.+. +.|++-++..+-.+. ...+.....+... ..+.... T Consensus 1 ~~~~~~~~~~~~M-s~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~d 78 (322) T protein:vir:10 1 MKLNAIMSMLPLI-AGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVK-RKRSRQQSAD 78 (322) T ss_pred Ccccceeeeeeee-echhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccc-cccccccccC Confidence 0011111110000 00123 45555555444444 566666654443222 1222221111110 0000000 Q ss_pred ----cccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeecc-CCCcceeeeeccccc Q lcl|Aclame:pro 140 ----KGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT-GKDQPIGLNRQVQKG 214 (381) Q Consensus 140 ----~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~-G~~qP~Gil~~~~~~ 214 (381) .+...-.++........++....|.+.-+-....|..+...+..+.+++++.|+.++.|- |... +|- .+ T Consensus 79 ~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~-~~~----~g- 152 (322) T protein:vir:10 79 GTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS-IKG----TG- 152 (322) T ss_pred cccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc-ccc----cc- Confidence 011112244444444445555677776666677889999999999999999999887632 2110 000 00 Q ss_pred cccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc--cC------- Q lcl|Aclame:pro 215 VSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH--LN------- 285 (381) Q Consensus 215 ~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~--~~------- 285 (381) ..+. .++...+. ......+...+..+...+... ..+-.++-+.+++|..+..++.-... .+ T Consensus 153 t~v~---~~ss~~i~---~g~~g~t~~kl~~a~~~l~~~----dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l 222 (322) T protein:vir:10 153 QPVE---FLATQEIG---DGTKPISFDYVTEITERFLEN----EIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDL 222 (322) T ss_pred cccc---cCCCcccc---cCccchhHHHHHHHHHHHHhc----CCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhh Confidence 0000 00000000 000011112222222222111 11222344678899887777542211 11 Q ss_pred -CCCceeeccCCCceEEecCCCCCcc------------------EEEEeccceEEEecceeeEeeehhhhhhcCceEEEE Q lcl|Aclame:pro 286 -ANGVYVTALPFNLNVIESTVQEAGK------------------VLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTA 346 (381) Q Consensus 286 -~~G~~~~~l~~g~~vi~s~~~p~~~------------------i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~ 346 (381) .+|..-. ..|+.++.++.+|... .+++.-+....+....+..+.+... -.....-+++ T Consensus 223 ~~~G~ig~--~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~-~~~~a~~I~~ 299 (322) T protein:vir:10 223 QSKGIITN--WMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDP-SASFAWRIYS 299 (322) T ss_pred hhcCeeee--eeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccC-Ccchhhhhhh Confidence 1233222 2588899999888321 2223333333444444444432211 1122334566 Q ss_pred EEEEcCEEecCcceEEEEEEecc Q lcl|Aclame:pro 347 KQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 347 ~~r~dgk~~~~~Af~v~~l~~~~ 369 (381) .+-++++.++++.++.++.+-+= T Consensus 300 ~~~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 300 AFTADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred hhhhCceEeccCcEEEEEEeccC Confidence 67789999999998887775433 No 165 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=95.36 E-value=0.0022 Score=35.13 Aligned_cols=268 Identities=11% Similarity=0.016 Sum_probs=111.0 Q ss_pred CCCceEccHHHHHHHHHHHHhhhhhhhhceee---ec---CC-ceEEEEecCCcceEEec-----ccccccccccccccc Q lcl|Aclame:pro 82 YKEEKLLPEETIDRIFEDLTTNHPLLADLGIK---NA---GL-RLKFLKSETSGVAVWGK-----IYGEIKGQLDAAFSE 149 (381) Q Consensus 82 ~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~---~~---~~-~~~ip~~~~~~~a~w~~-----e~~~~~~~~~~~f~~ 149 (381) -..-+++|+.+..++++.|++...+.+++..- .. .| .++||+..... +.+.. .++++.. .+.+-+. T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~ 78 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSR-GHTRKLRGAGAERNLTV-SDFTEDS 78 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccccc-ceeeeccccccCCcccc-cccccce Confidence 22345889999999999999988777776432 11 23 47888754432 32221 1111111 1222233 Q ss_pred --eeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccch Q lcl|Aclame:pro 150 --ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEE 227 (381) Q Consensus 150 --v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~ 227 (381) +.+..+++.++ .|+.+-...+..++..-+.+...++++..+|..++. .=.+.|.+.. .. T Consensus 79 ~~~~id~~k~~~~-~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~-~~~~a~~~~~---------~~-------- 139 (392) T protein:vir:99 79 FPVTLTDVAYHLG-VLTDEELTFDLESFATQILPRQVRGVADILEEGVRD-MIVGAPYEAA---------GA-------- 139 (392) T ss_pred EEEEEeeeeecce-eechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHH-HHhccccccc---------cc-------- Confidence 44444554444 444444444566777777788899999999987652 1111111000 00 Q ss_pred hhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhcc-----CC-------CCceeeccC Q lcl|Aclame:pro 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL-----NA-------NGVYVTALP 295 (381) Q Consensus 228 ~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~-----~~-------~G~~~~~l~ 295 (381) .+..++....+.+.++...|.. ...| .+.+++++|..+..+++..... +. +|... .. T Consensus 140 ---~~~~~~~~~~~~i~~a~~~L~~--~~vP----~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg--~i 208 (392) T protein:vir:99 140 ---VHEVAPDEFFKGVNGARRALNE--LYIP----QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLG--RI 208 (392) T ss_pred ---ccccChhhhHHHHHHHHHHHhh--cCCC----CCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceee--ee Confidence 0001111222223333332211 1122 2346788998777765421111 01 22221 22 Q ss_pred CCceEEecCCCCCccEEEEeccceEEEecceeeE-------------------eeehhhhhhcCceE---EEEEEEEc-- Q lcl|Aclame:pro 296 FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINV-------------------QKFKETLALDDMDL---YTAKQFAY-- 351 (381) Q Consensus 296 ~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i-------------------~~~~~~~~~~d~~~---~~~~~r~d-- 351 (381) +|.+|+.+..+|.+..+.+..+.+....+..... .++.......+... +.+...++ T Consensus 209 ~G~~v~~s~~~~~~t~~a~~~~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~ 288 (392) T protein:vir:99 209 YGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDP 288 (392) T ss_pred eeeEEEeecccccccceeeeccccccccccccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeec Confidence 5789999999988765554433332222211110 00001111111111 11111110 Q ss_pred -C-EEecCcceEEE--EEEecccccC------CCCCCCCC Q lcl|Aclame:pro 352 -G-KAKDNKVAAVW--KLDLKGHKPA------LEGTEETL 381 (381) Q Consensus 352 -g-k~~~~~Af~v~--~l~~~~~~~~------~~~~~~~~ 381 (381) + .........+. .+.+...+.+ ..+...|+ T Consensus 289 ~~~~~~~~~~~~~~~~~v~v~~v~~~~~~~~~~~~~~~~~ 328 (392) T protein:vir:99 289 NGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTV 328 (392) T ss_pred cccceeeeeeeeeecceeeeeeeecccceeEeeeccceeE Confidence 0 00000000000 0000000000 00111111 No 166 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=95.08 E-value=0.0028 Score=34.56 Aligned_cols=300 Identities=14% Similarity=0.050 Sum_probs=144.6 Q ss_pred ccHHHHHHHHHH----hc--cc---CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEe Q lcl|Aclame:pro 65 LSANQRSFFMDI----NK--NV---NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~----~~--~~---~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~ 133 (381) ++.+-|..|+++ .+ +. +....|.|-+.+...+.+.+++.+-+++..+++++.- +..+....+.+.++-. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeecc Confidence 333333333321 11 11 2245788989999999999999999999999988752 2344444333333322 Q ss_pred cc--cccccccccccccceeccceeeeeehhhhHHHHhcCh--hHHHHHHHHHHHHHHHHHHhhheeeccCC----C--- Q lcl|Aclame:pro 134 KI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP--AWIERFVRVQIEEAFAVALETAFLKGTGK----D--- 202 (381) Q Consensus 134 ~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~--~~l~~~i~~~la~a~a~~~d~a~l~G~G~----~--- 202 (381) .- ..+.........+.-.+..++.-.-..|+.+.|+.-+ .+|..-+++.+.++++.=.-.--++|+-. + T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:18 81 DTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVK 160 (355) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 11 1122222222344445555555555678888887543 46888888888888886555555677641 1 Q ss_pred cc------eeeeecccccc---ccccccc------cccchhhh-ccccChhHHHHHHHHHHHHhhhccccccccc--cCc Q lcl|Aclame:pro 203 QP------IGLNRQVQKGV---SVTEGAY------PEKEEQGT-LTFANPRATVNELTQVFKYHSTNEKGKSVAV--KGN 264 (381) Q Consensus 203 qP------~Gil~~~~~~~---~~~~~~~------~~~~~~~~-~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 264 (381) .| +|+|..+-... .-+.++. .+....+. ..+.+-+.++..+.+ .+ -+..+ .+. T Consensus 161 nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~---~l------I~~~~~~d~d 231 (355) T protein:vir:18 161 NPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTN---TL------IDEIYQDDPK 231 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHh---cc------CChHHhcCCC Confidence 23 35553211100 0011110 01111111 112222222222211 00 01122 235 Q ss_pred eEEEEchhhHH-HHHhhhhccCCC-----Cceee--ccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Eeeeh Q lcl|Aclame:pro 265 VTMVVNPSDAF-EVQAQYTHLNAN-----GVYVT--ALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQKFK 333 (381) Q Consensus 265 ~~~imn~~~~~-~~~~~~~~~~~~-----G~~~~--~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~~~~ 333 (381) .|.||.+.-.. +..++....+.+ ++.+. ...=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +.-.+ T Consensus 232 LVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p 311 (355) T protein:vir:18 232 LVAIVGRKLLADKYFPLVNKQQENTESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLENLSIYFMDESHRRSIDENP 311 (355) T ss_pred EEEEEchhhhHHHHhHHhhccCChHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 78888876443 222222111110 11111 112389999999999999887666553332 112111 21111 Q ss_pred hh----hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 334 ET----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 334 ~~----~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) ++ .+..-..+|..-.+.-+..++ . +++.-...+++++++. T Consensus 312 ~r~rie~y~s~Ne~YvVEd~~~~a~ie--n---i~~~~~~~~~~~~~g~ 355 (355) T protein:vir:18 312 KKDRVENYESMNIDYVVEAYAAGCLLE--N---ITLGDFTAPAAPEGGE 355 (355) T ss_pred ccccccchhhhcceeeeeccccEEEEe--e---eeecCCCCcccccCCC Confidence 11 112223455555444444444 1 3334445667888888 No 167 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=94.74 E-value=0.0036 Score=33.98 Aligned_cols=305 Identities=12% Similarity=0.040 Sum_probs=137.0 Q ss_pred HHHHHH-HHH-HHHHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEcc--HHHHHHHHHHHHhhhhhhhhceeee- Q lcl|Aclame:pro 40 LFEETK-LQA-KAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKN- 114 (381) Q Consensus 40 ~~~~~~-~~~-~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP--~~~~~~Ii~~l~~~~~l~~~~~v~~- 114 (381) +..... +.. ..+++.+..+.+.... -+....+..|.|++- +.+...|++.....-..+.++.+.+ T Consensus 1 ~~~~~~~~~~~~d~~~~~~~a~~~~~~----------~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~ 70 (329) T protein:vir:79 1 MRGNIMSKEMKYDEFEANVIANHMQLR----------GAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSE 70 (329) T ss_pred CccchhhhhhccchhhhhhHhhhcccc----------cceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccC Confidence 000000 000 0011111111100000 000111112334432 2345566665544434444444432 Q ss_pred cC-Cc--eEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcC---hhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 115 AG-LR--LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFA 188 (381) Q Consensus 115 ~~-~~--~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds---~~~l~~~i~~~la~a~a 188 (381) .+ +. ..+...+..+.+.|.+..+..-+..+..+.+.....+.++.-+.++..=|+-+ ..++..--....+++++ T Consensus 71 ~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~ 150 (329) T protein:vir:79 71 LSDTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHD 150 (329) T ss_pred CCCceeEEEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHH Confidence 22 11 34455566677888765432222345556666666777777666765544433 46688888888999999 Q ss_pred HHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEE Q lcl|Aclame:pro 189 VALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMV 268 (381) Q Consensus 189 ~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 268 (381) +.+|+-+++|++..+-.|+|++......+++.. ........++....+++..++..+.....+.. ....++ T Consensus 151 ~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~-----~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~----~p~~L~ 221 (329) T protein:vir:79 151 QLVNHLVFKGSKPHKIISVFEHPNLTTINSAGW-----NNAAGTGKKPETAQDELEQAIEKIETLTNGQH----RANMIL 221 (329) T ss_pred HhhccEEEeecccccceeeecCCCccccccCCC-----CCccccccCHHHHHHHHHHHHHHHHHhcCcee----cccEEE Confidence 999999999998888899998755332221110 01112234455555555555544433222221 234578 Q ss_pred EchhhHHHHHhhhhccCCCCc----eeeccCCCceEEecCCCC-C-----ccEEEEeccc-eE-EEecceeeEeeehhhh Q lcl|Aclame:pro 269 VNPSDAFEVQAQYTHLNANGV----YVTALPFNLNVIESTVQE-A-----GKVLTYVKGL-YD-GYLAGGINVQKFKETL 336 (381) Q Consensus 269 mn~~~~~~~~~~~~~~~~~G~----~~~~l~~g~~vi~s~~~p-~-----~~i~~gd~s~-y~-i~~r~~~~i~~~~~~~ 336 (381) |+|..+..+.- ..+..|. ++.....++.|.....+. + +.+++.+.+. ++ +...+.++... -+.+ T Consensus 222 Lpp~~~~~L~~---~~~~~~~tvl~~lk~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~-~q~~ 297 (329) T protein:vir:79 222 IPPSMRKVLMV---RMPETTMSYLDYFKQQNGGITIESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNMLT-AQPK 297 (329) T ss_pred ecHHHHHHhhc---ccCCCCccHHHHHHHhCCCcEEEEcccccccCCCCceEEEEEecCCceEEEecCcceeeee-ceec Confidence 88875543321 1222332 111111233444333221 1 1234433333 22 22223333221 1111 Q ss_pred hhcCceEEEEEEEEcCE-EecCcceEEEEEEecc Q lcl|Aclame:pro 337 ALDDMDLYTAKQFAYGK-AKDNKVAAVWKLDLKG 369 (381) Q Consensus 337 ~~~d~~~~~~~~r~dgk-~~~~~Af~v~~l~~~~ 369 (381) .. ........|+.|. +..|.|++.++==+.+ T Consensus 298 ~~--~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 298 DL--HFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred Cc--eEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 11 1223345555543 4456666655433433 No 168 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=94.58 E-value=0.004 Score=33.70 Aligned_cols=281 Identities=11% Similarity=0.017 Sum_probs=125.3 Q ss_pred cccCCC-----CceEccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceEEEEecCCcce-EEecccccccc-cccccccc Q lcl|Aclame:pro 78 KNVNYK-----EEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVA-VWGKIYGEIKG-QLDAAFSE 149 (381) Q Consensus 78 ~~~~~~-----gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~-~~~ip~~~~~~~a-~w~~e~~~~~~-~~~~~f~~ 149 (381) ...++. .....=+++.+.|..--....|+.+++.-.+... ..+|...+-...+ .-..|+++.+. ...+. . T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r--~ 78 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFT--T 78 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCC--E Confidence 111111 1112346677777766666678888765444333 3566643322211 11123332211 11111 1 Q ss_pred eecc-ceeeeeehhhhHHHHhcChhHHHHHHHH---HHHHHHHHHHhhheeecc-----CCC----cceeeeeccccccc Q lcl|Aclame:pro 150 ETAI-QNKLTAFVVLPKDLNDFGPAWIERFVRV---QIEEAFAVALETAFLKGT-----GKD----QPIGLNRQVQKGVS 216 (381) Q Consensus 150 v~l~-~~kl~~~~~iS~ell~ds~~~l~~~i~~---~la~a~a~~~d~a~l~G~-----G~~----qP~Gil~~~~~~~~ 216 (381) ..-+ ..-+...+.||..+..-+.......+.. .-...+.+-+|.+||+|. |+. +-.||+..+..... T Consensus 79 ~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~ 158 (317) T protein:vir:88 79 MLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGS 158 (317) T ss_pred EeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCce Confidence 1001 1122344555555544433333333333 334456677899999985 222 33577654422111 Q ss_pred cc-cccccccchhhhccccChh-HHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc------cCC-C Q lcl|Aclame:pro 217 VT-EGAYPEKEEQGTLTFANPR-ATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH------LNA-N 287 (381) Q Consensus 217 ~~-~~~~~~~~~~~~~t~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~------~~~-~ 287 (381) .. +|..+........+..... -..+.+.++++.+-. .|. ..+. ++||+...-.+-++... .+. . T Consensus 159 ~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~--~Gg----~~~~-i~v~a~~k~~i~~~~~~~~~~i~~~~~~ 231 (317) T protein:vir:88 159 LGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWR--NGG----QANS-IQTSSSIKKAISKNMKGRATEITLDASD 231 (317) T ss_pred eccCccccccCCCccccccccccccHHHHHHHHHHHHh--cCC----CCCE-EEeChHHHHHHHHHhcCCceeEEEcccC Confidence 10 0000000000000111100 122223333333211 111 1123 46788654433333110 001 1 Q ss_pred Ccee-----eccCCC-ceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceE Q lcl|Aclame:pro 288 GVYV-----TALPFN-LNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAA 361 (381) Q Consensus 288 G~~~-----~~l~~g-~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~ 361 (381) +... ...+|| ++++.+.+||++++++.|+++.-+..=.++..+.+-.. .|..-+.....+.=++.+++|.+ T Consensus 232 ~~~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~laKt---Gd~~k~~i~~E~tLe~~N~~a~a 308 (317) T protein:vir:88 232 NRIAQTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHELAKT---GDSEKRQLLVEYTFRVNNEKSGA 308 (317) T ss_pred eEEEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeecccceeeccCCC---cccceeEEEEEEEEEEcCcccee Confidence 1111 112455 46789999999999999999866533234444332222 24445666666778889999999 Q ss_pred EEEEEeccc Q lcl|Aclame:pro 362 VWKLDLKGH 370 (381) Q Consensus 362 v~~l~~~~~ 370 (381) ++..-.+.. T Consensus 309 ~i~~l~~~~ 317 (317) T protein:vir:88 309 LIRDVVAQL 317 (317) T ss_pred EEEEecccC Confidence 875333322 No 169 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=94.32 E-value=0.0047 Score=33.32 Aligned_cols=291 Identities=13% Similarity=0.096 Sum_probs=133.7 Q ss_pred HHHHHHHHHHhhhhhccccHHHHHHHHHHh-cccCCCCceEcc--HHHHHHHHHHHHhhhhhhhhceeeecCC----ceE Q lcl|Aclame:pro 48 AKAEAERVSSLPKSAQSLSANQRSFFMDIN-KNVNYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKNAGL----RLK 120 (381) Q Consensus 48 ~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~-~~~~~~gg~lvP--~~~~~~Ii~~l~~~~~l~~~~~v~~~~~----~~~ 120 (381) ...+++. .++....+ ...+. ...++.|-|++. +.+..+|++.....-.-+.++.+.+..+ ... T Consensus 1 ~~~~~~~-~~~~~~~~---------~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~ 70 (314) T protein:vir:10 1 MAIKFDA-EQAKITTH---------LEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFE 70 (314) T ss_pred CccchHH-HHHHHHHH---------HHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEE Confidence 1111110 00000000 00111 112233445544 3444556554333322233333322111 134 Q ss_pred EEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcC---hhHHHHHHHHHHHHHHHHHHhhheee Q lcl|Aclame:pro 121 FLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETAFLK 197 (381) Q Consensus 121 ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds---~~~l~~~i~~~la~a~a~~~d~a~l~ 197 (381) ++..+..+.+.|.+..+..-+..+..+.+.....+.++.-+.++..=|+-+ ..+++.--....++++.+.+|+.+++ T Consensus 71 ~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~ 150 (314) T protein:vir:10 71 YPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWS 150 (314) T ss_pred eeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEe Confidence 455566677888775443223456667788888888888888865555444 45678888888999999999999999 Q ss_pred ccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHH Q lcl|Aclame:pro 198 GTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEV 277 (381) Q Consensus 198 G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~ 277 (381) |+...+-.|+|++........ ..++ .++....+++..++..+.....+.. ....++++|..+..+ T Consensus 151 G~~~~g~~GLlN~p~v~~~~~---~~~W--------aT~~ei~~Di~~~~~~l~~~s~g~~----~p~~l~Lpp~~~~~L 215 (314) T protein:vir:10 151 GSAPHGIVSVFDQPNINNVVA---TPNW--------SVPQNAIDDVTAMIDAVESSTQGLH----HVTDILLPASARRVM 215 (314) T ss_pred ecccccceeEeecCCCccccC---CCCc--------ccHHHHHHHHHHHHHHHHHhcCccc----cceeEEecHHHHHhh Confidence 998778899998764321111 1112 1334444555554444433222222 123577888755433 Q ss_pred HhhhhccCCCCce----eeccCCCceEEecCCCCC----cc--EEEEeccc-eE-EEecceeeEeeehhhhhhcCceEEE Q lcl|Aclame:pro 278 QAQYTHLNANGVY----VTALPFNLNVIESTVQEA----GK--VLTYVKGL-YD-GYLAGGINVQKFKETLALDDMDLYT 345 (381) Q Consensus 278 ~~~~~~~~~~G~~----~~~l~~g~~vi~s~~~p~----~~--i~~gd~s~-y~-i~~r~~~~i~~~~~~~~~~d~~~~~ 345 (381) .. ..+..|.- +..-..++.|.....+.. ++ +++.+-+. ++ +...+.++.-. -+.+ .-..... T Consensus 216 ~~---~~~~~~~tvl~~l~~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~-~e~~--~~~~~~~ 289 (314) T protein:vir:10 216 QG---LVPQTNLSYGELFTRNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLP-AQPK--DLHFRYP 289 (314) T ss_pred cc---cccCCCccHHHHHHHhCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceeec-ceec--CceEEEc Confidence 21 11122321 111112444444333321 11 22222222 21 22222222211 0111 1122333 Q ss_pred EEEEEcCE-EecCcceEEEEEEecccccC Q lcl|Aclame:pro 346 AKQFAYGK-AKDNKVAAVWKLDLKGHKPA 373 (381) Q Consensus 346 ~~~r~dgk-~~~~~Af~v~~l~~~~~~~~ 373 (381) +..|+.|. +..|.|++.+ .+.|=+ T Consensus 290 ~~~r~~Gv~i~~P~ai~~~----dGI~~~ 314 (314) T protein:vir:10 290 VTSKATGLIVYRPLTMAVI----KGITFA 314 (314) T ss_pred ceeeeEEEEEECcceeEee----eeeecC Confidence 46677554 4556666543 232222 No 170 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=337 Identities=14% Similarity=0.140 Sum_probs=128.6 Q ss_pred CCc----cHHHHHHH-------------------------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 1 MTI----NLSETFAN-------------------------AKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKA- 50 (381) Q Consensus 1 m~~----~l~~~~~e-------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 50 (381) |.- +..+.+.| +.++|...+..++.+.+.. ++.++.-.+..+...+. T Consensus 1 mnkpdliekqnrlaelkennvslksqisgfevknaiedl~K~~ELe~TlSe~~iEI~k~---en~LN~~eE~~KGK~kMt 77 (393) T protein:vir:16 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKI---ENELNAQEEKPKGKDKMT 77 (393) T ss_pred CCCcchhhhhhhhhhhhhcccchhhhccchhhhhhhhhchhHHHHHHhHhhcchhhhhh---hhhhhhhhhcchhhHHHH Confidence 221 11111111 1111111111111111000 01111100000000000 Q ss_pred HHH---HHHH-hh--hhhccccHHHHHHHHH-Hhccc--CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCc-eE Q lcl|Aclame:pro 51 EAE---RVSS-LP--KSAQSLSANQRSFFMD-INKNV--NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LK 120 (381) Q Consensus 51 ~~~---~~~~-~~--~~~~~lt~~e~~~~~~-~~~~~--~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~-~~ 120 (381) ++- ++.. .. ...+...++-|++..+ +.+.+ -++--+.+|+.+.-.|-..+..+.|+++...|++.+.- ++ T Consensus 78 ~~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~ 157 (393) T protein:vir:16 78 NFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVS 157 (393) T ss_pred HHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHH Confidence 000 0000 00 0112223333444332 22222 24666788999999999999999999998888877643 22 Q ss_pred EEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHH---HhcChhHHHHHHHHHHHHHHH-HHHhhhee Q lcl|Aclame:pro 121 FLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL---NDFGPAWIERFVRVQIEEAFA-VALETAFL 196 (381) Q Consensus 121 ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~el---l~ds~~~l~~~i~~~la~a~a-~~~d~a~l 196 (381) ....+.. .|.- ...|..+.+...+|..-++.+-.++..-.+ -++ ++.|...+-.||..+++.+|- ++.+.|++ T Consensus 158 ~s~~s~~-eAq~-HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV 234 (393) T protein:vir:16 158 RSFDSAN-EAQV-HKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALV 234 (393) T ss_pred hhhhhhh-hhhh-hccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 2222222 3332 223444444455555555554333322222 222 345565678999999999999 99999999 Q ss_pred eccCCCcceeeee--ccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhH Q lcl|Aclame:pro 197 KGTGKDQPIGLNR--QVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) Q Consensus 197 ~G~G~~qP~Gil~--~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~ 274 (381) -|+|++...-+-+ ++......+. ...+.+...++++.. ...++ -++..|...+++...+. T Consensus 235 ~GDG~N~f~~~DK~advK~I~k~Tt----kaksagktpfadaie---eavdf-----------vrptagrrylivktedr 296 (393) T protein:vir:16 235 EGDGTNGFKSIDKEADVKKIKKITT----KAKSAGKTPFADAIE---EAVDF-----------VRPTAGRRYLIVKTEDR 296 (393) T ss_pred eecCCCCccchhhHHHHHHHHHHhh----hhhhcCCCchhHHHH---HHHhh-----------hccCCCceEEEEeccch Confidence 9999876544421 1110000000 001112222333221 11111 12334444566665554 Q ss_pred HHHHhhhhccCCCCceee-------ccCCCce---EEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhcCceEE Q lcl|Aclame:pro 275 FEVQAQYTHLNANGVYVT-------ALPFNLN---VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLY 344 (381) Q Consensus 275 ~~~~~~~~~~~~~G~~~~-------~l~~g~~---vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~ 344 (381) -.++-.+..-+++.+.-. ..-.|+. |+.....-.-.+ +-| .+|.|-+ ++++ .-+-.-+..+.--+ T Consensus 297 kalldelrqatananvriknddteiasevgvdeiivytgskalkptv-lvd-qkyhidm-qdlt--kvdafewktnsnmi 371 (393) T protein:vir:16 297 KALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTV-LVD-QKYHIDM-QDLT--KVDAFEWKTNSNMI 371 (393) T ss_pred HHHHHHHHhhhccCceeeeccchhhhhhcCcceeeeeecccccccee-eec-cccccch-hhhh--hhhhheeccCCceE Confidence 433322221222221100 0001111 111110000011 111 1233311 1111 01111122222222 Q ss_pred EEEEEEcCEEecCcceEEEEEE Q lcl|Aclame:pro 345 TAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 345 ~~~~r~dgk~~~~~Af~v~~l~ 366 (381) ..-...-|-+---+|-+|.++- T Consensus 372 lvetltsghvetynagavitvs 393 (393) T protein:vir:16 372 LVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred EEeecccCcceeeccceeEeeC Confidence 2222334444444555555444 No 171 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=94.08 E-value=0.0055 Score=32.99 Aligned_cols=300 Identities=15% Similarity=0.058 Sum_probs=140.8 Q ss_pred ccHHHHHHHHHH----hc--cc---CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEe Q lcl|Aclame:pro 65 LSANQRSFFMDI----NK--NV---NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~----~~--~~---~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~ 133 (381) ++.+-|+.|+++ .+ +. +....|.|-+.+...+.+.+++.+-+++..+++++.- +..+....+.+.++-. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccc Confidence 333333333322 11 12 2345788988999999999999999999999988752 2344443333333221 Q ss_pred cc--cccccccccccccceeccceeeeeehhhhHHHHhcCh--hHHHHHHHHHHHHHHHHHHhhheeeccCC----C--- Q lcl|Aclame:pro 134 KI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP--AWIERFVRVQIEEAFAVALETAFLKGTGK----D--- 202 (381) Q Consensus 134 ~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~--~~l~~~i~~~la~a~a~~~d~a~l~G~G~----~--- 202 (381) .- ..+.........+.-.+..++.-.-..|+.+.|+.-+ .+|..-+++.+.++++.=.-.--++|+-. + T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:98 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) T ss_pred cCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 11 1122222222334444555555555678888887542 46888888888888886555555677641 1 Q ss_pred cc------eeeeecccccc--------ccccccc-cccchhhh-ccccChhHHHHHHHHHHHHhhhccccccccc--cCc Q lcl|Aclame:pro 203 QP------IGLNRQVQKGV--------SVTEGAY-PEKEEQGT-LTFANPRATVNELTQVFKYHSTNEKGKSVAV--KGN 264 (381) Q Consensus 203 qP------~Gil~~~~~~~--------~~~~~~~-~~~~~~~~-~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 264 (381) .| +|+|..+-... ...++.. .+....+. ..+.+.+.++..+.+ .+ -+..| .+. T Consensus 161 nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~---~l------I~~~~~~d~d 231 (355) T protein:vir:98 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATN---NL------IDEVYQDDPN 231 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHh---cc------CChHHhcCCC Confidence 23 35553211100 0011110 00001111 112232222222111 00 01122 235 Q ss_pred eEEEEchhhHH-HHHhhhhccCCC-----Cceee--ccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Eeeeh Q lcl|Aclame:pro 265 VTMVVNPSDAF-EVQAQYTHLNAN-----GVYVT--ALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQKFK 333 (381) Q Consensus 265 ~~~imn~~~~~-~~~~~~~~~~~~-----G~~~~--~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~~~~ 333 (381) .|.||.+.-.. +..++....+.+ ++.+. ...=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +.-.+ T Consensus 232 LVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p 311 (355) T protein:vir:98 232 LVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENP 311 (355) T ss_pred EEEEEchhhhHHHhhhHhhccCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 78888876443 222322111111 11111 112389999999999999887666553332 112111 21111 Q ss_pred hh----hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCC Q lcl|Aclame:pro 334 ET----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 334 ~~----~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~ 378 (381) ++ .+..-..+|..-.+.-+..++ . +++.-...+++++++- T Consensus 312 ~r~rie~y~s~Ne~YvVEd~~~~a~ie--n---I~~~~~~~~~~~~~~a 355 (355) T protein:vir:98 312 KKDRVENYESMNIDYVVEVYAAGCLLE--N---ITLGDFTAPAAPESGA 355 (355) T ss_pred ccccccchhhhcceeeeeccccEEEee--c---eeeeCCCCCcccccCC Confidence 11 112223455555444455554 1 2222333445555554 No 172 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=93.00 E-value=0.0092 Score=31.76 Aligned_cols=339 Identities=14% Similarity=0.149 Sum_probs=131.4 Q ss_pred CCccHHH----HHHHHHHHHHHHHhhhhHH-------------------HHHHH----------HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MTINLSE----TFANAKNEFINAVNNGEPQ-------------------ERQNE----------LYGDMINQLFEETKLQ 47 (381) Q Consensus 1 m~~~l~~----~~~e~~~~~~~~~~~~~~~-------------------~~~~~----------~~~~~~~~~~~~~~~~ 47 (381) |.+-.++ .++|+++.+.+--+++..- .+-.. ..++.+++..+..+.. T Consensus 1 mriS~~~~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~LNa~~E~~KGK 80 (400) T protein:vir:93 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) T ss_pred CcccccccccchHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhhhhhh Confidence 5542211 2333332222111111100 00000 0011111111111111 Q ss_pred HHH-HHH---HHHH-hh--hhhccccHHHHHHHHH-Hhccc--CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC Q lcl|Aclame:pro 48 AKA-EAE---RVSS-LP--KSAQSLSANQRSFFMD-INKNV--NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL 117 (381) Q Consensus 48 ~~~-~~~---~~~~-~~--~~~~~lt~~e~~~~~~-~~~~~--~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~ 117 (381) .+. ++- ++.. .. ...+...++-|++..+ +.+.+ -++--+.+|+.+.-.|-..+..+.|+++...|++.+. T Consensus 81 ~kMt~~i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~ 160 (400) T protein:vir:93 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 160 (400) T ss_pred HHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchh Confidence 110 000 0000 00 0112222333433322 22222 2456678899999999999999999999888887764 Q ss_pred c-eEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHH---HhcChhHHHHHHHHHHHHHHH-HHHh Q lcl|Aclame:pro 118 R-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL---NDFGPAWIERFVRVQIEEAFA-VALE 192 (381) Q Consensus 118 ~-~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~el---l~ds~~~l~~~i~~~la~a~a-~~~d 192 (381) - ++....+.. .|.-+ ..|..+.+...+|..-++.+-.++..-.+ -++ ++.|...+-.||..+++.+|. ++.+ T Consensus 161 ~~V~~s~~s~~-~Aq~H-kdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd 237 (400) T protein:vir:93 161 LLVSRSFDSAN-EAQVH-KDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVD 237 (400) T ss_pred hhHHhhhhhhh-hhhhh-ccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 3 222222222 33322 23444444455565555554333322222 122 345666678999999999999 8999 Q ss_pred hheeeccCCCcceeeee--ccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEc Q lcl|Aclame:pro 193 TAFLKGTGKDQPIGLNR--QVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVN 270 (381) Q Consensus 193 ~a~l~G~G~~qP~Gil~--~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn 270 (381) .|++-|+|++...-+-+ ++......+. ...+.+...++++.. ...++ -++..|...+++. T Consensus 238 ~AlV~GDG~N~f~~~DK~advK~I~~~Tt----kaksagktpfadaie---eavdf-----------vrptagrrylivk 299 (400) T protein:vir:93 238 LALVEGDGTNGFKSIDKEADVKKIKKITT----KAKSAGKTPFADAIE---EAVDF-----------VRPTAGRRYLIVK 299 (400) T ss_pred hhhheecCCCCccchhhHHHHHHHHHHhh----hhhhcCCCchhHHHH---HHHhh-----------hccCCCceEEEEe Confidence 99999999886544421 1110000000 001112222333221 11111 1233444456666 Q ss_pred hhhHHHHHhhhhccCCCCceeec--------cCCCce---EEecCCCCCccEEEEeccceEEEecceeeEeeehhhhhhc Q lcl|Aclame:pro 271 PSDAFEVQAQYTHLNANGVYVTA--------LPFNLN---VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALD 339 (381) Q Consensus 271 ~~~~~~~~~~~~~~~~~G~~~~~--------l~~g~~---vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~ 339 (381) ..+.-.++--+..-+++.. +.. .-.|+. |+.....-.-.+ +-| .+|.|-+ ++++ .-+-.-+.. T Consensus 300 tedrkalldelrqatanah-vriknddaeiasevgvdeiivytgskalkptv-lvd-qkyhidm-qdlt--kvdafewkt 373 (400) T protein:vir:93 300 TEDRKALLDELRQATANAH-VRIKNDDAEIASEVGVDEIIVYTGSKALKPTV-LVD-QKYHIDM-QDLT--KVDAFEWKT 373 (400) T ss_pred ccchHHHHHHHHhhccccc-eEeecchhhhhhhcCcceeeeeecccccccee-eec-cccccch-hhhh--hhhhheecc Confidence 5554333322211122221 100 001111 111110000011 111 1233311 1111 011111222 Q ss_pred CceEEEEEEEEcCEEecCcceEEEEEE Q lcl|Aclame:pro 340 DMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 340 d~~~~~~~~r~dgk~~~~~Af~v~~l~ 366 (381) +.--+..-...-|-+---+|-+|.++- T Consensus 374 nsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 374 NSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred CCceEEEeecccCcceeeccceeEeeC Confidence 222222222334444444555555444 No 173 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=92.42 E-value=0.011 Score=31.22 Aligned_cols=298 Identities=13% Similarity=0.065 Sum_probs=140.1 Q ss_pred ccHHHHHHHHHH----hc--cc---CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEe Q lcl|Aclame:pro 65 LSANQRSFFMDI----NK--NV---NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~----~~--~~---~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~ 133 (381) ++.+-|+.|+++ .+ +. +....|.|-+.+...+...+.+.+-+++..+++++.- +..+....+.+.++-. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 333333333322 11 11 1245788999999999999999999999999888652 2344443333333221 Q ss_pred cc--cccccccccccccceeccceeeeeehhhhHHHHhcCh--hHHHHHHHHHHHHHHHHHHhhheeeccCCC------- Q lcl|Aclame:pro 134 KI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP--AWIERFVRVQIEEAFAVALETAFLKGTGKD------- 202 (381) Q Consensus 134 ~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~--~~l~~~i~~~la~a~a~~~d~a~l~G~G~~------- 202 (381) .- +.+..+..-..++.-.+..++.-.-..|+.++|+.-+ .+|..-+++.+.++++.=.-.--+||+-.. T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:56 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 11 1122221112344445555555555678888887532 467788888888888765444446776421 Q ss_pred cc------eeeeeccccc--------ccccccccccc-chhhh-ccccChhHHHHHHHHHHHHhhhcccccccccc--Cc Q lcl|Aclame:pro 203 QP------IGLNRQVQKG--------VSVTEGAYPEK-EEQGT-LTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GN 264 (381) Q Consensus 203 qP------~Gil~~~~~~--------~~~~~~~~~~~-~~~~~-~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 264 (381) .| +|+|..+-.. ....++..... ...+. ..+.+.+.++..+.+ .+ -+..++ +. T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~---~l------I~~~~~~d~d 231 (357) T protein:vir:56 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATN---NL------IEPWYQEDPD 231 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHh---cc------CChHHhcCCC Confidence 23 3555321110 00011111100 10111 122232332222111 00 012233 35 Q ss_pred eEEEEchhhHHH-HHhhhhccCCCCce--------e--eccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Ee Q lcl|Aclame:pro 265 VTMVVNPSDAFE-VQAQYTHLNANGVY--------V--TALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQ 330 (381) Q Consensus 265 ~~~imn~~~~~~-~~~~~~~~~~~G~~--------~--~~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~ 330 (381) .|.||-+.-... ..++. +..+.+ + ....=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +. T Consensus 232 LVvivG~dLla~k~~~l~---n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~ 308 (357) T protein:vir:56 232 LVVIVGRQLLADKYFPIV---NKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIE 308 (357) T ss_pred EEEEEchhhhhhhhhhHh---hccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEE Confidence 778888765442 22221 111111 1 0112388999999999999887655553222 222222 21 Q ss_pred eehhh----hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 331 KFKET----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 331 ~~~~~----~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) -.+++ .+..-..+|..-.+.-+..++ .+++...++.+++.++.- T Consensus 309 d~p~r~riE~y~s~Ne~YvVEd~~~~a~iE-------~i~i~~~~~~~~~~~~~~ 356 (357) T protein:vir:56 309 ENPKLDRVENYESMNIDYVVEDYAAGCLVE-------KIKVGDFSTPAKATEEPG 356 (357) T ss_pred eccccccccchhhhcceeeeeccccEEEee-------eeeeccCCCCcccCCCCC Confidence 11111 111223345444444344443 244444455555555555 No 174 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=91.64 E-value=0.015 Score=30.60 Aligned_cols=352 Identities=11% Similarity=0.056 Sum_probs=128.7 Q ss_pred CCccHHHHHHHHHHHHHHH-----HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHH---H Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINA-----VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS---F 72 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~ 72 (381) |+|+-++++.||=+-+++. |.+. +...-.+.+++..+.++.+. +++........+..|++.+.. . T Consensus 1 ~~~~~~~~l~~kw~p~l~~~~~~~i~~~-~~~~~a~~~enq~~~~~~~~------~~~~~~~~~~~~~~l~e~~~~~~~~ 73 (521) T protein:vir:10 1 MTIKTKAELLNKWKPLLEGEGLPEIANS-KQAIIAKIFENQEKDFQTAP------EYKDEKIAQAFGSFLTEAEIGGDHG 73 (521) T ss_pred CCcchhHHHHHhhhhhhccCCCCccccc-hhhhhhhhhhhhhhhhhhcc------ccchhHHHHHHhhhhhhhcccCccc Confidence 9999999999998888876 1111 11111222233222221111 111111111111112211100 0 Q ss_pred H--HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEE-----E-ecCC--------------cce Q lcl|Aclame:pro 73 F--MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFL-----K-SETS--------------GVA 130 (381) Q Consensus 73 ~--~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip-----~-~~~~--------------~~a 130 (381) + ..+.+++.+..=.-.-|.+.. ++++.-..=.-.+++.|.|+++..-.. + .... +.+ T Consensus 74 ~~~~~i~es~~t~~v~~~~P~Li~-lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada 152 (521) T protein:vir:10 74 YNATNIAAGQTSGAVTQIGPAVMG-MVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDA 152 (521) T ss_pred cccccccccccccccccCCchhhh-HHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhccccc Confidence 0 011112211110001122221 111111111234567777765432110 0 0000 000 Q ss_pred EEeccc--------------------------------------------------------------------c----- Q lcl|Aclame:pro 131 VWGKIY--------------------------------------------------------------------G----- 137 (381) Q Consensus 131 ~w~~e~--------------------------------------------------------------------~----- 137 (381) .|.+.+ + T Consensus 153 ~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~ 232 (521) T protein:vir:10 153 MFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSI 232 (521) T ss_pred cccccccccccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhh Confidence 000000 0 Q ss_pred -cc----cccccccccceeccceeeeee-------hhhhHHHHhc----ChhHHHHHHHHHHHHHHHHHHhhheeeccC- Q lcl|Aclame:pro 138 -EI----KGQLDAAFSEETAIQNKLTAF-------VVLPKDLNDF----GPAWIERFVRVQIEEAFAVALETAFLKGTG- 200 (381) Q Consensus 138 -~~----~~~~~~~f~~v~l~~~kl~~~-------~~iS~ell~d----s~~~l~~~i~~~la~a~a~~~d~a~l~G~G- 200 (381) |. -..+...|.+..|...|.++- ...|-||.+| -.+|.|++|.+.|+..|...+|+.||. += T Consensus 233 aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~-~i~ 311 (521) T protein:vir:10 233 AELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVD-WIN 311 (521) T ss_pred HhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhh-hhh Confidence 00 001122366666666665533 4567777776 356899999999999999999999983 20 Q ss_pred C-Cc--ceeeeeccccccccccccccccchhhhcccc---C---hhHHHHHHHHHHHHhhhcccccccc-ccCce-EEEE Q lcl|Aclame:pro 201 K-DQ--PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFA---N---PRATVNELTQVFKYHSTNEKGKSVA-VKGNV-TMVV 269 (381) Q Consensus 201 ~-~q--P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~---~---~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~im 269 (381) . .+ -.|+-+ +.+ ...+...+. + .......+..++..+-...+...+. -++++ ..|+ T Consensus 312 ~sa~~~~~g~t~--------~~~-----~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~ 378 (521) T protein:vir:10 312 YSAQVGKSGMTL--------TPG-----SKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIA 378 (521) T ss_pred heeeeeeeeeee--------ccC-----ccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEE Confidence 0 00 112210 000 001111111 1 1111111111221111111111111 12333 3467 Q ss_pred chhhHHHHHhhhh------ccC-CCC--------ceeeccCCCceEEecCCCCCccEEEEeccce-------EEEeccee Q lcl|Aclame:pro 270 NPSDAFEVQAQYT------HLN-ANG--------VYVTALPFNLNVIESTVQEAGKVLTYVKGLY-------DGYLAGGI 327 (381) Q Consensus 270 n~~~~~~~~~~~~------~~~-~~G--------~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~y-------~i~~r~~~ 327 (381) ++.-...+ .... .++ +.| .+.-.|.-+++|+.+.+.|.+-++.|-.... +-=-.... T Consensus 379 S~~Va~~L-~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~ 457 (521) T protein:vir:10 379 SRNVVNVL-ASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALT 457 (521) T ss_pred chHHHHHH-hhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccc Confidence 76543322 2110 010 011 1222455578999999998877766633111 00000111 Q ss_pred eEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEE-----EEecccccCCCCCCCCC Q lcl|Aclame:pro 328 NVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWK-----LDLKGHKPALEGTEETL 381 (381) Q Consensus 328 ~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~-----l~~~~~~~~~~~~~~~~ 381 (381) .+...|...| |-.+-...|+ |-.++| |+.-. -.|.+..|....+...+ T Consensus 458 ~~~~~dp~sf---qP~~g~~tRY-~l~~NP--~~~~~~~~~~~~i~~~~~~~~a~~~~~ 510 (521) T protein:vir:10 458 PLRGSDPKNF---QPVMGFKTRY-GIGINP--FAESAAQAPASRIQSGMPSILNSLGKN 510 (521) T ss_pred cccccCCccc---cceeeeeeee-ceeecC--cccccCCccceeecccchhhhcccccc Confidence 1122233323 2233334445 555565 22210 01111111111122222 No 175 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=89.77 E-value=0.024 Score=29.42 Aligned_cols=298 Identities=12% Similarity=0.058 Sum_probs=136.7 Q ss_pred ccHHHHHHHHHH----hc--cc---CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEe Q lcl|Aclame:pro 65 LSANQRSFFMDI----NK--NV---NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~----~~--~~---~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~ 133 (381) ++.+-|+.|+++ .+ +. +....|.|-+.+...+...+.+.+-+++..+++++.- +..+....+.+.++-. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 333333333322 11 11 1245788999999999999999999999999888652 2344443333333221 Q ss_pred cc--cccccccccccccceeccceeeeeehhhhHHHHhcCh--hHHHHHHHHHHHHHHHHHHhhheeeccCCC------- Q lcl|Aclame:pro 134 KI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP--AWIERFVRVQIEEAFAVALETAFLKGTGKD------- 202 (381) Q Consensus 134 ~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~--~~l~~~i~~~la~a~a~~~d~a~l~G~G~~------- 202 (381) .- +.+..+..-..++.-.+..++.-.-..|+.++|+.-+ .+|..-+++.+.++++.=.-.--+||+-.. T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:20 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 11 1122221112344445555555555678888887532 467788888888888765444446776421 Q ss_pred cc------eeeeecccc--------cccccccccccc-chhhh-ccccChhHHHHHHHHHHHHhhhcccccccccc--Cc Q lcl|Aclame:pro 203 QP------IGLNRQVQK--------GVSVTEGAYPEK-EEQGT-LTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GN 264 (381) Q Consensus 203 qP------~Gil~~~~~--------~~~~~~~~~~~~-~~~~~-~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 264 (381) .| +|+|..+-. ..+...+..... ...+. ..+.+.+.++..+.+ .+ -+..++ +. T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~---~l------I~~~~~~d~d 231 (357) T protein:vir:20 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATN---NL------IEPWYQEDPD 231 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHh---cc------CChHHhcCCC Confidence 23 355532110 001111111111 11111 122233332222111 00 012233 35 Q ss_pred eEEEEchhhHHH-HHhhhhccCCCCce--------e--eccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Ee Q lcl|Aclame:pro 265 VTMVVNPSDAFE-VQAQYTHLNANGVY--------V--TALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQ 330 (381) Q Consensus 265 ~~~imn~~~~~~-~~~~~~~~~~~G~~--------~--~~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~ 330 (381) .|.||-+.-... ..++. +..+.+ + ....=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +. T Consensus 232 LVvivG~dLla~k~~~l~---n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~ 308 (357) T protein:vir:20 232 LVVIVGRQLLADKYFPIV---NKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIE 308 (357) T ss_pred EEEEEchhhhhhhhhhHh---hccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEE Confidence 778888765442 22221 111111 1 0112388999999999999887655553222 222222 21 Q ss_pred eehhh----hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 331 KFKET----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 331 ~~~~~----~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) -.+++ .+..-..+|..-.+.-+..++. +++...+...+++.|.- T Consensus 309 d~p~r~riE~y~s~Ne~YvVEd~~~~a~iE~-------i~~~~~~~p~~~~~~~~ 356 (357) T protein:vir:20 309 ENPKLDRVENYESMNIDYVVEDYAAGCLVEK-------IKVGDFSTPAKATAEPG 356 (357) T ss_pred eccccccccchhhhcceeeeeccccEEEeee-------eeeccccCCccCCCCCC Confidence 11111 1112233444443333333332 33333222223333333 No 176 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=89.57 E-value=0.026 Score=29.31 Aligned_cols=299 Identities=12% Similarity=0.051 Sum_probs=136.7 Q ss_pred ccHHHHHHHHHH----hc--cc---CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEe Q lcl|Aclame:pro 65 LSANQRSFFMDI----NK--NV---NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~----~~--~~---~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~ 133 (381) ++.+-|+.|+++ .+ +. +....|.|-+.+...+...+.+.+-+++..+++++.- +..+....+.+.++-. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 333333333322 11 11 1245788999999999999999999999999888642 2344443333333221 Q ss_pred cc--cccccccccccccceeccceeeeeehhhhHHHHhcCh--hHHHHHHHHHHHHHHHHHHhhheeeccCCC------- Q lcl|Aclame:pro 134 KI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP--AWIERFVRVQIEEAFAVALETAFLKGTGKD------- 202 (381) Q Consensus 134 ~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~--~~l~~~i~~~la~a~a~~~d~a~l~G~G~~------- 202 (381) .- +.+..+..-..++.-.+..++.-.-..|+.++|+.-+ .+|..-+++.+.++++.=.-.--+||+-.. T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:60 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSS 160 (357) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 11 1122221112344445555555555678888887532 467788888888888765544446776421 Q ss_pred cc------eeeeeccccc--------ccccccccccc-chhhh-ccccChhHHHHHHHHHHHHhhhcccccccccc--Cc Q lcl|Aclame:pro 203 QP------IGLNRQVQKG--------VSVTEGAYPEK-EEQGT-LTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GN 264 (381) Q Consensus 203 qP------~Gil~~~~~~--------~~~~~~~~~~~-~~~~~-~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 264 (381) .| +|+|..+-.. ....++..... ...+. ..+.+.+.++..+.+ .+ -+..++ +. T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~---~l------I~~~~~~d~d 231 (357) T protein:vir:60 161 NQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATN---NL------IEPWYQEDPD 231 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHh---cc------CChHHhcCCC Confidence 23 3555321110 00011111100 10111 122232332222111 00 112233 35 Q ss_pred eEEEEchhhHHH-HHhhhhccCCCCce--------e--eccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Ee Q lcl|Aclame:pro 265 VTMVVNPSDAFE-VQAQYTHLNANGVY--------V--TALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQ 330 (381) Q Consensus 265 ~~~imn~~~~~~-~~~~~~~~~~~G~~--------~--~~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~ 330 (381) .|.||-+.-... ..++. +..+.+ + ....=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +. T Consensus 232 LVvivG~dLla~k~~~l~---n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~ 308 (357) T protein:vir:60 232 LVVIVGRQLLADKYFPIV---NREQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIE 308 (357) T ss_pred EEEEEchhhhhHHhhhHh---hcCCChHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeeccccEEEEecCcEEEEEE Confidence 778888765442 22222 111211 1 0112388999999999999887655553222 222221 21 Q ss_pred eehhh----hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCC Q lcl|Aclame:pro 331 KFKET----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGT 377 (381) Q Consensus 331 ~~~~~----~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~ 377 (381) -.+++ .+..-..+|..-.+.-+..++. ..+...+-.++.++..+. T Consensus 309 d~p~r~riE~y~s~Ne~YvVEd~~~~a~iE~--i~~~~~~~pa~~~~~~~a 357 (357) T protein:vir:60 309 ENPKLDRVENYESMNIDYVVEDYAAGCLVEK--IKVGDFSTPAKATAEPGA 357 (357) T ss_pred eccccccccchhhhcceeeeeccccEEEeee--eeeccCcccccCCCCCCC Confidence 11111 1112233454444444444442 112112222222222233 No 177 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=88.27 E-value=0.033 Score=28.68 Aligned_cols=288 Identities=13% Similarity=0.074 Sum_probs=135.8 Q ss_pred ccHHHHHHHHHH-------hcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEec- Q lcl|Aclame:pro 65 LSANQRSFFMDI-------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGK- 134 (381) Q Consensus 65 lt~~e~~~~~~~-------~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~~- 134 (381) +..+-|+.|++. .--.+....|.|.+.+...+.+.+++.+-+++..+++++.- +..+....+.+.++-.. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT 80 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDT 80 (338) T ss_pred CCHHHHHHHHHHHHHHHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccC Confidence 333333333221 11123456789999999999999999999999999988752 23444433333332221 Q ss_pred -ccccccccccc-cccceeccceeeeeehhhhHHHHhcC--hhHHHHHHHHHHHHHHHHHHhhheeeccCC-------Cc Q lcl|Aclame:pro 135 -IYGEIKGQLDA-AFSEETAIQNKLTAFVVLPKDLNDFG--PAWIERFVRVQIEEAFAVALETAFLKGTGK-------DQ 203 (381) Q Consensus 135 -e~~~~~~~~~~-~f~~v~l~~~kl~~~~~iS~ell~ds--~~~l~~~i~~~la~a~a~~~d~a~l~G~G~-------~q 203 (381) ..++..+ .++ ..+.-.+..++.-.-..|+.+.|+.- ..+|..-+++.+.++++.=.-.--++|+-. .. T Consensus 81 ~~~~~R~~-~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~n 159 (338) T protein:vir:11 81 TGDGVRKP-RDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAAN 159 (338) T ss_pred CCCCcccc-ccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhC Confidence 1122221 222 34444555555555677888888753 346888888899998886555555677651 12 Q ss_pred c------eeeeecccc---ccccccccccccchhhhc---cccChhHHHHHHHHHHHHhhhccccccccc--cCceEEEE Q lcl|Aclame:pro 204 P------IGLNRQVQK---GVSVTEGAYPEKEEQGTL---TFANPRATVNELTQVFKYHSTNEKGKSVAV--KGNVTMVV 269 (381) Q Consensus 204 P------~Gil~~~~~---~~~~~~~~~~~~~~~~~~---t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~im 269 (381) | +|++..+-. ...-+.++...+..++.. .+.+.+.++..+.+ .+ -+..| .+..|.|| T Consensus 160 PllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~---~l------I~~~~~~d~dLVviv 230 (338) T protein:vir:11 160 PLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVS---SL------IDPWHRRDPGLVVIL 230 (338) T ss_pred cCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHh---cc------CChHHhcCCCEEEEE Confidence 3 355432111 111112222222222211 12233333222211 00 01122 23578888 Q ss_pred chhhHHHH-HhhhhccCCC-----Ccee--eccCCCceEEecCCCCCccEEEEeccceEEEe-cceee--Eeeehhh--- Q lcl|Aclame:pro 270 NPSDAFEV-QAQYTHLNAN-----GVYV--TALPFNLNVIESTVQEAGKVLTYVKGLYDGYL-AGGIN--VQKFKET--- 335 (381) Q Consensus 270 n~~~~~~~-~~~~~~~~~~-----G~~~--~~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~-r~~~~--i~~~~~~--- 335 (381) .+.-.... .++....+.+ ++.+ ....=|+|.+.-+++|.+.+++--|+..-|+. ++..+ +.-.+++ T Consensus 231 G~dLladk~~~l~n~~~~ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ri 310 (338) T protein:vir:11 231 GRELVHDKYFPMVNKDQPATEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRRYLKEVPEKNRI 310 (338) T ss_pred chhhhHHHHhHHHhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccc Confidence 87644422 2221110100 1111 11223899999999999998876665533321 22111 2111111 Q ss_pred -hhhcCceEEEEEEEEcCEEecCcceEEEEEEecc Q lcl|Aclame:pro 336 -LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 336 -~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~ 369 (381) .+..-..+|..-.+.-+..++. +++.. T Consensus 311 e~y~s~Ne~YvVEd~~~~a~ien-------i~~~~ 338 (338) T protein:vir:11 311 ENYESSNDAYVVEDYGLGCLVEN-------IEVAE 338 (338) T ss_pred cchhhhccceeeeccccEEEeec-------ceecC Confidence 1111222343333333333321 22222 No 178 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=88.09 E-value=0.035 Score=28.60 Aligned_cols=288 Identities=14% Similarity=0.071 Sum_probs=132.5 Q ss_pred ccHHHHHHHHHHhc------c-cCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEecc Q lcl|Aclame:pro 65 LSANQRSFFMDINK------N-VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) Q Consensus 65 lt~~e~~~~~~~~~------~-~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~~e 135 (381) ++.+-|..|+++.. + .+..-.|.|-+.+...+.+.+++.+-+++..+++++.- +..+....+.+.++-... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 33333333332211 1 12344688888999999999999999999999988752 233444333333322221 Q ss_pred c-ccccccccccccceeccceeeeeehhhhHHHHhcC--hhHHHHHHHHHHHHHHHHHHhhheeeccCC-------Ccc- Q lcl|Aclame:pro 136 Y-GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG--PAWIERFVRVQIEEAFAVALETAFLKGTGK-------DQP- 204 (381) Q Consensus 136 ~-~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds--~~~l~~~i~~~la~a~a~~~d~a~l~G~G~-------~qP- 204 (381) + .+..+..-...+.-.+..++.-.-..|+.+.|+.- ..+|..-+++.+.++++.=.-.--++|+-. ..| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:79 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 1 11211111223444455555555567888888753 246888888899988886555555677651 123 Q ss_pred -----eeeeecccc---cccccccccc-ccchhhh-ccccChhHHHHHHHHHHHHhhhcccccccccc--CceEEEEchh Q lcl|Aclame:pro 205 -----IGLNRQVQK---GVSVTEGAYP-EKEEQGT-LTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPS 272 (381) Q Consensus 205 -----~Gil~~~~~---~~~~~~~~~~-~~~~~~~-~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~imn~~ 272 (381) +|++..+-. ....+.++.. .+..+++ ..+.+.+.++..+.+ .+ -+..++ +..+.||-+. T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~---~l------I~~~~~~d~~LVvivG~d 231 (337) T protein:vir:79 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVS---SM------IDPWFQEDTGLVAICGRE 231 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHh---cc------CChHHhcCCCEEEEEchh Confidence 355532211 0011111110 0111111 122233332222111 00 012222 3577788876 Q ss_pred hHHHHHhhhhccCCCCce--------ee--ccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Eeeehhh---- Q lcl|Aclame:pro 273 DAFEVQAQYTHLNANGVY--------VT--ALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQKFKET---- 335 (381) Q Consensus 273 ~~~~~~~~~~~~~~~G~~--------~~--~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~~~~~~---- 335 (381) -.......+ .+....+ +. ...=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +.-.+++ T Consensus 232 Lladk~~~l--~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie 309 (337) T protein:vir:79 232 LLHDKYFPI--VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIE 309 (337) T ss_pred hhhHHhhHH--hccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEcccccccc Confidence 544222111 1111111 11 112389999999999999887666553332 222221 2111111 Q ss_pred hhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 336 ~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) .+..-..+|..-.+.-+..++ .+++... T Consensus 310 ~y~s~Ne~YvVEd~~~~a~ie-------nI~~~~a 337 (337) T protein:vir:79 310 NYESSNDAYVVEDFGCGCVAE-------NIELAAA 337 (337) T ss_pred chhhccceeeeeccccEEEEe-------ceeecCC Confidence 011112233333222222222 2233222 No 179 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=87.47 E-value=0.039 Score=28.34 Aligned_cols=288 Identities=14% Similarity=0.086 Sum_probs=132.9 Q ss_pred ccHHHHHHHHHHh----c--c-cCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEecc Q lcl|Aclame:pro 65 LSANQRSFFMDIN----K--N-VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) Q Consensus 65 lt~~e~~~~~~~~----~--~-~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~~e 135 (381) ++.+-|..|+++. + + .+....|.|-+.+...+.+.+++.+-+++..+++++.- +..+....+.+.++-... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 3333333333221 1 1 13345788888999999999999999999999988752 233444333333322221 Q ss_pred c-ccccccccccccceeccceeeeeehhhhHHHHhcC--hhHHHHHHHHHHHHHHHHHHhhheeeccCC-------Ccc- Q lcl|Aclame:pro 136 Y-GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG--PAWIERFVRVQIEEAFAVALETAFLKGTGK-------DQP- 204 (381) Q Consensus 136 ~-~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds--~~~l~~~i~~~la~a~a~~~d~a~l~G~G~-------~qP- 204 (381) + .+..+..-...+.-.+..++.-.-..|+.+.|+.- ..+|..-+++.+.++++.=.-.--++|+-. ..| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:10 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 1 11211111224444455555555567888888753 246888888899988886555555677651 123 Q ss_pred -----eeeeecccc---cccccccccc-ccchhhhc-cccChhHHHHHHHHHHHHhhhcccccccccc--CceEEEEchh Q lcl|Aclame:pro 205 -----IGLNRQVQK---GVSVTEGAYP-EKEEQGTL-TFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPS 272 (381) Q Consensus 205 -----~Gil~~~~~---~~~~~~~~~~-~~~~~~~~-t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~imn~~ 272 (381) +|++..+-. ....+.++.. .+..+++. .+.+.+.++..+.+ .+ -+..++ +..+.||-+. T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~---~l------I~~~~~~d~~LVvivG~d 231 (337) T protein:vir:10 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVS---SM------IDPWFQEDTGLVVICGRE 231 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHh---cc------CChHHhcCCCEEEEEchh Confidence 355532211 0011111110 01111111 22333332222211 00 012222 3577788876 Q ss_pred hHHHHHhhhhccCCCCce--------ee--ccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Eeeehhh---- Q lcl|Aclame:pro 273 DAFEVQAQYTHLNANGVY--------VT--ALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQKFKET---- 335 (381) Q Consensus 273 ~~~~~~~~~~~~~~~G~~--------~~--~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~~~~~~---- 335 (381) -.......+ .+....+ +. ...=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +.-.+++ T Consensus 232 Lladk~~~l--~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie 309 (337) T protein:vir:10 232 LLHDKYFPI--VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIE 309 (337) T ss_pred hhhHHhhHH--hccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEcccccccc Confidence 544222111 1111111 11 112389999999999999887666553332 222222 2111111 Q ss_pred hhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 336 ~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) .+..-..+|..-.+.-+..++ .+++... T Consensus 310 ~y~s~Ne~YvVEd~~~~a~ie-------nI~~~~a 337 (337) T protein:vir:10 310 NYESSNDAYVVEDFGCGCVAE-------NIELAAA 337 (337) T ss_pred chhhccceeeeeccccEEEEe-------ceeecCC Confidence 011112233333222222222 2233222 No 180 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=86.77 E-value=0.043 Score=28.06 Aligned_cols=286 Identities=13% Similarity=0.099 Sum_probs=125.5 Q ss_pred ccHH-HHHHHHHHhc--cc-----CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEec Q lcl|Aclame:pro 65 LSAN-QRSFFMDINK--NV-----NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGK 134 (381) Q Consensus 65 lt~~-e~~~~~~~~~--~~-----~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~~ 134 (381) ++.. ...+...+.+ +. ..+.-|.|.+.+...+.+.+++.+-+++..+++++.- +..+....+.+.++-.. T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtd 80 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATEKGVTGRKQ 80 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccCcccccccC Confidence 1110 0111111111 11 1224599999999999999999999999999988742 23344433333332211 Q ss_pred ccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHH----HHHHHHHHHhhheeeccC----CCcc-- Q lcl|Aclame:pro 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQ----IEEAFAVALETAFLKGTG----KDQP-- 204 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~----la~a~a~~~d~a~l~G~G----~~qP-- 204 (381) . + .. ..+...+.-.+..++.-.-..|+.+.|+.-+ ++..+..+. +.++|+.=.-.--++|+- ++.| T Consensus 81 t-~-R~-~~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA-~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPll 156 (336) T protein:vir:37 81 T-G-RN-LANLDHTQNGFELAETDSGIIVPWALFDSFA-IFKDRLVELYSEYFQNQVALDILQIGWNGQSVADNTTKADL 156 (336) T ss_pred C-C-cc-ccccCcCCcccEEEEeeeeeeecHHHHHHHh-cChhHHHHHHHHHHHHHHhhchhhhcccceeeccCCCCCcc Confidence 1 1 11 1223344445555555556789999998754 244443333 344444322233356654 2245 Q ss_pred ----eeeeeccc---ccccccccccc-ccch-hhh-ccccChhHHHHHHHHHHHHhhhcccccccccc--CceEEEEchh Q lcl|Aclame:pro 205 ----IGLNRQVQ---KGVSVTEGAYP-EKEE-QGT-LTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPS 272 (381) Q Consensus 205 ----~Gil~~~~---~~~~~~~~~~~-~~~~-~~~-~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~imn~~ 272 (381) +|++..+- ....-+.++.. ++.. .++ ..+.+.+.++..+.+. -+..++ +..+.||-+. T Consensus 157 qDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~----------I~~~~~~d~dLVvivG~d 226 (336) T protein:vir:37 157 SDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQG----------LDFRHQNRNDLVFLVGAD 226 (336) T ss_pred cccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhc----------CchHHhcCCCeEEEEchh Confidence 35553221 11111111100 1100 111 1123333332222221 122233 3667788775 Q ss_pred hHHHHHhhhhccCCCCc-e----------eeccCCCceEEecCCCCCccEEEEeccceEEEe-cceee--Eeeehhh--- Q lcl|Aclame:pro 273 DAFEVQAQYTHLNANGV-Y----------VTALPFNLNVIESTVQEAGKVLTYVKGLYDGYL-AGGIN--VQKFKET--- 335 (381) Q Consensus 273 ~~~~~~~~~~~~~~~G~-~----------~~~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~-r~~~~--i~~~~~~--- 335 (381) -...-.... .+.+|. | +....=|+|.+.-+++|.+.+++--|+..-|+. ++..+ +.-.+++ T Consensus 227 Lla~~~~~l--~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ri 304 (336) T protein:vir:37 227 LVSKETKLI--QQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKKGL 304 (336) T ss_pred hhhhhhhhh--hhhcCCCHHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccc Confidence 433221111 112221 1 111123899999999999998876665533321 22221 2111111 Q ss_pred -hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 336 -LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 336 -~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) .+..-..+|..-.+.-+..++.-. +++.+ |. T Consensus 305 e~y~s~Ne~YvVEd~~~~a~iE~i~-----v~~~~-----e~ 336 (336) T protein:vir:37 305 VTSYYRQEGYVVEDLGLMTAIDHTK-----VKLNG-----EV 336 (336) T ss_pred cchhhhcceeeeeccccEEEeeeee-----eeecC-----cC Confidence 111223344444444344444322 22222 22 No 181 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=86.67 E-value=0.044 Score=28.03 Aligned_cols=292 Identities=11% Similarity=0.055 Sum_probs=131.3 Q ss_pred ccHHHHHHHHHH----hc--cc-----CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecC--CceEEEEecCCcceE Q lcl|Aclame:pro 65 LSANQRSFFMDI----NK--NV-----NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAV 131 (381) Q Consensus 65 lt~~e~~~~~~~----~~--~~-----~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~--~~~~ip~~~~~~~a~ 131 (381) ++.+-|..|+++ .+ +. +.+.-|.|.+.+...+.+.+.+.+-+++..+++++. +........+...++ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~ 80 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYG 80 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccC Confidence 333333333322 11 11 223459999999999999999999999999988763 222222222211111 Q ss_pred EecccccccccccccccceeccceeeeeehhhhHHHHhcCh--hH-HHHHHHHHHHHHHHHHHhhheeeccCC----Ccc Q lcl|Aclame:pro 132 WGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP--AW-IERFVRVQIEEAFAVALETAFLKGTGK----DQP 204 (381) Q Consensus 132 w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~--~~-l~~~i~~~la~a~a~~~d~a~l~G~G~----~qP 204 (381) -....+..... .+ -+.-.+..++.-.-..|+.++|+.-+ .| |..-+++.+.++++.=.-.--+||+-. ..| T Consensus 81 r~~t~~~~~~~-~~-~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T~nP 158 (343) T protein:vir:98 81 AHDRRTPIQQR-WT-RQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDTSDP 158 (343) T ss_pred ccccCCCcccc-cc-CCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCCCCc Confidence 11111110000 00 01112344444445778888887653 45 777888888888775444444577642 245 Q ss_pred ------eeeeeccc---ccccccccccccc-chhhhc-cccChhHHHHHHHHHHHHhhhcccccccccc--CceEEEEch Q lcl|Aclame:pro 205 ------IGLNRQVQ---KGVSVTEGAYPEK-EEQGTL-TFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNP 271 (381) Q Consensus 205 ------~Gil~~~~---~~~~~~~~~~~~~-~~~~~~-t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~imn~ 271 (381) +|+|..+- ....-+.++.... ...+.. .+.+.+.++..+..+ -+..++ +..+.||.+ T Consensus 159 llqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~~----------I~~~~~~d~dLVvivG~ 228 (343) T protein:vir:98 159 NLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQG----------LDARHRDAGDLVFLVGA 228 (343) T ss_pred chhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHhc----------CchHHhcCCCEEEEEch Confidence 35553221 1111111111111 111111 133333333222211 122233 357778887 Q ss_pred hhHHHHHhhhhccCCCCceee-----------ccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Eeeehhh-- Q lcl|Aclame:pro 272 SDAFEVQAQYTHLNANGVYVT-----------ALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQKFKET-- 335 (381) Q Consensus 272 ~~~~~~~~~~~~~~~~G~~~~-----------~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~~~~~~-- 335 (381) .-...-.... .+..++..+ ...=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +.-.+++ T Consensus 229 dLla~~~~~l--~n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~r 306 (343) T protein:vir:98 229 DLVAKEASLV--YKGNGLIATEKAALNTHDLMKSFGGMPAMIVPNMPPRAAIVTSLSNLSIYTQEGSMRRGMKDDDDKKA 306 (343) T ss_pred hhhhhhhhhh--hhhcCCChHHHHHHHHHHHHHhhCCCeeEEccccCCCceEEeeccccEEEEecCcEEEEEEecccccc Confidence 6443322111 112222111 112388999999999999887665553222 222222 2111111 Q ss_pred --hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 336 --~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) .+..-..+|..-.+.-+..++.-.+++ .+ ..|+=+ T Consensus 307 ie~y~s~Ne~YvVEd~~~~a~iE~i~v~~-----~~----~~g~w~ 343 (343) T protein:vir:98 307 VRDSYYRNEAYAVEDCGKFMAVDFTKVKL-----SS----GKGTWK 343 (343) T ss_pred ccchhhhcceeeeeccccEEEeeeeeeee-----cC----CCCCCC Confidence 111223355544444444444333332 21 112222 No 182 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=86.18 E-value=0.047 Score=27.84 Aligned_cols=286 Identities=14% Similarity=0.085 Sum_probs=123.8 Q ss_pred ccHH-HHHHHHHHhc--cc-----CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEec Q lcl|Aclame:pro 65 LSAN-QRSFFMDINK--NV-----NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGK 134 (381) Q Consensus 65 lt~~-e~~~~~~~~~--~~-----~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~~ 134 (381) ++.. ...+...+.+ +. ..+.-|.|.+.+...+.+.+++.+-+++..+++++.- +..+....+.+.++-.. T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtd 80 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGVTGRKQ 80 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCcccccccC Confidence 1110 0111111111 11 1234699999999999999999999999999988742 23344433333332222 Q ss_pred ccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHH--HHhhh--eeeccC----CCcc-- Q lcl|Aclame:pro 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAV--ALETA--FLKGTG----KDQP-- 204 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~--~~d~a--~l~G~G----~~qP-- 204 (381) .+.. ... ...+.-.+..++.-.-..|+.+.|+.-+ ++..+..+.+...+.+ ++|.- -++|+- ++.| T Consensus 81 t~r~-r~~--~~l~~~~Y~c~qTn~dt~i~y~~LD~WA-~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPll 156 (336) T protein:vir:37 81 TGRN-LAT--LDHSQNGYELSETDSGILVNWSLFDSFA-IFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTTKTDL 156 (336) T ss_pred CCCC-ccc--cCCCCCccEEEEeeeeeeccHHHHHHHh-cChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCCCccc Confidence 1111 111 1233344444444455788999998754 2444443333333332 24433 356654 2245 Q ss_pred ----eeeeecccc---cccccccccc-ccch-hhh-ccccChhHHHHHHHHHHHHhhhcccccccccc--CceEEEEchh Q lcl|Aclame:pro 205 ----IGLNRQVQK---GVSVTEGAYP-EKEE-QGT-LTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPS 272 (381) Q Consensus 205 ----~Gil~~~~~---~~~~~~~~~~-~~~~-~~~-~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~imn~~ 272 (381) +|++..+-. ...-+.++.. ++.. .++ ..+.+.+.++..+.+. -+..++ +..+.||-+. T Consensus 157 qDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~----------I~~~~~~d~dLVvivG~d 226 (336) T protein:vir:37 157 SDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQG----------LDFRHQNRNDLVFLVGAD 226 (336) T ss_pred cccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhc----------cchHHhcCCCeEEEEchh Confidence 355532211 1111111100 1100 111 1123333333222221 122233 3667788775 Q ss_pred hHHHHHhhhhccCCCCc-e----------eeccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Eeeehhh--- Q lcl|Aclame:pro 273 DAFEVQAQYTHLNANGV-Y----------VTALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQKFKET--- 335 (381) Q Consensus 273 ~~~~~~~~~~~~~~~G~-~----------~~~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~~~~~~--- 335 (381) -...-.... .+.+|. | +....=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +.-.+++ T Consensus 227 Lla~~~~~l--~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ri 304 (336) T protein:vir:37 227 LVSKETKLI--QQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKKGL 304 (336) T ss_pred hhhhhhhhh--hhhcCCCHHHHHHHHHHHHHHhhCCceEEEccccCCCceEEeeccccEEEEecCcEEEEEEEccccccc Confidence 433221111 111221 1 11112389999999999999887666553332 122221 2111111 Q ss_pred -hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCC Q lcl|Aclame:pro 336 -LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 336 -~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~ 376 (381) .+..-..+|..-.+.-+..++.-.+ ++.+ |. T Consensus 305 e~y~s~Ne~YvVEd~~~~a~iE~i~v-----~~~~-----e~ 336 (336) T protein:vir:37 305 VTSYYRQEGYVVEDLGLMTAIDHTKV-----KLNG-----EV 336 (336) T ss_pred cchhhhcceeeeeccccEEEeeeeee-----eccc-----cC Confidence 1112233444443333344433222 2222 22 No 183 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=85.84 E-value=0.05 Score=27.72 Aligned_cols=300 Identities=7% Similarity=-0.023 Sum_probs=139.6 Q ss_pred hhccccHHHHHHHHHHhc------c-cCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceE Q lcl|Aclame:pro 61 SAQSLSANQRSFFMDINK------N-VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAV 131 (381) Q Consensus 61 ~~~~lt~~e~~~~~~~~~------~-~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~ 131 (381) +-+.++..-|+.|+++.. + .+....|.|-+.+...+.+.+++.+-+++.++++++.- +..+....+.+.++ T Consensus 1 m~~~m~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iag 80 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQLAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTG 80 (341) T ss_pred CcccccHHHHHHHHHHHHHHHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceee Confidence 222344444444443221 1 13355788988999999999999999999999888642 22344333333332 Q ss_pred EecccccccccccccccceeccceeeeeehhhhHHHHhcCh-----hHHHHHHHHHHHHHHHHHHhhheeeccCC----- Q lcl|Aclame:pro 132 WGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP-----AWIERFVRVQIEEAFAVALETAFLKGTGK----- 201 (381) Q Consensus 132 w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~-----~~l~~~i~~~la~a~a~~~d~a~l~G~G~----- 201 (381) -... + .. ..++..+...+..++.-.-..|+.+.|+.-+ .+|..-+++.+.++++.=.-.--++|+-. T Consensus 81 rtdt-~-R~-~r~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td 157 (341) T protein:vir:27 81 RKAG-G-RF-TKQVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTD 157 (341) T ss_pred ccCC-C-ce-ecccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCC Confidence 2221 1 11 1223444555555555555777888886655 77899999999999987665666777651 Q ss_pred --Ccc------eeeeeccccccccccccccc-cchhh-hccccChhHHHHHHHHHHHHhhhcccccccccc--CceEEEE Q lcl|Aclame:pro 202 --DQP------IGLNRQVQKGVSVTEGAYPE-KEEQG-TLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVV 269 (381) Q Consensus 202 --~qP------~Gil~~~~~~~~~~~~~~~~-~~~~~-~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~im 269 (381) ..| +|++..+-.. .+. ..-.+ ....+ ...+.+.+.++..+.+ .+ -+..++ +..+.|| T Consensus 158 ~~anPllqDVNkGWlQ~~Re~-a~~-rVl~~~~~~~g~~gdy~nLDAlV~D~~~---~l------I~~~~~~d~dLVviv 226 (341) T protein:vir:27 158 PSANPLGQDVNEGWIAFVKNR-KAS-QVVDVDVYFDETNGDYRTLDAMASDIIN---NQ------IHPMFRNDPRLTVFV 226 (341) T ss_pred hhhcccccccchhHHHHHHhh-ccc-ceeccceeeccCCCccccHHHHHHHHHh---cc------cChHHhcCCCEEEEE Confidence 123 3555322111 010 00000 01111 0112222222222111 00 011222 3467778 Q ss_pred chhhHH-HHHhhhhccCCC-----CceeeccCCCceEEecCCCCCccEEEEeccceEEEe-cceee--Eeeehhh-hhhc Q lcl|Aclame:pro 270 NPSDAF-EVQAQYTHLNAN-----GVYVTALPFNLNVIESTVQEAGKVLTYVKGLYDGYL-AGGIN--VQKFKET-LALD 339 (381) Q Consensus 270 n~~~~~-~~~~~~~~~~~~-----G~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~~-r~~~~--i~~~~~~-~~~~ 339 (381) -+.-.. +..+++...+.+ ++-+....-|+|.+.-+++|.+.+++--|+..-|+. ++..+ +.-.+++ ++.. T Consensus 227 G~dLla~k~~~l~n~~~~ptE~~Aa~~i~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 306 (341) T protein:vir:27 227 GSGLIGAAQAKLYDKADKPSEQIAAQKLDKTIAGRPAYVPPFLPDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKRSKT 306 (341) T ss_pred chhhhhhhhhhhhccCCCCHHHHHHHHHHHhhCCCeEEEccccCCCceEEeeccceEEEEecCcEEEEEEeccccccccc Confidence 765443 222222111100 111112223899999999999998876665533322 11111 2111111 1111 Q ss_pred CceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCC Q lcl|Aclame:pro 340 DMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGT 377 (381) Q Consensus 340 d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~ 377 (381) -+.+|+.-. ++. ...-.|.-+.+..++.--..|-+ T Consensus 307 yes~YvVEd-yg~--~~~~~~~~vkl~~~~~~~~~~~~ 341 (341) T protein:vir:27 307 HTGAWKVTQ-WVC--WKRSPLTTQKKSTSALNHRSERN 341 (341) T ss_pred hhhhheeeh-hhh--hhhccccccccCccccccccccC Confidence 133443322 211 11111222223332322222333 No 184 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=85.46 E-value=0.053 Score=27.59 Aligned_cols=303 Identities=10% Similarity=0.015 Sum_probs=143.8 Q ss_pred hhccccHHHHHHHHHH----hc--cc---CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcc Q lcl|Aclame:pro 61 SAQSLSANQRSFFMDI----NK--NV---NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGV 129 (381) Q Consensus 61 ~~~~lt~~e~~~~~~~----~~--~~---~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~ 129 (381) +-+.++..-+..|+++ .+ +. +.+..|.|.+.+...+.+.+.+.+-+++..+++++.- +..+....+.+. T Consensus 1 m~~~M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~i 80 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCDALAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLY 80 (358) T ss_pred CcccccHHHHHHHHHHHHHHHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCccc Confidence 2233444444444432 11 22 2346799999999999999999999999999888642 223444333333 Q ss_pred eEEecccccccccccccccceeccceeeeeehhhhHHHHhcCh-----hHHHHHHHHHHHHHHHHHHhhheeeccCCC-- Q lcl|Aclame:pro 130 AVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP-----AWIERFVRVQIEEAFAVALETAFLKGTGKD-- 202 (381) Q Consensus 130 a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~-----~~l~~~i~~~la~a~a~~~d~a~l~G~G~~-- 202 (381) ++-... .........+.-.+..++.-.-..|+.++|+.-+ .+|..-+++.+.++++.=.-.--+||+-.. T Consensus 81 agrt~t---r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~ 157 (358) T protein:vir:78 81 TGRKKG---GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADD 157 (358) T ss_pred ceecCC---CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccC Confidence 322211 1112223344445555555556788999998754 268899999999998865555556776421 Q ss_pred -----cc------eeeeeccc---cccccccccccccchhhh---ccccChhHHHHHHHHHHHHhhhcccccccccc--C Q lcl|Aclame:pro 203 -----QP------IGLNRQVQ---KGVSVTEGAYPEKEEQGT---LTFANPRATVNELTQVFKYHSTNEKGKSVAVK--G 263 (381) Q Consensus 203 -----qP------~Gil~~~~---~~~~~~~~~~~~~~~~~~---~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 263 (381) .| +|+|..+- .......++..+...++. ..+.+.+.++..+ +.. .-+..++ + T Consensus 158 Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~---~~~------lI~~~~~~d~ 228 (358) T protein:vir:78 158 TDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDL---INT------TIDPLFQQDP 228 (358) T ss_pred CChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHH---Hhc------cCChHHhcCC Confidence 23 35553211 111111111111111111 1223333332221 110 1112233 3 Q ss_pred ceEEEEchhhHHH-HHhhhhccCCC-----CceeeccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Eeeehh Q lcl|Aclame:pro 264 NVTMVVNPSDAFE-VQAQYTHLNAN-----GVYVTALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQKFKE 334 (381) Q Consensus 264 ~~~~imn~~~~~~-~~~~~~~~~~~-----G~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~~~~~ 334 (381) ..|.||-+.-... ..++....+.+ ++-+....=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +.-.++ T Consensus 229 dLVvivG~dLla~k~~~l~n~~~~pTE~~Aa~~i~k~iGGlpa~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~ 308 (358) T protein:vir:78 229 RLVVLVGTDLVAAAQAKLYSEATKPSEQIAAQQLAKSIAGRKAYIPPFFPGKRMVVTTLDNLHCYTQRGTRKRKADDNQD 308 (358) T ss_pred CEEEEEchhhhhHHhhhHhhcCCCcHHHHHHHHHHHHhCCCeEEEccccCCCceEEeeccccEEEEecCcEEEEEEeccc Confidence 5778888765542 22322111100 11111112388999999999999887655553222 222222 211111 Q ss_pred h----hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 335 T----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 335 ~----~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) + .+..-..+|..-.+.-+..++.-.+. +.+ .|++.....-. T Consensus 309 r~riE~y~s~Ne~YvVEd~~~~a~iE~i~v~-----~~~-~pa~~~~~~~~ 353 (358) T protein:vir:78 309 SKSFDNQYWRMEGYALGEHKAYGGFEEADIE-----IGA-DPAVLAVEAAA 353 (358) T ss_pred cccccchhhhcceeeeeccccEEEEeeeeee-----eCC-CCCccccCCcc Confidence 1 11122335554444444444433322 222 22221111111 No 185 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=84.73 E-value=0.058 Score=27.36 Aligned_cols=297 Identities=15% Similarity=0.163 Sum_probs=124.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHH-Hhccc--CCCCceEccHHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 33 YGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMD-INKNV--NYKEEKLLPEETIDRIFEDLTTNHPLLAD 109 (381) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~-~~~~~--~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~ 109 (381) +.+.++. .++..++-+..+. +...++-|++.++ +.+.+ -++--+.+|+.+.-.|-..+..+.|+++. T Consensus 1 mtn~ies------q~A~~eF~~vL~~----N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~v 70 (318) T protein:vir:86 1 MTNFIES------QNAVTEFFDVLKK----NSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKV 70 (318) T ss_pred Ccchhhh------hHHHHHHHHHHhc----cCCchhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhccCcceee Confidence 1111110 0111111111111 1112233334332 22222 24566788999999999999999999998 Q ss_pred ceeeecCCc-eEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHH---HhcChhHHHHHHHHHHHH Q lcl|Aclame:pro 110 LGIKNAGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL---NDFGPAWIERFVRVQIEE 185 (381) Q Consensus 110 ~~v~~~~~~-~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~el---l~ds~~~l~~~i~~~la~ 185 (381) ..|++.+.- ++...++. ..|. +-..|..+.+...+|..-++.+-.++..-.+ -++ ++.|...+-.||..+++. T Consensus 71 fHVT~~~~~~V~~s~~s~-AeAq-~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ 147 (318) T protein:vir:86 71 FHVTNVGALLVSRSFDSS-AEAQ-VHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQ 147 (318) T ss_pred eeeccchhhhhhhhhhhh-hhhh-hhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHH Confidence 888877653 22222222 2333 2233444444455565555554333322222 222 345566678999999999 Q ss_pred HHH-HHHhhheeeccCCCcceeeee--ccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 186 AFA-VALETAFLKGTGKDQPIGLNR--QVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK 262 (381) Q Consensus 186 a~a-~~~d~a~l~G~G~~qP~Gil~--~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 262 (381) +|- ++.|.|++-|+|++...-+-+ ++......+. ...+.++..++++.. ...++ -++.. T Consensus 148 ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Tt----kaksagttpfanaie---eavdf-----------vrpta 209 (318) T protein:vir:86 148 AIVNKIVDLALVEGDGSNGFKSIDKEADVKKIKKITT----KAKSAGTTPFANAIE---EAVDF-----------VRPTA 209 (318) T ss_pred HHHHHHHHhhheeecCCCCccchhhHHHHHHHHHHhh----hhhccCCCchhhHHH---HHHhh-----------hccCC Confidence 999 899999999999886544421 1111000000 001122223333221 11111 12334 Q ss_pred CceEEEEchhhHHHHHhhhhccCCCCceeec--------cCCCce---EEecCCCCCccEEEEeccceEEEecceeeEee Q lcl|Aclame:pro 263 GNVTMVVNPSDAFEVQAQYTHLNANGVYVTA--------LPFNLN---VIESTVQEAGKVLTYVKGLYDGYLAGGINVQK 331 (381) Q Consensus 263 ~~~~~imn~~~~~~~~~~~~~~~~~G~~~~~--------l~~g~~---vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~ 331 (381) |...+++...+.-.++-.+..-++|.. +.. .-.|+. |+.....-.-.+ +-| .+|.|-+ ++++ . T Consensus 210 grrylivkaedrkalldelrqatanah-vriknddteiasevgvdeiivytgskalkptv-lvd-qkyhidm-qdlt--k 283 (318) T protein:vir:86 210 GRRYLIVKAEDRKALLDELRQATANAH-VRIKNDDTEIASEVGVDEIIVYTGSKALKPTV-LVD-QKYHIDM-QDLT--K 283 (318) T ss_pred CceEEEEeecchHHHHHHHHhhcccce-eEEeccchhhhhhcCcceeeeeecccccccee-eec-cceecch-hhhh--h Confidence 444566665554433322211222221 100 001111 111110000011 111 1233311 1111 0 Q ss_pred ehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEE Q lcl|Aclame:pro 332 FKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 332 ~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~ 366 (381) -+-.-+..+.--+..-...-|-+---+|-+|.++. T Consensus 284 vdafewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:86 284 VDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred hhcceeccCCceEEEeecccCcceeecCceeEEeC Confidence 01111222222222222334444444555555444 No 186 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=83.97 E-value=0.064 Score=27.12 Aligned_cols=341 Identities=8% Similarity=-0.056 Sum_probs=138.5 Q ss_pred CC----------------c-----------------------cHHHHHHHHHHHHHHHHhhhhHHHH----------HHH Q lcl|Aclame:pro 1 MT----------------I-----------------------NLSETFANAKNEFINAVNNGEPQER----------QNE 31 (381) Q Consensus 1 m~----------------~-----------------------~l~~~~~e~~~~~~~~~~~~~~~~~----------~~~ 31 (381) .. + +..-.+++.+.++++++........ .-. T Consensus 224 ~~De~airAq~~aeeraRi~~I~~l~a~Fggr~~~l~~~~l~d~~~s~e~ar~~il~~l~~~~~p~~~~~~~~~~~~~g~ 303 (652) T protein:vir:79 224 VVDENSIRAQVLAEQKARVNGINDLFAMFGGRYQTLQAQCLADPECSLEQAREKLLNEMGRESTPSNKNTPAHIYAGNGN 303 (652) T ss_pred cCchhHHHHHHHHHHHHHHHHHHHHHHhhccccchHHHHHhhccCCCHHHHHHHHHHHHHhhcCCCCCCcceeEeeccch Confidence 11 0 0000111122222222210000000 000 Q ss_pred HH-HHHHHHHHHHHHH---HH--------HHHHHHHHHhhhhhccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHH Q lcl|Aclame:pro 32 LY-GDMINQLFEETKL---QA--------KAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFED 99 (381) Q Consensus 32 ~~-~~~~~~~~~~~~~---~~--------~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~ 99 (381) .. ..+.++|..+... +. -.+..+..+..++.....-...+.....-..++++ .|--+.+-+-+. T Consensus 304 ~~~d~~~~aL~~R~g~~~~~~~~~~~g~~L~elAr~~L~~~G~~~~~~~~~~~v~~A~~hsTsD----Fp~IL~~~~nk~ 379 (652) T protein:vir:79 304 FVGDGIRQALMARAGFEKTERDNVYNGMTLREYARMSLTERGIGVSSYNPMQMVGAAFTHSTSD----FGNILLDVANKA 379 (652) T ss_pred hhHHHHHHHHHhhcCCcccccCccccCccHHHHHHHHHHhhccCCCCCCHHHHHHHHhhcCcch----HHHHHHHHHHHH Confidence 00 0000111100000 00 00111112222221111111111111111122333 233333333333 Q ss_pred HHhh-----hhhhhhceeeecCC--ceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcCh Q lcl|Aclame:pro 100 LTTN-----HPLLADLGIKNAGL--RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP 172 (381) Q Consensus 100 l~~~-----~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~ 172 (381) |++. ...+..|+..+++- ..+..+..+-+.-.-+.|++|.+--+- .=+..++...+++.++.||++.+-.-. T Consensus 380 l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~-~e~~e~~~l~tyG~~~~iTRqaiINDD 458 (652) T protein:vir:79 380 ILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTT-GDKQATIALATYGELFSITRQAIINDD 458 (652) T ss_pred HHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccceeee-cCccceeeeecccCeeeeehheeeccc Confidence 3322 23455555544432 112233344555556778888764322 225667888899999999999988777 Q ss_pred hHHHHHHHHHHHHHHHHHHhhhe---eeccCCC--cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHH Q lcl|Aclame:pro 173 AWIERFVRVQIEEAFAVALETAF---LKGTGKD--QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVF 247 (381) Q Consensus 173 ~~l~~~i~~~la~a~a~~~d~a~---l~G~G~~--qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~ 247 (381) .+.-.-|...++++.++.+++.+ |.+..+- --+.+|.....++-.++ . ..+...+......+. T Consensus 459 L~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~-a-----------a~~~~~l~~ar~aM~ 526 (652) T protein:vir:79 459 LNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLES-A-----------AMDVASLDKARQLMR 526 (652) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeeccccccccccc-c-----------cCCHHHHHHHHHHHH Confidence 88888889999999888887544 3333210 11234521111111111 0 111111111111111 Q ss_pred HHhhhccccccccccCceEEEEchhhHHHHHhhh---hccCCC--CceeeccCCC-ceEEecCCCCCcc---EEEEecc- Q lcl|Aclame:pro 248 KYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---THLNAN--GVYVTALPFN-LNVIESTVQEAGK---VLTYVKG- 317 (381) Q Consensus 248 ~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~---~~~~~~--G~~~~~l~~g-~~vi~s~~~p~~~---i~~gd~s- 317 (381) .+ .++...-......|++.+.-...-..+. ...+++ ...+.-+ .+ ..||.++.+.++. .++++-. T Consensus 527 ~Q----k~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~a~~~~~~~Np~-~~~~~~i~eprL~~~s~~~wylaa~~~ 601 (652) T protein:vir:79 527 VQ----KEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPV-KDFATVIAEPRLDDNSQTTFYLAASKG 601 (652) T ss_pred Hh----ccCCccccccccEEEecchhHHHHHHHhccCCCccccccccccccc-ccccccccccccCCCCcccEEEecCCC Confidence 11 1233222233456777776443332222 111111 1011101 12 2456666654432 2222211 Q ss_pred c--e---EEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEE Q lcl|Aclame:pro 318 L--Y---DGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKL 365 (381) Q Consensus 318 ~--y---~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l 365 (381) . + ++.-.++..|+. +..|..|-+-|++..-++.+++|-.+++-.+- T Consensus 602 ~dtiev~yL~G~~~P~ie~--~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 602 SDTIEVAYLNGVDTPYIDQ--MEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred CCeEEEEEecCCCCCeeee--cCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 1 1 122233444443 33599999999999999999999988764322 No 187 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=82.44 E-value=0.077 Score=26.69 Aligned_cols=288 Identities=14% Similarity=0.084 Sum_probs=130.9 Q ss_pred ccHHHHHHHHHHh----c--c-cCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEecc Q lcl|Aclame:pro 65 LSANQRSFFMDIN----K--N-VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) Q Consensus 65 lt~~e~~~~~~~~----~--~-~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~~e 135 (381) ++.+-|+.|+++. + + .+....|.|-+.+...+.+.+.+.+-+++..+++++.- +..+....+.+.++-... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecC Confidence 3333333333221 1 1 13356788999999999999999999999999887642 233443333333322211 Q ss_pred c-ccccccccccccceeccceeeeeehhhhHHHHhcC--hhHHHHHHHHHHHHHHHHHHhhheeeccCCC-------cc- Q lcl|Aclame:pro 136 Y-GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG--PAWIERFVRVQIEEAFAVALETAFLKGTGKD-------QP- 204 (381) Q Consensus 136 ~-~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds--~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~-------qP- 204 (381) + .+..+..-..++.-.+..++.-.-..|+.++|+.- ..+|..-+++.+.++++.=.-.--+||+-.. .| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (337) T protein:vir:78 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcC Confidence 1 11211111223444444444444567888888753 2468888888888888765544456776421 23 Q ss_pred -----eeeeecccc---cccccccccc-ccchhhhc-cccChhHHHHHHHHHHHHhhhcccccccccc--CceEEEEchh Q lcl|Aclame:pro 205 -----IGLNRQVQK---GVSVTEGAYP-EKEEQGTL-TFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPS 272 (381) Q Consensus 205 -----~Gil~~~~~---~~~~~~~~~~-~~~~~~~~-t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~imn~~ 272 (381) +|++..+-. ....+.++.. .+..+++. .+.+-+.++..+.+ .+ -+..++ +..+.||-+. T Consensus 161 lqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~---~l------I~~~~~~d~dLVvivG~d 231 (337) T protein:vir:78 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVS---SM------IDPWFQEDTGLVVICGRE 231 (337) T ss_pred ccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHh---cc------CChHHhcCCCEEEEEchh Confidence 355532211 0011111111 11111111 22233333222211 00 012222 3577888876 Q ss_pred hHHHHHhhhhccCCCCce--------ee--ccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Eeeehhh---- Q lcl|Aclame:pro 273 DAFEVQAQYTHLNANGVY--------VT--ALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQKFKET---- 335 (381) Q Consensus 273 ~~~~~~~~~~~~~~~G~~--------~~--~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~~~~~~---- 335 (381) -.......+. +..+.+ +. ...=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +.-.+++ T Consensus 232 Lladk~~~l~--n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie 309 (337) T protein:vir:78 232 LLHDKYFPIV--NATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIE 309 (337) T ss_pred hhHHHHHHHH--hcCCCcHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeechhcEEEEecCcEEEEEEecccccccc Confidence 5543222211 111111 11 112388999999999999887666553222 222222 2111111 Q ss_pred hhhcCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 336 ~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) .+..-..+|..-.+.-+..++ .+++... T Consensus 310 ~y~s~Ne~YvVEd~~~~a~iE-------nI~~~~a 337 (337) T protein:vir:78 310 NYESSNDAYVVEDFGCGCVAE-------NIELAAA 337 (337) T ss_pred chhhccceeeeeccccEEEEe-------ceeecCC Confidence 011112233333222222222 2233222 No 188 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=81.30 E-value=0.087 Score=26.40 Aligned_cols=189 Identities=13% Similarity=0.066 Sum_probs=82.0 Q ss_pred eeehhhhHHHHh-----cChhHHHHHHHHHHHHHHHHHHhhheee----ccCCCcceeeeeccccccccccccccccchh Q lcl|Aclame:pro 158 TAFVVLPKDLND-----FGPAWIERFVRVQIEEAFAVALETAFLK----GTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 158 ~~~~~iS~ell~-----ds~~~l~~~i~~~la~a~a~~~d~a~l~----G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) ---..+|.-+++ ++..|+.+...++++++++...|+.++. |....-|..- ...++... ..+. T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~--------~~~g~~~~-~~a~ 71 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTG--------QDGGFSVN-IGAG 71 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccc--------cccCccee-cccc Confidence 111223333333 4677888899999999999999988742 3222222100 00000000 0000 Q ss_pred hhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhcc-------CCCCceee----ccCCC Q lcl|Aclame:pro 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL-------NANGVYVT----ALPFN 297 (381) Q Consensus 229 ~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~-------~~~G~~~~----~l~~g 297 (381) ...++..+.+.+.++...+ +.+..+ ..+.+++++|..++.+++..+.. +.+|.... ....| T Consensus 72 ---~t~~~~~l~dai~~a~~~L----dekdVP-~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G 143 (221) T protein:vir:17 72 ---NTNNAQAIVDGFFEAAAVL----DERSAP-MDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAG 143 (221) T ss_pred ---ccCCHHHHHHHHHHHHHHH----hhcCCC-CCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecC Confidence 0012233333333222222 222222 23445678998888887642211 12222111 11248 Q ss_pred ceEEecCCCCC--ccEEEEeccceEE--EecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccC Q lcl|Aclame:pro 298 LNVIESTVQEA--GKVLTYVKGLYDG--YLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) Q Consensus 298 ~~vi~s~~~p~--~~i~~gd~s~y~i--~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~ 373 (381) ++|+.|+++|. ++-+..+.+.+.. .....++.. |.+ .=|.+.+++|... +|.-+++.- T Consensus 144 ~~V~~SnnlP~~~gt~~~~~ag~~~~~~~~~~~yr~~------fs~----------~~glv~~~~Avgt--vkl~~~~~~ 205 (221) T protein:vir:17 144 IRIYKSNVLASLYGTNLVTDPGDATTSGENNGSYRPA------ITD----------RAGLVFHKEAADT--VEVLLPPSR 205 (221) T ss_pred cEEEEeccCCcccccccccCCcccccccccccccccc------ccc----------eEEEEEcchheee--eeeecCCCC Confidence 99999999996 2222222121111 000011111 111 1288889998554 555443322 Q ss_pred C--CCCCCCC Q lcl|Aclame:pro 374 L--EGTEETL 381 (381) Q Consensus 374 ~--~~~~~~~ 381 (381) | ...-=.+ T Consensus 206 ~~~~~~~~~~ 215 (221) T protein:vir:17 206 PPLVISMFSI 215 (221) T ss_pred Cceeeeeeec Confidence 2 1111111 No 189 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=81.10 E-value=0.089 Score=26.35 Aligned_cols=291 Identities=12% Similarity=0.048 Sum_probs=131.9 Q ss_pred ccHHHHHHHHHH----hc--cc-----CCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceE Q lcl|Aclame:pro 65 LSANQRSFFMDI----NK--NV-----NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAV 131 (381) Q Consensus 65 lt~~e~~~~~~~----~~--~~-----~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~ 131 (381) +..+-|+.|+++ .+ +. +..--|.|-+.+...+.+.+.+.+-+++..+++++.- +..+....+.+.++ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iag 80 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVAS 80 (342) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCccccc Confidence 222223333322 11 22 2223588999999999999999999999999888642 23444443333333 Q ss_pred Eecc--cccccccccccccceeccceeeeeehhhhHHHHhcC--hhHHHHHHHHHHHHHHHHHHhhheeeccCCC----- Q lcl|Aclame:pro 132 WGKI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG--PAWIERFVRVQIEEAFAVALETAFLKGTGKD----- 202 (381) Q Consensus 132 w~~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds--~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~----- 202 (381) -..- .++.....-..++.-.+..++.-.-..|+.++|+.- ..+|..-+++.+.++++.=.-.--+||+-.. T Consensus 81 rtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 160 (342) T protein:vir:10 81 TTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDR 160 (342) T ss_pred ccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCCh Confidence 2211 112222222234455555555555567888888753 2468888888888888765544456776421 Q ss_pred --cc------eeeeeccc---cccccccccccccchhhhc-cccChhHHHHHHHHHHHHhhhcccccccccc--CceEEE Q lcl|Aclame:pro 203 --QP------IGLNRQVQ---KGVSVTEGAYPEKEEQGTL-TFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMV 268 (381) Q Consensus 203 --qP------~Gil~~~~---~~~~~~~~~~~~~~~~~~~-t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~i 268 (381) .| +|++..+- ......+++..++..+++. .+.+-+.++..+.+ .+ -+..++ +..|.| T Consensus 161 ~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~---~l------I~~~~~~d~dLVvi 231 (342) T protein:vir:10 161 NSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATE---EL------IDEWHRDDTDLVVI 231 (342) T ss_pred hhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHh---cc------CChHHhcCCCEEEE Confidence 23 35553211 1111112221121111211 22233332222111 00 012222 357788 Q ss_pred EchhhHHHHHhhhhcc-CCC-----Ccee--eccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Eeeehhh-- Q lcl|Aclame:pro 269 VNPSDAFEVQAQYTHL-NAN-----GVYV--TALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQKFKET-- 335 (381) Q Consensus 269 mn~~~~~~~~~~~~~~-~~~-----G~~~--~~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~~~~~~-- 335 (381) |-+.-.......+..+ +.+ ++-+ ....=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +.-.+++ T Consensus 232 vG~dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~r 311 (342) T protein:vir:10 232 TGRKLLADKYFPIVNQQNAPTEELAADIVISQKRIGGLKAVRVPFFPANAILITKLENLAIYVQEGTTRKHIENVPKKDR 311 (342) T ss_pred EchhhhHHHHHHHHhcCCChHHHHHHHHHHhhhhhcCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecccccc Confidence 8876554322211111 100 1111 0112388999999999999887655553222 222222 2111111 Q ss_pred --hhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCC Q lcl|Aclame:pro 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 336 --~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~ 379 (381) .+..-..+|..-.+.-+..++ .+++.. +| T Consensus 312 ie~y~s~Ne~YvVEd~~~~a~iE-------~i~i~~--------~~ 342 (342) T protein:vir:10 312 IETYESENIDYVVEDYGCAALIE-------NITLKD--------KE 342 (342) T ss_pred ccchhhhccceeeeccccEEEee-------cceecC--------CC Confidence 011112233322222222222 122222 12 No 190 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=80.94 E-value=0.09 Score=26.31 Aligned_cols=354 Identities=10% Similarity=0.050 Sum_probs=128.5 Q ss_pred CCccHHHHHHHHHHHHHHH-----HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHH---H Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINA-----VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS---F 72 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~ 72 (381) |+|+-++++.||=+-+++. |.+. +...-.+.+++..+.++.+. +++...........|.+.+.. . T Consensus 1 ~~~~~~~~l~~kw~p~l~~~~~~~i~~~-~~~~~a~~~enq~~~~~~~~------~~~~~~~~~~~~~~l~e~~~~~~~~ 73 (521) T protein:vir:72 1 MTIKTKAELLNKWKPLLEGEGLPEIANS-KQAIIAKIFENQEKDFQTAP------EYKDEKIAQAFGSFLTEAEIGGDHG 73 (521) T ss_pred CCcchhHHHHHhhhhhhccCCCCccccc-hhhhhhhhhhhhhhhhhhcc------cccchHHHHHHhhhhhhhcccCccc Confidence 9999999999998888876 1111 11111122232222221111 111111111111111111000 0 Q ss_pred HH--HHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEE------EEecCC--------------cce Q lcl|Aclame:pro 73 FM--DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKF------LKSETS--------------GVA 130 (381) Q Consensus 73 ~~--~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~i------p~~~~~--------------~~a 130 (381) ++ .+.+++.+..=.-.-|.+.. ++++.-..=.-.+++.|.|++|.+-. -..... +.+ T Consensus 74 ~~~~~iaes~~t~~v~~~~P~Li~-lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da 152 (521) T protein:vir:72 74 YNATNIAAGQTSGAVTQIGPAVMG-MVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDA 152 (521) T ss_pred cCcccccccccccccccCCchhhh-HHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhccccc Confidence 00 01111111110001122221 11111111122455667766543210 000000 000 Q ss_pred EEec---------------------------------------------------------------------c-----c Q lcl|Aclame:pro 131 VWGK---------------------------------------------------------------------I-----Y 136 (381) Q Consensus 131 ~w~~---------------------------------------------------------------------e-----~ 136 (381) .|.+ . . T Consensus 153 ~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~ 232 (521) T protein:vir:72 153 MFSGQGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSI 232 (521) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccchhh Confidence 0000 0 0 Q ss_pred ccc----cccccccccceeccceeeeee-------hhhhHHHHhc----ChhHHHHHHHHHHHHHHHHHHhhheeeccC- Q lcl|Aclame:pro 137 GEI----KGQLDAAFSEETAIQNKLTAF-------VVLPKDLNDF----GPAWIERFVRVQIEEAFAVALETAFLKGTG- 200 (381) Q Consensus 137 ~~~----~~~~~~~f~~v~l~~~kl~~~-------~~iS~ell~d----s~~~l~~~i~~~la~a~a~~~d~a~l~G~G- 200 (381) +|. -..++..|.+..|...|.++- ...|-||.+| -.+|.|++|.+.|+..|...+|+.||. += T Consensus 233 aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~-~i~ 311 (521) T protein:vir:72 233 AELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVD-WIN 311 (521) T ss_pred hhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhh-hhh Confidence 000 001123366666666655433 4467777766 356899999999999999999999983 20 Q ss_pred C-Cc--ceeeeeccccccccccccccccchhhhcccc---C---hhHHHHHHHHHHHHhhhcccccccc-ccCce-EEEE Q lcl|Aclame:pro 201 K-DQ--PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFA---N---PRATVNELTQVFKYHSTNEKGKSVA-VKGNV-TMVV 269 (381) Q Consensus 201 ~-~q--P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~---~---~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~im 269 (381) . .+ -.|+-+ +.+ ...+...+. + .......+..++..+-...+...+. -++++ ..|+ T Consensus 312 ~sa~~g~~g~t~--------~~~-----~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~ 378 (521) T protein:vir:72 312 YSAQVGKSGMTL--------TPG-----SKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIA 378 (521) T ss_pred heeeeeeeeeee--------ccC-----ccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEE Confidence 0 00 112210 000 001111111 1 1111111111221111111111111 12333 3467 Q ss_pred chhhHHHHHhhhh------ccC-CCC--------ceeeccCCCceEEecCCCCCccEEEEeccce-------EEEeccee Q lcl|Aclame:pro 270 NPSDAFEVQAQYT------HLN-ANG--------VYVTALPFNLNVIESTVQEAGKVLTYVKGLY-------DGYLAGGI 327 (381) Q Consensus 270 n~~~~~~~~~~~~------~~~-~~G--------~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~y-------~i~~r~~~ 327 (381) ++.-...+ .... .++ +.| .+.-.|.-+++|+.+.+.|.+-++.|-.... +-=-.... T Consensus 379 S~~Va~~L-~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~ 457 (521) T protein:vir:72 379 SRNVVNVL-ASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALT 457 (521) T ss_pred chHHHHHH-hhcccccccccccccccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccceeeccccccc Confidence 76543322 2110 011 111 1233455678999999998877766633111 00000111 Q ss_pred eEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEE---EEEecccccCCCCCCCCC Q lcl|Aclame:pro 328 NVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVW---KLDLKGHKPALEGTEETL 381 (381) Q Consensus 328 ~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~---~l~~~~~~~~~~~~~~~~ 381 (381) .+...|...| |-.+-...|+ |-.++|-+-..- .-++.+..|....+...+ T Consensus 458 ~~~~~dp~sf---qP~~g~~tRY-~l~~NP~~~~~~~~~a~~i~~~~~~~~a~~~~~ 510 (521) T protein:vir:72 458 PLRGSDPKNF---QPVMGFKTRY-GIGINPFAESAAQAPASRIQSGMPSILNSLGKN 510 (521) T ss_pred cccccCCccc---cceeeeeeee-ceeecCcccccCcccceeecCcChhhhcCcccc Confidence 1112233323 2233333444 444554321100 011223333333333333 No 191 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=78.28 E-value=0.12 Score=25.71 Aligned_cols=291 Identities=13% Similarity=0.051 Sum_probs=131.9 Q ss_pred ccHHHHHHHHHHhc-------ccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCC--ceEEEEecCCcceEEecc Q lcl|Aclame:pro 65 LSANQRSFFMDINK-------NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) Q Consensus 65 lt~~e~~~~~~~~~-------~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~--~~~ip~~~~~~~a~w~~e 135 (381) ++.+-|+.|+++.. -.+.+..|.|-+.+...+...+.+.+-+++..+++++.- +..+....+.+.++-..- T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDT 80 (339) T ss_pred CChHHHHHHHHHHHHHHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccC Confidence 33333333332211 123456789999999999999999999999999887642 234444333333322111 Q ss_pred -cccccccccccccceeccceeeeeehhhhHHHHhcC--hhHHHHHHHHHHHHHHHHHHhhheeeccCCC-------cc- Q lcl|Aclame:pro 136 -YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG--PAWIERFVRVQIEEAFAVALETAFLKGTGKD-------QP- 204 (381) Q Consensus 136 -~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds--~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~-------qP- 204 (381) ..+..+..-..++.-.+..++.-.-..|+.++|+.- ..+|..-+++.+.++++.=.-.--+||+-.. .| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (339) T protein:vir:79 81 TQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPM 160 (339) T ss_pred CCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcC Confidence 112221111234444555555555567888888753 2468888888888888765444445776421 23 Q ss_pred -----eeeeecccc---ccccccccc-cccchh-hh-ccccChhHHHHHHHHHHHHhhhcccccccccc--CceEEEEch Q lcl|Aclame:pro 205 -----IGLNRQVQK---GVSVTEGAY-PEKEEQ-GT-LTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNP 271 (381) Q Consensus 205 -----~Gil~~~~~---~~~~~~~~~-~~~~~~-~~-~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~imn~ 271 (381) +|+|..+-. ...-..++. ..+..+ ++ ..+.+-+.++..+.+ .+ -+..|+ +..+.||-+ T Consensus 161 lqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~---~l------Id~~~~~d~dLVvivG~ 231 (339) T protein:vir:79 161 LQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITN---HL------VEPWYAEDPDLVVVCGR 231 (339) T ss_pred ccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHh---cc------CChHHhcCCCEEEEEch Confidence 355432111 000111111 011111 11 012232333222221 00 012222 357778887 Q ss_pred hhHHH-HHhhhhccCC-----CCceee--ccCCCceEEecCCCCCccEEEEeccceEEE-ecceee--Eeeehhh----h Q lcl|Aclame:pro 272 SDAFE-VQAQYTHLNA-----NGVYVT--ALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQKFKET----L 336 (381) Q Consensus 272 ~~~~~-~~~~~~~~~~-----~G~~~~--~l~~g~~vi~s~~~p~~~i~~gd~s~y~i~-~r~~~~--i~~~~~~----~ 336 (381) .-... ..++....+. .++.+. ...=|+|.+.-+++|.+.+++--|+..-|+ .++..+ +.-.+++ . T Consensus 232 dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 311 (339) T protein:vir:79 232 NLLSDKYFPLVNRDRDPVQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDNLSIYYQEGGRRRTILDNAKRDRIEN 311 (339) T ss_pred hhhhhHhhhHhhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccc Confidence 65442 2222211000 011111 112388999999999999887666553222 222222 2111111 1 Q ss_pred hhcCceEEEEEEEEcCEEecCcceEEEEEEecccccC Q lcl|Aclame:pro 337 ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) Q Consensus 337 ~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~ 373 (381) +..-..+|..-.+.-+..++ .+++. ..| T Consensus 312 y~s~Ne~YvVEd~~~~a~iE-------ni~~~--~aa 339 (339) T protein:vir:79 312 YESSNDAYVIEDLACAAMAE-------NIALA--AAA 339 (339) T ss_pred hhhccceeeeeccccEEEee-------eeecc--cCC Confidence 11112234333332222222 12221 111 No 192 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=74.51 E-value=0.16 Score=24.99 Aligned_cols=354 Identities=8% Similarity=0.022 Sum_probs=119.8 Q ss_pred ccHHHHHHHHHHHHHHH-----HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHH---HH- Q lcl|Aclame:pro 3 INLSETFANAKNEFINA-----VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS---FF- 73 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~~- 73 (381) |+-.+++.||=.-+++. |++.-++..-.+.+++..+.+..+. .++........+..|.+.+-. -+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~------~~~~~~~~~~~~~~l~ea~~~~~~~~~ 74 (528) T protein:vir:80 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDP------IYKDEKVVEAFGGFIAEAEVAGDHGYD 74 (528) T ss_pred CcchHHHHHhhhHhhcCCccchhcchhhhhhhhhhhhhhhHHhhccc------cccchHHHHhhhhhccccccccccCCc Confidence 77777777777777664 2222222212222333222222211 111111111111222221100 00 Q ss_pred -HHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEE------EecCC--------------cc--- Q lcl|Aclame:pro 74 -MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFL------KSETS--------------GV--- 129 (381) Q Consensus 74 -~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip------~~~~~--------------~~--- 129 (381) ..+.+++.+..=--.-|.+.. ++++.-..=.-.+++.|.|++|.+-.. ..... +. T Consensus 75 ~~~i~es~~t~~v~~~~P~Li~-lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~f 153 (528) T protein:vir:80 75 ASQIAAGQTTGAITNVGPAVIG-MVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFH 153 (528) T ss_pred cccccccccccccccCCchhhh-HHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCcccccccccccccccccccc Confidence 001111111100001122211 111111111223556666665431000 00000 00 Q ss_pred ---------------------------------------------------------------------------eEEec Q lcl|Aclame:pro 130 ---------------------------------------------------------------------------AVWGK 134 (381) Q Consensus 130 ---------------------------------------------------------------------------a~w~~ 134 (381) ..-.. T Consensus 154 S~~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~ 233 (528) T protein:vir:80 154 SSLAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIA 233 (528) T ss_pred ccccccccccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccc Confidence 00000 Q ss_pred c-----ccc----ccccccccccceeccceeeeee-------hhhhHHHHhc----ChhHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 135 I-----YGE----IKGQLDAAFSEETAIQNKLTAF-------VVLPKDLNDF----GPAWIERFVRVQIEEAFAVALETA 194 (381) Q Consensus 135 e-----~~~----~~~~~~~~f~~v~l~~~kl~~~-------~~iS~ell~d----s~~~l~~~i~~~la~a~a~~~d~a 194 (381) . .+| .-..++..|.+..|...|..+- ...|-||.+| -.+|.|++|.+.|+..|...+|+. T Consensus 234 ~Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINRe 313 (528) T protein:vir:80 234 FGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINRE 313 (528) T ss_pred cccchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHH Confidence 0 000 0011223366666666665432 4467777766 457899999999999999999999 Q ss_pred eeeccCCC-c--ceeeeeccccccccccccccccchhhhc--cccChhHHHHHHHHHHHHhhhccccccccccCceEEEE Q lcl|Aclame:pro 195 FLKGTGKD-Q--PIGLNRQVQKGVSVTEGAYPEKEEQGTL--TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVV 269 (381) Q Consensus 195 ~l~G~G~~-q--P~Gil~~~~~~~~~~~~~~~~~~~~~~~--t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~im 269 (381) ||.=-.+. + -.|+...+. ...|.. |-...... ..-..+..-..+..+.+......... .+-+.-..++ T Consensus 314 ii~~i~~~a~~~~~~~t~~~~----~~~G~~-dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T--~~~~gn~vi~ 386 (528) T protein:vir:80 314 IVDVINFTAQVGKTGMTQTVG----SKAGVF-DLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQT--GRGAGNFVIA 386 (528) T ss_pred HHhhhhheeeeeeeeeeeccc----ccccee-eccccccccccchhHHHHHHHHHHHHHHHHHHHHhh--ccccccEEEE Confidence 95311110 1 112211000 000000 00000000 00001111111222222211111111 1112234677 Q ss_pred chhhHHHHHhhh-------------hccCCCC-ceeeccCCCceEEecCCCCCccEEEEeccc-------eEE-Eeccee Q lcl|Aclame:pro 270 NPSDAFEVQAQY-------------THLNANG-VYVTALPFNLNVIESTVQEAGKVLTYVKGL-------YDG-YLAGGI 327 (381) Q Consensus 270 n~~~~~~~~~~~-------------~~~~~~G-~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~-------y~i-~~r~~~ 327 (381) ++.-...+...- ...+..+ .+.-.|.-+++|+.+.+.|.+-++.|-... |+- +. .+. T Consensus 387 S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv-~l~ 465 (528) T protein:vir:80 387 SRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYV-ALT 465 (528) T ss_pred chHHHHHHhhccccccccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccc-cce Confidence 776444332110 0011111 223345557889999998877666653211 110 00 111 Q ss_pred eEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 328 NVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 328 ~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) -....|...| |-.+-...|+ |-.++| |+. ...+.+..-....+. T Consensus 466 ~~~~~dp~sf---qP~~g~~tRY-~l~~NP--~~~----~~~~~~~~r~~~g~~ 509 (528) T protein:vir:80 466 PLRATDPQSF---HPVLGFKTRY-GIGINP--FAD----SKSQAPSARITSGML 509 (528) T ss_pred eeEeeCCccc---cceeeeeeee-ceeecC--ccc----ccCCcccccccccch Confidence 1122233323 2222223333 334444 211 000000000000011 No 193 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=70.84 E-value=0.2 Score=24.37 Aligned_cols=268 Identities=12% Similarity=0.024 Sum_probs=86.6 Q ss_pred ccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhce-----ee--ecCCc-eEEEEecCC-cce----E Q lcl|Aclame:pro 65 LSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLG-----IK--NAGLR-LKFLKSETS-GVA----V 131 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~-----v~--~~~~~-~~ip~~~~~-~~a----~ 131 (381) ++-....+|| +.+....++.+.+.....+.+. .. +..|. ..+|.-.+. +.. . T Consensus 1 m~lsD~~vfN---------------~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~ 65 (325) T protein:vir:95 1 MALSDLAVYS---------------EYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRN 65 (325) T ss_pred Cchhhhhhhh---------------hhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeecccccccccccccccc Confidence 1111111222 2222333333333222222211 11 12344 345543221 110 1 Q ss_pred EecccccccccccccccceeccceeeeeehhhhHHHH---hcChhHHHHHHHHHHHHHHHHHHhhheeecc-CCCcceee Q lcl|Aclame:pro 132 WGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLN---DFGPAWIERFVRVQIEEAFAVALETAFLKGT-GKDQPIGL 207 (381) Q Consensus 132 w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell---~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~-G~~qP~Gi 207 (381) +.+ .+......-.++.++....+.-.++.....+.+ .+.... +.+.+++.+++...+.++++- |. -.|. T Consensus 66 ~~~-~~~vt~~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~----~~~~Ig~~~a~~~~~~~l~~~~~~--l~~a 138 (325) T protein:vir:95 66 AYG-SGTVAEKVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEV----AGAAMGQQLAVDTMADMLNVGLGS--VYSA 138 (325) T ss_pred CCC-CceeccceeccccceeeEEecccCcccccHHHHhhcCCCHHH----HHHHHHHHHHHHHHHHHHHHHHHH--HHHh Confidence 111 122222111223333333322222222111111 111222 223344444433332222210 00 0000 Q ss_pred eeccccccccccccccccchhh-hccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc--- Q lcl|Aclame:pro 208 NRQVQKGVSVTEGAYPEKEEQG-TLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH--- 283 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~-~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~--- 283 (381) ++.. ++.+....+..+ .....+..+ +.+....+. +... .=..|+||...+..++++... T Consensus 139 ~~~~------~~~v~dis~~~~~~~~~~s~~~----l~~A~~klG---D~~~----~l~~~~MHS~v~~~L~~~~L~~~~ 201 (325) T protein:vir:95 139 LSQV------SDVVYDATANTDAADKLPTWNN----LNNGQAKFG---DQSS----QIAAWIMHSTPMHKLYGSNLTNGE 201 (325) T ss_pred hccc------ccceeeeecccCcccccccHHH----HHHHHHHhc---cccc----ceeEEEEchHHHHHHHHhhccccc Confidence 1000 000000000000 000011111 111111111 1111 013589999999988765432 Q ss_pred --cCCCCceeeccCCCceEEecCCCCCccEE-EEeccceEEEecceeeEeeehhh--------hhhcCceEEEEEEEEcC Q lcl|Aclame:pro 284 --LNANGVYVTALPFNLNVIESTVQEAGKVL-TYVKGLYDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYG 352 (381) Q Consensus 284 --~~~~G~~~~~l~~g~~vi~s~~~p~~~i~-~gd~s~y~i~~r~~~~i~~~~~~--------~~~~d~~~~~~~~r~dg 352 (381) .+..|..+.....|++||.++.||....- -+-+.-|.++ .+.+.+-...+. .-.....+|++... T Consensus 202 ~~~~~~g~~~i~t~~G~~VIVdD~~p~~~~g~~~~ytty~lg-~GAi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t--- 277 (325) T protein:vir:95 202 RLFTYGTVNVVRDPFGKLLVMTDSPNLFAAGTPNVYHILGLV-PGGVLIGQNNDFDANEETKNGDENIIRTYQAEWS--- 277 (325) T ss_pred cccccCCcccccccCCcEEEEeCCCCCCCccCceeEEEEEEe-cCeEEecCCCCccccccccCcccceeeeeeeeee--- Confidence 23445443334569999999999954310 0111112221 222323222221 11223444443221 Q ss_pred EEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 353 KAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 353 k~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) -.+++..+.-- -...+.+ ++++- T Consensus 278 f~lhp~G~sw~-~s~~g~s-----Pt~ae 300 (325) T protein:vir:95 278 YNIGVKGFAWD-KANGGKS-----PTDAA 300 (325) T ss_pred EEeecceeeee-cccccCC-----cChHh Confidence 46777777752 1121222 33333 No 194 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=68.71 E-value=0.23 Score=24.05 Aligned_cols=347 Identities=11% Similarity=0.035 Sum_probs=117.7 Q ss_pred CCccHHHHHHHHHHHHHHH-----HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHH--- Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINA-----VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSF--- 72 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~--- 72 (381) |+|+-+ ++.||=.-+++. |++.-++..-...+++..+.+.++. .++...+...-.+.|.+.+..- T Consensus 1 ~~~~~~-~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~------~~~~~~~~e~~~~~l~e~~~~~~~~ 73 (529) T protein:vir:10 1 MSLKTK-EILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKTDP------VYRDDKLIEAFGQSLMEAEVAGDHG 73 (529) T ss_pred CccchH-HHHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHhhccc------ccchhhhhhhhhhccchhhcccccc Confidence 999766 466666666554 2222112212222333222221111 1111111111111222211100 Q ss_pred H--HHHhcccCCCCce-EccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEE------EecCC---------------- Q lcl|Aclame:pro 73 F--MDINKNVNYKEEK-LLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFL------KSETS---------------- 127 (381) Q Consensus 73 ~--~~~~~~~~~~gg~-lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip------~~~~~---------------- 127 (381) + ..+.+++.+ |.. -.-|.+.. ++++.-..=.-.+++.|.|+++.+-.. ..... T Consensus 74 ~~~~~ia~s~~t-~~v~~~~P~Li~-lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~d 151 (529) T protein:vir:10 74 YDPTNIAAGQSS-GAITNIGPAVIG-MVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPD 151 (529) T ss_pred cccccccccccc-cccccccchhhh-hHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCccccccccccccc Confidence 0 000111111 110 00111111 111111111123445555554321000 00000 Q ss_pred ---------------------------------cceEEec---------------------------------------- Q lcl|Aclame:pro 128 ---------------------------------GVAVWGK---------------------------------------- 134 (381) Q Consensus 128 ---------------------------------~~a~w~~---------------------------------------- 134 (381) +...|.. T Consensus 152 t~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~ 231 (529) T protein:vir:10 152 AWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELA 231 (529) T ss_pred ccccccccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCcccccccccccccccc Confidence 0000000 Q ss_pred ---c-----cccc----cccccccccceeccceeeeee-------hhhhHHHHhc----ChhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 135 ---I-----YGEI----KGQLDAAFSEETAIQNKLTAF-------VVLPKDLNDF----GPAWIERFVRVQIEEAFAVAL 191 (381) Q Consensus 135 ---e-----~~~~----~~~~~~~f~~v~l~~~kl~~~-------~~iS~ell~d----s~~~l~~~i~~~la~a~a~~~ 191 (381) + .+|. -..+.-.|.+..|...|..+- ...|-||.+| -.+|.|++|.+.|+..|...+ T Consensus 232 ~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEI 311 (529) T protein:vir:10 232 EIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEI 311 (529) T ss_pred ccccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHh Confidence 0 0000 001122356666665555432 4467777766 356899999999999999999 Q ss_pred hhheee--------cc-CC----CcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 192 ETAFLK--------GT-GK----DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKS 258 (381) Q Consensus 192 d~a~l~--------G~-G~----~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~ 258 (381) |+.||. |. |- +...|++.-.. ..... +. . -.....-..+..+.+........ T Consensus 312 NReii~~i~~~a~~~~~g~~~~~~~~~gv~d~~~-~~d~~-~~--------~---~~~e~~~~L~~~i~~~an~I~~~-- 376 (529) T protein:vir:10 312 NREVIDWINYTAQVGKSGWTQTVGSAAGVFDFQD-PIDVR-GA--------R---WAGESYKALLIQIDKEANEIARQ-- 376 (529) T ss_pred hHHHHHHhhhhceeeeeeeeccccccccceeccc-ccccc-cc--------c---hhHHHHHHHHHHHHHHHHHHHHh-- Confidence 999986 11 10 01222221000 00000 00 0 00111111111222211111110 Q ss_pred ccccCc-eEEEEchhhHHHHHhhhh----c----------c-CCCCceeeccCCCceEEecCCCCCccEEEEeccc--eE Q lcl|Aclame:pro 259 VAVKGN-VTMVVNPSDAFEVQAQYT----H----------L-NANGVYVTALPFNLNVIESTVQEAGKVLTYVKGL--YD 320 (381) Q Consensus 259 ~~~~~~-~~~imn~~~~~~~~~~~~----~----------~-~~~G~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~--y~ 320 (381) --+++ -..++++.-...+ .... . . +..+.+.-.|.-+++|+.+.+.|.+-++.|-... |. T Consensus 377 -T~rg~~n~vi~S~~Va~~L-~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~ 454 (529) T protein:vir:10 377 -TGRGAGNFIIASRNVVSAL-ALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLD 454 (529) T ss_pred -hccccceEEEEchHHHHHH-hhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccc Confidence 11222 2456776543333 1100 0 0 0112333345567899999999877776663311 11 Q ss_pred -----E-EecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 321 -----G-YLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 321 -----i-~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) - +. ...-....|...| |-.+-...|+ |-.++|-+-.. +-.....-+-|.+-.. T Consensus 455 ~glfy~PYv-~l~~~~~~dp~sf---qP~~g~~tRY-~l~~NP~~~~~---~~~~~~r~~~g~~~~~ 513 (529) T protein:vir:10 455 AGIYYCPYV-ALTPLRGSDPKNF---QPVMGFKTRY-AIGVNPFAESR---TQAPTSRISNGMPGAH 513 (529) T ss_pred cceeecccc-ccccccccCCCcc---cceeeeeeee-ceeecCccccc---cccccccccCCcchhh Confidence 0 00 0011112222222 2233333344 44455422110 0000001111211111 No 195 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=65.08 E-value=0.29 Score=23.54 Aligned_cols=346 Identities=9% Similarity=-0.000 Sum_probs=124.2 Q ss_pred CCc-cHHHHHHHHHHHHHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHH---H Q lcl|Aclame:pro 1 MTI-NLSETFANAKNEFINAVNNGE----PQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS---F 72 (381) Q Consensus 1 m~~-~l~~~~~e~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~ 72 (381) |+- +-++++.||=+-+++.-...+ +...-.+.+++..+.+..+. +++........++.|++.+-. - T Consensus 1 ~~~~~~~e~l~~kw~p~l~~~~~~~~~~~~~~~~a~l~enq~~~~~~~~------~~~~~~~~~~~~~~l~ea~~~~~~~ 74 (522) T protein:vir:69 1 MTTIKTKAQLVDKWKELLEGEGLPEIANSKQAIIAKIFENQEKDFEVSP------EYKDEKIAQAFGSFLTEAEIGGDHG 74 (522) T ss_pred CCccchHHHHHHhhHHHhcCCCCCccccchhhhhhhhhhhhhHHhhccc------ccchhHHHHhhhhhhhhhccccccC Confidence 765 555557777677766411001 11111222233222222111 111111111122333332210 0 Q ss_pred H--HHHhcccCCCCceEccHHHHHHHHHHHHhh---hhhhhhceeeecCCceEEE------EecC--------------C Q lcl|Aclame:pro 73 F--MDINKNVNYKEEKLLPEETIDRIFEDLTTN---HPLLADLGIKNAGLRLKFL------KSET--------------S 127 (381) Q Consensus 73 ~--~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~---~~l~~~~~v~~~~~~~~ip------~~~~--------------~ 127 (381) + ..+.+++.+. .. ..+.-.++..+|+- =.-.+++.|.|+++.+-.. .... . T Consensus 75 ~~~~~i~es~~t~-~v---~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~ne 150 (522) T protein:vir:69 75 YNAQNIAAGQTSG-AV---TQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYA 150 (522) T ss_pred CCccccccccccc-cc---ccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCccccccccccc Confidence 0 0111222211 10 11111222222111 1224566777765432100 0000 0 Q ss_pred cceEEecc------------------------------------------------------------------------ Q lcl|Aclame:pro 128 GVAVWGKI------------------------------------------------------------------------ 135 (381) Q Consensus 128 ~~a~w~~e------------------------------------------------------------------------ 135 (381) +.+.|.+. T Consensus 151 adt~fSG~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~Gms 230 (522) T protein:vir:69 151 PDAMFSGQGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMA 230 (522) T ss_pred cccccccccccccccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeeccccc Confidence 00000000 Q ss_pred --cccc----cccccccccceeccceeeeee-------hhhhHHHHhc----ChhHHHHHHHHHHHHHHHHHHhhheeec Q lcl|Aclame:pro 136 --YGEI----KGQLDAAFSEETAIQNKLTAF-------VVLPKDLNDF----GPAWIERFVRVQIEEAFAVALETAFLKG 198 (381) Q Consensus 136 --~~~~----~~~~~~~f~~v~l~~~kl~~~-------~~iS~ell~d----s~~~l~~~i~~~la~a~a~~~d~a~l~G 198 (381) .+|. -..++..|.+..|...|.++- ...|-||.+| -.+|.|++|.+.|+..|...+|+.||. T Consensus 231 Ta~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~- 309 (522) T protein:vir:69 231 TSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVD- 309 (522) T ss_pred hhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHh- Confidence 0000 001122366666666665433 4567777776 356899999999999999999999873 Q ss_pred cC--CC------------cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccc-cccC Q lcl|Aclame:pro 199 TG--KD------------QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSV-AVKG 263 (381) Q Consensus 199 ~G--~~------------qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 263 (381) += +. .+.|++.-.... . ..........+..++..+-...+...+ --++ T Consensus 310 ~i~~sa~~~~~g~t~~~~~~~Gv~Dl~~~~-~----------------~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg 372 (522) T protein:vir:69 310 WINYSAQVGKSGMTNIVGSKAGVFDFQDPI-D----------------IRGARWAGESFKALLFQIDKEAVEIARQTGRG 372 (522) T ss_pred hhhhhheeeccccccccccccceeeccccc-c----------------cccchhHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 20 01 112222100000 0 000001111111111111111111111 1122 Q ss_pred ce-EEEEchhhHHHHHhhhh------cc--------CCCC-ceeeccCCCceEEecCCCCCccEEEEeccc--e-----E Q lcl|Aclame:pro 264 NV-TMVVNPSDAFEVQAQYT------HL--------NANG-VYVTALPFNLNVIESTVQEAGKVLTYVKGL--Y-----D 320 (381) Q Consensus 264 ~~-~~imn~~~~~~~~~~~~------~~--------~~~G-~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~--y-----~ 320 (381) .+ ..|+++.-...+ .... .+ +..+ .+.-.|.-+++|+.+.+.|.+-++.|-... + + T Consensus 373 ~~n~~i~S~~Va~~L-~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy 451 (522) T protein:vir:69 373 EGNFIIASRNVVNVL-ASVDTGISYAAQGLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYY 451 (522) T ss_pred cccEEEEchhHHHHH-hhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceee Confidence 22 456777543333 2100 01 1111 122234557889999998877776664321 1 1 Q ss_pred EEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEE-----EecccccCCCCCCCCC Q lcl|Aclame:pro 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKL-----DLKGHKPALEGTEETL 381 (381) Q Consensus 321 i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l-----~~~~~~~~~~~~~~~~ 381 (381) -=-.....+...|...| |-.+-...|+ |-.++| |+.-.- .+....|......++- T Consensus 452 aPYv~l~~~~~~dp~sf---qP~~g~~tRY-~l~vNP--~~~~~~~~~~~ri~~g~p~~~~~~~~n 511 (522) T protein:vir:69 452 APYVALTPLRGSDPKNF---QPVMGFKTRY-GIGVNP--FAESSLQAPGARIQSGMPSILNSLGKN 511 (522) T ss_pred ccccccccccccCCccc---cceeeeeeee-ceeecC--cccccCCcccceeecccchhhcccCCc Confidence 00001111112233222 2233333444 555555 222100 1222222222222221 No 196 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=64.96 E-value=0.29 Score=23.52 Aligned_cols=287 Identities=11% Similarity=-0.023 Sum_probs=106.6 Q ss_pred HHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceee---------ecCCc-eEEEEecCC-cc-eEEeccc Q lcl|Aclame:pro 69 QRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK---------NAGLR-LKFLKSETS-GV-AVWGKIY 136 (381) Q Consensus 69 e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~---------~~~~~-~~ip~~~~~-~~-a~w~~e~ 136 (381) ..+| ++. +.- .-..+|+.|..-+.+...+.+.|++-.-+. ..+|. ..+|.-... +. ..|.+.. T Consensus 1 M~~~-~~~--T~l--~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~ 75 (367) T protein:vir:80 1 MPDF-NNQ--VRL--VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDN 75 (367) T ss_pred Ccch-hhh--hhh--hhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCC Confidence 0000 000 000 113456666554444444444444332121 22343 678864332 21 2222211 Q ss_pred c--cccccccccccceeccc--eeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhhee---ec---c---CCCc Q lcl|Aclame:pro 137 G--EIKGQLDAAFSEETAIQ--NKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL---KG---T---GKDQ 203 (381) Q Consensus 137 ~--~~~~~~~~~f~~v~l~~--~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l---~G---~---G~~q 203 (381) + +.+...-.+..++-... -|--..-.++..+- .-|..+.|.+++++--.+.....+| .| + ++.+ T Consensus 76 ~~~~~t~~kittg~~~a~v~~r~kaw~~~Dla~~ls---G~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~ 152 (367) T protein:vir:80 76 PNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELA---GSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFA 152 (367) T ss_pred CcccccccccccchheeeeehhcccchhhhHHHHhh---CchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchh Confidence 1 11111111222222222 22223344565553 2366777777777554444333322 11 1 1100 Q ss_pred ce---eeeeccccccccccccccccchh-h-hccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHH Q lcl|Aclame:pro 204 PI---GLNRQVQKGVSVTEGAYPEKEEQ-G-TLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQ 278 (381) Q Consensus 204 P~---Gil~~~~~~~~~~~~~~~~~~~~-~-~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~ 278 (381) .. |.+.+. .....+....+.+.. + .....++.. +.+....+.. .. ..=..++||+..+..++ T Consensus 153 ~~~~~~~~~a~--~~~~~~~~~~Dis~~t~~~~~~~s~~~----~~~A~~~lGD-----~~--~~l~~i~mHS~V~~~L~ 219 (367) T protein:vir:80 153 TIKTRGRVPAE--VLGTAGDMVIDISGQTNPADAVFNREA----FVDAAFTMGD-----HV--GSIAAIAVHSMVYKRMT 219 (367) T ss_pred hhhhhhccccc--cccccCceeeeeeccCCCccceecHHH----HHHHHHHhcc-----cc--ccccEEEEchHHHHHHH Confidence 00 000000 000000000111100 0 001122222 2222222211 11 11235799999988887 Q ss_pred hhhh---ccCCCCceeeccCCCceEEecCCCCCc-----c----EEEEe--ccceEEEecceeeEeeehhhhhh--cCce Q lcl|Aclame:pro 279 AQYT---HLNANGVYVTALPFNLNVIESTVQEAG-----K----VLTYV--KGLYDGYLAGGINVQKFKETLAL--DDMD 342 (381) Q Consensus 279 ~~~~---~~~~~G~~~~~l~~g~~vi~s~~~p~~-----~----i~~gd--~s~y~i~~r~~~~i~~~~~~~~~--~d~~ 342 (381) ++.. .+.++|..--..-.|++||+++.||.. . .+||. |.+.-..-..+. +..++.... .++. T Consensus 220 ~~~li~~i~~sd~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~--E~~Rd~~~~~~gG~d 297 (367) T protein:vir:80 220 NNDEIEFIPDSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPV--AVGRRELRGNGSGLE 297 (367) T ss_pred hccccccccCCCCccccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccce--ecccchhhhcCCceE Confidence 6532 234455311111138999999999942 1 13322 111111111222 333333321 3344 Q ss_pred EEEEEEEEcCEEecCcceEEEEEEecccc---------cCCCCCCCCC Q lcl|Aclame:pro 343 LYTAKQFAYGKAKDNKVAAVWKLDLKGHK---------PALEGTEETL 381 (381) Q Consensus 343 ~~~~~~r~dgk~~~~~Af~v~~l~~~~~~---------~~~~~~~~~~ 381 (381) ...-+.| .++++..+....-.+++++ +..+.++++= T Consensus 298 ~L~~Rr~---~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~e 342 (367) T protein:vir:80 298 YILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLAN 342 (367) T ss_pred EEEeeee---EEeecceeeecccccccccccccccccccccCCCChHH Confidence 4444434 6788888876544443221 1222333332 No 197 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=60.35 E-value=0.37 Score=22.92 Aligned_cols=301 Identities=14% Similarity=0.147 Sum_probs=122.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHHH-hc--ccCCCCceEccHHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 33 YGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDI-NK--NVNYKEEKLLPEETIDRIFEDLTTNHPLLAD 109 (381) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~-~~--~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~ 109 (381) +.++++. . .+..++- ..+.+..+ .++-+.+.|+- .+ -+-++.-|-+|..+...|...|....|+.+. T Consensus 1 mtnfies----q--navteff-dvlkknsg---kseiknawnaklaengvtitdttfqlprklvesintallntnpvfkv 70 (318) T protein:vir:94 1 MTNFIES----Q--NAVTEFF-DVLKKNSG---KSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKV 70 (318) T ss_pred Cccchhh----h--hhHHHHH-HHHhcccC---hhhhhhhhhhhhhhCCceeecchhhhHHHHHHhhhhhhccCCcceee Confidence 1111110 0 0000000 01111111 11122222211 11 1123445667888888888888899999999 Q ss_pred ceeeecCCceEEEEe-cCCcceEEecccccccccccccccceeccceeeeeehhhhHH--HHhcChhHHHHHHHHHHHHH Q lcl|Aclame:pro 110 LGIKNAGLRLKFLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKD--LNDFGPAWIERFVRVQIEEA 186 (381) Q Consensus 110 ~~v~~~~~~~~ip~~-~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~e--ll~ds~~~l~~~i~~~la~a 186 (381) ..+.+++.-+ ..++ ..+..+..... |..+.+...++.--++.+-.++.+-.+... -|++|...+...|..++..+ T Consensus 71 fhvtnvgall-vsrsfdssneaqvhkd-gqtkteqaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltqa 148 (318) T protein:vir:94 71 FHVTNVGALL-VSRSFDSSNEAQVHKD-GQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQA 148 (318) T ss_pred eeehhhhhee-eeccccccchhhhhcc-cccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHH Confidence 8888887642 2232 23334444333 222333344454445555555544433332 36778888899999999999 Q ss_pred HHHH-HhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCce Q lcl|Aclame:pro 187 FAVA-LETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNV 265 (381) Q Consensus 187 ~a~~-~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 265 (381) |..+ .|-+++-|+|++..+.|-+....... ...+ ....+.+...++++.. ...++ -++..|.. T Consensus 149 ivnkivdlalvegdgtngfksidkeadvkki-kkit-tkaksagktpfadaie---eavdf-----------vrptagrr 212 (318) T protein:vir:94 149 IVNKIVDLALVEGDGTNGFKSIDKEADVKKI-KKIT-TKAKSAGKTPFADAIE---EAVDF-----------VRPTAGRR 212 (318) T ss_pred HHhhhhheeeeecCCcchhhhhchhhhHHHH-HHhh-hhhhhcCCCchhHHHH---HHHhh-----------hccCCCce Confidence 8865 47888999999877666432210000 0000 0001112222332221 11111 12334444 Q ss_pred EEEEchhhHHHHHhhhhccCCCCceee-------ccCCCce---EEecCCCCCccEEEEeccceEEEecceeeEeeehhh Q lcl|Aclame:pro 266 TMVVNPSDAFEVQAQYTHLNANGVYVT-------ALPFNLN---VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKET 335 (381) Q Consensus 266 ~~imn~~~~~~~~~~~~~~~~~G~~~~-------~l~~g~~---vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~ 335 (381) .+++...+.-.++-.+..-++|.+.-. ..-.|+. |+.....-.-. ++-| .+|.|-+ ++++ .-+-. T Consensus 213 ylivktedrkalldelrqatananvriknddteiasevgvdeiivytgskavkpt-vlvd-qkyhidm-qdlt--kvdaf 287 (318) T protein:vir:94 213 YLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAVKPT-VLVD-QKYHIDM-QDLT--KVDAF 287 (318) T ss_pred EEEEeccchHHHHHHHHhhhcccceEEeccchhhhhhcCcceeEEeeccccccce-eEec-cceecch-hhhh--hhhce Confidence 566665554433322221222221100 0001111 11111000001 1111 1233321 1111 00111 Q ss_pred hhhcCceEEEEEEEEcCEEecCcceEEEEEE Q lcl|Aclame:pro 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 336 ~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~ 366 (381) -+..+.--+..-...-|-+---+|-+|.++. T Consensus 288 ewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:94 288 EWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred eeccCCceEEEEecccCcceeecCceeEEeC Confidence 1122221222222234444444555555444 No 198 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=53.09 E-value=0.54 Score=22.06 Aligned_cols=335 Identities=11% Similarity=-0.022 Sum_probs=131.4 Q ss_pred CCccHHHHHHHHHHHHHHHHhhhhH---------------HHHHHHHHHHHHHHHHHHHH---HHH--------HHHHHH Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINAVNNGEP---------------QERQNELYGDMINQLFEETK---LQA--------KAEAER 54 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~---~~~--------~~~~~~ 54 (381) +.+-+ ++.+.++++++..... .+.-.+. +.+++..... .+. -.+.-+ T Consensus 301 ~~~s~----d~ar~~lL~~l~~~~~p~~~~~~~~~~~~~~g~~~~d~---~~~al~~R~g~~~~~~~n~~~g~~L~elAr 373 (693) T protein:vir:95 301 MNITV----DQAREKLLAAIGADTQPAAALSAGAHIHAGNGNLVGDS---VRASVLARIGRGERQADNAYNGMTLRELAR 373 (693) T ss_pred cCCCH----HHHHHHHHHHHhhccCCCCCcCcCccccCCchhHHHHH---HHHHHHHhcCcccccCCccccCCcHHHHHH Confidence 11111 1222222222211000 0000000 0011110000 000 001112 Q ss_pred HHHhhhhhcc--ccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhh-----hhhhhhceeeecCC--ceEEEEec Q lcl|Aclame:pro 55 VSSLPKSAQS--LSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTN-----HPLLADLGIKNAGL--RLKFLKSE 125 (381) Q Consensus 55 ~~~~~~~~~~--lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~-----~~l~~~~~v~~~~~--~~~ip~~~ 125 (381) ..+..++... ++..| .....-..++++= |--+.+-+.+.|++. ...+..|+..+.+- ..+..... T Consensus 374 ~~L~~rg~~~~~~~~~~--~~~~a~~htTSDF----p~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg 447 (693) T protein:vir:95 374 ASLVDRGIGVASLNAPQ--MVGLAFTHTSSDF----GLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLG 447 (693) T ss_pred HHHHhcCCccCCCCHHH--HHHHHHhcCcchh----HHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecC Confidence 2222222221 11111 1111111233332 222222222222221 23344444333321 01111222 Q ss_pred CCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhhe---eeccCCC Q lcl|Aclame:pro 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF---LKGTGKD 202 (381) Q Consensus 126 ~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~---l~G~G~~ 202 (381) +-+.-.-+.|++|.+-.+- .=+.-++...+++..+.||++.+-.-..++-+-|-..++++.++.+++.+ |.|..+- T Consensus 448 ~~~~L~~V~E~gEyk~~t~-~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m 526 (693) T protein:vir:95 448 EFSSLRQVREGAEYKYVTL-GERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAM 526 (693) T ss_pred CCCChhhcCCCCceeeeec-CCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccc Confidence 3333334566676643111 11234566778999999999999888888888889999999988887644 3332110 Q ss_pred -cceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhh--ccccccccccCceEEEEchhhHHHHHh Q lcl|Aclame:pro 203 -QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHST--NEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 203 -qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~imn~~~~~~~~~ 279 (381) --+.||.. ..++-.++. ....+...+......+..+-.. ...+.....+ ...|++.+.-...... T Consensus 527 ~DGk~LFha-dH~Nl~tga----------~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~-P~~llvP~~le~~a~~ 594 (693) T protein:vir:95 527 SDGKTLFHA-DHSNLLTGA----------ASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIR-PGFVLTPVALEDKANQ 594 (693) T ss_pred cCCcceeec-ccccccccc----------ccccChHHHHHHHHHHHHhhcchhccCCceeecc-cceEEecchHHHHHHH Confidence 01234431 111101100 0111122222222222222111 1122233233 3467776654333222 Q ss_pred hhhcc---CC---CC--ceeeccCCCceEEecCCCCC--ccE--EEEeccc--e---EEEecceeeEeeehhhhhhcCce Q lcl|Aclame:pro 280 QYTHL---NA---NG--VYVTALPFNLNVIESTVQEA--GKV--LTYVKGL--Y---DGYLAGGINVQKFKETLALDDMD 342 (381) Q Consensus 280 ~~~~~---~~---~G--~~~~~l~~g~~vi~s~~~p~--~~i--~~gd~s~--y---~i~~r~~~~i~~~~~~~~~~d~~ 342 (381) +.... .+ .| +|+... ..||.++.+.+ ... ++.|... + ++.-.++..|+. +-.|..|-+ T Consensus 595 l~~s~~~~~a~~~~~~~NP~~~~---~~vi~~prL~~~s~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~--~~gf~~dG~ 669 (693) T protein:vir:95 595 IINSESVPGADVNSGIVNPIRAF---AQVIGEPRLDDASATAWYMAAKKGSDTIEVAYLDGVDTPYLEQ--QEGFTVDGV 669 (693) T ss_pred Hhccccccccccccccccchhcc---ccccccceecCCCCCceEEecCCCCCeEEEEEecCCCCCeEee--cCCCCcceE Confidence 22111 11 11 222111 14555565542 122 2334322 1 122223444443 335889999 Q ss_pred EEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 343 LYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 343 ~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) -|++..-++.+++|-.+++ |+++- T Consensus 670 ~~kvr~D~G~~~iD~Rg~~----kn~GA 693 (693) T protein:vir:95 670 ASKVRIDAGVAPLDFRGLQ----KSNGA 693 (693) T ss_pred EEEEEEeccCceeeccccc----cCCCC Confidence 9999999999999987765 45443 No 199 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=52.38 E-value=0.56 Score=21.98 Aligned_cols=273 Identities=10% Similarity=0.047 Sum_probs=120.4 Q ss_pred CCCCceEccH--HHHHHHHHHHHhhhhhhhhceeeecC---C-ceEEEEecCCcceE--Eecccccccccccccccceec Q lcl|Aclame:pro 81 NYKEEKLLPE--ETIDRIFEDLTTNHPLLADLGIKNAG---L-RLKFLKSETSGVAV--WGKIYGEIKGQLDAAFSEETA 152 (381) Q Consensus 81 ~~~gg~lvP~--~~~~~Ii~~l~~~~~l~~~~~v~~~~---~-~~~ip~~~~~~~a~--w~~e~~~~~~~~~~~f~~v~l 152 (381) -+...||+.+ -+..+|.+.....-..+.++.+.+.. . .......+..+.+. |+...+..-+..+..+++-.. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 1222333321 22233333221111222333332211 1 12334445556666 866544333345666677777 Q ss_pred cceeeeeehhhhHHHHhcC---hhHHHHHHHHHHHHHHHHHHhhheeeccCC-Ccceeeeeccccccccccccccccchh Q lcl|Aclame:pro 153 IQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETAFLKGTGK-DQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 153 ~~~kl~~~~~iS~ell~ds---~~~l~~~i~~~la~a~a~~~d~a~l~G~G~-~qP~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) ..+..+.-+..|.+=|+.+ ..++++-=.+...+++...+|+-.+.|+-. ..-.|++++.........+.... T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~---- 156 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQN---- 156 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccC---- Confidence 7777776666554433333 234666666667778888899999999733 34789998765433221111100 Q ss_pred hhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhccCCCCc----ee-----eccCCCce Q lcl|Aclame:pro 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGV----YV-----TALPFNLN 299 (381) Q Consensus 229 ~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~~~~~G~----~~-----~~l~~g~~ 299 (381) ......++....+++..++..+.....+ ....+ .++|.|..+..+.. +..+..|. |+ ..-+.++. T Consensus 157 ~~w~~~T~~eI~~di~~~~~~i~~~s~~---~~~p~-tl~Lpp~~~~~l~~--~~~~~~~~Tvl~~l~~n~~~~~g~~l~ 230 (304) T protein:vir:52 157 TKVQAMDFDKAVAFFKEIFLKGMEKTKR---IEAPN-TFAIDSLDLAHLAL--VQRANTDTTALEFLTKHLSAAAGRQVA 230 (304) T ss_pred CccccCCHHHHHHHHHHHHHHHHhccCc---eecCc-eEEeCHHHHHHHhh--ccCCCCCchHHHHHHHhcccccCCcce Confidence 1122334555555555555444322222 12222 46677765543321 11111111 11 00111222 Q ss_pred E--EecCCCCC-----ccEEEEeccceEEEecceeeEeeehhhhhhcCceEE-E-EEEEEcCEE-ecCcceEEEEE Q lcl|Aclame:pro 300 V--IESTVQEA-----GKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLY-T-AKQFAYGKA-KDNKVAAVWKL 365 (381) Q Consensus 300 v--i~s~~~p~-----~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~-~-~~~r~dgk~-~~~~Af~v~~l 365 (381) | +.+..... +.+++.+.+.=++...-++.+.++..- .++...| . +..|++|-. ..|.|++.++- T Consensus 231 I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~q--~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 231 IKALPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDAQ--PKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred EEEecccccccCCCCceEEEEEecChhheEEecCccccccchh--hcCCceEEecceeeeeeEEEEccceeeeecC Confidence 2 22222221 124444444422222233333333321 1233233 2 567777754 45667777655 No 200 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=50.90 E-value=0.6 Score=21.81 Aligned_cols=344 Identities=7% Similarity=-0.002 Sum_probs=115.5 Q ss_pred ccHHHHHHHHHHHHHHH-----HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHH---HH- Q lcl|Aclame:pro 3 INLSETFANAKNEFINA-----VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS---FF- 73 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~~- 73 (381) |+-.+++.||=.-+++. |++.-+...-.+.+++..+.+..+. .++...+.......|++.+.. .+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~------~~~~~~~~~~~~~~l~ea~~~~~~~~~ 74 (528) T protein:vir:66 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDP------IYKDEKVVEAFGGFIAEAEVAGDHGYD 74 (528) T ss_pred CcchHHHHHHhHHhhcCCCcchhcchhhhhhhhhhhhhhHHHhhccc------chhhHHHHHhhhhhhhhhccccccccc Confidence 77777777777777664 3222222222223333333222221 011111100001111111100 00 Q ss_pred -HHHhcccCCCCceEccHHHHHHHHHHHHh-h--hhhhhhceeeecCCce--------EEE------------------- Q lcl|Aclame:pro 74 -MDINKNVNYKEEKLLPEETIDRIFEDLTT-N--HPLLADLGIKNAGLRL--------KFL------------------- 122 (381) Q Consensus 74 -~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~-~--~~l~~~~~v~~~~~~~--------~ip------------------- 122 (381) +.+.+++.+..=--+-+. ++..+|+ . =.-.+++.|.|+++.+ ..+ T Consensus 75 ~~~i~es~~t~~v~~~~P~----Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~e 150 (528) T protein:vir:66 75 ASQIAAGQTTGAITNVGPA----VIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPD 150 (528) T ss_pred chhccccccccccccCchh----HHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCccccccccccccccccc Confidence 001111111100000111 1111111 0 0112344455443310 000 Q ss_pred -------------------------E--ec--------------------------------------------CCcceE Q lcl|Aclame:pro 123 -------------------------K--SE--------------------------------------------TSGVAV 131 (381) Q Consensus 123 -------------------------~--~~--------------------------------------------~~~~a~ 131 (381) + .. ..+... T Consensus 151 a~fsea~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~ 230 (528) T protein:vir:66 151 AFHSSLAAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLA 230 (528) T ss_pred ccccccccccccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccccccccccccccce Confidence 0 00 000000 Q ss_pred Eecc-----cccc----cccccccccceeccceeeee-------ehhhhHHHHhc----ChhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 132 WGKI-----YGEI----KGQLDAAFSEETAIQNKLTA-------FVVLPKDLNDF----GPAWIERFVRVQIEEAFAVAL 191 (381) Q Consensus 132 w~~e-----~~~~----~~~~~~~f~~v~l~~~kl~~-------~~~iS~ell~d----s~~~l~~~i~~~la~a~a~~~ 191 (381) -.+. .+|. -..++..|.+..|...|.++ ....|-||.+| -.+|.|+.|.+.|+..|...+ T Consensus 231 ~~~~Gm~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEI 310 (528) T protein:vir:66 231 EIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEI 310 (528) T ss_pred ecccccchhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHh Confidence 0000 0000 00112235666665555543 34567777776 357889999999999999999 Q ss_pred hhheeeccCCC-c--ceeeeeccccccccccccccccchhhhc--cccChhHHHHHHHHHHHHhhhccccccccccCceE Q lcl|Aclame:pro 192 ETAFLKGTGKD-Q--PIGLNRQVQKGVSVTEGAYPEKEEQGTL--TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVT 266 (381) Q Consensus 192 d~a~l~G~G~~-q--P~Gil~~~~~~~~~~~~~~~~~~~~~~~--t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 266 (381) |+.||.=-.+. + -.|+...+.. ..|.. |-...... ..-..+..-..+..+.+......... .+-+.-. T Consensus 311 NREii~~i~~~a~~~~~~~t~~~~~----~aG~~-dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T--~r~~gn~ 383 (528) T protein:vir:66 311 NREIVDVINFTAQVGKTGMTQTVGS----KAGVF-DLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQT--GRGAGNF 383 (528) T ss_pred hHHHHhhhhheeeeeeeeeeecccc----cccee-ecccccccccchhHHHHHHHHHHHHHHHHHHHHHhh--ccccccE Confidence 99995311111 1 1222111000 00000 00000000 00000111111222222211111111 1112234 Q ss_pred EEEchhhHHHHHhhh-------------hccCCC-CceeeccCCCceEEecCCCCCccEEEEeccc-------eEE-Eec Q lcl|Aclame:pro 267 MVVNPSDAFEVQAQY-------------THLNAN-GVYVTALPFNLNVIESTVQEAGKVLTYVKGL-------YDG-YLA 324 (381) Q Consensus 267 ~imn~~~~~~~~~~~-------------~~~~~~-G~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~-------y~i-~~r 324 (381) .++++.-...+...- ...+.. ..+.-.|.-+++|+.+.+.|.+-++.|-... |+- +. T Consensus 384 vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv- 462 (528) T protein:vir:66 384 VIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYV- 462 (528) T ss_pred EEEchHHHHHHhhccccccccccccccccccCCCCceeEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccc- Confidence 577776444332110 001111 1223344557889999998877666653211 100 00 Q ss_pred ceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcc--------------------------eEEEEEEec Q lcl|Aclame:pro 325 GGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKV--------------------------AAVWKLDLK 368 (381) Q Consensus 325 ~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~A--------------------------f~v~~l~~~ 368 (381) .+.-....|...| |-.+-...|+ |-.++|-+ |..+-+|.. T Consensus 463 ~l~~~~~~dp~sf---qP~~g~~tRY-~l~vNP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:66 463 ALTPLRATDPQSF---HPVLGFKTRY-GIGINPFADSKSQEPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred cceeeEeeCCccc---cceeeeeeee-ceeecCcccccCccccccccccchhhhhcCccceeEEeeeccC Confidence 1111122233222 1122222333 33344311 111111111 No 201 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=49.50 E-value=0.64 Score=21.66 Aligned_cols=262 Identities=13% Similarity=0.069 Sum_probs=83.3 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhce---ee----ecCCceEE-EEec-CCcceE-Eecccccccccccc Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLG---IK----NAGLRLKF-LKSE-TSGVAV-WGKIYGEIKGQLDA 145 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~---v~----~~~~~~~i-p~~~-~~~~a~-w~~e~~~~~~~~~~ 145 (381) |.++.-++= .+--+.+....++.+.+.-.+++.+. +. +..|.+.. +.-. +..... -+...+...+..-. T Consensus 1 ~~~t~~sdl-~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit 79 (315) T protein:vir:96 1 MATTVNSDL-VIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIA 79 (315) T ss_pred Cceeeecce-eeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceecc Confidence 222222220 01123344444454444333333211 11 11232211 1000 000000 00000111110001 Q ss_pred cccceeccceeee-eehh--hhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheee-ccCCCcceeeeecccccccccccc Q lcl|Aclame:pro 146 AFSEETAIQNKLT-AFVV--LPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK-GTGKDQPIGLNRQVQKGVSVTEGA 221 (381) Q Consensus 146 ~f~~v~l~~~kl~-~~~~--iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~-G~G~~qP~Gil~~~~~~~~~~~~~ 221 (381) +..++.. |++ +.-+ .+.+.+.-..-+.+.++. .+...+..+.-+.++. +- .|++..+....... T Consensus 80 ~~~dvaV---k~~~~~~~~~~~~~~~a~~g~dp~~~~~-~i~~~~~~~~l~~~l~~~l-----~~~~aai~~~t~~~--- 147 (315) T protein:vir:96 80 ADEMVSV---KVPWKYGPYETTEEAFKRRARSPEEFSM-LIGQDMADATMAGWIGYAL-----NALQGAIGSNAGMN--- 147 (315) T ss_pred cccceeE---EEeecCCchhccHHHHHHhhcCHHHHHH-HHHHHHHHHHHHHHHHHHH-----hhhhhhhccccccc--- Confidence 1111111 222 2222 233333322333344433 2222222222222221 10 00110000000000 Q ss_pred ccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh---ccCCCCceeecc---C Q lcl|Aclame:pro 222 YPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNANGVYVTAL---P 295 (381) Q Consensus 222 ~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~---~~~~~G~~~~~l---~ 295 (381) . .+.....+..++. +....+ |... ..=..|+||...+..+.+... .....+.-+..+ - T Consensus 148 ~-----~~~~a~~~~~~l~----dA~~kl-----GD~~--~~l~~~vMHS~v~~~L~~q~L~~~~~~~~~~~~~~~~~~~ 211 (315) T protein:vir:96 148 V-----SGELATEGKKVLT----KGLRTM-----GDKA--SSIAIWVMDSTSYFDIVDEAIDNKLYEEAGVVVYGGTPGT 211 (315) T ss_pred c-----cccccccCHHHHH----HHHHHh-----cccc--cCeeEEEEchHHHHHHHHhhhhhhcccccceeEecCcCcc Confidence 0 0011112222221 111111 1110 111369999999988876421 112222222211 1 Q ss_pred CCceEEecCCCCCccEEEEeccceEEEecceeeEeeehhhh----hhcCceEEEEEEEEcC-EEecCcceEEEEEEeccc Q lcl|Aclame:pro 296 FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETL----ALDDMDLYTAKQFAYG-KAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 296 ~g~~vi~s~~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~----~~~d~~~~~~~~r~dg-k~~~~~Af~v~~l~~~~~ 370 (381) +|++|+.++.||...++ | |. .+.+.+....... -..++.......|.++ ..+++..|.--+. T Consensus 212 lGkrViVdD~~P~~~~~-g-l~------~GAi~~~~~~~~~~~~~~~~g~e~l~~~~r~e~tf~l~p~G~sw~~~----- 278 (315) T protein:vir:96 212 LGKPVLVTDQCPATKIF-G-LV------AGAVMITESQAPGMRSYQIDDQENLAIGFRAEGTANVEVLGYKWKTK----- 278 (315) T ss_pred cccEEEEECCCCcceee-e-ee------cceeeecCCCccccccccCCCcceeEEEEeeeeEeeeeeeeEEeecC----- Confidence 49999999999974332 2 00 1222222111111 1123333444444444 4677777665321 Q ss_pred ccCCCCCCCCC Q lcl|Aclame:pro 371 KPALEGTEETL 381 (381) Q Consensus 371 ~~~~~~~~~~~ 381 (381) ...-++|+- T Consensus 279 --~~~sPt~ae 287 (315) T protein:vir:96 279 --TNVNPASAT 287 (315) T ss_pred --CCcCCChHH Confidence 112234433 No 202 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=44.30 E-value=0.81 Score=21.08 Aligned_cols=262 Identities=11% Similarity=-0.006 Sum_probs=110.2 Q ss_pred ccCCCCceEccHHHHHHHHHHHHhhhhhhhhceee------ecCCceEEEEecCCcceEEeccccccccccccccc--ce Q lcl|Aclame:pro 79 NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK------NAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFS--EE 150 (381) Q Consensus 79 ~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~------~~~~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~--~v 150 (381) -..-++-+|-|+.+..++++.+++...+.++|..- ..+.+++||+........ +..+..+ +.+=+ .+ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~d----g~~~~~~-~~te~~v~l 75 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSAS----GRTLVKQ-PMVDQTIPF 75 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeecc----cCCcccc-ccccceEEE Confidence 12224556679999999999999999888777542 122357888743222111 1111111 11222 35 Q ss_pred eccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhh Q lcl|Aclame:pro 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) .|+-+|+..+ .++.+=+..+..++..-+.+....+++..+|..++. +++...- ...+.++ T Consensus 76 ~id~~k~~~~-~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~---------l~~~a~~-~~gt~gt--------- 135 (418) T protein:vir:10 76 KIAYQEHVGL-EYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLAL---------TLKKAFH-SSGTPGV--------- 135 (418) T ss_pred EEecccccce-eechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHH---------HHhhccc-ccccCCc--------- Confidence 5555555544 444444445566787777788899999999987652 1111100 0000000 Q ss_pred ccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc-cCC--------CCceeeccCCCceEE Q lcl|Aclame:pro 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH-LNA--------NGVYVTALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~-~~~--------~G~~~~~l~~g~~vi 301 (381) .+ ...+.+.++...|.. .+.| -.++.+.+++|..++.+..-... .+. +|.. ...+|..|+ T Consensus 136 ----~~-~~~~~i~~a~~~Ld~--~~VP--~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~I--G~i~GF~V~ 204 (418) T protein:vir:10 136 ----RP-GAFIDFANAGAKQTT--YAVP--QDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYR--GNVAAYEVY 204 (418) T ss_pred ----Cc-chHHHHHHHHHHHHh--cCCC--CCCceEEEeCHHHHHHHhhhccccccccccchhhheeee--eeeeceEEE Confidence 00 012223333332211 1122 12335668999877655432111 011 1221 123688999 Q ss_pred ecCCCCCccEEEEeccc-eEE-Ee-cce--eeEe---eehhhhhh-cCceEEEEEE---EEcCEEe-cCcceEEE----- Q lcl|Aclame:pro 302 ESTVQEAGKVLTYVKGL-YDG-YL-AGG--INVQ---KFKETLAL-DDMDLYTAKQ---FAYGKAK-DNKVAAVW----- 363 (381) Q Consensus 302 ~s~~~p~~~i~~gd~s~-y~i-~~-r~~--~~i~---~~~~~~~~-~d~~~~~~~~---r~dgk~~-~~~Af~v~----- 363 (381) .|+.+|.... |.+.. ..+ +- ..+ +.+. .+..-... .|...|-+.. ++.+.+. +..=|+|. T Consensus 205 ~S~nip~~ta--g~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~ 282 (418) T protein:vir:10 205 ESQNLPKHTV--GDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDT 282 (418) T ss_pred EecCCCcccc--cccccceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccc Confidence 9999995432 22211 111 00 111 1111 11111111 1122222211 1100000 11122221 Q ss_pred ------EEEecccccCCCCCCCCC Q lcl|Aclame:pro 364 ------KLDLKGHKPALEGTEETL 381 (381) Q Consensus 364 ------~l~~~~~~~~~~~~~~~~ 381 (381) ++++ .|++.-..++. T Consensus 283 ~~~~~~tv~i---~p~~~~~~~~~ 303 (418) T protein:vir:10 283 DAGGAGSIKI---SPSLNDGTATI 303 (418) T ss_pred cccCcceeEe---ccccccccccc Confidence 1111 12211111111 No 203 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=41.84 E-value=0.91 Score=20.81 Aligned_cols=268 Identities=11% Similarity=0.049 Sum_probs=110.3 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeee--------cCCceEEEEecCCcceEEecc-ccccc-c-ccc Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN--------AGLRLKFLKSETSGVAVWGKI-YGEIK-G-QLD 144 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~--------~~~~~~ip~~~~~~~a~w~~e-~~~~~-~-~~~ 144 (381) |.-.-.+ .||+.+..++++.+++...+-++++.-- .+-+++||+.........-.. ...+. . ..+ T Consensus 1 MAN~llT----~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e 76 (423) T protein:vir:35 1 MANNLES----NISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFS 76 (423) T ss_pred Cccchhh----hhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccccccc Confidence 2111000 2799999999999999988877765421 122467776543222221111 11111 1 111 Q ss_pred ccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccc Q lcl|Aclame:pro 145 AAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPE 224 (381) Q Consensus 145 ~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~ 224 (381) .+ -.+.|+.+|+.++-.=..|+.. +..++++++...+ .+++..+|..++.--=.+-|.-+ .+.++ T Consensus 77 ~~-v~l~id~~k~~a~~v~d~e~~l-~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a~~~v---------gt~~t--- 141 (423) T protein:vir:35 77 AK-ATGKVGKYITVAVEWTQIEEAL-KLNQLDQILSPIH-ERMVTDLETELAHFMMNNGALSL---------GSPNT--- 141 (423) T ss_pred ce-eeEEeccceeccceeCHHHHHh-hHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcccccc---------ccccC--- Confidence 11 2466777777666555555544 5667888777664 66777777777531000001000 00000 Q ss_pred cchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc-cCC---------CCceeecc Q lcl|Aclame:pro 225 KEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH-LNA---------NGVYVTAL 294 (381) Q Consensus 225 ~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~-~~~---------~G~~~~~l 294 (381) ..+ ..+.+.++...|. ..+.|. ++.+.+++|..+..++..... ..+ +|. +... T Consensus 142 --------~~~---~~~~i~~a~~~Ld--~~~vP~---~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~-i~G~ 204 (423) T protein:vir:35 142 --------AIK---KWADVAQTASFIK--DIGIKT---GENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQ-ISGN 204 (423) T ss_pred --------Ccc---hHHHHHHHHHHHH--HhcCCc---CCCEEEeCHHHHHHHhccccceeccccchhHHHhhcc-ceee Confidence 001 1122333333221 112232 344568999887776532111 111 111 2122 Q ss_pred CCCceEEecCCCCCccEEE------------------EeccceEEEecceeeEeeehhhhhhcCceEEEEEEEE---cCE Q lcl|Aclame:pro 295 PFNLNVIESTVQEAGKVLT------------------YVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFA---YGK 353 (381) Q Consensus 295 ~~g~~vi~s~~~p~~~i~~------------------gd~s~y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~---dgk 353 (381) .+|..|+.|+.+|..+..- .+.+.+.+.. .+..+.... ..-..|...|-|..-+ .+. T Consensus 205 i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~-~~~~~~~~g-~l~~GD~~t~aGv~~v~~~t~~ 282 (423) T protein:vir:35 205 FGGIRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVAL-TGATPSKTG-FLKAGDQLKFTSTHWLNQQSKQ 282 (423) T ss_pred ecceEEEEcCCCccccccccccceeeccccccccccccccccceeee-eeeeeccCC-cEEecceEEeeeeeeccccccc Confidence 3688999999999643110 0111111111 111111111 1122333333332221 111 Q ss_pred EecC-cceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 354 AKDN-KVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 354 ~~~~-~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) ++-. .--...+..+++..-..-++..|+ T Consensus 283 ~~~~~~t~~~~~~~V~~~~~~~a~g~~~v 311 (423) T protein:vir:35 283 TLYNGSTAMSFTATVLEETNSTASGDVTV 311 (423) T ss_pred eeecccCCceeEEEEeccccccccCceeE Confidence 1000 000111222222222222223334 No 204 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=38.92 E-value=1 Score=20.48 Aligned_cols=346 Identities=10% Similarity=0.008 Sum_probs=115.5 Q ss_pred CCccHHHHHHHHHHHHHHH-----HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHH---H Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINA-----VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS---F 72 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~ 72 (381) |+|+.+ ++.||=.-+++. |++.-++..-...+++..+.+.++. .++...+...-.+.|++.+.. . T Consensus 1 ~~~~~~-~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~------~~~~~~~~e~~~~~l~~~~~~~~~~ 73 (529) T protein:vir:10 1 MSLKNK-EILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDP------VYRDDKLIEAFGQSLMEAEVAGDHG 73 (529) T ss_pred CcccHH-HHHHHhHHHhcCCccchhccchhhhhhhhhhhhhHHHHhhcc------ccchhhhhhhhhcccchhhcccccc Confidence 999877 476776666665 3222222212222332222221111 000001111111122221100 0 Q ss_pred HH--HHhcccCCCCce-EccHHHHHHHHHHHHhhhhhhhhceeeecCCceE--------EEEec---------------- Q lcl|Aclame:pro 73 FM--DINKNVNYKEEK-LLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLK--------FLKSE---------------- 125 (381) Q Consensus 73 ~~--~~~~~~~~~gg~-lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~--------ip~~~---------------- 125 (381) ++ -+.+++.+ |.. -.-|.+.. ++++.-..=.-.+++.|.|+++.+- .+... T Consensus 74 ~~~~~i~est~t-~~v~~~~P~Li~-lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~Pd 151 (529) T protein:vir:10 74 YDPTNIAAGQSS-GAITNIGPAVIG-MVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPD 151 (529) T ss_pred cccccccccccc-ccccccCchhhh-hHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccccccccccccccccc Confidence 00 00011111 000 00011111 1111001111223444555432210 00000 Q ss_pred --------------------------------------------------------------------------CCcceE Q lcl|Aclame:pro 126 --------------------------------------------------------------------------TSGVAV 131 (381) Q Consensus 126 --------------------------------------------------------------------------~~~~a~ 131 (381) ..+... T Consensus 152 a~~sga~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~ 231 (529) T protein:vir:10 152 AWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLA 231 (529) T ss_pred cccccccccccccccCccccccccccccccccCcceeeeecccceecccccccccccCccccCccccccccccccccccc Confidence 000000 Q ss_pred Eeccc-----ccc----cccccccccceeccceeeeee-------hhhhHHHHhc----ChhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 132 WGKIY-----GEI----KGQLDAAFSEETAIQNKLTAF-------VVLPKDLNDF----GPAWIERFVRVQIEEAFAVAL 191 (381) Q Consensus 132 w~~e~-----~~~----~~~~~~~f~~v~l~~~kl~~~-------~~iS~ell~d----s~~~l~~~i~~~la~a~a~~~ 191 (381) -..++ +|. -..++..|.+..|...|.++- ...|-||.+| -.+|.|++|.+.|+..|...+ T Consensus 232 ~~~~Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEI 311 (529) T protein:vir:10 232 EIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEI 311 (529) T ss_pred ccccccchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHh Confidence 00000 000 011233466666666665432 4567777776 356899999999999999999 Q ss_pred hhheeec--------c--CC---CcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 192 ETAFLKG--------T--GK---DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKS 258 (381) Q Consensus 192 d~a~l~G--------~--G~---~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~ 258 (381) |+.||.= . |+ +...|++.- .....+. ........+..++.++-...+... T Consensus 312 NReii~~l~~~a~~~k~~g~~~~~~~~Gv~d~-~~~~~~~----------------~~~~~~e~~k~L~~~i~~~an~I~ 374 (529) T protein:vir:10 312 NREVIDWINYTAQVGKSGWTKTDGSASGVFDF-QDPIDVR----------------GARWAGESYKALLIQIDKEANEIA 374 (529) T ss_pred hHHHHHhHhhhhhhhhcccccccccccceeec-ccCcccc----------------ccchHHHHHHHHHHHHHHHHHHHH Confidence 9988741 1 00 011233210 0000000 000011111111111111111111 Q ss_pred cc-ccCce-EEEEchhhHHHHHhhhhc---------------cCCCCceeeccCCCceEEecCCCCCccEEEEeccc--e Q lcl|Aclame:pro 259 VA-VKGNV-TMVVNPSDAFEVQAQYTH---------------LNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGL--Y 319 (381) Q Consensus 259 ~~-~~~~~-~~imn~~~~~~~~~~~~~---------------~~~~G~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~--y 319 (381) +. -++++ ..++++.-...+ ..... .+..+.+.-.|.-+++|+.+.+.|.+-++.|-... | T Consensus 375 ~~T~rg~~n~vi~S~~Va~~L-~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~ 453 (529) T protein:vir:10 375 RQTGRGAGNFIIASRNVVSAL-ALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNL 453 (529) T ss_pred HhhccccceEEEEchHHHHHH-HhhhhhccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCccc Confidence 11 12222 456676543322 21000 00111233345557889999998877666663211 1 Q ss_pred E-----E-EecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 320 D-----G-YLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 320 ~-----i-~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) . - +. ....+...|...| |-.+-...|+ |-.++|-+-. .+-..+-..+.|.+-.. T Consensus 454 ~~glfy~PYv-~l~~~~~~dp~sf---qP~~g~~tRY-~l~~NP~~~~---~~~~~~~r~~~g~~~~~ 513 (529) T protein:vir:10 454 DAGIYYCPYV-ALTPLRGSDPKNF---QPVMGFKTRY-AIGVNPFAES---RTQAPQGRITSGMPGVN 513 (529) T ss_pred ccceeecccc-ccccccccCCCcc---cceeeeeeee-ceeecCcccc---ccccccccccCCcchhh Confidence 1 0 00 0111112222222 2222223333 3334432210 00000001122222111 No 205 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=37.53 E-value=1.1 Score=20.33 Aligned_cols=354 Identities=8% Similarity=0.037 Sum_probs=117.7 Q ss_pred ccHHHHHHHHHHHHHHHHhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHH---HHH--H Q lcl|Aclame:pro 3 INLSETFANAKNEFINAVNNGEP--QERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS---FFM--D 75 (381) Q Consensus 3 ~~l~~~~~e~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~~~--~ 75 (381) |..++++.||=.-+++..+.... ...++.....+++.-.++.+.+ + .++...+....+..|.+.|.- -+. . T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~~~-~-~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~ 78 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETD-P-VYRDEKIVESFGGFLAEAEIAGDHNYDQTN 78 (524) T ss_pred CcchHHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHhcC-c-cccchHHHHhhhcccccccccccccccccc Confidence 65667777777777654322111 0111111111111111111111 0 111111111112333332210 000 0 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceE--------EEEecCC-c---------------ceE Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLK--------FLKSETS-G---------------VAV 131 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~--------ip~~~~~-~---------------~a~ 131 (381) +.+++.+..=--.-|.+.. ++++.-..=.-.+++.|.|+++..- .+-.... + .+. T Consensus 79 i~~s~~t~~v~~~~P~Li~-lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~ 157 (524) T protein:vir:98 79 IASGKSSGAITNIGPAVIG-MVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTM 157 (524) T ss_pred ccccccccccccccchhhh-HHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccccccccccccccccccc Confidence 1111111100001122211 1111111112234556665543220 0000000 0 000 Q ss_pred Ee---------------------------------------------------------------------ccc-----c Q lcl|Aclame:pro 132 WG---------------------------------------------------------------------KIY-----G 137 (381) Q Consensus 132 w~---------------------------------------------------------------------~e~-----~ 137 (381) |. ..+ + T Consensus 158 fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~a 237 (524) T protein:vir:98 158 YSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVA 237 (524) T ss_pred cCCccccccccccccccccccccccccccccccceeccccccCcccccccccccccccccccccccceeecccccchhhh Confidence 00 000 0 Q ss_pred cc----cccccccccceeccceeeeee-------hhhhHHHHhc----ChhHHHHHHHHHHHHHHHHHHhhheeeccC-- Q lcl|Aclame:pro 138 EI----KGQLDAAFSEETAIQNKLTAF-------VVLPKDLNDF----GPAWIERFVRVQIEEAFAVALETAFLKGTG-- 200 (381) Q Consensus 138 ~~----~~~~~~~f~~v~l~~~kl~~~-------~~iS~ell~d----s~~~l~~~i~~~la~a~a~~~d~a~l~G~G-- 200 (381) |. -..+...|.+..|...|..+- ...|-||.+| -.+|.|++|.+.|+..|...+|+.||. += T Consensus 238 EaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~-~i~~ 316 (524) T protein:vir:98 238 ELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVD-LINY 316 (524) T ss_pred hhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHH-HHhh Confidence 00 001233466666666666533 4467777776 356899999999999999999999983 20 Q ss_pred CC--cceeeeeccccccccccccccccchhhhccccC------hhHHHHHHHHHHHHhhhcccccccc-ccC-ceEEEEc Q lcl|Aclame:pro 201 KD--QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFAN------PRATVNELTQVFKYHSTNEKGKSVA-VKG-NVTMVVN 270 (381) Q Consensus 201 ~~--qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~------~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~imn 270 (381) +. +-.|+-+.. ....+.+.+.+ .......+..++.++-...+...+. -++ .-..|++ T Consensus 317 ~a~~~~~g~t~~~-------------~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S 383 (524) T protein:vir:98 317 TAQVGKSGFTQTV-------------GSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIAS 383 (524) T ss_pred hheeceeeccccc-------------ccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEc Confidence 01 112221110 00011111100 0001111111111111111111111 122 2345777 Q ss_pred hhhHHHHHh-------------hhhccCCCC-ceeeccCCCceEEecCCCCCccEEEEeccce-------EEEecceeeE Q lcl|Aclame:pro 271 PSDAFEVQA-------------QYTHLNANG-VYVTALPFNLNVIESTVQEAGKVLTYVKGLY-------DGYLAGGINV 329 (381) Q Consensus 271 ~~~~~~~~~-------------~~~~~~~~G-~~~~~l~~g~~vi~s~~~p~~~i~~gd~s~y-------~i~~r~~~~i 329 (381) +.-...+-. .....+..| .+.-.|.-+++|+.+.+.|.+-++.|-.... +-=-.....+ T Consensus 384 ~~Va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~ 463 (524) T protein:vir:98 384 RNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPL 463 (524) T ss_pred hHHHHHHhhhhcccccccchhhcccccCCccceEEEEecCceEEEecCCCCcceEEEEeeCCcccccceeeccccccccc Confidence 754332221 000011111 1222344578899999988776666633111 0000001111 Q ss_pred eeehhhhhhcCceEEEEEEEEcCEEecCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 330 QKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 330 ~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) ...|...| |-.+-...|+ |-.++| |+.- .+-....+.+.|. ++- T Consensus 464 ~~~dp~sf---qP~~g~~tRY-~l~~NP--~~~~-~~~~~~~ri~~g~-~~~ 507 (524) T protein:vir:98 464 RGSDPKNF---QPVMGFKTRY-GIGINP--FANS-RSQAPADRITSGM-ISK 507 (524) T ss_pred cccCCccc---cceeeeeeee-ceeecC--cccc-cCCccccccccCc-chH Confidence 12222222 2222223333 344444 2210 0000000111111 110 No 206 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=34.48 E-value=1.3 Score=19.98 Aligned_cols=349 Identities=8% Similarity=-0.006 Sum_probs=130.3 Q ss_pred CCccH-HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH-hhhhhccccHHHHHHHHHHh Q lcl|Aclame:pro 1 MTINL-SETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAE-AERVSS-LPKSAQSLSANQRSFFMDIN 77 (381) Q Consensus 1 m~~~l-~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~lt~~e~~~~~~~~ 77 (381) |+.++ ++++.||=.-+++.+++..+...-...+++..+ +.+.. .+.... ...+-..+-.-.||++-.+. T Consensus 1 ~~~~~~~e~l~~kw~p~l~~~~~~~~~~~~a~llenq~~--------~~~~~l~e~~~~~~~~~~~~~~~~v~r~~p~l~ 72 (523) T protein:vir:59 1 MSQPKINEQLIEKWQPLLEGCRNDWERHTLATLLENQYR--------EAKKHLMETTQTTEVDGWNLALPIVRRVFANLR 72 (523) T ss_pred CCcchhhHHHHHhhhhhhcccCChhHHHHHHHHhhhhhH--------HHHHhhhhhhhccccccccchhhhhhhHhhhhh Confidence 98875 666667767777765443322111222222111 11100 010000 00010112233455443321 Q ss_pred c---------------------------ccCC-------------------CCce-------E-c--------------- Q lcl|Aclame:pro 78 K---------------------------NVNY-------------------KEEK-------L-L--------------- 88 (381) Q Consensus 78 ~---------------------------~~~~-------------------~gg~-------l-v--------------- 88 (381) . ++.+ +..+ . - T Consensus 73 a~DIWGVQPMTGPTGLIFAMRSRY~~q~gteA~yg~~~~~~~~a~~~~~ean~~~s~~~~~~~~~~d~~~sg~~~~~~~a 152 (523) T protein:vir:59 73 ATDLVSVQPLSLPTGLVFYLDFKSPELPGNGSVYGGTGLTTDTATGGLYDENARLSRREYETTITVDLATAQQATMRDVG 152 (523) T ss_pred hhhccccccCCCCcceeEEEEeeccCCCCcccccCccccCcccccccccccccccccccccCccCCCccccccccccccc Confidence 1 0000 0000 0 0 Q ss_pred -cHH-----HHHHHHH---------HHHhhhhhhhhceeeecCC--c----eEEEEe----------------------- Q lcl|Aclame:pro 89 -PEE-----TIDRIFE---------DLTTNHPLLADLGIKNAGL--R----LKFLKS----------------------- 124 (381) Q Consensus 89 -P~~-----~~~~Ii~---------~l~~~~~l~~~~~v~~~~~--~----~~ip~~----------------------- 124 (381) |+- ....+.+ .....+++... ......+ . ..++.. T Consensus 153 ~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~-~~q~itg~tga~fa~s~~~an~astAss~Al~gEA~t~~sTd~ 231 (523) T protein:vir:59 153 FDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFW-QYDDASGDPENTVAYPLPRYNRIVGAVGSALYARLFFVTGSDF 231 (523) T ss_pred ccccchhhccccceeeeecccccccccccccccccc-ccccccccccccccchhhccccccccccccccccccccccccc Confidence 000 0000000 00000000000 0000000 0 000000 Q ss_pred --c----------------CCcceEEeccc-cc--ccccccccccceeccceeeee-------ehhhhHHHHhc-----C Q lcl|Aclame:pro 125 --E----------------TSGVAVWGKIY-GE--IKGQLDAAFSEETAIQNKLTA-------FVVLPKDLNDF-----G 171 (381) Q Consensus 125 --~----------------~~~~a~w~~e~-~~--~~~~~~~~f~~v~l~~~kl~~-------~~~iS~ell~d-----s 171 (381) . +.+...-..|. +. ........|.+..|...|.++ ....|-||.+| . T Consensus 232 at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~ 311 (523) T protein:vir:59 232 ATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHK 311 (523) T ss_pred cccCCCcccccccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhc Confidence 0 00000000000 00 001123345555555555543 35577788777 3 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhheeeccC----CC-----cceeeeeccccccccccccccccchhhhccccChhHHHHH Q lcl|Aclame:pro 172 PAWIERFVRVQIEEAFAVALETAFLKGTG----KD-----QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNE 242 (381) Q Consensus 172 ~~~l~~~i~~~la~a~a~~~d~a~l~G~G----~~-----qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~ 242 (381) .+|.|++|.+.|+..|...+|+.||.=-= .. .+.|++. +. ....+. +. .+.......+..... T Consensus 312 GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~-~~---~~~~~~---~~-~~~~~~~~~e~~~~l 383 (523) T protein:vir:59 312 GVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGE-YY---DETSGN---FV-AGNFYGSKQEWLATL 383 (523) T ss_pred CCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceee-ec---ccccch---hh-hhhhhhhhHHHHHHH Confidence 56799999999999999999998874110 00 1222221 00 000000 00 000000001111111 Q ss_pred HHHHHHHhhhccccccccccCc-eEEEEchhhHHHHHhhhhcc-------CCCCc-eeeccCCCceEEecCCCCCccEEE Q lcl|Aclame:pro 243 LTQVFKYHSTNEKGKSVAVKGN-VTMVVNPSDAFEVQAQYTHL-------NANGV-YVTALPFNLNVIESTVQEAGKVLT 313 (381) Q Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~-~~~imn~~~~~~~~~~~~~~-------~~~G~-~~~~l~~g~~vi~s~~~p~~~i~~ 313 (381) +..+.+........ --+++ -..||++.-...+....-.+ +..|. +.-.|.-+++|+.+.+.|.+-++. T Consensus 384 ~~~~~~~~n~i~~~---t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~~~g~l~~~~~vy~d~~~~~dy~~~ 460 (523) T protein:vir:59 384 MIELNKVSNRIQQK---TAVAGANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIFYVGMVQGRYRLYKNIYQNQPVIIM 460 (523) T ss_pred HHHHHHHHHHHHHh---cccccccEEEEchhHHHHHHhccccccCCccccccccceeEEEecCceEEEecCCCCcceEEE Confidence 12222211111110 11222 24567776544332211111 11122 233455578999999999887777 Q ss_pred EeccceEEEecceeeEeeehhh---hhh----cCceEEEEEEEEcCEEecCcceEEEEEEeccc Q lcl|Aclame:pro 314 YVKGLYDGYLAGGINVQKFKET---LAL----DDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 314 gd~s~y~i~~r~~~~i~~~~~~---~~~----~d~~~~~~~~r~dgk~~~~~Af~v~~l~~~~~ 370 (381) |-...+.-.+ .++-...+-+. +.. .-|-.+-...|+.-.+.+|.+..++-+|+.-+ T Consensus 461 g~k~~~~~~~-~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~~~~~ 523 (523) T protein:vir:59 461 GNQDLNTPWQ-TGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYVKLLQP 523 (523) T ss_pred EecccCCccc-ccceecccchhhcccccccCCcccceeeeeeehhheecchhHhhhhhhhhcCC Confidence 7443221111 22222111111 111 12445566678876677999988888887443 No 207 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=30.88 E-value=1.5 Score=19.55 Aligned_cols=302 Identities=12% Similarity=0.053 Sum_probs=122.7 Q ss_pred HHHHHHHHHHhhhhh------ccccHHHH-HHHHHHhcc----cCCCCceEccHHHHH----HHHHHHHhhhhhhhhcee Q lcl|Aclame:pro 48 AKAEAERVSSLPKSA------QSLSANQR-SFFMDINKN----VNYKEEKLLPEETID----RIFEDLTTNHPLLADLGI 112 (381) Q Consensus 48 ~~~~~~~~~~~~~~~------~~lt~~e~-~~~~~~~~~----~~~~gg~lvP~~~~~----~Ii~~l~~~~~l~~~~~v 112 (381) .++..+-..+.+-+. ..++.+-. -.+.++-.+ +++.+ -||..+.+ ++++.+..---...++.+ T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~--~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv 78 (336) T protein:vir:36 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSS--GIPNYLTTYVDPSVIDILVAPMKAAELVGE 78 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhHHHHhhhhhhhccCccccCCCc--chHHHHHHhhccceEeeecchhhhhhhccc Confidence 000000000000010 01111100 011111010 11112 14554443 223322222112223333 Q ss_pred eecC----CceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhh-HHHHhcC--hhHHHHHHHHHHHH Q lcl|Aclame:pro 113 KNAG----LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLP-KDLNDFG--PAWIERFVRVQIEE 185 (381) Q Consensus 113 ~~~~----~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS-~ell~ds--~~~l~~~i~~~la~ 185 (381) .+.+ ....++..+..+.+.+.+..... +..+..-...+-..+.+..-+.++ .|+-.-+ ..++.+--+...++ T Consensus 79 ~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~-P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ 157 (336) T protein:vir:36 79 SKKGDWTTLVAAFITAEPTTKVATYGDYSSD-GDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSAL 157 (336) T ss_pred cccCCccceeEEEeeeeceeeEEEeeccCCC-ceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHH Confidence 3322 12345666666667666544444 334434444445566777667776 4444322 34566777788888 Q ss_pred HHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCce Q lcl|Aclame:pro 186 AFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNV 265 (381) Q Consensus 186 a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 265 (381) ++.+.+|.-.++|++..+-.|++++.+.....+..+ .+ .....++...+++..++..+.....|.-.. .... T Consensus 158 ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t--~~-----~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~-~~~~ 229 (336) T protein:vir:36 158 GLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT--PW-----SGSPAVEAVVNEVVALFQVLQTQSQGIITQ-EDVL 229 (336) T ss_pred HHHHhhCcEEEEeccccceEEEEecCCCccccccCC--Cc-----ccccCHHHHHHHHHHHHHHHHHhcCCeeee-cccc Confidence 899999999999999889999999754322111110 00 011233445555555555554433332111 1123 Q ss_pred EEEEchhhHHHHHhhhhccCCCCceeec---cCC-CceEEecCCCCCccEEEEeccceEEEecc---eeeEeeehhhh-- Q lcl|Aclame:pro 266 TMVVNPSDAFEVQAQYTHLNANGVYVTA---LPF-NLNVIESTVQEAGKVLTYVKGLYDGYLAG---GINVQKFKETL-- 336 (381) Q Consensus 266 ~~imn~~~~~~~~~~~~~~~~~G~~~~~---l~~-g~~vi~s~~~p~~~i~~gd~s~y~i~~r~---~~~i~~~~~~~-- 336 (381) .++|.+.-+..+ +..+..|.-+.. -.| ++.++..+.... .-|+..+++..... ...+....... T Consensus 230 tL~LP~~~~~~L----s~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~---a~g~~~~l~~~~~~~~~t~~~~~p~~~~~l 302 (336) T protein:vir:36 230 RMGLPPTAMSDL----SKTNQYGLAAAAKLKDIFPKLEFVTIPEYDT---ASGRLVQLWAPRVEGKDTATCGFTEKMRAH 302 (336) T ss_pred EEEechHHHHhc----cCCCccCccHHHHHHHhcCccEEEEcccccc---CCCceEEEEEEecCCCcceeeecchhhhcc Confidence 456666533222 222333321111 012 244443322211 01222121111111 11221111110 Q ss_pred -hh--cCceEEEEEEEEcCEEecCcceEEEEEEe Q lcl|Aclame:pro 337 -AL--DDMDLYTAKQFAYGKAKDNKVAAVWKLDL 367 (381) Q Consensus 337 -~~--~d~~~~~~~~r~dgk~~~~~Af~v~~l~~ 367 (381) .. .-.....+..|..|.++.--.+++...=| T Consensus 303 ~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 303 SIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ceeecCceeEeccccceeeeeeeccchheeeecC Confidence 11 12334456777877776544444332333 No 208 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=29.48 E-value=1.7 Score=19.38 Aligned_cols=263 Identities=10% Similarity=0.005 Sum_probs=111.0 Q ss_pred CCCce--EccHHHHHHHHHHHHhhhhhhhhceeee--------cCCceEEEEecCCcc---eEEeccccccccccccccc Q lcl|Aclame:pro 82 YKEEK--LLPEETIDRIFEDLTTNHPLLADLGIKN--------AGLRLKFLKSETSGV---AVWGKIYGEIKGQLDAAFS 148 (381) Q Consensus 82 ~~gg~--lvP~~~~~~Ii~~l~~~~~l~~~~~v~~--------~~~~~~ip~~~~~~~---a~w~~e~~~~~~~~~~~f~ 148 (381) -..-+ ++|+-++.++++.+++...+.+++..-- .+-+++||+...... +.+...........+.+ - T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~~~l~e~~-v 79 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSKNSLISAK-A 79 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCcccccccccce-E Confidence 12334 7899999999999999988877776421 122467776432211 11111101000001111 2 Q ss_pred ceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchh Q lcl|Aclame:pro 149 EETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 149 ~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) .+.|+.+|+.++-.=+.|+. .+..+++++++.. .++++..+|..+.......-+..+ .+.++ .. T Consensus 80 ~l~id~~k~~a~~v~d~E~~-l~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~v---------gt~~t-----~~ 143 (423) T protein:vir:10 80 TGEVGNYITVAVEYRQIEEA-LKLNQLDQILVPI-NERMVTDLETELALFMMKHGALSL---------GSPNT-----PI 143 (423) T ss_pred EEEecceeeeeeeeChHHHh-cChhHHHHHHHHH-HHHHHHHHHHHHHHHhhhcccccc---------ccccc-----cc Confidence 56677777776655566655 5677888876555 678888888877532111101000 00000 00 Q ss_pred hhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhhc-c--CC-------CCceeeccCCCc Q lcl|Aclame:pro 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH-L--NA-------NGVYVTALPFNL 298 (381) Q Consensus 229 ~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~~-~--~~-------~G~~~~~l~~g~ 298 (381) + ..+.+......|. ..+.|. ++.+.+++|..+..+...... . +. +|+.+. -.+|. T Consensus 144 ------~---a~~~~a~a~~~L~--~~~vP~---~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G-~~~GF 208 (423) T protein:vir:10 144 ------K---KWSDVAQTASFLK--DLGINS---GENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISG-NFGGI 208 (423) T ss_pred ------c---cHHHHHHHHHHHh--hccCCc---CCCEEEeCHHHHHHHhhhhhhhccccccchHHHHhcccce-eecce Confidence 0 0122222222221 122232 345678999887776532111 1 11 122211 22588 Q ss_pred eEEecCCCCC---ccEE-EEeccceEEEecce-----------eeEeeehh-hhhhcCceEE---EEEEEEcCEEe---- Q lcl|Aclame:pro 299 NVIESTVQEA---GKVL-TYVKGLYDGYLAGG-----------INVQKFKE-TLALDDMDLY---TAKQFAYGKAK---- 355 (381) Q Consensus 299 ~vi~s~~~p~---~~i~-~gd~s~y~i~~r~~-----------~~i~~~~~-~~~~~d~~~~---~~~~r~dgk~~---- 355 (381) .|+.|..+|. ++.. .+--+-.....+.. +....+.. .....|..-| .+..+..+..+ T Consensus 209 di~~Sn~vp~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~ 288 (423) T protein:vir:10 209 RALMSNGLASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGA 288 (423) T ss_pred EEEEecCCcccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeeccc Confidence 9999999984 2211 00001111100000 00011000 0001222222 22334444332 Q ss_pred --cCcceEEEEEEecccccCCCCCCCCC Q lcl|Aclame:pro 356 --DNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 356 --~~~Af~v~~l~~~~~~~~~~~~~~~~ 381 (381) ...-|+|. +-.-+.-++.=|+ T Consensus 289 ~~~~~~~~V~-----~~~~~~a~~~~tv 311 (423) T protein:vir:10 289 SALSFTATVM-----EDANAHSSGDVTV 311 (423) T ss_pred CCcceEEEEE-----ecccccccCceEE Confidence 11223331 1110111111123 No 209 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=29.07 E-value=1.7 Score=19.33 Aligned_cols=301 Identities=12% Similarity=0.066 Sum_probs=126.9 Q ss_pred HHHHHHHHHHhhhhhccccH-------HHHH-HHHHH-h---cccCCCCceEccHHHHH----HHHHHHHhhhhhhhhce Q lcl|Aclame:pro 48 AKAEAERVSSLPKSAQSLSA-------NQRS-FFMDI-N---KNVNYKEEKLLPEETID----RIFEDLTTNHPLLADLG 111 (381) Q Consensus 48 ~~~~~~~~~~~~~~~~~lt~-------~e~~-~~~~~-~---~~~~~~gg~lvP~~~~~----~Ii~~l~~~~~l~~~~~ 111 (381) .++..+-..+. +-+-.|.. +-+. ++.++ . -.++++.| ||..+.+ ++++.+...--...+.. T Consensus 1 ~~~~~~~~~l~-~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g--~~~~l~~~i~p~~~~~~~~~~~~~~l~~ 77 (336) T protein:vir:78 1 MRDAQRIQNLA-RAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILVAPMKAAELVG 77 (336) T ss_pred CchHHHHHHHh-ccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcc--hHHHHHHhcccceeeehhhhhhhhhhcc Confidence 00000000000 01111111 1000 11011 0 01122222 4554443 22332222212223333 Q ss_pred eeecC----CceEEEEecCCcceEEecccccccccccccccceeccceeeeeehhhhHHHHhc---ChhHHHHHHHHHHH Q lcl|Aclame:pro 112 IKNAG----LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF---GPAWIERFVRVQIE 184 (381) Q Consensus 112 v~~~~----~~~~ip~~~~~~~a~w~~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d---s~~~l~~~i~~~la 184 (381) +.+.+ ....++..+..+.+.+.+-.... +..+..-...+-..+.+..-+.++..=+.- ...++.+--+...+ T Consensus 78 v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~-P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~ 156 (336) T protein:vir:78 78 ESKKGDWTTLVAAFITAEPTTTVATYGDYSSD-GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) T ss_pred cccCCCccccEEEEeeeecceeeEEeecccCC-CeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHH Confidence 33332 13456666677777776654444 345555556666677777777777333332 34567777788888 Q ss_pred HHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCc Q lcl|Aclame:pro 185 EAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGN 264 (381) Q Consensus 185 ~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 264 (381) +++.+.+|.-.++|++..+-.|++++.......+.. .+.+ ...+++...+++..++..+.....|.... ... T Consensus 157 ~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~-~~~w------~~~T~~~I~~Di~~~~~~l~~qt~g~~~~-~~~ 228 (336) T protein:vir:78 157 LGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITAT-TPWS------GSPAVEAVVNEVVTLFQVLQTQSQGIITQ-EAV 228 (336) T ss_pred HHHHHhhCeEEEEeccccceEEEEeCCCCCcccccC-cCcc------cccCHHHHHHHHHHHHHHHHHhcCCeeee-ccc Confidence 999999999999999988999999975432221111 1111 11234455555555555554333322111 112 Q ss_pred eEEEEchhhHHHHHhhhhccCCCCceeec-c--CC-CceEEecCCCCCccEEEEeccceEEEecc---eeeEeeehhh-- Q lcl|Aclame:pro 265 VTMVVNPSDAFEVQAQYTHLNANGVYVTA-L--PF-NLNVIESTVQEAGKVLTYVKGLYDGYLAG---GINVQKFKET-- 335 (381) Q Consensus 265 ~~~imn~~~~~~~~~~~~~~~~~G~~~~~-l--~~-g~~vi~s~~~p~~~i~~gd~s~y~i~~r~---~~~i~~~~~~-- 335 (381) ..++|.+.-+..+ ...+..|.-+.. + .| ++.|+.-+.... + -|+..+.+..+.. -.++...... T Consensus 229 ~tL~Lp~~~~~~L----~~~n~~g~tv~~~lk~n~Pnl~i~t~pel~~--A-gg~~~~~~~~~~~~~~t~~~~~p~~f~~ 301 (336) T protein:vir:78 229 LHMGLPPTAMSDL----SKTNQYGLSAAAKLKEIFPKLEFVTIPEYDT--A-SGRLVQLWAPRVEGKDTATCGFTEKMRA 301 (336) T ss_pred eEEEechHHHHhc----cCCCccCccHHHHHHHhcCccEEEEcccccc--c-CcceEEEEEeeccCCcceeeecchhhhc Confidence 3456665543322 122333321111 0 12 244444332211 0 1222111111110 1111111100 Q ss_pred ---hhhcCceEEEEEEEEcCEEecCcceEEEEEEe Q lcl|Aclame:pro 336 ---LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDL 367 (381) Q Consensus 336 ---~~~~d~~~~~~~~r~dgk~~~~~Af~v~~l~~ 367 (381) ....-.....+..|..|.++.--..+....=| T Consensus 302 lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 302 HSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cceeecCceeEeccccceeeeeeeccchheeeccC Confidence 01112334456677777776544444332223 No 210 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=28.87 E-value=1.7 Score=19.31 Aligned_cols=344 Identities=12% Similarity=0.080 Sum_probs=117.1 Q ss_pred CCccHHHHHHHHHHHHHHH-----HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccHHHHHHHHH Q lcl|Aclame:pro 1 MTINLSETFANAKNEFINA-----VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMD 75 (381) Q Consensus 1 m~~~l~~~~~e~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~ 75 (381) |.|+-++++.|+=.-+++. |++.-++..-...+++..+.+.++.. ...++... .+.+.-+. +- T Consensus 1 ~~~~~~e~l~~kw~p~l~~~~~~~i~~~~~~~v~a~l~enq~~~~~~~~~-----~l~e~~~~-~~~~~~~~------~~ 68 (470) T protein:vir:10 1 MQMFNSEYLQEKWAPILDYDGLDPIKDSHRRSVTAVLLENQEKELREERN-----FLSEAPNV-NTNSGATA------GF 68 (470) T ss_pred CCcchhHHHHHhhhhhhcCCccchhcchhhhhhhhhhhhhhHHHHhhccc-----hhhhhhhc-cccccccc------cc Confidence 9999999998888887765 22221111112222222222111110 00000000 00000000 00 Q ss_pred HhcccCCCCceEccHHHHHHHHHHHHhhhhhhhhceeeecCCceEEE---E---ecCCcc-e-------EEeccc----- Q lcl|Aclame:pro 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFL---K---SETSGV-A-------VWGKIY----- 136 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~~~v~~~~~~~~ip---~---~~~~~~-a-------~w~~e~----- 136 (381) +.+++.+..=.-.-|.+.. ++++....=.-.+++.|.|++|..-.. + .+..++ + .|.+.. T Consensus 69 i~~st~t~~v~~~~P~Li~-lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG~EaffnEA~T~fSG~~~~~~~ 147 (470) T protein:vir:10 69 SADATAAGPVAGFDPVLIS-LIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSGTEALFNEADTAFSGQPDGLDD 147 (470) T ss_pred cccccccccccccCchhhh-hHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCccceeeecCCcccCcccccccc Confidence 1111111100001122221 111111111234567777776543211 0 111100 0 010000 Q ss_pred -------------------------------------------------cc-ccccccccccceeccceeeeee------ Q lcl|Aclame:pro 137 -------------------------------------------------GE-IKGQLDAAFSEETAIQNKLTAF------ 160 (381) Q Consensus 137 -------------------------------------------------~~-~~~~~~~~f~~v~l~~~kl~~~------ 160 (381) ++ .-..+...|.+..|...|...- T Consensus 148 ~~~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaL 227 (470) T protein:vir:10 148 TSGFTATGANNVGLGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRAL 227 (470) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccchHHhhhcCCCCCcccceeeeEEEEEEEEeeccce Confidence 00 0001223366666666665432 Q ss_pred -hhhhHHHHhc----ChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccccccccccccchhhhccc-- Q lcl|Aclame:pro 161 -VVLPKDLNDF----GPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTF-- 233 (381) Q Consensus 161 -~~iS~ell~d----s~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~t~-- 233 (381) ...|-||.+| -.+|.|++|.+.|+..|...+|+.||.=- ...+. -++.......+.+.+ T Consensus 228 KAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l---------~~~a~-----~~k~~~~~~~Gv~Dl~~ 293 (470) T protein:vir:10 228 KAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTI---------YNVAE-----PGAQANVAAAGTFDLDT 293 (470) T ss_pred eccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHH---------hhhhh-----hceeccccccceEEeec Confidence 4467777766 45689999999999999999999887511 00000 000000011111100 Q ss_pred -cChhHHHHHHHHHHHHhhhccc---cccccccCceEEEEchhhHHHHHhhhh------------ccCCCCc-eeeccCC Q lcl|Aclame:pro 234 -ANPRATVNELTQVFKYHSTNEK---GKSVAVKGNVTMVVNPSDAFEVQAQYT------------HLNANGV-YVTALPF 296 (381) Q Consensus 234 -~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~imn~~~~~~~~~~~~------------~~~~~G~-~~~~l~~ 296 (381) .+.......+..++..+....+ -+.+.+.+|. .++++.-...+ ...- ..+..|. +.-.|.- T Consensus 294 ~~~gr~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~-~i~S~~Va~~L-a~sG~l~~~~~~~~~~~~D~t~~~~~G~l~~ 371 (470) T protein:vir:10 294 DSNGRWSVEKFKGLIFQIERDANAIAQRTRRGKGNM-ILCSADVASAL-TMAGVLDYTPALNANLNVDDTGNTFAGILQG 371 (470) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHhhccccceE-EEEchhHHhHh-hhccccccccccccccccCCCCceEEEEecC Confidence 0111111111122111110000 0112233332 35666543322 1100 1111222 2334555 Q ss_pred CceEEecCCCCC------ccEEEEeccceEEEecceeeEeee---hhhhhh---cCceEEEEEEEEcCEEecCcceEEEE Q lcl|Aclame:pro 297 NLNVIESTVQEA------GKVLTYVKGLYDGYLAGGINVQKF---KETLAL---DDMDLYTAKQFAYGKAKDNKVAAVWK 364 (381) Q Consensus 297 g~~vi~s~~~p~------~~i~~gd~s~y~i~~r~~~~i~~~---~~~~~~---~d~~~~~~~~r~dgk~~~~~Af~v~~ 364 (381) +++|+.+.++.. +-++.|-....-+ ..++-...+ +.+... .-|-.+-...|+ |-.++|-+- T Consensus 372 ~~~vy~d~y~~~~~~a~~dy~~vG~KG~~~~--~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY-~l~~NP~~~---- 444 (470) T protein:vir:10 372 KYRVYIDPFSASGGAAATQYYVVGYKGSSPY--DAGLFYCPYVPLQMVRAVGQDTFQPKIGFKTRY-GLVENPFSQ---- 444 (470) T ss_pred ceEEEeeccccccCcccccEEEEEEecCcce--ecceeeccccccccCCCCCCccccceeeeeeee-ceeecCccc---- Confidence 678887765432 2333332211000 011111100 000000 112233333344 444444321 Q ss_pred EEecccccCCCCCCCCC Q lcl|Aclame:pro 365 LDLKGHKPALEGTEETL 381 (381) Q Consensus 365 l~~~~~~~~~~~~~~~~ 381 (381) ..+...+...-+.|-. T Consensus 445 -~~~~~~~~i~~~~n~y 460 (470) T protein:vir:10 445 -GTTQGLGTLTRNSNRY 460 (470) T ss_pred -CCCcccccccCCCCce Confidence 1111111111122222 No 211 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=27.12 E-value=1.9 Score=19.09 Aligned_cols=281 Identities=10% Similarity=-0.070 Sum_probs=128.6 Q ss_pred ccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhh-hhhhhceeeecCCc-eEEEEecCCcc-eEEecccccc Q lcl|Aclame:pro 63 QSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNH-PLLADLGIKNAGLR-LKFLKSETSGV-AVWGKIYGEI 139 (381) Q Consensus 63 ~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~-~l~~~~~v~~~~~~-~~ip~~~~~~~-a~w~~e~~~~ 139 (381) ..+|++--+ .+. ..+...+.+...... ..++.|++.+.... .+..+...-+. -.|.+| . T Consensus 1 m~it~~~l~---~l~------------~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge---~ 62 (302) T protein:vir:10 1 MLINKQSLN---AAF------------VAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGA---K 62 (302) T ss_pred CcccHHHHH---HHH------------HHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccc---e Confidence 122221111 110 111122222222222 23445655553222 23333333332 245433 2 Q ss_pred cccccccccceeccceeeeeehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhhee----eccCC----Ccceeeeec- Q lcl|Aclame:pro 140 KGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL----KGTGK----DQPIGLNRQ- 210 (381) Q Consensus 140 ~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l----~G~G~----~qP~Gil~~- 210 (381) +- ....=..-++...+++..+.||++.+.|-...+-.-+...++++.++.+++.+. .|.+. +|+ ++.. T Consensus 63 ~~-~~l~~~~~~i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~--fF~~d 139 (302) T protein:vir:10 63 VV-KNLKAYKYVVENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQY--FIDTD 139 (302) T ss_pred ee-ccccccceeEEeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcc--eeccc Confidence 21 122334566788899999999999999999999999999999999988876553 23221 222 3321 Q ss_pred ccccc-ccccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhh-hhccCCCC Q lcl|Aclame:pro 211 VQKGV-SVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ-YTHLNANG 288 (381) Q Consensus 211 ~~~~~-~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~-~~~~~~~G 288 (381) ...+. +..++..... ..................+... ....|.+...+.. .||+.|.-...-+.+ ...+..+| T Consensus 140 H~~g~~~~~N~g~~~~--~~~~~~l~~~~~~aa~~am~~~--k~~~G~~L~i~P~-~LiVp~~le~~A~~ll~~~~~~~g 214 (302) T protein:vir:10 140 HPVGDASVSNKGTAPL--SNASQAAAKAGYGAARTAMKKF--KDEEGRSLNVSPN-VLLVGPALEDVAKMLLTNPKLADN 214 (302) T ss_pred ccccccccccccchhh--hhcccccchHHHHHHHHHHHHH--hhhcccccccCCC-EEEecchhHHHHHHHhhccccCCC Confidence 00000 0000000000 0000111222222222222211 1233444444433 477776554433332 22333333 Q ss_pred --ceeeccCCCceEEecCCCCCccEE--EEeccc---eEEEecceeeEeeehhhhhhcCceEEEEEEEEcCEEecCcceE Q lcl|Aclame:pro 289 --VYVTALPFNLNVIESTVQEAGKVL--TYVKGL---YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAA 361 (381) Q Consensus 289 --~~~~~l~~g~~vi~s~~~p~~~i~--~gd~s~---y~i~~r~~~~i~~~~~~~~~~d~~~~~~~~r~dgk~~~~~Af~ 361 (381) ++... -..++.++.+.++..+ +.|.+. +++.-+++..+... ..|..|.+-++....++..-.-.-+|. T Consensus 215 ~~Np~~g---~~~~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~--~~~~~dgv~~k~~~d~Gvd~R~~~G~~ 289 (302) T protein:vir:10 215 TPNPYVG---TAELVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQ--VNLDSDDVFNLRKLKFGAEARAAAGYG 289 (302) T ss_pred Ccceecc---ceEEEEeeccCCCCceEEEecCCccceEEEcCccccEEEec--cCCCCCceEEEEEEEEeeeeeeecchh Confidence 22221 1356777777665543 345544 33344566666653 347778888888877765445555555 Q ss_pred EEEEEecccccCC Q lcl|Aclame:pro 362 VWKLDLKGHKPAL 374 (381) Q Consensus 362 v~~l~~~~~~~~~ 374 (381) ...+-+...-.+- T Consensus 290 ~wq~a~~s~g~~~ 302 (302) T protein:vir:10 290 FWQLAYGSTGTGA 302 (302) T ss_pred hhhhhhccCccCC Confidence 5555553322222 No 212 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=25.88 E-value=2 Score=18.92 Aligned_cols=286 Identities=6% Similarity=-0.069 Sum_probs=110.4 Q ss_pred HhcccCCCCceEccH--HHHHHHHHHHHhhhhhhhhceee---------ecCCc-eEEEEecCC-cc---eEEeccc-cc Q lcl|Aclame:pro 76 INKNVNYKEEKLLPE--ETIDRIFEDLTTNHPLLADLGIK---------NAGLR-LKFLKSETS-GV---AVWGKIY-GE 138 (381) Q Consensus 76 ~~~~~~~~gg~lvP~--~~~~~Ii~~l~~~~~l~~~~~v~---------~~~~~-~~ip~~~~~-~~---a~w~~e~-~~ 138 (381) |..+.- .-..||+ .|..-+.+.-.+.+.|++-.-++ ..+|. ..+|.-... +. -+|..-. +. T Consensus 1 Ma~T~l--~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~ 78 (349) T protein:vir:94 1 MAITTI--GNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEE--eeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 222211 2234555 24443434444444444421111 12343 678864332 22 2333211 11 Q ss_pred ccccccccccceeccceeee--eehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccc Q lcl|Aclame:pro 139 IKGQLDAAFSEETAIQNKLT--AFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVS 216 (381) Q Consensus 139 ~~~~~~~~f~~v~l~~~kl~--~~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~ 216 (381) .+...-.++.++-...+.-. ..-.++.+|-- -|..+.|.+.+++...+.....+|. --.|+|..-..... T Consensus 79 ~t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG---~dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~~~~~~~~ 150 (349) T protein:vir:94 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELTS---QNPLQSVASRLDNFWQRQAQRRLIA-----TALGLYNDNVSATD 150 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhhC---chHHHHHHHHHHHHHhhHHHHHHHH-----HHHhhhcccccccc Confidence 21111122333333333222 33445666643 2567777777777666655444432 11233321100000 Q ss_pred cccccccccchhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh---ccCCCCceeec Q lcl|Aclame:pro 217 VTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNANGVYVTA 293 (381) Q Consensus 217 ~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~---~~~~~G~~~~~ 293 (381) .......-...+......++....+....+ .....|..... =..++||+..+..++++.. .++++|...-. T Consensus 151 ~~~~~~~~~~d~~~~a~~~~~~~~~A~~~~----Gdaa~Gd~~~~--lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i~ 224 (349) T protein:vir:94 151 AYHEQNDMVVDVSATSGFDAGAFIDATQTM----GDALMGNGGEV--LGAIAMHSFVYAQARKAQLIDFIRDAENNTMFA 224 (349) T ss_pred cccccCceeEEecccCCCChhhHHHHHHHH----HHHhccccccc--eeEEEEchHHHHHHHhcchhhhccCcccCcccc Confidence 000000000001111112222222221111 11011110000 0247899999888766432 23444432211 Q ss_pred cCCCceEEecCCCCCcc---------EEEEeccceEEEecc-eeeEeeehhhh--hhcCceEEEEEEEEcCEEecCcceE Q lcl|Aclame:pro 294 LPFNLNVIESTVQEAGK---------VLTYVKGLYDGYLAG-GINVQKFKETL--ALDDMDLYTAKQFAYGKAKDNKVAA 361 (381) Q Consensus 294 l~~g~~vi~s~~~p~~~---------i~~gd~s~y~i~~r~-~~~i~~~~~~~--~~~d~~~~~~~~r~dgk~~~~~Af~ 361 (381) .-.|++||.++.||-.. .+||.- .+.+.+.. ...++..++.. -..++..+....|+ ++++..+. T Consensus 225 ty~G~~VivDD~~Pv~~~g~~~~yttylfg~G-Ai~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~---~~hp~G~s 300 (349) T protein:vir:94 225 TYQGYRVIVDDSMTVVGQDTSRKFISIIFGQG-AIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYS 300 (349) T ss_pred eecCcEEEEeCCCccccCCCCceEEEEEeecc-eEEeecCCCCcceeeecccccCCcceeEEEEEeeEE---Eeeeeeee Confidence 22489999999999411 123311 11111211 12233333332 23456666666665 57777777 Q ss_pred EEEEEecccc--cCCCCCCCC-C Q lcl|Aclame:pro 362 VWKLDLKGHK--PALEGTEET-L 381 (381) Q Consensus 362 v~~l~~~~~~--~~~~~~~~~-~ 381 (381) .-.-.++... .-.+.++++ | T Consensus 301 ~~~a~v~~~~~~~~~~sPt~aeL 323 (349) T protein:vir:94 301 FTSAVITGNGTETIARSASWQDL 323 (349) T ss_pred ecccccCCCccccccCCCChHHh Confidence 5432221100 011334444 3 No 213 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=24.45 E-value=2.2 Score=18.73 Aligned_cols=285 Identities=6% Similarity=-0.076 Sum_probs=107.5 Q ss_pred HhcccCCCCceEccH--HHHHHHHHHHHhhhhhhhhceee---------ecCCc-eEEEEecCC-c--ce-EEecc-ccc Q lcl|Aclame:pro 76 INKNVNYKEEKLLPE--ETIDRIFEDLTTNHPLLADLGIK---------NAGLR-LKFLKSETS-G--VA-VWGKI-YGE 138 (381) Q Consensus 76 ~~~~~~~~gg~lvP~--~~~~~Ii~~l~~~~~l~~~~~v~---------~~~~~-~~ip~~~~~-~--~a-~w~~e-~~~ 138 (381) |..+.- .-..||+ .|..-+.+.-.+.+.|++-.-++ ..+|. ..+|.-... + .. .|..- .+. T Consensus 1 Ma~T~l--~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MAITTI--GDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEE--eeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 322221 2234565 24433334334444444321111 22343 678875432 2 21 23221 111 Q ss_pred ccccccccccceeccceeeee--ehhhhHHHHhcChhHHHHHHHHHHHHHHHHHHhhheeeccCCCcceeeeeccccccc Q lcl|Aclame:pro 139 IKGQLDAAFSEETAIQNKLTA--FVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVS 216 (381) Q Consensus 139 ~~~~~~~~f~~v~l~~~kl~~--~~~iS~ell~ds~~~l~~~i~~~la~a~a~~~d~a~l~G~G~~qP~Gil~~~~~~~~ 216 (381) .+...-.++.++-...+.-.+ .-.++.+|-- -|..+.|.+.+++...+.....+|. .-.|++..-..... T Consensus 79 ~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG---~dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~~~~~a~~ 150 (349) T protein:vir:78 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELTS---QNPLQSVASRLDNFWQRQAQRRLIA-----TALGLYNDNVSATD 150 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhhC---chHHHHHHHHHHHHHhhHHHHHHHH-----HHHHhhcccccccc Confidence 121112223333333333333 3345665533 3567777777776655544433321 01223321000000 Q ss_pred ccccccccc-chhhhccccChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHHhhhh---ccCCCCceee Q lcl|Aclame:pro 217 VTEGAYPEK-EEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNANGVYVT 292 (381) Q Consensus 217 ~~~~~~~~~-~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~~~~~~---~~~~~G~~~~ 292 (381) ..... .+. ..+...+..++....+....+ ... ..|..... =..++||+..+..++++.. .++++|...- T Consensus 151 ~~~~~-~~~t~d~s~~a~~~~~~~~dA~~~l-gda---~~Gd~~~~--lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i 223 (349) T protein:vir:78 151 AYHEQ-NDMVVDVSATLGFDAGAFIDATQTM-GDA---LMGNGGEV--LGAIAMHSFVYAQARKAQLIDFIRDAENNTMF 223 (349) T ss_pred hhhhc-ccceeeeccccCCChhhhhhhHHHH-HHH---hccccccc--eeEEEEchHHHHHHHhhhhhhhccCcccCccc Confidence 00000 000 000000111222211111111 000 00100000 0257899999888766432 2344443211 Q ss_pred ccCCCceEEecCCCCCcc---------EEEEeccceEEEecce-eeEeeehhhh--hhcCceEEEEEEEEcCEEecCcce Q lcl|Aclame:pro 293 ALPFNLNVIESTVQEAGK---------VLTYVKGLYDGYLAGG-INVQKFKETL--ALDDMDLYTAKQFAYGKAKDNKVA 360 (381) Q Consensus 293 ~l~~g~~vi~s~~~p~~~---------i~~gd~s~y~i~~r~~-~~i~~~~~~~--~~~d~~~~~~~~r~dgk~~~~~Af 360 (381) ..-.|++||+++.||-.. .+||.-. +.+.+.++ ..++..++.. -..++..+....|+ ++++..+ T Consensus 224 ~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GA-i~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~---~~hp~G~ 299 (349) T protein:vir:78 224 ATYQGYRVIVDDSMTVVGQGAQRKFISIIFGQGA-IGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW---LLHPFGY 299 (349) T ss_pred ceecCeEEEEeCCCccccCCCCceEEEEEeecce-EEEccCCCccceeeecccccCCcceeEEEEEeeEE---Eeeeeee Confidence 122489999999999421 1233111 11122121 1233322332 22456666666554 4677776 Q ss_pred EEEEEEecccccC--CCCCCCC-C Q lcl|Aclame:pro 361 AVWKLDLKGHKPA--LEGTEET-L 381 (381) Q Consensus 361 ~v~~l~~~~~~~~--~~~~~~~-~ 381 (381) ..-.-.++..+.. ...++++ | T Consensus 300 s~~~a~v~~~~~~~~~~sPt~aeL 323 (349) T protein:vir:78 300 RFTSAVITGNGTETIARSASWQDL 323 (349) T ss_pred eeccccccCCccccccCCCChHHh Confidence 6543222221111 1233433 2 No 214 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=21.70 E-value=2.6 Score=18.34 Aligned_cols=283 Identities=14% Similarity=0.043 Sum_probs=121.8 Q ss_pred ccccHHHHHHHHHHhcccCCCCceEccHHHHHHHHHHHHhhhhhhhh----ceeeecCCce--EEEEe-cCCcceEEecc Q lcl|Aclame:pro 63 QSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLAD----LGIKNAGLRL--KFLKS-ETSGVAVWGKI 135 (381) Q Consensus 63 ~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~Ii~~l~~~~~l~~~----~~v~~~~~~~--~ip~~-~~~~~a~w~~e 135 (381) .+. +. +.++.+.+ =...+.++.+.+-..+||+.. .++.+.+|+. ..|.. ...+++.|-.. T Consensus 1 mp~-~~----lsel~t~t--------l~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~G 67 (321) T protein:vir:34 1 MPF-PN----ISDIITTT--------IESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSG 67 (321) T ss_pred CCC-ch----HHHHHHHH--------HHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEe Confidence 111 00 11111111 012233444445555666544 3345556644 44544 33567788543 Q ss_pred cccccccccccccceeccceeeeeehhhhH-HHHhcCh----hHHHHHHHHHHHHHHHHHHhhheee-ccC--CCcceee Q lcl|Aclame:pro 136 YGEIKGQLDAAFSEETAIQNKLTAFVVLPK-DLNDFGP----AWIERFVRVQIEEAFAVALETAFLK-GTG--KDQPIGL 207 (381) Q Consensus 136 ~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~-ell~ds~----~~l~~~i~~~la~a~a~~~d~a~l~-G~G--~~qP~Gi 207 (381) .+.......-.|.+=++..+.++.-+.||- |+|+.+. +||...=.+...+.++..++..+.. |+| ..+..|+ T Consensus 68 yd~l~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~sdGTa~g~~~i~GL 147 (321) T protein:vir:34 68 YDVLPTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYGDGTAFGGRAINGL 147 (321) T ss_pred eeeeccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhccccccccchhhhh Confidence 333332233468888889888887777763 5555443 2222222334445566666666654 665 4466676 Q ss_pred eeccccccccccccccccch----h--hhccc----cChhHHHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHH Q lcl|Aclame:pro 208 NRQVQKGVSVTEGAYPEKEE----Q--GTLTF----ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEV 277 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~----~--~~~t~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~imn~~~~~~~ 277 (381) =-.+. ..|..|++.+... . ...++ .++.+....+..++.... .+. ...-.||+...-+... T Consensus 148 ~~lv~--~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~---Rg~----~~PDlii~~~~~y~~y 218 (321) T protein:vir:34 148 DGAVP--VDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCV---RGA----DMPDLIMSGNDAWTTY 218 (321) T ss_pred hhhcc--cCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhc---cCC----CCccEEEechHHHHHH Confidence 22111 1122222222110 0 00111 122222222222221111 111 1112455554333222 Q ss_pred Hhhhhc----cC---CCCceeeccC-CCceEEecC----CCCCccEEEEeccceEEEecceeeEeeehhhhhh-cCceEE Q lcl|Aclame:pro 278 QAQYTH----LN---ANGVYVTALP-FNLNVIEST----VQEAGKVLTYVKGLYDGYLAGGINVQKFKETLAL-DDMDLY 344 (381) Q Consensus 278 ~~~~~~----~~---~~G~~~~~l~-~g~~vi~s~----~~p~~~i~~gd~s~y~i~~r~~~~i~~~~~~~~~-~d~~~~ 344 (381) +..... .+ ++..+ +.|- .|..|+.+. .+|++..+|-+=++..++...+-.+......++. -+|.+. T Consensus 219 ~~s~q~~qR~~~~~~a~~Gf-~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r~~~~NqdA~ 297 (321) T protein:vir:34 219 SNSLQVLQRFTSAEEANLGF-RSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSRRAAFNQDAE 297 (321) T ss_pred HHhhheeeeecccccccccc-eeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCcccccccchhHH Confidence 221110 11 11111 1111 256677765 6899988888888766765444444433333222 233333 Q ss_pred EEEEEEcCEEecCcceEEEEEEec Q lcl|Aclame:pro 345 TAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 345 ~~~~r~dgk~~~~~Af~v~~l~~~ 368 (381) ....-.-|.++-+++..-..|+=. T Consensus 298 ~q~I~~~GnL~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 298 AQILAWAGNLTCSGAQFQGRLIAE 321 (321) T ss_pred hhhhhhhheeeeecccceeEEeeC Confidence 333334456666665544333321 Done!