Query lcl|Aclame:protein:vir:7855|NCBI_annot:gp12|genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Match_columns 497 No_of_seqs 180 out of 996 Neff 10.1 Searched_HMMs 1612 Date Sat Nov 30 17:59:14 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_12 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_12_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:101650 Length: 497 100.0 1.6E-96 1E-99 545.8 49.1 497 1-497 1-497 (497) 2 protein:vir:7855 Length: 497 # 100.0 1.6E-96 1E-99 545.8 49.1 497 1-497 1-497 (497) 3 protein:vir:100135 Length: 418 100.0 3.2E-70 2E-73 401.6 42.0 411 1-496 4-418 (418) 4 protein:vir:4339 Length: 395 # 100.0 6E-68 3.7E-71 389.1 40.6 394 1-493 1-395 (395) 5 protein:vir:1886 Length: 385 # 100.0 7.3E-68 4.5E-71 388.7 39.6 384 1-494 1-385 (385) 6 protein:vir:191 Length: 385 # 100.0 7.3E-68 4.5E-71 388.7 39.6 384 1-494 1-385 (385) 7 protein:vir:81227 Length: 413 100.0 4.8E-67 3E-70 384.2 41.8 405 14-496 1-413 (413) 8 protein:vir:10364 Length: 390 100.0 1.2E-67 7.6E-71 387.4 38.1 388 1-491 1-390 (390) 9 protein:vir:81070 Length: 390 100.0 2.4E-67 1.5E-70 385.8 38.0 387 1-491 1-390 (390) 10 protein:vir:97053 Length: 390 100.0 8.9E-67 5.6E-70 382.7 37.9 387 1-491 1-390 (390) 11 protein:vir:100247 Length: 425 100.0 7.3E-66 4.5E-69 377.7 38.5 404 1-494 1-425 (425) 12 protein:vir:1328 Length: 392 # 100.0 1.1E-65 6.7E-69 376.8 36.5 385 1-494 1-392 (392) 13 protein:vir:101607 Length: 379 100.0 1.8E-65 1.1E-68 375.6 37.7 378 1-493 1-379 (379) 14 protein:vir:485 Length: 407 # 100.0 2.3E-64 1.4E-67 369.5 38.3 397 1-497 1-404 (407) 15 protein:vir:6242 Length: 390 # 100.0 1.2E-64 7.7E-68 370.9 36.0 385 1-494 1-390 (390) 16 protein:vir:95376 Length: 425 100.0 2.1E-63 1.3E-66 364.3 42.4 413 5-497 1-425 (425) 17 protein:vir:105038 Length: 428 100.0 4.3E-64 2.7E-67 368.0 37.9 402 1-493 1-428 (428) 18 protein:vir:94673 Length: 419 100.0 3.6E-63 2.2E-66 362.9 39.8 410 1-495 1-419 (419) 19 protein:vir:4456 Length: 401 # 100.0 1.2E-63 7.8E-67 365.4 36.1 393 1-493 1-401 (401) 20 protein:vir:104256 Length: 458 100.0 6.3E-61 3.9E-64 350.6 44.7 430 1-493 3-458 (458) 21 protein:vir:1433 Length: 435 # 100.0 1.7E-62 1E-65 359.2 35.1 402 2-497 1-435 (435) 22 protein:vir:4511 Length: 409 # 100.0 2.3E-62 1.4E-65 358.6 34.7 395 1-496 1-409 (409) 23 protein:vir:8102 Length: 543 # 100.0 1.3E-60 7.9E-64 348.9 44.1 428 1-494 40-543 (543) 24 protein:vir:80376 Length: 435 100.0 6.7E-62 4.1E-65 356.0 35.8 402 1-497 1-435 (435) 25 protein:vir:1268 Length: 397 # 100.0 1.6E-61 9.8E-65 353.9 36.6 386 1-493 1-397 (397) 26 protein:vir:81160 Length: 371 100.0 1.2E-61 7.3E-65 354.6 35.2 356 1-493 1-371 (371) 27 protein:vir:8420 Length: 477 # 100.0 1.2E-60 7.4E-64 349.1 38.4 433 7-497 1-475 (477) 28 protein:vir:4953 Length: 397 # 100.0 4.9E-61 3.1E-64 351.2 36.2 378 1-497 1-389 (397) 29 protein:vir:80128 Length: 466 100.0 3.9E-60 2.4E-63 346.3 40.3 443 2-497 1-452 (466) 30 protein:vir:102119 Length: 404 100.0 5.5E-61 3.4E-64 350.9 35.5 390 1-497 1-404 (404) 31 protein:vir:3870 Length: 400 # 100.0 3E-60 1.9E-63 346.9 38.5 391 5-494 1-400 (400) 32 protein:vir:1025 Length: 408 # 100.0 1.6E-60 1E-63 348.4 36.8 383 1-497 4-397 (408) 33 protein:vir:4700 Length: 415 # 100.0 3.5E-60 2.2E-63 346.5 38.5 400 1-497 1-408 (415) 34 protein:vir:4600 Length: 415 # 100.0 3.5E-60 2.2E-63 346.5 38.5 400 1-497 1-408 (415) 35 protein:vir:4997 Length: 397 # 100.0 2.1E-60 1.3E-63 347.8 36.9 378 1-497 1-389 (397) 36 protein:vir:98339 Length: 415 100.0 6.5E-60 4E-63 345.1 39.4 399 1-497 1-408 (415) 37 protein:vir:79987 Length: 415 100.0 6.5E-60 4E-63 345.1 39.4 399 1-497 1-408 (415) 38 protein:vir:81100 Length: 415 100.0 6.5E-60 4E-63 345.1 39.4 399 1-497 1-408 (415) 39 protein:vir:3991 Length: 404 # 100.0 2.8E-60 1.7E-63 347.1 36.9 386 1-497 1-397 (404) 40 protein:vir:9410 Length: 415 # 100.0 8.6E-60 5.3E-63 344.4 39.3 399 1-497 1-408 (415) 41 protein:vir:7409 Length: 408 # 100.0 2.4E-60 1.5E-63 347.5 36.0 383 1-497 1-397 (408) 42 protein:vir:3845 Length: 395 # 100.0 1.4E-59 8.5E-63 343.3 38.7 376 1-497 1-387 (395) 43 protein:vir:6212 Length: 434 # 100.0 9.2E-60 5.7E-63 344.2 36.9 412 5-497 1-434 (434) 44 protein:vir:1084 Length: 437 # 100.0 1.6E-58 9.8E-62 337.5 43.6 411 1-497 1-431 (437) 45 protein:vir:4830 Length: 397 # 100.0 1.5E-59 9.2E-63 343.1 37.4 378 1-497 1-389 (397) 46 protein:vir:102082 Length: 392 100.0 1.5E-59 9E-63 343.2 35.4 374 1-497 1-388 (392) 47 protein:vir:105004 Length: 392 100.0 1.5E-59 9E-63 343.2 35.4 374 1-497 1-388 (392) 48 protein:vir:102873 Length: 392 100.0 1.5E-59 9E-63 343.2 35.4 374 1-497 1-388 (392) 49 protein:vir:107593 Length: 392 100.0 1.5E-59 9E-63 343.2 35.4 374 1-497 1-388 (392) 50 protein:vir:4092 Length: 390 # 100.0 3.7E-59 2.3E-62 340.9 35.1 366 1-497 1-372 (390) 51 protein:vir:9704 Length: 394 # 100.0 1.5E-58 9.4E-62 337.6 37.9 387 1-497 1-394 (394) 52 protein:vir:100172 Length: 394 100.0 1.2E-58 7.7E-62 338.1 37.0 380 1-497 1-388 (394) 53 protein:vir:962 Length: 397 # 100.0 8E-58 4.9E-61 333.6 39.2 388 1-493 1-397 (397) 54 protein:vir:100884 Length: 389 100.0 2.8E-58 1.7E-61 336.1 36.2 377 1-497 1-386 (389) 55 protein:vir:7771 Length: 330 # 100.0 2.8E-59 1.7E-62 341.6 26.7 302 143-497 1-326 (330) 56 protein:vir:4226 Length: 326 # 100.0 2.2E-59 1.4E-62 342.1 25.7 311 127-496 1-326 (326) 57 protein:vir:93616 Length: 645 100.0 2.5E-57 1.6E-60 330.9 35.4 425 1-497 155-642 (645) 58 protein:vir:1383 Length: 421 # 100.0 4.9E-57 3E-60 329.3 36.3 379 1-497 1-396 (421) 59 protein:vir:2430 Length: 318 # 100.0 1.1E-58 6.6E-62 338.4 26.4 302 137-497 1-317 (318) 60 protein:vir:104085 Length: 320 100.0 1.7E-58 1E-61 337.3 26.6 305 138-496 1-320 (320) 61 protein:vir:9574 Length: 300 # 100.0 2E-58 1.3E-61 336.9 26.7 279 152-493 1-300 (300) 62 protein:vir:41 Length: 299 # N 100.0 1.4E-58 8.7E-62 337.7 25.7 280 148-494 1-299 (299) 63 protein:vir:98635 Length: 377 100.0 8E-58 5E-61 333.6 28.2 365 1-493 1-377 (377) 64 protein:vir:80684 Length: 315 100.0 5.6E-58 3.5E-61 334.5 26.2 287 151-497 1-310 (315) 65 protein:vir:9361 Length: 402 # 100.0 4.3E-57 2.6E-60 329.6 31.0 383 1-497 16-400 (402) 66 protein:vir:96762 Length: 632 100.0 2.3E-56 1.4E-59 325.6 35.0 415 1-492 185-632 (632) 67 protein:vir:5739 Length: 366 # 100.0 1.1E-57 6.9E-61 332.8 26.5 344 67-493 1-366 (366) 68 protein:vir:96978 Length: 387 100.0 2.2E-56 1.4E-59 325.7 32.8 383 1-497 1-385 (387) 69 protein:vir:94424 Length: 387 100.0 2.2E-56 1.4E-59 325.7 32.8 383 1-497 1-385 (387) 70 protein:vir:2685 Length: 387 # 100.0 2.2E-56 1.4E-59 325.7 32.8 383 1-497 1-385 (387) 71 protein:vir:8187 Length: 311 # 100.0 2.6E-57 1.6E-60 330.8 26.5 281 153-494 1-311 (311) 72 protein:vir:2344 Length: 397 # 100.0 2.2E-57 1.4E-60 331.2 25.7 295 142-497 1-310 (397) 73 protein:vir:93881 Length: 387 100.0 8.5E-56 5.3E-59 322.5 34.3 382 1-497 1-385 (387) 74 protein:vir:97148 Length: 324 100.0 1.6E-56 9.6E-60 326.5 28.7 303 108-497 1-319 (324) 75 protein:vir:105905 Length: 304 100.0 5.8E-57 3.6E-60 328.9 25.9 285 143-492 1-304 (304) 76 protein:vir:94142 Length: 304 100.0 5.8E-57 3.6E-60 328.9 25.9 285 143-492 1-304 (304) 77 protein:vir:9759 Length: 303 # 100.0 1.2E-56 7.7E-60 327.1 26.7 285 153-493 1-303 (303) 78 protein:vir:9309 Length: 324 # 100.0 2.9E-56 1.8E-59 325.1 28.7 303 108-497 1-319 (324) 79 protein:vir:78830 Length: 324 100.0 3.6E-56 2.2E-59 324.6 28.5 303 108-497 1-322 (324) 80 protein:vir:96392 Length: 324 100.0 3.6E-56 2.2E-59 324.6 28.5 303 108-497 1-322 (324) 81 protein:vir:1638 Length: 298 # 100.0 3.4E-56 2.1E-59 324.7 26.7 280 155-492 1-298 (298) 82 protein:vir:78223 Length: 333 100.0 5.3E-56 3.3E-59 323.6 26.6 294 142-494 1-333 (333) 83 protein:vir:99749 Length: 324 100.0 1.3E-55 8.1E-59 321.5 28.5 303 108-497 1-319 (324) 84 protein:vir:9643 Length: 377 # 100.0 4.6E-55 2.8E-58 318.5 31.3 363 1-493 1-377 (377) 85 protein:vir:78523 Length: 338 100.0 7E-56 4.3E-59 323.0 26.7 297 142-496 1-338 (338) 86 protein:vir:101291 Length: 381 100.0 4.2E-55 2.6E-58 318.7 30.9 361 21-497 1-372 (381) 87 protein:vir:9509 Length: 381 # 100.0 4.2E-55 2.6E-58 318.7 30.9 361 21-497 1-372 (381) 88 protein:vir:103955 Length: 324 100.0 2.5E-55 1.6E-58 319.9 28.2 303 108-497 1-319 (324) 89 protein:vir:96223 Length: 324 100.0 4.2E-55 2.6E-58 318.7 28.3 303 108-497 1-319 (324) 90 protein:vir:100632 Length: 381 100.0 2.1E-54 1.3E-57 314.8 30.8 361 21-497 1-377 (381) 91 protein:vir:95963 Length: 395 100.0 6.2E-54 3.9E-57 312.3 33.3 374 1-497 1-380 (395) 92 protein:vir:99920 Length: 311 100.0 2.3E-55 1.5E-58 320.1 25.2 285 152-493 1-311 (311) 93 protein:vir:2504 Length: 305 # 100.0 2.5E-55 1.6E-58 319.9 25.2 284 151-497 1-302 (305) 94 protein:vir:94771 Length: 298 100.0 4.8E-55 3E-58 318.4 26.5 277 155-492 1-298 (298) 95 protein:vir:78640 Length: 352 100.0 4.3E-54 2.7E-57 313.1 30.2 348 26-497 1-350 (352) 96 protein:vir:95763 Length: 297 100.0 8.8E-55 5.5E-58 316.9 26.2 281 143-494 1-297 (297) 97 protein:vir:4856 Length: 293 # 100.0 1.6E-54 1E-57 315.4 25.8 274 147-497 1-285 (293) 98 protein:vir:78350 Length: 383 100.0 4.2E-53 2.6E-56 307.7 30.7 363 1-497 1-379 (383) 99 protein:vir:4197 Length: 314 # 100.0 1.1E-41 6.8E-45 245.1 23.5 292 139-496 1-314 (314) 100 protein:vir:4159 Length: 315 # 100.0 2E-41 1.2E-44 243.7 21.8 295 127-492 1-315 (315) 101 protein:vir:97397 Length: 517 100.0 4E-40 2.5E-43 236.6 28.2 390 1-496 122-517 (517) 102 protein:vir:4074 Length: 480 # 100.0 5.6E-37 3.5E-40 219.3 22.2 364 1-496 109-480 (480) 103 protein:vir:3158 Length: 321 # 100.0 3.3E-35 2E-38 209.6 24.7 302 130-497 1-321 (321) 104 protein:vir:9820 Length: 272 # 100.0 1.9E-31 1.2E-34 189.0 22.7 266 151-496 1-272 (272) 105 protein:vir:3033 Length: 272 # 100.0 1.9E-31 1.2E-34 189.0 22.7 266 151-496 1-272 (272) 106 protein:vir:3613 Length: 272 # 99.8 5.8E-22 3.6E-25 137.0 19.0 264 151-493 1-272 (272) 107 protein:vir:93742 Length: 274 99.8 4.4E-21 2.7E-24 132.2 20.9 268 151-497 1-274 (274) 108 protein:vir:80930 Length: 278 99.8 1.3E-20 8.2E-24 129.6 19.6 271 151-494 1-278 (278) 109 protein:vir:105334 Length: 276 99.8 3.7E-20 2.3E-23 127.1 19.6 268 151-497 1-274 (276) 110 protein:vir:94933 Length: 330 99.8 1.4E-20 8.9E-24 129.4 16.2 308 108-496 1-330 (330) 111 protein:vir:96833 Length: 275 99.8 7.6E-20 4.7E-23 125.4 19.4 269 151-497 1-275 (275) 112 protein:vir:96123 Length: 274 99.7 3.8E-19 2.4E-22 121.5 20.5 268 151-497 1-274 (274) 113 protein:vir:94494 Length: 274 99.7 2.4E-18 1.5E-21 117.2 20.9 268 151-497 1-274 (274) 114 protein:vir:97433 Length: 274 99.7 2.4E-18 1.5E-21 117.2 20.9 268 151-497 1-274 (274) 115 protein:vir:1239 Length: 274 # 99.6 2.2E-17 1.4E-20 111.9 19.4 268 151-497 1-274 (274) 116 protein:vir:96262 Length: 274 99.6 3.2E-17 2E-20 111.0 19.7 265 151-497 1-271 (274) 117 protein:vir:95898 Length: 274 99.6 3.2E-17 2E-20 111.0 19.7 265 151-497 1-271 (274) 118 protein:vir:95107 Length: 270 99.6 8.2E-16 5.1E-19 103.3 18.9 262 151-497 1-269 (270) 119 protein:vir:739 Length: 231 # 99.5 8.7E-16 5.4E-19 103.1 15.8 228 185-493 1-231 (231) 120 protein:vir:79928 Length: 393 99.5 3.6E-14 2.3E-17 94.3 20.9 356 43-497 1-382 (393) 121 protein:vir:97255 Length: 310 99.4 5.4E-14 3.3E-17 93.3 20.9 282 151-493 1-310 (310) 122 protein:vir:99424 Length: 360 99.4 1E-13 6.2E-17 91.9 21.3 329 120-496 1-360 (360) 123 protein:vir:108211 Length: 318 99.4 2.5E-14 1.5E-17 95.2 16.0 290 148-494 1-318 (318) 124 protein:vir:8324 Length: 410 # 99.3 4.6E-14 2.9E-17 93.7 13.7 382 1-497 1-410 (410) 125 protein:vir:93858 Length: 400 99.2 1.1E-11 6.6E-15 80.7 20.2 391 1-491 1-400 (400) 126 protein:vir:7990 Length: 273 # 99.1 1.2E-11 7.4E-15 80.5 17.5 261 151-493 1-273 (273) 127 protein:vir:105822 Length: 273 99.1 2.6E-11 1.6E-14 78.6 17.4 259 151-493 1-273 (273) 128 protein:vir:102605 Length: 273 99.1 2.6E-11 1.6E-14 78.6 17.4 259 151-493 1-273 (273) 129 protein:vir:2201 Length: 345 # 99.0 1.8E-11 1.1E-14 79.5 12.9 298 135-493 1-345 (345) 130 protein:vir:8885 Length: 347 # 99.0 3.5E-11 2.2E-14 77.9 14.3 296 139-494 1-347 (347) 131 protein:vir:103285 Length: 296 99.0 1.1E-10 6.7E-14 75.2 16.3 278 151-491 1-296 (296) 132 protein:vir:94576 Length: 347 98.9 1.2E-10 7.4E-14 75.0 15.5 295 139-493 1-347 (347) 133 protein:vir:103323 Length: 364 98.9 7.1E-10 4.4E-13 70.7 19.2 297 142-497 1-343 (364) 134 protein:vir:10450 Length: 344 98.9 2E-10 1.3E-13 73.7 14.6 298 135-493 1-344 (344) 135 protein:vir:80213 Length: 334 98.8 4.4E-10 2.7E-13 71.9 15.8 294 139-495 1-334 (334) 136 protein:vir:3364 Length: 347 # 98.8 1.8E-10 1.1E-13 74.0 13.5 299 139-495 1-347 (347) 137 protein:vir:9927 Length: 295 # 98.8 2.6E-10 1.6E-13 73.1 13.1 265 151-497 1-292 (295) 138 protein:vir:78739 Length: 332 98.8 3.2E-10 2E-13 72.6 13.5 293 136-491 1-332 (332) 139 protein:vir:107687 Length: 319 98.8 2E-09 1.3E-12 68.2 17.6 301 109-491 1-319 (319) 140 protein:vir:1541 Length: 347 # 98.8 7.9E-10 4.9E-13 70.5 15.2 293 139-495 1-347 (347) 141 protein:vir:95318 Length: 328 98.8 4.6E-11 2.8E-14 77.3 8.3 284 145-435 1-328 (328) 142 protein:vir:94622 Length: 341 98.8 1.2E-09 7.7E-13 69.4 15.8 285 142-495 1-341 (341) 143 protein:vir:94711 Length: 347 98.8 3.8E-10 2.4E-13 72.2 12.5 290 139-494 1-347 (347) 144 protein:vir:6324 Length: 335 # 98.7 1.8E-09 1.1E-12 68.6 16.1 290 142-497 1-332 (335) 145 protein:vir:100057 Length: 375 98.7 2.6E-09 1.6E-12 67.6 16.8 310 135-497 1-374 (375) 146 protein:vir:78935 Length: 335 98.7 3.3E-09 2.1E-12 67.1 17.3 290 142-497 1-332 (335) 147 protein:vir:105645 Length: 400 98.7 2E-09 1.3E-12 68.2 15.2 298 142-497 1-337 (400) 148 protein:vir:80068 Length: 301 98.7 1.1E-08 6.8E-12 64.2 17.9 283 153-491 1-301 (301) 149 protein:vir:97031 Length: 402 98.6 2.1E-09 1.3E-12 68.1 13.7 292 142-497 1-337 (402) 150 protein:vir:9875 Length: 296 # 98.6 6.9E-09 4.3E-12 65.3 15.9 273 141-494 1-296 (296) 151 protein:vir:7019 Length: 401 # 98.6 2E-09 1.3E-12 68.3 12.7 306 142-497 1-342 (401) 152 protein:vir:5974 Length: 324 # 98.6 4.7E-08 2.9E-11 60.8 19.3 271 151-497 1-292 (324) 153 protein:vir:103759 Length: 330 98.6 1.2E-09 7.2E-13 69.6 10.1 282 145-435 1-330 (330) 154 protein:vir:104342 Length: 314 98.5 1.8E-08 1.1E-11 63.0 16.1 297 127-494 1-314 (314) 155 protein:vir:7324 Length: 335 # 98.5 1.7E-09 1.1E-12 68.6 10.0 284 145-435 1-335 (335) 156 protein:vir:1583 Length: 351 # 98.5 7.6E-08 4.7E-11 59.6 18.7 277 151-497 1-296 (351) 157 protein:vir:102944 Length: 330 98.5 6.3E-08 3.9E-11 60.0 17.5 280 151-497 1-298 (330) 158 protein:vir:99675 Length: 324 98.5 1.1E-08 6.6E-12 64.3 13.2 253 186-497 1-301 (324) 159 protein:vir:79642 Length: 329 98.5 7.9E-08 4.9E-11 59.5 17.9 310 109-496 1-329 (329) 160 protein:vir:107388 Length: 331 98.4 5.7E-09 3.6E-12 65.8 10.1 283 145-435 1-331 (331) 161 protein:vir:98525 Length: 331 98.4 5.7E-09 3.6E-12 65.8 10.1 283 145-435 1-331 (331) 162 protein:vir:107826 Length: 331 98.4 5.7E-09 3.6E-12 65.8 10.1 283 145-435 1-331 (331) 163 protein:vir:106647 Length: 303 98.4 1.4E-08 8.5E-12 63.7 11.6 271 147-497 1-300 (303) 164 protein:vir:8843 Length: 317 # 98.4 1.1E-07 6.6E-11 58.8 15.9 300 148-495 1-317 (317) 165 protein:vir:80180 Length: 381 98.1 4.4E-07 2.7E-10 55.4 14.9 297 135-497 1-312 (381) 166 protein:vir:3136 Length: 322 # 98.0 9.6E-07 6E-10 53.6 13.7 280 151-497 1-322 (322) 167 protein:vir:79548 Length: 652 97.8 1.9E-05 1.2E-08 46.5 23.1 420 1-490 168-652 (652) 168 protein:vir:5255 Length: 304 # 97.6 1.1E-05 6.7E-09 47.8 14.5 280 157-490 1-304 (304) 169 protein:vir:102655 Length: 322 97.4 3.8E-05 2.4E-08 44.8 14.7 285 145-494 1-322 (322) 170 protein:vir:95512 Length: 693 97.3 0.0001 6.4E-08 42.4 19.9 421 1-494 220-693 (693) 171 protein:vir:101557 Length: 336 97.3 4.8E-05 3E-08 44.3 14.4 310 99-491 1-336 (336) 172 protein:vir:3643 Length: 336 # 97.1 6.8E-05 4.2E-08 43.4 13.7 309 99-491 1-336 (336) 173 protein:vir:107732 Length: 379 96.9 3.6E-05 2.2E-08 45.0 10.4 341 74-492 1-379 (379) 174 protein:vir:78558 Length: 336 96.6 0.00048 3E-07 38.8 14.4 308 99-491 1-336 (336) 175 protein:vir:94070 Length: 339 96.5 0.00057 3.6E-07 38.4 14.9 312 96-491 1-339 (339) 176 protein:vir:99075 Length: 392 96.3 0.00079 4.9E-07 37.6 16.6 268 151-497 1-317 (392) 177 protein:vir:96079 Length: 382 96.2 0.00034 2.1E-07 39.6 11.7 343 74-491 1-382 (382) 178 protein:vir:103886 Length: 302 96.1 0.00069 4.3E-07 37.9 13.1 285 151-497 1-302 (302) 179 protein:vir:106734 Length: 336 95.5 0.0018 1.1E-06 35.6 13.1 308 99-491 1-336 (336) 180 protein:vir:95131 Length: 325 95.3 0.0023 1.4E-06 35.1 17.2 274 151-497 1-295 (325) 181 protein:vir:99576 Length: 388 94.9 0.00075 4.7E-07 37.7 9.1 351 74-491 1-388 (388) 182 protein:vir:1153 Length: 338 # 94.7 0.0037 2.3E-06 33.9 18.7 326 130-495 1-338 (338) 183 protein:vir:1781 Length: 221 # 94.4 0.0035 2.2E-06 34.1 11.6 187 233-497 1-206 (221) 184 protein:vir:3525 Length: 423 # 94.3 0.0048 3E-06 33.3 16.7 275 151-497 1-308 (423) 185 protein:vir:100331 Length: 342 94.1 0.0055 3.4E-06 33.0 17.8 325 130-497 1-342 (342) 186 protein:vir:94989 Length: 349 94.0 0.0057 3.5E-06 32.9 17.7 278 151-497 1-318 (349) 187 protein:vir:348 Length: 321 # 93.9 0.006 3.7E-06 32.8 11.9 302 151-491 1-321 (321) 188 protein:vir:4600 Length: 415 # 93.7 0.0066 4.1E-06 32.5 20.7 387 1-471 8-415 (415) 189 protein:vir:4700 Length: 415 # 93.7 0.0066 4.1E-06 32.5 20.7 387 1-471 8-415 (415) 190 protein:vir:96792 Length: 315 93.6 0.0069 4.3E-06 32.4 17.1 265 152-497 1-282 (315) 191 protein:vir:1663 Length: 393 # 93.4 0.0077 4.8E-06 32.2 14.0 382 1-491 1-393 (393) 192 protein:vir:3870 Length: 400 # 93.3 0.0082 5.1E-06 32.0 19.7 379 1-484 1-400 (400) 193 protein:vir:78777 Length: 358 92.9 0.0095 5.9E-06 31.7 17.0 326 114-497 1-352 (358) 194 protein:vir:105374 Length: 423 92.3 0.012 7.4E-06 31.1 16.4 274 151-497 1-332 (423) 195 protein:vir:1829 Length: 355 # 92.3 0.012 7.4E-06 31.1 18.8 332 130-497 1-346 (355) 196 protein:vir:174 Length: 423 # 92.0 0.013 8.3E-06 30.8 16.4 275 151-497 1-308 (423) 197 protein:vir:93966 Length: 400 91.9 0.014 8.4E-06 30.8 15.8 391 1-491 1-400 (400) 198 protein:vir:98566 Length: 355 91.5 0.016 9.6E-06 30.5 18.9 333 130-497 1-353 (355) 199 protein:vir:104011 Length: 337 91.3 0.016 1E-05 30.4 19.7 324 130-496 1-337 (337) 200 protein:vir:79171 Length: 337 91.0 0.018 1.1E-05 30.2 19.7 324 130-496 1-337 (337) 201 protein:vir:78186 Length: 337 90.4 0.021 1.3E-05 29.8 19.1 324 130-496 1-337 (337) 202 protein:vir:80446 Length: 367 89.6 0.025 1.6E-05 29.3 18.7 297 151-497 1-337 (367) 203 protein:vir:5694 Length: 357 # 89.0 0.029 1.8E-05 29.0 18.4 332 130-497 1-346 (357) 204 protein:vir:78387 Length: 349 88.7 0.031 1.9E-05 28.9 18.5 278 151-497 1-318 (349) 205 protein:vir:6061 Length: 357 # 88.2 0.034 2.1E-05 28.7 18.5 331 130-497 1-352 (357) 206 protein:vir:79157 Length: 339 88.2 0.034 2.1E-05 28.6 19.2 329 130-497 1-339 (339) 207 protein:vir:95875 Length: 401 87.9 0.036 2.2E-05 28.5 15.0 309 142-494 1-401 (401) 208 protein:vir:99311 Length: 463 87.5 0.038 2.4E-05 28.3 13.5 314 108-497 1-354 (463) 209 protein:vir:95603 Length: 463 87.5 0.038 2.4E-05 28.3 13.5 314 108-497 1-354 (463) 210 protein:vir:2016 Length: 357 # 86.4 0.046 2.8E-05 27.9 18.6 333 130-497 1-346 (357) 211 protein:vir:108303 Length: 418 85.9 0.049 3.1E-05 27.8 18.3 264 151-497 1-287 (418) 212 protein:vir:270 Length: 341 # 84.9 0.057 3.5E-05 27.4 16.7 326 114-497 1-338 (341) 213 protein:vir:1084 Length: 437 # 84.8 0.058 3.6E-05 27.4 24.2 397 1-469 7-437 (437) 214 protein:vir:98856 Length: 343 83.5 0.068 4.2E-05 27.0 18.4 321 130-497 1-337 (343) 215 protein:vir:95451 Length: 313 80.1 0.098 6.1E-05 26.1 14.1 278 151-495 1-313 (313) 216 protein:vir:94800 Length: 319 78.9 0.11 6.8E-05 25.9 18.0 286 98-497 1-298 (319) 217 protein:vir:97331 Length: 319 78.9 0.11 6.8E-05 25.9 18.0 286 98-497 1-298 (319) 218 protein:vir:3783 Length: 336 # 78.1 0.12 7.3E-05 25.7 17.4 316 127-497 1-334 (336) 219 protein:vir:3746 Length: 336 # 78.1 0.12 7.3E-05 25.7 17.4 316 127-497 1-334 (336) 220 protein:vir:96666 Length: 462 76.6 0.13 8.3E-05 25.4 15.5 311 108-497 1-343 (462) 221 protein:vir:105522 Length: 423 76.3 0.14 8.5E-05 25.3 17.0 273 151-497 1-308 (423) 222 protein:vir:107120 Length: 329 64.2 0.3 0.00019 23.4 18.6 296 114-497 1-310 (329) 223 protein:vir:78148 Length: 123 58.0 0.38 0.00024 22.9 6.7 106 377-493 1-123 (123) 224 protein:vir:80835 Length: 464 57.9 0.42 0.00026 22.6 14.3 312 92-497 1-340 (464) 225 protein:vir:81100 Length: 415 54.0 0.51 0.00032 22.2 21.5 397 1-471 8-415 (415) 226 protein:vir:98339 Length: 415 54.0 0.51 0.00032 22.2 21.5 397 1-471 8-415 (415) 227 protein:vir:79987 Length: 415 54.0 0.51 0.00032 22.2 21.5 397 1-471 8-415 (415) 228 protein:vir:94870 Length: 318 48.8 0.66 0.00041 21.6 10.8 308 92-491 1-318 (318) 229 protein:vir:4953 Length: 397 # 47.4 0.7 0.00044 21.4 17.0 371 1-469 8-397 (397) 230 protein:vir:100135 Length: 418 45.4 0.77 0.00048 21.2 19.6 371 1-465 21-418 (418) 231 protein:vir:9410 Length: 415 # 45.4 0.77 0.00048 21.2 19.1 387 1-471 8-415 (415) 232 protein:vir:100851 Length: 514 42.1 0.9 0.00056 20.8 11.2 329 92-497 1-386 (514) 233 protein:vir:861 Length: 318 # 40.3 0.98 0.00061 20.6 10.6 311 92-491 1-318 (318) 234 protein:vir:104915 Length: 470 38.5 1.1 0.00066 20.4 18.2 352 61-497 1-457 (470) 235 protein:vir:6242 Length: 390 # 36.0 1.2 0.00074 20.2 14.7 360 1-458 6-390 (390) 236 protein:vir:8846 Length: 705 # 35.9 1.2 0.00075 20.1 16.8 140 1-150 544-705 (705) 237 protein:vir:962 Length: 397 # 35.1 1.2 0.00077 20.1 24.4 378 1-476 9-397 (397) 238 protein:vir:97397 Length: 517 33.9 1.3 0.00082 19.9 17.8 380 1-486 127-517 (517) 239 protein:vir:107947 Length: 519 32.8 1.4 0.00087 19.8 21.6 354 68-497 1-505 (519) 240 protein:vir:1328 Length: 392 # 32.5 1.4 0.00088 19.8 19.8 366 1-486 6-392 (392) 241 protein:vir:4456 Length: 401 # 32.2 1.4 0.00089 19.7 20.8 386 1-484 5-401 (401) 242 protein:vir:102823 Length: 470 32.0 1.5 0.0009 19.7 10.2 301 124-497 1-369 (470) 243 protein:vir:8420 Length: 477 # 31.6 1.5 0.00092 19.6 24.4 442 1-486 1-477 (477) No 1 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=1.6e-96 Score=545.83 Aligned_cols=497 Identities=100% Similarity=1.396 Sum_probs=433.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) ||+..+++++.++++++++++.+++.+..+|+++.++++.++++++.+.++..++..+...+++++.+++++++.++.+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999988888888888888899999888888888776 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +........+..........+.+.............................+.+..+............+..++++++| T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:10 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 65544443433333333333333333322222222222222222223333334445555555555566666777888888 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) ++||+++..+||+.+++.++|+++++++++++++++||++++.++.++||+||+.+|+++++|++|++.+|||+++++|| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS 240 (497) T protein:vir:10 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhH Confidence 99999999999999999999999999999999999999998877789999999999999999999999999999999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhh Q lcl|Aclame:pro 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) Q Consensus 241 ~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) +|||+|++++++||.++|++++++++|.+||+|+|+++|.||++.++..+.+.......+.............+.+.+.. T Consensus 241 ~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) T protein:vir:10 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhh Confidence 99999999999999999999999999999999999999999999999988888888888888888888889999999999 Q ss_pred hhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccc Q lcl|Aclame:pro 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~ 400 (497) +..++..+.+.....+.....++.....++..+....++.+......+++..+++|+|||.+|..|+++||++|+|||++ T Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~ 400 (497) T protein:vir:10 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) T ss_pred hhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEee Q lcl|Aclame:pro 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVY 480 (497) Q Consensus 401 ~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~ 480 (497) +.....+.+....++|||+||+++++||+|+++||||++++|.|++|.+++|+++++..++|++|+|+||+++|+|+.|+ T Consensus 401 ~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~ 480 (497) T protein:vir:10 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVY 480 (497) T ss_pred cccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceee Confidence 98888888899999999999999999999999999999999999999999999999988899999999999999999999 Q ss_pred cccceEEEEecCCCCCC Q lcl|Aclame:pro 481 RPSAFQLIQLKKGATGS 497 (497) Q Consensus 481 ~~~Af~~~~~~~~a~~~ 497 (497) +|+|||+|+++++++|| T Consensus 481 ~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 481 RPSAFQLIQLKKGATGS 497 (497) T ss_pred ccccEEEEEecCCccCC Confidence 99999999999999999 No 2 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=1.6e-96 Score=545.83 Aligned_cols=497 Identities=100% Similarity=1.396 Sum_probs=433.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) ||+..+++++.++++++++++.+++.+..+|+++.++++.++++++.+.++..++..+...+++++.+++++++.++.+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999988888888888888899999888888888776 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +........+..........+.+.............................+.+..+............+..++++++| T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:78 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 65544443433333333333333333322222222222222222223333334445555555555566666777888888 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) ++||+++..+||+.+++.++|+++++++++++++++||++++.++.++||+||+.+|+++++|++|++.+|||+++++|| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS 240 (497) T protein:vir:78 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhH Confidence 99999999999999999999999999999999999999998877789999999999999999999999999999999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhh Q lcl|Aclame:pro 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) Q Consensus 241 ~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) +|||+|++++++||.++|++++++++|.+||+|+|+++|.||++.++..+.+.......+.............+.+.+.. T Consensus 241 ~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) T protein:vir:78 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhh Confidence 99999999999999999999999999999999999999999999999988888888888888888888889999999999 Q ss_pred hhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccc Q lcl|Aclame:pro 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~ 400 (497) +..++..+.+.....+.....++.....++..+....++.+......+++..+++|+|||.+|..|+++||++|+|||++ T Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~ 400 (497) T protein:vir:78 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) T ss_pred hhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEee Q lcl|Aclame:pro 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVY 480 (497) Q Consensus 401 ~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~ 480 (497) +.....+.+....++|||+||+++++||+|+++||||++++|.|++|.+++|+++++..++|++|+|+||+++|+|+.|+ T Consensus 401 ~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~ 480 (497) T protein:vir:78 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVY 480 (497) T ss_pred cccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceee Confidence 98888888899999999999999999999999999999999999999999999999988899999999999999999999 Q ss_pred cccceEEEEecCCCCCC Q lcl|Aclame:pro 481 RPSAFQLIQLKKGATGS 497 (497) Q Consensus 481 ~~~Af~~~~~~~~a~~~ 497 (497) +|+|||+|+++++++|| T Consensus 481 ~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 481 RPSAFQLIQLKKGATGS 497 (497) T ss_pred ccccEEEEEecCCccCC Confidence 99999999999999999 No 3 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=3.2e-70 Score=401.60 Aligned_cols=411 Identities=25% Similarity=0.322 Sum_probs=275.4 Q ss_pred CchHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQG---RQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDI 77 (497) Q Consensus 1 m~~~~~~~~~~---~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~ 77 (497) |+...++.+.. +++.+.++++.+...+...++++..++...+.+... +...+..++++++.++.++++.++ T Consensus 4 ~~~~~~~~~~~~~~~el~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~------~~~~e~~~~~~~l~~~~~~l~~~~ 77 (418) T protein:vir:10 4 MNEPRQFGRKSGGDSHPEQVLETVTKELKRIGDEVKSAGEKALAEAKRAG------DLGVETKATVDELLIKQGELQARL 77 (418) T ss_pred chhHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh------hhhHHHHHHHHHHHHHHHHHHHHH Confidence 66666655322 222333333322222222222222222211111111 112233344444444444444433 Q ss_pred HHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccc Q lcl|Aclame:pro 78 PEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTG 157 (497) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (497) .+.+......... . .....+. .+........... ... .......+...............++++ T Consensus 78 ~~~e~~~~~~~~~---~---~~~~~~~-----~~~~~~~~~~~~~---~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 141 (418) T protein:vir:10 78 LEAEQKLARGGGS---A---ELETPKT-----LGQLVTESEEMKG---MDG--SARKSVRVRVDRKSIMNVPATVGSGVS 141 (418) T ss_pred HHHHHHHhhcccc---c---ccchhhh-----hhHHhhhHHHHHH---HHH--HHhhhhhhhhHHHHHHHhhhhccCCCC Confidence 3322211110000 0 0000000 0000000000000 000 000000011111111122223334455 Q ss_pred cCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeec Q lcl|Aclame:pro 158 TFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANAL 237 (497) Q Consensus 158 ~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~ 237 (497) .+|++||+++...||+.+++.++|++++++++++++++++|++++.++.+.|++||+.+|+++++|++|++.++|+++++ T Consensus 142 ~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~ 221 (418) T protein:vir:10 142 GSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLF 221 (418) T ss_pred CCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeeccCccccccccceeeEEEeeeeEEEee Confidence 56778888999999999999999999999999999999999998877889999999999999999999999999999999 Q ss_pred hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|+|+|++++++||.++|+++++.++|.+||+|+|+++ |.||++.++..+..... T Consensus 222 ~is~ell~ds~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~--------------------- 280 (418) T protein:vir:10 222 KASRQILDDAPALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITL--------------------- 280 (418) T ss_pred hhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccc--------------------- Confidence 99999999999999999999999999999999999999975 99999876544332211 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .....+++++.++..+.. .+..++.|+|||.+|..|+++||++|+| T Consensus 281 ---------------------------------~~~~~~~~i~~~~~~~~~-~~~~~~~~v~n~~~~~~L~~lkd~~G~~ 326 (418) T protein:vir:10 281 ---------------------------------ANATPIDKIRLALLQAVL-AEFPATGIVLNPIDWASIELTKDSQGRY 326 (418) T ss_pred ---------------------------------cccccHHHHHHHHHhhcc-ccCCCCEEEEcHHHHHHHHHhhcCCCce Confidence 111223445555555543 3456678999999999999999999999 Q ss_pred cccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeec Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLG 476 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~ 476 (497) ||.++. ...+++|+|+||+++++||+++++||||++. |.++++.+++|+++++.+.+|++|+|.||++.|+| T Consensus 327 i~~~~~-------~~~~~~l~G~pV~~~~~~p~~~~~~gd~s~~-~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d 398 (418) T protein:vir:10 327 IVGNPV-------NGTTPRLWNLPVVETQAMTANEFLVGAFSMA-AQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLA 398 (418) T ss_pred eccccc-------cCCCceecceeeEEcCCCCCCcEEEeeccce-EEEEEecceEEEEecccchhhhcCceEEEEEEeec Confidence 996532 2334689999999999999999999999984 77999999999999999889999999999999999 Q ss_pred cEeecccceEEEEecCCCCC Q lcl|Aclame:pro 477 LLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 477 ~~v~~~~Af~~~~~~~~a~~ 496 (497) |++++|+||+++++++++.| T Consensus 399 ~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 399 LAVYRPESFVTGALVEQAGG 418 (418) T ss_pred cEEecccceEEEEeccCCCC Confidence 99999999999999999999 No 4 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=6e-68 Score=389.10 Aligned_cols=394 Identities=25% Similarity=0.355 Sum_probs=268.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |... +++++++.++++++.+ ++++..+++.++.+...+ ..+++.++++++..+.+.++..+.+. T Consensus 1 m~~~---~k~l~el~~~~~~~~~-------~~~~~~e~~~~~~~~~~~------~~~e~~~~~~~~~~~~~~~~~~~~~~ 64 (395) T protein:vir:43 1 MSDF---EKQIGELNASLKQVGD-------QIKSQAEQVNTQIANFGE------MNKETRAKVDELLTAQGELQARLSAA 64 (395) T ss_pred ChhH---HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhh------hhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5432 2333333333333222 122222222222221111 12233333444444444433333222 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +............ ....+... .... ............+. ............++++.+| T Consensus 65 ~~~~~~~~~~~~~-----~~~~~~~~------~~~~------~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~g 122 (395) T protein:vir:43 65 EQAMLANEKRDGG-----EEAPKTAG------QMVA------ESLKEQGVTSSLRG-----SHRVSMPRSAITSIDGSGG 122 (395) T ss_pred HHHHHhhhccccc-----cchhhhHH------HHHH------HHHHHHHHHHHhhh-----hhhhhhhhhhhcccCCCCc Confidence 2111111000000 00000000 0000 00000000000000 0111112233445666678 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) +++||++..+||+.+++.++|+++|+++++++++++||+.++.++.++||+||+.+|+++++|++++++++|++++++|| T Consensus 123 ~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is 202 (395) T protein:vir:43 123 ALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKAS 202 (395) T ss_pred cccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhh Confidence 88999999999999999999999999999999999999998877889999999999999999999999999999999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhh Q lcl|Aclame:pro 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAF 319 (497) Q Consensus 241 ~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (497) +|||+|++++++||.++|+++++.++|.+||+|+|+++ |.||++..+..+...+.. T Consensus 203 ~ell~d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~----------------------- 259 (395) T protein:vir:43 203 RQILDDASALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVV----------------------- 259 (395) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccc----------------------- Confidence 99999999999999999999999999999999999986 589998765433322110 Q ss_pred hhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccc Q lcl|Aclame:pro 320 VGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~ 399 (497) ......++.+..++..+... +..+.+|+|||.+|..|+++||++|+|+|. T Consensus 260 -----------------------------~~~~~~~~~i~~~~~~~~~~-~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~ 309 (395) T protein:vir:43 260 -----------------------------VTAEQRIDRIRLAILQAQLA-EFPASGIVLNPIDWALIELNKDAENRYIIG 309 (395) T ss_pred -----------------------------cccchhHHHHHHHHHhhccc-cCCCcEEEEcHHHHHHHHHhhccCCceecc Confidence 11122345555555555544 445678999999999999999999999997 Q ss_pred ccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEe Q lcl|Aclame:pro 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) Q Consensus 400 ~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v 479 (497) ++. ....++|+|+||+++++||+++++||||+. +|.+++|.+++|+++++.+.+|++|+++||++.|+||++ T Consensus 310 ~~~-------~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v 381 (395) T protein:vir:43 310 SPQ-------NGTTPTLWRLPVVETQAITQDEFLTGAFSL-GAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAV 381 (395) T ss_pred ccc-------cCCCceecceeeEEcCCCCCCcEEEEeccc-eEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEE Confidence 542 223568999999999999999999999998 477889999999999999889999999999999999999 Q ss_pred ecccceEEEEecCC Q lcl|Aclame:pro 480 YRPSAFQLIQLKKG 493 (497) Q Consensus 480 ~~~~Af~~~~~~~~ 493 (497) ++|+||++++++++ T Consensus 382 ~~~~a~~~~~~taa 395 (395) T protein:vir:43 382 YRPEAFVTGSLTAS 395 (395) T ss_pred ecccceEEEEeccC Confidence 99999999999998 No 5 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=7.3e-68 Score=388.66 Aligned_cols=384 Identities=24% Similarity=0.337 Sum_probs=275.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+.++++++.++.++++++.++...+..++.+..+++.++++...+. +++++..+.+. T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~--------------------~~~~~~~~~~~ 60 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQKAEIESTGQVSKQLQSDLMKVQEE--------------------LTKSGTRLFDL 60 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHH Confidence 9999999988888877777765544433333333332222222221111 11111111100 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +... ......... ........... ......... .............++++.+| T Consensus 61 ~~~~--------~~~~~~~~~-----~~~~~~~~~~~-------------~~~~~~~~~-~~~~~~~~~~~~~~~~~~~g 113 (385) T protein:vir:18 61 EQKL--------ASGAENPGE-----KKSFSERAAEE-------------LIKSWDGKQ-GTFGAKTFNKSLGSDADSAG 113 (385) T ss_pred HHHh--------hccccccch-----hhhhHHHHHHH-------------HHHHHHHhh-ccchhhHHHhhhccccccCC Confidence 0000 000000000 00000000000 000000000 00001111223345566678 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) +++||++...||+.+++.++|+++++++++++++++||+.++..+.+.|++||+.+|+++++|+++++.+||++++++|| T Consensus 114 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is 193 (385) T protein:vir:18 114 SLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQAS 193 (385) T ss_pred ceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhh Confidence 88999999999999999999999999999999999999998766789999999999999999999999999999999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhh Q lcl|Aclame:pro 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAF 319 (497) Q Consensus 241 ~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (497) +|+|+|++++++||.++|+++++.++|.+||+|+|+++ |.||++.++..+.... T Consensus 194 ~ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~------------------------- 248 (385) T protein:vir:18 194 RQVMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLN------------------------- 248 (385) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc------------------------- Confidence 99999999999999999999999999999999999986 6899876543322111 Q ss_pred hhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccc Q lcl|Aclame:pro 320 VGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~ 399 (497) .......+.+..++..+.. .+..+++|+|||.+|..|+++||++|+|+|. T Consensus 249 -----------------------------~~~~~~~d~i~~~~~~l~~-~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~ 298 (385) T protein:vir:18 249 -----------------------------ATGDTRADIIAHAIYQVTE-SEFSASGIVLNPRDWHNIALLKDNEGRYIFG 298 (385) T ss_pred -----------------------------ccccchHHHHHHHHHhhcc-ccCCCCEEEEcHHHHHHHHHhhcCCCceecc Confidence 1111234555566655544 4566779999999999999999999999997 Q ss_pred ccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEe Q lcl|Aclame:pro 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) Q Consensus 400 ~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v 479 (497) ++. ...+++|+|+||++++++|+++++||||++ +|.++++.+++|+++++.+++|++|++.||+++|+|++| T Consensus 299 ~~~-------~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~-~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v 370 (385) T protein:vir:18 299 GPQ-------AFTSNIMWGLPVVPTKAQAAGTFTVGGFDM-ASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAH 370 (385) T ss_pred Ccc-------cCCCceecceeeEEcCcCCCCcEEEeeccc-EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEE Confidence 643 234568999999999999999999999998 578999999999999999889999999999999999999 Q ss_pred ecccceEEEEecCCC Q lcl|Aclame:pro 480 YRPSAFQLIQLKKGA 494 (497) Q Consensus 480 ~~~~Af~~~~~~~~a 494 (497) ++|+||+++++++++ T Consensus 371 ~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 371 YRPTAIIKGTFSSGS 385 (385) T ss_pred ecccceEEEEeccCC Confidence 999999999999999 No 6 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=7.3e-68 Score=388.66 Aligned_cols=384 Identities=24% Similarity=0.337 Sum_probs=275.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+.++++++.++.++++++.++...+..++.+..+++.++++...+. +++++..+.+. T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~--------------------~~~~~~~~~~~ 60 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQKAEIESTGQVSKQLQSDLMKVQEE--------------------LTKSGTRLFDL 60 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHH Confidence 9999999988888877777765544433333333332222222221111 11111111100 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +... ......... ........... ......... .............++++.+| T Consensus 61 ~~~~--------~~~~~~~~~-----~~~~~~~~~~~-------------~~~~~~~~~-~~~~~~~~~~~~~~~~~~~g 113 (385) T protein:vir:19 61 EQKL--------ASGAENPGE-----KKSFSERAAEE-------------LIKSWDGKQ-GTFGAKTFNKSLGSDADSAG 113 (385) T ss_pred HHHh--------hccccccch-----hhhhHHHHHHH-------------HHHHHHHhh-ccchhhHHHhhhccccccCC Confidence 0000 000000000 00000000000 000000000 00001111223345566678 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) +++||++...||+.+++.++|+++++++++++++++||+.++..+.+.|++||+.+|+++++|+++++.+||++++++|| T Consensus 114 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is 193 (385) T protein:vir:19 114 SLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQAS 193 (385) T ss_pred ceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhh Confidence 88999999999999999999999999999999999999998766789999999999999999999999999999999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhh Q lcl|Aclame:pro 241 DEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAF 319 (497) Q Consensus 241 ~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (497) +|+|+|++++++||.++|+++++.++|.+||+|+|+++ |.||++.++..+.... T Consensus 194 ~ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~------------------------- 248 (385) T protein:vir:19 194 RQVMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLN------------------------- 248 (385) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc------------------------- Confidence 99999999999999999999999999999999999986 6899876543322111 Q ss_pred hhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccc Q lcl|Aclame:pro 320 VGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~ 399 (497) .......+.+..++..+.. .+..+++|+|||.+|..|+++||++|+|+|. T Consensus 249 -----------------------------~~~~~~~d~i~~~~~~l~~-~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~ 298 (385) T protein:vir:19 249 -----------------------------ATGDTRADIIAHAIYQVTE-SEFSASGIVLNPRDWHNIALLKDNEGRYIFG 298 (385) T ss_pred -----------------------------ccccchHHHHHHHHHhhcc-ccCCCCEEEEcHHHHHHHHHhhcCCCceecc Confidence 1111234555566655544 4566779999999999999999999999997 Q ss_pred ccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEe Q lcl|Aclame:pro 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) Q Consensus 400 ~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v 479 (497) ++. ...+++|+|+||++++++|+++++||||++ +|.++++.+++|+++++.+++|++|++.||+++|+|++| T Consensus 299 ~~~-------~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~-~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v 370 (385) T protein:vir:19 299 GPQ-------AFTSNIMWGLPVVPTKAQAAGTFTVGGFDM-ASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAH 370 (385) T ss_pred Ccc-------cCCCceecceeeEEcCcCCCCcEEEeeccc-EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEE Confidence 643 234568999999999999999999999998 578999999999999999889999999999999999999 Q ss_pred ecccceEEEEecCCC Q lcl|Aclame:pro 480 YRPSAFQLIQLKKGA 494 (497) Q Consensus 480 ~~~~Af~~~~~~~~a 494 (497) ++|+||+++++++++ T Consensus 371 ~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 371 YRPTAIIKGTFSSGS 385 (385) T ss_pred ecccceEEEEeccCC Confidence 999999999999999 No 7 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=4.8e-67 Score=384.17 Aligned_cols=405 Identities=29% Similarity=0.420 Sum_probs=263.0 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH Q lcl|Aclame:pro 14 LAKSIKDINADET-KTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHL 92 (497) Q Consensus 14 l~~~~~~~~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 92 (497) |+++..+...++. .+.+|+++..+++....+...+..+ +++.+...++.++........... .... T Consensus 1 ~~ke~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 67 (413) T protein:vir:81 1 MVKEAGDAPTNAQVAEIAEVKSMVEQFKADEDAKRERAK----------SVKANQDFLRELQEATAGSVDSEK---SGEL 67 (413) T ss_pred ChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHhHHhHHH---hhhH Confidence 3333332222111 1122222222222211111111111 111111111111000000000000 0000 Q ss_pred HhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHH-HHHHhhhhhhhhhhhhhhcccccCCcccccchhhHH Q lcl|Aclame:pro 93 ARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELM-GAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGI 171 (497) Q Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~i 171 (497) ..... ............... ...... ........ ..................+++..+++++|+++..+| T Consensus 68 ~~~~~---~~~~~~~~~~~~~~~-~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i 138 (413) T protein:vir:81 68 TRKGE---GYKSIGEFFAKRAGD-QIKQQA-----GGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNI 138 (413) T ss_pred hhhhh---hhhhhhhhhhhhhhh-HHHHHH-----HHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHH Confidence 00000 000000000000000 000000 00000000 000000000011122233455667778888999999 Q ss_pred HHHHHhhhhHHhhccceecCCCceEEEEeecC---Cccceeecccccccccc-ccceeEEeeeeeeeeechhhHHHHhhH Q lcl|Aclame:pro 172 VEQLFYELSLADLISSRPVTSPNLSYLTESAA---HNNAAAVAEAGTYPFSS-EEFARVYEQVGKVANALTITDEGLRDA 247 (497) Q Consensus 172 i~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~---~~~a~~v~Eg~~~~~s~-~~~~~v~~~~~kia~~~~iS~ell~ds 247 (497) |+.+++.++|+++++++++++++++||+.++. ...++||+||+.+|+++ ++|+.+++.+||++++++||+|||+|+ T Consensus 139 i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds 218 (413) T protein:vir:81 139 IYRRREKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDY 218 (413) T ss_pred HHHHhhhhhHHhhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHH Confidence 99999999999999999999999999998753 24589999999999987 789999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhh Q lcl|Aclame:pro 248 PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVA 326 (497) Q Consensus 248 ~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (497) ++|++||+++|+++++.++|.+||+|+|+++ |.||++.++..+..... T Consensus 219 ~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~------------------------------- 267 (413) T protein:vir:81 219 DFLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSN------------------------------- 267 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccccc------------------------------- Confidence 8899999999999999999999999999986 58998876554333221 Q ss_pred hhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccc Q lcl|Aclame:pro 327 SLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAY 406 (497) Q Consensus 327 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~ 406 (497) .....+.+..++........+.+++|+|||.+|..|++|||++|+|||.++..... T Consensus 268 ------------------------~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~ 323 (413) T protein:vir:81 268 ------------------------KDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQY 323 (413) T ss_pred ------------------------cchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccc Confidence 11233445555555555666677889999999999999999999999988776655 Q ss_pred ccc-ccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccce Q lcl|Aclame:pro 407 GNP-VNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAF 485 (497) Q Consensus 407 ~~~-~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af 485 (497) +.+ ....++|||+||++++++|+++++||||++ +|.+++|.+++++++++..++|++|++.||+++|+|+.+++|+|| T Consensus 324 ~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~-~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~ 402 (413) T protein:vir:81 324 GSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRS-AASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAI 402 (413) T ss_pred cccccccCceecceeeEEcCCCCcccEEEEeccc-EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccce Confidence 443 235568999999999999999999999998 478999999999999999889999999999999999999999999 Q ss_pred EEEEecCCCCC Q lcl|Aclame:pro 486 QLIQLKKGATG 496 (497) Q Consensus 486 ~~~~~~~~a~~ 496 (497) ++|+++++++= T Consensus 403 ~~l~~~~~~~p 413 (413) T protein:vir:81 403 VQLDVAEVVTP 413 (413) T ss_pred EEEEecCCCCC Confidence 99999887777 No 8 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=1.2e-67 Score=387.41 Aligned_cols=388 Identities=27% Similarity=0.354 Sum_probs=257.1 Q ss_pred CchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQ-LEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |...-+ +++++.++.++++. +.++..... +...+..++++++.+++++++.++++ T Consensus 1 m~e~~~~l~~~~~~~~~~~~~------------------~~e~~~~~~------~~~~e~~~~~~~~~~e~~~l~~~i~~ 56 (390) T protein:vir:10 1 MTDITSKLEATLANVTDSLRA------------------FGERAVRDG------ELNASARSKVDELFATVGNLSAEVQA 56 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHH------------------HHHHHHhhc------ccCHHHHHHHHHHHHHHHHHHHHHHH Confidence 222211 11111111111111 111111100 01112222333333333333333322 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+........... . .....+... ......... ...... .....................++++.+ T Consensus 57 ~~~~~~~~~~~~~-~---~~~~~~~~~-----~~~~~~~~~--~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 121 (390) T protein:vir:10 57 ARQRVAELEGNGA-G---GDVQHVSVG-----DLFVASEQF--QASAGR----WNDRSARATMNIKAALNTASTDAAGSA 121 (390) T ss_pred HHHHHHHHHhhcc-c---ccccccchh-----hhhhhhHHH--HHHHHh----hhhhhhhhhhHHHHHHHhhhccccccc Confidence 2111100000000 0 000000000 000000000 000000 000000001111122233344566677 Q ss_pred CcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechh Q lcl|Aclame:pro 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 160 g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~i 239 (497) |+++||++...||+.+++.++|+++|+++++++++++||++++.++.+.|++||+.+|+++++|+++++.++|++++++| T Consensus 122 g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~i 201 (390) T protein:vir:10 122 GALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKA 201 (390) T ss_pred ccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehh Confidence 88999999999999999999999999999999999999999887778999999999999999999999999999999999 Q ss_pred hHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhh Q lcl|Aclame:pro 240 TDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |+|||+|++++++||.++|++++++++|.+||+|+|+++ |.||++.++....+.. T Consensus 202 s~ell~d~~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~------------------------ 257 (390) T protein:vir:10 202 TRQILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT------------------------ 257 (390) T ss_pred hHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcccccccccccccccccc------------------------ Confidence 999999999999999999999999999999999999876 9999987543322111 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccc Q lcl|Aclame:pro 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~ 398 (497) .......+.+..++..+.. .+..+++|+|||.+|..|+++||++|+||| T Consensus 258 ------------------------------~~~~~~~~~~~~~~~~l~~-~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~ 306 (390) T protein:vir:10 258 ------------------------------IAGATRVDQLRLAMLQASL-AEYPASGIVINPIDWAAIELAKDANNQYLI 306 (390) T ss_pred ------------------------------ccccchHHHHHHHHHhhcc-ccCCCCEEEEcHHHHHHHHHhhcCCCceee Confidence 0111223444555555544 456678899999999999999999999999 Q ss_pred cccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccE Q lcl|Aclame:pro 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~ 478 (497) .++.. ..+++|+|+||+.++.||+++++||||++ +|.+++|.+++|++++++ .+|++|++.||++.|+||+ T Consensus 307 ~~~~~-------~~~~~l~G~pv~~~~~~p~~~~~~gdf~~-~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~~~r~d~~ 377 (390) T protein:vir:10 307 GNARG-------TLTPTLWGLPVVATQAMAPGEFLVGAFDL-AAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEERLALV 377 (390) T ss_pred cCCcC-------cCCceecceeeEEcCCCCCCcEEEEeccc-eEEEEEecceEEEEeecc-cccccCcEEEEEEEeeccE Confidence 87542 23568999999999999999999999998 477899999999998764 4799999999999999999 Q ss_pred eecccceEEEEec Q lcl|Aclame:pro 479 VYRPSAFQLIQLK 491 (497) Q Consensus 479 v~~~~Af~~~~~~ 491 (497) |++|+||+++++. T Consensus 378 v~~~~a~~~~~~a 390 (390) T protein:vir:10 378 VYRPEALISGSFA 390 (390) T ss_pred EeccccEEEEEeC Confidence 9999999999999 No 9 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=2.4e-67 Score=385.80 Aligned_cols=387 Identities=27% Similarity=0.388 Sum_probs=260.5 Q ss_pred CchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQ-LEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQA-EVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 m~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~-~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) |..+.+ ++++++++.++++.+ .++.+...+ ..+..++.+++.++++++++++++++..+. T Consensus 1 m~~l~~~l~~~~~~~~~~~~~~------------------~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~ 62 (390) T protein:vir:81 1 MTDITSKLEATLANVTDSLRAF------------------GERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVA 62 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHH------------------HHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 433322 222232222222221 111111100 011112223333333333333333332222 Q ss_pred HHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccccc Q lcl|Aclame:pro 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) +.+..... .. ...+........... . ......... .. ................++++. T Consensus 63 ~~~~~~~~--------~~---~~~~~~~~~~~~~~~-----~--~~~~~~~~~---~~-~~~~~~~~~~~~~~~~~~~~~ 120 (390) T protein:vir:81 63 ELEGNGAG--------GD---VQHVSVGDMFVASEQ-----F--QASAGRWND---RS-ARATMNIKAALNTASTDAAGS 120 (390) T ss_pred HHHhcccc--------cc---cccccchhhhhhhHH-----H--HHHHHHHhh---hh-hhhhhHHHHHHHhhccccccC Confidence 21111000 00 000000000000000 0 000000000 00 000001111222334456677 Q ss_pred CCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeech Q lcl|Aclame:pro 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALT 238 (497) Q Consensus 159 ~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~ 238 (497) +|+++||++...||+.+++.++|++++++++++++.+++|+.++.++.++||+||+.+|+++++|+++++.++|++++++ T Consensus 121 ~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~ 200 (390) T protein:vir:81 121 AGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMK 200 (390) T ss_pred CcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeeh Confidence 78899999999999999999999999999999999999999988767899999999999999999999999999999999 Q ss_pred hhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchh Q lcl|Aclame:pro 239 ITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) ||+|+|+|++++++||.++|++++++++|.+||+|+|+++ |.||++.++....+... T Consensus 201 is~ell~d~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~---------------------- 258 (390) T protein:vir:81 201 ATRQILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTI---------------------- 258 (390) T ss_pred hhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeeccccccccccc---------------------- Confidence 9999999999999999999999999999999999999986 99999876543222110 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccc Q lcl|Aclame:pro 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~ 397 (497) ......+.+..++..+... +..++.|+|||.+|..|+++||++|+|+ T Consensus 259 --------------------------------~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~l~~lkd~~G~~l 305 (390) T protein:vir:81 259 --------------------------------AGATRVDQLRLAMLQASLA-EYNPSGIVINPIDWAAIELAKDANNQYL 305 (390) T ss_pred --------------------------------ccchhHHHHHHHHHhhccc-cCCCCEEEEcHHHHHHHHHhhcCCCcee Confidence 1112234455555555444 4567789999999999999999999999 Q ss_pred ccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeecc Q lcl|Aclame:pro 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGL 477 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~ 477 (497) |.++.. ...++|+|+||+.++++|+++++||||++ +|.+++|.+++|+++++. .+|++|++.||++.|+|+ T Consensus 306 ~~~~~~-------~~~~~l~G~pv~~~~~~p~~~~~~gd~~~-~~~~~~~~~~~v~~~~~~-~~~~~~~v~~r~~~r~d~ 376 (390) T protein:vir:81 306 IGNARG-------TLTPTLWGLPVVATQAMAPGEFLVGAFDL-AAQIFDQWDARVEIGYVG-EDFQRNMITVLAEERLAL 376 (390) T ss_pred ecCccc-------ccCceecceeeEEcCCCCCCcEEEEehhc-eEEEEEecceEEEEeccc-chhhcCcEEEEEEEeecc Confidence 987432 34468999999999999999999999998 477899999999998865 479999999999999999 Q ss_pred EeecccceEEEEec Q lcl|Aclame:pro 478 LVYRPSAFQLIQLK 491 (497) Q Consensus 478 ~v~~~~Af~~~~~~ 491 (497) +|++|+|||++++. T Consensus 377 ~v~~~~a~v~~t~a 390 (390) T protein:vir:81 377 VVYRPEALISGSFA 390 (390) T ss_pred EEecccceEEEEeC Confidence 99999999999999 No 10 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=8.9e-67 Score=382.68 Aligned_cols=387 Identities=27% Similarity=0.394 Sum_probs=260.4 Q ss_pred CchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQ-LEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQA-EVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 m~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~-~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) |....+ ++++++++..+++.+.++... ..+ ..+..++.+++..+++.+++++++++.... T Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~e~~~~------------------~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~ 62 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLKAFGERAVR------------------DGELNASARSKVDELFATVGNLSAEVQAARQRVA 62 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHh------------------hcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 433322 333333333333322221111 000 001111222233333333333333322221 Q ss_pred HHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccccc Q lcl|Aclame:pro 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) +.+..... . ....+.......... .... . ........................++++. T Consensus 63 ~~~~~~~~-------~----~~~~~~~~~~~~~~~-----~~~~---~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 120 (390) T protein:vir:97 63 ELEGNGAG-------G----DVQHVSVGDMFVASE-----QFQA---S---TGRWNDRSARATMNIKAALNTASTDAAGS 120 (390) T ss_pred HHHhcccc-------c----ccccccchhhhhhhH-----HHHH---H---HHHhhhhhhhhhhHHHHHHHhhhcccccc Confidence 11110000 0 000000000000000 0000 0 00000000000111112223344556777 Q ss_pred CCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeech Q lcl|Aclame:pro 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALT 238 (497) Q Consensus 159 ~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~ 238 (497) +|+++||++...||+.+++.++|++++++++++++.++||+.++.++.+.||+||+++|+++++|+++++.+||++++++ T Consensus 121 ~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~ 200 (390) T protein:vir:97 121 AGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMK 200 (390) T ss_pred cccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeeh Confidence 78889999999999999999999999999999999999999988777899999999999999999999999999999999 Q ss_pred hhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchh Q lcl|Aclame:pro 239 ITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) +|+|+++|++++++||.++|++++++++|.+||+|+|+++ |.||++.++..+.... T Consensus 201 is~ell~ds~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~----------------------- 257 (390) T protein:vir:97 201 ATRQILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT----------------------- 257 (390) T ss_pred hhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeecccccccccc----------------------- Confidence 9999999999999999999999999999999999999886 9999987654322211 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccc Q lcl|Aclame:pro 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~ 397 (497) .......+.+..++..+.. .+..+++|+|||.+|..|+++||++|+|| T Consensus 258 -------------------------------~~~~~~~d~~~~~~~~~~~-~~~~~~~~v~n~~~~~~L~~lkd~~G~~l 305 (390) T protein:vir:97 258 -------------------------------IAGATRVDQLRLAMLQASL-AEYPASGIVINPIDWAAIELAKDANNQYL 305 (390) T ss_pred -------------------------------ccccchHHHHHHHHHhhcc-ccCCCCEEEEcHHHHHHHHHhhcCCCcee Confidence 0111223444555555544 34567789999999999999999999999 Q ss_pred ccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeecc Q lcl|Aclame:pro 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGL 477 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~ 477 (497) |.++.. ...++|+|+||++++++|+++++||||+. +|.+++|.++++++++++ .+|++|+++||++.|+|| T Consensus 306 ~~~~~~-------~~~~~l~G~pV~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~-~~f~~~~~~~r~~~r~d~ 376 (390) T protein:vir:97 306 IGNARG-------TLTPTLWGLPVVATQAMAPGEFLVGAFDL-AAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEERLAL 376 (390) T ss_pred ecCccC-------CCCceecceeeEEcCCCCCCcEEEEeccc-eEEEEEecceEEEEeecc-cccccCcEEEEEEEeecc Confidence 987432 23468999999999999999999999998 477899999999998765 469999999999999999 Q ss_pred EeecccceEEEEec Q lcl|Aclame:pro 478 LVYRPSAFQLIQLK 491 (497) Q Consensus 478 ~v~~~~Af~~~~~~ 491 (497) .|+||+|||++++. T Consensus 377 ~v~~~~a~v~~~~a 390 (390) T protein:vir:97 377 VVYRPEALITGSFA 390 (390) T ss_pred EEeccccEEEEEeC Confidence 99999999999999 No 11 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=7.3e-66 Score=377.70 Aligned_cols=404 Identities=19% Similarity=0.238 Sum_probs=254.3 Q ss_pred CchH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPST-------AQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAH-------ERAQEMLKSLGGA 66 (497) Q Consensus 1 m~~~-------~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~-------e~~~e~~~~~~~~ 66 (497) |-+- ++|.....++.+.+.++.++.. .|+++.++++..+.+..+++.+.. ....+...+++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~e~ra~~~---~e~~~l~~~~~~~~~~~k~~~~~~~~~~~~~~~~~e~~~~~~~~ 77 (425) T protein:vir:10 1 MSKKLLIAVLTAALTGPVGAVPRGIISVRAEGP---TEVKALIENLQKAFHDFKAEHTKQLDAVKAGLPTSDALAKVDKV 77 (425) T ss_pred CchhHHHHhhHHHhhhhhhhhhHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHHHHHH Confidence 4331 1112222222222222222222 222222222222222211111100 0001111112222 Q ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhh Q lcl|Aclame:pro 67 DAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPA 146 (497) Q Consensus 67 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (497) ..+++.++..+.+.... ... .... ...... ....+.+..+....+... T Consensus 78 ~~ei~~~~~~~~~~~~~--------~~~-----~~~~-----~~~~~~--------------~~~~~~~~af~~~l~~~e 125 (425) T protein:vir:10 78 SADLEALQAAVDEANIK--------IAA-----AQMG-----ANGVKP--------------LRDPEYTEAFKAHVKRGD 125 (425) T ss_pred HHHHHHHHHHHHHHHHH--------HHh-----hhcc-----cccccc--------------cccHHHHHHHHHHhhhhh Confidence 22222211111110000 000 0000 000000 000000111111111112 Q ss_pred hhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccccc-cccee Q lcl|Aclame:pro 147 AIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSS-EEFAR 225 (497) Q Consensus 147 ~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~-~~~~~ 225 (497) .......++++.+|.++|+++...|++.+++.++|+++|++++++++.+++|+.++. +.++||+|++.+|+++ ++|++ T Consensus 126 ~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~-~~a~wv~E~~~~~~~~~~~f~~ 204 (425) T protein:vir:10 126 VQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGG-TTSGWVGEASQRPQTNAATFQP 204 (425) T ss_pred hHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcCC-cceeeeccccccccccccccce Confidence 222233345555566778888899999999999999999999999999999998874 6899999999999876 79999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) +++.++|++++++||+|+|+|+ +++++||.++|+++++.++|.+||+|+|+++|.||++..+..+.......... T Consensus 205 v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~---- 280 (425) T protein:vir:10 205 LSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAI---- 280 (425) T ss_pred eeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeecccccccccccccccc---- Confidence 9999999999999999999997 69999999999999999999999999999999999987654433221100000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) . ............+++.+.+..+... +..+..|+|||.+|. T Consensus 281 ----------------------------------~----~~~~~~~~~~~~d~l~~l~~~l~~~-~~~~a~~vmn~~~~~ 321 (425) T protein:vir:10 281 ----------------------------------E----VVNSGAAADITSDGIIDLVYDLPSA-FTGNARFAMNRNTQR 321 (425) T ss_pred ----------------------------------c----cccccccccccHHHHHHHHhhhhhh-hccCCEEEEchHHHH Confidence 0 0000111222345566666665544 455678999999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCc-----CcEEEeeccceEEEEEeccccEEEEeccch Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~ 459 (497) .|+++||++|+|+|.+.... +.+.+|+|+||+++++||+ ..++||||+. +|.+++|.++++..++ T Consensus 322 ~L~~lkD~~G~~l~~~~~~~------g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~-~~~i~~~~~~~v~~d~--- 391 (425) T protein:vir:10 322 QVRKLKDGQGNYLWQPSYVA------GQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQ-TYLIIDRIGVRVLRDP--- 391 (425) T ss_pred HHHHhhcCCCceeeccCccC------CCCceecceeeEEecCcCCccCCccEEEEEehhc-cEEEEEecceEEEecc--- Confidence 99999999999999886432 3446899999999999984 2388999998 5789999998886554 Q ss_pred hhhhcCceEEEEEeeeccEeecccceEEEEecCCC Q lcl|Aclame:pro 460 TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 460 ~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a 494 (497) +|.+|++.||++.|+|++|+||+||++++++++- T Consensus 392 -~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 392 -YTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred -cccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 4679999999999999999999999999988877 No 12 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=1.1e-65 Score=376.75 Aligned_cols=385 Identities=15% Similarity=0.119 Sum_probs=252.3 Q ss_pred CchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTA--QLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 m~~~~--~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) ||+.. ++++++.++.++++.+.++... ....++..++++.++.+++.++.+++ T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~~-------------------------~~~~~e~~~~~~~l~~e~~~l~~~i~ 55 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFAG-------------------------KEMTAEAREKEERLLTAVADFDGRIK 55 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhc-------------------------ccccHHHHHHHHHHHHHHHHHHHHHH Confidence 77543 4445555555555444332110 00111111222222222222222221 Q ss_pred HHhHHHHH--HHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccc Q lcl|Aclame:pro 79 EVEVRNLK--QIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGST 156 (497) Q Consensus 79 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (497) +..+.... ................. ....... .. ..+.......+..........+++ T Consensus 56 ~~~e~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~----~~--------~~r~g~~~~~~~~~~~~~~~~~t~ 115 (392) T protein:vir:13 56 RGIDAIKATDAVTSLLSGLQGSGSGAQ--------RSADHDD----DA--------VLRAGNLGEARSFEFAPEKRDGTK 115 (392) T ss_pred HHHHHHHHHHHHHHHhcccCCcccchh--------hhhhHHH----HH--------HHhccchhhhHHHHhhhhhhcccc Confidence 11000000 00000000000000000 0000000 00 000000000011111112223445 Q ss_pred ccCCcccccchhhHHHHHHHhhh-hHHhhccceecCCC-ceEEEEeecCCccceeeccccccccccccceeEEeeeeeee Q lcl|Aclame:pro 157 GTFAPGILPTFLPGIVEQLFYEL-SLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA 234 (497) Q Consensus 157 ~~~g~~i~~~~~~~ii~~~~~~~-~l~~~~~~~~~~~~-~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia 234 (497) +++|+++||++...+|..++..+ .+++++++++++++ .+.+|+.++ .+.++||+|++.+|+++++|+.+++.+||++ T Consensus 116 ~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~ 194 (392) T protein:vir:13 116 AGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITG-RATAGIVGETAEIPESYPATTQRSMGGFKYG 194 (392) T ss_pred cCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcC-CcceeeecccccccccccceeeEEeeeeeEE Confidence 55677889998888887776655 57788888888654 588998877 5789999999999999999999999999999 Q ss_pred eechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhh Q lcl|Aclame:pro 235 NALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 235 ~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) ++++||+|+|+|+ +++++||.++|+++++.++|.+||+|+|+++|.||++..+..+...... T Consensus 195 ~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~----------------- 257 (392) T protein:vir:13 195 FASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGEA----------------- 257 (392) T ss_pred eeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccccccccc----------------- Confidence 9999999999997 5899999999999999999999999999999999998764332211100 Q ss_pred cchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhccc Q lcl|Aclame:pro 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDAN 393 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~ 393 (497) .......+.+...+..+... +..+..|+|||.++..|+++||++ T Consensus 258 -----------------------------------~~~~~~~d~l~~~~~~l~~~-~~~~a~~v~n~~~~~~l~~lkd~~ 301 (392) T protein:vir:13 258 -----------------------------------DADSKVSDALIDLFHEVPSA-YRKNAKFVVNDLRAAQMRKLKDAN 301 (392) T ss_pred -----------------------------------ccccccHHHHHHHHHhhhhh-hhcCCEEEEcHHHHHHHHHhhccC Confidence 00111234444555554433 445668999999999999999999 Q ss_pred CcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEe Q lcl|Aclame:pro 394 GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEE 473 (497) Q Consensus 394 G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~ 473 (497) |+|+|.+.... +.+.+|+|+||++++++|+++++||||++ |.++++.+++|+.+.+. +|.+|++.||++. T Consensus 302 G~~l~~~~~~~------g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~--~~i~~~~~~~i~~~~~~--~~~~~~~~~r~~~ 371 (392) T protein:vir:13 302 GQYLWQSALTV------GAPDTFNGKVVETDDGMPADKVLFADLSK--YRVRFAGSLRVDRSVDA--KFSTDQIVYRFLQ 371 (392) T ss_pred CceeecCCcCC------CCCceecceeeEEcCCCCCCcEEEeeccc--eeEEeecceEEEeeccc--cccCCcEEEEEEE Confidence 99999876433 34458999999999999999999999997 67889999999988765 5999999999999 Q ss_pred eeccEeecccceEEEEecCCC Q lcl|Aclame:pro 474 RLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 474 r~~~~v~~~~Af~~~~~~~~a 494 (497) |+|+++.||+||+.++++++| T Consensus 372 r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 372 RADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred EeccEEecccceEEEEeeccC Confidence 999999999999999999999 No 13 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=1.8e-65 Score=375.58 Aligned_cols=378 Identities=13% Similarity=0.161 Sum_probs=254.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |. ..+++++++++..+++.......++ .+...+....++.. +...+.++++++..++..++.+. T Consensus 1 m~-~~e~~~~~~~~~~~l~~~~~~~~~e---~~~~~e~~~~~~~~------------~~~~~~~e~~~~~~~l~~~~~~~ 64 (379) T protein:vir:10 1 ME-ALEIKVALEAIKGQVDSKSSAQALE---VKGLIEALEAKMTS------------EKDLAVNELKSDMAALQAHADKL 64 (379) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHhHhhH------------HHHHHHHHHHHHHHHHHHHHHHH Confidence 76 4556666666555554433322221 11111111111111 11112223333333332222221 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.+.... .......+ ......... .....+.+.. ...........++++.++ T Consensus 65 e~~~~~~--------~~~~~~~~-----~~~~~~~~~----------~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~ 116 (379) T protein:vir:10 65 DVKLKEK--------AKSEDKSD-----SLVKSITEN----------FNDIKEVRNG-----KSIQVKAVGDMTLPVNLT 116 (379) T ss_pred HHHHHhc--------ccccccch-----hHHHHHHHH----------HHhHHHHHhh-----hhhhhhhhcccccCCCCc Confidence 1111000 00000000 000000000 0000000000 000111122234444556 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCC-ccceeeccccccccccccceeEEeeeeeeeeechh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAH-NNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~-~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~i 239 (497) +.+|+++...||+.+++.++|+++|+++++++++++||+.++.+ +.+.|++||+.+|+++++|++|++++||++++++| T Consensus 117 ~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~i 196 (379) T protein:vir:10 117 GAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRY 196 (379) T ss_pred cccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehh Confidence 67889999999999999999999999999999999999998643 45789999999999999999999999999999999 Q ss_pred hHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhh Q lcl|Aclame:pro 240 TDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAF 319 (497) Q Consensus 240 S~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (497) |+|||+|++++++||.++|+++++.++|.+|++|+|+..+.+..... T Consensus 197 S~ell~D~~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~~~--------------------------------- 243 (379) T protein:vir:10 197 SKKMANNLPFLTSFIPNALRRDYAKAENAAFNAVLAANATASTEIIT--------------------------------- 243 (379) T ss_pred hHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc--------------------------------- Confidence 99999999999999999999999999999999998865433322111 Q ss_pred hhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccc Q lcl|Aclame:pro 320 VGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~ 399 (497) ....++.+..++..+..+ ++.+++|+|||.+|..|+++||++|+|+|+ T Consensus 244 -------------------------------~~~~~d~i~~~~~~~~~~-~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~ 291 (379) T protein:vir:10 244 -------------------------------NKNKVEMLINEIAKQENL-DFPVTAIVLRPTDYYDILVTQKSVGAGYGL 291 (379) T ss_pred -------------------------------CcccHHHHHHHHHhhhhc-cCCCCEEEEcHHHHHHHHHhhccCCceecc Confidence 011133445555554443 556778999999999999999999999998 Q ss_pred ccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEe Q lcl|Aclame:pro 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLV 479 (497) Q Consensus 400 ~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v 479 (497) +..... .+.+++|||+||++|++||+|+++||||++ |.+.+|++++|+++++..++|++|++.||+++|+|+.| T Consensus 292 ~~~~~~----~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~--~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v 365 (379) T protein:vir:10 292 PGVVTQ----DNGVLRINGIPLFRATWLAANKYYVGDWTR--VTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAV 365 (379) T ss_pred CCccCC----CCCcceecceeeEecCCCCCCceEEeeccc--EEEEEEeceEEEEeecccccccCCcEEEEEEEEeccEE Confidence 755332 234569999999999999999999999998 34667889999999998889999999999999999999 Q ss_pred ecccceEEEEecCC Q lcl|Aclame:pro 480 YRPSAFQLIQLKKG 493 (497) Q Consensus 480 ~~~~Af~~~~~~~~ 493 (497) +||+|||++++++. T Consensus 366 ~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 366 EQPAALIFGDFTAV 379 (379) T ss_pred ecCccEEEEEecCC Confidence 99999999999999 No 14 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=2.3e-64 Score=369.46 Aligned_cols=397 Identities=14% Similarity=0.155 Sum_probs=256.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |-.+.+++++..++.+...++.++..+.+.+.++...++ ..+++.+.+.+++++....+. T Consensus 1 l~~~k~l~~~i~e~~~~~~~~k~~~~~~~~~~e~~~~~l--------------------~~~~e~~~~~~~~~e~~~~~~ 60 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFDDFKEKNDKRIDAIEQEKGKL--------------------AGEVETLNGKLAELENLKSDL 60 (407) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHHHHH Confidence 555555555444444444433322222222221111111 111111111111111111111 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.... ...+.... ....... +....+. ...+... ..............++...+| T Consensus 61 ~~~~~-~~~~~~~~----~~~~~~~---e~~~a~~----------------~~l~~g~-~~~~~~~e~~a~~~~t~~~gG 115 (407) T protein:vir:48 61 EAELA-EVKRPAGG----TQNKVAS---EHKEAFI----------------GFMRKGR-EDGLRELERKALQVGNDEDGG 115 (407) T ss_pred HHHHH-Hhhccccc----cccchhh---HHHHHHH----------------HHHhccc-hhhhhHHHHHhhhcccCCCCc Confidence 00000 00000000 0000000 0000000 0000000 000001111222333444555 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccc-cccceeEEeeeeeeeeechh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~~~i 239 (497) .+||+++.+.|++.+++.++|+++|++++++++.+.+|+.++. ..+.|++|++.+|++ .++|+++++.+||++++++| T Consensus 116 ~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~i 194 (407) T protein:vir:48 116 YAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGG-TTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQA 194 (407) T ss_pred ccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCC-cceeeecccccccccccccceeEEeeeeeeEeehhh Confidence 5677788999999999999999999999999999999998774 679999999999976 58999999999999999999 Q ss_pred hHHHHhhHH-HHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhh Q lcl|Aclame:pro 240 TDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~ds~-~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |+|+|+|+. ++++||.++|+++++.++|.+|++|+|+++|.||++............... T Consensus 195 S~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~------------------- 255 (407) T protein:vir:48 195 TQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGK------------------- 255 (407) T ss_pred HHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeeccccccccccccccc------------------- Confidence 999999985 899999999999999999999999999999999998755432221100000 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccc Q lcl|Aclame:pro 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~ 398 (497) .. ............+++...+..+... +.....|+||+.+|..|+++||++|||+| T Consensus 256 ----------------------~~-~~~~~~~~~~~~d~i~~l~~~l~~~-~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~ 311 (407) T protein:vir:48 256 ----------------------LQ-HIASGAASGVTADAIIKLIYTLRKA-HRSGAKFMMNNSSLFAIRLLKDNDGNYLW 311 (407) T ss_pred ----------------------cc-ccccccccccChHHHHHHHHhhchh-hhcCCEEEEcHHHHHHHHHhhccCCceee Confidence 00 0000011112345666666666554 44566899999999999999999999999 Q ss_pred cccccccccccccccccccccceeecCCCCc-----CcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEe Q lcl|Aclame:pro 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEE 473 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~ 473 (497) ++.... +.+.+|||+||+++++||. ..++||||+. +|.+++|.+++|..++ +|.+|++.||++. T Consensus 312 ~~~~~~------g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~-~~~i~~~~~~~i~~d~----~~~~~~~~~~~~~ 380 (407) T protein:vir:48 312 RPGIEL------GQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKR-GYTIVDRIGTRILRDP----YTNKPFVGFYTTK 380 (407) T ss_pred ccCcCC------CCCceecceeeEEecCcCCccCCccEEEEEeccc-cEEEEEeeceEEEeec----cccCCcEEEEEEE Confidence 886533 3345899999999999985 2378899998 5889999999987654 4679999999999 Q ss_pred eeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 474 RLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 474 r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) |+|++|++|+||++++++++++++ T Consensus 381 r~d~~v~~~~a~~~l~~~aa~~~~ 404 (407) T protein:vir:48 381 RTGGMLVDSQAIKLMKIGAATRQK 404 (407) T ss_pred EeccEEecccceEEEEeeccCCCC Confidence 999999999999999999999998 No 15 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=1.2e-64 Score=370.94 Aligned_cols=385 Identities=15% Similarity=0.130 Sum_probs=251.3 Q ss_pred CchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTA--QLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 m~~~~--~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) ||+.. ++++++..+.++++.+.++.. ..+..++..++++.++++++.++.+++ T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~~~~~-------------------------~~~lt~e~~~~~~~l~~e~~~l~~~i~ 55 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLTDEFA-------------------------GKEMTDEAREKEERLITAVSDYDARIK 55 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHhh-------------------------cccccHHHHHHHHHHHHHHHHHHHHHH Confidence 66543 344444444444444332111 011112223333334444444333332 Q ss_pred HHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccccc Q lcl|Aclame:pro 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) +......... ... .................. ... .. ..+.......+..........+++.+ T Consensus 56 ~~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~--~~~----~~--------~~r~~~~~~~r~~~~~~~~~~~t~~~ 117 (390) T protein:vir:62 56 RGIEAIKAID--PVT--SLLSGLQGSGSGAQRSAD--VDD----DA--------TLRAGNLGEARSFEFAPEKRDGTKAG 117 (390) T ss_pred HHHHHHHHHH--HHH--HHHhhcccccccchhhcc--hHH----HH--------HHhhhhhhhhHHHHhhhhhhcccccC Confidence 2111100000 000 000000000000000000 000 00 00000000001111111222345556 Q ss_pred CCcccccchhhHHHHHHH-hhhhHHhhccceecCCC-ceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeee Q lcl|Aclame:pro 159 FAPGILPTFLPGIVEQLF-YELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANA 236 (497) Q Consensus 159 ~g~~i~~~~~~~ii~~~~-~~~~l~~~~~~~~~~~~-~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~ 236 (497) +|+++||++...+|..++ ..+.+++++++++++++ .+++|+.++. +.+.||+|++.+|+++++|+++++++||++++ T Consensus 118 ~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~-~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~ 196 (390) T protein:vir:62 118 NPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGR-SSASIVGETAEIPESYPATAQRSMGGFKYGFA 196 (390) T ss_pred CCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCC-cceeeecccccccccccceeeeEeeeeeEEee Confidence 678888888887776554 55567889999998764 5899998874 67999999999999999999999999999999 Q ss_pred chhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 237 LTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT 315 (497) Q Consensus 237 ~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (497) ++||+|+|+|+ +++++||.++|+++++.++|.+||+|+| +|.||++.....+...... T Consensus 197 ~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G--~p~Gi~~~~~~~~~~~~~~------------------- 255 (390) T protein:vir:62 197 SVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTG--QPRGILTDASPATATFLAT------------------- 255 (390) T ss_pred hHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCC--ccccccccccccccceecc------------------- Confidence 99999999998 5899999999999999999999999988 6899998754432221100 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCc Q lcl|Aclame:pro 316 NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ 395 (497) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~ 395 (497) .......+.+...++.+... +.....|+||+.++..|++|||++|+ T Consensus 256 ---------------------------------~~~~~~~~~l~~~~~~l~~~-~~~~a~~vmn~~~~~~L~~lkd~~g~ 301 (390) T protein:vir:62 256 ---------------------------------DTDSKVSDALIDLFHEVPSA-YRANAKYVVNDLRAAQMRKLKDANGQ 301 (390) T ss_pred ---------------------------------cccccchHHHHHHHHhhhhh-hhcCCEEEEchHHHHHHHHhhccCCC Confidence 00112234444555554433 34455799999999999999999999 Q ss_pred ccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeee Q lcl|Aclame:pro 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERL 475 (497) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~ 475 (497) |||++.... +.+.+|+|+||++++++|++.++||||++ |.++++.+++++++.+. +|.+|++.||++.|+ T Consensus 302 ~l~~~~~~~------g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~--~~i~~~~~~~v~~~~~~--~~~~~~~~~~~~~r~ 371 (390) T protein:vir:62 302 YLWQSGLTV------GAPSLFNGKVVETDDGMPADKILFADLSK--YRVRFAGSLRVDRSVDA--KFSTDQIVYRFLQRA 371 (390) T ss_pred eeecCCcCC------CccceecccceEEecCCCCccEEEeeccc--eeEEeecceEEEeeccc--cccCCcEEEEEEEEe Confidence 999886543 23458999999999999999999999997 67899999999998765 599999999999999 Q ss_pred ccEeecccceEEEEecCCC Q lcl|Aclame:pro 476 GLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 476 ~~~v~~~~Af~~~~~~~~a 494 (497) |++|+||+||+.|+++++| T Consensus 372 d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 372 DGLLVDARGAKVLTVTPGA 390 (390) T ss_pred CcEeechhheEEEEeecCC Confidence 9999999999999999999 No 16 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=2.1e-63 Score=364.26 Aligned_cols=413 Identities=15% Similarity=0.177 Sum_probs=244.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHH Q lcl|Aclame:pro 5 AQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRN 84 (497) Q Consensus 5 ~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~ 84 (497) ++|++.. +.++++.+.... .++.+.++.+++...+++...+.....++...+..+++.++++++++.......+... T Consensus 1 ~~~~~~~--~~~el~~~~~~l-~el~~~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~ 77 (425) T protein:vir:95 1 MALRQLM--LTKKIEQRKAAL-DELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEI 77 (425) T ss_pred CchHHHH--HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3333321 112222211111 1111111111111111111111111111111222222223333333322222221111 Q ss_pred HHHHHHHHHhhh--hhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhh-hhhhcccccCCc Q lcl|Aclame:pro 85 LKQIRKHLARAV--IMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIG-QNPFGSTGTFAP 161 (497) Q Consensus 85 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~ 161 (497) . .....+.... ....+............. .......... ......+.... ...... ....++++++|. T Consensus 78 ~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~----~~~~~~~~~~~-~~~~~~~~~~~~~~~~gg~ 148 (425) T protein:vir:95 78 A-QLEDELEQINSKQPSNQSRQKMQGSKGDVV---EMNRLQVREM----LKTGEYYKRSE-VVEFYEKFRNLRAVAGGEL 148 (425) T ss_pred H-HHHHHHHHhhhhccchhhhhhhhhhhhhHH---HHHHHHHHHH----HhhhhhhhhhH-HHHHHHHHHhhcccccCce Confidence 0 0001110000 000000000000000000 0000000000 00000000000 011111 112223344455 Q ss_pred ccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccccc-ccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 162 GILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSS-EEFARVYEQVGKVANALTIT 240 (497) Q Consensus 162 ~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~-~~~~~v~~~~~kia~~~~iS 240 (497) +||+++...|++.+++.++|+++++++++++ ..++|+.++ .+.++||+|++.+|+++ ++|++|++++||++++++|| T Consensus 149 ~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~g-~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS 226 (425) T protein:vir:95 149 TIPEVVVNRIMDIMGDYTTLYPLVDKIRVKG-TTRILVDTD-TSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVD 226 (425) T ss_pred eccHHHHHHHHHHHHhhhhHHHhhceeecCc-eeEEEEecC-Cccccccccccccccccccccceeeeeheeeeeeehhh Confidence 6666788889999999999999999999875 579999876 57899999999999887 79999999999999999999 Q ss_pred HHHHhhHH-HHHHHHHHHHHHHHHHHHHhhhhccCCCc--cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchh Q lcl|Aclame:pro 241 DEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYP--GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 241 ~ell~ds~-~l~~~i~~~la~~~~~~~d~a~l~G~g~~--~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) +|||+|++ ++++||.++|+++++.++|.+||+|+|++ +|.||++..+....... T Consensus 227 ~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~----------------------- 283 (425) T protein:vir:95 227 NYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTV----------------------- 283 (425) T ss_pred HHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccccccc----------------------- Confidence 99999985 89999999999999999999999999975 79999975433211100 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc-cCCceEEEehhHH----HHHHHHhcc Q lcl|Aclame:pro 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNPRDW----ELLRLTKDA 392 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~n~~~~----~~l~~lkd~ 392 (497) .......+++...+..+...+. ....+|+||+.++ ..|+++||+ T Consensus 284 -------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~ 332 (425) T protein:vir:95 284 -------------------------------EADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDS 332 (425) T ss_pred -------------------------------ccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCC Confidence 0001112233333333333332 2345699999986 467889999 Q ss_pred cCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEE Q lcl|Aclame:pro 393 NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 393 ~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) +|+|+|..+ ....++|||+||+.++++|++.++||||++ |.+++|.+++|.++++. +|.+|+++||++ T Consensus 333 ~g~~i~~~~--------~~~~~~l~G~pvv~~~~~~~~~i~~Gd~~~--~~~~~~~~~~i~~~~~~--~f~~~~~~~~~~ 400 (425) T protein:vir:95 333 NGNVVGKLP--------NLRTPDLLGLRVVFNNFLDDDTVLFGEFEQ--YTLVERENITIDSSTHV--KFTEDQTAFRGK 400 (425) T ss_pred CCceeeccC--------CCCCccccceeeEEcCcCCCccEEEEeccc--EEEEeecceEEEeeccc--ccccCceEEEEE Confidence 999999764 234568999999999999999999999997 67889999999999876 599999999999 Q ss_pred eeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 473 ERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 473 ~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) .|+|+.++||+||+++++++...|- T Consensus 401 ~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 401 GRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred EeeCcEeecccceEEEEecCcCCCC Confidence 9999999999999999999966666 No 17 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=4.3e-64 Score=367.97 Aligned_cols=402 Identities=17% Similarity=0.174 Sum_probs=255.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKT---AAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDI 77 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~---~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~ 77 (497) |+++.+|++++.++.++++.+..+..+. -++..+.++++..+++.+.++++..+..++..+... ... T Consensus 1 M~kl~~L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~----------~~~ 70 (428) T protein:vir:10 1 MPQIEELRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVA----------KPV 70 (428) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------hhh Confidence 9999999999999888888766543211 111222223333333333333222211111100000 000 Q ss_pred HHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccc Q lcl|Aclame:pro 78 PEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTG 157 (497) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (497) .... .........+.+... +......... .......... ... +...............++++ T Consensus 71 ~~~~----------~~~~~~~~~~~~~~~----~~~~~~~~~~--~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~ 132 (428) T protein:vir:10 71 KATQ----------HGPAVIVKAEPKQYT----GAGMTRMVMS--IAAAQGNLQD-AAK-FASDELNDQSVSMAISTAAG 132 (428) T ss_pred hchh----------hccccccccccchhh----hHHHHHHHHH--HHHhhhhHHH-HHH-HhhhhhhhhhHhhhhccccc Confidence 0000 000000000000000 0000000000 0000000000 000 00000001111222233344 Q ss_pred cCCcccccchhhHHHHHHHhhhhHHhh-ccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeee Q lcl|Aclame:pro 158 TFAPGILPTFLPGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANA 236 (497) Q Consensus 158 ~~g~~i~~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~ 236 (497) .+|.+||+++..+||+.+++.++|+++ +++++++++.++||+.++. +.++|++||+.+|+++++|++|++.++|++++ T Consensus 133 ~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~ 211 (428) T protein:vir:10 133 SGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAGG-ATASYTGENQDAKVSEARFDDVKLTAKTMIAM 211 (428) T ss_pred CCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeCC-cceeeeccCccccccccceeeEEeeeEEEEEe Confidence 455567778889999999999999998 6788888888999999874 78999999999999999999999999999999 Q ss_pred chhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCc-cccceeccccccccchhhhhhhHHHHHHHHHhhhhhc Q lcl|Aclame:pro 237 LTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) Q Consensus 237 ~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) ++||+|||+|+ +++++||.++|+++++.++|.+||+|+|++ +|.||++.++............ T Consensus 212 v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~--------------- 276 (428) T protein:vir:10 212 VPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADA--------------- 276 (428) T ss_pred ehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccc--------------- Confidence 99999999997 789999999999999999999999999986 7999998765432211100000 Q ss_pred chhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHH---hhhhhhhhhccCCceEEEehhHHHHHHHHhc Q lcl|Aclame:pro 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFD---AFVDIQLTLFQTPNAVVMNPRDWELLRLTKD 391 (497) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd 391 (497) ....+.++.... ..... ...+.....|+|||.++..|+++|| T Consensus 277 ----------------------------------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~n~~~~~~L~~lkd 321 (428) T protein:vir:10 277 ----------------------------------AVNLDTIDTYLDSIILMSMD-GNSNMISSGWGMSNRTYMKLFGLRD 321 (428) T ss_pred ----------------------------------cccHHHHHHHHHHHHHhhhc-cccccccCEEEEcHHHHHHHHHhhc Confidence 000001111111 11111 1122335689999999999999999 Q ss_pred ccCcccccccccccccccccccccccccceeecCCCCcC--------cEEEeeccceEEEEEeccccEEEEeccch---- Q lcl|Aclame:pro 392 ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTARREGVTMQMTNSNG---- 459 (497) Q Consensus 392 ~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~--------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~---- 459 (497) ++|+|+|++. ..++|+|+||+++++||++ .++||||+. |.++++.+++|+++++.. T Consensus 322 ~~G~~i~~~~----------~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~--~~i~~~~~i~i~~~~~~~~~~~ 389 (428) T protein:vir:10 322 GNGNKVYPEM----------AQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFND--VVIGEDGNMKVDFSKEASYIDT 389 (428) T ss_pred cCCceeccCC----------CCCeeeceeeEEeccccccccCCCccceEEEEecce--EEEEEecceEEEeecccccccc Confidence 9999999653 2237999999999999864 389999997 568899999999998753 Q ss_pred -----hhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 460 -----TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 460 -----~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) ++|++|+++||+++|+||.|.||+||++++-..= T Consensus 390 ~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 390 DGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 6799999999999999999999999999986555 No 18 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=3.6e-63 Score=362.95 Aligned_cols=410 Identities=25% Similarity=0.307 Sum_probs=261.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) ||....+++.+.++.+....... ..++..++.++..+..+++++ +.+.+..+++.++...+.. T Consensus 1 m~~~~~lee~~a~l~~~~~~~~~-~~~~~~~~~~e~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~ 63 (419) T protein:vir:94 1 MPPTPTLEEQRAALLARLDDTSL-TTEQVQEIVAEARGLADALQA----------------ESDRAAARAALLRTAPPAP 63 (419) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH----------------HHHHHHHHHHHHHHHHHHH Confidence 99998888766555443322211 111111111111111111111 1111111111111100000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +......... .. ......+ .................... .......................++...++ T Consensus 64 ~~~~~~~~~~--~~--~~~~~~~-----~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (419) T protein:vir:94 64 KGPADGGTPL--TP--AEAGTFR-----SLAQRFADSDGLREYRARDK--RGQFQVEMRDIDPNRLLSRDAPAGTITNPN 132 (419) T ss_pred HHHhhhhccc--cc--ccccccc-----chhhhhhhHHHHHHHHHhhh--hhhhhHHHHHHHHHHhhccccccccccCCc Confidence 0000000000 00 0000000 00000000000000000000 000000000000111111222334445566 Q ss_pred cccccchhhHHHHH-HHhhhhHHhhccceecCCCceEEEEeecCC-------ccceeeccccccccccccceeEEeeeee Q lcl|Aclame:pro 161 PGILPTFLPGIVEQ-LFYELSLADLISSRPVTSPNLSYLTESAAH-------NNAAAVAEAGTYPFSSEEFARVYEQVGK 232 (497) Q Consensus 161 ~~i~~~~~~~ii~~-~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~-------~~a~~v~Eg~~~~~s~~~~~~v~~~~~k 232 (497) ..++|+...++|.. +...+.|++++++++++++.++||++++.+ +.++||+||+.+|+++++|+++++.+|| T Consensus 133 ~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k 212 (419) T protein:vir:94 133 VPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKT 212 (419) T ss_pred ccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeee Confidence 77888887776654 456678999999999999999999876532 3578999999999999999999999999 Q ss_pred eeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhh Q lcl|Aclame:pro 233 VANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPA 312 (497) Q Consensus 233 ia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (497) ++++++||+|+|+|++++++||.++|+++++.++|.+||+|+|+++|+||++.++..+...... T Consensus 213 ~~~~~~is~ell~d~~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~---------------- 276 (419) T protein:vir:94 213 VAHWLPITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKP---------------- 276 (419) T ss_pred EEEeehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccc---------------- Confidence 9999999999999999999999999999999999999999999999999998765432221110 Q ss_pred hcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcc Q lcl|Aclame:pro 313 DGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDA 392 (497) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~ 392 (497) ..........+++..++..+..++ ..+++|+|||.+|..|+++||+ T Consensus 277 ---------------------------------~~~~t~~~~~~~l~~~~~~~~~~~-~~~~~~v~n~~~~~~l~~~k~~ 322 (419) T protein:vir:94 277 ---------------------------------TAPATDEPPLVDIRRAKTVAEIAG-FPPDGVVVHPQDWESIELDQAP 322 (419) T ss_pred ---------------------------------ccccccchhHHHHHHHHHhhhhcc-CCCCEEEEcHHHHHHHHHHhhc Confidence 011122234566677777766554 4667899999999999999998 Q ss_pred cCcc-cccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEE Q lcl|Aclame:pro 393 NGQY-MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 393 ~G~~-~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) +|++ ++.+... ...+++|+|+||++++++|+++++||||+.. |.+++|.+++++++++.+++|++|+++||+ T Consensus 323 ~~~~~~~~~~~~------~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~ 395 (419) T protein:vir:94 323 GSGVFRVIANVQ------GEATPRIWGLNVVSTVAIAQGTALVGGFRQG-ATLWSRQGITVLMTDSHADFFTANTLVILA 395 (419) T ss_pred CCCceeecCCcc------cCCCccccceeeEEcCCCCCccEEEeeccce-EEEEEecceEEEEeccccchhhcCcEEEEE Confidence 6664 5555432 2345689999999999999999999999984 678999999999999998899999999999 Q ss_pred EeeeccEeecccceEEEEecCCCC Q lcl|Aclame:pro 472 EERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 472 ~~r~~~~v~~~~Af~~~~~~~~a~ 495 (497) +.|+|+.|++|+|||+++++++.+ T Consensus 396 ~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 396 EFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred EEeeccEEeccccEEEEEeccCCC Confidence 999999999999999999999999 No 19 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=1.2e-63 Score=365.44 Aligned_cols=393 Identities=16% Similarity=0.149 Sum_probs=248.9 Q ss_pred CchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPST-AQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~-~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |... .++++.+.++.+..+++.+ ..++..++++...+ ++..+++.+++++++++..+.+ T Consensus 1 m~~~lk~l~~~~~el~~~~~~~k~-----------~~~~~~~~~e~~~~---------~l~~~~~~l~~~~~~~~~~~~~ 60 (401) T protein:vir:44 1 MAVDIKDVEQVAQELQQKFDDFKA-----------KNDKRVEAIEQEKG---------KLAGQVETLNGKLSELENLKSD 60 (401) T ss_pred CCccHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHH Confidence 4433 2333333333222222211 11111111111111 1111222222222222222211 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+.......+.... .......+ ....+... .+...... ......+....++.+.+ T Consensus 61 ~~~~~~~~~~~~~~-----~~~~~~~e---~~~a~~~~----------------lr~~~~~~-~~~~e~~a~~~~~~~~G 115 (401) T protein:vir:44 61 LEKELLELKRPARG-----AQNKVAAE---HKDAFVGF----------------LRKGREDG-LRDLERKALQVGTDEDG 115 (401) T ss_pred HHHHHHHhhccccc-----cccchhHH---HHHHHHHH----------------Hhhhhhhh-hHHHHHHHhhcCCCCCC Confidence 11110000000000 00000000 00000000 00000000 00011122233344445 Q ss_pred CcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccc-cccceeEEeeeeeeeeech Q lcl|Aclame:pro 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANALT 238 (497) Q Consensus 160 g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~~~ 238 (497) |.+||+++.+.|++.+++.++|+++|++++++++.+.+|+.++. ..+.|++|++.+|.+ .++|++|++.+||++++++ T Consensus 116 G~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~ 194 (401) T protein:vir:44 116 GYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGG-TASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQ 194 (401) T ss_pred ceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCC-ccceeeccccccCccccccceeeeeehhheeeehh Confidence 55677788999999999999999999999999999999998774 678999999999865 5899999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchh Q lcl|Aclame:pro 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) ||+|+|+|+ +++++||.++|+++++.++|.+||+|+|+++|.||++.....+........... T Consensus 195 iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~---------------- 258 (401) T protein:vir:44 195 ATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQ---------------- 258 (401) T ss_pred hhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccccccccc---------------- Confidence 999999998 589999999999999999999999999999999999876544332211100000 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccc Q lcl|Aclame:pro 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~ 397 (497) ............+++..++..+...+ ..+.+|+||+.+|..|+++||++|||+ T Consensus 259 --------------------------~~~t~~~~~~~~d~i~~~~~~l~~~~-~~~a~~v~n~~~~~~L~~lkd~~G~~l 311 (401) T protein:vir:44 259 --------------------------HIVSGEATAVTADAIIKLIYTLRKAH-RTGAKFMMNNNSLFAIRLLKDTEGNYL 311 (401) T ss_pred --------------------------ccccccccccCHHHHHHHHHhcchhh-hcCCEEEEcHHHHHHHHHhhccCCcee Confidence 00000111123455666666665443 445689999999999999999999999 Q ss_pred ccccccccccccccccccccccceeecCCCCcC-----cEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEE Q lcl|Aclame:pro 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG-----TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~-----~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) |.++... +.+.+|+|+||+++++||.. .++||||++ +|.+++|.++++..++ +|.+|+|+||++ T Consensus 312 ~~~~~~~------g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~-~~~i~~~~~~~~~~~~----~~~~~~v~~~a~ 380 (401) T protein:vir:44 312 WRPGLEL------GQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKR-GYTIVDRIGTRILRDP----YTNKPFVGFYTT 380 (401) T ss_pred ecCCcCC------CCCceecceeeEEecCcCCccCCccEEEEeehhc-cEEEEEecceEEeeec----cccCCcEEEEEE Confidence 9886433 23458999999999999852 278899998 5789999999987654 477999999999 Q ss_pred eeeccEeecccceEEEEecCC Q lcl|Aclame:pro 473 ERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 473 ~r~~~~v~~~~Af~~~~~~~~ 493 (497) .|+|++|++|+||++|+++++ T Consensus 381 ~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 381 KRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred EEeccEEecccceEEEEeecC Confidence 999999999999999999999 No 20 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=6.3e-61 Score=350.64 Aligned_cols=430 Identities=17% Similarity=0.113 Sum_probs=251.4 Q ss_pred CchHHHHHHH--HHHHHHHHHHHHHHHH-HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQ--GRQLAKSIKDINADET-KTAAEKK-EALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDND 76 (497) Q Consensus 1 m~~~~~~~~~--~~~l~~~~~~~~~~~~-~~~~e~~-~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~ 76 (497) |+.-. +++| +.++++..+.++.... .+..+++ +..++...++.... + ....|.++++++..++++.+..+ T Consensus 3 ~~~~~-~~~e~~~~e~a~~~~~~~~~~k~~e~~~~~ke~~~~~l~~~~e~~---~--k~~~E~~~~le~~~ee~k~l~ee 76 (458) T protein:vir:10 3 IDINK-LKEELGLGDLAKSLEGLTAAQKAQEAERMRKEQEEKELARMNDLV---S--KAVGEDRKRLEEALELVKSLDEK 76 (458) T ss_pred cchhh-hhhhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---H--HHHHHHHHHHHHHHHHHHHHHHH Confidence 22211 1111 1112222222111111 0111110 00000000000000 0 00112222222222222222222 Q ss_pred HHHHhHHHHHHHHHHHHhhhhhhHH---------Hhhh-hhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhh Q lcl|Aclame:pro 77 IPEVEVRNLKQIRKHLARAVIMNPE---------LKNA-TSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPA 146 (497) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (497) ..+..........+..........+ .+.. .............................+........... T Consensus 77 ~~~~~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 156 (458) T protein:vir:10 77 SKKSNELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQR 156 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhh Confidence 2221111111000000000000000 0000 00000000000000000000000000000000001111111 Q ss_pred hhhh-hhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccc------c Q lcl|Aclame:pro 147 AIGQ-NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF------S 219 (497) Q Consensus 147 ~~~~-~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~------s 219 (497) .... ...++.+.+|.++|+++...||+.+++.++|+++|++++++++...+|+.++. +.|+||+|++.+|+ + T Consensus 157 ~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~e~~~~~~~~~~~~~ 235 (458) T protein:vir:10 157 HLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDA-GKATWVAASTYGTDTTTGEEV 235 (458) T ss_pred hhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCC-cceeecccccccccccccccc Confidence 1111 22234445666788889999999999999999999999999999999998774 77999999988875 3 Q ss_pred cccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhh Q lcl|Aclame:pro 220 SEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLF 298 (497) Q Consensus 220 ~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~ 298 (497) +++|+++++.+||++++++||+|+|+|+ +++++||.++|+++++.++|.+||+|+|+++|.||++.++.......... T Consensus 236 ~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~- 314 (458) T protein:vir:10 236 KGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEA- 314 (458) T ss_pred cccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeecc- Confidence 6789999999999999999999999998 68999999999999999999999999999999999987654322111000 Q ss_pred hHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEE Q lcl|Aclame:pro 299 GATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVM 378 (497) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (497) ..........+++..+++.+... +..++.|+| T Consensus 315 -----------------------------------------------~~~~~~~~~~~~i~~~~~~l~~~-~~~~~~~v~ 346 (458) T protein:vir:10 315 -----------------------------------------------KADGSVLVTAKTISKLRRKLGRH-GLKLSKLVL 346 (458) T ss_pred -----------------------------------------------cccccccccHHHHHHHHHhhhhh-hcCCCEEEE Confidence 00011112345556666665544 455678999 Q ss_pred ehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcC----cEEEeeccceEEEEEeccccEEEE Q lcl|Aclame:pro 379 NPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG----TILVGHFAPSVIQTARREGVTMQM 454 (497) Q Consensus 379 n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~----~~~~gd~~~~~~~i~~r~~~~i~~ 454 (497) ||.+|..|+++||++|+|+|.+..... ...+.+++|||+||+++++||++ .++||||.. +|.+++|.+++|.+ T Consensus 347 ~~~~~~~l~~lkd~~G~~i~~~~~~~~--~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~~f~~-~~~~~~~~~~~v~~ 423 (458) T protein:vir:10 347 IVSMDAYYDLLEDEEWQDVAQVGNDSV--KLQGQVGRIYGLPVVVSEYFPAKANSAEFAVIVYKD-NFVMPRQRAVTVER 423 (458) T ss_pred cHHHHHHHHhhcccCCceeeccccccc--cccCcCceecceeeEEccccccccCCcceEEEEecc-cEEEEEeeceEEEe Confidence 999999999999999999998765432 22345568999999999999974 589999987 47899999999987 Q ss_pred eccchhhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 455 TNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 455 ~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) +++ +.+|+|+||++.|+|+.|++|+|||++++.++ T Consensus 424 d~~----~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 424 ERQ----AGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred ecc----cCCCceEEEEEEEecceEecccceEEEeeccC Confidence 654 46999999999999999999999999998887 No 21 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=1.7e-62 Score=359.25 Aligned_cols=402 Identities=15% Similarity=0.166 Sum_probs=243.7 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 2 PSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVE 81 (497) Q Consensus 2 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~ 81 (497) =++.+|++++.++.++++++.+... +. +.+ .++..++++.+++++++++.+++..+ T Consensus 1 M~i~eL~e~r~~~~~~~~~l~~~~~-e~------------------~~l-----t~ee~~~~~~l~~ei~~l~~~I~~~e 56 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALAQIEV-GG------------------TAL-----SVEQQAEFDQLSSKFSELTAQIERAE 56 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHh-cc------------------CCC-----CHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1344455544444444444332111 00 000 11112223333333333333332222 Q ss_pred HHHHHHHHHH--HHhhh--hhh-----HHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhh-h Q lcl|Aclame:pro 82 VRNLKQIRKH--LARAV--IMN-----PELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQ-N 151 (497) Q Consensus 82 ~~~~~~~~~~--~~~~~--~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 151 (497) .......... ..... ... ..........+...+.................. .............. . T Consensus 57 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~ 132 (435) T protein:vir:14 57 AAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLAS----KLAIERGFGEEVAMSL 132 (435) T ss_pred HHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHH----HHHHhhhhhhhhhhhc Confidence 1110000000 00000 000 000000000000000000000000000000000 00000000111111 2 Q ss_pred hhcccccCCcccccchhhHHHHHHHhhhhHHhh-ccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeee Q lcl|Aclame:pro 152 PFGSTGTFAPGILPTFLPGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 152 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~ 230 (497) ..++.+.+|.+||+++..+||+.+++.++|+++ +++++++++.++||+.++. +.++||+|++.+|+++++|+.|++.+ T Consensus 133 ~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~f~~i~~~~ 211 (435) T protein:vir:14 133 NTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGG-AIVGYIGADTDIPTTQQQFDDLKLTA 211 (435) T ss_pred ccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCC-cceeeeccCccccccccceeEEEeee Confidence 223334445566777888999999999999997 7788988888999999874 67999999999999999999999999 Q ss_pred eeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCc-cccceeccccccccchhhhhhhHHHHHHH Q lcl|Aclame:pro 231 GKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVS 306 (497) Q Consensus 231 ~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~~~~~~~ 306 (497) +|++++++||+|||+|+ ++|++||.++|++++++++|.+|++|+|++ +|.||++.+............ T Consensus 212 ~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~-------- 283 (435) T protein:vir:14 212 KKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDAS-------- 283 (435) T ss_pred EEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceecccccc-------- Confidence 99999999999999997 469999999999999999999999999986 699998765432221111000 Q ss_pred HHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhh-hhccCCceEEEehhHHHH Q lcl|Aclame:pro 307 NVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQL-TLFQTPNAVVMNPRDWEL 385 (497) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~n~~~~~~ 385 (497) +......++...+..+.. ..++.+.+|+|||.+|.. T Consensus 284 -------------------------------------------~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~ 320 (435) T protein:vir:14 284 -------------------------------------------TLQKIETDLGKVILALENADANLTQPGWIMAPRTFRF 320 (435) T ss_pred -------------------------------------------chhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHH Confidence 000111122222222222 123446689999999999 Q ss_pred HHHHhcccCcccccccccccccccccccccccccceeecCCCCcC--------cEEEeeccceEEEEEeccccEEEEecc Q lcl|Aclame:pro 386 LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTARREGVTMQMTNS 457 (497) Q Consensus 386 l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~--------~~~~gd~~~~~~~i~~r~~~~i~~~~~ 457 (497) |+++||++|+|+|... ..++|+|+||++++.||.+ .++||||++ |.+++|.+++++++++ T Consensus 321 L~~lkd~~G~~l~~~~----------~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~--~~i~~~~~~~~~~~~~ 388 (435) T protein:vir:14 321 LEGLRDGNGNKVYPEL----------ANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGD--VFIGEEETLEIDYSKE 388 (435) T ss_pred HHHhhccCCceeccCC----------CCCeeecceeEeeccccccccCCCccceEEEeeccc--EEEEEecccEEEEecc Confidence 9999999999999532 2348999999999999863 589999998 5588999999999987 Q ss_pred ch---------hhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 458 NG---------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 458 ~~---------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) .. .+|++|+++||+++|+||+|++|+||++|+ .++-|+ T Consensus 389 ~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~--~~~~~~ 435 (435) T protein:vir:14 389 ATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLA--GVAWGA 435 (435) T ss_pred ccccccccchhhhhhcChhheeeeeeeCceeecccceEEEe--cCCCCC Confidence 54 679999999999999999999999999977 333344 No 22 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=2.3e-62 Score=358.56 Aligned_cols=395 Identities=15% Similarity=0.138 Sum_probs=252.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETK--TAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~--~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) |- +.+|++++.++.++++.+.++... ...|.++.++++..+++ .++.+++ T Consensus 1 M~-l~eL~e~r~~l~~e~~~l~~k~~~~~~t~e~~~~~~~~~~e~~---------------------------~l~~~i~ 52 (409) T protein:vir:45 1 MK-LHELKQKRNTIATDMRALNEKIGDNAWTEEQRTEWNKAKSELE---------------------------ALDERIA 52 (409) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHH---------------------------HHHHHHH Confidence 65 556666666666655555443211 11122222222222222 2222221 Q ss_pred HHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccccc Q lcl|Aclame:pro 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) ..+...... ............. ........ .................. ..............+....++... T Consensus 53 ~~e~~~~~~-~~~~~~~~~~~~~---~~~~~~~~-~~~~~~~~a~~~~l~~~~---~~~~~~e~~~~~~~~a~~~~~~~~ 124 (409) T protein:vir:45 53 REEELRRQD-QAYIESNEEEQRQ---NLDPENNS-QQDEKRAQVFDKWMRHGA---SELTSEERKALRELRAQGVAQDEK 124 (409) T ss_pred HHHHHHHHH-HHHHhhhhhhhcc---cCCCCCcc-hhhHHHHHHHHHHHHhhh---hhccHHHHHHHHHHhhccCccCcC Confidence 111100000 0000000000000 00000000 000000000000000000 000000111111222233344445 Q ss_pred CCcccccchhhHHHHHHHhhhhHHhhccceecCCCc-eEEEEeecCCccceeeccccccccccccceeEEeeeeeee-ee Q lcl|Aclame:pro 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA-NA 236 (497) Q Consensus 159 ~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia-~~ 236 (497) +|.+||+++..+|++.+++.++|+++|++++++++. +.+|+..+....+.|++||+.+|+++++|+.+++.++|++ ++ T Consensus 125 gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~ 204 (409) T protein:vir:45 125 GGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKI 204 (409) T ss_pred CceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeee Confidence 566778888999999999999999999999997765 4555555444567899999999999999999999999986 57 Q ss_pred chhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCc---cccceeccccccccchhhhhhhHHHHHHHHHhhhh Q lcl|Aclame:pro 237 LTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP---GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPA 312 (497) Q Consensus 237 ~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~---~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (497) ++||+|||+|+ +++++||.++|+++++.++|.+||+|+|++ +|+||++..+........ T Consensus 205 i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~----------------- 267 (409) T protein:vir:45 205 IRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAA----------------- 267 (409) T ss_pred hhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccccc----------------- Confidence 89999999998 689999999999999999999999999986 699998865533221110 Q ss_pred hcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCc-eEEEehhHHHHHHHHhc Q lcl|Aclame:pro 313 DGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN-AVVMNPRDWELLRLTKD 391 (497) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~n~~~~~~l~~lkd 391 (497) .....+++..++..+...+..++. +|+||+.++..|++||| T Consensus 268 --------------------------------------~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd 309 (409) T protein:vir:45 268 --------------------------------------NAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMED 309 (409) T ss_pred --------------------------------------cccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhc Confidence 011234455555655555544443 35789999999999999 Q ss_pred ccCcccccccccccccccccccccccccceeecCCCCc-----CcEEEeeccceEEEEEeccccEEEEeccchhhhhcCc Q lcl|Aclame:pro 392 ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGK 466 (497) Q Consensus 392 ~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~ 466 (497) ++|+|+|++.... +.+.+|||+||+++++||. ..++||||+. |.+.++.+++++++.+. +|.+|+ T Consensus 310 ~~G~~i~~~~~~~------~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~--~~i~~~~~~~~~~~~d~--~~~~~~ 379 (409) T protein:vir:45 310 GQGRPLWLPDIVG------VAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDR--FIIRRVRYMILKRLVER--YAEYDQ 379 (409) T ss_pred CCCceeeccCcCC------CCCceecceeeEEecCcCCccCCccEEEEeehhh--hheeeccceEEEEeecc--cccCCc Confidence 9999999876543 3446899999999999985 2378899997 56788999999988765 588999 Q ss_pred eEEEEEeeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 467 VTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 467 v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) ++||++.|+|+++++|+||+.+++++++.| T Consensus 380 ~~~~~~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 380 TGFLAFHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred EEEEEEEEeccEeechhheEEEEeccCCCC Confidence 999999999999999999999999999999 No 23 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=1.3e-60 Score=348.94 Aligned_cols=428 Identities=16% Similarity=0.092 Sum_probs=241.5 Q ss_pred CchH--HHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------- Q lcl|Aclame:pro 1 MPST--AQLEAQGRQLAKSIKDI------NADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQE-------------- 58 (497) Q Consensus 1 m~~~--~~~~~~~~~l~~~~~~~------~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e-------------- 58 (497) =|+. ++...+++++.++++++ ..++.+++.++....+++.++++.+.+. +..++..+ T Consensus 40 ~~~~~~~~~~~~~~e~~~~~e~l~~~~~~~~~e~~~~~~~~~e~~el~~~~~~l~~~-e~~~~~~e~~~~~~~~~~~~~~ 118 (543) T protein:vir:81 40 APTLTYSQARNRADEVHARMEQIAELDKPTDEENEEFRALGAEFDSLVNHMSRLERA-AELARVRSTHEQIGKPQSGGQR 118 (543) T ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 1111 11111222222222211 1112222222222222222222211100 00000000 Q ss_pred -----------------------------HHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhh Q lcl|Aclame:pro 59 -----------------------------MLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFE 109 (497) Q Consensus 59 -----------------------------~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (497) +..+.....++.+.+.+...........+.............+....... T Consensus 119 e~r~e~~a~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~- 197 (543) T protein:vir:81 119 RMRVEAGSSQGGRGDYDRDAILEPDSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATK- 197 (543) T ss_pred HhhhhhhhHHHhhHHHHHhhhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 00000000000000000000000000000000000000000000000000 Q ss_pred hhhhhhhhhhhhh------hhhhHHHHHHH-HHHHHH---hhhhhhhhh-hhhhhcccccCCcccccchhhHHH-HHHHh Q lcl|Aclame:pro 110 KGTKFDVSFNVSA------KAADPGTAAAE-LMGAFA---DGETAPAAI-GQNPFGSTGTFAPGILPTFLPGIV-EQLFY 177 (497) Q Consensus 110 ~~~~~~~~~~~~~------~~~~~~~~~~~-~~~~~~---~~~~~~~~~-~~~~~~~~~~~g~~i~~~~~~~ii-~~~~~ 177 (497) ............. ........... .+.... ......... ......+++++|.+||+++...+| ..+.. T Consensus 198 ~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~ 277 (543) T protein:vir:81 198 IIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGS 277 (543) T ss_pred HHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhh Confidence 0000000000000 00000000000 000000 000001111 112223444556667778888877 54567 Q ss_pred hhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhHHHHHHHHHHH Q lcl|Aclame:pro 178 ELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAPELFNFVQGR 257 (497) Q Consensus 178 ~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds~~l~~~i~~~ 257 (497) .++|++++++.+++ +.+.+|+.++ .+.++||+||+.+|.++++|+++++.++|++++++||+|+|+|++++++||.++ T Consensus 278 ~~~l~~~~~~~~~~-g~~~~~~~~~-~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~~~~~i~~~ 355 (543) T protein:vir:81 278 LNDIRRFARQVVAT-GDVWHGVSSA-AVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQDEANVTETVALL 355 (543) T ss_pred hchhhhhcccccCC-cceEEEEecC-CcceeecccCccccccccccceeeeeeeeeEeeehhhHHHHhccHHHHHHHHHH Confidence 78899999887664 5688999876 478999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhhhccCCCc-cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcc Q lcl|Aclame:pro 258 LLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTG 336 (497) Q Consensus 258 la~~~~~~~d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 336 (497) |+++++.++|.+||+|+|++ +|.||++.....+..... T Consensus 356 l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~----------------------------------------- 394 (543) T protein:vir:81 356 FAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAP----------------------------------------- 394 (543) T ss_pred HHHHHHHHHHHHHhccCCCCcccccchhhcccccccccc----------------------------------------- Confidence 99999999999999999987 799998765432211100 Q ss_pred cccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccc Q lcl|Aclame:pro 337 AAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNI 416 (497) Q Consensus 337 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l 416 (497) ........+++..++..+.. .+....+|+|||.+|..|+++||++|+|||.+.. .+.+++| T Consensus 395 -----------~~~~~~~~~~~~~~~~~l~~-~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~-------~g~~~~l 455 (543) T protein:vir:81 395 -----------VTAETFALADVYAVYEQLAA-RHRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIG-------NGEPSQL 455 (543) T ss_pred -----------cccccccHHHHHHHHHhhhc-cccCCcEEEEcHHHHHHHHHhhcCCCceeccCcC-------CCCCccc Confidence 00111223444455555443 3445568999999999999999999999997643 2345689 Q ss_pred cccceeecCCCCcCc----------EEEeeccceEEEEEeccccEEEEeccch--hhhhcCceEEEEEeeeccEeecccc Q lcl|Aclame:pro 417 WGVPVVTTPLIPLGT----------ILVGHFAPSVIQTARREGVTMQMTNSNG--TDFVDGKVTVRAEERLGLLVYRPSA 484 (497) Q Consensus 417 ~G~pvv~s~~~~~~~----------~~~gd~~~~~~~i~~r~~~~i~~~~~~~--~~f~~~~v~~r~~~r~~~~v~~~~A 484 (497) +|+||+++++||.+. ++||||+. |.|+++.+++|.++++.. ..|.+|+++||++.|+||.|.+|+| T Consensus 456 ~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~--~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A 533 (543) T protein:vir:81 456 LGRPVGEAEAMDANWNTSASADNFVLLYGNFQN--YVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNA 533 (543) T ss_pred cceeeEEeccccccccccccCCcceEEEeeccc--eeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccc Confidence 999999999998653 78999985 678899999999988643 4688999999999999999999999 Q ss_pred eEEEEecCCC Q lcl|Aclame:pro 485 FQLIQLKKGA 494 (497) Q Consensus 485 f~~~~~~~~a 494 (497) |++++++++| T Consensus 534 ~~~l~~~~~a 543 (543) T protein:vir:81 534 FRLLNVETAS 543 (543) T ss_pred eEEEEecccC Confidence 9999999999 No 24 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=6.7e-62 Score=355.97 Aligned_cols=402 Identities=15% Similarity=0.167 Sum_probs=244.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |- +.+|++++..+.++++++.++..+ . +.+. ++..++++++++++++++.++++. T Consensus 1 M~-l~eL~~~r~~~~~~~~~l~~~~~e-~------------------~~l~-----~ee~~~~~~l~~ei~~l~~~i~~~ 55 (435) T protein:vir:80 1 MN-VNELRRERAAVNQRVQALAQIEVG-G------------------TALS-----VEQQAEFDQLSSKFNELTAQIERA 55 (435) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHhc-c------------------CCCC-----HHHHHHHHHHHHHHHHHHHHHHHH Confidence 42 345555555555555444322110 0 0000 011112222222222222222222 Q ss_pred hHHHHHHHHH--HHHhhh-hhhHHH-hh-----hhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhh Q lcl|Aclame:pro 81 EVRNLKQIRK--HLARAV-IMNPEL-KN-----ATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQN 151 (497) Q Consensus 81 ~~~~~~~~~~--~~~~~~-~~~~~~-~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (497) +......... ...... ...... .. .....++..+.................. .... ........... T Consensus 56 e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~--~~~~~~~~~~~ 131 (435) T protein:vir:80 56 EAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLAS--KLAI--ERGFGEEVAMS 131 (435) T ss_pred HHHHHHHHhhcccccchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHH--HHHH--hhhhhhhhhhh Confidence 1110000000 000000 000000 00 0000000000000000000000000000 0000 00011111111 Q ss_pred hhccc-ccCCcccccchhhHHHHHHHhhhhHHhh-ccceecCCCceEEEEeecCCccceeeccccccccccccceeEEee Q lcl|Aclame:pro 152 PFGST-GTFAPGILPTFLPGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQ 229 (497) Q Consensus 152 ~~~~~-~~~g~~i~~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~ 229 (497) ..+++ +.+|.+||+++..+||+.+++.++|+++ ++++++.++.++||+.++. +.+.||+|++.+|+++++|++|++. T Consensus 132 ~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~f~~i~~~ 210 (435) T protein:vir:80 132 LNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGG-AIVGYIGADTDIPTTQQQFDDLKLT 210 (435) T ss_pred hcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeCC-cceeeeccCccccccccceeeEEEe Confidence 22333 3344456667788999999999999998 7888998889999999874 7799999999999999999999999 Q ss_pred eeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCc-cccceeccccccccchhhhhhhHHHHHH Q lcl|Aclame:pro 230 VGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATV 305 (497) Q Consensus 230 ~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~~~~~~ 305 (497) +||++++++||+|+|+|+ +++++||.++|+++++.++|.+|++|+|++ +|.||++.+............ T Consensus 211 ~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~------- 283 (435) T protein:vir:80 211 AKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGS------- 283 (435) T ss_pred eEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeeccccc------- Confidence 999999999999999997 469999999999999999999999999986 699999876443322111000 Q ss_pred HHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhh-hccCCceEEEehhHHH Q lcl|Aclame:pro 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT-LFQTPNAVVMNPRDWE 384 (497) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~n~~~~~ 384 (497) .......++..++..+..+ .+..+.+|+|||.++. T Consensus 284 --------------------------------------------~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~ 319 (435) T protein:vir:80 284 --------------------------------------------TLQKIETDLGKAILALENADANLTQPGWIMAPRTFR 319 (435) T ss_pred --------------------------------------------chhhHHHHHHHHHHHhhccccccccCEEEEcHHHHH Confidence 0001111222222222222 2345678999999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcC--------cEEEeeccceEEEEEeccccEEEEec Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTARREGVTMQMTN 456 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~--------~~~~gd~~~~~~~i~~r~~~~i~~~~ 456 (497) .|+++||++|+|+|... ..++|+|+||+++++||.+ .++||||+. |.|++|.+++|++++ T Consensus 320 ~L~~lkd~~G~~l~~~~----------~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~--~~i~~~~~~~i~~~~ 387 (435) T protein:vir:80 320 FLEGLRDGNGNKVYPEL----------ANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGD--VFIGEEETLEIDYSK 387 (435) T ss_pred HHHhhhccCCceeccCC----------CCCeEeeeeeEEeccccccccCCCCcceEEEEEccc--EEEEeecceEEEEec Confidence 99999999999999532 2348999999999999863 589999997 458899999999998 Q ss_pred cch---------hhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 457 SNG---------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 457 ~~~---------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +.. ++|++|+++||++.|+||+|+||+||++|+= +.-|+ T Consensus 388 ~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~--~~~~~ 435 (435) T protein:vir:80 388 EATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSG--VAWGA 435 (435) T ss_pred cccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEec--cCCCC Confidence 864 5799999999999999999999999999873 33333 No 25 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=1.6e-61 Score=353.92 Aligned_cols=386 Identities=16% Similarity=0.122 Sum_probs=240.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAE-KKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e-~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) ||.....+ ++++.++.+++.......+.+ ..+..++..++++.+.+.++..++..+... .........+.. T Consensus 1 ~~~~m~k~--l~el~~~~~~~~~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~------~~~~~~~~~~~~ 72 (397) T protein:vir:12 1 MPMQMSKK--EIALRQQFTEKKQQADKALQEGNTDEARALLDEVKQLKNQIELMTEGRSLDV------PDLPGGVNFVPE 72 (397) T ss_pred CCCcHHHH--HHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHhhhhhh Confidence 77655433 222223333222211111111 112233333333333333332221111110 000000000000 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .. ........ ..+............+..... ........+.. ......+....++++++ T Consensus 73 ~~-------~~~~~~~~--~~~~~~~~~~~~~~a~~~~~~-------~~~~~~~~~~~-----~~~~~~~a~~~~~~~~g 131 (397) T protein:vir:12 73 QE-------RNPEGQRS--QGQGNEERQQQYSKAFLKGLR-------GKRLTDEERDL-----LDSPEFRAMSGINDEDG 131 (397) T ss_pred hh-------hhhccccc--ccchhhHHHHHHHHHHHHHHh-------ccCCcHHHHHH-----HhhhhhhhccccccccC Confidence 00 00000000 000000000000000000000 00000000000 00111122223344455 Q ss_pred CcccccchhhHHHHHHHhhhhHHhhccceecCCCc--eEEEEeecCCccceeecccccccc-ccccceeEEeeeeeeeee Q lcl|Aclame:pro 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPN--LSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANA 236 (497) Q Consensus 160 g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~ 236 (497) |.+||+++...||+.+++.++|+++++++++++++ +.+|+.++ .+.++||+||+++|+ +.++|++|++.++|++++ T Consensus 132 g~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~ 210 (397) T protein:vir:12 132 GILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNAD-MVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGI 210 (397) T ss_pred cccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecC-CcceeeecccccccccccccceeEEeeheeeEee Confidence 56677788899999999999999999999998654 45566655 367999999999996 569999999999999999 Q ss_pred chhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 237 LTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT 315 (497) Q Consensus 237 ~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (497) ++||+|+++|++ ++++||.+.|++++++++|.+|++|+|++.|.|+++.. T Consensus 211 ~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~~~g~~~~~----------------------------- 261 (397) T protein:vir:12 211 MTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLKKVDIDGLD----------------------------- 261 (397) T ss_pred ehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccHH----------------------------- Confidence 999999999974 89999999999999999999999999999988875421 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCc Q lcl|Aclame:pro 316 NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ 395 (497) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~ 395 (497) ++..++.....+.+.....|+|||.+|.+|+++||++|+ T Consensus 262 -----------------------------------------~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~ 300 (397) T protein:vir:12 262 -----------------------------------------GIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGR 300 (397) T ss_pred -----------------------------------------HHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCc Confidence 111111111123344566899999999999999999999 Q ss_pred ccccccccccccccccccccccccceeecCCCCc------CcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEE Q lcl|Aclame:pro 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL------GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~------~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) |+|.+.... +.+++|||+||++++++.. ..++||||+. +|.+++|.+++|+++++.+.+|++|++.| T Consensus 301 ~l~~~~~~~------g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~f~~~~~~~ 373 (397) T protein:vir:12 301 YLLQPDPTN------PTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKE-AIVLFDREQQSIASTDTGAGAFETNSTKV 373 (397) T ss_pred eeecccccC------CCCccccceeeEEecccccccCCCccEEEEEehhc-eEEEEeecceEEEEeccccchhhcCceEE Confidence 999875432 3456899999998765432 2389999998 47789999999999999989999999999 Q ss_pred EEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 470 RAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 470 r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) |++.|+|+.+++|+||+++++++- T Consensus 374 r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 374 RGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEEEeeccEEecccceEEEEEeeC Confidence 999999999999999999999888 No 26 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=1.2e-61 Score=354.62 Aligned_cols=356 Identities=18% Similarity=0.188 Sum_probs=234.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |.+ ++++.+.++..... |++....+ +..++++.+.++++.++.++... T Consensus 1 M~k--~l~~l~e~~~~~~~-----------e~~~~~~~-------------------~~~e~~~~~~~ei~~l~~~i~~~ 48 (371) T protein:vir:81 1 MPK--ELRELLEQINNKKE-----------EARKLLAE-------------------NKIEEAKKLKEEIVALQEKFDVA 48 (371) T ss_pred CcH--HHHHHHHHHHHHHH-----------HHHHHhhH-------------------HHHHHHHHHHHHHHHHHHHHHHH Confidence 442 22222211111111 11110000 00001122222222222222111 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +..... ..+........... .. .. .+.+..+....+. ........+++..+| T Consensus 49 ~~~~~~-~~~~~~~~~~~~~~----------~~----~~------------~~~~~~~~~~l~~-~~~~a~~~~t~~~gg 100 (371) T protein:vir:81 49 KELYEE-QKQTIEDKEPLKPT----------VQ----VK------------ENEVEAFVNHIRT-RFRNAMSEGSNQDGG 100 (371) T ss_pred HHHHHH-HHHhhccccccccc----------hh----hH------------HHHHHHHHHHHHH-HHHHhhccCCCccCc Confidence 110000 00000000000000 00 00 0000011110000 011223344555566 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecC-Cccceeecccccccc-ccccceeEEeeeeeeeeech Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAA-HNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALT 238 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~-~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~~~ 238 (497) .+||+++..+||+.+++.++|+++++++++++++.++++.... .+.++||+||+.+|+ ++++|++++++++|++++++ T Consensus 101 ~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~ 180 (371) T protein:vir:81 101 YTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFR 180 (371) T ss_pred eeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccccccccccceeeEEeeeeEEEEeeh Confidence 6778888999999999999999999999998876665544322 357899999999986 67999999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchh Q lcl|Aclame:pro 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) ||+|+|+|+ +++++||.+.|++++++++|.+|++|+|++.|.|+.+... T Consensus 181 iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~~~~~~~------------------------------ 230 (371) T protein:vir:81 181 VTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKTAIADLDG------------------------------ 230 (371) T ss_pred hhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccHHH------------------------------ Confidence 999999997 6899999999999999999999999999988877643210 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccc Q lcl|Aclame:pro 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~ 397 (497) +...+.....+.+.....|+|||.+|..|+++||++|+|+ T Consensus 231 ----------------------------------------i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l 270 (371) T protein:vir:81 231 ----------------------------------------LKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYL 270 (371) T ss_pred ----------------------------------------HHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCee Confidence 0111111112233445689999999999999999999999 Q ss_pred ccccccccccccccccccccccceeecCCCCcC------------cEEEeeccceEEEEEeccccEEEEeccchhhhhcC Q lcl|Aclame:pro 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------------TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDG 465 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~------------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~ 465 (497) |.+... .+.+++|+|+||++++++|.+ .++||||+. +|.+++|.+++|+++++..++|++| T Consensus 271 ~~~~~~------~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~-~~~~~~~~~~~i~~~~~~~~~f~~~ 343 (371) T protein:vir:81 271 LQPSIS------SPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKE-AVVMFDRQRTEIMSSNVAMDAFETD 343 (371) T ss_pred eecccC------CCCCceecceeEEEecccccCccccccccCCcceEEEEehhc-eEEEEeecceEEEEeccccchhhcC Confidence 987643 234579999999999998743 489999998 4789999999999999998999999 Q ss_pred ceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 466 KVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 466 ~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) ++.||++.|+|+.+++|+||++++++++ T Consensus 344 ~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 344 ATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred ceEEEEEEeeccEEecccceEEEEEecC Confidence 9999999999999999999999999999 No 27 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=1.2e-60 Score=349.12 Aligned_cols=433 Identities=15% Similarity=0.109 Sum_probs=263.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH Q lcl|Aclame:pro 7 LEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLK 86 (497) Q Consensus 7 ~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~ 86 (497) ++++.++|...++++.+++.....+ ++.+.++.++..+.....+...+..+++++++++++.++..+.+.+..... T Consensus 1 ~~k~~eem~~~i~eL~e~r~~l~~e----~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~ 76 (477) T protein:vir:84 1 MEKHLEELRALRAAAVEAVATLKAE----RQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESE 76 (477) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444444444555544433333333 333334433333322223333444555555555555544333322211110 Q ss_pred HHHH---HHHhh-----hhhhHHHhhhhhhhhhhhhhhhhhhhhh--hhh--HHHHHHHH--HHHHHh--hhhhhhhhhh Q lcl|Aclame:pro 87 QIRK---HLARA-----VIMNPELKNATSFEKGTKFDVSFNVSAK--AAD--PGTAAAEL--MGAFAD--GETAPAAIGQ 150 (497) Q Consensus 87 ~~~~---~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~--~~~~~~~~--~~~~~~--~~~~~~~~~~ 150 (497) ..+. ..... ............. .............. ... ........ ...... .......... T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (477) T protein:vir:84 77 IERSGKLEAETKTVRKATVEVNEALTYEKG-NGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYR 155 (477) T ss_pred HHHhhcchhhhhhhcccccccccchhhhhh-HHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhc Confidence 0000 00000 0000000000000 00000000000000 000 00000000 000000 0001111223 Q ss_pred hhhcccccCCcccccchh-hHHHHHHHhhhhHHhhccceecCC--CceEEEEeecCCccceeecccc-----cccccccc Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADLISSRPVTS--PNLSYLTESAAHNNAAAVAEAG-----TYPFSSEE 222 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~-~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~p~~~~~~~~a~~v~Eg~-----~~~~s~~~ 222 (497) ...++++.+|+++||+++ ..||+.+++.++|+++++++++++ +++.||+.+++...++|++||+ .+|.++++ T Consensus 156 ~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~ 235 (477) T protein:vir:84 156 DLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLT 235 (477) T ss_pred cccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccc Confidence 334455566778888875 569999999999999999998865 4689999877666788999985 45788999 Q ss_pred ceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCc-cccceeccccccccchhhhhhhH Q lcl|Aclame:pro 223 FARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 223 ~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~ 300 (497) |+.+++++||++++++||+|||+|+ +++++||.++|+++++.++|.+||+|+|++ +|.||++.++......+... T Consensus 236 f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~--- 312 (477) T protein:vir:84 236 DGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAG--- 312 (477) T ss_pred eeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccc--- Confidence 9999999999999999999999997 699999999999999999999999999975 79999987654322211100 Q ss_pred HHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEeh Q lcl|Aclame:pro 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) ..........+.+..+.......++.++.+|+||| T Consensus 313 ---------------------------------------------~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~ 347 (477) T protein:vir:84 313 ---------------------------------------------SALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHP 347 (477) T ss_pred ---------------------------------------------cchhhHHHHHHHHHHHHhhccccccCCccEEEEcH Confidence 00011122344556666666667777788999999 Q ss_pred hHHHHHHHHhcccCccccccccccccc-------ccccccccccccceeecCCCCcC--------cEEEeeccceEEEEE Q lcl|Aclame:pro 381 RDWELLRLTKDANGQYMGGNFFGNAYG-------NPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTA 445 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~~~~~~~~~~~~-------~~~~~~~~l~G~pvv~s~~~~~~--------~~~~gd~~~~~~~i~ 445 (497) .+|..|+++||++|+|+|.+....... ......++|||+||+.+++||++ .++||||+. |.++ T Consensus 348 ~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~--~~i~ 425 (477) T protein:vir:84 348 RRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASD--LALF 425 (477) T ss_pred HHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCcccccccccCCcceEEEEEece--EEEE Confidence 999999999999999999876432211 12234568999999999999964 479999986 4455 Q ss_pred eccccEEEEeccchhhhhcCceEEEEEeeecc-EeecccceEEEEecCCCCCC Q lcl|Aclame:pro 446 RREGVTMQMTNSNGTDFVDGKVTVRAEERLGL-LVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 446 ~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~-~v~~~~Af~~~~~~~~a~~~ 497 (497) . .+++++++++.. +.++++.|++..++++ .+++|+||+.+|.++...-+ T Consensus 426 ~-~~~~~~~~~~~~--~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~~~~~ 475 (477) T protein:vir:84 426 E-SSVRMRALQETR--AENLSVLLQVYGYLAFTAARFPQSVVEIGGTALTAPT 475 (477) T ss_pred e-eceeEEeccccc--cccceeeeeehhhhhhhhhccccceEEeecccccccc Confidence 4 578888887764 5578899999888886 45569999999987776666 No 28 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=4.9e-61 Score=351.20 Aligned_cols=378 Identities=15% Similarity=0.161 Sum_probs=252.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+.++++++.++.++++.+.+.......+ .....++++.+ ..+++.+.++.+.+...+.+. T Consensus 1 Mk~~~el~~~~~~~~~~~~~l~~~~~~~~~~----~~~~~ee~~~~-------------~~~i~~~~~~~e~~~~~~~~~ 63 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLD----DSVSAEELQAI-------------KNERDTAKMKRDMFKEQYTEA 63 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhh----hhcCHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHH Confidence 9999988888887777776654432211111 11111112211 122222222222222111111 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) ........... .......... ..............+... .........+++..+| T Consensus 64 ~~~~~~~~~~~---------~~~~~~~~~~----------~~~~~~~~~~~~~l~~~~------~~~~~~~~~~t~~~gg 118 (397) T protein:vir:49 64 RANEVANMSEE---------EKKPLTKSEE----------EVKAGFVKDFKNLVRGRY------QNLLDSKTDASGSDAG 118 (397) T ss_pred HHHhhhccccc---------cccccccchh----------HHHHHHHHHHHHHHhcch------hHHHHHhhccccccCc Confidence 10000000000 0000000000 000000000000000000 0111122233444455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCc--eEEEEeecCCccceeecccccccc-ccccceeEEeeeeeeeeec Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPN--LSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~~ 237 (497) .+||+++...|++.+++.++|+++|+++++++.+ +.||+.....+.++||+||+.+|+ ++++|+++++++||+++++ T Consensus 119 ~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~ 198 (397) T protein:vir:49 119 LTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGIS 198 (397) T ss_pred ccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeee Confidence 5677788899999999999999999999987654 556666665567999999999996 6899999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|||+|+ +++++||.++|++++++++|.+|++|+|++.+.+... T Consensus 199 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~~~~~-------------------------------- 246 (397) T protein:vir:49 199 TVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALPTKPTLT-------------------------------- 246 (397) T ss_pred hhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc-------------------------------- Confidence 9999999998 6899999999999999999999999999876433110 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ..+++..++..+... +...+.|+|||.+|..|+++||++|+| T Consensus 247 -------------------------------------~~d~i~~~~~~l~~~-~~~~a~~vmn~~~~~~l~~lkd~~G~~ 288 (397) T protein:vir:49 247 -------------------------------------KWDDIIDLEAKVDPA-IKQTSFFLTNTSGFTALKKVKNALGDY 288 (397) T ss_pred -------------------------------------cHHHHHHHHHhhhhh-hcCCCEEEEcHHHHHHHHHhhcCCCce Confidence 123344444444444 345678999999999999999999999 Q ss_pred cccccccccccccccccccccccceeecC--CCCc-----CcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEE Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTP--LIPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s~--~~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) +|.+.... +.+++|+|+||++++ .+|. ..++||||+. +|.+++|.+++++++++.+++|++|+++| T Consensus 289 l~~~~~~~------~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 361 (397) T protein:vir:49 289 LMERDVKS------PTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQ-AVTLFDRQHMSLLSTNIGGGAFETDTTKV 361 (397) T ss_pred eeccCcCC------CCCceecceeeEEecccccccccCCceeEEEeeccc-eEEEEeecceEEEEeccccchhhcCceeE Confidence 99875432 345699999998854 3443 3489999998 57899999999999999888999999999 Q ss_pred EEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 470 RAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 470 r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) |++.|+|+.+++|+||+++++++++++. T Consensus 362 r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:49 362 RVIDRFDVVATDTEAFVPASFKAIADQK 389 (397) T ss_pred EEEeeeCcEEecccceEEEEeecccCCC Confidence 9999999999999999999999998887 No 29 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=3.9e-60 Score=346.26 Aligned_cols=443 Identities=13% Similarity=0.077 Sum_probs=244.5 Q ss_pred chHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 2 PSTAQLE--AQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 2 ~~~~~~~--~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) =++-||+ ++.+++..+++++.++... +....+.+.+..++.+...+.....+..+.+.++..++.+++..++.++.+ T Consensus 1 ~~~~~~~l~~~~~~~~~~l~el~e~~~~-l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~ 79 (466) T protein:vir:80 1 MALRQLMLAKKIEQRKAALAELLEQEKA-LQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKE 79 (466) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222222 2233333333332221111 111011111111111100000000011122222222222222223222222 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhh-hhhhhhhhhhhhhhhhhhhh-hhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccc Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKN-ATSFEKGTKFDVSFNVSAKA-ADPGTAAAELMGAFADGETAPAAIGQNPFGSTG 157 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (497) .+.+....... ............. ......+... ........ ....+.....+....... ...........+. T Consensus 80 le~el~e~~~~-~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 154 (466) T protein:vir:80 80 LENELEQLNNK-EPKNNSEPAQVSGARTQQFVGGET--RMKGFFRNMPYEQRAALIARSEVKEFL--AQVRTLAQQKRAV 154 (466) T ss_pred HHHHHHHHHHh-hhccCchhHHHHhhhhhHHhhHHH--HHHHHHHhhhhhhHHHHHHHHHHHHHH--HHHHHHhhhhhhh Confidence 22111000000 0000000000000 0000000000 00000000 000000000000000000 0000111111222 Q ss_pred cCCc-ccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeee Q lcl|Aclame:pro 158 TFAP-GILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANA 236 (497) Q Consensus 158 ~~g~-~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~ 236 (497) ++++ ++|.++...|++.+++.++|++++++.++++. .++|+.+. .+.+.|++|++.+|+++|+|++|++.+||++++ T Consensus 155 ~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g~-~~~~~~~~-~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~ 232 (466) T protein:vir:80 155 SGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKGT-ARQNIAGA-IPEGVWTEAVANLNELSLSFSQIEVDGYKVGGF 232 (466) T ss_pred ccccccccHHHHHHHHHhhhhhhhhhhheeeeecCce-eEeeeecC-Ccceeecccccccccccccccceeecceeeeee Confidence 3344 45556777899999999999999999998764 68888765 467899999999999999999999999999999 Q ss_pred chhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 237 LTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT 315 (497) Q Consensus 237 ~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (497) ++||+|||+|+ +++++||..+|+++++.++|.+||+|+|+++|.||++..+..+.................... T Consensus 233 ~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 307 (466) T protein:vir:80 233 IPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNL----- 307 (466) T ss_pred hhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhh----- Confidence 99999999998 589999999999999999999999999999999999876554433322111111000000000 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHh---cc Q lcl|Aclame:pro 316 NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK---DA 392 (497) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk---d~ 392 (497) .. ...........+.............+..+...|+||+.++..|..++ +. T Consensus 308 ------------------~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~ 361 (466) T protein:vir:80 308 ------------------LK--------IDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNS 361 (466) T ss_pred ------------------hh--------hhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccC Confidence 00 00000001111222222333333444445557999999999999888 67 Q ss_pred cCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEE Q lcl|Aclame:pro 393 NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 393 ~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) +|.|++.+. ....|+|+||+++++||++.+++|||+. |.+++|.+++|.++++. .|.+|++.||++ T Consensus 362 ~g~~~~~~~----------~~~~i~G~pvv~s~~~~~~~~~~g~~~~--y~i~~r~~~~i~~~~~~--~f~~d~~~~r~~ 427 (466) T protein:vir:80 362 AGALVASLN----------NTMPIVGGDIVILDFIPDNDIIGGYGSL--YLLAERADIKLAQSEHV--RFIEDQTVFKGT 427 (466) T ss_pred CccccccCC----------CcccccccceeecCccCccceeeecccc--EEEEeecceEEEechhh--hhhcCcEEEEEE Confidence 788776542 1235999999999999999999999996 77999999999998765 599999999999 Q ss_pred eeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 473 ERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 473 ~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +|+||+|++|+||++++++...... T Consensus 428 ~r~dg~~~~~~afv~~~~~~~~~~~ 452 (466) T protein:vir:80 428 ARYDGKPVFGEGFVAVNIANANPTT 452 (466) T ss_pred EEEccEEeccCceEEEEecCCCccc Confidence 9999999999999999998876655 No 30 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=5.5e-61 Score=350.95 Aligned_cols=390 Identities=15% Similarity=0.138 Sum_probs=244.8 Q ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPS-TAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |.+ +.++++++.++.++++.+..+..... ++++ .+.++++.++++++..... .+ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~~----ee~~--------------------~~~~e~~~l~~~i~~~~~~-~~ 55 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDGVTA----EELN--------------------KTSNEIDILQAKIEAQKRK-EN 55 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcCCCH----HHHH--------------------HHHHHHHHHHHHHHHHHHH-HH Confidence 554 22333333322222222111100000 0111 1111122222211111000 00 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhh-hhhhhhhhhhhhccccc Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADG-ETAPAAIGQNPFGSTGT 158 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 158 (497) .+........+. .................. ... ....+...... ............+++++ T Consensus 56 ~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~-------~~~----~~~~~~~~~~~~~~~~~e~~a~~~~~~~~ 117 (404) T protein:vir:10 56 IENNFNEDNVKS-------LNTGKEENVIYNGALFVR-------AIA----DNLLKQKNQRGLNLSEKEINAISENIDED 117 (404) T ss_pred HHHHHhhhhccc-------cccccchhhHHHHHHHHH-------HHH----HHHHHHHHhhhhcchhhHHhhhccccCCC Confidence 000000000000 000000000000000000 000 00000000000 01111112222334445 Q ss_pred CCcccccchhhHHHHHHHhhhhHHhhccceecCCC--ceEEEEeecCCccceeeccccccccc--cccceeEEeeeeeee Q lcl|Aclame:pro 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP--NLSYLTESAAHNNAAAVAEAGTYPFS--SEEFARVYEQVGKVA 234 (497) Q Consensus 159 ~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~--~~~~p~~~~~~~~a~~v~Eg~~~~~s--~~~~~~v~~~~~kia 234 (497) +|.+||+++...|++.+++.++|++++++.++++. .+.||+.++ .+.++|++|++.+|.+ +++|++++++++|++ T Consensus 118 gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~-~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~ 196 (404) T protein:vir:10 118 GGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSK-QKPMKPLSENQQIPTNGDNGKLERFNFKLKDLA 196 (404) T ss_pred CceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecC-CcceeeccccccccccccccceeeeEeeheeeE Confidence 55567778889999999999999999999998754 567787766 4679999999999875 589999999999999 Q ss_pred eechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhh Q lcl|Aclame:pro 235 NALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPA 312 (497) Q Consensus 235 ~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (497) ++++||+|||+|+ +++++||.++|++++++++|.+||+|+|+++ |.||++..+..+.+.... T Consensus 197 ~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~---------------- 260 (404) T protein:vir:10 197 DFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPKS---------------- 260 (404) T ss_pred eeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeecccc---------------- Confidence 9999999999997 5899999999999999999999999999875 788887655443322211 Q ss_pred hcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcc Q lcl|Aclame:pro 313 DGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDA 392 (497) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~ 392 (497) ...+++..++.....+.+.....|+|||.+|..|+++||+ T Consensus 261 ----------------------------------------~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~ 300 (404) T protein:vir:10 261 ----------------------------------------PALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDK 300 (404) T ss_pred ----------------------------------------ccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhcc Confidence 1122333333333344555566899999999999999999 Q ss_pred cCcccccccccccccccccccccccccceeec-CCCCcC-----cEEEeeccceEEEEEeccccEEEEeccchhhhhcCc Q lcl|Aclame:pro 393 NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTT-PLIPLG-----TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGK 466 (497) Q Consensus 393 ~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s-~~~~~~-----~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~ 466 (497) +|+|+|.+... .+.+++|||+||++. +.++.+ .++||||+. +|.+++|.+++|+++++.+.+|++|+ T Consensus 301 ~G~~l~~~~~~------~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~-~~~~~~~~~~~i~~~~~~~~~~~~~~ 373 (404) T protein:vir:10 301 TGRPYLQPDPK------DPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKE-AYKYVSDGAYELATTNIGAGAFETNT 373 (404) T ss_pred CCceeeccCcC------CCCCccccceeeEEecccccCCCCCccEEEEEeccc-cEEEEEecceEEEEeccccchhhcCc Confidence 99999987543 234568999999864 445443 389999998 47899999999999999888999999 Q ss_pred eEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 467 VTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 467 v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +.||++.|+|+.|.+|+||++++++++|..+ T Consensus 374 ~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 374 TKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred eEEEEEEeeccEEecccceEEEEeecccCCC Confidence 9999999999999999999999999999988 No 31 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=3e-60 Score=346.89 Aligned_cols=391 Identities=15% Similarity=0.121 Sum_probs=248.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHH Q lcl|Aclame:pro 5 AQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRN 84 (497) Q Consensus 5 ~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~ 84 (497) +-++++++++.++++++.+.......++++.+++ .+.. +...+.+++.++++++..++++++.++...+... T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~----~~~~----~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~ 72 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPAMKTELRSLLEG----EDSE----ENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAAL 72 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hccc----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4555555555555555433333222333322222 1110 1112334455556666666655554443332211 Q ss_pred HHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHH-HHHHHHHhhhhhhhhhhhhhhc-ccccCCcc Q lcl|Aclame:pro 85 LKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAA-ELMGAFADGETAPAAIGQNPFG-STGTFAPG 162 (497) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~~ 162 (497) ... ......... ......... .................... .....................+ ++..+|.+ T Consensus 73 ~~~-~~~~~~~~~-~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~ 145 (400) T protein:vir:38 73 KGN-EQSSGKKPD-HPEEHSYRD-----ALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAAST 145 (400) T ss_pred HHH-hhccccccc-chhhhhHHH-----HHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCccc Confidence 111 000000000 000000000 00000000000000000000 0000000000011111112222 33444556 Q ss_pred cccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccc-ccccceeEEeeeeeeeeechhhH Q lcl|Aclame:pro 163 ILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALTITD 241 (497) Q Consensus 163 i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~~~iS~ 241 (497) ||+++...|++.+++.++|++++++++++++++++|+.+..++.+.|++|++..|. ++++|++|++.+||++++++||+ T Consensus 146 vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ 225 (400) T protein:vir:38 146 IPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQ 225 (400) T ss_pred ccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHH Confidence 77788999999999999999999999999999999998876678999999999986 68999999999999999999999 Q ss_pred HHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhh Q lcl|Aclame:pro 242 EGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) Q Consensus 242 ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) |||+|+ +++++||.++|+++++.++|.+|++|+|.+.+.|+.+.. T Consensus 226 ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~---------------------------------- 271 (400) T protein:vir:38 226 ESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTAKTISSVD---------------------------------- 271 (400) T ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccHH---------------------------------- Confidence 999997 689999999999999999999999999887665543210 Q ss_pred hhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccc Q lcl|Aclame:pro 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~ 400 (497) ++...+.....+. ...+|+|||.+|..|+++||++|+|||.+ T Consensus 272 ------------------------------------~~~~~~~~~~~~~--~~a~~v~~~~~~~~l~~lkd~~G~~i~~~ 313 (400) T protein:vir:38 272 ------------------------------------DLKHINNVDLDPA--YSRVIIASQSFYNFLDTVKDGNGRYLLQD 313 (400) T ss_pred ------------------------------------HHHHHHHhhhhhh--hCcEEEEcHHHHHHHHHhhccCCCeeeec Confidence 0111111111111 13579999999999999999999999987 Q ss_pred cccccccccccccccccccceeecCCCCcCc-----EEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeee Q lcl|Aclame:pro 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERL 475 (497) Q Consensus 401 ~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~ 475 (497) .... +.+++|+|+||++++++|.+. ++||||++ +|.+++|.++++.++++.. | ..+||+.+|+ T Consensus 314 ~~~~------~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~-~~~~~~~~~~~~~~~~~~~--~---~~~~~~~~r~ 381 (400) T protein:vir:38 314 SILT------PSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKR-AILFANRADFMVRWVDDQI--Y---GQFLQAGMRF 381 (400) T ss_pred CcCC------CCccccccceeEEecccccCCCCceEEEEEeccc-cEEEEeecceEEEEecccc--c---ceeEEEEEEe Confidence 5432 345689999999999988542 79999998 4889999999999987653 3 4589999999 Q ss_pred ccEeecccceEEEEecCCC Q lcl|Aclame:pro 476 GLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 476 ~~~v~~~~Af~~~~~~~~a 494 (497) |+.|.+|+||++|+++++| T Consensus 382 d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 382 GVSVADEKAGYFLTYTPKA 400 (400) T ss_pred ccEEecccceEEEEeecCC Confidence 9999999999999999999 No 32 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=1.6e-60 Score=348.39 Aligned_cols=383 Identities=14% Similarity=0.124 Sum_probs=247.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |-.+.++++++.++.++++++.+.....+.+.. ...+ +..++..+++.+.++++.+..++.+. T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~e-------------e~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (408) T protein:vir:10 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDN----FSAE-------------AMSELKNKRDNEKVRRDALREQLVEA 66 (408) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhccc----ccHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 224556666666665555544332211111110 0011 11122222333333333332222221 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +............. . ......... ..... ......+. ...............++.+.+| T Consensus 67 ~~~~~~~~~~~~~~-~-~~~~~~~~~-----~~~~~------------~~~~~~~~--~~~~~~~~~~~a~~~~t~~~gg 125 (408) T protein:vir:10 67 QAEQVVNMREEEKG-P-LNKSENELK-----DKFVK------------DFVNMVRN--PMAFMNTVSSKTETSGSDSAAG 125 (408) T ss_pred HHHHHhcccccccc-c-cccchhhhH-----HHHHH------------HHHHHhhc--chhhhhhhhhhhhhcccccCCc Confidence 11100000000000 0 000000000 00000 00000000 0000111122233344455556 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEE--EeecCCccceeeccccccccc-cccceeEEeeeeeeeeec Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYL--TESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p--~~~~~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~~ 237 (497) .+||+++...||+.+++.++|+++++++++++++..+| +..+..+.+.|++||+.+|++ .++|++|++++||+++++ T Consensus 126 ~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~ 205 (408) T protein:vir:10 126 LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGII 205 (408) T ss_pred eeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeee Confidence 66777888999999999999999999999987765555 444545678999999999975 599999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|||+|+ +++++||.++|+++++.++|.+|++|+|++.+.+-. T Consensus 206 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~~~~--------------------------------- 252 (408) T protein:vir:10 206 TATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTI--------------------------------- 252 (408) T ss_pred hhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc--------------------------------- Confidence 9999999997 589999999999999999999999999976543110 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ...+++..++.....+.+.....|+|||.+|..|+++||++|+| T Consensus 253 ------------------------------------~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~ 296 (408) T protein:vir:10 253 ------------------------------------AKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKY 296 (408) T ss_pred ------------------------------------ccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCce Confidence 01222333333323344455668999999999999999999999 Q ss_pred cccccccccccccccccccccccceeecC--CCCcC-----cEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEE Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTP--LIPLG-----TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s~--~~~~~-----~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) +|++.... +.+++|+|+||++++ .+|+. .++||||+. +|.+++|.+++|+++++.+..|++|++.| T Consensus 297 i~~~~~~~------~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~~v~~~~~~~~~f~~~~~~~ 369 (408) T protein:vir:10 297 LLEPDPTK------PNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQ-AITLFDRENMSLLPTNIGAGAFETDTTKI 369 (408) T ss_pred EeccCcCC------CCCceecceeeEEecccccCccCCCceEEEEEehhc-cEEEEEecceEEEEcccccchhhcCceEE Confidence 99876533 345699999999965 45543 289999998 47899999999999999988999999999 Q ss_pred EEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 470 RAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 470 r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) |++.|+|+.|++|+||+++++++++..+ T Consensus 370 r~~~r~d~~v~~~~a~~~~~~~~~~~~~ 397 (408) T protein:vir:10 370 RVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) T ss_pred EEEEeeccEEeccccEEEEEeeccccCC Confidence 9999999999999999999999987666 No 33 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=3.5e-60 Score=346.53 Aligned_cols=400 Identities=12% Similarity=0.097 Sum_probs=255.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |-+.-++++++.++.+++........+.+.+.. .++.+++.+++++++++++++...+.+. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~-------------------~e~~~~~~~ev~~l~~~i~~~~~~~~~~ 61 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE-------------------LEKAEKLEQEITDLRSQIQEKQEELDKL 61 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhh-------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 777777777666665544433221111111100 0111122222233333332222222111 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +......... .......+.+......... ...............+..+.......... .....+++.++ T Consensus 62 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~t~~g~ 130 (415) T protein:vir:47 62 KEKDRTSENN---QQSVEVNEARTYRNQANIN-------DLGISIQNTKVTSQEVRDFTEYLETRNDI-QGGSLKTDSGF 130 (415) T ss_pred HHHHHhhhhc---ccccccchhhhhHHHHHHH-------HHHHhhhhhhhhHHHHHHHHHHHhhhhhh-hhccccccCCc Confidence 1110000000 0000000000000000000 00000000111111112222222111111 11223344455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeec-CCccceeecccccccc-ccccceeEEeeeeeeeeech Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESA-AHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALT 238 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~~~ 238 (497) .+||+++.+.|++.+++.++|++++++++++++..+||+... ....++||+||+.+|+ +.++|+.|++.+||++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~ 210 (415) T protein:vir:47 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFR 210 (415) T ss_pred ccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeeh Confidence 677778889999999999999999999999988888876543 2357899999999997 57999999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchh Q lcl|Aclame:pro 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) ||+|||+|+ .++++||.++|++++++++|.+|++|+|++.+.++.......... T Consensus 211 iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~------------------------- 265 (415) T protein:vir:47 211 ISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK------------------------- 265 (415) T ss_pred hhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccce------------------------- Confidence 999999998 579999999999999999999999999998766654332111100 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccc Q lcl|Aclame:pro 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~ 397 (497) .........+++..++..+..+++ .++.|+|||.+|..|+++||++|+|| T Consensus 266 -----------------------------~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lkd~~G~~i 315 (415) T protein:vir:47 266 -----------------------------LEVKKAKSLDDIKDAINLNVKPNY-EHNVAIVSQTMFAKLDKMKDKLGNYL 315 (415) T ss_pred -----------------------------eccccccchHHHHHHHHhhhhhcc-CCCEEEEcHHHHHHHHHhhccCCCee Confidence 001111234555666666655544 57789999999999999999999999 Q ss_pred ccccccccccccccccccccccceeecCCCCcCc-----EEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEE Q lcl|Aclame:pro 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) |.+.... ..+++|||+||++++++|.+. ++||||++ +|.+++|.++++++++ |.++++.+|++ T Consensus 316 ~~~~~~~------~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~~~~ 383 (415) T protein:vir:47 316 IQPDVKE------KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMIA 383 (415) T ss_pred eccCcCC------CCCccccceeeEEeccccccCCCccEEEEEehhc-cEEEEeecceEEEeec-----cccCceEEEEE Confidence 9875432 345689999999999998643 89999998 4778999999998875 56778899999 Q ss_pred eeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 473 ERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 473 ~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +|+|++|++|+||+++++++++.+. T Consensus 384 ~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:47 384 VRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEeccEEeccccEEEEEeeccCCCC Confidence 9999999999999999999999998 No 34 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=3.5e-60 Score=346.53 Aligned_cols=400 Identities=12% Similarity=0.097 Sum_probs=255.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |-+.-++++++.++.+++........+.+.+.. .++.+++.+++++++++++++...+.+. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~-------------------~e~~~~~~~ev~~l~~~i~~~~~~~~~~ 61 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDE-------------------LEKAEKLEQEITDLRSQIQEKQEELDKL 61 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhh-------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 777777777666665544433221111111100 0111122222233333332222222111 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +......... .......+.+......... ...............+..+.......... .....+++.++ T Consensus 62 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~t~~g~ 130 (415) T protein:vir:46 62 KEKDRTSENN---QQSVEVNEARTYRNQANIN-------DLGISIQNTKVTSQEVRDFTEYLETRNDI-QGGSLKTDSGF 130 (415) T ss_pred HHHHHhhhhc---ccccccchhhhhHHHHHHH-------HHHHhhhhhhhhHHHHHHHHHHHhhhhhh-hhccccccCCc Confidence 1110000000 0000000000000000000 00000000111111112222222111111 11223344455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeec-CCccceeecccccccc-ccccceeEEeeeeeeeeech Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESA-AHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALT 238 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~~~ 238 (497) .+||+++.+.|++.+++.++|++++++++++++..+||+... ....++||+||+.+|+ +.++|+.|++.+||++++++ T Consensus 131 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~ 210 (415) T protein:vir:46 131 VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFR 210 (415) T ss_pred ccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeeh Confidence 677778889999999999999999999999988888876543 2357899999999997 57999999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchh Q lcl|Aclame:pro 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) ||+|||+|+ .++++||.++|++++++++|.+|++|+|++.+.++.......... T Consensus 211 iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~------------------------- 265 (415) T protein:vir:46 211 ISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK------------------------- 265 (415) T ss_pred hhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccce------------------------- Confidence 999999998 579999999999999999999999999998766654332111100 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccc Q lcl|Aclame:pro 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~ 397 (497) .........+++..++..+..+++ .++.|+|||.+|..|+++||++|+|| T Consensus 266 -----------------------------~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lkd~~G~~i 315 (415) T protein:vir:46 266 -----------------------------LEVKKAKSLDDIKDAINLNVKPNY-EHNVAIVSQTMFAKLDKMKDKLGNYL 315 (415) T ss_pred -----------------------------eccccccchHHHHHHHHhhhhhcc-CCCEEEEcHHHHHHHHHhhccCCCee Confidence 001111234555666666655544 57789999999999999999999999 Q ss_pred ccccccccccccccccccccccceeecCCCCcCc-----EEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEE Q lcl|Aclame:pro 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) |.+.... ..+++|||+||++++++|.+. ++||||++ +|.+++|.++++++++ |.++++.+|++ T Consensus 316 ~~~~~~~------~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~~~~ 383 (415) T protein:vir:46 316 IQPDVKE------KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMIA 383 (415) T ss_pred eccCcCC------CCCccccceeeEEeccccccCCCccEEEEEehhc-cEEEEeecceEEEeec-----cccCceEEEEE Confidence 9875432 345689999999999998643 89999998 4778999999998875 56778899999 Q ss_pred eeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 473 ERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 473 ~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +|+|++|++|+||+++++++++.+. T Consensus 384 ~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:46 384 VRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEeccEEeccccEEEEEeeccCCCC Confidence 9999999999999999999999998 No 35 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=2.1e-60 Score=347.75 Aligned_cols=378 Identities=16% Similarity=0.153 Sum_probs=252.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+-+|++++.++.+++.++.+... +.+...+...++++.+ ..+++.+.++.+.+...+.+. T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~----~~~~~~~~~~ee~~~l-------------~~ei~~~~~~~~~~~~~~~~~ 63 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLN----VAMLDDSVSAEELQAI-------------KNERDTAKMKRDLFKEQYTEA 63 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH----HHHhcchhhHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHH Confidence 99999998888887777766543222 1111111111222222 222222222222222111111 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) .......... ......... .... ............++. ..........++++.+| T Consensus 64 ~~~~~~~~~~---------~~~~~~~~~------~~~~----~~~~~~~~~~~l~~~------~~~~~~~~~~~t~~~gg 118 (397) T protein:vir:49 64 RANEVANMSE---------EEKKPLTKN------EEEV----KANFVKDFKNLVRGR------YQNLLDSKTDGSGSDAG 118 (397) T ss_pred HHhhhhcccc---------cccccccch------hhHH----HHHHHHHHHHHhhcc------hhhHHHhhhccCCccCc Confidence 1000000000 000000000 0000 000000000000000 01111223334445556 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCce--EEEEeecCCccceeecccccccccc-ccceeEEeeeeeeeeec Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNL--SYLTESAAHNNAAAVAEAGTYPFSS-EEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~--~~p~~~~~~~~a~~v~Eg~~~~~s~-~~~~~v~~~~~kia~~~ 237 (497) .+||+++...|++.+++.++|+++++++++++++. .||+.....+.+.||+||+.+|+++ ++|++|++.++|+++++ T Consensus 119 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 198 (397) T protein:vir:49 119 LTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGIS 198 (397) T ss_pred ceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeeh Confidence 66777888999999999999999999999987654 5566655556789999999999875 89999999999999999 Q ss_pred hhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|+|+|+. ++++||.++|+++++.++|.+||+|+|++.+.+... T Consensus 199 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~~~~~~-------------------------------- 246 (397) T protein:vir:49 199 TVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLPNKPTLA-------------------------------- 246 (397) T ss_pred hhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc-------------------------------- Confidence 99999999985 799999999999999999999999999876432110 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ..+++..++..+.. .+..+..|+|||.+|..|++|||++|+| T Consensus 247 -------------------------------------~~d~i~~~~~~l~~-~~~~~a~~v~n~~~~~~l~~lkd~~g~~ 288 (397) T protein:vir:49 247 -------------------------------------KWDDIIDLQAKVDP-AIKQTSLFLTNTSGFTALKKVKNAMGDY 288 (397) T ss_pred -------------------------------------CHHHHHHHHHhhhh-hhcCCCEEEEcHHHHHHHHHhhccCCce Confidence 12233444444443 3456779999999999999999999999 Q ss_pred cccccccccccccccccccccccceeecC--CCCc-----CcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEE Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTP--LIPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s~--~~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) +|.+.... +.+.+|+|+||++++ .+|. ..++||||+. +|.+++|.+++|+++++.+++|++|+++| T Consensus 289 l~~~~~~~------g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 361 (397) T protein:vir:49 289 LMERDVKS------PTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQ-AVTLFDRQHLSLLSTNIGGGAFETDTTKV 361 (397) T ss_pred eecccccC------CCCceecceeeEEecccccccccCCceeEEEeeccc-eEEEEeecccEEEEeccccchhhcCeeeE Confidence 99875432 345689999998854 4453 3489999998 47899999999999999988999999999 Q ss_pred EEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 470 RAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 470 r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) |++.|+|+.+++|+||++++++++++.. T Consensus 362 ~~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:49 362 RVIDRFDVVSTDTEAFVPASFKAIADQK 389 (397) T ss_pred EEEEeeccEEecccceEEEEeccccccc Confidence 9999999999999999999999988866 No 36 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=6.5e-60 Score=345.06 Aligned_cols=399 Identities=12% Similarity=0.076 Sum_probs=253.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEK-KEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~-~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |.++.++++++.++.+++....+...+.+.+. .+..+ ++..++++++++++.++..+.+ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~--------------------~~~~e~~~l~~~i~~~~~~~~~ 60 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAE--------------------KLEQEITDLRSQIQEKQEELDK 60 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHH--------------------HHHHHHHHHHHHHHHHHHHHHH Confidence 99999998888877766655433222211111 01111 1122222222222222221111 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+........ ... .....+.+......... ...............+..+.......... .....+++.+ T Consensus 61 ~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g 129 (415) T protein:vir:98 61 LKEKDGTSEN--NQQ-SVEVNEARTYRNQANIN-------DLGISIQNTKVTSQEVRDFTEYLETRNDI-QGGSLKTDSG 129 (415) T ss_pred HHHHHhhhhh--ccc-ccccchhhhHHHHHHHH-------HHhhhhhhhhhHHHHHHHHHHHHhhhhhh-hhcccccccc Confidence 1110000000 000 00000000000000000 00000000000111111122111111111 1122233344 Q ss_pred CcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeec-CCccceeeccccccccc-cccceeEEeeeeeeeeec Q lcl|Aclame:pro 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESA-AHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 160 g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~~ 237 (497) |.++|+++...|++.+++.++|++++++++|++++.++|+... ....++|++|++.+|+. .++|+++++.++|+++++ T Consensus 130 g~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 209 (415) T protein:vir:98 130 FVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred ccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeee Confidence 4566667889999999999999999999999887766665432 23578999999999964 689999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+||++|+ +++++||.++|+++++.++|.+|++|+|++.+.+............ T Consensus 210 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~----------------------- 266 (415) T protein:vir:98 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL----------------------- 266 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccc----------------------- Confidence 9999999997 5799999999999999999999999999887665543221111000 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ........+++..++..+..++ +.++.|+|||.+|..|+++||++|+| T Consensus 267 -------------------------------~~~~~~~~~~i~~~~~~~~~~~-~~~~~~v~n~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:98 267 -------------------------------EVKKAKSLDDIKDAINLNVKPN-YEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred -------------------------------ccccccchhHHHHHHHhhhhhc-cCCCEEEEcHHHHHHHHHhhccCCce Confidence 0011123455556666655444 45778999999999999999999999 Q ss_pred cccccccccccccccccccccccceeecCCCCcCc-----EEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEE Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) ||.+.... +.+.+|+|+||++++++|.+. ++||||++ +|.+++|.++++++++ |.++++.+|+ T Consensus 315 l~~~~~~~------~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~~~ 382 (415) T protein:vir:98 315 LIQPDVKE------KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMI 382 (415) T ss_pred eeccCcCC------CCCceecceeeEEecccccCCCCccEEEEEehhc-cEEEEeecceEEEEec-----cccCceEEEE Confidence 99875432 345689999999999998654 89999998 4779999999998875 4567789999 Q ss_pred EeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++|+|++|+||+||+++++++++.++ T Consensus 383 ~~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:98 383 AVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEEeccEEeccccEEEEEEeccCCCC Confidence 99999999999999999999999999 No 37 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=6.5e-60 Score=345.06 Aligned_cols=399 Identities=12% Similarity=0.076 Sum_probs=253.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEK-KEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~-~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |.++.++++++.++.+++....+...+.+.+. .+..+ ++..++++++++++.++..+.+ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~--------------------~~~~e~~~l~~~i~~~~~~~~~ 60 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAE--------------------KLEQEITDLRSQIQEKQEELDK 60 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHH--------------------HHHHHHHHHHHHHHHHHHHHHH Confidence 99999998888877766655433222211111 01111 1122222222222222221111 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+........ ... .....+.+......... ...............+..+.......... .....+++.+ T Consensus 61 ~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g 129 (415) T protein:vir:79 61 LKEKDGTSEN--NQQ-SVEVNEARTYRNQANIN-------DLGISIQNTKVTSQEVRDFTEYLETRNDI-QGGSLKTDSG 129 (415) T ss_pred HHHHHhhhhh--ccc-ccccchhhhHHHHHHHH-------HHhhhhhhhhhHHHHHHHHHHHHhhhhhh-hhcccccccc Confidence 1110000000 000 00000000000000000 00000000000111111122111111111 1122233344 Q ss_pred CcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeec-CCccceeeccccccccc-cccceeEEeeeeeeeeec Q lcl|Aclame:pro 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESA-AHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 160 g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~~ 237 (497) |.++|+++...|++.+++.++|++++++++|++++.++|+... ....++|++|++.+|+. .++|+++++.++|+++++ T Consensus 130 g~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 209 (415) T protein:vir:79 130 FVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred ccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeee Confidence 4566667889999999999999999999999887766665432 23578999999999964 689999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+||++|+ +++++||.++|+++++.++|.+|++|+|++.+.+............ T Consensus 210 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~----------------------- 266 (415) T protein:vir:79 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL----------------------- 266 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccc----------------------- Confidence 9999999997 5799999999999999999999999999887665543221111000 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ........+++..++..+..++ +.++.|+|||.+|..|+++||++|+| T Consensus 267 -------------------------------~~~~~~~~~~i~~~~~~~~~~~-~~~~~~v~n~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:79 267 -------------------------------EVKKAKSLDDIKDAINLNVKPN-YEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred -------------------------------ccccccchhHHHHHHHhhhhhc-cCCCEEEEcHHHHHHHHHhhccCCce Confidence 0011123455556666655444 45778999999999999999999999 Q ss_pred cccccccccccccccccccccccceeecCCCCcCc-----EEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEE Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) ||.+.... +.+.+|+|+||++++++|.+. ++||||++ +|.+++|.++++++++ |.++++.+|+ T Consensus 315 l~~~~~~~------~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~~~ 382 (415) T protein:vir:79 315 LIQPDVKE------KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMI 382 (415) T ss_pred eeccCcCC------CCCceecceeeEEecccccCCCCccEEEEEehhc-cEEEEeecceEEEEec-----cccCceEEEE Confidence 99875432 345689999999999998654 89999998 4779999999998875 4567789999 Q ss_pred EeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++|+|++|+||+||+++++++++.++ T Consensus 383 ~~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:79 383 AVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEEeccEEeccccEEEEEEeccCCCC Confidence 99999999999999999999999999 No 38 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=6.5e-60 Score=345.06 Aligned_cols=399 Identities=12% Similarity=0.076 Sum_probs=253.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEK-KEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~-~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |.++.++++++.++.+++....+...+.+.+. .+..+ ++..++++++++++.++..+.+ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~--------------------~~~~e~~~l~~~i~~~~~~~~~ 60 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAE--------------------KLEQEITDLRSQIQEKQEELDK 60 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHH--------------------HHHHHHHHHHHHHHHHHHHHHH Confidence 99999998888877766655433222211111 01111 1122222222222222221111 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+........ ... .....+.+......... ...............+..+.......... .....+++.+ T Consensus 61 ~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g 129 (415) T protein:vir:81 61 LKEKDGTSEN--NQQ-SVEVNEARTYRNQANIN-------DLGISIQNTKVTSQEVRDFTEYLETRNDI-QGGSLKTDSG 129 (415) T ss_pred HHHHHhhhhh--ccc-ccccchhhhHHHHHHHH-------HHhhhhhhhhhHHHHHHHHHHHHhhhhhh-hhcccccccc Confidence 1110000000 000 00000000000000000 00000000000111111122111111111 1122233344 Q ss_pred CcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeec-CCccceeeccccccccc-cccceeEEeeeeeeeeec Q lcl|Aclame:pro 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESA-AHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 160 g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~~ 237 (497) |.++|+++...|++.+++.++|++++++++|++++.++|+... ....++|++|++.+|+. .++|+++++.++|+++++ T Consensus 130 g~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 209 (415) T protein:vir:81 130 FVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred ccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeee Confidence 4566667889999999999999999999999887766665432 23578999999999964 689999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+||++|+ +++++||.++|+++++.++|.+|++|+|++.+.+............ T Consensus 210 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~----------------------- 266 (415) T protein:vir:81 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL----------------------- 266 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccc----------------------- Confidence 9999999997 5799999999999999999999999999887665543221111000 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ........+++..++..+..++ +.++.|+|||.+|..|+++||++|+| T Consensus 267 -------------------------------~~~~~~~~~~i~~~~~~~~~~~-~~~~~~v~n~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:81 267 -------------------------------EVKKAKSLDDIKDAINLNVKPN-YEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred -------------------------------ccccccchhHHHHHHHhhhhhc-cCCCEEEEcHHHHHHHHHhhccCCce Confidence 0011123455556666655444 45778999999999999999999999 Q ss_pred cccccccccccccccccccccccceeecCCCCcCc-----EEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEE Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) ||.+.... +.+.+|+|+||++++++|.+. ++||||++ +|.+++|.++++++++ |.++++.+|+ T Consensus 315 l~~~~~~~------~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~~~ 382 (415) T protein:vir:81 315 LIQPDVKE------KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMI 382 (415) T ss_pred eeccCcCC------CCCceecceeeEEecccccCCCCccEEEEEehhc-cEEEEeecceEEEEec-----cccCceEEEE Confidence 99875432 345689999999999998654 89999998 4779999999998875 4567789999 Q ss_pred EeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++|+|++|+||+||+++++++++.++ T Consensus 383 ~~r~d~~v~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:81 383 AVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEEeccEEeccccEEEEEEeccCCCC Confidence 99999999999999999999999999 No 39 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=2.8e-60 Score=347.06 Aligned_cols=386 Identities=14% Similarity=0.110 Sum_probs=244.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+= +..+++|.++..++.+.......+..+.+.. ... ..+...++.++++++..+.+.+..++.+. T Consensus 1 ~~~----~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~-------~~~---~~ee~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (404) T protein:vir:39 1 MGV----KLTVNQLNEAWIASGDKVTDFNDQINMALND-------DNF---SAEAMSELKNKRDNEKVRRDALREQLVEA 66 (404) T ss_pred CCh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-------ccc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 432 2223333333333222111111111111110 000 01112233334444444444444333322 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.......... .. ... . ....... .... .......+.. ..............++++++| T Consensus 67 ~~~~~~~~~~~-~~---~~~-----~-----~~~~~~~-~~~~----~~~~~~~~~~--~~~~~~~e~~a~~~~t~~~gg 125 (404) T protein:vir:39 67 QAEQVVNMREE-EK---GPL-----N-----KSEYELK-DKFV----KEFVNMVRNP--MAFLNTVSSKTETSGSDSAAG 125 (404) T ss_pred HHHHHhccccc-cc---ccc-----c-----cchhhhH-HHHH----HHHHHHHhcc--hhhhhhhhhhhhhcccccCCc Confidence 21111100000 00 000 0 0000000 0000 0000000000 000011112222334444555 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEE--EeecCCccceeecccccccc-ccccceeEEeeeeeeeeec Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYL--TESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p--~~~~~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~~ 237 (497) .+||+++...|++.+++.++|++++++++++++...+| +..+..+.+.||+||+.+|+ ++++|++++++++|+++++ T Consensus 126 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~ 205 (404) T protein:vir:39 126 LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGII 205 (404) T ss_pred eeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeee Confidence 66777888999999999999999999999987665554 44555567899999999997 6799999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|+++|+ +++++||.++|+++++.++|.+||+|+|++.+.+.... T Consensus 206 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~~~~~~~------------------------------- 254 (404) T protein:vir:39 206 TATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKPTIAK------------------------------- 254 (404) T ss_pred hhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc------------------------------- Confidence 9999999997 68999999999999999999999999998765443211 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .+++..++.......+....+|+|||.+|..|+++||++|+| T Consensus 255 --------------------------------------~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~ 296 (404) T protein:vir:39 255 --------------------------------------FDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKY 296 (404) T ss_pred --------------------------------------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCce Confidence 111222222222334445668999999999999999999999 Q ss_pred cccccccccccccccccccccccceeecCC--CCcC-----cEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEE Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--IPLG-----TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~--~~~~-----~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) +|.+.... +.+++|+|+||+++++ +|.. .+++|||++ +|.+++|.+++++++++..++|++|++.| T Consensus 297 l~~~~~~~------~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 369 (404) T protein:vir:39 297 LLEPDPTK------PNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQ-AITLFDRENMSLLPTNIGAGAFETDTTKI 369 (404) T ss_pred eeccCcCC------CCcceecceeEEEecccccCccCCCccEEEEEeccc-cEEEEeecceEEEEeccchhhhhhceeeE Confidence 99875432 3446899999999754 4432 489999998 47789999999999999988999999999 Q ss_pred EEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 470 RAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 470 r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) |++.|+|+.+++|+||+++++++++.++ T Consensus 370 r~~~r~d~~~~~~~a~~~~~~~~~a~~~ 397 (404) T protein:vir:39 370 RVIDRFDVKTTDSEALVAGSFTAIADQV 397 (404) T ss_pred EEEeeeccEEecccceEEEEeeccccCC Confidence 9999999999999999999999988766 No 40 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=8.6e-60 Score=344.41 Aligned_cols=399 Identities=12% Similarity=0.088 Sum_probs=255.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAE-KKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e-~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |....+++++++++.+++.+..+...+.+.+ ..+..+++.++++.++ ++++.+...+.+ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~--------------------~~i~~~~~~~~~ 60 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLR--------------------SQIQEKQEELDK 60 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHH--------------------HHHHHHHHHHHH Confidence 8888888888877766655544322221111 1122222222222222 222222111111 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+.......... .............. .... ...............+..+......... .....++++++ T Consensus 61 ~~~~~~~~~~~~--~~~~~~~~~~~~~~-~~~~-------~~~~~~~~~~~~~~e~~~~~~~~~~~~~-~~~~~~~~~~g 129 (415) T protein:vir:94 61 LKEKDGTSENNQ--QSVEVNEASTYRNQ-ANIN-------DLGISIQNTKVTSQEVRDFTEYLETRND-IQGGSLKTDSG 129 (415) T ss_pred HHHHHHhhhhcc--ccccccchhhHHHH-HHHH-------HHHhhhhhhhhhHHHHHHHHHHhhhhhh-hhhhccccccc Confidence 111000000000 00000000000000 0000 0000000000011111122222112111 12223344455 Q ss_pred CcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeec-CCccceeecccccccc-ccccceeEEeeeeeeeeec Q lcl|Aclame:pro 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESA-AHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANAL 237 (497) Q Consensus 160 g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~~ 237 (497) |.++|+++...|++.+++.++|++++++++|++++.++|+... ..+.+.|++||+.+|+ +.++|++|++.+||+++++ T Consensus 130 ~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~ 209 (415) T protein:vir:94 130 FVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred cccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeec Confidence 6667778889999999999999999999999887666654432 2357899999999996 5689999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|||+|+ +++++||.++|+++++.++|.+|++|+|++.+.++........... T Consensus 210 ~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~----------------------- 266 (415) T protein:vir:94 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL----------------------- 266 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccccc----------------------- Confidence 9999999997 5799999999999999999999999999987666543322111100 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ........+++..++..+..++ +.++.|+|||.+|..|+++||++|+| T Consensus 267 -------------------------------~~~~~~~~~~i~~~~~~~~~~~-~~~~~~vmn~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:94 267 -------------------------------EVKKAKSLDDIKDAINLNVKPN-YEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred -------------------------------ccccccchHHHHHHHHhhhhhc-cCCCEEEEcHHHHHHHHHhhccCCCe Confidence 0011122445556666555444 45778999999999999999999999 Q ss_pred cccccccccccccccccccccccceeecCCCCcCc-----EEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEE Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-----ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~-----~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) ||.+.... +.+++|||+||++++++|.+. ++||||++ +|.+++|.++++++++ |.++++.||+ T Consensus 315 l~~~~~~~------~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~~v~~~~-----~~~~~~~~r~ 382 (415) T protein:vir:94 315 LIQPDVKE------KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD-AIVLFDRSQYQASWTD-----YMHFGECLMI 382 (415) T ss_pred eeccCcCC------CCCceecceeeEEecccccCCCCccEEEEEehhc-cEEEEeecceEEEEec-----cccCceEEEE Confidence 99875432 345689999999999998664 89999998 4778999999998775 5678889999 Q ss_pred EeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +.|+|++|++|+||+++++++++.++ T Consensus 383 ~~r~d~~~~~~~a~~~~~~~~~~~~~ 408 (415) T protein:vir:94 383 AVRQDCRILDYKSAIVIEYDDSERGE 408 (415) T ss_pred EEEeccEEeccccEEEEEEeccCCCC Confidence 99999999999999999999999999 No 41 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=2.4e-60 Score=347.48 Aligned_cols=383 Identities=15% Similarity=0.126 Sum_probs=243.3 Q ss_pred CchHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTA---QLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDI 77 (497) Q Consensus 1 m~~~~---~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~ 77 (497) |..-+ ++++++.++.++++++.++ +++ ..+..... .+...++.++++.+.++.+.++.++ T Consensus 1 m~~~m~i~el~~~~~~~~~~~~~~~~e-------~~~-------~~~~~~~~---~e~i~e~~~~~~~~~~~~~~~~~~~ 63 (408) T protein:vir:74 1 MGVKLTVNQLNEAWIASGDKVTDFNDQ-------INM-------ALNDDNFS---AEAMSELKNKRDNEKVRRDALREQL 63 (408) T ss_pred CChhhhHHHHHHHHHHHHHHHHHHHHH-------HHH-------HHhhhccc---HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44333 3333333333333332211 111 11000000 0112233333344444444433333 Q ss_pred HHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccc Q lcl|Aclame:pro 78 PEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTG 157 (497) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (497) .+.+......... .... ............... ......+. . ..............++.. T Consensus 64 ~~~~~~~~~~~~~----~~~~---~~~~~~~~~~~~~~~------------~~~~~~~~-~-~~~~~~~~~~a~~~~~~~ 122 (408) T protein:vir:74 64 VEAQAEQVVNMRE----EEKG---PLNKSENELKDKFVK------------DFVNMVRN-P-MAFLNTVSSKTETSGSDS 122 (408) T ss_pred HHHHHHHHhhccc----cccc---cccchhhhhHHHHHH------------HHHHHHhc-c-hhhhhhhhhhhhcccccC Confidence 2222111110000 0000 000000000000000 00000000 0 000111122222334444 Q ss_pred cCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceE--EEEeecCCccceeecccccccc-ccccceeEEeeeeeee Q lcl|Aclame:pro 158 TFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVA 234 (497) Q Consensus 158 ~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia 234 (497) .+|.+||+++...||+.+++.++|+++|+++++++++.+ +++..+..+.++|++|++.+|+ ++++|++++++++|++ T Consensus 123 ~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~ 202 (408) T protein:vir:74 123 AAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYA 202 (408) T ss_pred CCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEE Confidence 556667778889999999999999999999999876554 5555554556789999999996 6799999999999999 Q ss_pred eechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhh Q lcl|Aclame:pro 235 NALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 235 ~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) ++++||+|+++|+ +++++||.++|+++++.++|.+||+|+|++.+.+.... T Consensus 203 ~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~~~~~~---------------------------- 254 (408) T protein:vir:74 203 GIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKPTIAN---------------------------- 254 (408) T ss_pred eeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc---------------------------- Confidence 9999999999997 58999999999999999999999999998865432110 Q ss_pred cchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhccc Q lcl|Aclame:pro 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDAN 393 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~ 393 (497) .+++..++.....+.+....+|+|||.+|..|+++||++ T Consensus 255 -----------------------------------------~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~ 293 (408) T protein:vir:74 255 -----------------------------------------FDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAE 293 (408) T ss_pred -----------------------------------------HHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCC Confidence 111222222222334445668999999999999999999 Q ss_pred CcccccccccccccccccccccccccceeecCC--CCc-----CcEEEeeccceEEEEEeccccEEEEeccchhhhhcCc Q lcl|Aclame:pro 394 GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--IPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGK 466 (497) Q Consensus 394 G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~--~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~ 466 (497) |+|+|.+.... +.+.+|+|+||+++++ +|. +.++||||++ +|.+++|.+++++++++.++.|.+|+ T Consensus 294 G~~l~~~~~~~------~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~~i~~~~~~~~~f~~~~ 366 (408) T protein:vir:74 294 GKYLLEPDPTK------PNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQ-AITLFDRENMSLLPTNIGAGAFETDT 366 (408) T ss_pred CceEeccCcCC------CCCceecceeeEEecCcccccccCCcceEEEEehhc-cEEEEEecceEEEEeccccchhhcce Confidence 99999876433 3446899999998653 553 3489999998 47899999999999999888999999 Q ss_pred eEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 467 VTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 467 v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +.||++.|+||.+++|+||+++++++++++- T Consensus 367 ~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 397 (408) T protein:vir:74 367 TKIRVIDRFDVKATDSEALVAGSFTAIADQV 397 (408) T ss_pred eeEEEEEeeCcEEecccceEEEEeecccCCC Confidence 9999999999999999999999998776655 No 42 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=1.4e-59 Score=343.30 Aligned_cols=376 Identities=17% Similarity=0.153 Sum_probs=243.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKK-EALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~-~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |-- .+|++++.++.++++++.++..+...+.+ +......++++ .+.++++.+++..+..+....+ T Consensus 1 M~~-~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~-------------~l~~~i~~~~~~~~~~~~~~~~ 66 (395) T protein:vir:38 1 MNI-NQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDIN-------------KLNASLKNAKMAQELAKSAYED 66 (395) T ss_pred CCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHH Confidence 544 44666666666666655443222111111 00000001111 1111122111111111000000 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .......... ....... ...... .......+.... ........++++.+ T Consensus 67 --------~~~~~~~~~~------~~~~~~~----------~~~~~~----~~~~~~~~~~~~---~~~~~~~~~~~~~g 115 (395) T protein:vir:38 67 --------ARANLNAEPV------NKKPLPV----------KDGKPD----AQAMKNQFVKDF---KNLVTSGTTGTGNA 115 (395) T ss_pred --------HHhhhhhccc------cccccch----------hhhhHH----HHHHHHHHHHHH---HHHHhhccCccCCC Confidence 0000000000 0000000 000000 000011111110 01111223344556 Q ss_pred CcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEE--EeecCCccceeeccccccccc-cccceeEEeeeeeeeee Q lcl|Aclame:pro 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYL--TESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANA 236 (497) Q Consensus 160 g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p--~~~~~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~ 236 (497) |.+||+++...|++.+++.++|+++|++++++++...++ +..+..+.++|++|++.+|++ +++|+.|++++||++++ T Consensus 116 g~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~ 195 (395) T protein:vir:38 116 GLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGI 195 (395) T ss_pred ceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEee Confidence 667888889999999999999999999999987665554 444445678999999999976 59999999999999999 Q ss_pred chhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 237 LTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT 315 (497) Q Consensus 237 ~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (497) ++||+||++|+ ++|++||.++|+++++.++|.+|++|+|++.+.+.... T Consensus 196 ~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~~~------------------------------ 245 (395) T protein:vir:38 196 TTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKKPTISQ------------------------------ 245 (395) T ss_pred hhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc------------------------------ Confidence 99999999997 57999999999999999999999999998754321100 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCc Q lcl|Aclame:pro 316 NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ 395 (497) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~ 395 (497) .+++..++.......+....+|+|||.+|..|+++||++|+ T Consensus 246 ---------------------------------------~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~ 286 (395) T protein:vir:38 246 ---------------------------------------FDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGR 286 (395) T ss_pred ---------------------------------------HHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCc Confidence 11112222222233444566899999999999999999999 Q ss_pred ccccccccccccccccccccccccceeecCCCCc------CcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEE Q lcl|Aclame:pro 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL------GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~------~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) |+|.+.... +.+.+|+|+||+++++++. ..++||||++ +|.+++|.+++|+++++.+.+|++|+++| T Consensus 287 ~l~~~~~~~------~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~-~~~i~~~~~~~i~~~~~~~~~~~~~~~~~ 359 (395) T protein:vir:38 287 YLMQPDVTS------PDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQ-GITLFDRQQMQIDTTNVGAGSFEHDTTKL 359 (395) T ss_pred eeeccCcCC------CCcceeccceeEEecccccCcCCCcceEEEEeccc-cEEEEEecceEEEEeccccchhhcCceEE Confidence 999875432 3456899999999987643 2389999998 57899999999999999888999999999 Q ss_pred EEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 470 RAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 470 r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) |++.|+|+.+.+|+||++++++++++.+ T Consensus 360 r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 387 (395) T protein:vir:38 360 RFIDRFDVQLIDDGAFAAASFKTVANQA 387 (395) T ss_pred EEEEeeccEEecccceEEEEeecccCCC Confidence 9999999999999999999999888777 No 43 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=9.2e-60 Score=344.24 Aligned_cols=412 Identities=16% Similarity=0.106 Sum_probs=235.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 5 AQLEAQGRQLAKSIKDINADETKTA--AE-KKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVE 81 (497) Q Consensus 5 ~~~~~~~~~l~~~~~~~~~~~~~~~--~e-~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~ 81 (497) +.+++.++.+.+..++......... .+ ..+.+.++..+++ ++.++++.+..+++.++....... T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~-------------~l~~~~~~l~~~i~~le~~~~~~~ 67 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQGKVEKNEVRSEELAAVKAEVE-------------QLTKEIQTISEELAKLEEKEKEED 67 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhccCccHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHH Confidence 3344433333332222211000000 00 0111111112222 122222222222222211111100 Q ss_pred HHHHHHHHHHHH--hhhhhhHHHhhhhhhhhhhhhhhhhhhhhh-hhhHHHHHHHHHHHHHhhhhhh--hhhhhhhhccc Q lcl|Aclame:pro 82 VRNLKQIRKHLA--RAVIMNPELKNATSFEKGTKFDVSFNVSAK-AADPGTAAAELMGAFADGETAP--AAIGQNPFGST 156 (497) Q Consensus 82 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 156 (497) ............ ...............+.............. .........+.+..+....... .........++ T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t 147 (434) T protein:vir:62 68 PAKKKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVT 147 (434) T ss_pred HHhhhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccc Confidence 000000000000 000000000000000000000000000000 0000011111222221111110 01111222334 Q ss_pred ccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceee---ccccccccccccceeEEeeeeee Q lcl|Aclame:pro 157 GTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAV---AEAGTYPFSSEEFARVYEQVGKV 233 (497) Q Consensus 157 ~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v---~Eg~~~~~s~~~~~~v~~~~~ki 233 (497) +.+|.+||+++...|++.+++.++|+++++++++++ +++||+.+.. +.+.|+ +|+..+|.++++|++|++.+||+ T Consensus 148 ~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~-~~~~p~~~~~-~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~ 225 (434) T protein:vir:62 148 GNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKE-NIKYPVLVKK-AEAQGHKNERTNNEMPETDIEFDEIELSPTEF 225 (434) T ss_pred cccceecchhhHHHHHHhhhhhhhhhhhcceeccCC-ceEEEEEecC-CcccceecccccccccccccceeeEEeeheee Confidence 445556677788899999999999999999988865 5899988763 455654 56788999999999999999999 Q ss_pred eeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhh Q lcl|Aclame:pro 234 ANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFP 311 (497) Q Consensus 234 a~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (497) +++++||+|||+|+ +++++||.++|+++++.++|.+||+|+|+++ +.|+++..+....+ T Consensus 226 ~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~------------------- 286 (434) T protein:vir:62 226 DALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKT------------------- 286 (434) T ss_pred EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccc------------------- Confidence 99999999999998 5899999999999999999999999999987 45665543221100 Q ss_pred hhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhc Q lcl|Aclame:pro 312 ADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKD 391 (497) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd 391 (497) ......+++..+...+...+ .....|+|||.+|..|+++|| T Consensus 287 --------------------------------------~~~~~~d~l~~l~~~l~~~~-~~~a~~v~n~~~~~~L~~lkd 327 (434) T protein:vir:62 287 --------------------------------------DEKNLYDALVKMKNTPVKEV-RKKARWVLNTAALTKIETMKT 327 (434) T ss_pred --------------------------------------cccchhhHHHHHHhhcchhh-hcCCEEEEcHHHHHHHHHhhc Confidence 11123455556666555444 455689999999999999999 Q ss_pred ccCcccccccccccccccccccccccccceeecCCCCcCc------EEEeeccceEEEEEecc-ccEEEEeccchhhhhc Q lcl|Aclame:pro 392 ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT------ILVGHFAPSVIQTARRE-GVTMQMTNSNGTDFVD 464 (497) Q Consensus 392 ~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~------~~~gd~~~~~~~i~~r~-~~~i~~~~~~~~~f~~ 464 (497) ++|+|+|++..... .+.+++|+|+||++++.+|.+. ++||||++ |.|++|. .++|+++++. +|.+ T Consensus 328 ~~G~~l~~~~~~~~----~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~--~~i~~~~g~~~i~~~~~~--~~~~ 399 (434) T protein:vir:62 328 DDGFPLLRPFNQAE----GGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSK--FYIQDVIGSLEVQKLVEL--FSRT 399 (434) T ss_pred cCCCEeeccCCCcc----CCCCceecceeeEEecCccCccCCCceEEEEeeccc--eEEEEeeceeEEEeehhh--hccc Confidence 99999998744321 2345689999999999998654 78999997 4577775 4778877655 5889 Q ss_pred CceEEEEEeeeccEeec-ccceEEEEec-CCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYR-PSAFQLIQLK-KGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~-~~Af~~~~~~-~~a~~~ 497 (497) |+|+||++.|+|+++++ |.++..++++ .+|+++ T Consensus 400 ~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 400 NRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred CceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 99999999999999876 8887777665 444444 No 44 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=1.6e-58 Score=337.47 Aligned_cols=411 Identities=14% Similarity=0.081 Sum_probs=237.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAH-ERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~-e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |- +.+++++++++..+++..... +.++.+...+..++++....++... ++.+++..++++..........+... T Consensus 1 Mk-i~elk~el~~~~~el~~~~~e----lr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~ 75 (437) T protein:vir:10 1 MK-IEKLKKDLATKTAELNTKKAE----IRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRD 75 (437) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 76 777888777776665554321 1122222222222222222111111 11112222222221111111000000 Q ss_pred HhHHHHH-----HHHHHHHhhhhhhHHHhhhhhhhhhhhhhh---hhh-hhhh-hhhHHHHHHHHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 80 VEVRNLK-----QIRKHLARAVIMNPELKNATSFEKGTKFDV---SFN-VSAK-AADPGTAAAELMGAFADGETAPAAIG 149 (497) Q Consensus 80 ~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (497) ....... ..............+............... ... .... ..............+..... .... T Consensus 76 ~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~e~- 153 (437) T protein:vir:10 76 DSDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLK-TGEV- 153 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHH-hhhh- Confidence 0000000 000000000000000000000000000000 000 0000 00000000000011111111 1111 Q ss_pred hhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccc-ccccceeEEe Q lcl|Aclame:pro 150 QNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYE 228 (497) Q Consensus 150 ~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~-s~~~~~~v~~ 228 (497) ......++..|++++|+....+|..+.+.+.|++++++++++++.+.+|+.....+.++|++|++..|+ ++++|++|++ T Consensus 154 ~~~~~~~~~~~g~lvp~~~~~~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~ 233 (437) T protein:vir:10 154 RDVTGIALKDGKVIIPETILTPEKEVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILW 233 (437) T ss_pred hhhhhcccccccccchHHHHHHHHHhhhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccccccccceeeee Confidence 122223334444444544555666778888999999999999999999998776678999999999996 5699999999 Q ss_pred eeeeeeeechhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHH Q lcl|Aclame:pro 229 QVGKVANALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSN 307 (497) Q Consensus 229 ~~~kia~~~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~ 307 (497) .+||++++++||+|||+|++ +|++||.++|+++++.++|.+|++|+|++.+.+.... T Consensus 234 ~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~---------------------- 291 (437) T protein:vir:10 234 DLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKTTSTY---------------------- 291 (437) T ss_pred ehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc---------------------- Confidence 99999999999999999985 7999999999999999999999999998765432111 Q ss_pred HhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHH Q lcl|Aclame:pro 308 VKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLR 387 (497) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~ 387 (497) ..+++.+++.....+.+..+..|+|||.+|..|+ T Consensus 292 ----------------------------------------------~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~ 325 (437) T protein:vir:10 292 ----------------------------------------------LLGDLKKVLNVTLKPQDSAAASIVMSQSAYNLFD 325 (437) T ss_pred ----------------------------------------------chhhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHH Confidence 0111112221112233444568999999999999 Q ss_pred HHhcccCcccccccccccccccccccccccccceeecCCC--Cc---Cc--EEEeeccceEEEEEeccccEEEEeccchh Q lcl|Aclame:pro 388 LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLI--PL---GT--ILVGHFAPSVIQTARREGVTMQMTNSNGT 460 (497) Q Consensus 388 ~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~--~~---~~--~~~gd~~~~~~~i~~r~~~~i~~~~~~~~ 460 (497) ++||++|+|||.+.... +.+++|||+||++++++ |. |+ ++||||++ +|.+++|.++++.+++. T Consensus 326 ~lkd~~g~~~~~~~~~~------~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~r~~~~~~~~~~--- 395 (437) T protein:vir:10 326 MATDAMGRPLLQPNVTA------ATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKK-AVINFKLTEITGQFQDT--- 395 (437) T ss_pred HhhccCCCeeeccCccC------CCCcccccceeEEecccccCCcCCCceEEEEeeccc-cEEEEeeeceEEEEecc--- Confidence 99999999999876432 34568999999998764 43 22 89999998 47899999999987753 Q ss_pred hhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 461 DFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 461 ~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) |..+...+|+.+|+||.|+||+|||+|+.+.++..+ T Consensus 396 -~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~ 431 (437) T protein:vir:10 396 -YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTV 431 (437) T ss_pred -cccccceeeEEEEEccEEecccceEEEEeecccccc Confidence 445667899999999999999999999977665555 No 45 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=1.5e-59 Score=343.11 Aligned_cols=378 Identities=15% Similarity=0.160 Sum_probs=251.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+.++++.+.++.++++++.++......+. +...++++ ++..+++.+.++++.++...... T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~----~~~~ee~~-------------~l~~ei~~~~~~~~~~~~~~~~~ 63 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDD----SVTAEELQ-------------AIKNERDTAKMKRDMFKEQYTEA 63 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcch----hhhHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHH Confidence 99998888888777776666544322211111 01111111 12222222222222222111111 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) .......... ....... ...... ............+... .........++++.+| T Consensus 64 ~~~~~~~~~~---------~~~~~~~-----~~~~~~-----~~~~~~~~~~~~~~~~------~~~~~~~~~~t~~~gg 118 (397) T protein:vir:48 64 RANEVVNMSE---------EEKKPLT-----KSEEEV-----KAGFVKDFKNLVRGRY------QNLLDSKTDASGSDAG 118 (397) T ss_pred HHhhhhhhhh---------hcccccc-----chhhHH-----HHHHHHHHHHHHhhhh------hHHHHHhhccCCcccc Confidence 0000000000 0000000 000000 0000000000000000 0011112233444556 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEe--ecCCccceeeccccccccc-cccceeEEeeeeeeeeec Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTE--SAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~--~~~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~~ 237 (497) .+||+++...||+.+++.++|+++++++++++++..+|+. .+..+.++|++||+.+|++ +++|++|+++++|+++++ T Consensus 119 ~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 198 (397) T protein:vir:48 119 LTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGIS 198 (397) T ss_pred ccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeeh Confidence 6788889999999999999999999999998877666543 4444568999999999986 599999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|+|+|+ .++++||.++|+++++.++|.+|++|+|++.+.+... T Consensus 199 ~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~~~~~~-------------------------------- 246 (397) T protein:vir:48 199 TVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLPTKPTLT-------------------------------- 246 (397) T ss_pred hhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc-------------------------------- Confidence 9999999997 5799999999999999999999999999876533111 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ..+++..+...+... +..+..|+|||.+|..|+++||++|+| T Consensus 247 -------------------------------------~~d~i~~~~~~l~~~-~~~~a~~v~n~~~~~~L~~lkd~~G~~ 288 (397) T protein:vir:48 247 -------------------------------------KWDDIIDLQAKVDPA-IKQTSFFLTNTSGFTALKKVKNAFGDY 288 (397) T ss_pred -------------------------------------cHHHHHHHHHHhhhh-hcCCCEEEECHHHHHHHHHhhcCCCce Confidence 112233344444433 455679999999999999999999999 Q ss_pred cccccccccccccccccccccccceeecCC--CC-----cCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEE Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--IP-----LGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~--~~-----~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) +|++.... +.+++|+|+||++++. +| ...++||||+. +|.++++.+++++++++.+++|.+|++.| T Consensus 289 i~~~~~~~------~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 361 (397) T protein:vir:48 289 LMERDVKS------PTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQ-AVTLFDRQQMSLLSTNIGGGAFETDTTKI 361 (397) T ss_pred eeccCcCC------CCCceeccceeEEecccccCCcCCCceEEEEEeccc-eEEEEeecceEEEEeccchhhhhcCceeE Confidence 99876432 3456999999988543 33 34589999998 47789999999999999888999999999 Q ss_pred EEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 470 RAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 470 r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) |+++|+|+.+++|+||++++++++++.+ T Consensus 362 r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:48 362 RVIDRFDVVATDTESFVPASFKAIADQK 389 (397) T ss_pred EEEeeeccEEecccceEEEEecccccCC Confidence 9999999999999999999999999888 No 46 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=1.5e-59 Score=343.15 Aligned_cols=374 Identities=16% Similarity=0.145 Sum_probs=230.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |. +++++ ++++++++.+++++...+ +..++.+++.++++.++++++..+. +.+. T Consensus 1 M~-------------k~l~e-----------l~~~~~~~~~e~~~~~~~-~~~~e~~~~~~e~~~l~~~i~~~~~-~~~~ 54 (392) T protein:vir:10 1 MS-------------KELRE-----------LLAKLEGKKEEVRSLMGE-DKVAEAEQMMEEVRSLQKKIDLQRS-LDEA 54 (392) T ss_pred Cc-------------HHHHH-----------HHHHHHHHHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Confidence 33 11111 111111111111111100 0000111111222222222211110 0000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +...... .......... ...+....+..... ........+ .+.. ..........++++.+| T Consensus 55 ~~~~~~~------~~~~~~~~~~--~~~~~~~~~~~~l~-------~~~~~~~~~-~~~~---~~~~~~~~~~~t~~~gg 115 (392) T protein:vir:10 55 ETEERNN------GREVETRNVD--GEMEYRDVFMKALR-------NKPLNAEER-EFLE---DDLEQRAMSGLTGEDGG 115 (392) T ss_pred HHHHhhc------cccccccCcc--chHHHHHHHHHHHh-------cccccHHHH-HHHh---hhhhhhhccccccCCCc Confidence 0000000 0000000000 00000000000000 000000000 0000 01111222233444455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceE--EEEeecCCccceeeccccccccc-cccceeEEeeeeeeeeec Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~~ 237 (497) .+||+++.+.|++.+++.++|+++++++++++++.+ +|+.++ .+.++||+||+.+|++ .++|+++++.+||+++++ T Consensus 116 ~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~ 194 (392) T protein:vir:10 116 LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGIL 194 (392) T ss_pred eecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEee Confidence 677888889999999999999999999999877655 455444 3579999999999976 599999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|||+|+ +++++||.+.|+++++.++|.+|++|+|++.+.++.+ T Consensus 195 ~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~-------------------------------- 242 (392) T protein:vir:10 195 PLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS-------------------------------- 242 (392) T ss_pred hhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC-------------------------------- Confidence 9999999997 6899999999999999999999999999876544321 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .+++..++.....+.+..+..|+|||.+|..|+++||++|+| T Consensus 243 --------------------------------------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~ 284 (392) T protein:vir:10 243 --------------------------------------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKY 284 (392) T ss_pred --------------------------------------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCe Confidence 111222222223344556678999999999999999999999 Q ss_pred cccccccccccccccccccccccceeec-CCC-C------cC--cEEEeeccceEEEEEeccccEEEEeccchhhhhcCc Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTT-PLI-P------LG--TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGK 466 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s-~~~-~------~~--~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~ 466 (497) +|.+.... +.+++|+|+|+|++ +.+ + .+ .++||||++ +|.+++|.+++++++++.+.+|++|+ T Consensus 285 l~~~~~~~------~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~-~~~i~~~~~~~~~~~~~~~~~f~~~~ 357 (392) T protein:vir:10 285 ILQSDPTQ------KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE-AIVLFKREDMELASTDVGGKAFTRNT 357 (392) T ss_pred EeecCccC------CccccccCcccEEEecccccCCCcccCCceEEEEEehhc-eEEEEeecceEEEEeccccchhhcCc Confidence 99876533 34568999876653 222 1 12 378999998 57899999999999999888999999 Q ss_pred eEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 467 VTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 467 v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++||++.|+|+.|++|+||+++++++++-.. T Consensus 358 ~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 358 LDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred eEEEEEEeeccEEecccceEEEEeccccccc Confidence 9999999999999999999999998776666 No 47 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=1.5e-59 Score=343.15 Aligned_cols=374 Identities=16% Similarity=0.145 Sum_probs=230.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |. +++++ ++++++++.+++++...+ +..++.+++.++++.++++++..+. +.+. T Consensus 1 M~-------------k~l~e-----------l~~~~~~~~~e~~~~~~~-~~~~e~~~~~~e~~~l~~~i~~~~~-~~~~ 54 (392) T protein:vir:10 1 MS-------------KELRE-----------LLAKLEGKKEEVRSLMGE-DKVAEAEQMMEEVRSLQKKIDLQRS-LDEA 54 (392) T ss_pred Cc-------------HHHHH-----------HHHHHHHHHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Confidence 33 11111 111111111111111100 0000111111222222222211110 0000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +...... .......... ...+....+..... ........+ .+.. ..........++++.+| T Consensus 55 ~~~~~~~------~~~~~~~~~~--~~~~~~~~~~~~l~-------~~~~~~~~~-~~~~---~~~~~~~~~~~t~~~gg 115 (392) T protein:vir:10 55 ETEERNN------GREVETRNVD--GEMEYRDVFMKALR-------NKPLNAEER-EFLE---DDLEQRAMSGLTGEDGG 115 (392) T ss_pred HHHHhhc------cccccccCcc--chHHHHHHHHHHHh-------cccccHHHH-HHHh---hhhhhhhccccccCCCc Confidence 0000000 0000000000 00000000000000 000000000 0000 01111222233444455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceE--EEEeecCCccceeeccccccccc-cccceeEEeeeeeeeeec Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~~ 237 (497) .+||+++.+.|++.+++.++|+++++++++++++.+ +|+.++ .+.++||+||+.+|++ .++|+++++.+||+++++ T Consensus 116 ~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~ 194 (392) T protein:vir:10 116 LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGIL 194 (392) T ss_pred eecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEee Confidence 677888889999999999999999999999877655 455444 3579999999999976 599999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|||+|+ +++++||.+.|+++++.++|.+|++|+|++.+.++.+ T Consensus 195 ~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~-------------------------------- 242 (392) T protein:vir:10 195 PLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS-------------------------------- 242 (392) T ss_pred hhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC-------------------------------- Confidence 9999999997 6899999999999999999999999999876544321 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .+++..++.....+.+..+..|+|||.+|..|+++||++|+| T Consensus 243 --------------------------------------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~ 284 (392) T protein:vir:10 243 --------------------------------------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKY 284 (392) T ss_pred --------------------------------------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCe Confidence 111222222223344556678999999999999999999999 Q ss_pred cccccccccccccccccccccccceeec-CCC-C------cC--cEEEeeccceEEEEEeccccEEEEeccchhhhhcCc Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTT-PLI-P------LG--TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGK 466 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s-~~~-~------~~--~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~ 466 (497) +|.+.... +.+++|+|+|+|++ +.+ + .+ .++||||++ +|.+++|.+++++++++.+.+|++|+ T Consensus 285 l~~~~~~~------~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~-~~~i~~~~~~~~~~~~~~~~~f~~~~ 357 (392) T protein:vir:10 285 ILQSDPTQ------KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE-AIVLFKREDMELASTDVGGKAFTRNT 357 (392) T ss_pred EeecCccC------CccccccCcccEEEecccccCCCcccCCceEEEEEehhc-eEEEEeecceEEEEeccccchhhcCc Confidence 99876533 34568999876653 222 1 12 378999998 57899999999999999888999999 Q ss_pred eEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 467 VTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 467 v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++||++.|+|+.|++|+||+++++++++-.. T Consensus 358 ~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 358 LDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred eEEEEEEeeccEEecccceEEEEeccccccc Confidence 9999999999999999999999998776666 No 48 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=1.5e-59 Score=343.15 Aligned_cols=374 Identities=16% Similarity=0.145 Sum_probs=230.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |. +++++ ++++++++.+++++...+ +..++.+++.++++.++++++..+. +.+. T Consensus 1 M~-------------k~l~e-----------l~~~~~~~~~e~~~~~~~-~~~~e~~~~~~e~~~l~~~i~~~~~-~~~~ 54 (392) T protein:vir:10 1 MS-------------KELRE-----------LLAKLEGKKEEVRSLMGE-DKVAEAEQMMEEVRSLQKKIDLQRS-LDEA 54 (392) T ss_pred Cc-------------HHHHH-----------HHHHHHHHHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Confidence 33 11111 111111111111111100 0000111111222222222211110 0000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +...... .......... ...+....+..... ........+ .+.. ..........++++.+| T Consensus 55 ~~~~~~~------~~~~~~~~~~--~~~~~~~~~~~~l~-------~~~~~~~~~-~~~~---~~~~~~~~~~~t~~~gg 115 (392) T protein:vir:10 55 ETEERNN------GREVETRNVD--GEMEYRDVFMKALR-------NKPLNAEER-EFLE---DDLEQRAMSGLTGEDGG 115 (392) T ss_pred HHHHhhc------cccccccCcc--chHHHHHHHHHHHh-------cccccHHHH-HHHh---hhhhhhhccccccCCCc Confidence 0000000 0000000000 00000000000000 000000000 0000 01111222233444455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceE--EEEeecCCccceeeccccccccc-cccceeEEeeeeeeeeec Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~~ 237 (497) .+||+++.+.|++.+++.++|+++++++++++++.+ +|+.++ .+.++||+||+.+|++ .++|+++++.+||+++++ T Consensus 116 ~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~ 194 (392) T protein:vir:10 116 LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGIL 194 (392) T ss_pred eecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEee Confidence 677888889999999999999999999999877655 455444 3579999999999976 599999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|||+|+ +++++||.+.|+++++.++|.+|++|+|++.+.++.+ T Consensus 195 ~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~-------------------------------- 242 (392) T protein:vir:10 195 PLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS-------------------------------- 242 (392) T ss_pred hhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC-------------------------------- Confidence 9999999997 6899999999999999999999999999876544321 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .+++..++.....+.+..+..|+|||.+|..|+++||++|+| T Consensus 243 --------------------------------------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~ 284 (392) T protein:vir:10 243 --------------------------------------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKY 284 (392) T ss_pred --------------------------------------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCe Confidence 111222222223344556678999999999999999999999 Q ss_pred cccccccccccccccccccccccceeec-CCC-C------cC--cEEEeeccceEEEEEeccccEEEEeccchhhhhcCc Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTT-PLI-P------LG--TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGK 466 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s-~~~-~------~~--~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~ 466 (497) +|.+.... +.+++|+|+|+|++ +.+ + .+ .++||||++ +|.+++|.+++++++++.+.+|++|+ T Consensus 285 l~~~~~~~------~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~-~~~i~~~~~~~~~~~~~~~~~f~~~~ 357 (392) T protein:vir:10 285 ILQSDPTQ------KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE-AIVLFKREDMELASTDVGGKAFTRNT 357 (392) T ss_pred EeecCccC------CccccccCcccEEEecccccCCCcccCCceEEEEEehhc-eEEEEeecceEEEEeccccchhhcCc Confidence 99876533 34568999876653 222 1 12 378999998 57899999999999999888999999 Q ss_pred eEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 467 VTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 467 v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++||++.|+|+.|++|+||+++++++++-.. T Consensus 358 ~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 358 LDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred eEEEEEEeeccEEecccceEEEEeccccccc Confidence 9999999999999999999999998776666 No 49 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=1.5e-59 Score=343.15 Aligned_cols=374 Identities=16% Similarity=0.145 Sum_probs=230.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |. +++++ ++++++++.+++++...+ +..++.+++.++++.++++++..+. +.+. T Consensus 1 M~-------------k~l~e-----------l~~~~~~~~~e~~~~~~~-~~~~e~~~~~~e~~~l~~~i~~~~~-~~~~ 54 (392) T protein:vir:10 1 MS-------------KELRE-----------LLAKLEGKKEEVRSLMGE-DKVAEAEQMMEEVRSLQKKIDLQRS-LDEA 54 (392) T ss_pred Cc-------------HHHHH-----------HHHHHHHHHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Confidence 33 11111 111111111111111100 0000111111222222222211110 0000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +...... .......... ...+....+..... ........+ .+.. ..........++++.+| T Consensus 55 ~~~~~~~------~~~~~~~~~~--~~~~~~~~~~~~l~-------~~~~~~~~~-~~~~---~~~~~~~~~~~t~~~gg 115 (392) T protein:vir:10 55 ETEERNN------GREVETRNVD--GEMEYRDVFMKALR-------NKPLNAEER-EFLE---DDLEQRAMSGLTGEDGG 115 (392) T ss_pred HHHHhhc------cccccccCcc--chHHHHHHHHHHHh-------cccccHHHH-HHHh---hhhhhhhccccccCCCc Confidence 0000000 0000000000 00000000000000 000000000 0000 01111222233444455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceE--EEEeecCCccceeeccccccccc-cccceeEEeeeeeeeeec Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLS--YLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANAL 237 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~~a~~v~Eg~~~~~s-~~~~~~v~~~~~kia~~~ 237 (497) .+||+++.+.|++.+++.++|+++++++++++++.+ +|+.++ .+.++||+||+.+|++ .++|+++++.+||+++++ T Consensus 116 ~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~ 194 (392) T protein:vir:10 116 LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGIL 194 (392) T ss_pred eecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEee Confidence 677888889999999999999999999999877655 455444 3579999999999976 599999999999999999 Q ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 238 TITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 238 ~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) +||+|||+|+ +++++||.+.|+++++.++|.+|++|+|++.+.++.+ T Consensus 195 ~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~-------------------------------- 242 (392) T protein:vir:10 195 PLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS-------------------------------- 242 (392) T ss_pred hhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC-------------------------------- Confidence 9999999997 6899999999999999999999999999876544321 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .+++..++.....+.+..+..|+|||.+|..|+++||++|+| T Consensus 243 --------------------------------------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~ 284 (392) T protein:vir:10 243 --------------------------------------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKY 284 (392) T ss_pred --------------------------------------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCe Confidence 111222222223344556678999999999999999999999 Q ss_pred cccccccccccccccccccccccceeec-CCC-C------cC--cEEEeeccceEEEEEeccccEEEEeccchhhhhcCc Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTT-PLI-P------LG--TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGK 466 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s-~~~-~------~~--~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~ 466 (497) +|.+.... +.+++|+|+|+|++ +.+ + .+ .++||||++ +|.+++|.+++++++++.+.+|++|+ T Consensus 285 l~~~~~~~------~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~-~~~i~~~~~~~~~~~~~~~~~f~~~~ 357 (392) T protein:vir:10 285 ILQSDPTQ------KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE-AIVLFKREDMELASTDVGGKAFTRNT 357 (392) T ss_pred EeecCccC------CccccccCcccEEEecccccCCCcccCCceEEEEEehhc-eEEEEeecceEEEEeccccchhhcCc Confidence 99876533 34568999876653 222 1 12 378999998 57899999999999999888999999 Q ss_pred eEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 467 VTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 467 v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++||++.|+|+.|++|+||+++++++++-.. T Consensus 358 ~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~ 388 (392) T protein:vir:10 358 LDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) T ss_pred eEEEEEEeeccEEecccceEEEEeccccccc Confidence 9999999999999999999999998776666 No 50 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=3.7e-59 Score=340.95 Aligned_cols=366 Identities=15% Similarity=0.138 Sum_probs=232.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+-++..+..++ .+++.+.++...+..+..+.+++.. +.+..+.. T Consensus 1 ik~L~e~~~e~~e~----------------------------~~~~~~~~~~~~~~~e~~~~~~~~~---~~~~~~~~-- 47 (390) T protein:vir:40 1 MNNLDKKDSETLNI----------------------------STAFLNAIKEGATEAEQVTAFTNMA---EQIQNNII-- 47 (390) T ss_pred CchHHHHHHHHHHH----------------------------HHHHHHHHhhhhhHHHHHHHHHHHH---HHHHHHHH-- Confidence 22222222211111 1111111110000001111111100 00000000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) .+.+..... ..........++.. . ...+.+..+. .....+++.++| T Consensus 48 -----~~~~~~~~~-----~~~~~~~~~~~~~~--------~-------l~~~~r~~~~---------~~~~~~~~~~gg 93 (390) T protein:vir:40 48 -----AQARKEVNR-----EMNDNNVLASRGAN--------A-------LTSDESKYYN---------EVIAGNGFAGVT 93 (390) T ss_pred -----HHHHHHHHH-----HHHHHHHHHhcCch--------h-------ccHHHHHHHH---------HHHhccCcccCc Confidence 000000000 00000000000000 0 0000111110 011123445566 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccc-cccccceeEEeeeeeeeeechh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~-~s~~~~~~v~~~~~kia~~~~i 239 (497) .+||+++...|++.+++.++|+++|++++++++...+|+.++ .+.+.|++|++..+ .++++|+++++++||++++++| T Consensus 94 ~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~-~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~i 172 (390) T protein:vir:40 94 ALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGD-VATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPV 172 (390) T ss_pred ccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcC-CcceeeeccccccCccccccceeeEeeeeeEEEeehh Confidence 778888999999999999999999999999999999999876 57899999998876 4689999999999999999999 Q ss_pred hHHHHhhHH-HHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhh Q lcl|Aclame:pro 240 TDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~ds~-~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |+|||+|++ ++++||.++|+++++.++|.+||+|+|+++|.||++..+..+............ T Consensus 173 S~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~---------------- 236 (390) T protein:vir:40 173 CNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPL---------------- 236 (390) T ss_pred hHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeecccccccccccccccccc---------------- Confidence 999999985 799999999999999999999999999999999998665433222111000000 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHH----HHHHHHhcccC Q lcl|Aclame:pro 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDW----ELLRLTKDANG 394 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~----~~l~~lkd~~G 394 (497) ........+..+...+.. .........+|+|||.++ ..+++++|.+| T Consensus 237 ----------------------------t~~~~~~~~~~l~~~~~~-~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G 287 (390) T protein:vir:40 237 ----------------------------TDLTPATLATKVMLPLTD-NGKKSVSDAILVINPADYWSKIYAATSYMTPQG 287 (390) T ss_pred ----------------------------chhhHHHHHHHHHHHhhc-chhhhhcCceEEEcchhHHHHHHHHhhccCCCC Confidence 000000111111111111 111233456799999985 46668999999 Q ss_pred cccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEee Q lcl|Aclame:pro 395 QYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEER 474 (497) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r 474 (497) +|+|.. .++|+||+.+++||+++++||||++ |.+++|.+++|+++++. +|.+|++.||+++| T Consensus 288 ~~v~~~--------------~~~g~pvv~~~~~p~~~i~~Gd~s~--~~i~~~~~~~v~~~~~~--~f~~~~~~~r~~~r 349 (390) T protein:vir:40 288 VWVTGI--------------LPVPLEIVQSVAVPVGKAVAGRAKD--YFMGIGSEQVIRTSTEY--RLLDDETLYYAKQY 349 (390) T ss_pred cccccc--------------CCCceeEEEcCCCCCCcEEEEeece--EEEEeecceEEEecchh--hhhcCcEEEEEEEE Confidence 999854 3579999999999999999999997 56889999999998765 59999999999999 Q ss_pred eccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 475 LGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 475 ~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +|++|+||+||+++++++++... T Consensus 350 ~dg~v~~~~A~~~l~~~~~~~~~ 372 (390) T protein:vir:40 350 ANGRPKDNSSFLVFDITGLEGSP 372 (390) T ss_pred eCCEEecccceEEEEeeccCCCC Confidence 99999999999999999998542 No 51 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=1.5e-58 Score=337.56 Aligned_cols=387 Identities=14% Similarity=0.107 Sum_probs=233.7 Q ss_pred Cc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MP--STAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 m~--~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) |- .+.++++++.++.+++.+..+ +.+..+ . .+..++.+++.++++.+.+++++++.++. T Consensus 1 M~~~~l~el~~~l~e~~~~i~~~~~-------e~~~~~-------~-----~~~~~~~~~l~~eie~l~~ei~~l~~~~~ 61 (394) T protein:vir:97 1 MFEEKIKEIKATIADLNNTIVTKTA-------QVKNAL-------E-----SDDLEAARSIKAEVEQAKANLVEAENDLK 61 (394) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHH-------HHHHhh-------c-----hhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 32 122222222222222111111 110000 0 00112222333444444444444443333 Q ss_pred HHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccc-c Q lcl|Aclame:pro 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGST-G 157 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 157 (497) +.+.................. +.+..+ ..+............... .......................+.+ . T Consensus 62 ~~e~~~e~~~~~~~~~~~~~~-~~~~~~-----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~t~~ 134 (394) T protein:vir:97 62 LYESSVEVGGAENIGGKEVTQ-EEKTYR-----ESVNDFIRSKGKIVNDSL-RFEGKDEVLMPINETTPVEPQKDGIKKE 134 (394) T ss_pred HHHHHhhhhccccccccccch-hhHHHH-----HHHHHHHHHHHHHhhhhh-hhhhHHHHHHHHHhhhhhhhhccccccc Confidence 222211100000000000000 000000 000000000000000000 00000000000011111112222333 3 Q ss_pred cCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccc-ccccceeEEeeeeeeeee Q lcl|Aclame:pro 158 TFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANA 236 (497) Q Consensus 158 ~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~ 236 (497) .+|.++|+++...|++.+++.++|+++|++++++++++.+|+....++.++|++||+.+|+ ++++|+.|++.+||++++ T Consensus 135 ~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~ 214 (394) T protein:vir:97 135 NAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGA 214 (394) T ss_pred cccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeee Confidence 4455677788899999999999999999999999999999998776678999999999996 679999999999999999 Q ss_pred chhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 237 LTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGT 315 (497) Q Consensus 237 ~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (497) ++||+|||+|+ +++++||.++|+++++.++|.+|++|.+++.+.+..+ T Consensus 215 i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~------------------------------- 263 (394) T protein:vir:97 215 IPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKN------------------------------- 263 (394) T ss_pred hhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc------------------------------- Confidence 99999999997 5799999999999999999999999987655433211 Q ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCc Q lcl|Aclame:pro 316 NGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ 395 (497) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~ 395 (497) .+++..++.....+ ++ ...|+|||.+|..|+++||++|+ T Consensus 264 ---------------------------------------~~~~~~~~~~~~~~-~~-~a~~v~n~~~~~~l~~lkd~~G~ 302 (394) T protein:vir:97 264 ---------------------------------------LDEIKALLNGGFDP-AY-NVSLIVSQSFYQTLDTLKDGNGR 302 (394) T ss_pred ---------------------------------------HHHHHHHHHhhhhh-hh-CCEEEEcHHHHHHHHHhhccCCC Confidence 01111111111111 11 35799999999999999999999 Q ss_pred ccccccccccccccccccccccccceeecC--CCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEe Q lcl|Aclame:pro 396 YMGGNFFGNAYGNPVNGGKNIWGVPVVTTP--LIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEE 473 (497) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~--~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~ 473 (497) |||.+.... +.+++|||+||++++ .+++++++||||++ +|.+++|.+++++++++. .+..+||+++ T Consensus 303 ~i~~~~~~~------~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 370 (394) T protein:vir:97 303 YLLQDDITA------VSGKVLLGKPVFVLSDEVLGANKAFIGDFKR-GVLFADRKDLGLRWADNE-----IYGQYLQAVL 370 (394) T ss_pred eeeecCcCC------CCCceeccceeEEecccccCCccEEEeeccc-cEEEEEecceEEEEeccc-----ccceeEEEEE Confidence 999875432 345699999999954 56777899999998 478999999999988654 3456899999 Q ss_pred eeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 474 RLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 474 r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) |+|+.|.+|+||++|+++++++== T Consensus 371 r~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 371 RFGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred EEccEEecccceEEEEecccccCC Confidence 999999999999999997666555 No 52 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=1.2e-58 Score=338.06 Aligned_cols=380 Identities=17% Similarity=0.176 Sum_probs=244.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+.++.+++++...+++........+.....+.++ ++.++++.+.++.+.++.++++. T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~--------------------~~~~~~~~~~~~~~~l~~~i~~~ 60 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQ--------------------KIKDDLTAAKARRDAINDQIKDL 60 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHH--------------------HHHHHHHHHHHHHHHHHHHHHHH Confidence 8877777777766655555432211111000001111 12222222333333333322222 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +........ ...... ...+. ........ ..... .....+...... ........++++.+| T Consensus 61 e~~~~~~~~--~~~~~~--~~~~~------~~~~~~~~------~~~~~---~~~~~~l~~~~~-~~~~~~~~~t~~~gg 120 (394) T protein:vir:10 61 EAENKANSD--PDKPVD--NAQPN------GTDLKKKP------IDAKK---KAINDFIHSHGK-VIDNAAGHVTSTEAG 120 (394) T ss_pred HHHHHhhcc--hhhhhh--hhccc------ccchhhhH------HHHHH---HHHHHHHhccch-hhhhhhcccccccCc Confidence 111100000 000000 00000 00000000 00000 000000000000 011122233444455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccc-ccccceeEEeeeeeeeeechh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~~~i 239 (497) .+||+++...|++.+++.++|+++|+++++++++++||+....++.+.|++|++.+|+ ++++|++|++.+||++++++| T Consensus 121 ~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~i 200 (394) T protein:vir:10 121 VLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPL 200 (394) T ss_pred eeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehh Confidence 6677788999999999999999999999999999999988876677899999999996 679999999999999999999 Q ss_pred hHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhh Q lcl|Aclame:pro 240 TDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |+|||+|+ +++++||.++|+++++.++|.+|++|+|++.+.++.+..+ T Consensus 201 S~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~~~~~------------------------------- 249 (394) T protein:vir:10 201 SEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTAKATTTDTL------------------------------- 249 (394) T ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccc------------------------------- Confidence 99999998 6899999999999999999999999999876655432210 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccc Q lcl|Aclame:pro 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~ 398 (497) .+.+...+.......+ .++|+|||.+|..|+++||++||||| T Consensus 250 ------------------------------------~d~l~~~~~~~~~~~~--~a~~vmn~~~~~~l~~lkd~~G~~i~ 291 (394) T protein:vir:10 250 ------------------------------------VDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLL 291 (394) T ss_pred ------------------------------------HHHHHHHHHhhhhhhc--cCEEEecHHHHHHHHHhhccCCCeee Confidence 1111111211111222 35799999999999999999999999 Q ss_pred cccccccccccccccccccccceeecCCC--Cc--C--cEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEE Q lcl|Aclame:pro 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLI--PL--G--TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~pvv~s~~~--~~--~--~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) .+...... ....+.+|||+||++++++ |. + .++||||++ +|.++++.++++.++++.. |. ..||+. T Consensus 292 ~~~~~~~~--~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~-~~~~~~~~~~~v~~~~~~~--~~---~~~~~~ 363 (394) T protein:vir:10 292 HDASDSIT--DGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKR-GVLFADRQQVTLAWEDSKI--YG---RYLGAA 363 (394) T ss_pred eccccccc--cCCcccccccceeEEecccccCCCCCceEEEEeeccc-cEEEEeecceEEEEecccc--cc---eeEEEE Confidence 88764432 2345568999999987654 32 2 289999999 4779999999999887653 54 468999 Q ss_pred eeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 473 ERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 473 ~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +|+|++|++|+||+.+++++++.++ T Consensus 364 ~r~d~~~~~~~ai~~~~~~~~~~~~ 388 (394) T protein:vir:10 364 FRFGVKQADSNAGYFVTNTDAASGS 388 (394) T ss_pred EEeccEEeccccEEEEEeecccCCC Confidence 9999999999999999999999999 No 53 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=8e-58 Score=333.62 Aligned_cols=388 Identities=15% Similarity=0.138 Sum_probs=238.1 Q ss_pred CchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPST-AQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~-~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |-.- ..++++.+++.+.++++.+ +++.+++...+++...+.....+...+..++.++++.+++.++.++.+ T Consensus 1 m~~k~~~l~~~~~el~~~l~eL~e--------~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~ 72 (397) T protein:vir:96 1 MALKQLILNKQIKERSSEIDKLLS--------QRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAE 72 (397) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4221 1123333333333333222 222121111111111111111122333444445555555555544443 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+..... ............. . ............. ..........+..+....... ........+...+ T Consensus 73 ~~~~~~~-l~~~~~~~~~~~~--~--~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 140 (397) T protein:vir:96 73 LQKEKQD-LEDELAKAADPTD--Q--KPKDGEKRKMKKF------KVTEEELAEKRSAINAFVKSK-GAEKRDGFTSVEG 140 (397) T ss_pred HHHHHHH-HHHHHHhhhhhhh--h--hhHHHHHHHHHHH------hhhhHHHHHHHHHHHHHHHhh-hhhhhhccccccc Confidence 3321111 0000000000000 0 0000000000000 000000011111111111111 1111222344455 Q ss_pred CcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccc-ccccceeEEeeeeeeeeech Q lcl|Aclame:pro 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALT 238 (497) Q Consensus 160 g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~~~ 238 (497) +..+|+++...|++ +.+..+|+++|++++++++++.+|+....+..++|++|++..|+ ++++|++|++.+|+++++++ T Consensus 141 ~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~ 219 (397) T protein:vir:96 141 GALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIP 219 (397) T ss_pred ccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchh Confidence 56677777888876 57788999999999999999999988766677899999999986 68999999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchh Q lcl|Aclame:pro 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) +|+|||+|+ +++++||.++|+++++.++|.+|++|+|.+.|.|+.+. T Consensus 220 ~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~-------------------------------- 267 (397) T protein:vir:96 220 ISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTATAKSVVGV-------------------------------- 267 (397) T ss_pred hHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccch-------------------------------- Confidence 999999998 57999999999999999999999999998877665321 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccc Q lcl|Aclame:pro 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~ 397 (497) +++..++.....+++ ..+|+|||.+|..|+++||++|+|+ T Consensus 268 --------------------------------------d~~~~~~~~~~~~~~--~a~~v~n~~~~~~l~~lkd~~G~~~ 307 (397) T protein:vir:96 268 --------------------------------------DGLKDLINKEIKKVY--DVKLFISASMYSELDKLKDKNGRYL 307 (397) T ss_pred --------------------------------------HHHHHHHHHhhhhhc--CcEEEEcHHHHHHHHHhhccCCCeE Confidence 111111211112221 4579999999999999999999999 Q ss_pred ccccccccccccccccccccccceeecCCCCcC------cEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEE Q lcl|Aclame:pro 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) |.+.... +.+++|||+||++++++..+ .++||||++ +|.+++|.++++.++++. .+.+.||+ T Consensus 308 ~~~~~~~------~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~~~~~~~~-----~~~~~~~~ 375 (397) T protein:vir:96 308 LQDSITA------ASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKA-FASFFDRKQVSVSWVDNN-----IYGQLLAG 375 (397) T ss_pred eccCccC------CCcccccccceEEecccccCCCCCceEEEEeehhc-ceEeEeecceEEEEeccc-----ccceeEEE Confidence 9876533 34568999999987665432 389999999 478999999999988754 33568999 Q ss_pred EeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 472 EERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 472 ~~r~~~~v~~~~Af~~~~~~~~ 493 (497) ++|+|+.|+||+|||+|++++| T Consensus 376 ~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 376 IIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EEEEccEEecccceEEEEeecC Confidence 9999999999999999999999 No 54 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=2.8e-58 Score=336.15 Aligned_cols=377 Identities=16% Similarity=0.133 Sum_probs=239.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+.++..++.+..+++++..+ ++....++.. ++..++..+++++.++++++++++... T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~-------~~~~~~~~~~-------------e~~~~l~~ei~~~~~~~~~l~~~~~~~ 60 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLN-------AKLQDENASV-------------DDFQKIKDDLTAAKARRDAINDQIKAL 60 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHH-------HHHHhHhhhH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 65555555544433333222111 1110000011 111122233333333333333333322 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhh-hhhhhhhhcccccC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAP-AAIGQNPFGSTGTF 159 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 159 (497) +........ ... ... .......... ...... +..+....+.. ........++++.+ T Consensus 61 ~~~~~~~~~-~~~----~~~---~~~~~~~~~~---------~~~~~~------~~~~~~~lr~~~~~~~~~~~~t~~~g 117 (389) T protein:vir:10 61 EAEKPAEPK-TEP----KDD---GSKKGTDLSK---------KPIDAK------KKAINDFIHSHGKVIDATSKVTSTEA 117 (389) T ss_pred HHHHHhhhh-ccc----ccc---ccccccccch---------hHHHHH------HHHHHHHhhcchhhhhhhcccccCCc Confidence 211100000 000 000 0000000000 000000 00000000111 11122233455555 Q ss_pred CcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccc-ccccceeEEeeeeeeeeech Q lcl|Aclame:pro 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALT 238 (497) Q Consensus 160 g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~-s~~~~~~v~~~~~kia~~~~ 238 (497) |.+||+++...|++.+++.++|+++|+++++++++++||+.+..++.+.|++|++.+|. ++++|+++++.+||++++++ T Consensus 118 g~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~ 197 (389) T protein:vir:10 118 GVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIP 197 (389) T ss_pred ceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEecCCCccccccccccccccccccceeeeeeheeeEeeeh Confidence 66777888999999999999999999999999999999998876677889999999885 78999999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchh Q lcl|Aclame:pro 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) +|+|||+|+ +++++||.++|+++++.++|.+|++|.|++.+.+.....+ T Consensus 198 iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~~------------------------------ 247 (389) T protein:vir:10 198 LSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTAKKTTTDTL------------------------------ 247 (389) T ss_pred hhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccc------------------------------ Confidence 999999997 5899999999999999999999999988766554322110 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccc Q lcl|Aclame:pro 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~ 397 (497) .+.+...+.....+.+ .++|+|||.+|..|+++||++|+|| T Consensus 248 -------------------------------------~d~l~~~~~~~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i 288 (389) T protein:vir:10 248 -------------------------------------VDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYL 288 (389) T ss_pred -------------------------------------HHHHHHHHHhhhhhhh--CcEEEecHHHHHHHHHhhccCCCee Confidence 1111111111111111 3579999999999999999999999 Q ss_pred ccccccccccccccccccccccceeecCCC-CcC-----cEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEE Q lcl|Aclame:pro 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLI-PLG-----TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~-~~~-----~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) |++...... ....+++|||+||++++++ +.. .++||||++ +|.+++|++++|.++++.. |. ..||+ T Consensus 289 ~~~~~~~~~--~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~--~~---~~~~~ 360 (389) T protein:vir:10 289 LHDASDSIT--DGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKR-GVLFTDRQQVTLAWEDSKI--YG---KYLGA 360 (389) T ss_pred eecCccccc--ccccccccccceeEEecccccCCCCCceEEEEeeccc-cEEEEeecceEEEeecccc--cc---ceEEE Confidence 987654322 2345678999999886554 322 279999998 4789999999999988653 44 46899 Q ss_pred EeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 472 ~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) .+|+|++|++|+||+++++++++.++ T Consensus 361 ~~r~d~~~~~~~a~~~~~~~~~~~~~ 386 (389) T protein:vir:10 361 AFRFGVQKADSKAGYFVTNTDVPGSA 386 (389) T ss_pred EEEeccEEecccceEEEEeeccCCCC Confidence 99999999999999999999999888 No 55 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=2.8e-59 Score=341.58 Aligned_cols=302 Identities=16% Similarity=0.167 Sum_probs=240.8 Q ss_pred hhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccccccc Q lcl|Aclame:pro 143 TAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 143 ~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~ 222 (497) ......+.....+++++|+++||++..+|++.+++.++|++++++++++++.++||+.++. +.+.|++||+.+|+++++ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~ 79 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGA-VSASWTGEAERKPITKGS 79 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCC-cceeEecCCCccccccce Confidence 1122223333455677888999999999999999999999999999999989999999874 679999999999999999 Q ss_pred ceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhH Q lcl|Aclame:pro 223 FARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 223 ~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~ 300 (497) |+++++.+||++++++||+|||+|+ +++++||.++|+++++.++|.+||+|+|+++ |.||++............. T Consensus 80 f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~--- 156 (330) T protein:vir:77 80 FGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNL--- 156 (330) T ss_pred eeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccc--- Confidence 9999999999999999999999997 6899999999999999999999999999975 6788876543322211100 Q ss_pred HHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEeh Q lcl|Aclame:pro 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) ........+..+++..++..+... +..+..|+||| T Consensus 157 --------------------------------------------~~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~vmn~ 191 (330) T protein:vir:77 157 --------------------------------------------TTASGPQGNAYLAVNNALSLLVNS-GKKWTGTLLDN 191 (330) T ss_pred --------------------------------------------cccccccchhHHHHHHHHHhhhhc-CCCccEEEEcH Confidence 001111223344455555544433 45667899999 Q ss_pred hHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc------EEEeeccceEEEEEeccccEEEE Q lcl|Aclame:pro 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT------ILVGHFAPSVIQTARREGVTMQM 454 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~------~~~gd~~~~~~~i~~r~~~~i~~ 454 (497) .+|..|+++||++|+|+|.+....+... .....+|+|+||++++++|++. +++|||++ |.++++.+++|++ T Consensus 192 ~~~~~l~~lkd~~G~~l~~~~~~~~~~~-~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~--~~i~~~~~~~i~~ 268 (330) T protein:vir:77 192 VTEPILNTAVDGNGRPLFVESTYTEQVG-AIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQ--VIWGQIGGLSFDV 268 (330) T ss_pred HHHHHHHHHhccCCceeecCcccccccc-ccCCceecceeeEEeccccCCCCCCccEEEEEecce--EEEEEecCcEEEE Confidence 9999999999999999998865443322 2345689999999999999764 89999998 4588999999998 Q ss_pred eccch----------------hhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 455 TNSNG----------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 455 ~~~~~----------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +++.. ++|++|+++||+++|+|+.|.||+||++|+.++ +.+. T Consensus 269 ~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~-~~~~ 326 (330) T protein:vir:77 269 TDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQV-AGTD 326 (330) T ss_pred eecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEecc-CCcC Confidence 77642 679999999999999999999999999998766 4444 No 56 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=2.2e-59 Score=342.13 Aligned_cols=311 Identities=11% Similarity=0.067 Sum_probs=239.6 Q ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCcc Q lcl|Aclame:pro 127 PGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNN 206 (497) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~ 206 (497) .+....+... +......+....+++.+|++||+++..+||+.+++.++|++++++++++++.+++|+.++. +. T Consensus 1 ~~~~~~r~~~------~~~~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~-~~ 73 (326) T protein:vir:42 1 MAVNPDRTTP------FLGVNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGD-VS 73 (326) T ss_pred CCCCccchhh------hcCcchhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCC-cc Confidence 0000000000 0011122333345566778999999999999999999999999999999999999999874 78 Q ss_pred ceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceecc Q lcl|Aclame:pro 207 AAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQR 285 (497) Q Consensus 207 a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~ 285 (497) ++||+||+.+|+++++|+++++.++|+++++++|+|+++|+ +++++||.++|++++++++|.++|+|+|+++|.||++. T Consensus 74 a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~ 153 (326) T protein:vir:42 74 ASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQT 153 (326) T ss_pred eEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccc Confidence 99999999999999999999999999999999999999997 68999999999999999999999999999999999876 Q ss_pred ccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhh Q lcl|Aclame:pro 286 STGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI 365 (497) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 365 (497) ............... ......+......... T Consensus 154 ~~~~~~~~~~~~~~~-------------------------------------------------~~~~~~~~~~~~~~~~ 184 (326) T protein:vir:42 154 TKEVSLVDPDGTGSN-------------------------------------------------ADLTVYDAVAVNALSL 184 (326) T ss_pred ccccceeeccccccc-------------------------------------------------ccchhHHHHHHHHHhh Confidence 554332221100000 0000111111112222 Q ss_pred hhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcE--EEeeccceEEE Q lcl|Aclame:pro 366 QLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI--LVGHFAPSVIQ 443 (497) Q Consensus 366 ~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~--~~gd~~~~~~~ 443 (497) ....+...+.|+|||.++..|+++||++|+|||.+....+.... ....+++|+||+.++++|+++. ++|||++. . T Consensus 185 ~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~-~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~--~ 261 (326) T protein:vir:42 185 LVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSP-FRLGRIVARPTILSDHVASGTVVGYQGDFRQL--V 261 (326) T ss_pred hhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCcccc-ccCceeeeeeEEEcCCCCCCceEEEEeecceE--E Confidence 33445567789999999999999999999999988654433222 2345899999999999999874 67999984 3 Q ss_pred EEeccccEEEEeccch------------hhhhcCceEEEEEeeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 444 TARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 444 i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) ++++.+++|+++++.. ++|++|+++||+++|+|++|.||+||++|+.++++++ T Consensus 262 ~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 262 WGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred EEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccccCC Confidence 6788999998877643 5699999999999999999999999999999999888 No 57 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=2.5e-57 Score=330.88 Aligned_cols=425 Identities=15% Similarity=0.131 Sum_probs=237.3 Q ss_pred Cch----HHHHHHHHHHHH------HHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH Q lcl|Aclame:pro 1 MPS----TAQLEAQGRQLA------KSIKDINA------DETKTAAEKKEALAKIEPDFKAHQAEVE--AHERAQEMLKS 62 (497) Q Consensus 1 m~~----~~~~~~~~~~l~------~~~~~~~~------~~~~~~~e~~~~~~~~~~~~~~~~~~~e--~~e~~~e~~~~ 62 (497) |+. ....+.....+. ...++... ...+.+.++++...++.++++++..... .+....+..++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~ 234 (645) T protein:vir:93 155 YDRQFSAASGNRKPVVKIASSAGAAAQSTTVFHKEKTIMNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEH 234 (645) T ss_pred ccchhhhhhhhhcchhhhhhhhcchhhccccccccccccchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHH Confidence 111 000000000000 00000000 0011122233333333333222211111 11112344566 Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHH--HHHhhhhhhHH-------HhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHH Q lcl|Aclame:pro 63 LGGADAAKDGLDNDIPEVEVRNLKQIRK--HLARAVIMNPE-------LKNATSFEKGTKFDVSFNVSAKAADPGTAAAE 133 (497) Q Consensus 63 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (497) ++.+.+++++++.++.+.+......... ........... ........++..+......-...........+ T Consensus 235 ~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e 314 (645) T protein:vir:93 235 YDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNGNVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALE 314 (645) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHH Confidence 6777777777776665554322111110 00000000000 00000011111111111000000000000000 Q ss_pred H-HHHHHh-hhhhhh---hhhhhhhcccccCCcccc-cchhhHHHHHHHhhhhHHhhccceecC----CCceEEEEeecC Q lcl|Aclame:pro 134 L-MGAFAD-GETAPA---AIGQNPFGSTGTFAPGIL-PTFLPGIVEQLFYELSLADLISSRPVT----SPNLSYLTESAA 203 (497) Q Consensus 134 ~-~~~~~~-~~~~~~---~~~~~~~~~~~~~g~~i~-~~~~~~ii~~~~~~~~l~~~~~~~~~~----~~~~~~p~~~~~ 203 (497) . +..... ...... .......++++.+|++++ +++..+||+.+++.+++++++....++ .+.+++|+++++ T Consensus 315 ~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~ 394 (645) T protein:vir:93 315 VARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSG 394 (645) T ss_pred HHHhhcccchhhhhhhhhhhhccccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecC Confidence 0 111000 011111 111112223333456554 556778999999999999987654332 235899999885 Q ss_pred CccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCc----c Q lcl|Aclame:pro 204 HNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP----G 278 (497) Q Consensus 204 ~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~----~ 278 (497) +.++||+||+.+|+++++|+++++++|||++++++|+|||+|+ +++++||+++|+++++.++|.+||+|+|++ . T Consensus 395 -~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~ 473 (645) T protein:vir:93 395 -GAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVS 473 (645) T ss_pred -cceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCcc Confidence 7899999999999999999999999999999999999999987 799999999999999999999999998765 3 Q ss_pred ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhh Q lcl|Aclame:pro 279 VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENV 358 (497) Q Consensus 279 ~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (497) |.|+++......... ....++ T Consensus 474 p~gi~~~~~~~~~~~-----------------------------------------------------------~~~~d~ 494 (645) T protein:vir:93 474 PASITHDVKGTASSG-----------------------------------------------------------NPDADA 494 (645) T ss_pred ccceecccccccccc-----------------------------------------------------------chHHHH Confidence 777765322111000 000111 Q ss_pred HHhhhhhhhhh-ccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeec Q lcl|Aclame:pro 359 FDAFVDIQLTL-FQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHF 437 (497) Q Consensus 359 ~~~~~~~~~~~-~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~ 437 (497) ..++..+..+. .....+|+|||.++..|+++||++|+|+|.... ...++|+|+||++|++||+ .+++||| T Consensus 495 ~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~~--------~~~~tL~G~PV~~s~~vp~-~~~~gd~ 565 (645) T protein:vir:93 495 EAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDMT--------LLGGSFQGLPVIVSQYVGD-QLVLVNA 565 (645) T ss_pred HHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecCCC--------CCCceeeceeeEEeccCCc-ceeEecc Confidence 12222222222 223457999999999999999999999984321 1235899999999999986 4678999 Q ss_pred cceEEEEEeccccEEEEeccch--------------------hhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 438 APSVIQTARREGVTMQMTNSNG--------------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 438 ~~~~~~i~~r~~~~i~~~~~~~--------------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +.. .++++.++.|.++++.. ++|++|+|+||+++|+||.++||+||++|+ .+.=.++ T Consensus 566 s~~--~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt-~~~~g~~ 642 (645) T protein:vir:93 566 PDI--YLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVIT-GVNYGSA 642 (645) T ss_pred ccE--EEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEe-cccCCcc Confidence 974 45667777777655432 469999999999999999999999999987 2222222 No 58 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=4.9e-57 Score=329.30 Aligned_cols=379 Identities=13% Similarity=0.104 Sum_probs=235.2 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |....+++ ++.++++++.++......+.+ ....+.+ .+..+++.+++++++++++.++..+... T Consensus 1 Mn~~e~lk----el~~~~~el~~~~~~~~~~~~-----------~~~~e~~-~~e~~~~~~e~~~l~~~i~~~~~~~~~~ 64 (421) T protein:vir:13 1 MNLFERLK----ELRAKKKELEEKRCGIVEEIR-----------SLAKEKK-EEEARSKALEREKIEARMEIIEEEIESV 64 (421) T ss_pred CCHHHHHH----HHHHHHHHHHHHHHHHHHHHH-----------HHhhccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 54333322 222223322222221111121 1111100 0111222223333333333332222111 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhh-hhhhhhhcccccC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPA-AIGQNPFGSTGTF 159 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 159 (497) ........ +. ... ... ........ ..... ...+..+........ ........+++.+ T Consensus 65 ~~~~~~~~-~~----~~~---~~~-~~~~~~~~-----~~~~~--------~~~~~~~~~~~~~~~~~~~~ra~~t~~~g 122 (421) T protein:vir:13 65 MTAIDEER-KN----TNF---TGG-RVIINGDS-----KEEKR--------SLQLSAMSKTIRGIQLSEEERDIMSSTNN 122 (421) T ss_pred HHHHHHHH-hh----hcc---ccc-ccccccch-----hHHHH--------HHHHHHHHHhhhccchhHHHhhccccCCc Confidence 11000000 00 000 000 00000000 00000 000011111100000 0011112334445 Q ss_pred CcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCC-ccceeeccccccccccccceeEEeeeeeeeeech Q lcl|Aclame:pro 160 APGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAH-NNAAAVAEAGTYPFSSEEFARVYEQVGKVANALT 238 (497) Q Consensus 160 g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~-~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~ 238 (497) |.+||+++...|++.+++.++|+++|+++++++++++||+.+... ..+.|++|++.+|.++++|+.|++.++|++++++ T Consensus 123 g~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~ 202 (421) T protein:vir:13 123 GAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAP 202 (421) T ss_pred ceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehh Confidence 667777888999999999999999999999999999999887643 3467899999999999999999999999999999 Q ss_pred hhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchh Q lcl|Aclame:pro 239 ITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNG 317 (497) Q Consensus 239 iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (497) ||+|||+|+ +++++||.++|++++..++|.++++ .|+||++.++.. T Consensus 203 iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~-----~~~g~~~~~~~~---------------------------- 249 (421) T protein:vir:13 203 IDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVK-----QAKAVLAEETIN---------------------------- 249 (421) T ss_pred hhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhh-----hhhhcccccccc---------------------------- Confidence 999999998 5799999999999999999877763 577776543211 Q ss_pred hhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccc Q lcl|Aclame:pro 318 AFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYM 397 (497) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~ 397 (497) ..+++..++..+..+ +..+..|+|||.+|..|+++||++|+|| T Consensus 250 ------------------------------------~~d~i~~~~~~l~~~-~~~~a~~v~n~~~~~~l~~lkd~~G~~i 292 (421) T protein:vir:13 250 ------------------------------------DYAGLVKTINSLVPN-ARKRAIIVTNSDGRAYLDGLMDKQGRPL 292 (421) T ss_pred ------------------------------------chHHHHHHHHHhhhh-hcCCCEEEEcHHHHHHHHHhhcCCCcee Confidence 123445555555444 4456789999999999999999999999 Q ss_pred ccccccccccccccccccccccceeecCCCCcC-----cEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEE Q lcl|Aclame:pro 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG-----TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~-----~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) |.++. .+.+++|||+||++++++|.+ .++||||+. +|.+++|.+++|+++++. +|.+|+++||++ T Consensus 293 ~~~~~-------~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~v~~~~~~--~f~~~~~~~r~~ 362 (421) T protein:vir:13 293 LKELS-------DGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKT-LIKFMDRKQYLIDQSKEA--GYTKNETIARII 362 (421) T ss_pred ecCcC-------CCCCceecceeeEEeccccccCCCceEEEEEeccc-cEEEEEecceEEEeeccc--ccccCeeEEEEE Confidence 97743 234568999999999999854 379999998 578999999999999876 599999999999 Q ss_pred eeeccEeecccceEEEEecC---------CCCCC Q lcl|Aclame:pro 473 ERLGLLVYRPSAFQLIQLKK---------GATGS 497 (497) Q Consensus 473 ~r~~~~v~~~~Af~~~~~~~---------~a~~~ 497 (497) .|+|+.+++|+||+.+.... ++.+| T Consensus 363 ~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~ 396 (421) T protein:vir:13 363 ERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKSS 396 (421) T ss_pred eeecceeecchhhheeeecccceeeccccccCCC Confidence 99999999999976544331 11122 No 59 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=1.1e-58 Score=338.43 Aligned_cols=302 Identities=12% Similarity=0.099 Sum_probs=240.5 Q ss_pred HHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccc Q lcl|Aclame:pro 137 AFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTY 216 (497) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~ 216 (497) ...+... ......+..++++++|++||+++..+||+.+++.++|++++++++++++.++||+.++ .+.++|++||+.+ T Consensus 1 ~~~~~~~-~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~ 78 (318) T protein:vir:24 1 MAAGTAF-AVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVG-DVSAQWIGEGDMK 78 (318) T ss_pred CCCCCCC-CHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeC-CcceEEecCCccc Confidence 0001000 1122233445566778889999999999999999999999999999999999999987 4789999999999 Q ss_pred ccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhh Q lcl|Aclame:pro 217 PFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSAS 295 (497) Q Consensus 217 ~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~ 295 (497) |+++++|+++++.+||+++++++|+|+|+|+ +++++||.++|++++++++|.+|++|+|++.|.|++............ T Consensus 79 ~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~ 158 (318) T protein:vir:24 79 PITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTT 158 (318) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccc Confidence 9999999999999999999999999999987 589999999999999999999999999999999998765433222111 Q ss_pred hhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCce Q lcl|Aclame:pro 296 SLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNA 375 (497) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 375 (497) .. .....+.+..+.. .....+..+.. T Consensus 159 ~~-----------------------------------------------------~~~~~~~~~~~~~-~~~~~~~~~~~ 184 (318) T protein:vir:24 159 GA-----------------------------------------------------TTVYDQVAVNGLS-LLVNDGKKWTH 184 (318) T ss_pred cc-----------------------------------------------------cchHHHHHHHHHH-hhccccCCCCE Confidence 00 0000111122222 22344566778 Q ss_pred EEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc--EEEeeccceEEEEEeccccEEE Q lcl|Aclame:pro 376 VVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARREGVTMQ 453 (497) Q Consensus 376 ~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~--~~~gd~~~~~~~i~~r~~~~i~ 453 (497) |+|||.++..|+++||++|+|||.+....+.... .....++|+||+.++++|+++ +++|||+. +.++++.+++|+ T Consensus 185 ~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~-~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~--~~~~~~~~l~i~ 261 (318) T protein:vir:24 185 TLLDDITEPILNGAKDQNGRPLFIESTYGEAASP-FRSGRIVARPTILSDHVVEGTTVGFMGDFSQ--LIWGQIGGLSFD 261 (318) T ss_pred EEEcHHHHHHHHHhhccCCceeecCccccCcccc-ccCceEEEEeeEEeCCCCCCccEEEEeecce--EEEEEecCeEEE Confidence 9999999999999999999999988665543332 223579999999999999876 57899997 447788999999 Q ss_pred Eeccch------------hhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 454 MTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 454 ~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++++.. ++|++|++.||+++|+|+.|.+|+||++|+.++++.|+ T Consensus 262 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~ 317 (318) T protein:vir:24 262 VTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGGE 317 (318) T ss_pred EeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCCC Confidence 887643 56999999999999999999999999999999999888 No 60 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=1.7e-58 Score=337.33 Aligned_cols=305 Identities=13% Similarity=0.093 Sum_probs=240.2 Q ss_pred HHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccc Q lcl|Aclame:pro 138 FADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP 217 (497) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~ 217 (497) .............+..++++.+|++||+++..+||+.+++.++|+++++++++++++++||+.++ .+.++||+|++.+| T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~ 79 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIG-DVSAQWIGEGDMKP 79 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeC-CcceEEecCCcccc Confidence 00011111122334455667777899999999999999999999999999999999999999886 46799999999999 Q ss_pred cccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhh Q lcl|Aclame:pro 218 FSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASS 296 (497) Q Consensus 218 ~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~ 296 (497) +++++|+++++.+||++++++||+|+|+|+ +++++||.+.|++++++++|.+||+|+|++.|.++.............. T Consensus 80 ~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~ 159 (320) T protein:vir:10 80 ITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGG 159 (320) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceeccc Confidence 999999999999999999999999999986 6899999999999999999999999999998888765543332221110 Q ss_pred hhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceE Q lcl|Aclame:pro 297 LFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAV 376 (497) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 376 (497) ... ......+..+..........+..+.+| T Consensus 160 ~~~--------------------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (320) T protein:vir:10 160 ATA--------------------------------------------------SDLTAYDAVAVNGLSLLVNAKKKWTHT 189 (320) T ss_pred ccc--------------------------------------------------cccccHHHHHHHHHhhhhcccCCCcEE Confidence 000 000011111222222233455667899 Q ss_pred EEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc--EEEeeccceEEEEEeccccEEEE Q lcl|Aclame:pro 377 VMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARREGVTMQM 454 (497) Q Consensus 377 ~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~--~~~gd~~~~~~~i~~r~~~~i~~ 454 (497) +|||.+|..|+++||++|+|+|.+....+.... ....+++|+||+.++++|+++ +++|||+. +.++++.+++|++ T Consensus 190 v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~-~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~--~~~~~~~~~~i~~ 266 (320) T protein:vir:10 190 LLDDIVEPILNGAKDKNGRPLFIESTYTDENSP-FRAGRIVSRPTILSDHVADGTTVGYMGDFRN--VIWGQVGGLSFDV 266 (320) T ss_pred EEcHHHHHHHHHhhccCCceeeccccccCcccc-ccCceeeeeeeEecCCCCCCceEEEEeecce--EEEEEecCeEEEE Confidence 999999999999999999999987655443332 334579999999999999987 57899997 4478899999998 Q ss_pred eccch------------hhhhcCceEEEEEeeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 455 TNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 455 ~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) +++.. ++|++|+++||+++|+|++|.+|+||++|+-.++.++ T Consensus 267 ~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 267 TDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred eecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 87654 5799999999999999999999999999998888888 No 61 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=2e-58 Score=336.89 Aligned_cols=279 Identities=15% Similarity=0.110 Sum_probs=224.0 Q ss_pred hhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeee Q lcl|Aclame:pro 152 PFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVG 231 (497) Q Consensus 152 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~ 231 (497) +..++.++|.+|||++..+||+.+++.++|++++++++++++.+++|+.++. +.|+||+||+.+|+++++|+++++++| T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~-~~a~wv~Eg~~~~~s~~~f~~v~l~~~ 79 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFD-SDIDIVAENGKKTHGGVSLDPVTIVPL 79 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecC-cceEEeeCCcccccccccceeeEeeeE Confidence 5566667788999999999999999999999999999999999999999874 789999999999999999999999999 Q ss_pred eeeeechhhHHHHh---h-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCcc--cc---ceeccccccccchhhhhhhHHH Q lcl|Aclame:pro 232 KVANALTITDEGLR---D-APELFNFVQGRLLEGIQRKEEVQLLAGGGYPG--VN---GLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 232 kia~~~~iS~ell~---d-s~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~--~~---Gil~~~~~~~~~~~~~~~~~~~ 302 (497) |++++++||+|||+ | .++++++|.++|++++++++|.++++|++.+. +. |.....+..+.... T Consensus 80 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~-------- 151 (300) T protein:vir:95 80 KVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVP-------- 151 (300) T ss_pred EEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeec-------- Confidence 99999999999994 4 47899999999999999999999999965432 22 22222111110000 Q ss_pred HHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhH Q lcl|Aclame:pro 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 382 (497) .......+.+..++..+.. .+..+++|+|||.+ T Consensus 152 ----------------------------------------------~~~~~~~~~i~~~~~~~~~-~~~~~~~~vmn~~~ 184 (300) T protein:vir:95 152 ----------------------------------------------FKDTNPDESMEDAVGMIDG-SERDITGAILDPIF 184 (300) T ss_pred ----------------------------------------------ccccchHHHHHHHHHHhhh-cCCCccEEEECHHH Confidence 0011122344444444433 45667789999999 Q ss_pred HHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc------EEEeeccceEEEEEeccccEEEEec Q lcl|Aclame:pro 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT------ILVGHFAPSVIQTARREGVTMQMTN 456 (497) Q Consensus 383 ~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~------~~~gd~~~~~~~i~~r~~~~i~~~~ 456 (497) +..|+++||++|+|||.+.... ..+++|||+||++++++|.+. +++|||+++ +.+..|+++++++++ T Consensus 185 ~~~L~~lkd~~G~~i~~~~~~~------~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GDf~~~-~~~~~~~~~~~~v~~ 257 (300) T protein:vir:95 185 TTALSKMKNAEGGKLYPELAWG------GVPDAINGLAVDKNRTVSYSQTDPKNTAIVGDFETM-FKWGYAKEVPMEIIK 257 (300) T ss_pred HHHHHHhhccCCCeeccCcccc------CCCceecceeeEEecCCCCCCCCCccEEEEeeccce-EEEEEecccEEEEee Confidence 9999999999999999765432 345689999999999998653 788999985 456678999999887 Q ss_pred cch------hhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 457 SNG------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 457 ~~~------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) +.. ++|++|+++||+++|+||.|.||+||++|+-++. T Consensus 258 ~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 258 YGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred ccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 543 4699999999999999999999999999987777 No 62 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=1.4e-58 Score=337.75 Aligned_cols=280 Identities=21% Similarity=0.247 Sum_probs=235.5 Q ss_pred hhhh--hhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 148 IGQN--PFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 148 ~~~~--~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) +..+ ...+++.+|++||+++..+||+.+++.++|++++++++++++..++|+.+. +.++||+|++.+|+++++|++ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~--~~a~~v~E~~~~~~~~~~f~~ 78 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMSG--VGAFWVDEAERIQTSKPTFTK 78 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEcC--CceeeeecCccccccccceeE Confidence 2222 233445567788889999999999999999999999999999999998764 568999999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) +++.++|++++++||+|+++|+ +++++||.+.|++++++++|.+|++|+|+++|.||++........... T Consensus 79 v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~--------- 149 (299) T protein:vir:41 79 AKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEE--------- 149 (299) T ss_pred EEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeecc--------- Confidence 9999999999999999999997 689999999999999999999999999999999999765433221110 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+++..++..+...+ ..++.|+|||.++. T Consensus 150 ----------------------------------------------~~~~~~~l~~~~~~l~~~~-~~~~~~v~n~~~~~ 182 (299) T protein:vir:41 150 ----------------------------------------------TANKYDDLNEAIGLIEAED-LEPNGIATIRKQRV 182 (299) T ss_pred ----------------------------------------------ccccHHHHHHHHHhhhccc-CCcCEEEEcHHHHH Confidence 0112345556666555444 45678999999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc----EEEeeccceEEEEEeccccEEEEeccch- Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT----ILVGHFAPSVIQTARREGVTMQMTNSNG- 459 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~----~~~gd~~~~~~~i~~r~~~~i~~~~~~~- 459 (497) .|+++||++|+|+|.+... ++.++|+|+||+.++++|++. ++||||+. |.+++|++++++++++.. T Consensus 183 ~L~~lkd~~G~~l~~~~~~-------~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~--~~i~~~~~~~i~~~~~~~~ 253 (299) T protein:vir:41 183 KYRSTKDGNGMPIFNTATS-------NGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQ--AYYGILRGVEYEILTEATL 253 (299) T ss_pred HHHHhhccCCceeecCCcC-------CCCceecceeeEEecccCCCCCceEEEEEeccc--EEEEEecCcEEEEeecccc Confidence 9999999999999987543 334589999999999999887 99999997 458899999999887643 Q ss_pred -----------hhhhcCceEEEEEeeeccEeecccceEEEEecCCC Q lcl|Aclame:pro 460 -----------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 460 -----------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a 494 (497) ++|++|+++||+++|+|+++++|+||++++.+++- T Consensus 254 ~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 254 TTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred cccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 46999999999999999999999999999988887 No 63 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=8e-58 Score=333.60 Aligned_cols=365 Identities=14% Similarity=0.037 Sum_probs=234.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |....+...+.++.. +++.++.... ...++..+.+++ ..+.+..++... T Consensus 1 M~i~~k~~~~~~~~~---~~l~~~~~~~-------------------------~~~ee~~~~~~~---~~~~~~~~~~~~ 49 (377) T protein:vir:98 1 MAINLKELPKYREAV---AELSAKISAG-------------------------ATSEEQEKLFEA---AFTTMGDEILAK 49 (377) T ss_pred CCCcHHHHHHHHHHH---HHHHHHHHhh-------------------------hhhHHHHHHHHH---HHHhHHHHHHHH Confidence 544332222222211 1111100000 000000000100 011111111000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) . +.+.... .... ...... ..+.+..+.. ....++..++| T Consensus 50 ~-------~~e~~~~-------~~~~---~~~~~l---------------t~ee~~~~~~---------~~~~~~~~~gg 88 (377) T protein:vir:98 50 N-------EEEMERM-------FDLR---DKNREL---------------TAEEIKFFND---------IDKNVGGKDKF 88 (377) T ss_pred H-------HHHHHHH-------HHhc---cCCccc---------------CHHHHHHHHH---------HHhccCCCCCc Confidence 0 0000000 0000 000000 0000111100 11123444556 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccc-cccccceeEEeeeeeeeeechh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~-~s~~~~~~v~~~~~kia~~~~i 239 (497) .+||+++...|++.+.+.++|+++|++.+++++ .++|+.++ .+.+.|++|++..+ +++|+|+++++.+||++++++| T Consensus 89 ~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~~~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~i 166 (377) T protein:vir:98 89 KLLPEETMVQVFDDLVAEHPLLKVINFKNTSLR-LKALTAET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVI 166 (377) T ss_pred cccCHHHHHHHHHHHHHhhhhhhheeeEecCcc-eEEEEecC-CcceeEeecccccCcccCccceeEeecceeEEeeecc Confidence 678888999999999999999999999998764 79999876 57899999987765 6789999999999999999999 Q ss_pred hHHHHhhHH-HHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhh Q lcl|Aclame:pro 240 TDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~ds~-~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |+|||+|++ ++++||.++|++++++++|.+|++|+|+++|.||++..+..+........... T Consensus 167 s~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~----------------- 229 (377) T protein:vir:98 167 PKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITT----------------- 229 (377) T ss_pred cHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccc----------------- Confidence 999999985 89999999999999999999999999999999999865443332211100000 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccc Q lcl|Aclame:pro 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~ 398 (497) .....+.+......+... +.....|+||+.++..++++||.+|+|+| T Consensus 230 --------------------------------~~~~~~~~~~l~~~~~~~-~~~~a~~~m~~~t~~~~~klkd~~G~~i~ 276 (377) T protein:vir:98 230 --------------------------------YKTDKEAIADLSDLTPDN-APKKLVPVMKHLSVNDKKRPLKIAGQVKL 276 (377) T ss_pred --------------------------------ccchhhhHhhhhhhchhH-HHHHHHHHHHHHHHHHHhhhhccCCceEE Confidence 000011122222222222 23344799999999999999999999999 Q ss_pred cccccc--------cccccccccccccccc--eeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceE Q lcl|Aclame:pro 399 GNFFGN--------AYGNPVNGGKNIWGVP--VVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVT 468 (497) Q Consensus 399 ~~~~~~--------~~~~~~~~~~~l~G~p--vv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~ 468 (497) ...+.. ......+.+.+++|+| |+.++++|+++++||||+. |.|++|.+++|+++++. .|.+|++. T Consensus 277 ~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~--Y~i~~r~~~~i~~~~~~--~~~~d~~~ 352 (377) T protein:vir:98 277 ILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANR--YDAFMATASTIEEYDQT--FAMEDLQL 352 (377) T ss_pred EecccchhhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecc--eeEEeecceEEEeechh--hhhcCceE Confidence 533221 1111223345788888 6789999999999999997 88999999999998765 59999999 Q ss_pred EEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 469 VRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 469 ~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) ||+..|+|+.++||+||++++++-. T Consensus 353 f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 353 YLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEEEEcCEEeccCcEEEEEEecC Confidence 9999999999999999999999888 No 64 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=5.6e-58 Score=334.46 Aligned_cols=287 Identities=16% Similarity=0.103 Sum_probs=224.3 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~ 230 (497) ++.+++.++|.++|+++..+||+.+++.++|++++++++++++.++||+.++ .+.|+||+||+.+|+++++|+++++.+ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~s~~~f~~v~l~~ 79 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKPSASVDVSAFTAQP 79 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeCCccccccccceeeeEeee Confidence 5566667777888889999999999999999999999999999999999987 478999999999999999999999999 Q ss_pred eeeeeechhhHHHHhhH-HH----HHHHHHHHHHHHHHHHHHhhhhccCCCcc---ccceeccccccccchhhhhhhHHH Q lcl|Aclame:pro 231 GKVANALTITDEGLRDA-PE----LFNFVQGRLLEGIQRKEEVQLLAGGGYPG---VNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 231 ~kia~~~~iS~ell~ds-~~----l~~~i~~~la~~~~~~~d~a~l~G~g~~~---~~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) ||++++++||+||++|+ .+ ++++|.++|++++++++|.+|++|+|.+. +.|+.+.....+... T Consensus 80 ~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~--------- 150 (315) T protein:vir:80 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIV--------- 150 (315) T ss_pred eeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccccccccccccccee--------- Confidence 99999999999999765 22 88999999999999999999999987432 333332211110000 Q ss_pred HHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhH Q lcl|Aclame:pro 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 382 (497) ........++..++..+....+...++|+|||.+ T Consensus 151 ----------------------------------------------~~~~~~~~d~~~~~~~~~~~~~~~~~~~imn~~~ 184 (315) T protein:vir:80 151 ----------------------------------------------DATDSATADLVKAVGLIAGAGLQVPNGVALDPAF 184 (315) T ss_pred ----------------------------------------------eccccchHHHHHHHHHHhhccCccceEEEEcHHH Confidence 0001112333444444444444556689999999 Q ss_pred HHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcC---------cEEEeeccceEEEEEeccccEEE Q lcl|Aclame:pro 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG---------TILVGHFAPSVIQTARREGVTMQ 453 (497) Q Consensus 383 ~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~---------~~~~gd~~~~~~~i~~r~~~~i~ 453 (497) +..|+++||.+|++.+....... ...+.+.+|+|+||+++++||++ .+++|||++.+ +..+.+++|+ T Consensus 185 ~~~L~~l~~~~g~~~~g~~~~~~--~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~--~g~~~~~~i~ 260 (315) T protein:vir:80 185 SFALSTEVYPKGSPLAGQPMYPA--AGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVH--WGFQRNFPIE 260 (315) T ss_pred HHHHHHHhhccCCcccccccccc--cccCCCceecceeeEecCcCCcccccccccccEEEEeecccEE--EEEecCeeEE Confidence 99999999887766544332211 12334568999999999999864 37899999854 4557888888 Q ss_pred Eeccch------hhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 454 MTNSNG------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 454 ~~~~~~------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++++.. ++|++|+++||+++|+||+|+||+||++|+.+++.+.+ T Consensus 261 i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~ 310 (315) T protein:vir:80 261 LIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) T ss_pred EeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCC Confidence 876532 57999999999999999999999999999999999988 No 65 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=4.3e-57 Score=329.62 Aligned_cols=383 Identities=15% Similarity=0.119 Sum_probs=241.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+++.++++++.++.++++++.+. +.++........+++++ +..+++.++++++.++.+++.. T Consensus 16 mk~l~el~~~~~e~~~~~~~~~~e----l~~~~~~~~~~~ee~~~-------------~~~~~~~l~~~~~~l~~~~~~~ 78 (402) T protein:vir:93 16 MPTLYELKQSLGMIGQQLKNKNDE----LSQKATDPNIDMEDIKQ-------------LETEKAGLQQRFNIVERQVQDI 78 (402) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHH----HHHHHhccCcCHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHH Confidence 999999998888777666665432 22222111111111221 1122222222333332222222 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +......... ... .......... ....+....+ .... ...... ..............+++.++| T Consensus 79 e~~~~~~~~~-~~~--~~~~~~~~~~---~~~~~~~~~r----~~~~---~~~~~~---~~~~~~~~~~a~~~~t~~~GG 142 (402) T protein:vir:93 79 EEKEKAKVKD-KGE--AYQSLSDNEK---MVKAKAEFYR----HAIL---PNEFEK---PSMEAQRLLHALPTGNDSGGD 142 (402) T ss_pred HHHHHhhhhh-ccc--cCCCCchhHH---HHHHHHHHHH----HHHh---hhhHHH---HHHhHHHHHhhhccCCCcCCc Confidence 1111000000 000 0000000000 0000000000 0000 000000 000111111222333444455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) .+||+++...||+.+++.++|+++|+++++++ ..+|+.+...+.+.|++||+..++++|+|+++++.+||++++++|| T Consensus 143 ~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS 220 (402) T protein:vir:93 143 KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAIS 220 (402) T ss_pred cccchhHHHHHHHhHHhhhhhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeecceeeeeechhh Confidence 66777888999999999999999999998865 4577765544678999999999999999999999999999999999 Q ss_pred HHHHhhH-HHHHHHHHHHHHHHHHHHHHh-hhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhh Q lcl|Aclame:pro 241 DEGLRDA-PELFNFVQGRLLEGIQRKEEV-QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 241 ~ell~ds-~~l~~~i~~~la~~~~~~~d~-a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) +|||+|+ +++++||.++|+++++.+++. .|.+|+|+++|.|++...+....+. T Consensus 221 ~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~------------------------- 275 (402) T protein:vir:93 221 DTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG------------------------- 275 (402) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc------------------------- Confidence 9999997 689999999999999999765 5778999999999987644322111 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccc Q lcl|Aclame:pro 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~ 398 (497) ....+.+..+++.+...+ .....|+||+.++..+..+++..|+|+| T Consensus 276 ---------------------------------~~~~d~l~~~~~~l~~~y-~~na~~imn~~t~~~~~~~~~d~~~~~~ 321 (402) T protein:vir:93 276 ---------------------------------ADMYDAIINALADLHEDY-RDNATIYMRYADYVKIISVLSNGTTNFF 321 (402) T ss_pred ---------------------------------cchHHHHHHHHhccChhh-hcCCEEEEechHHHHHHHHHhcCCCccc Confidence 012345556666555544 4466899999999888777777777887 Q ss_pred cccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccE Q lcl|Aclame:pro 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~ 478 (497) .+. +.+|+|+||++++.++ +++||||+.+ |.+++ .+.++..++. .++++.|++..|+|++ T Consensus 322 ~~~-----------~~~llG~PV~~t~~~~--~i~~GDf~~~-~~~~~--~~~~~~~~~~----~~~~~~~~~~~r~Dg~ 381 (402) T protein:vir:93 322 DTP-----------AEKVFGKPVVFTDAAV--KPIVGDFNYF-GINYD--GTTYDTDKDV----KKGEYLFVLTAWYDQQ 381 (402) T ss_pred ccC-----------CccccccceEEecCCC--ceeeechhhh-hhhhh--hhhhhhhhcc----cCCceEEEEEEEeCcE Confidence 542 3479999999999875 5899999984 55544 3445544433 2699999999999999 Q ss_pred eecccceEEEEecCCCCCC Q lcl|Aclame:pro 479 VYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 479 v~~~~Af~~~~~~~~a~~~ 497 (497) |++|+||+++++++++.++ T Consensus 382 v~~~~A~~~l~ik~~~~~~ 400 (402) T protein:vir:93 382 RTLDSAFRIAKAKENTGPL 400 (402) T ss_pred EechhheEEEEeecCCCCC Confidence 9999999999999987777 No 66 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=2.3e-56 Score=325.59 Aligned_cols=415 Identities=15% Similarity=0.128 Sum_probs=230.7 Q ss_pred CchHHHHHHH-------HHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHH Q lcl|Aclame:pro 1 MPSTAQLEAQ-------GRQLAKSIKD-----INADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLK---SLGG 65 (497) Q Consensus 1 m~~~~~~~~~-------~~~l~~~~~~-----~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~---~~~~ 65 (497) ||...+.... .+......+. ....+.....+.++....+....+ ..+..+..++... .+++ T Consensus 185 ~~~~~~~~~~~~~~~~~~r~~~~~a~~~~~~~~~a~~~~~~~~E~~r~~eI~~l~~----~~~~~~~~~~ai~~g~sld~ 260 (632) T protein:vir:96 185 MPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQ----QFSQRSLAQEAIQKGHTVDQ 260 (632) T ss_pred ccchhhhhhccccccccccchhhcccccchhhhhhhhhhhhhhhHHHHHHHHHHHH----HhhhhhhHHHHHhccccHHH Confidence 3321111000 0000000000 000000000000111111111111 0010000111110 0111 Q ss_pred HHHHHH-HHHHHHHH-HhHHHHHHH--HHHHHhhhhhhHHHhhhhhhhhhhhhhhhhh-hhhhhhhHHHHHHH---HHHH Q lcl|Aclame:pro 66 ADAAKD-GLDNDIPE-VEVRNLKQI--RKHLARAVIMNPELKNATSFEKGTKFDVSFN-VSAKAADPGTAAAE---LMGA 137 (497) Q Consensus 66 ~~a~~~-~~~~~~~~-~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~---~~~~ 137 (497) .+++.. .+...... ......... ..............++............... .............. ..+. T Consensus 261 ~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~ 340 (632) T protein:vir:96 261 FRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGK 340 (632) T ss_pred HHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhh Confidence 111110 00000000 000000000 0000000000000000000000000000000 00000000000000 0000 Q ss_pred HHhhh--hhhhhh-hhhhhcccccCCcccccchh-hHHHHHHHhhhhHHhh-ccceecCCCceEEEEeecCCccceeecc Q lcl|Aclame:pro 138 FADGE--TAPAAI-GQNPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAAAVAE 212 (497) Q Consensus 138 ~~~~~--~~~~~~-~~~~~~~~~~~g~~i~~~~~-~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~~a~~v~E 212 (497) ...+. ...... +....++++++|.+||+++. ..||+.+++.+.++++ ++++++.+++++||+++++ +.++||+| T Consensus 341 ~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~-~~a~wv~E 419 (632) T protein:vir:96 341 EARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSG-ANFYWIGE 419 (632) T ss_pred hhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeCC-ceeEeecC Confidence 00000 001111 22233445556667777765 6799999999999887 6788888889999999884 78999999 Q ss_pred ccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCC-ccccceeccccccc Q lcl|Aclame:pro 213 AGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGY-PGVNGLLQRSTGFT 290 (497) Q Consensus 213 g~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~-~~~~Gil~~~~~~~ 290 (497) ++.+|+++++|+++++++||++++++||+|||+|+ ++++++|+++|+++++.++|.+||+|+|+ ++|.||++.++..+ T Consensus 420 ~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~ 499 (632) T protein:vir:96 420 DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPA 499 (632) T ss_pred CccccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccc Confidence 99999999999999999999999999999999986 79999999999999999999999999996 56999998765433 Q ss_pred cchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhh- Q lcl|Aclame:pro 291 ASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTL- 369 (497) Q Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 369 (497) ....... .....+..+...+...+ T Consensus 500 ~~~~~~~-------------------------------------------------------~~~~~i~~~~~~i~~~~~ 524 (632) T protein:vir:96 500 LTYPAGG-------------------------------------------------------VDWASVVDMETKISTFNA 524 (632) T ss_pred eeccccc-------------------------------------------------------CCHHHHHHHHHHHhhccc Confidence 2211100 01112222222222222 Q ss_pred ccCCceEEEehhHHHHHHH--HhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEec Q lcl|Aclame:pro 370 FQTPNAVVMNPRDWELLRL--TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR 447 (497) Q Consensus 370 ~~~~~~~~~n~~~~~~l~~--lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r 447 (497) .....+|+|||.++..+.+ ++|++|+|+|.+ .+|+|+||+.+++||+++++||||+. |.+.++ T Consensus 525 ~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~-------------~~l~G~pv~~s~~ip~~~~~~gd~s~--~~i~~~ 589 (632) T protein:vir:96 525 DAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------------NEVNGYRAEASNQIPADTWIFGDWSQ--IVIAMW 589 (632) T ss_pred ccCccEEEEchhHHHHHHHHhccCCCCceeecC-------------CeecccceEeccccccCcEEEeecce--EEEEEe Confidence 2335579999998877765 789999999964 26999999999999999999999998 447788 Q ss_pred cccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecC Q lcl|Aclame:pro 448 EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 448 ~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~ 492 (497) .+++|.++++. +|.+|++.||+++|+|++|++|++|+.++.++ T Consensus 590 ~~~~i~~~~~~--~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 590 GVLDLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred cceEEEEcccc--ccccCceEEEEEeecCceeechhhhhheeecC Confidence 99999999876 48899999999999999999999999999887 No 67 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=1.1e-57 Score=332.81 Aligned_cols=344 Identities=16% Similarity=0.122 Sum_probs=227.0 Q ss_pred HHHHHHHHHHHHHHhHHHHHHHHHH-HHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhh-hHHHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 67 DAAKDGLDNDIPEVEVRNLKQIRKH-LARAVIMNPELKNATSFEKGTKFDVSFNVSAKAA-DPGTAAAELMGAFADGETA 144 (497) Q Consensus 67 ~a~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 144 (497) .+++.. ...+.. .........+.+ ..++..+........... .........+..+.. T Consensus 1 ~a~~~a-------------~~~~~~~~~~~~~~~~~~~----~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~---- 59 (366) T protein:vir:57 1 MAAAVA-------------VPVKAHSVAPGIIIKEELQ----QYKGAGMTRMVMSIAAGKGNLADAAKFAATELGD---- 59 (366) T ss_pred Cccccc-------------ccccccccccccccccccc----cccchhHHHHHHHHHhcccchhHHHHHHHHhhcc---- Confidence 000000 000000 000000000000 011111111110000000 000000000011100 Q ss_pred hhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhh-ccceecCCCceEEEEeecCCccceeeccccccccccccc Q lcl|Aclame:pro 145 PAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEF 223 (497) Q Consensus 145 ~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~ 223 (497) ........++++++|.+||+++..+||+.+++.++++++ ++++++.++.+++|+.++. +.++||+|++.+|+++++| T Consensus 60 -~~~~~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~-~~a~wv~E~~~~~~s~~~f 137 (366) T protein:vir:57 60 -TGLSMAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGG-ATAGYVGEGKDVVATGATF 137 (366) T ss_pred -hhhhhhccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCC-cceeeeccCccccccccce Confidence 011112223344455567778888999999999999998 8889998889999999874 7899999999999999999 Q ss_pred eeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCc-cccceeccccccccchhhhhhhHH Q lcl|Aclame:pro 224 ARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 224 ~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~~ 301 (497) +++++++||+++++++|+|||+|+ +++++||+++|++++++++|.+||+|+|++ +|.||++..+............ T Consensus 138 ~~i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~-- 215 (366) T protein:vir:57 138 DDVKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTA-- 215 (366) T ss_pred eEEEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccc-- Confidence 999999999999999999999997 699999999999999999999999999986 7999998765433221100000 Q ss_pred HHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehh Q lcl|Aclame:pro 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPR 381 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 381 (497) ......+...+.+..... ....+.....|+|||. T Consensus 216 ---------------------------------------------~~~~~~~~~~~~~~~~~~-~~~~~~~~a~~vmn~~ 249 (366) T protein:vir:57 216 ---------------------------------------------INLTTIDEYLDSLILKHM-DSNSNMIRCGWGLSNR 249 (366) T ss_pred ---------------------------------------------cchhhHHHHHHHHHHhhh-ccccccccCEEEecHH Confidence 000000000111111111 1223345678999999 Q ss_pred HHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcC--------cEEEeeccceEEEEEeccccEEE Q lcl|Aclame:pro 382 DWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTARREGVTMQ 453 (497) Q Consensus 382 ~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~--------~~~~gd~~~~~~~i~~r~~~~i~ 453 (497) ++..|+++||++|+|+|.+. ..++|+|+||+.+++||++ .++||||+. |.|.++.+++|+ T Consensus 250 ~~~~L~~lkd~~G~~l~~~~----------~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~--~~i~~~~~i~i~ 317 (366) T protein:vir:57 250 TYMTLFGLRDGNGNKVYPEM----------SQGILKGYPIQRTSAIPANLGDDGNESEIYFCDFND--VVIGEDGMMKVD 317 (366) T ss_pred HHHHHHhhhccCCceeccCC----------CCCeecceeeEEccccccccccCCCccEEEEEecce--EEEEEecceEEE Confidence 99999999999999999542 2348999999999999862 489999997 558899999999 Q ss_pred Eeccch---------hhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 454 MTNSNG---------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 454 ~~~~~~---------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) ++++.. ++|++|+++||+++|+||+|+||+||++++=..= T Consensus 318 ~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 318 FSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred EeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 887642 5799999999999999999999999999984333 No 68 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=2.2e-56 Score=325.69 Aligned_cols=383 Identities=15% Similarity=0.120 Sum_probs=244.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+++.++++++.++.++++++.+.. .++........+++++ +..+++.++++++.++.++... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el----~e~~~~~~~~~eei~~-------------~~~~~~~l~~~~~~l~~~~~~~ 63 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDEL----SQKATDPNIDMEDIKQ-------------LETEKAGLQQRFNIVERQVQDI 63 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHH----HHHHhccCcCHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999998888777665432 1221111111111111 1122222233333332222222 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.......... .. ......... .....+....+ .... ....... .............+++..+| T Consensus 64 e~~~~~~~~~~-~~--~~~~~~~~~---~~~~~~~~~~r----~~~~---~~~~~~~---~~~~~~~~~a~~~~~~~~gG 127 (387) T protein:vir:96 64 EEKEKAKVKDK-GE--AYQSLSDNE---KMVKAKAEFYR----HAIL---PNEFEKP---SMEAQRLLHALPTGNDSGGD 127 (387) T ss_pred HHHHHhhhhhc-cc--cCCCCchhH---HHHHHHHHHHH----HHHh---hhhHHHH---HHHHHHHHhhhccCCCCCCc Confidence 11110000000 00 000000000 00000000000 0000 0000000 00111111222233444455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) .+||+++...||+.+++.++|+++++++++++ ..+|+.+...+.++|++||+..++++++|+++++.+||++++++|| T Consensus 128 ~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS 205 (387) T protein:vir:96 128 KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAIS 205 (387) T ss_pred eeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeechheeeeechhh Confidence 56777889999999999999999999998875 4677766545679999999999999999999999999999999999 Q ss_pred HHHHhhH-HHHHHHHHHHHHHHHHHHHHh-hhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhh Q lcl|Aclame:pro 241 DEGLRDA-PELFNFVQGRLLEGIQRKEEV-QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 241 ~ell~ds-~~l~~~i~~~la~~~~~~~d~-a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) +|||+|+ +++++||.++|+++++.+++. .|.+|+|+++|.|++...+....+. T Consensus 206 ~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~------------------------- 260 (387) T protein:vir:96 206 DTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG------------------------- 260 (387) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc------------------------- Confidence 9999997 689999999999999999764 5778999999999987643322111 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccc Q lcl|Aclame:pro 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~ 398 (497) ....+.+..+++.+...+. ....|+||+.++..+..+++..|+|+| T Consensus 261 ---------------------------------~~~~d~i~~~~~~l~~~y~-~na~~imn~~t~~~~~~~~~~~~~~~~ 306 (387) T protein:vir:96 261 ---------------------------------ADMYDAIINALADLHEDYR-DNATIYMRYADYVKIISVLSNGTTNFF 306 (387) T ss_pred ---------------------------------cchHHHHHHHHhccChhhh-cCCEEEEechHHHHHHHHHhcCCCccc Confidence 1123455666666555543 456899999999888877777888887 Q ss_pred cccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccE Q lcl|Aclame:pro 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~ 478 (497) .+. +.+|+|+||++++.++ +++||||+++ |.++ .++.++..++. .+|++.|++..|+|++ T Consensus 307 ~~~-----------~~~llG~PV~~~~~~~--~~~~GDf~~~-~~~~--~~~~~~~~~~~----~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:96 307 DTP-----------AEKVFGKPVVFTDAAV--KPIVGDFNYF-GINY--DGTTYDTDKDV----KKGEYLFVLTAWYDQQ 366 (387) T ss_pred ccC-----------CccccccceEEecCCC--ceeeechhhh-hhhh--hhhhheecccc----cCCceEEEEEEEeCcE Confidence 542 3479999999999875 5899999984 4443 45666655543 3789999999999999 Q ss_pred eecccceEEEEecCCCCCC Q lcl|Aclame:pro 479 VYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 479 v~~~~Af~~~~~~~~a~~~ 497 (497) |++|+||+++++++++.+. T Consensus 367 v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:96 367 RTLDSAFRIAKAKENTGPL 385 (387) T ss_pred eechhheEEEEeecCCCCC Confidence 9999999999999888777 No 69 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=2.2e-56 Score=325.69 Aligned_cols=383 Identities=15% Similarity=0.120 Sum_probs=244.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+++.++++++.++.++++++.+.. .++........+++++ +..+++.++++++.++.++... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el----~e~~~~~~~~~eei~~-------------~~~~~~~l~~~~~~l~~~~~~~ 63 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDEL----SQKATDPNIDMEDIKQ-------------LETEKAGLQQRFNIVERQVQDI 63 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHH----HHHHhccCcCHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999998888777665432 1221111111111111 1122222233333332222222 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.......... .. ......... .....+....+ .... ....... .............+++..+| T Consensus 64 e~~~~~~~~~~-~~--~~~~~~~~~---~~~~~~~~~~r----~~~~---~~~~~~~---~~~~~~~~~a~~~~~~~~gG 127 (387) T protein:vir:94 64 EEKEKAKVKDK-GE--AYQSLSDNE---KMVKAKAEFYR----HAIL---PNEFEKP---SMEAQRLLHALPTGNDSGGD 127 (387) T ss_pred HHHHHhhhhhc-cc--cCCCCchhH---HHHHHHHHHHH----HHHh---hhhHHHH---HHHHHHHHhhhccCCCCCCc Confidence 11110000000 00 000000000 00000000000 0000 0000000 00111111222233444455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) .+||+++...||+.+++.++|+++++++++++ ..+|+.+...+.++|++||+..++++++|+++++.+||++++++|| T Consensus 128 ~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS 205 (387) T protein:vir:94 128 KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAIS 205 (387) T ss_pred eeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeechheeeeechhh Confidence 56777889999999999999999999998875 4677766545679999999999999999999999999999999999 Q ss_pred HHHHhhH-HHHHHHHHHHHHHHHHHHHHh-hhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhh Q lcl|Aclame:pro 241 DEGLRDA-PELFNFVQGRLLEGIQRKEEV-QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 241 ~ell~ds-~~l~~~i~~~la~~~~~~~d~-a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) +|||+|+ +++++||.++|+++++.+++. .|.+|+|+++|.|++...+....+. T Consensus 206 ~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~------------------------- 260 (387) T protein:vir:94 206 DTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG------------------------- 260 (387) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc------------------------- Confidence 9999997 689999999999999999764 5778999999999987643322111 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccc Q lcl|Aclame:pro 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~ 398 (497) ....+.+..+++.+...+. ....|+||+.++..+..+++..|+|+| T Consensus 261 ---------------------------------~~~~d~i~~~~~~l~~~y~-~na~~imn~~t~~~~~~~~~~~~~~~~ 306 (387) T protein:vir:94 261 ---------------------------------ADMYDAIINALADLHEDYR-DNATIYMRYADYVKIISVLSNGTTNFF 306 (387) T ss_pred ---------------------------------cchHHHHHHHHhccChhhh-cCCEEEEechHHHHHHHHHhcCCCccc Confidence 1123455666666555543 456899999999888877777888887 Q ss_pred cccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccE Q lcl|Aclame:pro 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~ 478 (497) .+. +.+|+|+||++++.++ +++||||+++ |.++ .++.++..++. .+|++.|++..|+|++ T Consensus 307 ~~~-----------~~~llG~PV~~~~~~~--~~~~GDf~~~-~~~~--~~~~~~~~~~~----~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:94 307 DTP-----------AEKVFGKPVVFTDAAV--KPIVGDFNYF-GINY--DGTTYDTDKDV----KKGEYLFVLTAWYDQQ 366 (387) T ss_pred ccC-----------CccccccceEEecCCC--ceeeechhhh-hhhh--hhhhheecccc----cCCceEEEEEEEeCcE Confidence 542 3479999999999875 5899999984 4443 45666655543 3789999999999999 Q ss_pred eecccceEEEEecCCCCCC Q lcl|Aclame:pro 479 VYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 479 v~~~~Af~~~~~~~~a~~~ 497 (497) |++|+||+++++++++.+. T Consensus 367 v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:94 367 RTLDSAFRIAKAKENTGPL 385 (387) T ss_pred eechhheEEEEeecCCCCC Confidence 9999999999999888777 No 70 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=2.2e-56 Score=325.69 Aligned_cols=383 Identities=15% Similarity=0.120 Sum_probs=244.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+++.++++++.++.++++++.+.. .++........+++++ +..+++.++++++.++.++... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el----~e~~~~~~~~~eei~~-------------~~~~~~~l~~~~~~l~~~~~~~ 63 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDEL----SQKATDPNIDMEDIKQ-------------LETEKAGLQQRFNIVERQVQDI 63 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHH----HHHHhccCcCHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999998888777665432 1221111111111111 1122222233333332222222 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.......... .. ......... .....+....+ .... ....... .............+++..+| T Consensus 64 e~~~~~~~~~~-~~--~~~~~~~~~---~~~~~~~~~~r----~~~~---~~~~~~~---~~~~~~~~~a~~~~~~~~gG 127 (387) T protein:vir:26 64 EEKEKAKVKDK-GE--AYQSLSDNE---KMVKAKAEFYR----HAIL---PNEFEKP---SMEAQRLLHALPTGNDSGGD 127 (387) T ss_pred HHHHHhhhhhc-cc--cCCCCchhH---HHHHHHHHHHH----HHHh---hhhHHHH---HHHHHHHHhhhccCCCCCCc Confidence 11110000000 00 000000000 00000000000 0000 0000000 00111111222233444455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) .+||+++...||+.+++.++|+++++++++++ ..+|+.+...+.++|++||+..++++++|+++++.+||++++++|| T Consensus 128 ~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS 205 (387) T protein:vir:26 128 KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAIS 205 (387) T ss_pred eeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeechheeeeechhh Confidence 56777889999999999999999999998875 4677766545679999999999999999999999999999999999 Q ss_pred HHHHhhH-HHHHHHHHHHHHHHHHHHHHh-hhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhh Q lcl|Aclame:pro 241 DEGLRDA-PELFNFVQGRLLEGIQRKEEV-QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 241 ~ell~ds-~~l~~~i~~~la~~~~~~~d~-a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) +|||+|+ +++++||.++|+++++.+++. .|.+|+|+++|.|++...+....+. T Consensus 206 ~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~------------------------- 260 (387) T protein:vir:26 206 DTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG------------------------- 260 (387) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc------------------------- Confidence 9999997 689999999999999999764 5778999999999987643322111 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccc Q lcl|Aclame:pro 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMG 398 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~ 398 (497) ....+.+..+++.+...+. ....|+||+.++..+..+++..|+|+| T Consensus 261 ---------------------------------~~~~d~i~~~~~~l~~~y~-~na~~imn~~t~~~~~~~~~~~~~~~~ 306 (387) T protein:vir:26 261 ---------------------------------ADMYDAIINALADLHEDYR-DNATIYMRYADYVKIISVLSNGTTNFF 306 (387) T ss_pred ---------------------------------cchHHHHHHHHhccChhhh-cCCEEEEechHHHHHHHHHhcCCCccc Confidence 1123455666666555543 456899999999888877777888887 Q ss_pred cccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccE Q lcl|Aclame:pro 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) Q Consensus 399 ~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~ 478 (497) .+. +.+|+|+||++++.++ +++||||+++ |.++ .++.++..++. .+|++.|++..|+|++ T Consensus 307 ~~~-----------~~~llG~PV~~~~~~~--~~~~GDf~~~-~~~~--~~~~~~~~~~~----~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:26 307 DTP-----------AEKVFGKPVVFTDAAV--KPIVGDFNYF-GINY--DGTTYDTDKDV----KKGEYLFVLTAWYDQQ 366 (387) T ss_pred ccC-----------CccccccceEEecCCC--ceeeechhhh-hhhh--hhhhheecccc----cCCceEEEEEEEeCcE Confidence 542 3479999999999875 5899999984 4443 45666655543 3789999999999999 Q ss_pred eecccceEEEEecCCCCCC Q lcl|Aclame:pro 479 VYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 479 v~~~~Af~~~~~~~~a~~~ 497 (497) |++|+||+++++++++.+. T Consensus 367 v~~~~A~~~l~~ka~~~~~ 385 (387) T protein:vir:26 367 RTLDSAFRIAKAKENTGPL 385 (387) T ss_pred eechhheEEEEeecCCCCC Confidence 9999999999999888777 No 71 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=2.6e-57 Score=330.82 Aligned_cols=281 Identities=15% Similarity=0.132 Sum_probs=223.8 Q ss_pred hcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeee Q lcl|Aclame:pro 153 FGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGK 232 (497) Q Consensus 153 ~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~k 232 (497) +-+.+++|.++|+++...||+.+++.++|++++++++++++.+++|+.++. +.++||+||+.+|+++++|+++++.+|| T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~-~~a~wv~Eg~~~~~~~~~f~~v~l~~~k 79 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAP-PRGEVVGEGAQKSESTATFAPVTAIPRK 79 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCC-ceeEEeecCcccccccceeeEEEEeeEE Confidence 333444567888899999999999999999999999999999999999874 7899999999999999999999999999 Q ss_pred eeeechhhHHHHh---hH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCc---cccceeccccccccchhhhhhhHHHHHH Q lcl|Aclame:pro 233 VANALTITDEGLR---DA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP---GVNGLLQRSTGFTASSASSLFGATSATV 305 (497) Q Consensus 233 ia~~~~iS~ell~---ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~---~~~Gil~~~~~~~~~~~~~~~~~~~~~~ 305 (497) +++++++|+|||+ |+ .+++++|.+++++++++++|.+|++|++.+ .+.||++.....+..... T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~---------- 149 (311) T protein:vir:81 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL---------- 149 (311) T ss_pred EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeee---------- Confidence 9999999999995 33 579999999999999999999999997543 355666543221111100 Q ss_pred HHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHH Q lcl|Aclame:pro 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWEL 385 (497) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~ 385 (497) . .......+..+.........++.++++|+|||.++.+ T Consensus 150 -----------------------------------------~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~ 187 (311) T protein:vir:81 150 -----------------------------------------T-TGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFM 187 (311) T ss_pred -----------------------------------------c-ccccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHH Confidence 0 0000111122222333334556677889999999999 Q ss_pred HHHHhcccCcccccccccccccccccccccccccceeecCCCCcC------------------cEEEeeccceEEEEEec Q lcl|Aclame:pro 386 LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------------------TILVGHFAPSVIQTARR 447 (497) Q Consensus 386 l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~------------------~~~~gd~~~~~~~i~~r 447 (497) |++|||++|+|+|.+... ...+.+|+|+||+.++.||.+ .+++|||++ |.+..+ T Consensus 188 l~~lkd~~G~~l~~~~~~------~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~--~~i~~~ 259 (311) T protein:vir:81 188 LATQRDSQGRKLYPELGF------GTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSA--FRWGVQ 259 (311) T ss_pred HHhhhccCCCeeecCccc------cCCCceecceeEEecccccccccccccccchhcccCCccEEEEEeccc--EEEEEe Confidence 999999999999987543 234568999999999999853 368999998 446678 Q ss_pred cccEEEEeccch-----hhhhcCceEEEEEeeeccEeecccceEEEEecCCC Q lcl|Aclame:pro 448 EGVTMQMTNSNG-----TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 448 ~~~~i~~~~~~~-----~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a 494 (497) .+++++++++.. ++|++|+|+||+++|+|++|+||+||++|+-.+.| T Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 260 VSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred ccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 889999887643 56999999999999999999999999999988888 No 72 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=2.2e-57 Score=331.22 Aligned_cols=295 Identities=14% Similarity=0.115 Sum_probs=231.9 Q ss_pred hhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccc Q lcl|Aclame:pro 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSE 221 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~ 221 (497) .........+...+++.++++|||++..+||+.+++.++|++++++++++++.++||+++. .+.++||+|++.+|++++ T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~~s~~ 79 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTG-DVSAQWIGEGDMKPITKG 79 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcC-CcceEEecCCcccccccc Confidence 1111122334455667778899999999999999999999999999999999999999987 477999999999999999 Q ss_pred cceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhH Q lcl|Aclame:pro 222 EFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 222 ~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~ 300 (497) +|+++++.+||++++++||+|||+|+ +++++||+++|++++++++|.+||+|+|++++.+.+......+..... T Consensus 80 ~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~----- 154 (397) T protein:vir:23 80 NMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISP----- 154 (397) T ss_pred ceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecc----- Confidence 99999999999999999999999987 689999999999999999999999999987644333222111111100 Q ss_pred HHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEeh Q lcl|Aclame:pro 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) ....+.+..+...+... +...+.|+||+ T Consensus 155 ---------------------------------------------------~~~~~~~~~~~~~l~~~-~~~~a~~vmn~ 182 (397) T protein:vir:23 155 ---------------------------------------------------NAYQGLGVSGLTKLVTD-GKKWTHTLLDD 182 (397) T ss_pred ---------------------------------------------------cchhHHHHHHHHhhhhc-ccCCCEEEEcH Confidence 01112222333333333 34567899999 Q ss_pred hHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc--EEEeeccceEEEEEeccccEEEEeccc Q lcl|Aclame:pro 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARREGVTMQMTNSN 458 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~--~~~gd~~~~~~~i~~r~~~~i~~~~~~ 458 (497) .++..|+++||++|+|+|.+....+.... ....+|+|+||++++++|+++ +++|||+.. .++++.+++++++++. T Consensus 183 ~~~~~L~~lkd~~G~~i~~~~~~~~~~~~-~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~--~i~~~~~i~i~~~~e~ 259 (397) T protein:vir:23 183 TVEPVLNGSVDANGRPLFVESTYESLTTP-FREGRILGRPTILSDHVAEGDVVGYAGDFSQI--IWGQVGGLSFDVTDQA 259 (397) T ss_pred HHHHHHHHhhccCCceeeccccccccccc-ccCceeeeeeEEEeCCCCCCceEEEEeecceE--EEEEEeceEEEEeeee Confidence 99999999999999999998765443322 233579999999999999987 478999984 4677899999887654 Q ss_pred ------------hhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 459 ------------GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 459 ------------~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) .++|++|+++||+++|+|++++||+||++++..+....- T Consensus 260 ~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~ 310 (397) T protein:vir:23 260 TLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTY 310 (397) T ss_pred eeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecccccee Confidence 257999999999999999999999999999986664443 No 73 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=8.5e-56 Score=322.50 Aligned_cols=382 Identities=14% Similarity=0.113 Sum_probs=239.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+++.++++.+.++.++++.+.+...+ +....+...+++++ +..+++.++++++.++.++.+. T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~----~~~~~~~~~ee~~~-------------~~~~~~~l~~~~~~l~~~~~~~ 63 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQ----KATDPNIDMEDIKQ-------------LETEKAGLQQRFNIVERQVKDI 63 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHH----HHhccCcCHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHH Confidence 999999999998887777766542221 11111111111111 1122222233333332222222 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.......... .. .......+.+. ...+....+ ..... . ..................++++.+| T Consensus 64 e~~~~~~~~~~-~~--~~~~~~~~~~~---~~~~~~~~r----~~~~~---~---~~~~~~~~~~~~~~al~~~t~s~gG 127 (387) T protein:vir:93 64 EEKEKAKVKDT-GE--AYQSLNDHEKM---VKAKAEFYR----HAILP---N---EFEKPSMEAQRLLHALPTGNDSGGD 127 (387) T ss_pred HHHHHHhhhhc-cc--cCCCcchhhHH---HHHHHHHHH----HHhhh---h---hhhhhhhhhHHHHHhhccCcCCCCc Confidence 11110000000 00 00000000000 000000000 00000 0 0000011111122222333444455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) .+||+++...||+.+++.++|+++|+++++++ ..+|+.....+.++||+||+..++++++|+++++.+||++++++|| T Consensus 128 ~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS 205 (387) T protein:vir:93 128 KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAIS 205 (387) T ss_pred eeechhHHHHHHHHHHhhchhhhheeeeecCC--ceEEEEeecCCccccccCcccccccccccceeeeeheeeeeechhh Confidence 56777888999999999999999999998875 4677765544678999999999999999999999999999999999 Q ss_pred HHHHhhH-HHHHHHHHHHHHHHHHHHHHh-hhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhh Q lcl|Aclame:pro 241 DEGLRDA-PELFNFVQGRLLEGIQRKEEV-QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 241 ~ell~ds-~~l~~~i~~~la~~~~~~~d~-a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) +|||+|+ +++++||.++|+++++.+++. .|.+|+|+++|.|++..++....+. T Consensus 206 ~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~------------------------- 260 (387) T protein:vir:93 206 DTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEG------------------------- 260 (387) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc------------------------- Confidence 9999997 689999999999999999766 5778999999999987543221111 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHH-HHhcccCccc Q lcl|Aclame:pro 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLR-LTKDANGQYM 397 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~-~lkd~~G~~~ 397 (497) ....+.+..+++.+...+. ....|+||+.++..+. +++|.+| |+ T Consensus 261 ---------------------------------~~~~d~i~~~~~~l~~~~~-~~a~~~mn~~t~~~~~~~~~d~~~-~~ 305 (387) T protein:vir:93 261 ---------------------------------ADMYDAIINALADLHEDYR-DNATIYMRYADYVKIISVLSNGTT-NF 305 (387) T ss_pred ---------------------------------cchHHHHHHHHhccChhhh-cCCEEEEechHHHHHHHHHhcCCC-cc Confidence 0123445566665555544 4568999999987665 5555555 44 Q ss_pred ccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeecc Q lcl|Aclame:pro 398 GGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGL 477 (497) Q Consensus 398 ~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~ 477 (497) |.+ .+.+|+|+||++++.++ +++||||+++ |.+ +.++.++..++ +.++++.|++..|+|+ T Consensus 306 ~~~-----------~~~~llG~PV~~~~~~~--~~~~GDf~~~-~~~--~~~~~~~~~~~----~~~~~~~~~~~~r~d~ 365 (387) T protein:vir:93 306 FDT-----------PAEKVFGKPVVFTDAAV--KPIVGDFNYF-GIN--YDGTTYDTDKD----VKKGEYLFVLTAWYDQ 365 (387) T ss_pred ccc-----------CCccccccceEEecCCC--ceeeeehhhh-hee--hhhheeeeccc----ccCCceeEEEEeeeCc Confidence 432 22479999999999875 5899999984 433 34566665543 4589999999999999 Q ss_pred EeecccceEEEEecCCCCCC Q lcl|Aclame:pro 478 LVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 478 ~v~~~~Af~~~~~~~~a~~~ 497 (497) .|++|+||+++++++++.+. T Consensus 366 ~v~~~eA~~~l~~k~~~~~~ 385 (387) T protein:vir:93 366 QRTLDSAFRIAKAKENTGSL 385 (387) T ss_pred eeechhheEEEEeecCCCCC Confidence 99999999999999888887 No 74 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=1.6e-56 Score=326.55 Aligned_cols=303 Identities=15% Similarity=0.141 Sum_probs=233.5 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccc Q lcl|Aclame:pro 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~ 187 (497) +++.+ ......+.|.................+..++++||+++..+|++.+++.++|++++++ T Consensus 1 ~~~~~-----------------~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~ 63 (324) T protein:vir:97 1 MEQTQ-----------------KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred Cccch-----------------hHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcce Confidence 00000 0000111222222222222223334445566788888999999999999999999999 Q ss_pred eecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~ 266 (497) +++++++++||+.++ .+.+.|++||+.+|+++++|+++++.+||++++++||+|+|+|+ +++++||.++|++++++++ T Consensus 64 ~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:97 64 EPMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred eeccCCceEEEEEec-CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 999999999999987 47899999999999999999999999999999999999999997 6899999999999999999 Q ss_pred HhhhhccCCCc-cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccccccc Q lcl|Aclame:pro 267 EVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 267 d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) |.+||+|+|++ .|.||++.....+.... T Consensus 143 d~a~l~G~g~~~~~~gi~~~~~~~~~~~~--------------------------------------------------- 171 (324) T protein:vir:97 143 DEAGILNQGNNPFGKSIAQSIEKTNKVIK--------------------------------------------------- 171 (324) T ss_pred HHHhhccCCCCccCccccccccccceecc--------------------------------------------------- Confidence 99999999987 47777765332211110 Q ss_pred ccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecC Q lcl|Aclame:pro 346 GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~ 425 (497) .....+++..+...+... +..+..|+|||.+|..|+++||++|+|+|.+. ..++|+|+||+.++ T Consensus 172 -----~~~~~~~i~~~~~~l~~~-~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~~----------~~~tl~G~PV~~~~ 235 (324) T protein:vir:97 172 -----GDFTQDNIIDLEALLEDD-ELEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDTLDGLPVVNLK 235 (324) T ss_pred -----ccCCHHHHHHHHHhhhhc-cCCCCEEEEcHHHHHHHHHhhcCCCceeecCC----------CCccccceeeEeec Confidence 011233444454444443 45567899999999999999999999998642 34589999999988 Q ss_pred CCC--cCcEEEeeccceEEEEEeccccEEEEeccch------------hhhhcCceEEEEEeeeccEeecccceEEEEec Q lcl|Aclame:pro 426 LIP--LGTILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 426 ~~~--~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~ 491 (497) ..+ .+.+++|||++ +.++++.+++|+++++.. ++|++|+++||+++|+|+++.+|+||++|+.+ T Consensus 236 ~~~~~~~~~~~gd~~~--~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~ 313 (324) T protein:vir:97 236 SSNLKRGELITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CCCCCcceEEEEeccc--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Confidence 754 56799999998 447789999999987643 67999999999999999999999999999987 Q ss_pred CCCCCC Q lcl|Aclame:pro 492 KGATGS 497 (497) Q Consensus 492 ~~a~~~ 497 (497) .+.+-. T Consensus 314 ~~~~~~ 319 (324) T protein:vir:97 314 DKKTDS 319 (324) T ss_pred cCCCCC Confidence 776654 No 75 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=5.8e-57 Score=328.90 Aligned_cols=285 Identities=15% Similarity=0.148 Sum_probs=230.1 Q ss_pred hhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccccccc Q lcl|Aclame:pro 143 TAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 143 ~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~ 222 (497) -..........++++.+|.+||+++..+|++.+++.++|++++++++++++.++||+.++ .+.+.|++|++.+|+++++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~ 79 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAK-GVGAYWVSETERIQTSKPE 79 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeC-CcceEEeecCcccccccce Confidence 111222333345555667778888999999999999999999999999999999999986 4679999999999999999 Q ss_pred ceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHH Q lcl|Aclame:pro 223 FARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 223 ~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~ 301 (497) |++++++++|++++++||+|+++|+ +++++||.++|++++++++|.+|++|+|+++|.|+............. T Consensus 80 ~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~------ 153 (304) T protein:vir:10 80 YAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKG------ 153 (304) T ss_pred eeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccc------ Confidence 9999999999999999999999997 589999999999999999999999999998888765432211110000 Q ss_pred HHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehh Q lcl|Aclame:pro 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPR 381 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 381 (497) ..........+++..++..+..+ +..+..|+|||. T Consensus 154 --------------------------------------------~~~~~~~~~~~~i~~~~~~l~~~-~~~~~~~v~~~~ 188 (304) T protein:vir:10 154 --------------------------------------------NVVTDTNNLYVDLSALMATIEDE-ELDPNGVLTTRS 188 (304) T ss_pred --------------------------------------------cccccccchHHHHHHHHHHhhhc-cCCcCEEEEcHH Confidence 00001112345555555555544 455668999999 Q ss_pred HHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCc----CcEEEeeccceEEEEEeccccEEEEecc Q lcl|Aclame:pro 382 DWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL----GTILVGHFAPSVIQTARREGVTMQMTNS 457 (497) Q Consensus 382 ~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~----~~~~~gd~~~~~~~i~~r~~~~i~~~~~ 457 (497) +|..|+++||++|+|+|.+. .++|+|+||++++++|. +.+++|||++ +.++++.+++++++++ T Consensus 189 ~~~~L~~lkd~~G~~l~~~~-----------~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~--~~~~~~~~~~i~~~~e 255 (304) T protein:vir:10 189 FRSKMRNALDANDRPLFDAN-----------GNEIMGLPLSYTGADVYDKKKSLALMGDWDY--ARYGILQGIEYAISED 255 (304) T ss_pred HHHHHHHhhccCCcEeecCC-----------CccccceeeEEecccccCCCCcEEEEEehhh--EEEEEecceEEEEeec Confidence 99999999999999999763 24799999999999985 3599999998 4578899999998876 Q ss_pred ch--------------hhhhcCceEEEEEeeeccEeecccceEEEEecC Q lcl|Aclame:pro 458 NG--------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 458 ~~--------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~ 492 (497) .. ++|++|+++||+++|+|+.|++|+||++|+... T Consensus 256 ~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 256 ATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 43 579999999999999999999999999999877 No 76 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=5.8e-57 Score=328.90 Aligned_cols=285 Identities=15% Similarity=0.148 Sum_probs=230.1 Q ss_pred hhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccccccccc Q lcl|Aclame:pro 143 TAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 143 ~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~ 222 (497) -..........++++.+|.+||+++..+|++.+++.++|++++++++++++.++||+.++ .+.+.|++|++.+|+++++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~ 79 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAK-GVGAYWVSETERIQTSKPE 79 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeC-CcceEEeecCcccccccce Confidence 111222333345555667778888999999999999999999999999999999999986 4679999999999999999 Q ss_pred ceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHH Q lcl|Aclame:pro 223 FARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 223 ~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~ 301 (497) |++++++++|++++++||+|+++|+ +++++||.++|++++++++|.+|++|+|+++|.|+............. T Consensus 80 ~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~------ 153 (304) T protein:vir:94 80 YAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKG------ 153 (304) T ss_pred eeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccc------ Confidence 9999999999999999999999997 589999999999999999999999999998888765432211110000 Q ss_pred HHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehh Q lcl|Aclame:pro 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPR 381 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 381 (497) ..........+++..++..+..+ +..+..|+|||. T Consensus 154 --------------------------------------------~~~~~~~~~~~~i~~~~~~l~~~-~~~~~~~v~~~~ 188 (304) T protein:vir:94 154 --------------------------------------------NVVTDTNNLYVDLSALMATIEDE-ELDPNGVLTTRS 188 (304) T ss_pred --------------------------------------------cccccccchHHHHHHHHHHhhhc-cCCcCEEEEcHH Confidence 00001112345555555555544 455668999999 Q ss_pred HHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCc----CcEEEeeccceEEEEEeccccEEEEecc Q lcl|Aclame:pro 382 DWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL----GTILVGHFAPSVIQTARREGVTMQMTNS 457 (497) Q Consensus 382 ~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~----~~~~~gd~~~~~~~i~~r~~~~i~~~~~ 457 (497) +|..|+++||++|+|+|.+. .++|+|+||++++++|. +.+++|||++ +.++++.+++++++++ T Consensus 189 ~~~~L~~lkd~~G~~l~~~~-----------~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~--~~~~~~~~~~i~~~~e 255 (304) T protein:vir:94 189 FRSKMRNALDANDRPLFDAN-----------GNEIMGLPLSYTGADVYDKKKSLALMGDWDY--ARYGILQGIEYAISED 255 (304) T ss_pred HHHHHHHhhccCCcEeecCC-----------CccccceeeEEecccccCCCCcEEEEEehhh--EEEEEecceEEEEeec Confidence 99999999999999999763 24799999999999985 3599999998 4578899999998876 Q ss_pred ch--------------hhhhcCceEEEEEeeeccEeecccceEEEEecC Q lcl|Aclame:pro 458 NG--------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 458 ~~--------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~ 492 (497) .. ++|++|+++||+++|+|+.|++|+||++|+... T Consensus 256 ~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 256 ATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 43 579999999999999999999999999999877 No 77 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=1.2e-56 Score=327.07 Aligned_cols=285 Identities=14% Similarity=0.097 Sum_probs=224.0 Q ss_pred hcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeee Q lcl|Aclame:pro 153 FGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGK 232 (497) Q Consensus 153 ~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~k 232 (497) +++++++|.+||+++..+||+.+++.++|++++++++++++.++||+.++. +.|+||+||+.+|+++++|+++++++|| T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~-~~a~wv~E~~~~~~s~~~f~~v~l~~~k 79 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLD-SDIDVVAENGKKTHGGLSLEPVTIVPIK 79 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecC-cceEEeecCccccccccceeeEEeeeEE Confidence 445566788899999999999999999999999999999999999999874 6899999999999999999999999999 Q ss_pred eeeechhhHHHHh---h-HHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHH Q lcl|Aclame:pro 233 VANALTITDEGLR---D-APELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNV 308 (497) Q Consensus 233 ia~~~~iS~ell~---d-s~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (497) +++++++|+|||+ | .+++++||.+++++++++++|.++++|++.+...+............. T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~-------------- 145 (303) T protein:vir:97 80 VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKV-------------- 145 (303) T ss_pred EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccccccccccccccc-------------- Confidence 9999999999993 3 468999999999999999999999999765433322111110000000 Q ss_pred hhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHH Q lcl|Aclame:pro 309 KFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRL 388 (497) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~ 388 (497) .............+++..++..+. ..+..++.|+|||.++..|++ T Consensus 146 ----------------------------------~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~~~vmn~~~~~~L~~ 190 (303) T protein:vir:97 146 ----------------------------------TQVVKFTESEDADANIEAAVNLIQ-GAEGVVTGLAMDTEFSTALAK 190 (303) T ss_pred ----------------------------------ccccccccccchHHHHHHHHHHHh-hcCCCccEEEEcHHHHHHHHH Confidence 000000111122344444444433 345667789999999999999 Q ss_pred HhcccCcccccccccccccccccccccccccceeecCCCCcC--------cEEEeeccceEEEEEeccccEEEEeccch- Q lcl|Aclame:pro 389 TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG--------TILVGHFAPSVIQTARREGVTMQMTNSNG- 459 (497) Q Consensus 389 lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~--------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~- 459 (497) +||++|+|+|.+.... ...+.+|||+||++|++||.+ .++||||+.. |.++.|.+++++++++.. T Consensus 191 lkd~~g~~~~~~~~~~-----~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~-~~~~~~~~~~~~~~~~~~~ 264 (303) T protein:vir:97 191 VTNGEMGPKMYPELAW-----GANPDSINGLKSSVNTTVGAGADEAESKDLVIIGDFESM-FKWGYAKQIPMEIIKYGDP 264 (303) T ss_pred hhccCCCeEEecCccC-----CCCCceecceeeEEecccCCccccCCCccEEEEeecccc-EEEEEecCcEEEEeeccCC Confidence 9999999999875422 223458999999999999853 3889999874 567789999999887542 Q ss_pred -----hhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 460 -----TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 460 -----~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) ++|++|+++||+++|+|++|++|+||++|+.... T Consensus 265 d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 265 DNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred CCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 5799999999999999999999999999997777 No 78 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=2.9e-56 Score=325.06 Aligned_cols=303 Identities=16% Similarity=0.148 Sum_probs=232.4 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccc Q lcl|Aclame:pro 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~ 187 (497) +++. ...... .+.|.................++.++++||+++..+|++.+++.++|++++++ T Consensus 1 ~~~~----------------~~~~~~-~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:93 1 MEQT----------------QKLKLN-LQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred Cchh----------------HHHHHH-HHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcce Confidence 0000 000000 11122222222222333344455567789999999999999999999999999 Q ss_pred eecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~ 266 (497) ++++++.++||+.++ .+.++||+||+.+|+++++|+++++.++|++++++||+||++|+ +++++||.++|++++++++ T Consensus 64 ~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:93 64 EPMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred eeccCCceEEEEEec-CcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 999999999999987 47899999999999999999999999999999999999999997 6899999999999999999 Q ss_pred HhhhhccCCCc-cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccccccc Q lcl|Aclame:pro 267 EVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 267 d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) |.++|+|+|++ .|.|+++.......... T Consensus 143 d~a~l~G~g~~~~~~~~~~~~~~~~~~~~--------------------------------------------------- 171 (324) T protein:vir:93 143 DEAGILNQGNNPFGKSIAQSIEKTNKVIK--------------------------------------------------- 171 (324) T ss_pred HHHHhcCCCCCCcCccccccccccceecc--------------------------------------------------- Confidence 99999999976 47777664432211110 Q ss_pred ccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecC Q lcl|Aclame:pro 346 GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~ 425 (497) .....+++..++..+..+ +..+..|+|||.+|..|+++||++|+|+|.+. .+++|+|+||+.++ T Consensus 172 -----~~~~~~~i~~~~~~l~~~-~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~~----------~~~~l~G~PVv~~~ 235 (324) T protein:vir:93 172 -----GDFTQDNIIDLEALLEDD-ELEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDSLDGLPVVNLK 235 (324) T ss_pred -----ccccHHHHHHHHHhhhhc-cCCCCEEEEcHHHHHHHHHhhCCCCCeeecCC----------CCCcccceeeEeec Confidence 012234455555555444 45567899999999999999999999998642 34589999999977 Q ss_pred CC--CcCcEEEeeccceEEEEEeccccEEEEeccch------------hhhhcCceEEEEEeeeccEeecccceEEEEec Q lcl|Aclame:pro 426 LI--PLGTILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 426 ~~--~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~ 491 (497) .. +.+.+++|||++ +.++++++++|+++++.. ++|++|+++||+++|+||.|.||+||++|+.. T Consensus 236 ~~~~~~~~i~~gdfs~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a 313 (324) T protein:vir:93 236 SSNLKRGELITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CCCCCcceEEEEecce--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecc Confidence 64 566799999997 347889999999988753 57999999999999999999999999999844 Q ss_pred CCCCCC Q lcl|Aclame:pro 492 KGATGS 497 (497) Q Consensus 492 ~~a~~~ 497 (497) .+.+-. T Consensus 314 ~~~~~~ 319 (324) T protein:vir:93 314 DKRTDS 319 (324) T ss_pred cccCCC Confidence 443322 No 79 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=3.6e-56 Score=324.57 Aligned_cols=303 Identities=15% Similarity=0.136 Sum_probs=231.1 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccc Q lcl|Aclame:pro 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~ 187 (497) ++.. .......+.+.................+..++++||+++..+||+.+++.++|++++++ T Consensus 1 ~~~~-----------------~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:78 1 MEQT-----------------QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred CCcc-----------------hhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcce Confidence 0000 00001111222222222222222334455566788889999999999999999999999 Q ss_pred eecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~ 266 (497) ++++++.++||+.++. +.++||+||+.+|+++++|+++++.+||++++++||+|+|+|+ +++++||.++|++++++++ T Consensus 64 ~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~ 142 (324) T protein:vir:78 64 EPMEGTEKKFTFWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred eeccCCceEEEEEecC-cceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 9999989999999874 7899999999999999999999999999999999999999997 6899999999999999999 Q ss_pred HhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccccccc Q lcl|Aclame:pro 267 EVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 267 d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) |.++|+|+|++. |.||.+.......... T Consensus 143 d~a~l~G~g~~~~~~gi~~~~~~~~~~~~--------------------------------------------------- 171 (324) T protein:vir:78 143 DEAGILNQGNNPFGKSIAQSIEKTNKVIK--------------------------------------------------- 171 (324) T ss_pred HHHHhccCCCCCcCccccccccccceecc--------------------------------------------------- Confidence 999999999864 7777664432211110 Q ss_pred ccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecC Q lcl|Aclame:pro 346 GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~ 425 (497) .....+++..+...+.. .+..+++|+|||.+|..|+++||++|+|++.+. ..++|+|+||+.++ T Consensus 172 -----~~~t~~~i~~~~~~l~~-~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~----------~~~~l~G~PV~~~~ 235 (324) T protein:vir:78 172 -----GDFTQDNIIDLEALLED-DELEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDSLDGLPVVNLK 235 (324) T ss_pred -----ccccHHHHHHHHHhhhh-ccCCCCEEEEcHHHHHHHHHhhccCCCeeecCC----------CCCcccceeeEeeC Confidence 11223445555554444 445677899999999999999999999998642 34589999999987 Q ss_pred CC--CcCcEEEeeccceEEEEEeccccEEEEeccch------------hhhhcCceEEEEEeeeccEeecccceEEEEec Q lcl|Aclame:pro 426 LI--PLGTILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 426 ~~--~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~ 491 (497) .+ +.+.+++|||++ +.++++.+++++++++.. ++|++|+++||+++|+||.|.||+||++|+.. T Consensus 236 ~~~~~~~~~~~gd~~~--~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:78 236 SSNLKRGELITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CCCCCcceEEEEecce--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecc Confidence 64 456799999998 347789999999987643 57999999999999999999999999999864 Q ss_pred CCCC---CC Q lcl|Aclame:pro 492 KGAT---GS 497 (497) Q Consensus 492 ~~a~---~~ 497 (497) ...+ .+ T Consensus 314 ~~~~~~~~~ 322 (324) T protein:vir:78 314 DKRTDSVPG 322 (324) T ss_pred cccCCCCCC Confidence 4332 22 No 80 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=3.6e-56 Score=324.57 Aligned_cols=303 Identities=15% Similarity=0.136 Sum_probs=231.1 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccc Q lcl|Aclame:pro 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~ 187 (497) ++.. .......+.+.................+..++++||+++..+||+.+++.++|++++++ T Consensus 1 ~~~~-----------------~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:96 1 MEQT-----------------QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred CCcc-----------------hhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcce Confidence 0000 00001111222222222222222334455566788889999999999999999999999 Q ss_pred eecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~ 266 (497) ++++++.++||+.++. +.++||+||+.+|+++++|+++++.+||++++++||+|+|+|+ +++++||.++|++++++++ T Consensus 64 ~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~ 142 (324) T protein:vir:96 64 EPMEGTEKKFTFWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred eeccCCceEEEEEecC-cceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 9999989999999874 7899999999999999999999999999999999999999997 6899999999999999999 Q ss_pred HhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccccccc Q lcl|Aclame:pro 267 EVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 267 d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) |.++|+|+|++. |.||.+.......... T Consensus 143 d~a~l~G~g~~~~~~gi~~~~~~~~~~~~--------------------------------------------------- 171 (324) T protein:vir:96 143 DEAGILNQGNNPFGKSIAQSIEKTNKVIK--------------------------------------------------- 171 (324) T ss_pred HHHHhccCCCCCcCccccccccccceecc--------------------------------------------------- Confidence 999999999864 7777664432211110 Q ss_pred ccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecC Q lcl|Aclame:pro 346 GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~ 425 (497) .....+++..+...+.. .+..+++|+|||.+|..|+++||++|+|++.+. ..++|+|+||+.++ T Consensus 172 -----~~~t~~~i~~~~~~l~~-~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~----------~~~~l~G~PV~~~~ 235 (324) T protein:vir:96 172 -----GDFTQDNIIDLEALLED-DELEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDSLDGLPVVNLK 235 (324) T ss_pred -----ccccHHHHHHHHHhhhh-ccCCCCEEEEcHHHHHHHHHhhccCCCeeecCC----------CCCcccceeeEeeC Confidence 11223445555554444 445677899999999999999999999998642 34589999999987 Q ss_pred CC--CcCcEEEeeccceEEEEEeccccEEEEeccch------------hhhhcCceEEEEEeeeccEeecccceEEEEec Q lcl|Aclame:pro 426 LI--PLGTILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 426 ~~--~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~ 491 (497) .+ +.+.+++|||++ +.++++.+++++++++.. ++|++|+++||+++|+||.|.||+||++|+.. T Consensus 236 ~~~~~~~~~~~gd~~~--~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:96 236 SSNLKRGELITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CCCCCcceEEEEecce--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecc Confidence 64 456799999998 347789999999987643 57999999999999999999999999999864 Q ss_pred CCCC---CC Q lcl|Aclame:pro 492 KGAT---GS 497 (497) Q Consensus 492 ~~a~---~~ 497 (497) ...+ .+ T Consensus 314 ~~~~~~~~~ 322 (324) T protein:vir:96 314 DKRTDSVPG 322 (324) T ss_pred cccCCCCCC Confidence 4332 22 No 81 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=3.4e-56 Score=324.67 Aligned_cols=280 Identities=15% Similarity=0.111 Sum_probs=220.7 Q ss_pred ccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeee Q lcl|Aclame:pro 155 STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA 234 (497) Q Consensus 155 ~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia 234 (497) -..++|.++||++..+||+.+++.++|++++++++++++.+++|+.++. +.|+||+|++.+|+++++|+++++.+||++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~-~~a~~v~E~~~~~~~~~~f~~v~l~~~k~a 79 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMD-SEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecC-cceEEecCCccccccccceeEEEEeeeeEE Confidence 3345577899999999999999999999999999999988999998874 789999999999999999999999999999 Q ss_pred eechhhHHHHh---hH-HHHHHHHHHHHHHHHHHHHHhhhhccCC--CccccceeccccccccchhhhhhhHHHHHHHHH Q lcl|Aclame:pro 235 NALTITDEGLR---DA-PELFNFVQGRLLEGIQRKEEVQLLAGGG--YPGVNGLLQRSTGFTASSASSLFGATSATVSNV 308 (497) Q Consensus 235 ~~~~iS~ell~---ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g--~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (497) +++++|+|||+ |+ .++++||.++|++++++++|.++++|++ ++.+.++............ T Consensus 80 ~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~-------------- 145 (298) T protein:vir:16 80 YGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQ-------------- 145 (298) T ss_pred EeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccc-------------- Confidence 99999999995 33 5799999999999999999999999964 3334433221111000000 Q ss_pred hhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHH Q lcl|Aclame:pro 309 KFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRL 388 (497) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~ 388 (497) ............+++..++..+.. .+..+.+|+|||.++..|++ T Consensus 146 -----------------------------------~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~vmn~~~~~~l~~ 189 (298) T protein:vir:16 146 -----------------------------------KVEAPRGIADPNGAIENAVELLTG-VDADVTGIAINPSFRSALAK 189 (298) T ss_pred -----------------------------------ccccccccccHHHHHHHHHHHhhh-cCCCccEEEEcHHHHHHHHH Confidence 000000111223344455544444 34556789999999999999 Q ss_pred HhcccCcccccccccccccccccccccccccceeecCCCCcC------cEEEeeccceEEEEEeccccEEEEeccch--- Q lcl|Aclame:pro 389 TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------TILVGHFAPSVIQTARREGVTMQMTNSNG--- 459 (497) Q Consensus 389 lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~------~~~~gd~~~~~~~i~~r~~~~i~~~~~~~--- 459 (497) +||++|+|+|++.... ..+.+|+|+||++++++|.+ .+++|||+++ +.++.|.+++++++++.. T Consensus 190 lkd~~G~~i~~~~~~~------~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDfs~~-~~~~~~~~~~~~~~~~~~~~~ 262 (298) T protein:vir:16 190 QKDLQDNALFPELKWG------ATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANG-FKWGYAKEVPLEVIQYGDPDN 262 (298) T ss_pred hhccCCCeeecCcccC------CCCceecceeeEEecccccccCCCccEEEEeeccce-EEEEEecCceEEEeeccCCcC Confidence 9999999999875432 23458999999999999863 4889999985 557778999999877532 Q ss_pred ---hhhhcCceEEEEEeeeccEeecccceEEEEecC Q lcl|Aclame:pro 460 ---TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 460 ---~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~ 492 (497) ++|++||++||+++|+|++|+||+||++|+-.+ T Consensus 263 ~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 263 SGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 479999999999999999999999999998666 No 82 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=5.3e-56 Score=323.62 Aligned_cols=294 Identities=15% Similarity=0.144 Sum_probs=228.8 Q ss_pred hhhhhhhhhhhhcc------cccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccc-- Q lcl|Aclame:pro 142 ETAPAAIGQNPFGS------TGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEA-- 213 (497) Q Consensus 142 ~~~~~~~~~~~~~~------~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg-- 213 (497) ....++.+.+..++ ...+++++|+++..+||+.+++.++|++++++++++++..++|+.++ .+.++||+|| T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~eg~~ 79 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVK-RPEVGQVGVGTS 79 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeC-CceeEeecCccc Confidence 11111111122222 22234478889999999999999999999999999999999999987 4677787765 Q ss_pred ------cccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCcccc---cee Q lcl|Aclame:pro 214 ------GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVN---GLL 283 (497) Q Consensus 214 ------~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~---Gil 283 (497) +.+|+++++|+++++++||+++++++|+|+++|+ +++++||+++|++++++++|.+||+|+|++.+. |+. T Consensus 80 ~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~ 159 (333) T protein:vir:78 80 NEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGID 159 (333) T ss_pred ccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccc Confidence 5678899999999999999999999999999987 589999999999999999999999999987654 444 Q ss_pred ccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhh Q lcl|Aclame:pro 284 QRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFV 363 (497) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 363 (497) +.......+.. ...........+.+..++. T Consensus 160 ~~~~~~~~~~~--------------------------------------------------~~~~~~~~~~~~~i~~~~~ 189 (333) T protein:vir:78 160 TDNVIANTTNV--------------------------------------------------DYLQETGDPLLDRLLDGYD 189 (333) T ss_pred ccccccccccc--------------------------------------------------cccccccchhHHHHHHHHH Confidence 33222111110 0000111223556666666 Q ss_pred hhhhhhccCCceEEEehhHHHHHHH---HhcccCcccccccccccccccccccccccccceeecCCCCcC---------c Q lcl|Aclame:pro 364 DIQLTLFQTPNAVVMNPRDWELLRL---TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG---------T 431 (497) Q Consensus 364 ~~~~~~~~~~~~~~~n~~~~~~l~~---lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~---------~ 431 (497) .+...++..+++|+|||.+|..|++ +||.+|+|+|.+.... ..+.+|+|+||++++++|.+ . T Consensus 190 ~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~------~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~ 263 (333) T protein:vir:78 190 LVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLA------AQTGDVLGLPAQFGRAVGGDLGAAVDSKTR 263 (333) T ss_pred hhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCcccc------CCCceeeceeeEEccccCCCccccCCCccE Confidence 6666777788899999999987765 7899999999775432 34568999999999999865 4 Q ss_pred EEEeeccceEEEEEeccccEEEEeccch---------hhhhcCceEEEEEeeeccEeecccceEEEEecCCC Q lcl|Aclame:pro 432 ILVGHFAPSVIQTARREGVTMQMTNSNG---------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 432 ~~~gd~~~~~~~i~~r~~~~i~~~~~~~---------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a 494 (497) +++|||+. |.+.+|.+++|+++++.. ++|++|+++||+++|+|++|+||+||++|+.+++. T Consensus 264 ~~~gD~~~--~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 264 IIGGDFSQ--LKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEEEeccc--EEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCCC Confidence 89999998 457789999999988742 57999999999999999999999999999988888 No 83 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=1.3e-55 Score=321.49 Aligned_cols=303 Identities=16% Similarity=0.147 Sum_probs=234.0 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccc Q lcl|Aclame:pro 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~ 187 (497) +++... .. ...+.|.................+..++++||+++..+|++.+++.++|++++++ T Consensus 1 ~~k~~~----------------~~-~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~ 63 (324) T protein:vir:99 1 MEQTQK----------------LK-LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKY 63 (324) T ss_pred CCCchH----------------hh-HHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcce Confidence 000000 00 0011122222222222233334455566789999999999999999999999999 Q ss_pred eecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~ 266 (497) +++++++++||+.++ .+.+.|++||+.+|+++++|+++++.++|+++++++|+||++|+ +++++||.++|++++++++ T Consensus 64 ~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~ 142 (324) T protein:vir:99 64 EPMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred eeccCCceEEEEEec-CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 999999999999876 47899999999999999999999999999999999999999997 6899999999999999999 Q ss_pred HhhhhccCCCc-cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccccccc Q lcl|Aclame:pro 267 EVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 267 d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) |.++|+|+|++ .|.|+++.....+.... T Consensus 143 d~~~l~G~g~~~~~~~~~~~~~~~~~~~~--------------------------------------------------- 171 (324) T protein:vir:99 143 DEAGILNQGNNPFGKSIAQSIEKTNKVIK--------------------------------------------------- 171 (324) T ss_pred HHHhhhcCCCCccCccccccccccceecc--------------------------------------------------- Confidence 99999999987 47777664332221110 Q ss_pred ccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecC Q lcl|Aclame:pro 346 GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~ 425 (497) .....+.+..++..+.. .+..++.|+|||.+|..|+++||++|+|+|.+. .+++|+|+||+.++ T Consensus 172 -----~~~~~~~i~~~~~~l~~-~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~----------~~~~l~G~PVv~~~ 235 (324) T protein:vir:99 172 -----GDFTQDNIIDLEALLED-DELEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDTLDGLPVVNLK 235 (324) T ss_pred -----ccCCHHHHHHHHHhhhh-ccCCCCEEEEcHHHHHHHHHhhcCCCceeecCC----------CCccccceeEEeec Confidence 01123444555555443 445667899999999999999999999998542 34589999999998 Q ss_pred CCCc--CcEEEeeccceEEEEEeccccEEEEeccch------------hhhhcCceEEEEEeeeccEeecccceEEEEec Q lcl|Aclame:pro 426 LIPL--GTILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 426 ~~~~--~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~ 491 (497) .++. +.+++|||+. +.++++.+++|+++++.. ++|++|+++||+++|+|+.|.||+||++|+.. T Consensus 236 ~~~~~~~~~i~gd~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a 313 (324) T protein:vir:99 236 SSNLKRGELITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CCCCCcceEEEEeccc--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEec Confidence 8664 5699999998 447889999999987743 56999999999999999999999999999987 Q ss_pred CCCCCC Q lcl|Aclame:pro 492 KGATGS 497 (497) Q Consensus 492 ~~a~~~ 497 (497) .+.+-. T Consensus 314 ~~~~~~ 319 (324) T protein:vir:99 314 DKKTDS 319 (324) T ss_pred cCCCCC Confidence 666654 No 84 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=4.6e-55 Score=318.48 Aligned_cols=363 Identities=15% Similarity=0.117 Sum_probs=222.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |....+-..++++. .+++..+ ++... ..++..+.+++. .+.+..++... T Consensus 1 M~i~~~~~~~~~e~---~~~l~~~------------------~~~~~-------~~e~~~~~~~~~---~~~~~~~~~~~ 49 (377) T protein:vir:96 1 MAINLKELPKYREA---VAELSAK------------------ISAGA-------TPEEQEKLFEAA---FTTMGDEILAK 49 (377) T ss_pred CCccHHHHHHHHHH---HHHHHHH------------------Hhhcc-------cHHHHHHHHHHH---HHHHHHHHHHH Confidence 54433322222211 1111110 00000 000000001100 01111111000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) . ..+... ..... ...... ..+.+..+.. ....++..++| T Consensus 50 ~-------~~e~~~-------~~~~~--~~~~~l----------------t~ee~~~~~~---------~~~~~~~~~gg 88 (377) T protein:vir:96 50 N-------EEEMER-------MFDLR--DKNREL----------------TAEEIKFFND---------IDKNVGGKDKF 88 (377) T ss_pred H-------HHHHHH-------HHHhc--cCCccc----------------CHHHHHHHHH---------HHhcCCCCCCc Confidence 0 000000 00000 000000 0000111100 01123344455 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccc-cccccceeEEeeeeeeeeechh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~-~s~~~~~~v~~~~~kia~~~~i 239 (497) .+||+++...|++.+.+.++|+++|+++++++ ..++|+.++ .+.|+|++|++..+ +++|+|+++++.+||++++++| T Consensus 89 ~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~-~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~i 166 (377) T protein:vir:96 89 KLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVI 166 (377) T ss_pred eecCHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecC-CcceeEeecccccccccCccceeEeeeeeeEEeechh Confidence 66777889999999999999999999999865 578998766 57899999988765 5789999999999999999999 Q ss_pred hHHHHhhHH-HHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhh Q lcl|Aclame:pro 240 TDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) Q Consensus 240 S~ell~ds~-~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) |++||+|++ ++++||.++|+++++.++|.+|++|+|+++|.||++.....+................. T Consensus 167 s~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~----------- 235 (377) T protein:vir:96 167 PKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKE----------- 235 (377) T ss_pred hHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeeccc----------- Confidence 999999985 79999999999999999999999999999999999876655443332111110000000 Q ss_pred hhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhh----------hccCCceEEEehhHHHHHHH Q lcl|Aclame:pro 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT----------LFQTPNAVVMNPRDWELLRL 388 (497) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~n~~~~~~l~~ 388 (497) ..........+.+...+..+... ......+|+|||.++..+ T Consensus 236 ---------------------------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~-- 286 (377) T protein:vir:96 236 ---------------------------AIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL-- 286 (377) T ss_pred ---------------------------cccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhc-- Confidence 00000000011111111111111 122345799999998655 Q ss_pred HhcccCcccccccccccccccccccccccccc--eeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCc Q lcl|Aclame:pro 389 TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVP--VVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGK 466 (497) Q Consensus 389 lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~p--vv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~ 466 (497) .|+|.|.+.. +.+.+++|+| |+.++.+|+++++||||++ |.|++|.+++|+.+++. +|.+|+ T Consensus 287 ----~~~~~~~~~~--------G~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~--Y~i~~r~~~~i~~~~~~--~~~~d~ 350 (377) T protein:vir:96 287 ----EAKFTSRNQF--------GEYVTVLPHGITILESLAVETGKAIAFVANR--YDAFMATASTIEEYDQT--FAMEDL 350 (377) T ss_pred ----cccccccCCC--------CCceeccCCCceEEecCCCCcccEEEEEcCc--EEEEEecccEEEeehhh--hhhcCC Confidence 4667776532 1223566666 6789999999999999997 88999999999999865 599999 Q ss_pred eEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 467 VTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 467 v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) +.||+.+|+|+.++||+||++++++-. T Consensus 351 ~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 351 QLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred eEEEEEEEEcCEEecCCcEEEEEEecC Confidence 999999999999999999999999888 No 85 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=7e-56 Score=322.98 Aligned_cols=297 Identities=14% Similarity=0.138 Sum_probs=230.3 Q ss_pred hhhhhhhhhhh------hcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecC-------Cccce Q lcl|Aclame:pro 142 ETAPAAIGQNP------FGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAA-------HNNAA 208 (497) Q Consensus 142 ~~~~~~~~~~~------~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~-------~~~a~ 208 (497) ....++.+.+. .+.+..++++||+++..+||+.+++.++|+++|++++++++.+++|+.+.. ...+. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~ 80 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccc Confidence 11111111111 122333455788999999999999999999999999999999999998652 24567 Q ss_pred eeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCcc---ccceec Q lcl|Aclame:pro 209 AVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG---VNGLLQ 284 (497) Q Consensus 209 ~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~---~~Gil~ 284 (497) |++||+.+|+++++|+++++.++|++++++||+|+|+|+ +++++||.++|++++++++|.+||+|+|++. |.||.+ T Consensus 81 ~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~ 160 (338) T protein:vir:78 81 EQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDT 160 (338) T ss_pred cccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccc Confidence 888999999999999999999999999999999999997 6899999999999999999999999999764 556654 Q ss_pred cccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhh Q lcl|Aclame:pro 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVD 364 (497) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (497) .......+.... .........+.+..+... T Consensus 161 ~~~~~~~~~~~~--------------------------------------------------~~~~~~~~~~~~~~~~~~ 190 (338) T protein:vir:78 161 NNVIVNTTNVDY--------------------------------------------------LQTGTTPLLDRFLDGYDL 190 (338) T ss_pred cccccccccccc--------------------------------------------------ccccchhhHHHHHHHHHH Confidence 432221111100 000111234455555555 Q ss_pred hhhhhccCCceEEEehhHHHHH---HHHhcccCcccccccccccccccccccccccccceeecCCCCc---------CcE Q lcl|Aclame:pro 365 IQLTLFQTPNAVVMNPRDWELL---RLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL---------GTI 432 (497) Q Consensus 365 ~~~~~~~~~~~~~~n~~~~~~l---~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~---------~~~ 432 (497) ........+++|+|||.++..| +++||.+|+|+|.+... .+.+.+|+|+||+++++||+ +.+ T Consensus 191 ~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~------~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~ 264 (338) T protein:vir:78 191 VSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINL------AASAGDLLGLPVQFGKAVGGDLGAATDSKVRV 264 (338) T ss_pred hhhhccccceEEEEchHHHHHHHHHhhhccCCCceeeccccc------CCCCceeeeeeEEEccccCccccccCCcccEE Confidence 5555666788999999998877 45789999999977543 33457899999999999985 238 Q ss_pred EEeeccceEEEEEeccccEEEEeccc------------hhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 433 LVGHFAPSVIQTARREGVTMQMTNSN------------GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 433 ~~gd~~~~~~~i~~r~~~~i~~~~~~------------~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) ++|||+. |.+.++.+++|+++++. .++|++|+++||+++|+||+|+||+||++|+-.+++.+ T Consensus 265 ~~gdfs~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 265 VGGDFSQ--LKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred EEEecce--EEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccCCCC Confidence 8999997 56889999999998764 26799999999999999999999999999998888888 No 86 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=4.2e-55 Score=318.67 Aligned_cols=361 Identities=14% Similarity=0.067 Sum_probs=224.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhhhhhH Q lcl|Aclame:pro 21 INADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNP 100 (497) Q Consensus 21 ~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 100 (497) |.-+..++..+.+.+ +.+.++..+..++..+...+. +..+..+. ..+.+.+ T Consensus 1 m~ik~~~~~~~~~~e----------~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~-------~~~~~~e--------- 51 (381) T protein:vir:10 1 MTINLSETFANAKNE----------FINAVNNGEPQERQNELYGDM---INQLFEET-------KLQAKAE--------- 51 (381) T ss_pred CchhhHHHHHHHHHH----------HHHHHhhhhhhHHHHHHHHHH---HHhhhhhH-------HHHHHHH--------- Confidence 111111111111111 011111101001110101100 00000000 0000000 Q ss_pred HHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhh Q lcl|Aclame:pro 101 ELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELS 180 (497) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~ 180 (497) .........+... .. .+.+..+ .....+++.++|.+||+++...|++.+++.++ T Consensus 52 -~~~~~~~~~~~~~--------lt-------~~e~~~~----------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~ 105 (381) T protein:vir:10 52 -AERVSSLPKSAQS--------LS-------ANQRSFF----------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHP 105 (381) T ss_pred -HHHHHHhccCccc--------cc-------HHHHHHH----------HHHhcccCCCCceecCHHHHHHHHHHHHhhcc Confidence 0000000000000 00 0001111 11222444556667888899999999999999 Q ss_pred HHhhccceecCCCceEEEEeecCCccceeeccccccc-cccccceeEEeeeeeeeeechhhHHHHhhHH-HHHHHHHHHH Q lcl|Aclame:pro 181 LADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTITDEGLRDAP-ELFNFVQGRL 258 (497) Q Consensus 181 l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~-~s~~~~~~v~~~~~kia~~~~iS~ell~ds~-~l~~~i~~~l 258 (497) |+++|++++++++ .++|+.++ .+.|+|++|++..+ +++++|+++++.+||++++++||+|||+|++ ++++||.++| T Consensus 106 i~~~~~v~~~~~~-~~i~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~l 183 (381) T protein:vir:10 106 LLADLGIKNAGLR-LKFLKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQI 183 (381) T ss_pred ceeheeeEecCcc-eEEEEecC-CcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHH Confidence 9999999998764 78999876 57899999988775 5689999999999999999999999999975 8999999999 Q ss_pred HHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccc Q lcl|Aclame:pro 259 LEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAA 338 (497) Q Consensus 259 a~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 338 (497) +++++.++|.+|++|+|+++|.||++................... T Consensus 184 a~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~----------------------------------- 228 (381) T protein:vir:10 184 EEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ----------------------------------- 228 (381) T ss_pred HHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccc----------------------------------- Confidence 999999999999999999999999986543322111100000000 Q ss_pred cccccccccccchhhhhhhhHHhhhhhh------hhhccCCceEEEehhHHHHHHHHh---cccCccccccccccccccc Q lcl|Aclame:pro 339 GSGSGVAGSYPTAAEIAENVFDAFVDIQ------LTLFQTPNAVVMNPRDWELLRLTK---DANGQYMGGNFFGNAYGNP 409 (497) Q Consensus 339 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~n~~~~~~l~~lk---d~~G~~~~~~~~~~~~~~~ 409 (497) ............+.+......+. ...+.....|+|||.++..|+.++ +++|+|+|..+ T Consensus 229 -----~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~-------- 295 (381) T protein:vir:10 229 -----GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP-------- 295 (381) T ss_pred -----cccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCC-------- Confidence 00000000000111111111111 112334457999999999998776 56788887431 Q ss_pred ccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEE Q lcl|Aclame:pro 410 VNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQ 489 (497) Q Consensus 410 ~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~ 489 (497) +|.+|+.++.||+++++||||++ |.|++|.+++|+++++. +|.+|+++||+.+|+|+.++||+||++++ T Consensus 296 -------~g~~vv~s~~~p~~~iifgDfs~--Y~i~~r~~~~i~~~~~~--~~~~d~~~f~a~~r~dg~~~~~~A~~v~~ 364 (381) T protein:vir:10 296 -------FNLNVIESTVQEAGKVLTYVKGL--YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKDNKVAAVWK 364 (381) T ss_pred -------CCceEEecCCCCcCcEEEEeccc--EEEEEecccEEEeechh--HhhcCCeEEEEEEEEcCEEecCceEEEEE Confidence 36789999999999999999997 88999999999999876 59999999999999999999999999998 Q ss_pred ecCCCCCC Q lcl|Aclame:pro 490 LKKGATGS 497 (497) Q Consensus 490 ~~~~a~~~ 497 (497) ++...+.. T Consensus 365 l~~~~~~~ 372 (381) T protein:vir:10 365 LDLKGHKP 372 (381) T ss_pred EEecCCCc Confidence 88754433 No 87 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=4.2e-55 Score=318.67 Aligned_cols=361 Identities=14% Similarity=0.067 Sum_probs=224.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhhhhhH Q lcl|Aclame:pro 21 INADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNP 100 (497) Q Consensus 21 ~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 100 (497) |.-+..++..+.+.+ +.+.++..+..++..+...+. +..+..+. ..+.+.+ T Consensus 1 m~ik~~~~~~~~~~e----------~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~-------~~~~~~e--------- 51 (381) T protein:vir:95 1 MTINLSETFANAKNE----------FINAVNNGEPQERQNELYGDM---INQLFEET-------KLQAKAE--------- 51 (381) T ss_pred CchhhHHHHHHHHHH----------HHHHHhhhhhhHHHHHHHHHH---HHhhhhhH-------HHHHHHH--------- Confidence 111111111111111 011111101001110101100 00000000 0000000 Q ss_pred HHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhh Q lcl|Aclame:pro 101 ELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELS 180 (497) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~ 180 (497) .........+... .. .+.+..+ .....+++.++|.+||+++...|++.+++.++ T Consensus 52 -~~~~~~~~~~~~~--------lt-------~~e~~~~----------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~ 105 (381) T protein:vir:95 52 -AERVSSLPKSAQS--------LS-------ANQRSFF----------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHP 105 (381) T ss_pred -HHHHHHhccCccc--------cc-------HHHHHHH----------HHHhcccCCCCceecCHHHHHHHHHHHHhhcc Confidence 0000000000000 00 0001111 11222444556667888899999999999999 Q ss_pred HHhhccceecCCCceEEEEeecCCccceeeccccccc-cccccceeEEeeeeeeeeechhhHHHHhhHH-HHHHHHHHHH Q lcl|Aclame:pro 181 LADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTITDEGLRDAP-ELFNFVQGRL 258 (497) Q Consensus 181 l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~-~s~~~~~~v~~~~~kia~~~~iS~ell~ds~-~l~~~i~~~l 258 (497) |+++|++++++++ .++|+.++ .+.|+|++|++..+ +++++|+++++.+||++++++||+|||+|++ ++++||.++| T Consensus 106 i~~~~~v~~~~~~-~~i~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~l 183 (381) T protein:vir:95 106 LLADLGIKNAGLR-LKFLKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQI 183 (381) T ss_pred ceeheeeEecCcc-eEEEEecC-CcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHH Confidence 9999999998764 78999876 57899999988775 5689999999999999999999999999975 8999999999 Q ss_pred HHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccc Q lcl|Aclame:pro 259 LEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAA 338 (497) Q Consensus 259 a~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 338 (497) +++++.++|.+|++|+|+++|.||++................... T Consensus 184 a~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~----------------------------------- 228 (381) T protein:vir:95 184 EEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ----------------------------------- 228 (381) T ss_pred HHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccc----------------------------------- Confidence 999999999999999999999999986543322111100000000 Q ss_pred cccccccccccchhhhhhhhHHhhhhhh------hhhccCCceEEEehhHHHHHHHHh---cccCccccccccccccccc Q lcl|Aclame:pro 339 GSGSGVAGSYPTAAEIAENVFDAFVDIQ------LTLFQTPNAVVMNPRDWELLRLTK---DANGQYMGGNFFGNAYGNP 409 (497) Q Consensus 339 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~n~~~~~~l~~lk---d~~G~~~~~~~~~~~~~~~ 409 (497) ............+.+......+. ...+.....|+|||.++..|+.++ +++|+|+|..+ T Consensus 229 -----~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~-------- 295 (381) T protein:vir:95 229 -----GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP-------- 295 (381) T ss_pred -----cccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCC-------- Confidence 00000000000111111111111 112334457999999999998776 56788887431 Q ss_pred ccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEE Q lcl|Aclame:pro 410 VNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQ 489 (497) Q Consensus 410 ~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~ 489 (497) +|.+|+.++.||+++++||||++ |.|++|.+++|+++++. +|.+|+++||+.+|+|+.++||+||++++ T Consensus 296 -------~g~~vv~s~~~p~~~iifgDfs~--Y~i~~r~~~~i~~~~~~--~~~~d~~~f~a~~r~dg~~~~~~A~~v~~ 364 (381) T protein:vir:95 296 -------FNLNVIESTVQEAGKVLTYVKGL--YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKDNKVAAVWK 364 (381) T ss_pred -------CCceEEecCCCCcCcEEEEeccc--EEEEEecccEEEeechh--HhhcCCeEEEEEEEEcCEEecCceEEEEE Confidence 36789999999999999999997 88999999999999876 59999999999999999999999999998 Q ss_pred ecCCCCCC Q lcl|Aclame:pro 490 LKKGATGS 497 (497) Q Consensus 490 ~~~~a~~~ 497 (497) ++...+.. T Consensus 365 l~~~~~~~ 372 (381) T protein:vir:95 365 LDLKGHKP 372 (381) T ss_pred EEecCCCc Confidence 88754433 No 88 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=2.5e-55 Score=319.93 Aligned_cols=303 Identities=16% Similarity=0.137 Sum_probs=232.9 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccc Q lcl|Aclame:pro 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~ 187 (497) +++. .. .....+.|.................+..++++||+++..+|++.+++.++|++++++ T Consensus 1 ~~~~----------------~~-~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~ 63 (324) T protein:vir:10 1 MEQT----------------QK-LKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred CCCc----------------hH-HHHHHHHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcce Confidence 0000 00 000011122222222222223334445556789999999999999999999999999 Q ss_pred eecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~ 266 (497) +++++++++||+.++ .+.+.|++||+.+|+++++|+++++.+||+++++++|+|+++|+ +++++||.+.|++++++++ T Consensus 64 ~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~ 142 (324) T protein:vir:10 64 EPMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred eeccCCceEEEEEeC-CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 999999999999876 47899999999999999999999999999999999999999997 6899999999999999999 Q ss_pred HhhhhccCCCc-cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccccccc Q lcl|Aclame:pro 267 EVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 267 d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) |.++|+|+|++ .|.||++.....+.... T Consensus 143 d~a~l~G~g~~~~~~~i~~~~~~~~~~~~--------------------------------------------------- 171 (324) T protein:vir:10 143 DEAGILNQGNNPFGKSIAQSIEKTNKVIK--------------------------------------------------- 171 (324) T ss_pred HHHhhhcCCCCccCccccccccccceecc--------------------------------------------------- Confidence 99999999987 47777664332211110 Q ss_pred ccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecC Q lcl|Aclame:pro 346 GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~ 425 (497) .....+++..++..+.. .+..++.|+|||.+|..|+++||++|+|+|.+. .+++|+|+||+.++ T Consensus 172 -----~~~t~~~i~~~~~~l~~-~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~----------~~~~l~G~PV~~~~ 235 (324) T protein:vir:10 172 -----GDFTQDNIIDLEALLED-DELEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDTLDGLPVVNLK 235 (324) T ss_pred -----ccCCHHHHHHHHHhhhh-ccCCCCEEEEcHHHHHHHHHhhccCCceeecCC----------CCccccceeEEeec Confidence 01123444555555544 345667899999999999999999999998542 34579999999988 Q ss_pred CCC--cCcEEEeeccceEEEEEeccccEEEEeccch------------hhhhcCceEEEEEeeeccEeecccceEEEEec Q lcl|Aclame:pro 426 LIP--LGTILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 426 ~~~--~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~ 491 (497) .++ .+.+++|||+. +.++++.+++|+++++.. ++|++|+++||+++|+|+.|.+|+||++|+.. T Consensus 236 ~~~~~~~~~~~gd~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:10 236 SSNLKRGELITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CCCCCcceEEEEeccc--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEec Confidence 765 55699999998 446788999999987643 57999999999999999999999999999976 Q ss_pred CCCCCC Q lcl|Aclame:pro 492 KGATGS 497 (497) Q Consensus 492 ~~a~~~ 497 (497) ++.+-. T Consensus 314 ~~~~~~ 319 (324) T protein:vir:10 314 DKKTDS 319 (324) T ss_pred cCCCCC Confidence 666543 No 89 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=4.2e-55 Score=318.70 Aligned_cols=303 Identities=16% Similarity=0.144 Sum_probs=229.2 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccc Q lcl|Aclame:pro 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~ 187 (497) +++..+ . . ...+.|.................+..++++||+++..+|++.+++.++|++++++ T Consensus 1 ~~~~~~---------~-------~-~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:96 1 MEQTQK---------L-------K-LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKY 63 (324) T ss_pred CCcchh---------h-------h-HHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcce Confidence 000000 0 0 0011121111111111122223344556789999999999999999999999999 Q ss_pred eecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~ 266 (497) ++++++.++||++++. +.+.||+||+.+|+++++|+++++.++|++++++||+|||+|+ +++++||.++|++++++++ T Consensus 64 ~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:96 64 EPMEGTEKKFTFWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred eeccCCceEEEEEecC-cceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 9999999999999874 6799999999999999999999999999999999999999997 6899999999999999999 Q ss_pred HhhhhccCCCcc-ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccccccc Q lcl|Aclame:pro 267 EVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVA 345 (497) Q Consensus 267 d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (497) |.++|+|+|.+. |.|+.+.......... T Consensus 143 d~~~l~G~g~~~~~~~~~~~~~~~~~~~~--------------------------------------------------- 171 (324) T protein:vir:96 143 DEAGILNQGNNPFGKSIAQSIKKTNKVIK--------------------------------------------------- 171 (324) T ss_pred HHHhhhcCCCCCcCccccccccccceecc--------------------------------------------------- Confidence 999999999864 6666554322211110 Q ss_pred ccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecC Q lcl|Aclame:pro 346 GSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 346 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~ 425 (497) .....+++..++..+.. .+..++.|+|||.+|..|+++||++|+|++.+. .+++|+|+||++++ T Consensus 172 -----~~~~~~~i~~~~~~i~~-~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~~----------~~~~l~G~PV~~~~ 235 (324) T protein:vir:96 172 -----GDFTQDNIIDLEALLED-DELEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDSLDGLPVVNLK 235 (324) T ss_pred -----cccchHHHHHHHHhhhh-ccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCC----------CCCcccceeeEeec Confidence 01123444555555444 345677899999999999999999999998542 34589999999977 Q ss_pred CC--CcCcEEEeeccceEEEEEeccccEEEEeccch------------hhhhcCceEEEEEeeeccEeecccceEEEEec Q lcl|Aclame:pro 426 LI--PLGTILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 426 ~~--~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~ 491 (497) .. +.+.+++|||+. +.++++.+++|+++++.. ++|++|+++||+++|+||.|.+|+||++|+.. T Consensus 236 ~~~~~~~~~~~gd~s~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a 313 (324) T protein:vir:96 236 SSNLKRGELITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred CCCCCcceEEEEecce--EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecc Confidence 65 456799999997 446788999999987643 67999999999999999999999999999855 Q ss_pred CCCCCC Q lcl|Aclame:pro 492 KGATGS 497 (497) Q Consensus 492 ~~a~~~ 497 (497) ...+.. T Consensus 314 ~~~~~~ 319 (324) T protein:vir:96 314 DKRTDS 319 (324) T ss_pred cccCCC Confidence 544443 No 90 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=2.1e-54 Score=314.84 Aligned_cols=361 Identities=15% Similarity=0.088 Sum_probs=222.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhhhhhH Q lcl|Aclame:pro 21 INADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNP 100 (497) Q Consensus 21 ~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 100 (497) |.-|..+++.+.+.++.+ .++..+...+..+.++.. ...+.++.. ...+. T Consensus 1 m~~kl~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~---~~~~~~~~~-------~~~~~---------- 50 (381) T protein:vir:10 1 MTINLSETFANAKNEFIN----------AVNNGEPQERQNELYGDM---INQLFEETK-------LQAKA---------- 50 (381) T ss_pred CchhHHHHHHHHHHHHHH----------HHHhhhHHHHHHHHHHHH---HHhhhhhHH-------HHHHH---------- Confidence 211122222222211111 111000000000000000 000000000 00000 Q ss_pred HHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhh Q lcl|Aclame:pro 101 ELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELS 180 (497) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~ 180 (497) +.........+.. .. ..+.+..+ .....++...+|.+||+++...|++.+.+.++ T Consensus 51 e~~~~~~~~~~~~--------~l-------~~~e~~~~----------~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~sp 105 (381) T protein:vir:10 51 EAERVSSLPKSAQ--------TL-------SANQRNFF----------MDINKSVGYKEEKLLPEETIDRIFEDLTTNHP 105 (381) T ss_pred HHHHHHHhccccc--------cc-------CHHHHHHH----------HHHhhcCCCCCceecCHHHHHHHHHHHHhhcc Confidence 0000000000000 00 00001100 11223344455667778899999999999999 Q ss_pred HHhhccceecCCCceEEEEeecCCccceeecccccc-ccccccceeEEeeeeeeeeechhhHHHHhhHH-HHHHHHHHHH Q lcl|Aclame:pro 181 LADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTY-PFSSEEFARVYEQVGKVANALTITDEGLRDAP-ELFNFVQGRL 258 (497) Q Consensus 181 l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~-~~s~~~~~~v~~~~~kia~~~~iS~ell~ds~-~l~~~i~~~l 258 (497) ||++|+++++++ ..++|+.++ .+.+.|++|++.. ++++|+|+++++.+||++++++||+|||+|++ ++++||..+| T Consensus 106 ir~~a~v~~~~~-~~~i~~~~~-~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~l 183 (381) T protein:vir:10 106 LLADLGIKNAGL-RLKFLKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQI 183 (381) T ss_pred eeeeeeeEecCc-ceEEEeecC-CcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHH Confidence 999999999865 468898876 4789999998775 46789999999999999999999999999985 8999999999 Q ss_pred HHHHHHHHHhhhhccCCCccccceeccccccccchhhhhh---hHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhc Q lcl|Aclame:pro 259 LEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLF---GATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT 335 (497) Q Consensus 259 a~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (497) +++++.++|.+|++|+|+++|.||++.............. ........+.. T Consensus 184 a~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~-------------------------- 237 (381) T protein:vir:10 184 EEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPR-------------------------- 237 (381) T ss_pred HHHHHHHhhceeEecccCCCceeeeecCCccccccccccccccccccccccchh-------------------------- Confidence 9999999999999999999999999754332211111000 00000000000 Q ss_pred ccccccccccccccchhhhhhhhHHhh---hhhhhhhccCCceEEEehhHHHHHHHHh---cccCccccccccccccccc Q lcl|Aclame:pro 336 GAAGSGSGVAGSYPTAAEIAENVFDAF---VDIQLTLFQTPNAVVMNPRDWELLRLTK---DANGQYMGGNFFGNAYGNP 409 (497) Q Consensus 336 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~n~~~~~~l~~lk---d~~G~~~~~~~~~~~~~~~ 409 (497) .....+..++... .......+.....|+|||.++..|+.++ +++|+|+|..+ T Consensus 238 --------------~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~lp-------- 295 (381) T protein:vir:10 238 --------------ATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP-------- 295 (381) T ss_pred --------------hHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecCC-------- Confidence 0000000000000 0011112334557999999999998655 78899987542 Q ss_pred ccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEE Q lcl|Aclame:pro 410 VNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQ 489 (497) Q Consensus 410 ~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~ 489 (497) +|.||+.+++||+++++||||++ |.|++|.+++|+++++. +|.+|+++||+..|+|+.++||+||++++ T Consensus 296 -------~g~~vv~~~~~p~~~i~fGDfs~--Y~i~~r~~~~i~~~~~~--~~~~d~~~f~a~~r~dG~~~~~~A~~v~~ 364 (381) T protein:vir:10 296 -------FNLNVIESTVQEAGKVLTYVKGL--YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKDNKVAAVWK 364 (381) T ss_pred -------CCceeEEcCCCCcCcEEEEEccc--EEEEEecccEEEeechh--hhhcCceEEEEEEEEcCEEecCCcEEEEE Confidence 47899999999999999999997 88999999999999876 59999999999999999999999999999 Q ss_pred ec-----CCCCCC Q lcl|Aclame:pro 490 LK-----KGATGS 497 (497) Q Consensus 490 ~~-----~~a~~~ 497 (497) ++ .+-..+ T Consensus 365 l~~~~~~~~~~~~ 377 (381) T protein:vir:10 365 LDLKGHKPALEDT 377 (381) T ss_pred EeecCCccccccc Confidence 96 333333 No 91 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=6.2e-54 Score=312.27 Aligned_cols=374 Identities=16% Similarity=0.123 Sum_probs=223.0 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |--..++.++...+.+..+++ .+ .++.. ...++..+.++ +.++++..++.+. T Consensus 1 mt~~~~~~e~~~~~~e~~~~~---------------~~---~~~~~-------~~~e~~~~~~~---~~~~~~~~~~~~~ 52 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQF---------------AN---LVQNG-------ASDEEQSKAFG---AMFDALSNDLQEE 52 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHH---------------HH---HHhhh-------hhHHHHHHHHH---HHHHHHHHHHHHH Confidence 443333333332221111111 10 00000 00000001111 1111111111100 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) ... ..+.. .........++.+ ... .+.+..+ .....++..++| T Consensus 53 ~~~---e~~~~---------~~~~~~~~~r~~~--------~l~-------~ee~~~~----------~~~~~~t~~~gG 95 (395) T protein:vir:95 53 ITA---EINNR---------VVDNGILAKRSQD--------PLT-------SEERKFF----------NDINYDVGYTDE 95 (395) T ss_pred HHH---HHHHH---------HHHHHHHhhcCcc--------ccc-------hHHHHHH----------HHHhhccCCCCc Confidence 000 00000 0000000000000 000 0001100 111223445566 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecccccc-ccccccceeEEeeeeeeeeechh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTY-PFSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~-~~s~~~~~~v~~~~~kia~~~~i 239 (497) .+||+++...|++.+++.++|+++|+++++++ ..++|+.++ .+.+.|++|++.. ++++++|+++++.+||++++++| T Consensus 96 ~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~i 173 (395) T protein:vir:95 96 KILPETVVERVFDDLQKDHPLLSKINFQNAGI-KTRVIKADP-AGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVL 173 (395) T ss_pred eeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecC-CcceEEeecccccCccccccceeeeeceeeEEEeecc Confidence 67778889999999999999999999999976 478998776 4789999987665 57899999999999999999999 Q ss_pred hHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCc--cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcch Q lcl|Aclame:pro 240 TDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP--GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTN 316 (497) Q Consensus 240 S~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~--~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (497) |+|||+|+ +++++||.++|+++++.++|.+||+|+|++ +|.||++.....+............... .......... T Consensus 174 S~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~-~~~~~~~~l~ 252 (395) T protein:vir:95 174 PDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFA-DADTTILELN 252 (395) T ss_pred cHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeecccccccccccccccchhhhh-hhHhhHHHHH Confidence 99999998 589999999999999999999999999997 5999998655443322211111110000 0000000000 Q ss_pred hhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 317 GAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) .. ... ...........+.....|+|||.++. |..|+| T Consensus 253 ~~------------~~~-------------------------~~~~~~~~~~~~~~~~~~~mn~~t~~------~~~g~~ 289 (395) T protein:vir:95 253 DV------------LKN-------------------------LSVDEKGKELKIDGKVALVVNPRDSW------DVQARY 289 (395) T ss_pred HH------------HHh-------------------------hccccccchhhhcCceEEEEcchhhh------hcCCcc Confidence 00 000 00000001122334557999998864 567999 Q ss_pred cccccccccccccccccccc--cccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEee Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNI--WGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEER 474 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l--~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r 474 (497) +|++..+ .+.++ +|+||+.+++||+++++||||++ |.|++|.+++|+++++. +|.+|++.||+..| T Consensus 290 ~~~~~~G--------~~~~~lg~g~~v~~~~~~p~~~i~fgdfs~--y~i~~r~~~~i~~~~~~--~~~~d~~~f~~~~r 357 (395) T protein:vir:95 290 TYLTANG--------GFVTVLPYNVTIITSEFVPEGKLVAFVTDR--YNAVRGGGLTVKKFDQT--LALEDAVLFTAKTF 357 (395) T ss_pred eeccCCC--------cceeccCCcceEEEcCCCCCCcEEEEeccc--EEEEEecceEEEeccch--hhhCCcEEEEEEEE Confidence 9987421 22345 46779999999999999999997 78999999999999865 59999999999999 Q ss_pred eccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 475 LGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 475 ~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +|++++|++||+.|+++.+-..- T Consensus 358 ~dg~~~~~~A~~~l~i~~~~~~~ 380 (395) T protein:vir:95 358 AYGQPDDNKASAVYDLKVASAPR 380 (395) T ss_pred ECCEEeccccEEEEEeeccCCCC Confidence 99999999999999987322211 No 92 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=2.3e-55 Score=320.09 Aligned_cols=285 Identities=15% Similarity=0.069 Sum_probs=218.6 Q ss_pred hhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeee Q lcl|Aclame:pro 152 PFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVG 231 (497) Q Consensus 152 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~ 231 (497) +.+.++++|++||+++..+|++.+++.++|++++++++++++.++||+.++. +.|+||+||+.+|+++++|+++++.+| T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~-~~a~wv~Eg~~~~~~~~~f~~v~l~~~ 79 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGR-PKAEFVGEGQQKSSTTGEFDFVTSTPK 79 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCC-ceeEEeecCcccccccceeeEEEEeeE Confidence 4455667778888899999999999999999999999999988999999874 689999999999999999999999999 Q ss_pred eeeeechhhHHHHh---hH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHH Q lcl|Aclame:pro 232 KVANALTITDEGLR---DA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSN 307 (497) Q Consensus 232 kia~~~~iS~ell~---ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~ 307 (497) |+++++++|+|||+ |+ .++++||.++|++++++++|.++|+|+|++.++++.............. T Consensus 80 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~----------- 148 (311) T protein:vir:99 80 KAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRV----------- 148 (311) T ss_pred EEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCcccccccccccccccee----------- Confidence 99999999999994 44 6899999999999999999999999999776555443222111100000 Q ss_pred HhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhh-hhccCCceEEEehhHHHHH Q lcl|Aclame:pro 308 VKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQL-TLFQTPNAVVMNPRDWELL 386 (497) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~n~~~~~~l 386 (497) ............++..++..... ...+.+++|+|||.++..| T Consensus 149 -------------------------------------~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L 191 (311) T protein:vir:99 149 -------------------------------------ELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGL 191 (311) T ss_pred -------------------------------------eccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHH Confidence 00000000001111112221111 1234456799999999999 Q ss_pred HHHhcccCcccccccccccccccccccccccccceeecCCCCcC----------------cEEEeeccceEEEEEecccc Q lcl|Aclame:pro 387 RLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG----------------TILVGHFAPSVIQTARREGV 450 (497) Q Consensus 387 ~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~----------------~~~~gd~~~~~~~i~~r~~~ 450 (497) +++||++|||+|++.... ..+.+|+|+||++++.+|.+ .+++|||+++ +.+.++.++ T Consensus 192 ~~lkd~~G~~l~~~~~~~------~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~-~~~~~~~~~ 264 (311) T protein:vir:99 192 STARYTDGRKKFPELGLG------IGVSSFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANG-IHWGVQRDI 264 (311) T ss_pred HhhhccCCCeeecCcccC------CCCceecceeeEeecccccccccccccchhhccCcceEEEeecccc-EEEEEecCc Confidence 999999999999875533 23468999999999988632 2578999984 567788999 Q ss_pred EEEEeccch-----hhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 451 TMQMTNSNG-----TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 451 ~i~~~~~~~-----~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) +++++++.. ++|++|+++||+++|+||.|.|| +|++++-++| T Consensus 265 ~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 265 PVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred eEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 999877643 56999999999999999999997 6777776666 No 93 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=2.5e-55 Score=319.92 Aligned_cols=284 Identities=12% Similarity=0.049 Sum_probs=221.6 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccc-----ccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGT-----YPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~-----~~~s~~~~~~ 225 (497) ++.++++++|.+||+++..+|++.+++.++|++++++++++++++++|+.+.. +.+.||+||+. +|.++++|++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~-~~a~wv~E~~~~~~~~~~~s~~~f~~ 79 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATL-PEADWVGESATDPKGVKPTSKVTWAN 79 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCC-cceEEeecccccccccccccccceee Confidence 66677777788899999999999999999999999999999999999999874 78999999975 5667899999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) +++++||++++++||+||++|+ +++++||+++|++++++++|.+|++|+|.+.+.+.............. T Consensus 80 i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~--------- 150 (305) T protein:vir:25 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQA--------- 150 (305) T ss_pred EEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCcccccccccccccccc--------- Confidence 9999999999999999999997 689999999999999999999999999875433222111110000000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ...........+..+.+..+.... ...++..+.|+|||.++. T Consensus 151 -------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~~~~~ 192 (305) T protein:vir:25 151 -------------------------------------VEVVGGVANESDIVGATNRAAKAV-ASAGWAPDTLLSSLALRY 192 (305) T ss_pred -------------------------------------ccccccchhhhHHHHHHHHHHHhh-hhcccccceeEecHHHHH Confidence 000000001111222233332222 233455667999999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCc----CcEEEeeccceEEEEEeccccEEEEeccc-- Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL----GTILVGHFAPSVIQTARREGVTMQMTNSN-- 458 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~----~~~~~gd~~~~~~~i~~r~~~~i~~~~~~-- 458 (497) .|+++||++|+|+|++ .+|+|+||++++++|. +.+++|||++ |.++++.+++|+++++. T Consensus 193 ~l~~lkd~~G~~i~~~-------------~~l~G~Pv~~~~~~~~~~~~~~~~~gd~s~--~~i~~~~~~~i~~~~~~~~ 257 (305) T protein:vir:25 193 EVANIRDANGNPVFRD-------------DSFAGFRTFFNRNGAWDADAAIEVIADSSR--VKIGVRQDITVKFLDQATL 257 (305) T ss_pred HHHHhhccCCceeecC-------------CcccccceEEcCccCCCCCccEEEEEecce--EEEEEecCeEEEEeeeeee Confidence 9999999999999965 3799999999999874 3689999997 55788999999888754 Q ss_pred ------hhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 459 ------GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 459 ------~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) .++|++|+++||++.|+||.|.||+||++++....+.=+ T Consensus 258 ~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~ 302 (305) T protein:vir:25 258 GTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVA 302 (305) T ss_pred ecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccC Confidence 257999999999999999999999999999987554322 No 94 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=4.8e-55 Score=318.37 Aligned_cols=277 Identities=16% Similarity=0.111 Sum_probs=218.4 Q ss_pred ccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeee Q lcl|Aclame:pro 155 STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA 234 (497) Q Consensus 155 ~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia 234 (497) -+.++|.++|+++..+||+.+++.++|++++++++++++.++||+.++. +.|+||+||+.+|+++++|+++++.+||++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~ 79 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMD-SEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecC-cceEEeeCCccccccccceeEEEEeeeEEE Confidence 2334577899999999999999999999999999999999999999874 679999999999999999999999999999 Q ss_pred eechhhHHHHh---h-HHHHHHHHHHHHHHHHHHHHHhhhhccCCC--ccc---cceeccccccccchhhhhhhHHHHHH Q lcl|Aclame:pro 235 NALTITDEGLR---D-APELFNFVQGRLLEGIQRKEEVQLLAGGGY--PGV---NGLLQRSTGFTASSASSLFGATSATV 305 (497) Q Consensus 235 ~~~~iS~ell~---d-s~~l~~~i~~~la~~~~~~~d~a~l~G~g~--~~~---~Gil~~~~~~~~~~~~~~~~~~~~~~ 305 (497) +++++|+|+|+ | ..+++++|.++|++++++++|.++++|++. +.+ .|+.......+. T Consensus 80 ~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~-------------- 145 (298) T protein:vir:94 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQ-------------- 145 (298) T ss_pred EeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccc-------------- Confidence 99999999995 2 357999999999999999999999999532 221 111111100000 Q ss_pred HHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHH Q lcl|Aclame:pro 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWEL 385 (497) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~ 385 (497) ............+++..++..+..+ +..+.+|+|||.+|.. T Consensus 146 --------------------------------------~~~~~~~~~~~~~~i~~~~~~~~~~-~~~~~~~vmn~~~~~~ 186 (298) T protein:vir:94 146 --------------------------------------KVEAPRGIADPNGAIENAVELLTGV-DADVTGIAINPSFRSA 186 (298) T ss_pred --------------------------------------ccccccccccHHHHHHHHHHhhhhc-CCCccEEEEcHHHHHH Confidence 0000011112234455555554443 4566789999999999 Q ss_pred HHHHhcccCcccccccccccccccccccccccccceeecCCCCcC------cEEEeeccceEEEEEeccccEEEEeccc- Q lcl|Aclame:pro 386 LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------TILVGHFAPSVIQTARREGVTMQMTNSN- 458 (497) Q Consensus 386 l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~------~~~~gd~~~~~~~i~~r~~~~i~~~~~~- 458 (497) |+++||++|+|+|++.... ..+++|||+||++++++|.+ .+++|||+++ +.++.|.++++++.++. T Consensus 187 l~~lkd~~G~~l~~~~~~~------~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gdfs~~-~~~~~~~~~~~~~~~~~~ 259 (298) T protein:vir:94 187 LAKQKDLQGNALFPELKWG------ATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANG-FKWGYAKEVPLEVIQYGD 259 (298) T ss_pred HHHhhccCCCeeecCcccC------CCCceecceeeEEecccccccCCCccEEEEeeccce-EEEEEecCceEEEeecCC Confidence 9999999999999875432 34468999999999999864 4889999985 45667889999887653 Q ss_pred -----hhhhhcCceEEEEEeeeccEeecccceEEEEecC Q lcl|Aclame:pro 459 -----GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 459 -----~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~ 492 (497) .++|++|+++||++.|+||++.||+||++|+-.+ T Consensus 260 ~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 260 PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 2479999999999999999999999999998666 No 95 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=4.3e-54 Score=313.13 Aligned_cols=348 Identities=14% Similarity=0.089 Sum_probs=220.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhhhhhHHHhhh Q lcl|Aclame:pro 26 TKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNA 105 (497) Q Consensus 26 ~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 105 (497) .+++.+++++.+.+ .++.+.++++++.++.+.+.... ....... ......+.. T Consensus 1 ~eei~~l~~~~~~l--------------------~~~~~~l~~~~d~~e~e~~~~~~----~~~~~~~---~~~~~~~~~ 53 (352) T protein:vir:78 1 MEDIKQLETEKAGL--------------------QQRFNIVERQVQDIEEKEKAKVK----DKGEAYQ---SLNDNEKLV 53 (352) T ss_pred ChhHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHHHHHhh----hcccccc---ccchhhhHH Confidence 11111111111111 11111111111111111000000 0000000 000000000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhc Q lcl|Aclame:pro 106 TSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLI 185 (497) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~ 185 (497) . .+..... .. ........... ...........+++..+|.+||+++..+||+.+++.++|++++ T Consensus 54 ~------~~~~~~r----~~---~~~~~~~~~~~---~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~ 117 (352) T protein:vir:78 54 K------AKAEFYR----HA---ILPNEFEKPSM---EAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKA 117 (352) T ss_pred H------HHHHHHH----HH---hhhhHHHHHHh---hHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhhe Confidence 0 0000000 00 00000000000 0111111222233444455666678899999999999999999 Q ss_pred cceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 186 SSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQR 264 (497) Q Consensus 186 ~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~ 264 (497) +++++++ ..+|+.+...+.+.||+||+.+|+++++|+++++.+||++++++||+|||+|+ +++++||.++|+++++. T Consensus 118 ~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~ 195 (352) T protein:vir:78 118 RLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAA 195 (352) T ss_pred eeEecCC--ceEEEEecCCCcccccccccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 9988765 46777766556799999999999999999999999999999999999999997 68999999999999999 Q ss_pred HHHh-hhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccccc Q lcl|Aclame:pro 265 KEEV-QLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSG 343 (497) Q Consensus 265 ~~d~-a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 343 (497) +++. .|.+|+|+++|.|+++..+....+.. T Consensus 196 ~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~------------------------------------------------- 226 (352) T protein:vir:78 196 KERKDALAVSPKSGLEHMSFYNGSVKEVEGA------------------------------------------------- 226 (352) T ss_pred HHHHhhhhcCCCCcccccceecccccccccc------------------------------------------------- Confidence 8655 57789999999998876443221110 Q ss_pred ccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceee Q lcl|Aclame:pro 344 VAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVT 423 (497) Q Consensus 344 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~ 423 (497) ...+.+..++..+...+ .....|+||+.++..|.++++.+|+|+|.+. +.+|+|+||++ T Consensus 227 ---------~~~d~i~~~~~~l~~~~-~~~a~~~mn~~t~~~l~~~~~~~~~~~~~~~-----------~~~llG~PV~~ 285 (352) T protein:vir:78 227 ---------NMYDAIINALADLHEDY-RDNATIYMRYADYVKIISVLSNGTTNFFDTP-----------AEKVFGKPVVF 285 (352) T ss_pred ---------chHHHHHHHHhccChhh-hcCCEEEEehHHHHHHHHHHhccCCcccccC-----------CccccccceEE Confidence 12344555555554443 4456899999999999999999999998542 34799999999 Q ss_pred cCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 424 TPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 424 s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++.++ +++||||+.+ |.. +.++.++..++. .++++.|++..|+|++|+||+||+++++++++.+. T Consensus 286 ~~~~~--~~~~Gdf~~~-~~~--~~~~~~~~~~~~----~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~~ 350 (352) T protein:vir:78 286 TDAAV--KPIVGDFNYF-GIN--YDGTTYDTDKDV----KKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGSL 350 (352) T ss_pred ecCCC--ceeEeehhhh-hhh--hhhheeeeeccc----cCCeeEEEEEeeeCceeechhheEEEEeecccCCC Confidence 99765 5899999974 433 445666555443 37899999999999999999999999999888877 No 96 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=8.8e-55 Score=316.93 Aligned_cols=281 Identities=15% Similarity=0.148 Sum_probs=229.3 Q ss_pred hhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCc-eEEEEeecCCccceeeccccccccccc Q lcl|Aclame:pro 143 TAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSE 221 (497) Q Consensus 143 ~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~~v~Eg~~~~~s~~ 221 (497) -...........+++++|++||+++..+|++.+++.++|++++++++++++. ..+|+.++ .+.+.|++||+.+|++++ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~Eg~~~~~~~~ 79 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTD-GISAYWVNETEKIKTDKP 79 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcC-CceeEEeecCcccccccc Confidence 1111122233445566778899999999999999999999999999997764 56777665 467999999999999999 Q ss_pred cceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhH Q lcl|Aclame:pro 222 EFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 222 ~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~ 300 (497) +|+++++.++|+++++++|+|+++|+ +++++||.++|++++++++|.++|+|+|++.|.||++.......... T Consensus 80 ~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~------ 153 (297) T protein:vir:95 80 EVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIG------ 153 (297) T ss_pred ceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecc------ Confidence 99999999999999999999999997 68999999999999999999999999999999999876443221111 Q ss_pred HHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEeh Q lcl|Aclame:pro 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) .....+++..++..+... +..+++|+||| T Consensus 154 --------------------------------------------------~~~t~~~i~~~~~~l~~~-~~~~~~~v~~~ 182 (297) T protein:vir:95 154 --------------------------------------------------GPINYDNILKLQDALYDA-DVEPNAFVSKI 182 (297) T ss_pred --------------------------------------------------cccCHHHHHHHHHHhhhc-cCCcCEEEEcH Confidence 011234455555555444 44567899999 Q ss_pred hHHHHHHHHhcccCcccccccccccccccccccccccccceeecCC--CCcCcEEEeeccceEEEEEeccccEEEEeccc Q lcl|Aclame:pro 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--IPLGTILVGHFAPSVIQTARREGVTMQMTNSN 458 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~--~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~ 458 (497) .++..|+++||++|+|+|.+. ..+|+|+||+.+.. ++.+++++|||++ +.++++++++++++++. T Consensus 183 ~~~~~L~~l~d~~G~~i~~~~-----------~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~--~~~~~~~~~~i~~~~~~ 249 (297) T protein:vir:95 183 QNRSALREARDGNKVSIYDKA-----------ANTIDGITTVDLKSARFEKGDLLAGDFDN--LIYGVPYNITYKISEEG 249 (297) T ss_pred HHHHHHHHhhccCCceeecCC-----------CCcccceeeEeecCCCCCCceEEEEeccc--EEEEEecCeEEEEeecc Confidence 999999999999999999653 24799999998654 5678899999998 44778899999988765 Q ss_pred h------------hhhhcCceEEEEEeeeccEeecccceEEEEecCCC Q lcl|Aclame:pro 459 G------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 459 ~------------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a 494 (497) . ++|++|+++||+++|+|++|++|+||++|+..+.. T Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 250 QISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred ccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 3 56999999999999999999999999999977777 No 97 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=1.6e-54 Score=315.45 Aligned_cols=274 Identities=16% Similarity=0.167 Sum_probs=222.4 Q ss_pred hhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCc--eEEEEeecCCccceeecccccccc-ccccc Q lcl|Aclame:pro 147 AIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPN--LSYLTESAAHNNAAAVAEAGTYPF-SSEEF 223 (497) Q Consensus 147 ~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~-s~~~~ 223 (497) ..+.+..++++++|.+||+++..+||+.+++.++|+++++++++++.+ +.+|+.....+.++||+||+.+|+ ++++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 333444445555556778888899999999999999999999987655 556666555577999999999997 57999 Q ss_pred eeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHH Q lcl|Aclame:pro 224 ARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 224 ~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) +++++++||+++++++|+|+++|+ +++++||.++++++++.++|.+|++|.|...+.+ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~--------------------- 139 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTKP--------------------- 139 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccccc--------------------- Confidence 999999999999999999999998 6899999999999999999999999887532110 Q ss_pred HHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhH Q lcl|Aclame:pro 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 382 (497) .....+++..++..+... +.....|+|||.+ T Consensus 140 ------------------------------------------------~~~~~d~i~~~~~~l~~~-~~~~a~~vmn~~~ 170 (293) T protein:vir:48 140 ------------------------------------------------TLTKWDDIIDLEAKVDPA-IKQTSFFLTNTSG 170 (293) T ss_pred ------------------------------------------------cccCHHHHHHHHHhhhhh-hcCCCEEEEcHHH Confidence 001123445555555444 4456689999999 Q ss_pred HHHHHHHhcccCcccccccccccccccccccccccccceeecCC--CCc-----CcEEEeeccceEEEEEeccccEEEEe Q lcl|Aclame:pro 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--IPL-----GTILVGHFAPSVIQTARREGVTMQMT 455 (497) Q Consensus 383 ~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~--~~~-----~~~~~gd~~~~~~~i~~r~~~~i~~~ 455 (497) +..|+++||++|+|+|.+.... +.+++|+|+||++++. +|. ..++||||++ +|.+++|.+++++++ T Consensus 171 ~~~L~~lkd~~g~~l~~~~~~~------~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~ 243 (293) T protein:vir:48 171 FTALKKVKNALGDYLMERDVKS------PTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQ-AVTLFDRQQMSLLST 243 (293) T ss_pred HHHHHHhhccCCceEeecCcCC------CCCceecceeeEEecccccCCccCCceEEEEEeccc-eEEEEEecceEEEEe Confidence 9999999999999999886433 3456899999987543 332 2489999998 478999999999999 Q ss_pred ccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 456 NSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 456 ~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++.+++|++|++.||+++|+|+.+++|+||++++++++++.- T Consensus 244 ~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~ 285 (293) T protein:vir:48 244 NIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQK 285 (293) T ss_pred cccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCC Confidence 998889999999999999999999999999999988877665 No 98 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=4.2e-53 Score=307.74 Aligned_cols=363 Identities=13% Similarity=0.038 Sum_probs=216.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |+ .+++.+...+.++.+.+.+....+ +..++..+.+++ ..+.+..++.+ T Consensus 1 M~--~kl~~~~~~~~e~~~~l~~~~~~~-------------------------~~~~~~~~~~~~---~~~~~~~~~~~- 49 (383) T protein:vir:78 1 MT--IKLKNNLANYEEKRTAFVNAVKNE-------------------------DTQEIQNKAYVE---MVDAMAADIME- 49 (383) T ss_pred Cc--hhHHHHHHHHHHHHHHHHHHHhcc-------------------------ChHHHHHHHHHH---HHHHHHHHHHH- Confidence 33 344444433333333222111100 000000000000 00011100000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.+..... ..........+.. ....+.+..+ .....+++.++| T Consensus 50 ------~~~~~~~~------~~~~~~~~~~g~~---------------~lt~~e~~~~----------~~~~~~~~~~gg 92 (383) T protein:vir:78 50 ------QAKKEARQ------EADAYISASRTDK---------------NITNEEIKFF----------NDINKEVGYKEE 92 (383) T ss_pred ------HHHHHHHH------HHHHHHHhcCChh---------------hhhHHHHHHH----------HHHhccCCCCCc Confidence 00000000 0000000000000 0000001110 112234455666 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccc-cccccceeEEeeeeeeeeechh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTI 239 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~-~s~~~~~~v~~~~~kia~~~~i 239 (497) .+||+++...|++.+.+.++|+++|++++++++ .++|+.++ .+.+.|++|++..+ +++++|+++++.+||++++++| T Consensus 93 ~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~-~~i~~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~i 170 (383) T protein:vir:78 93 TLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLR-TKFLKSET-SGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVV 170 (383) T ss_pred cccCHHHHHHHHHHHHhhccceeeeeeEecCCc-eEEEEEcC-CcceEEeecccccccccCcceeeEeecceeeEeeccc Confidence 778888999999999999999999999998765 68999876 47899999987764 6789999999999999999999 Q ss_pred hHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhh---------hHHHHHHHHHh Q lcl|Aclame:pro 240 TDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLF---------GATSATVSNVK 309 (497) Q Consensus 240 S~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~---------~~~~~~~~~~~ 309 (497) |+|||+|+ .++++||.++++++++.++|.+|++|+|+++|.||++.............. ........... T Consensus 171 s~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 250 (383) T protein:vir:78 171 PKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNEL 250 (383) T ss_pred hHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCcccccccccccccccchhhhhhhHHHHHHH Confidence 99999998 589999999999999999999999999999999999754322211111000 00000000000 Q ss_pred hhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHH-- Q lcl|Aclame:pro 310 FPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLR-- 387 (497) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~-- 387 (497) ........+. .............|+|||.++..+. T Consensus 251 ~~~~~~~~~~-------------------------------------------~~~~~~~~~~~~~~~~n~~~~~~~~~~ 287 (383) T protein:vir:78 251 TDVYKYHSVK-------------------------------------------ENGHPLNVAGKVTLLVNPTDAWDVKKQ 287 (383) T ss_pred HHHHhccchh-------------------------------------------cccchhhhcCceEEEEcCcchhhhccc Confidence 0000000000 0000000111235888887654332 Q ss_pred -HHhcccCccccccccccccccccccccccccc--ceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 388 -LTKDANGQYMGGNFFGNAYGNPVNGGKNIWGV--PVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 388 -~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~--pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) ..++.+|+|+ +++|+ +|+.++++|+++++||||++ |.|++|.+++|+++++. +|.+ T Consensus 288 ~~~~~~~G~~~-----------------t~l~~~~~iv~s~~~p~~~iifgdfs~--Y~i~~r~~~~i~~~~~~--~f~~ 346 (383) T protein:vir:78 288 YTSLNANGVYV-----------------TALPFNLNIIESLFVPEKKAISYVAER--YDALIGGPLDIGTYDQT--LAIE 346 (383) T ss_pred hhccCCCCcee-----------------eecCCCceEEecCCCCcccEEEeeccc--eEEEecccceEEecchh--hhhc Confidence 2233444432 44444 47889999999999999997 78999999999988765 6999 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) |++.||+..|+|+.++||+||++++++.+.+.. T Consensus 347 d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~~~~ 379 (383) T protein:vir:78 347 DLNLYAAKQFAYGKAKDDKAAAVWTLNINPAEQ 379 (383) T ss_pred CceEEEEEEEEcCEEecCCeEEEEEEEecCCCC Confidence 999999999999999999999999999777666 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=1.1e-41 Score=245.13 Aligned_cols=292 Identities=10% Similarity=0.001 Sum_probs=221.1 Q ss_pred HhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhcccee-cCCCceEEEEeecC---Cccceeecccc Q lcl|Aclame:pro 139 ADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRP-VTSPNLSYLTESAA---HNNAAAVAEAG 214 (497) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~p~~~~~---~~~a~~v~Eg~ 214 (497) .+..+........ .+.+..+||+++|+....+|+.+.+.+++++++++++ +++....+|+...+ .+.+.|.+|.. T Consensus 1 ~~~~~~~~~~~k~-it~~d~~gG~L~P~~~~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~~ 79 (314) T protein:vir:41 1 MDFLNKPFQITPK-IDVPDLGKGILAVQRFGEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTKV 79 (314) T ss_pred CchhhhHHHhhcc-cccccCCCceeChHHHHHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCCc Confidence 1111111112222 3444556778888877899999999999999999985 46777888876432 23456778888 Q ss_pred ccccccccceeEEeeeeeeeeechhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHhhhhccCCCc--------ccccee Q lcl|Aclame:pro 215 TYPFSSEEFARVYEQVGKVANALTITDEGLRDAP---ELFNFVQGRLLEGIQRKEEVQLLAGGGYP--------GVNGLL 283 (497) Q Consensus 215 ~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds~---~l~~~i~~~la~~~~~~~d~a~l~G~g~~--------~~~Gil 283 (497) ..++++++|+++.+.+||+...+.||+|+|+|+. +|+++|...++++++..++.+|++|+|+. +|.||+ T Consensus 80 ~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l 159 (314) T protein:vir:41 80 APTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWM 159 (314) T ss_pred cCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhh Confidence 8899999999999999999999999999999984 79999999999999999999999999852 577887 Q ss_pred ccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhh Q lcl|Aclame:pro 284 QRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFV 363 (497) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 363 (497) +.++........ .......+.+..++. T Consensus 160 ~~a~~~~~~~~~-----------------------------------------------------~~~~~~~~~~~~l~~ 186 (314) T protein:vir:41 160 KLAGNQYTDAEP-----------------------------------------------------EDENWPLNLFDGMMD 186 (314) T ss_pred hhcccceeecCc-----------------------------------------------------cccccHHHHHHHHHH Confidence 754322111000 001112334455566 Q ss_pred hhhhhhccC--CceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCC-----cCcEEEee Q lcl|Aclame:pro 364 DIQLTLFQT--PNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-----LGTILVGH 436 (497) Q Consensus 364 ~~~~~~~~~--~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~-----~~~~~~gd 436 (497) .+...++.+ ..+|+||+.+...++++++.+|+++|.+... .+.+.+|+|+||+.++.|| ++.++||| T Consensus 187 sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~------~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd 260 (314) T protein:vir:41 187 ELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALI------GATGLQYDGIPIQYVPALDALGDDKARALLTV 260 (314) T ss_pred hcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhh------CCCCceecceeeEecccccccCCCCceEEEec Confidence 666665543 3479999999999999999999999987543 2345579999999999874 57799999 Q ss_pred ccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 437 FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 437 ~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) |+... ++.+..++++..+.. .++++.|.+..|+|+.+.+++|.|+..+..+..| T Consensus 261 ~~nlv--~~~~~~ir~~~~~~a----~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 261 PTNLV--YGFWRNIRIEPKRDA----AMRRTEYIASLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred hhheE--EEeeceeEEeecccC----cCCeEEEEEEEEeceEEEEcCcEEEEEeeccCCC Confidence 99843 455667777666544 4889999999999999999999999999999888 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=2e-41 Score=243.70 Aligned_cols=295 Identities=10% Similarity=-0.054 Sum_probs=212.2 Q ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhcccee-cCCCceEEEEeecC-- Q lcl|Aclame:pro 127 PGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRP-VTSPNLSYLTESAA-- 203 (497) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~p~~~~~-- 203 (497) .-. ....+ ....... ....+.++.+|++++|+....+|+.+.+.++++++|++++ +++....++..... T Consensus 1 ~~~-~~~~~-----~~~~~~~--~k~~t~~d~~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~ 72 (315) T protein:vir:41 1 MLT-IEDIR-----GGKPFEI--VPKIDVPDLGRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLD 72 (315) T ss_pred Ccc-cchhh-----cCChhhh--hhhcCCcCCCCceechHHHHHHHHHHHhhhhhhhhceeeeccccccccccccccCcc Confidence 000 00000 0001111 1223445567888999988899999999999999999865 44444555443211 Q ss_pred -CccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCC--- Q lcl|Aclame:pro 204 -HNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGY--- 276 (497) Q Consensus 204 -~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~--- 276 (497) ..+..|.+|++..++++|+|+++.+.+|++.+.+.||+|+|+|+ +++++||...+++++++.++.+|++|+|+ T Consensus 73 ~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~ 152 (315) T protein:vir:41 73 VGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSD 152 (315) T ss_pred cccccccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcC Confidence 12456889999999999999999999999999999999999997 38999999999999999999999999985 Q ss_pred ---ccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhh Q lcl|Aclame:pro 277 ---PGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAE 353 (497) Q Consensus 277 ---~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 353 (497) .+|.||++.+......... ....... T Consensus 153 p~~~~~~G~l~~a~~~~~~~~~---------------------------------------------------~~~a~~~ 181 (315) T protein:vir:41 153 PLLRMSDGWLKLASEKLTESDV---------------------------------------------------DPEAEDW 181 (315) T ss_pred ccccccccceeccccccccccc---------------------------------------------------ccccccc Confidence 3567888765432111000 0000111 Q ss_pred hhhhhHHhhhhhhhhhccC--CceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCC--- Q lcl|Aclame:pro 354 IAENVFDAFVDIQLTLFQT--PNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP--- 428 (497) Q Consensus 354 ~~~~~~~~~~~~~~~~~~~--~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~--- 428 (497) ..+.+.+..+.+...++.+ ..+|+||+.++..|+++||++|+|+|++.... +.+.+|+|+||+.++.|| T Consensus 182 ~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~------g~~~tl~G~PV~~~~~m~~~~ 255 (315) T protein:vir:41 182 PMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTG------ANSILYDGRPVQYVPALEALN 255 (315) T ss_pred cHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccchhhc------CCCceecccceEecccccccC Confidence 2234455556666666543 34799999999999999999999999876533 345689999999999885 Q ss_pred --cCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecC Q lcl|Aclame:pro 429 --LGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) Q Consensus 429 --~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~ 492 (497) ++.++||||+.. .+.++.+++++..+.. .++.+.|.+..|+|+.+.++++.+.-.++. T Consensus 256 ~~~~~ilf~d~~nl--~~~~~~~i~i~~~~~a----~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 256 DGKSRALFVVPTQL--VYGFWRNIKVVPDYDA----EMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred CCCccEEEecccce--EEEeccccEEEeeecC----CCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 567999999974 4567788888877654 367888999999999988887755555555 No 101 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=4e-40 Score=236.58 Aligned_cols=390 Identities=12% Similarity=0.077 Sum_probs=206.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) .|+-...+-+ .+++...++. .++++....+.+ ..+..++..++.++++++..+...... T Consensus 122 ~pa~~~a~I~------~vke~~~~e~---~~~~~~~a~~ee-------~~e~~~k~~el~a~l~~~~~~~~~~~~----- 180 (517) T protein:vir:97 122 NPSNKNAVVT------YFREEKKKEE---NKMTFDQNLMQE-------LLDAKKLAADLNAKLKERENGGDNAAL----- 180 (517) T ss_pred hhhhhhhhhh------hhhhhhhhhh---hhhhhhhhhhhh-------hhhhhhhHHHHHHHHHHHHHHHHHHHH----- Confidence 4433332211 1111111111 111111100000 001111112222222222211111000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +............... ....................... ............ .............+.++ T Consensus 181 --e~~~~l~a~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 248 (517) T protein:vir:97 181 --KTVSELAANLMKQRES------EKILGVEALKVTPEATEFLKTRE-AEVAYMSASLTK---DPKAAWTAELKERGISG 248 (517) T ss_pred --hhhhhhhhhHHHHHHh------hhhcccccccccchhhHHHHHHH-HHHHHHHhcccc---cccceeeeecccccccc Confidence 0000000000000000 00000000000000000000000 000000000000 00000000111222234 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) ...|+.+...+...+...+++++++++.+++ ...+|..+.. ..+.|+.||..+|+++++|+.+++.++++++++++| T Consensus 249 ~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~--~~~~~~~~~~-~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~S 325 (517) T protein:vir:97 249 MPAPAGILKRIQDAVNDEGSLLPFIRHENLP--TLVVGGDNAL-TQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLP 325 (517) T ss_pred cccchHHHHHHHHhhhhhccceeeeeecccc--ceeeeccccc-ceeeeeecCCcccccccceeeEEeeHhhhhhhhhhh Confidence 4556677777888787777887777665543 3556665553 568899999999999999999999999999999999 Q ss_pred HHHHhhHH-H----HHHHHHHHHHHHHHHHHHhhhhccCCCc-cccceeccccccccchhhhhhhHHHHHHHHHhhhhhc Q lcl|Aclame:pro 241 DEGLRDAP-E----LFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) Q Consensus 241 ~ell~ds~-~----l~~~i~~~la~~~~~~~d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) ++||+|+. + |++||.++|+++++.+++.+||+|+|++ ++.|+++.++....... T Consensus 326 ~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~-------------------- 385 (517) T protein:vir:97 326 KIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNV-------------------- 385 (517) T ss_pred HHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccccccccccc-------------------- Confidence 99998763 3 9999999999999999999999999987 46677765321111000 Q ss_pred chhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccC Q lcl|Aclame:pro 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANG 394 (497) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G 394 (497) ...+...+++..+..... ...+..|+|||.+|.+|++|||++| T Consensus 386 -----------------------------------~~~~~~~d~i~~l~~a~~--~a~~a~~vmn~~t~~~I~klKD~~G 428 (517) T protein:vir:97 386 -----------------------------------TGTTNIQELLEKLSVATP--KAADSTLVIHRNDLAAIRFLKDKNG 428 (517) T ss_pred -----------------------------------cccchHHHHHHHHHHHhh--hccCCEEEECHHHHHHHHHhhcCCC Confidence 000111112222211111 1234579999999999999999999 Q ss_pred cccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEee Q lcl|Aclame:pro 395 QYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEER 474 (497) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r 474 (497) ||||++.... ....+++|+.-+. +.++.+...+++++. |.++++.++.+..+ ..+.+|++.|+.++| T Consensus 429 ~Yl~~~~~~~------~~~~~l~G~~~~~-~~~~~~~~~~~~~~~--y~i~~~~g~~~~~~----fd~~~n~~~f~~~~~ 495 (517) T protein:vir:97 429 NYVFPVGVSN------QTIATHFGFNRLV-QSVAVDEKTAVSLSG--YVTNGSRGMEFEQG----TILVENNKEYLFEMP 495 (517) T ss_pred CeeccCcCCc------ccccccCCccccc-cccccCceeEeeccc--cEEEeecceeeeee----eecccCceeEeeeee Confidence 9999775432 3345777742222 234456556666553 67888877765322 124689999999999 Q ss_pred eccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 475 LGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 475 ~~~~v~~~~Af~~~~~~~~a~~ 496 (497) +++.|+.|++|+.+.+...+.| T Consensus 496 ~~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 496 ISGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred eccccccccceEEEEEcCCCCC Confidence 9999999999999999999999 No 102 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=5.6e-37 Score=219.32 Aligned_cols=364 Identities=12% Similarity=0.075 Sum_probs=188.8 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) +|+-...+.+. .+....+.. ......+.. +...+..+.+.+..++.++++++..+.+++ T Consensus 109 ~pa~~~a~v~~------vks~~~~~e-----~~~~~~e~~---e~~~e~~e~~~~~~el~akl~el~k~~ee~------- 167 (480) T protein:vir:40 109 LPSNKGAKVTK------VREENKGEQ-----EQMGANETQ---EIMKQAIEAGVKVRELEAKVEELNKEREEL------- 167 (480) T ss_pred cccchhhhhhh------hhhhhhhhh-----hhhhhHHHH---HHHHhhhhhhhhhhhHHHHHHHHHhHHHHH------- Confidence 77765555321 111000000 000000000 000111111112222223333222211111 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) .......... ................ ...... ...+.. ..... ........+ T Consensus 168 k~~~~~~~~~------~~~~~~~~~e~r~~~~--------~~~~~~--------e~~~~~-----~~~~~-~~~~~~~~~ 219 (480) T protein:vir:40 168 KKEREASIPS------EKPEDAERKFMRELGS--------KMAEMP--------EQGFLR-----EFANG-ADLNVVNSL 219 (480) T ss_pred hhhhhhhccc------cchhhhhhHHHHHHHH--------Hhccch--------hhhhhh-----hhhhh-ccccccccc Confidence 0000000000 0000000000000000 000000 000000 00000 111222334 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccc--cccceeEEee---eeeeee Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFS--SEEFARVYEQ---VGKVAN 235 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s--~~~~~~v~~~---~~kia~ 235 (497) +.+++.+...+........++...++.... + .....|++|+...+.. ..++....+. ++++++ T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------g-~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~ 287 (480) T protein:vir:40 220 GSITSKYARKSGIYDGAMKARFQGLTLAED-----------G-VDDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAYLQ 287 (480) T ss_pred cccccchhhheeechhhhhhhhhcceeeec-----------c-ccceeeeeeeecccccccccccccchhhHHHHHHHHH Confidence 556666555444433333343333322211 1 1235677776544332 1234444444 578999 Q ss_pred echhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCC--ccccceeccccccccchhhhhhhHHHHHHHHHhhhhh Q lcl|Aclame:pro 236 ALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGY--PGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 236 ~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~--~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) +..+|+++|+|+++|++||.++|++.++.+++.+||+|+|+ ..+.||.+.....+.. T Consensus 288 ~~k~t~~lLDDa~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~~~~~~--------------------- 346 (480) T protein:vir:40 288 MDKATVRGVNDSGALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTATDGWTKQ--------------------- 346 (480) T ss_pred hHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceeeccccccc--------------------- Confidence 99999999999999999999999999999999999999554 4566664322110000 Q ss_pred cchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhccc Q lcl|Aclame:pro 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDAN 393 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~ 393 (497) .+..+.++ ...+++..++..++..|+|||.+|.+|++|||++ T Consensus 347 -----------------------------------~~~~d~id---~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~ 388 (480) T protein:vir:40 347 -----------------------------------IEYTDLFE---GITDAVAECSISDAITIVMSPQTFAELRKAKGTD 388 (480) T ss_pred -----------------------------------chhHHHHH---HHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCC Confidence 00011222 2334444555555557999999999999999999 Q ss_pred Ccccccccccccccccccccccccccceeec-CCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEE Q lcl|Aclame:pro 394 GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTT-PLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) Q Consensus 394 G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s-~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~ 472 (497) |+|||++... ...+++|||+||+.+ ..+|.+...+|.++. ++.++||. ++. .....+..|++.|+++ T Consensus 389 G~Yi~q~~~~------~~~~~~llG~pvv~~~~~~~~~~~~~~~~~~-~~~~~d~~-~~~----~~~~~~~~~~~~~~~e 456 (480) T protein:vir:40 389 GHSRFNELAT------KEQIAQSFGAVNLETRVWMPKDEVAVYNHDE-YVLIGDLN-VEN----YNDFDLRYNVEQWLSE 456 (480) T ss_pred CCeeccCccc------ccCcceecccceeeeeccccCCcceeeeCCc-cEEEEecc-cce----ecccccccchhhhhhh Confidence 9999988543 345679999998765 567888888888887 57888874 332 2223466899999999 Q ss_pred eeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 473 ERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 473 ~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) .|+++.|..|+||++++++..=.= T Consensus 457 ~~v~g~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:40 457 TLVGGSIRGKNRSAYLKKKGSLGV 480 (480) T ss_pred hhhceeeEccccEEEEEeccCcCC Confidence 999999999999999987643333 No 103 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=3.3e-35 Score=209.64 Aligned_cols=302 Identities=10% Similarity=0.021 Sum_probs=205.7 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCcccee Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA 209 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~ 209 (497) ... +.+.... ...........++.++|++|||++...+++.+.+.++++++++++++++....+|..... +...| T Consensus 1 ~~~---k~~~~~l-~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~-~~~~~ 75 (321) T protein:vir:31 1 MAS---RTINNDL-SRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIG-ERHRR 75 (321) T ss_pred Cch---HHHHHHH-HHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccC-Ccccc Confidence 011 1111111 111112222234455677899999999999999999999999999999988999887553 45678 Q ss_pred ecc-c-cccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCcccc---- Q lcl|Aclame:pro 210 VAE-A-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVN---- 280 (497) Q Consensus 210 v~E-g-~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~---- 280 (497) +++ + ...+.++|+|+++++.+|++.+.++||+|+|+|+ ++++++|.+.++++++..++.++++|+|...+. T Consensus 76 ~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~ 155 (321) T protein:vir:31 76 PQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQ 155 (321) T ss_pred cccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCccccc Confidence 764 3 3456788999999999999999999999999986 489999999999999999999999999987664 Q ss_pred --ceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhh Q lcl|Aclame:pro 281 --GLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENV 358 (497) Q Consensus 281 --Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (497) ||++.+....... .........+.+ T Consensus 156 n~G~l~~a~~~~~~~-----------------------------------------------------~~~~~~~~~d~l 182 (321) T protein:vir:31 156 NDGFITVAEGDVETI-----------------------------------------------------DAADDILDNDLV 182 (321) T ss_pred chhhhhhhccccccc-----------------------------------------------------cccccccCHHHH Confidence 4443221110000 000001122344 Q ss_pred HHhhhhhhhhhccCC-ceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeec Q lcl|Aclame:pro 359 FDAFVDIQLTLFQTP-NAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHF 437 (497) Q Consensus 359 ~~~~~~~~~~~~~~~-~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~ 437 (497) ..+...+...++..+ -+|+||+.++..++......+.++|.+... ...+.+|+|+||+.+++||.+.++++|| T Consensus 183 ~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~~~~~~~l~------~~~~~tl~G~pvv~~~~mP~~~il~t~~ 256 (321) T protein:vir:31 183 IRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDTPLGDNVIM------GEADVNPFSFPIIGSGLWPDDKAMFTDP 256 (321) T ss_pred HHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCCccccchhh------ccccccccceeEEEcCCCCCCcEEEecc Confidence 455555555554433 379999999988876444445577765432 2344589999999999999999999999 Q ss_pred cceEEEEEeccccEEEEeccchh-hhhcCceEEEEEeeeccEeecccceEEEEecCCCC------CC Q lcl|Aclame:pro 438 APSVIQTARREGVTMQMTNSNGT-DFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT------GS 497 (497) Q Consensus 438 ~~~~~~i~~r~~~~i~~~~~~~~-~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~------~~ 497 (497) +...|.+ ..++++++..+... ....+.+.+....++|+.|.+++|+|.++=..-+- .| T Consensus 257 ~nl~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~~~~~~~~~~ 321 (321) T protein:vir:31 257 QNLIYAL--YRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGDPLEHLEEETS 321 (321) T ss_pred ccEEEEE--eeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCcchhcccCCCC Confidence 9976555 45667776655431 12234555556667899999999999998322211 11 No 104 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.96 E-value=1.9e-31 Score=188.99 Aligned_cols=266 Identities=19% Similarity=0.192 Sum_probs=190.3 Q ss_pred hhhcccccCCcccccc-hhhHHHHHHHhhhhHHhhccce----ecCCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPT-FLPGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~-~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++.+++.. +..+.|+ +..-+++.+.+.+.+.+++.+. ..+|+++++|+... .+.+.|++||+.+|.++++++. T Consensus 1 MA~~~T~~-~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~-~~~a~~v~eg~~i~~~~~~~~~ 78 (272) T protein:vir:98 1 MAVGTTKM-AQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDY-IGDAEDVAEGEAIPMTQLGFKK 78 (272) T ss_pred CCCccccc-hheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecC-CCCcccccCCCcccccccccce Confidence 44444433 4455554 5555667777777777776553 23456799999865 5789999999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) +++.+++++..+++|++++.++ +++.+++.+.+++++++++|..++..-.. +.. .. T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~---------a~~---~~----------- 135 (272) T protein:vir:98 79 TTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK---------STQ---TV----------- 135 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc---------ccc---cc----------- Confidence 9999999999999999998876 79999999999999999999998852110 000 00 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ......+.+.++...+... ......|+|||.++. T Consensus 136 ---------------------------------------------~~~~t~d~i~da~~~l~~~-~~~~~~~vv~p~~~~ 169 (272) T protein:vir:98 136 ---------------------------------------------EATATVDGVSKALDIFNDE-DDAETVIVMNPADAS 169 (272) T ss_pred ---------------------------------------------ccccCHHHHHHHHHHHhcc-CCCccEEEEcHHHHH Confidence 0000122333333333323 344568999999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++.+..+..-. ...+.. ....+..++++|+||+.|+++|.++.++.+.. .+.++.+.+++++..++.. + T Consensus 170 ~L~k~~~~~~~~~--~~~~~~-~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~--a~~~~~~~~~~ve~~r~~~----~ 240 (272) T protein:vir:98 170 TLRLDAAKEWLGA--TEVGAN-RVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKG--ALRIMLKRNTMVETDRDIT----K 240 (272) T ss_pred HHHHhcccccccc--cccccc-ccccccchhhcCeeEEEcCCCCcceEEEEcCC--eEEEEecCCceeeeccccc----c Confidence 9987643321100 000000 01112235899999999999999999886655 5667778888888777553 5 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) +...+++.+|+++++.+|++||+++++++++- T Consensus 241 ~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 241 AINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 67899999999999999999999999999998 No 105 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.96 E-value=1.9e-31 Score=188.99 Aligned_cols=266 Identities=19% Similarity=0.192 Sum_probs=190.3 Q ss_pred hhhcccccCCcccccc-hhhHHHHHHHhhhhHHhhccce----ecCCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPT-FLPGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~-~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++.+++.. +..+.|+ +..-+++.+.+.+.+.+++.+. ..+|+++++|+... .+.+.|++||+.+|.++++++. T Consensus 1 MA~~~T~~-~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~-~~~a~~v~eg~~i~~~~~~~~~ 78 (272) T protein:vir:30 1 MAVGTTKM-AQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDY-IGDAEDVAEGEAIPMTQLGFKK 78 (272) T ss_pred CCCccccc-hheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecC-CCCcccccCCCcccccccccce Confidence 44444433 4455554 5555667777777777776553 23456799999865 5789999999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) +++.+++++..+++|++++.++ +++.+++.+.+++++++++|..++..-.. +.. .. T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~---------a~~---~~----------- 135 (272) T protein:vir:30 79 TTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK---------STQ---TV----------- 135 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc---------ccc---cc----------- Confidence 9999999999999999998876 79999999999999999999998852110 000 00 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ......+.+.++...+... ......|+|||.++. T Consensus 136 ---------------------------------------------~~~~t~d~i~da~~~l~~~-~~~~~~~vv~p~~~~ 169 (272) T protein:vir:30 136 ---------------------------------------------EATATVDGVSKALDIFNDE-DDAETVIVMNPADAS 169 (272) T ss_pred ---------------------------------------------ccccCHHHHHHHHHHHhcc-CCCccEEEEcHHHHH Confidence 0000122333333333323 344568999999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++.+..+..-. ...+.. ....+..++++|+||+.|+++|.++.++.+.. .+.++.+.+++++..++.. + T Consensus 170 ~L~k~~~~~~~~~--~~~~~~-~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~--a~~~~~~~~~~ve~~r~~~----~ 240 (272) T protein:vir:30 170 TLRLDAAKEWLGA--TEVGAN-RVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKG--ALRIMLKRNTMVETDRDIT----K 240 (272) T ss_pred HHHHhcccccccc--cccccc-ccccccchhhcCeeEEEcCCCCcceEEEEcCC--eEEEEecCCceeeeccccc----c Confidence 9987643321100 000000 01112235899999999999999999886655 5667778888888777553 5 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) +...+++.+|+++++.+|++||+++++++++- T Consensus 241 ~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 241 AINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 67899999999999999999999999999998 No 106 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.82 E-value=5.8e-22 Score=137.03 Aligned_cols=264 Identities=16% Similarity=0.140 Sum_probs=175.6 Q ss_pred hhhcccccCCcccccchhhH-HHHHHHhhhhHHhhccceec----CCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPG-IVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~-ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++.+ .+.....|.|+...+ +.+.+.....+.+++..-+. +|.++++|.... .+.+.++.||..++..+.+.++ T Consensus 1 ma~~-~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~-~gda~~~~eg~~i~~~~lt~~~ 78 (272) T protein:vir:36 1 MSKQ-KTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTY-IGDAADVAEGGEISLDKIGTTT 78 (272) T ss_pred CCCc-ceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeecc-CccccccCCCCccChhhcCCcc Confidence 3322 334455666665554 55666666666777655432 356799999865 4678899999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++++..+.++++....+ .++.+.+.+.+++.+++.+|..++..-.. +.... . T Consensus 79 ~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~---------~~~~~---~---------- 136 (272) T protein:vir:36 79 KSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKT---------TSQTV---S---------- 136 (272) T ss_pred eeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc---------ccccc---c---------- Confidence 9999999998999999876655 68999999999999999999987642110 00000 0 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... ....+++|||..+. T Consensus 137 ----------------------------------------------~~~~~d~i~~A~~~lgd~~-~~~~~ivv~p~~~~ 169 (272) T protein:vir:36 137 ----------------------------------------------TKANVDGVQAALDIFNDED-AQAYVLIVNPKDAA 169 (272) T ss_pred ----------------------------------------------ccccHHHHHHHHHHhhhcC-CCceEEEEcHHHHH Confidence 0001122222222222222 23567999999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEee--ccceEEEEEeccccEEEEeccchhhh Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGH--FAPSVIQTARREGVTMQMTNSNGTDF 462 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd--~~~~~~~i~~r~~~~i~~~~~~~~~f 462 (497) .|++...-. +. ....+....-.+.-++++|+||+.|+.+|.++.++.. |.++++.++....++++..+... T Consensus 170 ~L~k~~~~~--~~--~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~~~~~~vE~~R~~~--- 242 (272) T protein:vir:36 170 KIRKDANAK--NI--GSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV--- 242 (272) T ss_pred HHhcccccc--cc--cccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccceeeeecCCcccccccchh--- Confidence 987543221 11 1111100011122358999999999999998754321 33445666667788888776553 Q ss_pred hcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 463 VDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 463 ~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) +....+++.++++.++.+|+++|+++++-. T Consensus 243 -~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 243 -TKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred -hcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 334579999999999999999999999988 No 107 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.81 E-value=4.4e-21 Score=132.20 Aligned_cols=268 Identities=16% Similarity=0.143 Sum_probs=178.3 Q ss_pred hhhcccccCCcccccc-hhhHHHHHHHhhhhHHhhccceec----CCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPT-FLPGIVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~-~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++.+. +.-+..+.|+ |..-+.+.+.+...+.+++..... ++..+++|+... .+.+.|+.||+.++.++.++++ T Consensus 1 ma~~~-T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~-~g~~~~~~eg~~i~~~~it~~~ 78 (274) T protein:vir:93 1 MPQGI-TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKK 78 (274) T ss_pred CCccc-eehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeecc-CCCcccccCCCcccccccccce Confidence 33333 3444555565 455566667666666677655321 345799999865 4678999999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+..+.++++....+ .++...+.+.+++++++++|..++..-...... .. T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~---------~~------------- 136 (274) T protein:vir:93 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT---------VN------------- 136 (274) T ss_pred eEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------cc------------- Confidence 9999999998899999987665 688899999999999999999887532111000 00 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ......+.+.++...+.... .....++|||..+. T Consensus 137 ---------------------------------------------~~~~~~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:93 137 ---------------------------------------------ADITKLNGLQSAIDKFNDED-LEPMVLFINPLDAG 170 (274) T ss_pred ---------------------------------------------ccccCHHHHHHHHHHhhhcc-CCccEEEeCHHHHH Confidence 00001122233333322222 24568999999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++ +..-+++-....+. .....+..+++.|+||+.|+.+|.++.++..... +.++.+..+.++..+... + T Consensus 171 ~L~k--~~~~~f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~ga--i~~~~~~~~~vE~~Rd~~----~ 241 (274) T protein:vir:93 171 KLRG--DASTNFTRATELGD-DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGA--VKLILKRDFFLEVARDAS----T 241 (274) T ss_pred HHHh--hhhhcccccccccc-cceeecccceecCeeEEEcCCCCcceEEEEeCCe--EEEEecCCcccccccchh----h Confidence 9964 33222221111110 0111223457999999999999999988877554 555667777887776543 3 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ....+++..++++++.+|+++|+++++++..-- T Consensus 242 ~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 242 KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 456899999999999999999999854433222 No 108 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.78 E-value=1.3e-20 Score=129.57 Aligned_cols=271 Identities=15% Similarity=0.124 Sum_probs=175.3 Q ss_pred hhhcccccCCcccccc-hhhHHHHHHHhhhhHHhhcccee----cCCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPT-FLPGIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~-~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++..++ ..+..+.|+ |..-+.+.+.+...+.+++.... ..+..+++|+... .+.+.++.||+.++..+.++++ T Consensus 1 Ma~~~T-~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~-~g~a~~~~~g~~i~~~~lt~~~ 78 (278) T protein:vir:80 1 MADLTT-KLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKY-IGDAQDVAEGAAIDYSALETES 78 (278) T ss_pred CCCcce-ehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeecc-CCcceeecCCCcCcccccccce Confidence 333333 334455565 55566677776666667664432 2355799999864 4678899999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccC-CCccccceeccccccccchhhhhhhHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGG-GYPGVNGLLQRSTGFTASSASSLFGATSA 303 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~-g~~~~~Gil~~~~~~~~~~~~~~~~~~~~ 303 (497) .++..++.+..+.++++....+ .++.+.+.+.+++.+++.+|..++..- |... +..+..+.. T Consensus 79 ~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~-----~~~~~~t~~----------- 142 (278) T protein:vir:80 79 VKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTL-----EVKGAINIG----------- 142 (278) T ss_pred eeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----ccccccccc----------- Confidence 9999999988899999877665 579999999999999999999877531 1000 000000000 Q ss_pred HHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHH Q lcl|Aclame:pro 304 TVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDW 383 (497) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~ 383 (497) ......+.+.++...+..........++|||..+ T Consensus 143 ----------------------------------------------~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~ 176 (278) T protein:vir:80 143 ----------------------------------------------LIDKIENTFTDAPDAIEDESITTTGVLFLNYKDT 176 (278) T ss_pred ----------------------------------------------hhhhHHHHHHHHHHhhcccCCCcccEEEECHHHH Confidence 0000011122222222222222344688999999 Q ss_pred HHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhh Q lcl|Aclame:pro 384 ELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFV 463 (497) Q Consensus 384 ~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~ 463 (497) ..|++... .+++-....+.+ ..-.+.-+++.|++|+.|+++|.++.++..- +++..+....++++..+... T Consensus 177 ~~L~k~~~--~~~~~~~~~g~~-~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~--gAi~~~~~~~~~vE~~Rd~~---- 247 (278) T protein:vir:80 177 AKLREEAA--GSWTKASQLGDD-LLVKGAFGELLGWEIVRTKKLADGNALAVKA--GALKTFLKRNLLAESGRDMD---- 247 (278) T ss_pred HHHHhhhh--hhcccccccccc-ceeeccceeecceeEEEcCCCCcceEEEEec--cceeeeecCCcccccccchh---- Confidence 88876532 222211111111 1112334579999999999999999876543 35666667778887776543 Q ss_pred cCceEEEEEeeeccEeecccceEEEEecCCC Q lcl|Aclame:pro 464 DGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 464 ~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a 494 (497) +....+++..+++.++.+|+++|+++..++- T Consensus 248 ~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 248 HKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred hccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 3456788999999999999999999966665 No 109 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.77 E-value=3.7e-20 Score=127.12 Aligned_cols=268 Identities=15% Similarity=0.144 Sum_probs=180.6 Q ss_pred hhhcccccCCcccccchhh-HHHHHHHhhhhHHhhcccee----cCCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLP-GIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~-~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++.+ .+.-+..|.|+... -+.+.+...+.+.+++.+-+ .++..+++|.... .+.+.++.||..++..+.++++ T Consensus 1 Ma~~-~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~-igda~~~~eg~~i~~~~lt~~~ 78 (276) T protein:vir:10 1 MAQG-TTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVY-SGDATVVPEGQKIPVDKIETNR 78 (276) T ss_pred CCcc-eeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecC-CCccccccCCCccCccccccce Confidence 3322 33445566676555 45566666667777775533 3567899999865 4678899999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) .+...++.+..+.++++....+ .+..+.+.+.++..+++.+|..++.-- ........+ T Consensus 79 ~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l---------~~~~~~~~~------------ 137 (276) T protein:vir:10 79 REAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEAL---------RGTKLTVSA------------ 137 (276) T ss_pred eeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHH---------hcccccccc------------ Confidence 9999999999999999987765 578889999999999999998876310 000000000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... ....+++|||..+. T Consensus 138 ----------------------------------------------~~~t~d~i~~A~~~lgd~~-~~~~~ivv~p~~~~ 170 (276) T protein:vir:10 138 ----------------------------------------------DIGTLAGLEAAIDTFDDED-LEPMVLFINPKDAG 170 (276) T ss_pred ----------------------------------------------cccCHHHHHHHHHHhcccc-CcccEEEEcHHHHH Confidence 0001122222222222221 24568899999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|+++...+ ++-.... +......+.-+++.|++|+.++.+|.++.++.. ++++.++....++++..+... + T Consensus 171 ~L~k~~~~~--f~~~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~--~gAi~~~~~~~~~vE~dRd~~----~ 241 (276) T protein:vir:10 171 KLRSSASDN--FTRATEL-GDNIIVKGAFGEALGAVIVRSKKLDEGEAILAK--RGAVKLITKRDFFLETDRDPS----T 241 (276) T ss_pred HHHHhcccc--ccccccc-cccceeccccceecceeEEEcCCCCcceEEEEe--ccceeeeecCCceeecccchh----h Confidence 998764332 2211111 111111223458899999999999999987544 456667778888888887654 3 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ....+++..+++.++.+|+.+|++++...+.-| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (276) T protein:vir:10 242 KTTALYSDKHYVAYLYDESKAVKVTKGAGTTDS 274 (276) T ss_pred cccEEEEeeEEEEEEEcCcceEEEecCCcCCcC Confidence 466789999999999999999999966633333 No 110 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.76 E-value=1.4e-20 Score=129.37 Aligned_cols=308 Identities=14% Similarity=0.144 Sum_probs=196.6 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccc Q lcl|Aclame:pro 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISS 187 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~ 187 (497) +.+.-... .+.+|.. .....-...+...+-..++.+.+......||+.+.+.+.|+++++. T Consensus 1 ~~~~~~~~------------------~~~~~~~-~~~~~p~l~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf 61 (330) T protein:vir:94 1 MVRICTPP------------------LRGRWRT-LTHQFPELKMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPF 61 (330) T ss_pred CceecCCc------------------cccceee-hhccccccchhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhccc Confidence 00000000 0000000 0000001112222223334445566778899999999999999999 Q ss_pred eecCCCceEEEEeecCCccceeeccccccccccc-cceeEEeeeeeeeeechhhHHHHh--hHH-HHHHHHHHHHHHHHH Q lcl|Aclame:pro 188 RPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSE-EFARVYEQVGKVANALTITDEGLR--DAP-ELFNFVQGRLLEGIQ 263 (497) Q Consensus 188 ~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~-~~~~v~~~~~kia~~~~iS~ell~--ds~-~l~~~i~~~la~~~~ 263 (497) ..+.++.+.|++.+. -++++|...++..+.+.+ +|.+++...+.+++.+.|.+++.+ .++ +...+-.....+++. T Consensus 62 ~~ve~~~~~~~r~~~-lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~ 140 (330) T protein:vir:94 62 TEIEGNALAYNRENV-LGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIG 140 (330) T ss_pred ccccCCcceeeeeec-CCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHH Confidence 989899999999887 478999999998887664 899999999999999999999965 344 688888888999999 Q ss_pred HHHHhhhhccCCCc-cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccc Q lcl|Aclame:pro 264 RKEEVQLLAGGGYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGS 342 (497) Q Consensus 264 ~~~d~a~l~G~g~~-~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 342 (497) .+.+.+||||+.++ ++.||+......... .. T Consensus 141 ~~~e~~linGDs~~~~F~GL~~~~~~~q~i------------------------------------------------~t 172 (330) T protein:vir:94 141 RQYQASMITGDGTGNSFQGMMGLVAASQTI------------------------------------------------SA 172 (330) T ss_pred HHHHHHhhccCCCCccccchhhcCCcccEE------------------------------------------------ec Confidence 99999999999764 466776533211000 00 Q ss_pred cccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccccccccccccccccccccccccee Q lcl|Aclame:pro 343 GVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVV 422 (497) Q Consensus 343 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv 422 (497) ... ..+...+.+|.++..... ....+.+|+||+....+|+.++...|+|-..+......|.+ -.+..|+|++ T Consensus 173 g~~-gg~~T~d~LDeLl~~v~~----~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~---v~~~~GvPi~ 244 (330) T protein:vir:94 173 GAN-GGTLTFELLDQLLDLVKD----KDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQ---IPTYRGVPWF 244 (330) T ss_pred CCC-CCCCCHHHHHHHHHHhcC----CCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCE---EeeeCCeEEE Confidence 000 111122334444433321 22347789999999999999999888776544433322222 2456799999 Q ss_pred ecCCCCcC----------cEEEeec-----cceEEEEEec--cccEEEEeccchhhhhcCceEEEEEeeeccEeecccce Q lcl|Aclame:pro 423 TTPLIPLG----------TILVGHF-----APSVIQTARR--EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAF 485 (497) Q Consensus 423 ~s~~~~~~----------~~~~gd~-----~~~~~~i~~r--~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af 485 (497) .++.+|.+ .||+..| .++..++... .++++..-.+.. .++.+.++++++++.+|..|+|+ T Consensus 245 ~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~---~k~v~~~~v~~y~~~av~~~~a~ 321 (330) T protein:vir:94 245 VNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKE---NADETITRVKMYCGFANFSQLGL 321 (330) T ss_pred ecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCcc---ccceeeEEEEEeeeeEEechhhe Confidence 99998863 2454333 2444444322 255554433221 36678899999999999999999 Q ss_pred EEEEecCCCCC Q lcl|Aclame:pro 486 QLIQLKKGATG 496 (497) Q Consensus 486 ~~~~~~~~a~~ 496 (497) .+|+-.. -| T Consensus 322 ~~L~~V~--~g 330 (330) T protein:vir:94 322 AAIKGLI--PG 330 (330) T ss_pred eeecccc--CC Confidence 9977332 23 No 111 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.76 E-value=7.6e-20 Score=125.40 Aligned_cols=269 Identities=13% Similarity=0.104 Sum_probs=181.7 Q ss_pred hhhcccccCCcccccchhhH-HHHHHHhhhhHHhhccceec----CCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPG-IVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~-ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++..+.+.-...|.|+.... +.+.+.....+.+++.+-+. ++..+++|+... .+.+.++.||+.++..+.++++ T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~~~~lt~~~ 79 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVY-SGDAKVVPEGEEIPIDLIETKK 79 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeecc-CCccccccCCCCcchhhcccce Confidence 44444444556777775554 55666666677777655432 356799999875 4678899999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) .+...++.+..+.++++....+ .++...+.+.++.++++.+|..++.--++. ..... T Consensus 80 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a---------~~~~~------------- 137 (275) T protein:vir:96 80 RQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGA---------TLKVE------------- 137 (275) T ss_pred eeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc---------ccccc------------- Confidence 9999999999999999976655 578888899999999999999877421110 00000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ......+.+.++...+.... .....++|||..+. T Consensus 138 ---------------------------------------------~~~~~~d~i~dA~~~lgd~~-~~~~~ivv~p~~~~ 171 (275) T protein:vir:96 138 ---------------------------------------------ADITKLAGLQTAIDKFNDED-LEPMVLFVNPLDAG 171 (275) T ss_pred ---------------------------------------------ccccCHHHHHHHHHHhcccc-CCccEEEeCHHHHH Confidence 00001222233333332222 24567999999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++... -+++-.+ ..+....-.+.-+++.|++|+.|+.+|.++.++.. ++++.++....++++..+... + T Consensus 172 ~L~k~~~--~~f~~~~-~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~i~~--~gA~~~~~~~~~~vE~~Rd~~----~ 242 (275) T protein:vir:96 172 KLRASAT--DNFTRAT-LLGDNVIVKGAFGEALGAIIVRSNKIKEGEAILAK--RGAVKLITKRDFFLETERHAS----H 242 (275) T ss_pred HHHhccc--ccccccc-cccccceeccccceecCeeEEEeCCCCcceEEEEe--ccceeeeecCCcccccccchh----h Confidence 9977532 1222111 11111111223467999999999999999877643 445666777788888777553 3 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ....+++.++++.++++|+++|+++++++--|- T Consensus 243 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 243 KSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred cCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 456789999999999999999999987666666 No 112 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.74 E-value=3.8e-19 Score=121.55 Aligned_cols=268 Identities=14% Similarity=0.118 Sum_probs=176.8 Q ss_pred hhhcccccCCcccccchhh-HHHHHHHhhhhHHhhcccee----cCCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLP-GIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~-~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++... +..+..+.|+... -+.+.+.....+.+++..-. .++..+++|+... .+.+..+.||+.++..+.+++. T Consensus 1 ma~~~-T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~-~g~~~~~~~g~~i~~~~it~~~ 78 (274) T protein:vir:96 1 MAQGT-TKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY-SGDAQVIAEGEKIPVDQIGTSK 78 (274) T ss_pred CCccc-cchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeecc-CCCccccCCCCcCchhhcccce Confidence 33333 3444566676544 45566655556666665432 1356799999864 5678889999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++++..+.++++....+ .++.+.+.+.+++++++.+|..++.--... +.... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a---------~~~~~------------- 136 (274) T protein:vir:96 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA---------TLTVE------------- 136 (274) T ss_pred eEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcC---------CCCcC------------- Confidence 9999999988899999886655 578899999999999999999877421100 00000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ......+.+.++...+.... .....++|||..+. T Consensus 137 ---------------------------------------------~~~~~~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:96 137 ---------------------------------------------ADITKLDGLQTAIDKFNDED-LEPMVLFVNPLDAG 170 (274) T ss_pred ---------------------------------------------cccccHHHHHHHHHHhcccC-CCceEEEeCHHHHH Confidence 00011222333333332222 24567999999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++... .+++-....+. .....+.-+++.|++|+.|+++|.++.++..-. ++.++....++++..+... + T Consensus 171 ~L~k~~~--~~f~~~~~~g~-~~~~~g~ig~~~G~~Vi~s~~~p~~t~~l~~~g--A~~~~~~~~~~vE~~Rd~~----~ 241 (274) T protein:vir:96 171 GLRTSAS--DNFTRPTQLGD-NIIVKGAFGEALGAVIVRSNKLNKGEALLAKKG--AVKLITKRDFFLEKDRDAS----R 241 (274) T ss_pred HHHhccc--ccccccccccc-cceeecccceecCeeEEEcCCCCcceEEEEeCc--ceeeeecCCcccccccchh----h Confidence 9987532 22221111111 111123456889999999999999998765533 4566667777887666543 4 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ....+++.++++.++++|+++|+++..++-+-- T Consensus 242 ~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 242 KSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred cccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 456899999999999999999999955544444 No 113 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.71 E-value=2.4e-18 Score=117.16 Aligned_cols=268 Identities=16% Similarity=0.129 Sum_probs=175.9 Q ss_pred hhhcccccCCcccccch-hhHHHHHHHhhhhHHhhcccee----cCCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTF-LPGIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~-~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++.. .+.-+..|.|+. .+-+.+.+.......+++..-. .++..+++|+... .+.+..+.||+.++..+.+.++ T Consensus 1 ma~~-~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~-~g~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:94 1 MPQG-LTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKK 78 (274) T ss_pred CCcc-ceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecC-CCccccccCCCcccccccccce Confidence 3332 334455666664 4455566666655566665532 2456799999764 4678889999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+..+.++++....+ .++.+.+.+.+++++++.+|..++.--.+. ..... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a---------~~~~~------------- 136 (274) T protein:vir:94 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA---------KLTVN------------- 136 (274) T ss_pred eEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------Ccccc------------- Confidence 9999999998899999876654 678889999999999999999877421100 00000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ......+.+.++...+.... .....++|||..+. T Consensus 137 ---------------------------------------------~~~~~~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:94 137 ---------------------------------------------ADITKLNGLQSAIDKFNDED-LEPMVLFVNPLDAG 170 (274) T ss_pred ---------------------------------------------ccccCHHHHHHHHHHhhccC-CCceEEEeCHHHHH Confidence 00001222333333332222 24567899999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++ +..-+++-....+. .....+..+++.|++|+.|+.+|.++.++..-. ++.++....+.++..+... + T Consensus 171 ~L~k--~~~~~f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~g--A~~~~~~~~~~vE~~Rd~~----~ 241 (274) T protein:vir:94 171 KLRG--DASTNFTRATELGD-DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKG--AVKLILKRDFFLEVARDAS----T 241 (274) T ss_pred HHHh--hhhhhccccCcccc-cceeccccceecCeeEEEcCCCCcceEEEEeCc--ceEeeecCCceeccccchh----h Confidence 9864 33223322111111 011123345799999999999999998776544 4556667788888777543 3 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ....+++.+++++++.+|+++|+++++.+..-- T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 242 KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 355788899999999999999999965543333 No 114 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.71 E-value=2.4e-18 Score=117.16 Aligned_cols=268 Identities=16% Similarity=0.129 Sum_probs=175.9 Q ss_pred hhhcccccCCcccccch-hhHHHHHHHhhhhHHhhcccee----cCCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTF-LPGIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~-~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++.. .+.-+..|.|+. .+-+.+.+.......+++..-. .++..+++|+... .+.+..+.||+.++..+.+.++ T Consensus 1 ma~~-~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~-~g~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:97 1 MPQG-LTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKK 78 (274) T ss_pred CCcc-ceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecC-CCccccccCCCcccccccccce Confidence 3332 334455666664 4455566666655566665532 2456799999764 4678889999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+..+.++++....+ .++.+.+.+.+++++++.+|..++.--.+. ..... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a---------~~~~~------------- 136 (274) T protein:vir:97 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA---------KLTVN------------- 136 (274) T ss_pred eEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------Ccccc------------- Confidence 9999999998899999876654 678889999999999999999877421100 00000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ......+.+.++...+.... .....++|||..+. T Consensus 137 ---------------------------------------------~~~~~~d~i~dA~~~l~d~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:97 137 ---------------------------------------------ADITKLNGLQSAIDKFNDED-LEPMVLFVNPLDAG 170 (274) T ss_pred ---------------------------------------------ccccCHHHHHHHHHHhhccC-CCceEEEeCHHHHH Confidence 00001222333333332222 24567899999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++ +..-+++-....+. .....+..+++.|++|+.|+.+|.++.++..-. ++.++....+.++..+... + T Consensus 171 ~L~k--~~~~~f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~g--A~~~~~~~~~~vE~~Rd~~----~ 241 (274) T protein:vir:97 171 KLRG--DASTNFTRATELGD-DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKG--AVKLILKRDFFLEVARDAS----T 241 (274) T ss_pred HHHh--hhhhhccccCcccc-cceeccccceecCeeEEEcCCCCcceEEEEeCc--ceEeeecCCceeccccchh----h Confidence 9864 33223322111111 011123345799999999999999998776544 4556667788888777543 3 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ....+++.+++++++.+|+++|+++++.+..-- T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 242 KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 355788899999999999999999965543333 No 115 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.65 E-value=2.2e-17 Score=111.87 Aligned_cols=268 Identities=15% Similarity=0.091 Sum_probs=171.9 Q ss_pred hhhcccccCCcccccchhh-HHHHHHHhhhhHHhhccce----ecCCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLP-GIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~-~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++. ..+.-...|.|+... -+.+.+.....+.+++..- ..+++.+++|.... .+.+..+.||+.++..+.+.++ T Consensus 1 ma~-~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:12 1 MAQ-GLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIPTDILETKK 78 (274) T ss_pred CCc-ceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecC-CCccccccCCCccchhhcccce Confidence 222 233445566666544 4555555555555665542 22466899999864 4678889999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) ..+..++.+..+.++++....+ .++.+.+.+.++.++++.+|..++.--.+... ... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~---------~~~------------- 136 (274) T protein:vir:12 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL---------TVN------------- 136 (274) T ss_pred eeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------ccc------------- Confidence 9999999998999999765444 57888899999999999999987742111000 000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ......+.+.++...+.... .....++|||..+. T Consensus 137 ---------------------------------------------~~a~~~d~i~dA~~~lgd~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:12 137 ---------------------------------------------ADITKLNGLQSAIDKFNDED-LEPMVLFINPLDAG 170 (274) T ss_pred ---------------------------------------------ccccCHHHHHHHHHHhcccc-ccccEEEeCHHHHH Confidence 00011222233333322222 24567899999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++. ..-+++-... .+....-.+.-+++.|++|+.|+.+|.++.++.- ++++.++....++++..+... + T Consensus 171 ~L~k~--~~~~fv~~s~-~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~----~ 241 (274) T protein:vir:12 171 KLRGD--ASTNFTRATE-LGDDIIVKGAFGEALGAIIVRSNKLEAGTAILAK--KGAVKLILKRDFFLEVARDAS----T 241 (274) T ss_pred HHHhh--hhhhcccccc-ccccceecccceeecCeeEEEeCCCCcceEEEEe--ccceeeeecCCceeccccchh----h Confidence 88753 2222221111 1100111223457899999999999999876533 344566667888888877654 3 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ....+++.++++.++++|+.+|++++..+..-- T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 242 KTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cccEEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 445899999999999999999999944332222 No 116 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.64 E-value=3.2e-17 Score=110.99 Aligned_cols=265 Identities=15% Similarity=0.127 Sum_probs=171.6 Q ss_pred hhhcccccCCcccccchh-hHHHHHHHhhhhHHhhcccee----cCCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~-~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++. ..+.-...|.|+.. +-+.+.+.....+.+++.+-+ .++..+++|.... .+.+..+.||+.++..+.+.+. T Consensus 1 m~~-~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:96 1 MAQ-GMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIY-SGDAKVVAEGEKIPTDILETKK 78 (274) T ss_pred CCc-ceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecC-CCccccccCCCccchhhcccce Confidence 222 23344556666644 445566655555666654332 2466899999865 4678889999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+..+.++++....+ .++.+.+.+.++.++++.+|..++.--.+.. ....+ T Consensus 79 ~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~---------~~~~~------------ 137 (274) T protein:vir:96 79 REAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK---------LTVEA------------ 137 (274) T ss_pred eEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc---------ccccc------------ Confidence 9999999888899999865554 5788889999999999999998764211100 00000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... .....++|||..+. T Consensus 138 ----------------------------------------------~~~~~d~i~~A~~~lgd~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:96 138 ----------------------------------------------DITKLTGLQTAIDKFNDED-LEPMVLFISPLDAG 170 (274) T ss_pred ----------------------------------------------cccCHHHHHHHHHHhcccc-ccccEEEeCHHHHH Confidence 0001122222222222221 23557899999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++ +..-+++-....+ ......+.-+++.|++|+.|+.+|.++.++.. ++++..+....+.++..+... + T Consensus 171 ~L~k--~~~~~f~~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~----~ 241 (274) T protein:vir:96 171 KLRG--DATTNFTRATELG-DDVIVKGAFGEALGAVIVRSNKLEAGTAILAK--KGAVKLITKRDFFLETDRDPS----T 241 (274) T ss_pred HHHh--hcccccccccccc-ccceeccccceecCeEEEEeCCCCCceEEEEe--ccceeeeecCCcccccccccc----c Confidence 9875 3222222111111 11111223457899999999999998876533 334555667778888777553 4 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ....+++.+++++++++|+++|+++ ...|| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~~v~~t---k~~~~ 271 (274) T protein:vir:96 242 KTTALYSDKHYVAYLYDESKAVKIT---KGSGS 271 (274) T ss_pred ccCEEEEeEEEEEEEEcCCcEEEEE---cCCcc Confidence 5677899999999999999999988 44566 No 117 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.64 E-value=3.2e-17 Score=110.99 Aligned_cols=265 Identities=15% Similarity=0.127 Sum_probs=171.6 Q ss_pred hhhcccccCCcccccchh-hHHHHHHHhhhhHHhhcccee----cCCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADLISSRP----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~-~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++. ..+.-...|.|+.. +-+.+.+.....+.+++.+-+ .++..+++|.... .+.+..+.||+.++..+.+.+. T Consensus 1 m~~-~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:95 1 MAQ-GMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIY-SGDAKVVAEGEKIPTDILETKK 78 (274) T ss_pred CCc-ceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecC-CCccccccCCCccchhhcccce Confidence 222 23344556666644 445566655555666654332 2466899999865 4678889999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) .++..++.+..+.++++....+ .++.+.+.+.++.++++.+|..++.--.+.. ....+ T Consensus 79 ~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~---------~~~~~------------ 137 (274) T protein:vir:95 79 REAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK---------LTVEA------------ 137 (274) T ss_pred eEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc---------ccccc------------ Confidence 9999999888899999865554 5788889999999999999998764211100 00000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) .....+.+.++...+.... .....++|||..+. T Consensus 138 ----------------------------------------------~~~~~d~i~~A~~~lgd~~-~~~~~ivv~p~~~~ 170 (274) T protein:vir:95 138 ----------------------------------------------DITKLTGLQTAIDKFNDED-LEPMVLFISPLDAG 170 (274) T ss_pred ----------------------------------------------cccCHHHHHHHHHHhcccc-ccccEEEeCHHHHH Confidence 0001122222222222221 23557899999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhc Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVD 464 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~ 464 (497) .|++ +..-+++-....+ ......+.-+++.|++|+.|+.+|.++.++.. ++++..+....+.++..+... + T Consensus 171 ~L~k--~~~~~f~~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~----~ 241 (274) T protein:vir:95 171 KLRG--DATTNFTRATELG-DDVIVKGAFGEALGAVIVRSNKLEAGTAILAK--KGAVKLITKRDFFLETDRDPS----T 241 (274) T ss_pred HHHh--hcccccccccccc-ccceeccccceecCeEEEEeCCCCCceEEEEe--ccceeeeecCCcccccccccc----c Confidence 9875 3222222111111 11111223457899999999999998876533 334555667778888777553 4 Q ss_pred CceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 465 GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 465 ~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ....+++.+++++++++|+++|+++ ...|| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~~v~~t---k~~~~ 271 (274) T protein:vir:95 242 KTTALYSDKHYVAYLYDESKAVKIT---KGSGS 271 (274) T ss_pred ccCEEEEeEEEEEEEEcCCcEEEEE---cCCcc Confidence 5677899999999999999999988 44566 No 118 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.55 E-value=8.2e-16 Score=103.30 Aligned_cols=262 Identities=10% Similarity=0.031 Sum_probs=169.9 Q ss_pred hhhcccccCCcccccchhhH-HHHHHHhhhhHHhhccceec----CCCceEEEEeecCCccceeecccccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPG-IVEQLFYELSLADLISSRPV----TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~-ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~ 225 (497) ++.+ .....|.|+...+ +.+.+.+.+.+.+++.+-+. +|..+++|.+.. .+.+.-+.||+.++..+.++++ T Consensus 1 Ma~T---~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~-igdae~~~eg~~i~~~~lt~~~ 76 (270) T protein:vir:95 1 MTQT---KKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAY-IGAAEDLQEGVAMDTTQMSMTT 76 (270) T ss_pred CCce---ehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecC-CCccccccCCCccchhhcccch Confidence 2221 2234667776555 45666566667777765332 466799999874 6778889999999999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) .....++.+..+.++++....+ .+....+.+.++..+++++|..++.- .+|.. ..... T Consensus 77 ~~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~-----l~~a~----~~~~~------------ 135 (270) T protein:vir:95 77 TKVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAE-----LNKSK----QTATV------------ 135 (270) T ss_pred heeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHH-----hcccc----ccccc------------ Confidence 9999999999999999976555 46778888999999999999887631 11100 00000 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ....+.+.++...+... ...+.+++|||.++. T Consensus 136 -----------------------------------------------~~t~~~~~dA~~~lgd~-~~~~~~i~vhs~~~~ 167 (270) T protein:vir:95 136 -----------------------------------------------SADATGILDAIEVFNSE-NDEDYVLYVNPKDYN 167 (270) T ss_pred -----------------------------------------------ccCHHHHHHHHHHhccc-cCCCcEEEEcHHHHH Confidence 00011112222221111 233568999999999 Q ss_pred HHHHHhcccCcccccccccccccccccccccccccceeecCCCC-cCcEEEeeccceEEEEEeccccEEEEeccchhhhh Q lcl|Aclame:pro 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-LGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFV 463 (497) Q Consensus 385 ~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~-~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~ 463 (497) .|++...- .....+....-.+..+++.|++|++++.++ .++.++ |.++++.++...++.++..+... T Consensus 168 ~Lrk~~~~------~~~~~~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l--~~~gAi~~~~~~~~~vEtdRd~~---- 235 (270) T protein:vir:95 168 KLVKSLFK------VGGNVQDRAISKGDLVEIVGVSDIVKSKRVSENTAFL--QRYGAMEIVNKKKPEAYTDFDIL---- 235 (270) T ss_pred HHHhhhcc------cccccccchhcccccceecceeEEEeCCCCCceeEEE--EeccceeeeecCCceeeeccchh---- Confidence 99864211 111111111112346689999998876554 566554 34566777777888888777553 Q ss_pred cCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 464 DGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 464 ~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +....+++.+++..++.+|+.+|+++++.+-+-- T Consensus 236 ~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~ 269 (270) T protein:vir:95 236 KRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSLE 269 (270) T ss_pred hcccEEEeeeEEEEEEEccceEEEEEecCCCCcC Confidence 4456788889999999999999999985332222 No 119 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.51 E-value=8.7e-16 Score=103.14 Aligned_cols=228 Identities=15% Similarity=0.133 Sum_probs=157.3 Q ss_pred ccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 185 ISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQ 263 (497) Q Consensus 185 ~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~ 263 (497) -+-++ .+.++++|.. .+.|.-++||.+++....+++..++..++++-.+.|+++..-.+ .+......+.++.+++ T Consensus 1 ~~~~~-~Gdtit~P~~---iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA 76 (231) T protein:vir:73 1 ENGIN-LANLCEYPND---IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) T ss_pred Ccccc-CCceEEeccc---ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHH Confidence 12222 3457899976 35678899999999999999999999999999999999976544 5778889999999999 Q ss_pred HHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccccc Q lcl|Aclame:pro 264 RKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSG 343 (497) Q Consensus 264 ~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 343 (497) +++|..++.--.+ +.... . T Consensus 77 ~kvD~di~~~~~~---------a~l~~---~------------------------------------------------- 95 (231) T protein:vir:73 77 NKVDDDLLKAAKT---------TSQTV---S------------------------------------------------- 95 (231) T ss_pred HhhhHHHHHhhcc---------ccccc---c------------------------------------------------- Confidence 9999987742110 00000 0 Q ss_pred ccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceee Q lcl|Aclame:pro 344 VAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVT 423 (497) Q Consensus 344 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~ 423 (497) .....+.+.++....... ...+.+++|||.++..||+..+..- + ....+......+.-+++.|+||+. T Consensus 96 -------~~~t~d~i~~A~~~fgde-~~~~~vivv~p~~~~~Lrk~~~~~~--~--~~~~g~~i~~~G~iG~i~G~~Vi~ 163 (231) T protein:vir:73 96 -------TKANVDGVQAALDIFNDE-DAQAYVLIVNPKDAAKIRKDANAKN--I--GSEVGANALINGTYADVLGAQIVR 163 (231) T ss_pred -------ccccHHHHHHHHHHhccc-cccceEEEEcchHHHhhhhccchhh--h--hhhhccceeeecccceEcceEEEE Confidence 000111122222221111 2345679999999999988554321 1 111111111223445889999999 Q ss_pred cCCCCcCcEEEee--ccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 424 TPLIPLGTILVGH--FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 424 s~~~~~~~~~~gd--~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) |+.+|.++.+..- +.++++.++...++.|+..+... +....+++.+.+..++.+|+.+|+++++-. T Consensus 164 S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~----~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 164 SKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV----TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred cCCCCCCceeeeeEEeeccceeeeecccceeecccccc----ccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 9999998876433 34567888888899998887553 556779999999999999999999999988 No 120 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.46 E-value=3.6e-14 Score=94.26 Aligned_cols=356 Identities=14% Similarity=0.137 Sum_probs=174.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 43 FKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSA 122 (497) Q Consensus 43 ~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 122 (497) ++.-...+...--.+-. ..+.+++....++.+.. +....++....... . T Consensus 1 ~~~~~~~~~~~~~~~~~-------~~e~k~lr~~me~~et~-----------------~e~~~~~~~~~~~e-----~-- 49 (393) T protein:vir:79 1 MENWLKQLKESGFTETQ-------VQEQKSLRTRMERGETL-----------------AEADANKLALNEEE-----T-- 49 (393) T ss_pred CchHHHHHHhccCchhH-------HHHHHHHHHHhhhhhhh-----------------hhhhhhhhhcchhH-----H-- Confidence 00000000000000000 11111111111111100 00000000000000 0 Q ss_pred hhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceec-CCCceEEEEee Q lcl|Aclame:pro 123 KAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPV-TSPNLSYLTES 201 (497) Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~p~~~ 201 (497) .....+.....+..... .....-.-+++++.-+||..++.-+.+...+-...-.++..+.. .+.+..+|... T Consensus 50 ------el~E~f~Kmm~G~~p~~-eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g 122 (393) T protein:vir:79 50 ------QILESFAKMMEGETPTN-EVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIG 122 (393) T ss_pred ------HHHHHHHHHhcCCCchh-heehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchh Confidence 00000000000111111 11111112333444445545555555544444455567777777 45566666543 Q ss_pred cCCccceeecccccccccc---ccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCc Q lcl|Aclame:pro 202 AAHNNAAAVAEAGTYPFSS---EEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP 277 (497) Q Consensus 202 ~~~~~a~~v~Eg~~~~~s~---~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~ 277 (497) . --++-|+||++.|... .+++.++++.+|.+..+.+|+||+.|| -++.++.-....+++++..+...+++.-+. T Consensus 123 ~--~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ 200 (393) T protein:vir:79 123 I--MRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSH 200 (393) T ss_pred e--eeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcc Confidence 2 3467899999998744 679999999999999999999999998 599999999999999999999999988654 Q ss_pred cc---cceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhh Q lcl|Aclame:pro 278 GV---NGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEI 354 (497) Q Consensus 278 ~~---~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (497) +. .++.+...+-.. +....+ .-..... T Consensus 201 ghtvfDa~st~t~ahpt---------------------------------------Gr~~~~-----------~qNGTlS 230 (393) T protein:vir:79 201 GHTVFDNYSTNKLAHTT---------------------------------------GLDKNG-----------VQNDTFS 230 (393) T ss_pred cceeeeccccCccceee---------------------------------------cCCccc-----------ccccccc Confidence 43 233222111100 000000 1112234 Q ss_pred hhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccc-----------cceee Q lcl|Aclame:pro 355 AENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWG-----------VPVVT 423 (497) Q Consensus 355 ~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G-----------~pvv~ 423 (497) ++++++...++.. ..+.+..++|||-.|..+.+-.-=.+-| +.+.+ ..+.-.....+..| +.|++ T Consensus 231 leDllDm~~av~~-~hyt~svi~MHPLAWnv~AKna~me~~~--~na~g-N~~~~~~~ts~algp~~i~~~~~~nlnv~~ 306 (393) T protein:vir:79 231 AEDFLDLIIAVMA-NEYTPSDLMMHPLAWTVFAKNELMGSLQ--ANPYG-NYPAKGAPSSMALGPDSIQGRLPFNFNVNL 306 (393) T ss_pred HHHHHHHHHHHhc-ccCCcceEEEcCchhhhhhhhhhhccee--ecccc-ccCccccchhhhhchhhhccccccceeEEE Confidence 5666666666554 4567889999999999987642111111 11111 11111222223333 68999 Q ss_pred cCCCCcCc------EEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeeccc-ceEEEEecCCCCC Q lcl|Aclame:pro 424 TPLIPLGT------ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPS-AFQLIQLKKGATG 496 (497) Q Consensus 424 s~~~~~~~------~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~-Af~~~~~~~~a~~ 496 (497) |+.+|=.+ ++.-|=+...+.+ .+.+++++.-++ =..|...++..+|+|+.|++.. |++...--.-+++ T Consensus 307 sPfvp~d~k~~rFd~~~Vd~NnvgvlL-V~D~i~tdq~dd----k~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~~k~ 381 (393) T protein:vir:79 307 SPFIPLDKKSRRFDVYAVDRNNVGVLL-VRDDLKTDQWDE----KARGLQNIKMIERYGIGILNEGKAIAVAKNISMDKS 381 (393) T ss_pred ecccccccccceeeEEEeecCCceEEE-EecCcceecccc----ccccceeeeeeeeeceeeeeCCceEEEEecceeecc Confidence 99998432 2222222222222 222333332222 2468888999999999999854 5544433333332 Q ss_pred C Q lcl|Aclame:pro 497 S 497 (497) Q Consensus 497 ~ 497 (497) - T Consensus 382 y 382 (393) T protein:vir:79 382 Y 382 (393) T ss_pred c Confidence 2 No 121 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.44 E-value=5.4e-14 Score=93.34 Aligned_cols=282 Identities=13% Similarity=0.131 Sum_probs=173.9 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeecc-----cccccccccccee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAE-----AGTYPFSSEEFAR 225 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~E-----g~~~~~s~~~~~~ 225 (497) +..-+-...+.+.+......||+.+.+.+.|++.++..++.++.+.|.++... +.+.+.+- ....+.+..+|++ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~-~~~~~~~v~~~~~~~g~~~~~~t~~~ 79 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVL-GDVIMAGVGTTFSGAGAGKAAATFTK 79 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeecc-CCcccccccccccCCCccccccccce Confidence 22111122233344556788999999999999999999999988999988763 33333322 2344578899999 Q ss_pred EEeeeeeeeeechhhHHHHhh--H-H-HHHHHHHHHHHHHHHHHHHhhhhccCCCcc-ccceeccccccccchhhhhhhH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRD--A-P-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~d--s-~-~l~~~i~~~la~~~~~~~d~a~l~G~g~~~-~~Gil~~~~~~~~~~~~~~~~~ 300 (497) ++...+.+++.+.|.+.+.+- + + +...+=.+...+++..+.+..||||+.+.+ ..|++........ T Consensus 80 ~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~--------- 150 (310) T protein:vir:97 80 VNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQK--------- 150 (310) T ss_pred eeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccce--------- Confidence 999999999999999876542 2 3 333333455678999999999999998655 3466554221100 Q ss_pred HHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEeh Q lcl|Aclame:pro 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) .. . .....+...+.+|.++.... .....+.+++||| T Consensus 151 -------------------------------------i~--~-~~~gg~~t~d~LDeLl~~v~----~~~g~p~~~l~~~ 186 (310) T protein:vir:97 151 -------------------------------------AT--T-GATGSAISFAILDELMDLVV----DKDGQVDYLTMHA 186 (310) T ss_pred -------------------------------------ee--c-CCCCCCCCHHHHHHHHHHHh----cCCCCCCEEEecH Confidence 00 0 00011112233343333321 1233577899999 Q ss_pred hHHHHHHHH-hcccCcccccccccccccccccccccccccceeecCCCCcC----------cEE---Eeec--cceEEEE Q lcl|Aclame:pro 381 RDWELLRLT-KDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG----------TIL---VGHF--APSVIQT 444 (497) Q Consensus 381 ~~~~~l~~l-kd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~----------~~~---~gd~--~~~~~~i 444 (497) .+..+|+-+ +...++.++.... ...| ..-.++.|+|++.++.+|.+ .|| +|+. .++..++ T Consensus 187 ~~~r~i~A~~R~~~~~g~~~~~~-~~~G---~~v~~~~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl 262 (310) T protein:vir:97 187 RTLRSYKALLRALGGASINEVVE-LPSG---AEVPAYSGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGL 262 (310) T ss_pred HHHHHHHHHHHHhcCCCCCCccc-cCCC---CEEeeeCCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceecc Confidence 987777644 3333444443321 1111 12247889999999999853 244 3543 2444444 Q ss_pred Eecc--ccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 445 ARRE--GVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 445 ~~r~--~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) .... ++++..-.+.. .++-..+|+++.++.+|..|+|+++|.-..- T Consensus 263 ~~~~~~glsVr~~G~~~---~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 263 TATQAAGIQVVDVGESE---DSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred ccCCccceeEEeCCccc---CCcceeEEEEEeeeEEEecccceeeeccccC Confidence 3222 45554433221 2567789999999999999999999885444 No 122 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.43 E-value=1e-13 Score=91.86 Aligned_cols=329 Identities=11% Similarity=0.106 Sum_probs=173.1 Q ss_pred hhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEE Q lcl|Aclame:pro 120 VSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLT 199 (497) Q Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~ 199 (497) .... ... ......... .......+.+.-+++.++|+....+++.+.+.+++++.++++++.+.+..+++ T Consensus 1 ~~~~-~~~-------~~~~n~~~~---~i~k~~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~k 69 (360) T protein:vir:99 1 MSSN-STI-------DSVRNQNMN---SLSQKDIGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQ 69 (360) T ss_pred Ccch-hHH-------HHHhhhHHH---HHHhhhccccccCceeecHHHHHHHHHHHhhccchhhhcceeecccccccccc Confidence 0000 000 000111100 11111123233357899999999999999999999999999999988887776 Q ss_pred eecCCccceeecccccccc-ccccceeEEe-eeeeeeeechhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHhhhhc Q lcl|Aclame:pro 200 ESAAHNNAAAVAEAGTYPF-SSEEFARVYE-QVGKVANALTITDEGLRDAP-----ELFNFVQGRLLEGIQRKEEVQLLA 272 (497) Q Consensus 200 ~~~~~~~a~~v~Eg~~~~~-s~~~~~~v~~-~~~kia~~~~iS~ell~ds~-----~l~~~i~~~la~~~~~~~d~a~l~ 272 (497) ...+...-.--.|+...+. .+++...+.+ ..+++-..+.++.+-+++.. .+++.|.+.+++++++-++.-+++ T Consensus 70 ig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~ 149 (360) T protein:vir:99 70 FGVPRLSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIR 149 (360) T ss_pred cccceeeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhh Confidence 5432111011123222222 3344444444 23455566677777666532 367999999999999999999999 Q ss_pred cCCCcc--------------ccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccc Q lcl|Aclame:pro 273 GGGYPG--------------VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAA 338 (497) Q Consensus 273 G~g~~~--------------~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 338 (497) |+.... ..||+..+...... +..... ..+......... T Consensus 150 g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~--------------------------id~a~d--~t~~~~~~~~~~ 201 (360) T protein:vir:99 150 AGASSGNLQSIGGAAELDNTFKGWIARAEGDAQS--------------------------VDDAGD--STRIGLEDTATA 201 (360) T ss_pred ccchhcccccCcccchhhhhhHHHHHHhhcccch--------------------------hhcccc--cccccccccccc Confidence 875421 01111111100000 000000 000000000000 Q ss_pred ccccc--cc--ccccchhhhhhhhHHhhhhhhhhhccCC----ceEEEehhHHHHHHHHhcccCcccccccccccccccc Q lcl|Aclame:pro 339 GSGSG--VA--GSYPTAAEIAENVFDAFVDIQLTLFQTP----NAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPV 410 (497) Q Consensus 339 ~~~~~--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~ 410 (497) ..... .. .+..........++.........-|.+. -.|+|+|.+...++..-..-.-.+... . ... T Consensus 202 ~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~-~-----l~g 275 (360) T protein:vir:99 202 DADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLTEREDPLGSA-V-----IFG 275 (360) T ss_pred ccccchhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEEEccCchHHHHHHHHhccCcccchh-h-----eec Confidence 00000 00 0000001112233333333333334332 279999998777665432111111100 0 011 Q ss_pred cccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEE--EeeeccEeecccceEEE Q lcl|Aclame:pro 411 NGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA--EERLGLLVYRPSAFQLI 488 (497) Q Consensus 411 ~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~--~~r~~~~v~~~~Af~~~ 488 (497) ...-+.+|+|++..+.+|.+.+++-+++...|+++ .+++|+.+.+..- .......++- ...+|+.+.+++|+|.+ T Consensus 276 ~~~~~~~Gipi~~v~~~pd~~~mlT~p~NLi~g~~--~~iri~~~~e~~~-~~~~~~~~~~~~~~~~D~~iee~~Av~~v 352 (360) T protein:vir:99 276 DSDITPFSYDLVGVNGFPDEYMMFTDPNNLAFGLY--EEMELDQSTDTDK-VHEQRLHSRNWLEGQFDFQIKEQQAGVLV 352 (360) T ss_pred ccccccceeeeEEcCCCCCCceEEeccCceeEEee--eeeEEeecccchh-hhhhceeeeEEEEEEeeEEEEecccEEEE Confidence 22235679999999999999999999999888776 5677766554431 1222323333 45689999999999999 Q ss_pred EecCCCCC Q lcl|Aclame:pro 489 QLKKGATG 496 (497) Q Consensus 489 ~~~~~a~~ 496 (497) +=...+++ T Consensus 353 t~~~~~~~ 360 (360) T protein:vir:99 353 TDLETPTA 360 (360) T ss_pred ecCCCCCC Confidence 98888888 No 123 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.39 E-value=2.5e-14 Score=95.19 Aligned_cols=290 Identities=12% Similarity=0.068 Sum_probs=168.3 Q ss_pred hhh-hhhcccccCCcccc------cchhhHHHHHHHhhhhHHhh-ccc-eecCCCceEEEEeecC--Cccceeecccccc Q lcl|Aclame:pro 148 IGQ-NPFGSTGTFAPGIL------PTFLPGIVEQLFYELSLADL-ISS-RPVTSPNLSYLTESAA--HNNAAAVAEAGTY 216 (497) Q Consensus 148 ~~~-~~~~~~~~~g~~i~------~~~~~~ii~~~~~~~~l~~~-~~~-~~~~~~~~~~p~~~~~--~~~a~~v~Eg~~~ 216 (497) ++. ....++..++.+-+ |++++.-+..+.+...|.+. ... ...++..+.|-..... .+.+.-|+||+.+ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEi 80 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEI 80 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccc Confidence 111 11122222332222 55666655555565555553 333 3334556666543321 2457789999999 Q ss_pred ccccccceeEEe-eeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhcc---CCCcc---ccceeccccc Q lcl|Aclame:pro 217 PFSSEEFARVYE-QVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAG---GGYPG---VNGLLQRSTG 288 (497) Q Consensus 217 ~~s~~~~~~v~~-~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G---~g~~~---~~Gil~~~~~ 288 (497) |.+.++++...+ ..+|.+.-+.||+|++..+ .+..+-....++..+++..|...+.. .++.. +.++.+.. T Consensus 81 P~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~-- 158 (318) T protein:vir:10 81 PVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGG-- 158 (318) T ss_pred cccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcc-- Confidence 999999988888 5589999999999999876 57777777888899998888765431 11110 00100000 Q ss_pred cccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhh Q lcl|Aclame:pro 289 FTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT 368 (497) Q Consensus 289 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 368 (497) ....+...+... .+. ...+....-....... T Consensus 159 -------~~~~d~~~A~e~-----------------------v~~-------------------a~~~~~~a~~~~~~~~ 189 (318) T protein:vir:10 159 -------KVRTDIAIAIEQ-----------------------IST-------------------AAPTAYPAGVGSSDEY 189 (318) T ss_pred -------cccccchhhhhh-----------------------hhh-------------------hhhhhhhhhhhhhhhc Confidence 000000000000 000 0000000111112246 Q ss_pred hccCCceEEEehhHHHHHHHHhc------ccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEE Q lcl|Aclame:pro 369 LFQTPNAVVMNPRDWELLRLTKD------ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVI 442 (497) Q Consensus 369 ~~~~~~~~~~n~~~~~~l~~lkd------~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~ 442 (497) .++.++.++|||.+|..|.+-++ .++.+++.... ....-+..++|+.|+.|+.+|.|+.|+.+=.... T Consensus 190 ~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~-----~tg~~~g~~lGl~vi~s~~~p~~~alvlq~g~vG- 263 (318) T protein:vir:10 190 FGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPD-----WTGNFPGSVMGLNVIRSRTFPIDRVLIMERGTVG- 263 (318) T ss_pred cCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhccc-----ccccccceeeceEEeecCccCCCeeEEEecCCcc- Confidence 67889999999999999954433 33333333222 1223345789999999999999999988855433 Q ss_pred EEEeccccEEEEeccc--hhhhhc-CceEEEEEeeeccEeecccceEEEEecCCC Q lcl|Aclame:pro 443 QTARREGVTMQMTNSN--GTDFVD-GKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 443 ~i~~r~~~~i~~~~~~--~~~f~~-~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a 494 (497) .++|.++++.+.-+.. ..++.. ....+|+.++....|.+|+|+|+||=--+. T Consensus 264 ~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 264 FYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred eeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccCC Confidence 3567777776543321 122333 356678888899999999999999854444 No 124 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.33 E-value=4.6e-14 Score=93.70 Aligned_cols=382 Identities=12% Similarity=0.094 Sum_probs=175.8 Q ss_pred CchHHHHHHHH-HHHHHHHHHHHHHHH----HHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQG-RQLAKSIKDINADET----KTAAE---KKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDG 72 (497) Q Consensus 1 m~~~~~~~~~~-~~l~~~~~~~~~~~~----~~~~e---~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~ 72 (497) |--.++.+-+- |+..+.++.++-... -+.+. -|+...+...+.+...+.++.. -|...++..++.+.+. T Consensus 1 ~~n~t~a~d~~~RR~~~~L~~~EvSvv~~PAY~nA~vt~vRe~e~~~~~e~~~~~e~~en~---~e~~~~~~~~~~E~Rs 77 (410) T protein:vir:83 1 MGNATTASDEYIRRLENELREKESLVRGIYDRANASNRDVNEEEGQMVAECRGRMEQIKNQ---MEQAQEVNRIAFETRS 77 (410) T ss_pred CCCcccchhhHHHHHHHHhhhhheeeeccccccccccccchhhhccccccccCcccchhhh---hHHHHHHHHHHHHHHH Confidence 65544444221 222222211111000 01111 1221111111111111111111 1112222222222222 Q ss_pred HHHHHHHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHH---Hhhh---hhhh Q lcl|Aclame:pro 73 LDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAF---ADGE---TAPA 146 (497) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~---~~~~ 146 (497) +...+.. ..... ...+.....+-...+. ..+..+ .+.. ...+ T Consensus 78 ~~~~i~~-------~~~~~-----r~~p~~~~veyRSaGE--------------------~lkal~~~~~Gd~~A~~~~e 125 (410) T protein:vir:83 78 KGQAVDA-------AISAM-----RGSPVGTEVEYRSAGE--------------------YMLDMWNSAQGNASAADRLE 125 (410) T ss_pred HHHHHHh-------hhccC-----cCCCCCCCcccccHHH--------------------HHHHHhccCCchHHHHHHHH Confidence 2111100 00000 0000000000000000 000010 0000 0011 Q ss_pred h-hhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccce-------eecccccccc Q lcl|Aclame:pro 147 A-IGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAA-------AVAEAGTYPF 218 (497) Q Consensus 147 ~-~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~-------~v~Eg~~~~~ 218 (497) . .......++++..+.|+|+++.+.|+.+.+..+|..++...|.++.++.||+.+. +.+.+ .-.||...+. T Consensus 126 ~~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~-~~tV~~q~~~~kqa~EGd~L~~ 204 (410) T protein:vir:83 126 VYARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQ-RPAVGLQGVAGGASDEKTELDS 204 (410) T ss_pred HHHHhhccCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecc-cccccccccccccccccccccc Confidence 1 1222334555566778889999999999999999999999999999999988765 34332 2358999999 Q ss_pred ccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHh---hhhccCCCccccceeccccccccchh Q lcl|Aclame:pro 219 SSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEV---QLLAGGGYPGVNGLLQRSTGFTASSA 294 (497) Q Consensus 219 s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~---a~l~G~g~~~~~Gil~~~~~~~~~~~ 294 (497) .+.+|+..+...++++++..+||+.++.+ ...-+...+.|.-+.+.+-+. ++|+++-++. .......+ T Consensus 205 gKl~~~t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~~--------~a~~~~Ta 276 (410) T protein:vir:83 205 QKMVIDRLTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTGA--------VGYGNATA 276 (410) T ss_pred cceeeeeccceeehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh--------hhhhhccH Confidence 99999999999999999999999999865 455555556665555555543 3444432210 00000000 Q ss_pred hhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhh-hhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCC Q lcl|Aclame:pro 295 SSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYG-RVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTP 373 (497) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 373 (497) ..+.. + ++ ++....... ++ ..- T Consensus 277 d~~~~--------~----------i~--da~~~v~da~~~-------------------------------------~~~ 299 (410) T protein:vir:83 277 DNVAS--------A----------IW--QAAGAVYTAVKG-------------------------------------MGR 299 (410) T ss_pred HHHHH--------H----------HH--HHHHHHhhhhcc-------------------------------------cee Confidence 00000 0 00 001000000 00 111 Q ss_pred ceEEEehhHHHHHHHHhcccCccccccccc-ccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEE Q lcl|Aclame:pro 374 NAVVMNPRDWELLRLTKDANGQYMGGNFFG-NAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTM 452 (497) Q Consensus 374 ~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~-~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i 452 (497) ..+.+.|..+..+..+- .+++..|...-+ .......+-...++|+||+..+..++|+.+|.|-. ++..|......+ T Consensus 300 ~~i~vS~DVl~~~~~~f-~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~--Ai~~~eS~~gp~ 376 (410) T protein:vir:83 300 LVIAIAPDVLGDFGPLF-APVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTA--AIECFEQRVGTL 376 (410) T ss_pred eeEEechhhhhhcccee-eccCCCCcccccccccccccchhhhhcccceEEecCCCcCeeeEeccc--eeeeeecCCcee Confidence 12223333322211110 111111111111 00000122345789999999999999999998755 577777765555 Q ss_pred EEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 453 QMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 453 ~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +..+.+-...++|-- .++.+.+..|.+++-|.= | T Consensus 377 qL~d~~i~nLt~~yS-----gY~a~a~~~~~gliPv~g------~ 410 (410) T protein:vir:83 377 QVVEPSVFGLQVAYA-----GYFSTLVVNEDAIVPLVG------S 410 (410) T ss_pred EeeCCchhhhhhhhe-----eeeeeccccccceeeecc------C Confidence 555544322233222 556889999999988763 3 No 125 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.20 E-value=1.1e-11 Score=80.75 Aligned_cols=391 Identities=15% Similarity=0.103 Sum_probs=194.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKA--HQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~--~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) |- + +.+.+. ..++.++ ...+++.++.+.++..++.- .+..++...+.+|++.-+.++..++.+.++++. T Consensus 1 ~~----~--s~~~~~--k~~~~ek-~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e~eln 71 (400) T protein:vir:93 1 MR----I--SKRNMN--KPDLIEK-QNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN 71 (400) T ss_pred Cc----c--cccccc--cchHHHH-HHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHhhhhh Confidence 10 0 000000 0000000 01122333333333333321 233344445666666666666655544333321 Q ss_pred HHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccccc Q lcl|Aclame:pro 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) ..++ ..... +....-.+..+.. ..+........... ..+..|.. .......+.+ + T Consensus 72 ~~~E-------~~Kgk-~~mtefLkT~~A~---~~fa~~l~~nsg~s-------d~knaW~A------~l~E~gvt~t-d 126 (400) T protein:vir:93 72 AQEE-------KPKGK-DKMTNFIESQNAV---TEFFDVLKKNSGKS-------EIKNAWSA------KLAENGVTIT-D 126 (400) T ss_pred hhhh-------hcccc-hhHHHhhhhHHHH---HHHHHHHHhhcCCc-------chhhhhhh------hhhhcccccC-C Confidence 1100 00000 0000000000000 00000000000000 11111210 0111111111 1 Q ss_pred CCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCcccee-eccccccccccccceeEEeeeeeeeeec Q lcl|Aclame:pro 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA-VAEAGTYPFSSEEFARVYEQVGKVANAL 237 (497) Q Consensus 159 ~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~-v~Eg~~~~~s~~~~~~v~~~~~kia~~~ 237 (497) ....+|.-++..|-..++...++.+..++.++++--+..+-++. .-+| +--|.++.++..+|..-++.|.-++.+. T Consensus 127 ~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~l~V~~~~dt~---~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~ 203 (400) T protein:vir:93 127 TTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSA---NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQ 203 (400) T ss_pred chhhcchHHHHHHHHhhhccCCcccceeeecCCceeeecchhhh---cccceeccCCcccceeeeeeeeccCHHHHHHHh Confidence 11244555666777777888899998888888554334344333 2345 5668899999999999999999888888 Q ss_pred hhhHHHHhh---HHHHHHHHHHHHHHHHHH-HHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhh Q lcl|Aclame:pro 238 TITDEGLRD---APELFNFVQGRLLEGIQR-KEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 238 ~iS~ell~d---s~~l~~~i~~~la~~~~~-~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) .+.+-..++ +..|.+||.++|...+-. +.+.+++-|+|+....++..-+..-... .+ T Consensus 204 ~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~------~d------------- 264 (400) T protein:vir:93 204 SLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIK------KI------------- 264 (400) T ss_pred hhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhh------hh------------- Confidence 875555553 346899999999999996 5799999999987655543221110000 00 Q ss_pred cchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhccc Q lcl|Aclame:pro 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDAN 393 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~ 393 (497) .+..... ....+.+.+--+.....+.......+++.|..|+.|++|||++ T Consensus 265 ---------------------t~kt~~a---------~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~ 314 (400) T protein:vir:93 265 ---------------------TTKAKSA---------GKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQAT 314 (400) T ss_pred ---------------------hhhhhhc---------CCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCc Confidence 0000000 0001112222222233444555667899999999999999999 Q ss_pred Cccccccccccccccccccccccccc-ceeecCCCCcCc-EEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEE Q lcl|Aclame:pro 394 GQYMGGNFFGNAYGNPVNGGKNIWGV-PVVTTPLIPLGT-ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 394 G~~~~~~~~~~~~~~~~~~~~~l~G~-pvv~s~~~~~~~-~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) |.+.|.-..... .-.+-+|+ ..|+.+..|..+ .+.-|-. |.|. -.+++ ....-.|.+|+=.+.+ T Consensus 315 ~~a~f~~~n~d~------~IA~~fGv~~Lv~~Tr~~~~kp~V~VDek---~~i~-~~~~~----t~~sf~~~tNs~~ilv 380 (400) T protein:vir:93 315 ANANVRIKNDDT------EIASEVGVDEIIVYTGSKALKPTVLVDQK---YHID-MQDLT----KVDAFEWKTNSNMILV 380 (400) T ss_pred ceeeeeeccccc------hhhhhcccceeeeeccCCCCCceeeeehh---hhcc-ccCce----eccceeeeeccceEEe Confidence 999984322111 11123342 334455555432 2222322 3332 22222 1223346788888999 Q ss_pred EeeeccEeecccceEEEEec Q lcl|Aclame:pro 472 EERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 472 ~~r~~~~v~~~~Af~~~~~~ 491 (497) +..++|.+.-|.+-+++++. T Consensus 381 etlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 381 ETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eeeeccceecccceeeEeeC Confidence 99999999999999999987 No 126 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.13 E-value=1.2e-11 Score=80.47 Aligned_cols=261 Identities=14% Similarity=0.080 Sum_probs=145.4 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhhccce----ecCCCceEEEEeecCCccceeeccccccccccccceeE Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v 226 (497) ++. -.++|..|...+++.+...+.+.++++.- ...+.++++|+.... ..+.++.++..++..+++.+.+ T Consensus 1 MA~------~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:79 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcc-cccccccCCCccCccccccceE Confidence 211 11455566777888888887777776442 223568999986542 3456788888888888888888 Q ss_pred Eeeeeee-eeechhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc---cCCCccccceeccccccccchhhhhhhHH Q lcl|Aclame:pro 227 YEQVGKV-ANALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA---GGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 227 ~~~~~ki-a~~~~iS~-ell~ds~~l~~~i~~~la~~~~~~~d~a~l~---G~g~~~~~Gil~~~~~~~~~~~~~~~~~~ 301 (497) ++...+. +.-+.|++ +...+..++.++ .+.+.++++.++|..++. +.++....+. +. T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~---------~~-------- 135 (273) T protein:vir:79 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSA---------PS-------- 135 (273) T ss_pred EEEEeeecccceeeccHHHHhhcccHHHH-HHHHHHHHHHHHHHHHHHHHhhccccccccc---------cc-------- Confidence 8888664 33456665 344445567774 466889999999986542 2111110000 00 Q ss_pred HHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc-cCCceEEEeh Q lcl|Aclame:pro 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNP 380 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~n~ 380 (497) +.....+.+..+...+....- ...-.++++| T Consensus 136 ------------------------------------------------~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p 167 (273) T protein:vir:79 136 ------------------------------------------------DADDAFDLIASALKELTKANVPNVGRVVVVNA 167 (273) T ss_pred ------------------------------------------------chhhHHHHHHHHHHHhhhccCCccCcEEEECH Confidence 000011112222221111111 1123678899 Q ss_pred hHHHHHHHHhccc-CcccccccccccccccccccccccccceeecCCCCcCcEE-EeeccceEEEEEeccccEEEEeccc Q lcl|Aclame:pro 381 RDWELLRLTKDAN-GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTIL-VGHFAPSVIQTARREGVTMQMTNSN 458 (497) Q Consensus 381 ~~~~~l~~lkd~~-G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~-~gd~~~~~~~i~~r~~~~i~~~~~~ 458 (497) ..+..|.+..+-. ......... ..-.+.-.+++|++|+.|+.+|.+... +..+...+.....+. ..++..+.. T Consensus 168 ~~~~~Ll~~~~~~~~~~~~~~~~----~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~~~-~~~e~~r~~ 242 (273) T protein:vir:79 168 EMAFWLRSSGSKLTSADTSGDAA----GLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALRDQ 242 (273) T ss_pred HHHHHHhhchhhhhhhhhccccc----ceeeeEeeEEeceEEEecccccccCceEEEEEeccceeeeeeh-hhhhcccCc Confidence 9998886543211 111111100 001122347999999999999965421 222333334443332 244444433 Q ss_pred hhhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 459 GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 459 ~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) ..| -..+++.+.++..|+||++++.++.+.+ T Consensus 243 -~~~---~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 243 -DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -ccc---eeeeeeeeeeeeEEecCceEEEEeccCC Confidence 223 4468899999999999999999874443 No 127 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.08 E-value=2.6e-11 Score=78.62 Aligned_cols=259 Identities=13% Similarity=0.069 Sum_probs=141.7 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhhccce----ecCCCceEEEEeecCCccceeeccccccccccccceeE Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v 226 (497) ++. -.++|..|...+++.+.+.+.+.++++.- ...+.++++|+.... .-+.+..++..++..+.+.+++ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccc-cccccccCCCccCccccccceE Confidence 111 12345556677888888887777776442 223567999986542 3456777887777667777777 Q ss_pred Eeeeeee-eeechhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc---cCCCccccceeccccccccchhhhhhhHH Q lcl|Aclame:pro 227 YEQVGKV-ANALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA---GGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 227 ~~~~~ki-a~~~~iS~-ell~ds~~l~~~i~~~la~~~~~~~d~a~l~---G~g~~~~~Gil~~~~~~~~~~~~~~~~~~ 301 (497) ++...+. +.-+.|++ +......++++ +.+...++++.++|..++. +.+.....+ .+. T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~---------~~~-------- 135 (273) T protein:vir:10 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALTGS---------APT-------- 135 (273) T ss_pred EEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccc---------ccc-------- Confidence 7776553 33345665 33444456777 4566789999999987652 111110000 000 Q ss_pred HHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc-cCCceEEEeh Q lcl|Aclame:pro 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNP 380 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~n~ 380 (497) +.....+.+..+...+..... ...-.++++| T Consensus 136 ------------------------------------------------~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p 167 (273) T protein:vir:10 136 ------------------------------------------------DADDAFDLIAKALKELTKANVPNVGRVVVVNA 167 (273) T ss_pred ------------------------------------------------chhHHHHHHHHHHHHhhhcCCCcCCCEEEECH Confidence 000111222222222222111 1233578899 Q ss_pred hHHHHHHHHhcccCcccccccccccccc-cccccccccccceeecCCCCcCc---EEEeeccceEEEEEeccccEEEEec Q lcl|Aclame:pro 381 RDWELLRLTKDANGQYMGGNFFGNAYGN-PVNGGKNIWGVPVVTTPLIPLGT---ILVGHFAPSVIQTARREGVTMQMTN 456 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~~~~~~~~~~~~~-~~~~~~~l~G~pvv~s~~~~~~~---~~~gd~~~~~~~i~~r~~~~i~~~~ 456 (497) ..+..|.+...-. ......+.... -.+.-.++.|++|+.|+++|.+. .+.|- ..+.....+. ..++..+ T Consensus 168 ~~~~~L~~~~~~~----~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~--~~A~~~a~q~-~~~e~~r 240 (273) T protein:vir:10 168 EMAFWLRSSGSKL----TSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH--PSAAAYVSQI-DTVEALR 240 (273) T ss_pred HHHHHHhcchhhh----hhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEe--ccceeeeeee-ehhhccc Confidence 9999886542211 11011000011 01223479999999999999753 33333 3333343322 2444333 Q ss_pred cchhhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 457 SNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 457 ~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) .. ..| -..+++.+.+|+.|+||++++.++.+.+ T Consensus 241 ~~-~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 241 DQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred CC-Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 33 233 3458888999999999999999874444 No 128 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.08 E-value=2.6e-11 Score=78.62 Aligned_cols=259 Identities=13% Similarity=0.069 Sum_probs=141.7 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhhccce----ecCCCceEEEEeecCCccceeeccccccccccccceeE Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v 226 (497) ++. -.++|..|...+++.+.+.+.+.++++.- ...+.++++|+.... .-+.+..++..++..+.+.+++ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccc-cccccccCCCccCccccccceE Confidence 111 12345556677888888887777776442 223567999986542 3456777887777667777777 Q ss_pred Eeeeeee-eeechhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc---cCCCccccceeccccccccchhhhhhhHH Q lcl|Aclame:pro 227 YEQVGKV-ANALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA---GGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 227 ~~~~~ki-a~~~~iS~-ell~ds~~l~~~i~~~la~~~~~~~d~a~l~---G~g~~~~~Gil~~~~~~~~~~~~~~~~~~ 301 (497) ++...+. +.-+.|++ +......++++ +.+...++++.++|..++. +.+.....+ .+. T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~---------~~~-------- 135 (273) T protein:vir:10 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALTGS---------APT-------- 135 (273) T ss_pred EEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccc---------ccc-------- Confidence 7776553 33345665 33444456777 4566789999999987652 111110000 000 Q ss_pred HHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc-cCCceEEEeh Q lcl|Aclame:pro 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNP 380 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~n~ 380 (497) +.....+.+..+...+..... ...-.++++| T Consensus 136 ------------------------------------------------~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p 167 (273) T protein:vir:10 136 ------------------------------------------------DADDAFDLIAKALKELTKANVPNVGRVVVVNA 167 (273) T ss_pred ------------------------------------------------chhHHHHHHHHHHHHhhhcCCCcCCCEEEECH Confidence 000111222222222222111 1233578899 Q ss_pred hHHHHHHHHhcccCcccccccccccccc-cccccccccccceeecCCCCcCc---EEEeeccceEEEEEeccccEEEEec Q lcl|Aclame:pro 381 RDWELLRLTKDANGQYMGGNFFGNAYGN-PVNGGKNIWGVPVVTTPLIPLGT---ILVGHFAPSVIQTARREGVTMQMTN 456 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~~~~~~~~~~~~~-~~~~~~~l~G~pvv~s~~~~~~~---~~~gd~~~~~~~i~~r~~~~i~~~~ 456 (497) ..+..|.+...-. ......+.... -.+.-.++.|++|+.|+++|.+. .+.|- ..+.....+. ..++..+ T Consensus 168 ~~~~~L~~~~~~~----~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~--~~A~~~a~q~-~~~e~~r 240 (273) T protein:vir:10 168 EMAFWLRSSGSKL----TSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH--PSAAAYVSQI-DTVEALR 240 (273) T ss_pred HHHHHHhcchhhh----hhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEe--ccceeeeeee-ehhhccc Confidence 9999886542211 11011000011 01223479999999999999753 33333 3333343322 2444333 Q ss_pred cchhhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 457 SNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 457 ~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) .. ..| -..+++.+.+|+.|+||++++.++.+.+ T Consensus 241 ~~-~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 241 DQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred CC-Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 33 233 3458888999999999999999874444 No 129 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.99 E-value=1.8e-11 Score=79.53 Aligned_cols=298 Identities=14% Similarity=0.122 Sum_probs=152.7 Q ss_pred HHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecC-CCceEEEEeecCCccceeeccc Q lcl|Aclame:pro 135 MGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEA 213 (497) Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~~v~Eg 213 (497) ....... ...+..+....+.+++.-.+....|..++.....+.+.++++.++.++. ++++++|+. + ..++.....| T Consensus 1 ~~~~~~~-~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G-~~~~~~~~~G 77 (345) T protein:vir:22 1 MASMTGG-QQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-G-RTQAAYLAPG 77 (345) T ss_pred Ccccccc-hhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-c-ceEEEeeecC Confidence 0000000 0000000111111111113445778888989899999999999999887 557889976 3 3567888888 Q ss_pred cccccc--ccccee--EEeeeeeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc----cCC-----Ccccc Q lcl|Aclame:pro 214 GTYPFS--SEEFAR--VYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGG-----YPGVN 280 (497) Q Consensus 214 ~~~~~s--~~~~~~--v~~~~~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~----G~g-----~~~~~ 280 (497) +....+ ++..++ +++.-.+++.+..-.-+=.+...++.+.+.+++++++++..|+.++. +.. ++.|. T Consensus 78 ~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~ 157 (345) T protein:vir:22 78 ENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIE 157 (345) T ss_pred CCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 776543 577787 44444444443222111122224788999999999999999998762 111 11222 Q ss_pred ceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHH Q lcl|Aclame:pro 281 GLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFD 360 (497) Q Consensus 281 Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 360 (497) |+-..........+.. ............+.++. T Consensus 158 ~~~~~~~~~~~~~g~~-----------------------------------------------~t~~~~~~~~~~~ai~~ 190 (345) T protein:vir:22 158 GLGTATVIETTQNKAA-----------------------------------------------LTDQVALGKEIIAALTK 190 (345) T ss_pred cccccccccccccccc-----------------------------------------------ccccccCHHHHHHHHHH Confidence 2211110000000000 00000001112233333 Q ss_pred hhhhhhhhhcc-CCceEEEehhHHHHHHHHhcc-cCcccccccccccccccc-cccccccccceeecCCCCcCc------ Q lcl|Aclame:pro 361 AFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDA-NGQYMGGNFFGNAYGNPV-NGGKNIWGVPVVTTPLIPLGT------ 431 (497) Q Consensus 361 ~~~~~~~~~~~-~~~~~~~n~~~~~~l~~lkd~-~G~~~~~~~~~~~~~~~~-~~~~~l~G~pvv~s~~~~~~~------ 431 (497) +...+....-. ..-..+++|..+..|..-+.- ...|.. .+... +.-.+++|++|+.|+++|.+. T Consensus 191 a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~-------~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~ 263 (345) T protein:vir:22 191 ARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAA-------LIDPEKGSIRNVMGFEVVEVPHLTAGGAGTARE 263 (345) T ss_pred HHHHhhhcCCCccCCEEEeChHHHHHHhcccccccccccc-------ccccccceEEEEeceEEEecccccccccCcccc Confidence 32222221111 123567899999977543321 122221 11111 123478999999999987421 Q ss_pred -----------------EEEee-------ccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEE Q lcl|Aclame:pro 432 -----------------ILVGH-------FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQL 487 (497) Q Consensus 432 -----------------~~~gd-------~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~ 487 (497) ...++ |.+.+...+.-.+++++..+... +|.. .+++..=++..++||++.+. T Consensus 264 ~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~-~~~d---~I~~~~a~G~~vlRPeaa~~ 339 (345) T protein:vir:22 264 GTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN-FQAD---QIIAKYAMGHGGLRPEAAGA 339 (345) T ss_pred CcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeechh-HHHH---HHHHHHhcCCcccccceeEE Confidence 00111 11222333344456667666442 3332 46666669999999999999 Q ss_pred EEecCC Q lcl|Aclame:pro 488 IQLKKG 493 (497) Q Consensus 488 ~~~~~~ 493 (497) |.++-- T Consensus 340 i~~~~~ 345 (345) T protein:vir:22 340 VVFKVE 345 (345) T ss_pred EEEeeC Confidence 999988 No 130 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.98 E-value=3.5e-11 Score=77.87 Aligned_cols=296 Identities=13% Similarity=0.052 Sum_probs=153.8 Q ss_pred Hhhhhh-hhhhhhhhhcccccC-CcccccchhhHHHHHHHhhhhHHhhccceecC-CCceEEEEeecCCccceeeccccc Q lcl|Aclame:pro 139 ADGETA-PAAIGQNPFGSTGTF-APGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAGT 215 (497) Q Consensus 139 ~~~~~~-~~~~~~~~~~~~~~~-g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~~v~Eg~~ 215 (497) ..+... .+.-.+...+...+. -.+....|..++.....+.+.++++.++.++. +.++.+|+.-. .++..+..|.. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~--~~~~~~~~g~~ 78 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGR--TKGYYLAPGEN 78 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecc--eeeeeeccccC Confidence 000000 000011111111111 12345778888988888888899999988765 56788897533 45677777777 Q ss_pred ccc--ccccceeEEeeeeeee-eechhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc----cCC-----Cccccce Q lcl|Aclame:pro 216 YPF--SSEEFARVYEQVGKVA-NALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGG-----YPGVNGL 282 (497) Q Consensus 216 ~~~--s~~~~~~v~~~~~kia-~~~~iS~-ell~ds~~l~~~i~~~la~~~~~~~d~a~l~----G~g-----~~~~~Gi 282 (497) ... .++..+++++..-++- .-..|.+ +-.+...++.+.+.+..++++++..|+.++. +.. .+.+.|+ T Consensus 79 l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~ 158 (347) T protein:vir:88 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) T ss_pred CCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCc Confidence 654 3577787777665431 1223322 1122223688889999999999999998752 111 1112232 Q ss_pred eccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhh Q lcl|Aclame:pro 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) Q Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) .........+........ .......+.++.+. T Consensus 159 ~~~~~~~~~~~~~~~~~~------------------------------------------------~~~~~~~~~i~~a~ 190 (347) T protein:vir:88 159 GQAVVLNIGAAADLVDVE------------------------------------------------ARGKAILKGLTLAR 190 (347) T ss_pred cccccccccccccccchh------------------------------------------------hhHHHHHHHHHHHH Confidence 111110000000000000 00000111222222 Q ss_pred hhhhhhhc-cCCceEEEehhHHHHHHHHhc-ccCcccccccccccccccccccccccccceeecCCCCcCc--------- Q lcl|Aclame:pro 363 VDIQLTLF-QTPNAVVMNPRDWELLRLTKD-ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--------- 431 (497) Q Consensus 363 ~~~~~~~~-~~~~~~~~n~~~~~~l~~lkd-~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~--------- 431 (497) ..+....- ...-.++++|..|..|.+... ....|.-.. ....+.-..++|++|+.|+++|.+. T Consensus 191 ~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~------~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~ 264 (347) T protein:vir:88 191 ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALI------DPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADG 264 (347) T ss_pred HHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhcccc------chhcceeeeeccceEEEeeccccccccccccccc Confidence 11111111 123356778988887754321 112222111 1112233578999999999998421 Q ss_pred ----------------EEEeeccceEEEEEec--------cccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEE Q lcl|Aclame:pro 432 ----------------ILVGHFAPSVIQTARR--------EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQL 487 (497) Q Consensus 432 ----------------~~~gd~~~~~~~i~~r--------~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~ 487 (497) -+-+||+.....++-+ .+++++..+... .| .+ .+++...++..++||++.+. T Consensus 265 ~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~-~~-~d--~i~~~~~~G~~~~rPe~a~~ 340 (347) T protein:vir:88 265 VAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE-FQ-AD--QIIGKYAMGHGGLRPEAAGA 340 (347) T ss_pred ccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeechh-hH-HH--HhhhhhhhcCceeccceEEE Confidence 1335666554444433 334555554332 23 22 57788889999999999999 Q ss_pred EEecCCC Q lcl|Aclame:pro 488 IQLKKGA 494 (497) Q Consensus 488 ~~~~~~a 494 (497) +.++.+| T Consensus 341 ~~~~~a~ 347 (347) T protein:vir:88 341 LVFTPAA 347 (347) T ss_pred EEeCCCC Confidence 9999999 No 131 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.96 E-value=1.1e-10 Score=75.23 Aligned_cols=278 Identities=12% Similarity=0.082 Sum_probs=163.1 Q ss_pred hhhcccccCCcccccc---hhhHHHHHHHhhhhHHhhccceecCC---CceEEEEeecCCccceeeccc-cccccccccc Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPT---FLPGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEA-GTYPFSSEEF 223 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~---~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~p~~~~~~~~a~~v~Eg-~~~~~s~~~~ 223 (497) +..--..+.|.++..+ +.+.+++...+...-+.++++....+ .++.|.+... .+.+.|++.+ ..+|..+..+ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~~~dip~v~~~~ 79 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDG-VGIAQIVADYTDDLPLVDALA 79 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeec-cCceeEeCCCccccceeeccc Confidence 1111112334445433 34567776666666677666554222 3567766655 4678898876 4589999999 Q ss_pred eeEEeeeeeeeeechhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhh Q lcl|Aclame:pro 224 ARVYEQVGKVANALTITDEGLRDA----PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) Q Consensus 224 ~~v~~~~~kia~~~~iS~ell~ds----~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~ 299 (497) +......+.++..+.++.+=|+.+ .++..--....++++...+|+.+++|+..-+..|++|.++....+....|. T Consensus 80 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W~- 158 (296) T protein:vir:10 80 TERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSWS- 158 (296) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCcc- Confidence 999999999999999986555543 247777778888999999999999999887889999987653222111110 Q ss_pred HHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhh--hhccCCceEE Q lcl|Aclame:pro 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQL--TLFQTPNAVV 377 (497) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 377 (497) .....++++..++..+.. .+...|..++ T Consensus 159 --------------------------------------------------~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~ 188 (296) T protein:vir:10 159 --------------------------------------------------QPTTAVSDITSLLDIIETSTNGQHRATHLL 188 (296) T ss_pred --------------------------------------------------CHHHHHHHHHHHHHHHHHhhCceecceeEE Confidence 011334555555554433 3567788899 Q ss_pred EehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCC-cCcEEEeeccceEEEEEeccccEEEEec Q lcl|Aclame:pro 378 MNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-LGTILVGHFAPSVIQTARREGVTMQMTN 456 (497) Q Consensus 378 ~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~-~~~~~~gd~~~~~~~i~~r~~~~i~~~~ 456 (497) |+|..+..|...-...|.-++.-... .+..-+|.+.|...+.... ....++.+-+.-.+.+..-+.++. .. T Consensus 189 L~p~~~~~L~~~~~~~~~t~l~~ik~------~~~~l~i~~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~--~~ 260 (296) T protein:vir:10 189 LPTTARRIMQNLVPGTSVSYGEFFRQ------NNSGVTVEFVQYLNDYNGTGTSAAIAYEKDPNNMAIEIPEATNA--LP 260 (296) T ss_pred eCHHHHHHHhhccCCCCccHHHHHHH------hcCCceEEEeeeeccCCCCcceEEEEEEcCCceEEEEcCcceee--ec Confidence 99999988865544444333222110 1112233344443332211 111233333332333332233322 21 Q ss_pred cchhhhhcCceEEEEEeeec-cEeecccceEEE---Eec Q lcl|Aclame:pro 457 SNGTDFVDGKVTVRAEERLG-LLVYRPSAFQLI---QLK 491 (497) Q Consensus 457 ~~~~~f~~~~v~~r~~~r~~-~~v~~~~Af~~~---~~~ 491 (497) .. ...=...+++..|++ ..+++|.||+++ ||. T Consensus 261 ~e---~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 261 AQ---PKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred cc---ccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 11 112245577789995 788899999999 665 No 132 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.93 E-value=1.2e-10 Score=74.97 Aligned_cols=295 Identities=14% Similarity=0.112 Sum_probs=155.0 Q ss_pred Hhhhhhhhhh-hhhhhc-ccccCCcccccchhhHHHHHHHhhhhHHhhccceecC-CCceEEEEeecCCccceeeccccc Q lcl|Aclame:pro 139 ADGETAPAAI-GQNPFG-STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAGT 215 (497) Q Consensus 139 ~~~~~~~~~~-~~~~~~-~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~~v~Eg~~ 215 (497) ..+....... ++...+ .+++.-.+....|..++.....+.+.++++.++.++. +.++.+|+.. ..++..+..|.. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG--~~~~~~~~~G~~ 78 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLG--RTKAAYLQPGEN 78 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeecc--ceeEeeeecCcC Confidence 0000001111 111111 1111111345778889999999999999999988865 5678999753 356788888887 Q ss_pred ccc--ccccceeEEeeeeee--eeechhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc----cCCCc-----cccc Q lcl|Aclame:pro 216 YPF--SSEEFARVYEQVGKV--ANALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGGYP-----GVNG 281 (497) Q Consensus 216 ~~~--s~~~~~~v~~~~~ki--a~~~~iS~-ell~ds~~l~~~i~~~la~~~~~~~d~a~l~----G~g~~-----~~~G 281 (497) ... .++..++.++..-++ +.+ .|.+ +=.+...++.+.+.+..++++++..|+.++. +.... .+.| T Consensus 79 l~~~~~~~~~~e~~ltID~~~y~~~-~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g 157 (347) T protein:vir:94 79 LDDKRKDMKHTEKTINIDGLLTADV-LIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAG 157 (347) T ss_pred CCCCcCCccccceEEEEcchhhhhh-hhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 754 467888777755443 222 2221 1111223688889999999999999998752 11100 0111 Q ss_pred eeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHh Q lcl|Aclame:pro 282 LLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDA 361 (497) Q Consensus 282 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (497) ............... ............+.++.+ T Consensus 158 ~~~~~~v~i~~~~~~-----------------------------------------------~~~~~~~~~~~~d~i~~a 190 (347) T protein:vir:94 158 LGKAHVLEVGDQATL-----------------------------------------------QGDQVKLGQAIIAQLTLA 190 (347) T ss_pred CCcceeEeeeccccc-----------------------------------------------cccccccHHHHHHHHHHH Confidence 100000000000000 000000011112223322 Q ss_pred hhhhhhhhcc-CCceEEEehhHHHHHHHHhccc-CcccccccccccccccccccccccccceeecCCCCcCc-------- Q lcl|Aclame:pro 362 FVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDAN-GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-------- 431 (497) Q Consensus 362 ~~~~~~~~~~-~~~~~~~n~~~~~~l~~lkd~~-G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~-------- 431 (497) ...+....-. .+-.+++.|..+..|.+..+.. +.|-.. .+...+.-.++.|++|+.|+++|.+. T Consensus 191 ~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~------~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~ 264 (347) T protein:vir:94 191 RAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQAL------IDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEE 264 (347) T ss_pred HHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccc------cccccceeEEeeceEEEEcCccccccCccccccc Confidence 2222222111 1334556899998886543322 222111 11222334578999999999998421 Q ss_pred -----------------EEEeeccceEEEEEec--------cccEEEEeccchhhhhcCceEEEEEeeeccEeecccceE Q lcl|Aclame:pro 432 -----------------ILVGHFAPSVIQTARR--------EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQ 486 (497) Q Consensus 432 -----------------~~~gd~~~~~~~i~~r--------~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~ 486 (497) -+=+||+.....++-+ .++++++.+... ++.+ .+.+..-++..++||++.+ T Consensus 265 ~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~~--~~~~--~i~~~~a~G~g~~rPe~a~ 340 (347) T protein:vir:94 265 GVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRAN--FQAD--QIIAKYAMGHGGLRPEACG 340 (347) T ss_pred ccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeechh--hhhh--hhhhhhhhcCcccccceeE Confidence 1335666655555433 355566554332 2233 3555666899999999999 Q ss_pred EEEecCC Q lcl|Aclame:pro 487 LIQLKKG 493 (497) Q Consensus 487 ~~~~~~~ 493 (497) .+.++.| T Consensus 341 ~i~~~~a 347 (347) T protein:vir:94 341 ALVFKKA 347 (347) T ss_pred EEEecCC Confidence 9999999 No 133 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.92 E-value=7.1e-10 Score=70.74 Aligned_cols=297 Identities=11% Similarity=0.046 Sum_probs=147.5 Q ss_pred hhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCC-CceEEEEeecCCccceeecccccccccc Q lcl|Aclame:pro 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~~v~Eg~~~~~s~ 220 (497) ....+..++...+.+++.-.+....+..++.....+.+.++++..+.++.+ +++++|+.. ..++....-|+...-+. T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG--~~~~~~~~~G~~ld~~~ 78 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIG--ETELQVLSPGKSPDASP 78 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeee--eeEEeeeccCcccCCCC Confidence 112222222223333333334457788888899989999999999998865 578999863 24455555555554455 Q ss_pred ccceeEEeeeeee--eeechhhHHHHhhHHH-HHHHHHHHHHHHHHHHHHhhhhc---cCCCcc-----ccceecccccc Q lcl|Aclame:pro 221 EEFARVYEQVGKV--ANALTITDEGLRDAPE-LFNFVQGRLLEGIQRKEEVQLLA---GGGYPG-----VNGLLQRSTGF 289 (497) Q Consensus 221 ~~~~~v~~~~~ki--a~~~~iS~ell~ds~~-l~~~i~~~la~~~~~~~d~a~l~---G~g~~~-----~~Gil~~~~~~ 289 (497) +..++.++..-++ +......-+=.++..+ +.+.+..++.+++++..|+.++- ..+..+ -.++....+.. T Consensus 79 ~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g~~ 158 (364) T protein:vir:10 79 TEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHGFS 158 (364) T ss_pred cccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCcce Confidence 6666666655433 2222111111122234 67888899999999999998741 111000 01111110000 Q ss_pred ccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhh- Q lcl|Aclame:pro 290 TASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT- 368 (497) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 368 (497) . .......... .......+.++.+...+... T Consensus 159 i-~~~~~a~~~~-----------------------------------------------~~~~~l~~ai~~a~~~LdEkd 190 (364) T protein:vir:10 159 I-HIVGLASSFL-----------------------------------------------TSPQYMMAAIEMAMEQQTEQE 190 (364) T ss_pred e-eecccCcchh-----------------------------------------------hhHHHHHHHHHHHHHHHhhcC Confidence 0 0000000000 00000111111111111110 Q ss_pred hccCCceEEEehhHHHHHHHHhcccCccccccccc--ccccccccccccccccceeecCCCCcC---------------- Q lcl|Aclame:pro 369 LFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG--NAYGNPVNGGKNIWGVPVVTTPLIPLG---------------- 430 (497) Q Consensus 369 ~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~--~~~~~~~~~~~~l~G~pvv~s~~~~~~---------------- 430 (497) .-...-+.+++|..|..|.+-. +.+ .-.+. +....-.+.-..++|+||+.|+++|.. T Consensus 191 VP~~~R~~vv~P~~y~~Ll~~~----~lv-n~d~~~~~~~~~~~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls 265 (364) T protein:vir:10 191 VDTSELCGLMPWTAFNCLRDAD----RIV-DKSYTIAASDNTVDGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLS 265 (364) T ss_pred CCccccEEEeChHHHHHHhcCC----ccc-cccccccCCCccccceeEEEeceEEEeccccccccccccccccccccccc Confidence 0112236778999998876521 111 10100 011111222347899999999999841 Q ss_pred -----cE--EEeeccceEEEEEec--------cccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCC Q lcl|Aclame:pro 431 -----TI--LVGHFAPSVIQTARR--------EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 431 -----~~--~~gd~~~~~~~i~~r--------~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~ 495 (497) .- ..|||......++-+ .+++.++.++.. .| ...+.+..-++..++||++++.++...+++ T Consensus 266 ~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~-~~---~~~ida~~a~G~g~lRPeaa~~i~~~~~~~ 341 (364) T protein:vir:10 266 NAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKK-EK---TWYIDTFLAEGAIPDRWEAVAVVTAADTAE 341 (364) T ss_pred cccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccc-ee---eeeeeeehcccCcccCccceEEEEecCCCC Confidence 11 126666655555544 456666554432 22 222334455899999999999987555444 Q ss_pred CC Q lcl|Aclame:pro 496 GS 497 (497) Q Consensus 496 ~~ 497 (497) -- T Consensus 342 ~~ 343 (364) T protein:vir:10 342 LA 343 (364) T ss_pred Cc Confidence 33 No 134 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.87 E-value=2e-10 Score=73.73 Aligned_cols=298 Identities=14% Similarity=0.100 Sum_probs=149.2 Q ss_pred HHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecC-CCceEEEEeecCCccceeeccc Q lcl|Aclame:pro 135 MGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEA 213 (497) Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~~v~Eg 213 (497) ....... ...+..+....+.+++.-.+....|..++.....+.+.++++.++.++. ++++++|+. + ..++..+..| T Consensus 1 ma~~~~~-~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G-~~~~~~~~~G 77 (344) T protein:vir:10 1 MANMTGG-QQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-G-RTQAAYLAPG 77 (344) T ss_pred Ccccccc-ccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-c-eeEEEeeecC Confidence 0000000 0000000011111111122344678888988899999999999999886 557889976 3 3457777778 Q ss_pred cccccc--cccceeEEeeeee--eeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc----cCCC-----cccc Q lcl|Aclame:pro 214 GTYPFS--SEEFARVYEQVGK--VANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGGY-----PGVN 280 (497) Q Consensus 214 ~~~~~s--~~~~~~v~~~~~k--ia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~----G~g~-----~~~~ 280 (497) +..+-+ ++.-+++++..-+ +..+..-.-+=.+...++.+.+.++..+++++..|+.++. +... ..|. T Consensus 78 ~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~ 157 (344) T protein:vir:10 78 ENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENIT 157 (344) T ss_pred CCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Confidence 877543 4677776555433 3332211111111223788899999999999999987752 1111 1122 Q ss_pred ceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHH Q lcl|Aclame:pro 281 GLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFD 360 (497) Q Consensus 281 Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 360 (497) |.-..........+... ...........+.++. T Consensus 158 g~~~~~~~~~~~~~~~~-----------------------------------------------t~~~~~~~~~~~~i~~ 190 (344) T protein:vir:10 158 GLGTATVIETTQDKTTL-----------------------------------------------TDQVALGKEIIAALTK 190 (344) T ss_pred cccccceeecccccccc-----------------------------------------------cchhhhHHHHHHHHHH Confidence 21111000000000000 0000000111222232 Q ss_pred hhhhhhhhhcc-CCceEEEehhHHHHHHHHhccc-Cccccccccccccccccc-ccccccccceeecCCCCcCc------ Q lcl|Aclame:pro 361 AFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDAN-GQYMGGNFFGNAYGNPVN-GGKNIWGVPVVTTPLIPLGT------ 431 (497) Q Consensus 361 ~~~~~~~~~~~-~~~~~~~n~~~~~~l~~lkd~~-G~~~~~~~~~~~~~~~~~-~~~~l~G~pvv~s~~~~~~~------ 431 (497) +...+....-. ..-..+++|..+..|..-+.-+ +.|. +.+.... .-.+++|++|+.|+++|.+. T Consensus 191 a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~-------~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~ 263 (344) T protein:vir:10 191 ARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYA-------ALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSRE 263 (344) T ss_pred HHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccc-------cccceeeeEEEEEeceEEEeccccccccCCcccc Confidence 22222222211 1234667999988775332211 2221 1111111 22468999999999998431 Q ss_pred ---------------EEEeeccceEEEEE--------eccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEE Q lcl|Aclame:pro 432 ---------------ILVGHFAPSVIQTA--------RREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLI 488 (497) Q Consensus 432 ---------------~~~gd~~~~~~~i~--------~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~ 488 (497) ....+|+..+-.++ ...+++++..+.. .+|.. .+++.+=++..++||++.+.+ T Consensus 264 ~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~-~~~~d---~i~g~~~~G~~vlRPe~a~~v 339 (344) T protein:vir:10 264 GTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRA-NFQAD---QIIAKYAMGHGGLRPEAAGAV 339 (344) T ss_pred cccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccch-hHHHH---HHHHHhhcccceecccceEEE Confidence 11234544322222 2334466665543 33432 456666699999999999888 Q ss_pred EecCC Q lcl|Aclame:pro 489 QLKKG 493 (497) Q Consensus 489 ~~~~~ 493 (497) +|++- T Consensus 340 ~~~~~ 344 (344) T protein:vir:10 340 VFKTK 344 (344) T ss_pred EeecC Confidence 88887 No 135 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.85 E-value=4.4e-10 Score=71.88 Aligned_cols=294 Identities=12% Similarity=0.077 Sum_probs=153.1 Q ss_pred HhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecC-CCceEEEEeecCCccceeeccccccc Q lcl|Aclame:pro 139 ADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAGTYP 217 (497) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~~v~Eg~~~~ 217 (497) ..... .+.......+.+++.-.+....|..++.....+.+.++++..+.++. ++++.+|+. + ..++....-|+.+. T Consensus 1 m~~~~-~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G-~~~~~~~~~g~~l~ 77 (334) T protein:vir:80 1 MTYPA-ANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-G-ASTIAGRKAGEELV 77 (334) T ss_pred CCCCc-CCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-c-ceeeeeecCCCCCC Confidence 00000 01111111122222112334778889999999999999999999886 557899976 3 35677777788887 Q ss_pred cccccceeEEeeeeeee-eechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhh----ccCCCccc--------cc Q lcl|Aclame:pro 218 FSSEEFARVYEQVGKVA-NALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLL----AGGGYPGV--------NG 281 (497) Q Consensus 218 ~s~~~~~~v~~~~~kia-~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l----~G~g~~~~--------~G 281 (497) ...++.++.++..-.+- .-..|.+ +++. .++.+.+.+.+++++++..|++++ .+.....| .| T Consensus 78 ~~~~~~~~~~l~ID~~l~~~~~Vdd--iD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G 155 (334) T protein:vir:80 78 VQKNVSDKLNLTVDTVLYARHFFDK--FDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDG 155 (334) T ss_pred CCCcccCceEEEEeeeeehhhhHhh--HHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCC Confidence 77777777777655421 1122221 2232 368999999999999999999764 22211111 11 Q ss_pred eeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHh Q lcl|Aclame:pro 282 LLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDA 361 (497) Q Consensus 282 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 361 (497) +.......-.+.... .........++.+ T Consensus 156 ~~~~~~~~g~~~~~~----------------------------------------------------~~~~~l~~a~~~a 183 (334) T protein:vir:80 156 ILLPSTISGLAADAA----------------------------------------------------ADADVLVAAHRQG 183 (334) T ss_pred cceeecccccccchh----------------------------------------------------hhHHHHHHHHHHH Confidence 111110000000000 0000001111111 Q ss_pred hhhhhhhhc----cCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc------ Q lcl|Aclame:pro 362 FVDIQLTLF----QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT------ 431 (497) Q Consensus 362 ~~~~~~~~~----~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~------ 431 (497) ...+....- ...-..+++|..|..|..-+.-..+- |..... ..+.-...-.++.|+||+.|+++|... T Consensus 184 ~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d-~~~s~~-~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~ 261 (334) T protein:vir:80 184 VEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVE-FGAKEG-GNSFVGGRIAMLNGVRVVETPRFPQSAITANAL 261 (334) T ss_pred HHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccce-eccccc-cccccceeEEEEeceEEEeecCCCCcccccccc Confidence 111111111 12346788999999886532211110 100000 011112223478899999999999542 Q ss_pred -----EEEeeccceEEEEEeccc--------cEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCC Q lcl|Aclame:pro 432 -----ILVGHFAPSVIQTARREG--------VTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 432 -----~~~gd~~~~~~~i~~r~~--------~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~ 495 (497) .+-|||+..+..++-+.. ++.+..++.. .|.. .+.+..-++..++||++++.++|+.+-- T Consensus 262 g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-~~~d---~i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 262 GADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKK-DFGH---YLDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred ccccccccccccceEEEEEeCceEEEEEEeecceeeeechh-hHHH---HHHHHHHcCCceeccceEEEEEEeeecC Confidence 445677665544444332 3333332221 2221 1333344799999999999999988777 No 136 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.84 E-value=1.8e-10 Score=73.97 Aligned_cols=299 Identities=13% Similarity=0.074 Sum_probs=148.6 Q ss_pred Hhhhhhhhhh-hhhhhc-ccccCCcccccchhhHHHHHHHhhhhHHhhccceecC-CCceEEEEeecCCccceeeccccc Q lcl|Aclame:pro 139 ADGETAPAAI-GQNPFG-STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAGT 215 (497) Q Consensus 139 ~~~~~~~~~~-~~~~~~-~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~~v~Eg~~ 215 (497) .......... ++...+ .+++.-.+.+..|..++.....+.+.++++.++.+.. ++++.+|+... .++..+..|.. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~--~t~~~~~~g~~ 78 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR--TKAAYLKPGEN 78 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccc--eeeeeecCCCC Confidence 0000000000 111111 1111111334777888888888888899999987765 56788888643 45667777776 Q ss_pred ccc--ccccceeEEeee--eeeeeechhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc-----cCCCccccceecc Q lcl|Aclame:pro 216 YPF--SSEEFARVYEQV--GKVANALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA-----GGGYPGVNGLLQR 285 (497) Q Consensus 216 ~~~--s~~~~~~v~~~~--~kia~~~~iS~-ell~ds~~l~~~i~~~la~~~~~~~d~a~l~-----G~g~~~~~Gil~~ 285 (497) ++. .+++.++.++.. .++... .|.+ +=.+...++.+.+.+..++++++..|+.++. +.....+.+.... T Consensus 79 l~~~~~~~~~~e~~ltiD~~~y~~~-~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~ 157 (347) T protein:vir:33 79 LDDKRKDIKHTEKVIHIDGLLTADV-LIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEG 157 (347) T ss_pred CCCCCCCCccceEEEEechhhhhhH-HHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Confidence 654 345666655543 333221 1111 1111123688889999999999999998862 1111111100000 Q ss_pred ccc---cccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhh Q lcl|Aclame:pro 286 STG---FTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) Q Consensus 286 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) .+. ........ +.............+.++.+. T Consensus 158 ~~~~~~~~~~~~~t---------------------------------------------g~~~d~~~~a~~i~~~i~~a~ 192 (347) T protein:vir:33 158 LGKPTVLTLVKPTT---------------------------------------------GSLTDPVELGKAIIAQLTIAR 192 (347) T ss_pred cccccccccccccc---------------------------------------------ccccchhhhHHHHHHHHHHHH Confidence 000 00000000 000000000011122223222 Q ss_pred hhhhhhhc-cCCceEEEehhHHHHHHHHhcc-cCcccccccccccccccccccccccccceeecCCCCcCcE-------- Q lcl|Aclame:pro 363 VDIQLTLF-QTPNAVVMNPRDWELLRLTKDA-NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI-------- 432 (497) Q Consensus 363 ~~~~~~~~-~~~~~~~~n~~~~~~l~~lkd~-~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~-------- 432 (497) ..+....- ...-..+++|..+..|.+...- +..|.... ....+.-.+++|++|+.|+++|.+.+ T Consensus 193 ~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~------~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ 266 (347) T protein:vir:33 193 ASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALL------DPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAP 266 (347) T ss_pred HHHhhcCCCccCcEEEeCHHHHHHHhcccccccccccccc------ccccceeEEEeceeEEEecccccCcccccccccc Confidence 22222221 1234577899988887643221 12221111 11112234789999999999986421 Q ss_pred --------------EEeeccceE--------EEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEe Q lcl|Aclame:pro 433 --------------LVGHFAPSV--------IQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQL 490 (497) Q Consensus 433 --------------~~gd~~~~~--------~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~ 490 (497) .-++|+..+ ...+.-..++++..+... +|- -.+++...++..++||++.+.|.+ T Consensus 267 ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~-~~~---d~i~~~~~~G~~vlrP~~av~i~~ 342 (347) T protein:vir:33 267 ADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN-YQA---DQIIAKYAMGHGGLRPEAAGAIVL 342 (347) T ss_pred ccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchh-hhh---HhhhhhhhcCCceecccceEEEec Confidence 112332221 112233444666665432 232 347777778999999999999999 Q ss_pred cCCCC Q lcl|Aclame:pro 491 KKGAT 495 (497) Q Consensus 491 ~~~a~ 495 (497) +..+. T Consensus 343 ~~~~~ 347 (347) T protein:vir:33 343 PKVSE 347 (347) T ss_pred CCCCC Confidence 99999 No 137 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.80 E-value=2.6e-10 Score=73.11 Aligned_cols=265 Identities=14% Similarity=0.078 Sum_probs=137.4 Q ss_pred hhhcccccCCcccccchhhHHHHHH-HhhhhHHhhc---cceecCC-CceEEEEeecCCccceeeccccccccccccce- Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQL-FYELSLADLI---SSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSSEEFA- 224 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~-~~~~~l~~~~---~~~~~~~-~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~- 224 (497) ++.+.......+.++. +-+++... ..-..|++++ +..|+.. ..+++|+..- .+.+.-|+||+.+|.++.+.. T Consensus 1 mAe~nlt~~~dL~~~~-sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~-tgda~dVaEGe~Iplskvt~~~ 78 (295) T protein:vir:99 1 MAEKNLNTMADLGDIK-SIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEV-TLDQTDPGEGETIPLSKVTRTK 78 (295) T ss_pred CCCcccccHhhccCce-eehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeee-ecccccccCCcccchhhheeee Confidence 2222111111222222 22223222 2223455544 6677764 4699999765 577888999999999998876 Q ss_pred --eEEeeeeeeeeechhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhH Q lcl|Aclame:pro 225 --RVYEQVGKVANALTITDEGLRDA--PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 225 --~v~~~~~kia~~~~iS~ell~ds--~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~ 300 (497) ..+++.+|++.-+ |.|.++.+ .+....-.+.|..+++.++|..|+.--.++..+ . .... T Consensus 79 ~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t----------~-tg~~---- 141 (295) T protein:vir:99 79 DKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTK----------V-KGVG---- 141 (295) T ss_pred eeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCcee----------e-ehhh---- Confidence 4777778888754 99998654 356778889999999999999888632211000 0 0000 Q ss_pred HHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEeh Q lcl|Aclame:pro 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) ....... .+..++.++ -....+.++++|| T Consensus 142 lq~a~a~-------------------------------------------~~~al~~f~--------Ee~~~~~V~FVnP 170 (295) T protein:vir:99 142 LQKALSA-------------------------------------------SWAKLATFN--------EFEGSPLVSFVSP 170 (295) T ss_pred HHHHHHH-------------------------------------------hhhhhhhcc--------cccCCceEEEEeh Confidence 0000000 000000000 0112355899999 Q ss_pred hHHHHHHHHhcccCcccccccccccccccccccccccccc-eeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccch Q lcl|Aclame:pro 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVP-VVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~p-vv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~ 459 (497) .+.+.+++-..-+ |+.. +..|...- -.++|.- |+.|..+|.|+++.---....+.-.+-.+-.+ . .. T Consensus 171 ~D~a~yl~~A~~~----~~~a--~~fG~~~L--~nfLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l--~--~~ 238 (295) T protein:vir:99 171 LDVANYLGDTKVG----ADAS--NVFGMTLL--KNFLGMQNVIVMPSVPEGKIYSTAVENLVFASLNVKGGDL--G--GL 238 (295) T ss_pred HHHHHHHhccccc----cchh--hhhhhhhh--hhhhccceEEEcccCCCceEEEeeccceEEEEecCCchhh--h--hh Confidence 9999987543221 2211 00111110 1378996 99999999998664222221110000000000 0 01 Q ss_pred hhhhcCceEEEEEeee-------------c---cEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 460 TDFVDGKVTVRAEERL-------------G---LLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 460 ~~f~~~~v~~r~~~r~-------------~---~~v~~~~Af~~~~~~~~a~~~ 497 (497) -+|..|.+++.+..+. . +-+-+++++++.++++.+.+- T Consensus 239 f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~~~~ 292 (295) T protein:vir:99 239 FADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAAVPG 292 (295) T ss_pred hhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecCcCCC Confidence 1233444444433321 1 223467799999996655444 No 138 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.80 E-value=3.2e-10 Score=72.62 Aligned_cols=293 Identities=13% Similarity=0.042 Sum_probs=148.9 Q ss_pred HHHHhhhhhhhhhhhhhhcccccCC-cccccchhhHHHHHHHhhhhHHhhccceecC-CCceEEEEeecCCccceeeccc Q lcl|Aclame:pro 136 GAFADGETAPAAIGQNPFGSTGTFA-PGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEA 213 (497) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~g-~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~~v~Eg 213 (497) --+......++..+....+.+++.- .+....|..+++....+.+.++++.+..++. +.++.+|+.. ..++.....| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig--~~~~~~~~~g 78 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG--KLSAGYHTPG 78 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEecc--ceeEeeecCC Confidence 0000111111110000011111111 2445788889999999999999999988775 5678999864 3456666666 Q ss_pred ccc-ccccccceeEEeeeee--eeeechhhHHHHhh--H-HHHHHHHHHHHHHHHHHHHHhhhhc----cCCCcccccee Q lcl|Aclame:pro 214 GTY-PFSSEEFARVYEQVGK--VANALTITDEGLRD--A-PELFNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNGLL 283 (497) Q Consensus 214 ~~~-~~s~~~~~~v~~~~~k--ia~~~~iS~ell~d--s-~~l~~~i~~~la~~~~~~~d~a~l~----G~g~~~~~Gil 283 (497) ..+ +..+++-+++++..-+ +..+ .|. . +++ + .++.+.+.++.++++++.+|+.++. +.....|.+.. T Consensus 79 ~~l~~~~~~~~~~~~l~ID~~ky~~~-~Vd-d-iD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~ 155 (332) T protein:vir:78 79 TPIVGDAGIKANEKTLVMDDLLVSSQ-FVY-S-LDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) T ss_pred CCCCCCCCCCCceEEEEEehhhhhHH-HHH-h-HHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccc Confidence 654 3334565666655443 3222 222 1 222 2 3688999999999999999987752 11111111000 Q ss_pred ccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhh Q lcl|Aclame:pro 284 QRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFV 363 (497) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 363 (497) ........+. ....+.....+.++.+.. T Consensus 156 ~g~~~~~~~~----------------------------------------------------~~~~~~~~~~~~i~~a~~ 183 (332) T protein:vir:78 156 PGGFHVNIGA----------------------------------------------------GNTNDAQAIVDGFFEAAA 183 (332) T ss_pred ccccccccCC----------------------------------------------------ccccCHHHHHHHHHHHHH Confidence 0000000000 000011223344455554 Q ss_pred hhhhhhccCCc-eEEEehhHHHHHHHHhcccCcccccccccccccccccc--cccccccceeecCCCCcCc--------- Q lcl|Aclame:pro 364 DIQLTLFQTPN-AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNG--GKNIWGVPVVTTPLIPLGT--------- 431 (497) Q Consensus 364 ~~~~~~~~~~~-~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~--~~~l~G~pvv~s~~~~~~~--------- 431 (497) .+....-...+ .++++|..|..|.+.+|.. .+-. ...+..+...++ -..+.|++|+.|+++|... T Consensus 184 ~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~--~~n~-~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~ 260 (332) T protein:vir:78 184 VLDERSAPQEGRVAVLSPRQYYSLISSVDTN--ILNR-EIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAV 260 (332) T ss_pred HHhhcCCCccCCEEEeCHHHHHHHHhhcCce--eeee-eccccccceecceeeeEEeeeEEEecCccccCcccccccccc Confidence 44444333333 3556999998886644421 1100 011111111222 2578999999999998432 Q ss_pred -----EEEeeccceEEEEEecc--------ccEEEEec--cchhhhhcCceEEEEEeeeccEeecccceEEEEec Q lcl|Aclame:pro 432 -----ILVGHFAPSVIQTARRE--------GVTMQMTN--SNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 432 -----~~~gd~~~~~~~i~~r~--------~~~i~~~~--~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~ 491 (497) .+-|||+...-.++-+. +++|+++. .....|. -.+++...++..++||++++.|.-. T Consensus 261 ~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~---d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 261 TGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQG---DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred cccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhH---hhhhhhhhhcCceecccceEEEeeC Confidence 24455555333333332 23343322 1222232 3577777899999999999998744 No 139 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.79 E-value=2e-09 Score=68.23 Aligned_cols=301 Identities=15% Similarity=0.080 Sum_probs=162.6 Q ss_pred hhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCccccc---chhhHHHHHHHhhhhHHhhc Q lcl|Aclame:pro 109 EKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILP---TFLPGIVEQLFYELSLADLI 185 (497) Q Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~---~~~~~ii~~~~~~~~l~~~~ 185 (497) -+..+.+. .+........ .... ...-...+.|.+... .+.+.+++...+....+.++ T Consensus 1 ~~~~~~~~---------------~~~~~~~~~~--~~~~---~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i 60 (319) T protein:vir:10 1 MTTKKFDE---------------ADKSNVEMYL--IQAG---VKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVF 60 (319) T ss_pred CCCcchhH---------------HhhHHHHHHH--hhcc---chhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhc Confidence 00000000 0000000000 0000 000111122333332 23456777777777777777 Q ss_pred cceecCC---CceEEEEeecCCccceeecccc-ccccccccceeEEeeeeeeeeechhhHHHHhhH----HHHHHHHHHH Q lcl|Aclame:pro 186 SSRPVTS---PNLSYLTESAAHNNAAAVAEAG-TYPFSSEEFARVYEQVGKVANALTITDEGLRDA----PELFNFVQGR 257 (497) Q Consensus 186 ~~~~~~~---~~~~~p~~~~~~~~a~~v~Eg~-~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds----~~l~~~i~~~ 257 (497) ++.+..+ .++.|..... .+.+.|++.+. .+|..+..++......+.++..+.+|..=|+.+ .++..--... T Consensus 61 ~v~~~~~~~~~~~~~~~~~~-~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~a 139 (319) T protein:vir:10 61 PVTTELSPTDKTFEYMTFDK-VGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASA 139 (319) T ss_pred ccccCCCCceEEEEeeeecc-ccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHH Confidence 7653322 3566666655 47788998764 479999999999999999999998886544433 2477777888 Q ss_pred HHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccc Q lcl|Aclame:pro 258 LLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGA 337 (497) Q Consensus 258 la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 337 (497) .++++...+|+.+++|+...+..|++|.++....+.+.... T Consensus 140 A~~~~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~~--------------------------------------- 180 (319) T protein:vir:10 140 CQLAHDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGKWID--------------------------------------- 180 (319) T ss_pred HHHHHHHhhceEEEeecccccceeEEeCCCceeeecCCCCC--------------------------------------- Confidence 89999999999999999888889999987654332221100 Q ss_pred ccccccccccccchhhhhhhhHHhhhhhhh--hhccCCceEEEehhHHHHHHHHhcccCccccccccccccccccccccc Q lcl|Aclame:pro 338 AGSGSGVAGSYPTAAEIAENVFDAFVDIQL--TLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKN 415 (497) Q Consensus 338 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~ 415 (497) ....+....++++..++..+.. .+...|..++|+|..+..|.......|..++.-... .+..-+ T Consensus 181 --------~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~l~~lk~------~~~~l~ 246 (319) T protein:vir:10 181 --------VSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSYLDYFKS------QNSGIE 246 (319) T ss_pred --------ccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeHHHHHHH------hcCCce Confidence 0011223455666666655543 355678899999999999975555555443322111 111112 Q ss_pred ccccceeecCCCCcCc--EEEeeccceEEEEEeccccEEEEeccchhhhhcC--ceEEEEEeeec-cEeecccceEEEEe Q lcl|Aclame:pro 416 IWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDG--KVTVRAEERLG-LLVYRPSAFQLIQL 490 (497) Q Consensus 416 l~G~pvv~s~~~~~~~--~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~--~v~~r~~~r~~-~~v~~~~Af~~~~~ 490 (497) |.+.|...+... .|+ +++..-+.-.+.+..-+.++. ... +.. ...+.+..|++ ..+++|.||++++= T Consensus 247 I~~~pel~~ag~-~g~~~~v~y~~~~~~~~~~v~~~~~~--~~~-----e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dG 318 (319) T protein:vir:10 247 IDSIAELEDIDG-AGTKGVLVYEKNPMNMSIEIPEAFNM--LPA-----QPKDLHFKVPCTSKCTGLTIYRPMTIVLITG 318 (319) T ss_pred EEEeeeecccCC-CcceEEEEEecCCceEEEecCcceee--eee-----eecCceEEEeeeeeeEEEEEEccceeEeeec Confidence 333333332211 111 233332332333322222222 111 111 23355577775 66778999999873 Q ss_pred c Q lcl|Aclame:pro 491 K 491 (497) Q Consensus 491 ~ 491 (497) - T Consensus 319 I 319 (319) T protein:vir:10 319 V 319 (319) T ss_pred C Confidence 3 No 140 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.78 E-value=7.9e-10 Score=70.48 Aligned_cols=293 Identities=13% Similarity=0.074 Sum_probs=145.8 Q ss_pred Hhhhhhhhhh-hhhhhc-ccccCCcccccchhhHHHHHHHhhhhHHhhccceecC-CCceEEEEeecCCccceeeccccc Q lcl|Aclame:pro 139 ADGETAPAAI-GQNPFG-STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAGT 215 (497) Q Consensus 139 ~~~~~~~~~~-~~~~~~-~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~~v~Eg~~ 215 (497) .......... ++...+ .++..-.+.+..|...++......+.++++.++.++. ++++.+|+... .++.....|.. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~--~t~~~~~~g~~ 78 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR--TKAAYLKPGEN 78 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccc--eeeeeeccCCC Confidence 0000000000 000000 0000001233556777888888888889999887765 56789998643 45677777776 Q ss_pred ccc--ccccceeEEeee--eeeeeechhhHHH--HhhHHHHHHHHHHHHHHHHHHHHHhhhhcc--CC----------Cc Q lcl|Aclame:pro 216 YPF--SSEEFARVYEQV--GKVANALTITDEG--LRDAPELFNFVQGRLLEGIQRKEEVQLLAG--GG----------YP 277 (497) Q Consensus 216 ~~~--s~~~~~~v~~~~--~kia~~~~iS~el--l~ds~~l~~~i~~~la~~~~~~~d~a~l~G--~g----------~~ 277 (497) ++. .+++.++.++.. .++..+ .| ..+ .+...++.+.+.+..++++++..|+.++.- .+ .. T Consensus 79 l~~~~~~~~~~e~~ltID~~~~~~~-~V-ddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~ 156 (347) T protein:vir:15 79 LDDKRKDIKHTEKVIHIDGLLTADV-LI-YDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIE 156 (347) T ss_pred CCCCCCCCccceEEEEechhhhhhH-Hh-hhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 644 345667755543 333332 22 111 111236888899999999999999987621 00 00 Q ss_pred cc--cceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhh Q lcl|Aclame:pro 278 GV--NGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIA 355 (497) Q Consensus 278 ~~--~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (497) .| .++.......+.... ........++ T Consensus 157 ~~g~~~~~~~~~~~~~~~~---------------------------------------------------~~~~~~~~i~ 185 (347) T protein:vir:15 157 GLGKPTVLTLVKPTTGDLT---------------------------------------------------DPVELGKAII 185 (347) T ss_pred ccCccccccccccccccch---------------------------------------------------hhhhHHHHHH Confidence 00 000000000000000 0000001112 Q ss_pred hhhHHhhhhhhhhhc-cCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcE-- Q lcl|Aclame:pro 356 ENVFDAFVDIQLTLF-QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI-- 432 (497) Q Consensus 356 ~~~~~~~~~~~~~~~-~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~-- 432 (497) +.++.+...+....- ...-..+++|..+..|.+-.+-.. ....+.. ....+.-.+++|++|+.|+++|.+.+ T Consensus 186 d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~----~d~~~~~-~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~ 260 (347) T protein:vir:15 186 AQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNA----ANYQALI-DHERGTIRNVMGFEVVEVPHLTAGGAGD 260 (347) T ss_pred HHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccc----ccccccc-cccceEEEEEeceEEEeccccccccccc Confidence 222222212211111 122346679999888854432221 1111100 01111224789999999999984321 Q ss_pred --------------------EEeeccce--------EEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccc Q lcl|Aclame:pro 433 --------------------LVGHFAPS--------VIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSA 484 (497) Q Consensus 433 --------------------~~gd~~~~--------~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~A 484 (497) .-++|+.. ++..+.-+.++++..+... +| .-.+++...++..++||++ T Consensus 261 ~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~-~~---~d~i~~~~~~G~~vlrP~~ 336 (347) T protein:vir:15 261 TREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN-YQ---ADQIIAKYAMGHGGLRPEA 336 (347) T ss_pred ccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccch-hh---hhhhehhhhcCCceecccc Confidence 11222221 2223333445666665442 22 2347777788999999999 Q ss_pred eEEEEecCCCC Q lcl|Aclame:pro 485 FQLIQLKKGAT 495 (497) Q Consensus 485 f~~~~~~~~a~ 495 (497) .+.+.++..+. T Consensus 337 av~~~~~~~~~ 347 (347) T protein:vir:15 337 AGAIVLPKVSE 347 (347) T ss_pred EEEEecCCCCC Confidence 99999999999 No 141 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.78 E-value=4.6e-11 Score=77.28 Aligned_cols=284 Identities=13% Similarity=-0.016 Sum_probs=152.1 Q ss_pred hhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCC-CceEEEEeecCCccceeeccccccccccccc Q lcl|Aclame:pro 145 PAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSSEEF 223 (497) Q Consensus 145 ~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~ 223 (497) +......+.+-.....-+-+......||+.+.+.++|+..++.....+ ....+.+++.- ++++|..=++..+.++.++ T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~L-P~~~fR~lN~g~~~s~~tt 79 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGL-PSATWRLLNYGVQPSKSTT 79 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeecc-CCceeeecCCccCccccee Confidence 000000000000000111122356789999999999999999998864 44778888874 8899999999999999999 Q ss_pred eeEEeeeeeeeeechhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccc---hhhhh Q lcl|Aclame:pro 224 ARVYEQVGKVANALTITDEGLRDAP---ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTAS---SASSL 297 (497) Q Consensus 224 ~~v~~~~~kia~~~~iS~ell~ds~---~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~---~~~~~ 297 (497) .+++-..+-+++.+.|.+.+.+... ++...-.....+++...+...||||+...+|.++.....-.... .+... T Consensus 80 ~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~qi 159 (328) T protein:vir:95 80 VQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQNI 159 (328) T ss_pred EEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCccccccccce Confidence 9999999999999999999887543 34454556688999999999999999887777764322211100 00000 Q ss_pred hhHHHHHHH-HHhhhh---hcchhhhh-------------------------------hhhhhhhh--hhhhhhcccccc Q lcl|Aclame:pro 298 FGATSATVS-NVKFPA---DGTNGAFV-------------------------------GQDTVASL--KYGRVVTGAAGS 340 (497) Q Consensus 298 ~~~~~~~~~-~~~~~~---~~~~~~~~-------------------------------~~~~~~~~--~~~~~~~~~~~~ 340 (497) +........ .+.... ....-+++ ...|...+ +......-..+. T Consensus 160 idaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~NI 239 (328) T protein:vir:95 160 IDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIANI 239 (328) T ss_pred eecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecC Confidence 000000000 000000 00000000 00000000 001111111111 Q ss_pred cccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccc Q lcl|Aclame:pro 341 GSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVP 420 (497) Q Consensus 341 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~p 420 (497) ..+.........+..+.++.++...+ .......+|+||......|++....-+........ ........++|+| T Consensus 240 d~~~l~~~~~~~~l~~lm~~a~~~ip-~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~-----~~g~~~t~~~gip 313 (328) T protein:vir:95 240 DVSNLSEPSSAANIAKLMVKALHRIP-NRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKE-----TEGEWWTSFRGVP 313 (328) T ss_pred cccccccccChhhHHHHHHHHHHHhc-cCCCCcceeehhHHHHHHHHHHHhcCcceeeeeec-----cCCcceeEECCeE Confidence 11111111122233444455544432 44556778999999999999874433332222211 1122334677999 Q ss_pred eeecCCCCcCcEEEe Q lcl|Aclame:pro 421 VVTTPLIPLGTILVG 435 (497) Q Consensus 421 vv~s~~~~~~~~~~g 435 (497) |..++++--....+- T Consensus 314 ir~~dai~~tE~~vv 328 (328) T protein:vir:95 314 IRETDALLETEARVV 328 (328) T ss_pred EEEEeeeecCccccC Confidence 999888753322111 No 142 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=98.77 E-value=1.2e-09 Score=69.41 Aligned_cols=285 Identities=11% Similarity=-0.008 Sum_probs=140.7 Q ss_pred hhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhcccee---cCCCceEEEEeecCCccceeecccccccc Q lcl|Aclame:pro 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRP---VTSPNLSYLTESAAHNNAAAVAEAGTYPF 218 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~ 218 (497) ..-.+..+..+. ++.....++|..|...+++.+.+.+.+.++++... ..+.++++|+.. .+++.-+.++..++. T Consensus 1 ~~~~~~~~~~~~-~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g--~~~~~d~~~~~~i~~ 77 (341) T protein:vir:94 1 MALGNTITGPSI-NTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS--ELGVEDKATDVPVGV 77 (341) T ss_pred Ccchhhhccccc-cchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC--cceeeeecCCCcccc Confidence 000000000000 11111235666677788888888878788776443 235679999753 355666778888877 Q ss_pred ccccceeEEeeeeee-eeechhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHhhhhccC--CCcc--ccceeccccccccc Q lcl|Aclame:pro 219 SSEEFARVYEQVGKV-ANALTITDE-GLRDAPELFNFVQGRLLEGIQRKEEVQLLAGG--GYPG--VNGLLQRSTGFTAS 292 (497) Q Consensus 219 s~~~~~~v~~~~~ki-a~~~~iS~e-ll~ds~~l~~~i~~~la~~~~~~~d~a~l~G~--g~~~--~~Gil~~~~~~~~~ 292 (497) .+++-+++++...+. +.-+.|+++ ..+.+.++...+.+...+++++++|..++.-- +... +..+... ..... T Consensus 78 ~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~-~~~~t- 155 (341) T protein:vir:94 78 QPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSS-NGAIT- 155 (341) T ss_pred ccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCc-ccccc- Confidence 777777777777333 444566663 44555688888889999999999998876321 1111 1100000 00000 Q ss_pred hhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc-c Q lcl|Aclame:pro 293 SASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-Q 371 (497) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 371 (497) ........+.++.+...+....- . T Consensus 156 -------------------------------------------------------~~~~~~~~~~i~~a~~~Lde~~VP~ 180 (341) T protein:vir:94 156 -------------------------------------------------------GNGQAFSFAVFLAARRLLLEADVPE 180 (341) T ss_pred -------------------------------------------------------CchhhhhHHHHHHHHHHHhhcCCCc Confidence 00000001112222211111111 1 Q ss_pred CCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEE----------------- Q lcl|Aclame:pro 372 TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILV----------------- 434 (497) Q Consensus 372 ~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~----------------- 434 (497) ..-.++++|..+..|.+. .. ..-.. ..+....-.+.-.++.|++|+.|+++|.+.... T Consensus 181 ~gR~lvv~P~~~~~Ll~~--~~--~~~~~-~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i 255 (341) T protein:vir:94 181 EKIVLLISPGQESALFTI--PQ--FISKD-FINNAPIAQGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGF 255 (341) T ss_pred cCCEEEeCHHHHHHHhhc--hh--hhhhh-ccccchhheeeeeeEeceEEEEeccccccccccccccccceecccccccc Confidence 223567899999998642 11 11111 111001111223479999999999998643210 Q ss_pred ----------eeccceEEEEEeccc-------------------cEEEEeccchhhhhcCceEEEEEeeeccEeecccce Q lcl|Aclame:pro 435 ----------GHFAPSVIQTARREG-------------------VTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAF 485 (497) Q Consensus 435 ----------gd~~~~~~~i~~r~~-------------------~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af 485 (497) +|++...-.++-+.. +..+.+... ..| ...|++..=+|..++||++. T Consensus 256 ~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~-~~~---~~~i~~~~~~G~~~lrp~~~ 331 (341) T protein:vir:94 256 TGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFEN-REQ---VWLMVGRQAYGARLYRPLHA 331 (341) T ss_pred cccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchh-hhh---hhhhhhhhhhcccccCccee Confidence 111111111111111 000000000 011 22344555579999999998 Q ss_pred EEEEecCCCC Q lcl|Aclame:pro 486 QLIQLKKGAT 495 (497) Q Consensus 486 ~~~~~~~~a~ 495 (497) +.|...++.. T Consensus 332 v~~~~~~~~~ 341 (341) T protein:vir:94 332 VNIHTTGDTV 341 (341) T ss_pred EEEecCcCCC Confidence 8777555555 No 143 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.75 E-value=3.8e-10 Score=72.22 Aligned_cols=290 Identities=13% Similarity=0.073 Sum_probs=143.3 Q ss_pred HhhhhhhhhhhhhhhcccccCC---cccccchhhHHHHHHHhhhhHHhhccceecC-CCceEEEEeecCCccceeecccc Q lcl|Aclame:pro 139 ADGETAPAAIGQNPFGSTGTFA---PGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAEAG 214 (497) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~g---~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~~v~Eg~ 214 (497) ... .......... +..+..+ .+.+..|.++++....+.+.++++.++.++. ++++.+|+.. ..++..+..|+ T Consensus 1 m~~-~~~~~~~t~~-g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG--~~tv~~~t~G~ 76 (347) T protein:vir:94 1 MAN-VPGQKIGTDQ-GKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMG--RTSGVYLAPGE 76 (347) T ss_pred CCC-CCcccccccc-ccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEeccc--ceeeeeecCCC Confidence 000 0000000000 1111111 2334678888888888888889999888875 5578898863 35667777777 Q ss_pred ccccc--cccceeEEeeeeeeeeechhhHHHHh---hH---HHHHHHHHHHHHHHHHHHHHhhhhc-----cCCC----c Q lcl|Aclame:pro 215 TYPFS--SEEFARVYEQVGKVANALTITDEGLR---DA---PELFNFVQGRLLEGIQRKEEVQLLA-----GGGY----P 277 (497) Q Consensus 215 ~~~~s--~~~~~~v~~~~~kia~~~~iS~ell~---ds---~~l~~~i~~~la~~~~~~~d~a~l~-----G~g~----~ 277 (497) ..+.+ ..+-.++++..-++ .+++.+++ +. .++.+.+.++.++++++..|+.++. .... + T Consensus 77 ~l~~~~~~~~~~e~~itID~~----~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~ 152 (347) T protein:vir:94 77 RLSDKRKGIKHTEKVITIDGL----LTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNE 152 (347) T ss_pred CcCCCCCCCCcceEEEEecch----hhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 66433 34555644443322 12222322 22 3688889999999999999987752 1111 1 Q ss_pred cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhh Q lcl|Aclame:pro 278 GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAEN 357 (497) Q Consensus 278 ~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 357 (497) .+.|+-..........+....... ......+. T Consensus 153 ~~~g~~~~s~~~~~~~~~~~~~~~------------------------------------------------~~~~~~~~ 184 (347) T protein:vir:94 153 NIAGLGTASVLEVGKKADLDTPAK------------------------------------------------LGEAIIGQ 184 (347) T ss_pred ccCCCcccceeeccccccccchhh------------------------------------------------hHHHHHHH Confidence 122221111100000000000000 00001111 Q ss_pred hHHhhhhhhhhhc-cCCceEEEehhHHHHHHHHhcccCccccccccccccccc-ccccccccccceeecCCCCcCc---- Q lcl|Aclame:pro 358 VFDAFVDIQLTLF-QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNP-VNGGKNIWGVPVVTTPLIPLGT---- 431 (497) Q Consensus 358 ~~~~~~~~~~~~~-~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~-~~~~~~l~G~pvv~s~~~~~~~---- 431 (497) ++.+...+....- ...-..+++|..+..|..-+.-+... +.. .+.. .+.-.+++|++|+.|+++|.+. T Consensus 185 i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~-~~~-----~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~ 258 (347) T protein:vir:94 185 LTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAAN-YAA-----LIDPETGNIRNVMGFVVVEVPHLVQGGAGET 258 (347) T ss_pred HHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhh-ccc-----cccccccceEEEeceEEEecCcccccccccc Confidence 2222222111111 11235678999888764332211111 111 1111 1223579999999999998421 Q ss_pred --------------E--------EEeeccceEEEEEec--------cccEEEEeccchhhhhcCceEEEEEeeeccEeec Q lcl|Aclame:pro 432 --------------I--------LVGHFAPSVIQTARR--------EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYR 481 (497) Q Consensus 432 --------------~--------~~gd~~~~~~~i~~r--------~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~ 481 (497) . +-+||+...-.++-+ .+++++..+.. ..|. ..+++..-++..++| T Consensus 259 ~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~-~~~~---d~i~~~~~~G~~~~r 334 (347) T protein:vir:94 259 RGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDV-DAQG---DLIVGKYAMGHGGLR 334 (347) T ss_pred cccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhch-hhHH---HHhhhhhhhcCcccc Confidence 0 224444433233322 23345544333 2332 357888889999999 Q ss_pred ccceEEEEecCCC Q lcl|Aclame:pro 482 PSAFQLIQLKKGA 494 (497) Q Consensus 482 ~~Af~~~~~~~~a 494 (497) |++.+.|++++|- T Consensus 335 P~~a~~~~~~~A~ 347 (347) T protein:vir:94 335 PEAAGALVFSPAE 347 (347) T ss_pred cceeEEEEecCCC Confidence 9999999988555 No 144 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.75 E-value=1.8e-09 Score=68.58 Aligned_cols=290 Identities=12% Similarity=0.019 Sum_probs=156.5 Q ss_pred hhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCC-CceEEEEeecCCccceeecccccccccc Q lcl|Aclame:pro 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~~v~Eg~~~~~s~ 220 (497) ....+..++...+.+++.-.+....|..++.....+.+.++++..+.++.+ +++++|+. + ..++....-|+....+. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G-~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-G-NVEAKGRRAGEELERSR 78 (335) T ss_pred CCCcccchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-e-eeeeecccCCcCcCCCC Confidence 222222223333333333345568889999999999999999999998865 57888986 3 35677777777776666 Q ss_pred ccceeEEeeeeeeeeechhhHHH---HhhH---HHHHHHHHHHHHHHHHHHHHhhhh----ccCCCcccc--------ce Q lcl|Aclame:pro 221 EEFARVYEQVGKVANALTITDEG---LRDA---PELFNFVQGRLLEGIQRKEEVQLL----AGGGYPGVN--------GL 282 (497) Q Consensus 221 ~~~~~v~~~~~kia~~~~iS~el---l~ds---~~l~~~i~~~la~~~~~~~d~a~l----~G~g~~~~~--------Gi 282 (497) +..++..+..-.+- +++.+ +++. .++.+.+..++.+++++..|++++ .+.....|. |+ T Consensus 79 ~~~~k~~itVD~ll----~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:63 79 VVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred ccccceEEEeccee----echhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCc Confidence 66677666655433 33333 3333 378899999999999999999764 333321111 11 Q ss_pred eccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhh Q lcl|Aclame:pro 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) Q Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) ........... ..........++.+. T Consensus 155 ~~~~~~tg~~~------------------------------------------------------~~~~~~l~~a~~~a~ 180 (335) T protein:vir:63 155 LEKLDLTGLTA------------------------------------------------------KQAADKIVRMHRRVV 180 (335) T ss_pred ceeeeeccCcc------------------------------------------------------cccHHHHHHHHHHHH Confidence 11000000000 000111112222222 Q ss_pred hhhhhhhc----cCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc------- Q lcl|Aclame:pro 363 VDIQLTLF----QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT------- 431 (497) Q Consensus 363 ~~~~~~~~----~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~------- 431 (497) ..+....- ...-..+++|..|..|..-+.--.+- |+.. .+......+.-..++|+||+.|+++|.+. T Consensus 181 ~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~-~~~s-~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg 258 (335) T protein:vir:63 181 ETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVE-YQAT-GATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLG 258 (335) T ss_pred HHHHhccCCCcccCceEEEeChHHHHHHhccccccccc-cccc-cccccccCceeEEeeceEEEeeccCCCCCccccccc Confidence 22221111 12246888999999887642211110 0000 00111223344578999999999998532 Q ss_pred ----EEEeeccceEEEEEecc--------ccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 432 ----ILVGHFAPSVIQTARRE--------GVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 432 ----~~~gd~~~~~~~i~~r~--------~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) .+-|||......++-+. .++.++..+.. .|.. .+.+..-++..++||++.+.++++..-.=| T Consensus 259 ~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~-~~~~---~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~ 332 (335) T protein:vir:63 259 RHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNE-KFSW---VLDTFQMYNIGARRPDTAGAIELKGIGAFD 332 (335) T ss_pred ccCCccccccceeEEEEEecceEEEEEEeecccceeeccc-hhhH---HhHHHHHcCCcccccceEEEEEEcCCCcee Confidence 34456655444444332 23333333222 2322 233334489999999999999975433222 No 145 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.74 E-value=2.6e-09 Score=67.63 Aligned_cols=310 Identities=14% Similarity=0.083 Sum_probs=149.0 Q ss_pred HHHHHhhh-hhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecC-CCceEEEEeecCCccceeecc Q lcl|Aclame:pro 135 MGAFADGE-TAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVT-SPNLSYLTESAAHNNAAAVAE 212 (497) Q Consensus 135 ~~~~~~~~-~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~~a~~v~E 212 (497) .....-.+ ...+.-+....+.+++.-.+....|..++.....+.+.++++.++.++. ++++++|+.. ..++....- T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG--~~t~~~~t~ 78 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTG--RMTSSFHTP 78 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeee--eeEEeeecC Confidence 00000000 0000000000111111112344677888888888888999999998886 5578888863 345555555 Q ss_pred cccc---cccccccee--EEeeeeeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc----cCCCcccccee Q lcl|Aclame:pro 213 AGTY---PFSSEEFAR--VYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNGLL 283 (497) Q Consensus 213 g~~~---~~s~~~~~~--v~~~~~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~----G~g~~~~~Gil 283 (497) |+.+ +..++...+ +++.-.++..+..-.-+=.+...++.+.+.++.++++++..|+.++. +.....|.+.- T Consensus 79 G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~ 158 (375) T protein:vir:10 79 GTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSAT 158 (375) T ss_pred CcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 5554 233444444 44444443332211111111223788999999999999999987752 22211111110 Q ss_pred c----cccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhH Q lcl|Aclame:pro 284 Q----RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVF 359 (497) Q Consensus 284 ~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 359 (497) + ........... ......+.....+.++ T Consensus 159 ~~~~~Gg~~i~~~sg~------------------------------------------------~~~~~~ta~~~~~ai~ 190 (375) T protein:vir:10 159 NFVEPGGTQIRVGSGT------------------------------------------------NESDAFTASALVNAFY 190 (375) T ss_pred cccccCcceeeecccc------------------------------------------------ccccccCHHHHHHHHH Confidence 0 00000000000 0000000111122222 Q ss_pred Hhhhhhhhhhcc-CCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc------- Q lcl|Aclame:pro 360 DAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT------- 431 (497) Q Consensus 360 ~~~~~~~~~~~~-~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~------- 431 (497) .+...+....-. ..-..+++|..|..|.+-+|.+ .+.............+.-..+.|++|+.|+.+|... T Consensus 191 ~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~--~~~n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g 268 (375) T protein:vir:10 191 DAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSN--GLVNRDVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYG 268 (375) T ss_pred HHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCcc--ceeeecccccceeccceEEEEeceEEEEecccccccccccccc Confidence 222222221111 1335678999998886655533 121111111111112223478999999999998321 Q ss_pred ------------------------------EEEeec---cceEEEEE--------eccccEEEEeccchhhhhcCceEEE Q lcl|Aclame:pro 432 ------------------------------ILVGHF---APSVIQTA--------RREGVTMQMTNSNGTDFVDGKVTVR 470 (497) Q Consensus 432 ------------------------------~~~gd~---~~~~~~i~--------~r~~~~i~~~~~~~~~f~~~~v~~r 470 (497) .|-+|| ++.+-.++ .-.++++++++...+ -.+-...|. T Consensus 269 ~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~-~~~q~~~i~ 347 (375) T protein:vir:10 269 GTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIGPQVQVTNGDVS-VIYQGDVIL 347 (375) T ss_pred ccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchhheeeeeeeccccccccchhh-heeeeeeee Confidence 233455 32222222 334556665531111 112234467 Q ss_pred EEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 471 AEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 471 ~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +..-+|..+.||++.+.|...+++.+- T Consensus 348 ~~~a~G~~~lrp~~av~l~~~~~~~~~ 374 (375) T protein:vir:10 348 GRMAMGADYLNPAAAVELYIGATAPSA 374 (375) T ss_pred eeeeeccCccCceeEEEEecCcCcccc Confidence 777789999999999998877666666 No 146 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.74 E-value=3.3e-09 Score=67.05 Aligned_cols=290 Identities=12% Similarity=0.008 Sum_probs=156.4 Q ss_pred hhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCC-CceEEEEeecCCccceeecccccccccc Q lcl|Aclame:pro 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~~v~Eg~~~~~s~ 220 (497) ...++..++...+.+++.-.+....|..++.....+.+.++++..+.++.+ +++.+|+. + ..++.+..-|+....+. T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G-~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-G-NVEAKGRRAGEELERSR 78 (335) T ss_pred CCccccccccccccccchhhhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-e-eeeecccccCcccCCCC Confidence 222222233333333333345668889999999999999999999998864 57899975 3 34567777777766666 Q ss_pred ccceeEEeeeeeeeeechhhHHHH---hhH---HHHHHHHHHHHHHHHHHHHHhhhh----ccCCCcccc--------ce Q lcl|Aclame:pro 221 EEFARVYEQVGKVANALTITDEGL---RDA---PELFNFVQGRLLEGIQRKEEVQLL----AGGGYPGVN--------GL 282 (497) Q Consensus 221 ~~~~~v~~~~~kia~~~~iS~ell---~ds---~~l~~~i~~~la~~~~~~~d~a~l----~G~g~~~~~--------Gi 282 (497) +..++..+..-.+- +++.++ ++. -++.+.+...+.+++++..|++++ .+.....|. |+ T Consensus 79 ~~~~k~~itID~ll----~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:78 79 VVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred cccCCeEEEeccee----echhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCc Confidence 66677666554432 333333 333 368899999999999999999765 232221111 11 Q ss_pred eccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhh Q lcl|Aclame:pro 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) Q Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) .....+.... .........+.++.+. T Consensus 155 ~~~~~~tg~~------------------------------------------------------~~~~~~~l~~a~~~a~ 180 (335) T protein:vir:78 155 LEKLDLTGLT------------------------------------------------------AKEAAEKIVRMHRRVV 180 (335) T ss_pred ceeeeecccc------------------------------------------------------ccccHHHHHHHHHHHH Confidence 1100000000 0000111112222222 Q ss_pred hhhhhhh----ccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc------- Q lcl|Aclame:pro 363 VDIQLTL----FQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT------- 431 (497) Q Consensus 363 ~~~~~~~----~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~------- 431 (497) ..+.... ....-+.+++|..|..|..-+.--.+- |.... +..+...+.-..++|+||+.|+++|.+. T Consensus 181 ~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~-~~~s~-~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg 258 (335) T protein:vir:78 181 ETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVE-YQATG-ATNDYVKSRVAILNGVKVLETPRFATKAISAHPLG 258 (335) T ss_pred HHHHhccCCCCCCCccEEEeChHHHHHHhccccccccc-ccccc-cccccccceeEEeeceEEEeeccCCCCCCcccccc Confidence 1111111 111236889999999987643211110 00000 0011122344578999999999999542 Q ss_pred ----EEEeeccceEEEEEecc--------ccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 432 ----ILVGHFAPSVIQTARRE--------GVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 432 ----~~~gd~~~~~~~i~~r~--------~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) .+=+||....-.++-+. +++.++.++.. .|.. .+.+..-++..++||++.+.++++..-.=| T Consensus 259 ~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-~~~~---~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~ 332 (335) T protein:vir:78 259 RHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHD-QFSW---VLDTFQMYNIGARRPDTAGAIELKGIEAFD 332 (335) T ss_pred ccCCcccccccceEEEEEecceEEEEEEEecccceeeccc-hhhH---hhhHHHHcCCcccCcceEEEEEecCCCccc Confidence 22345544333344333 23334333322 2322 233334489999999999999977654444 No 147 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.71 E-value=2e-09 Score=68.22 Aligned_cols=298 Identities=11% Similarity=0.082 Sum_probs=148.4 Q ss_pred hhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCC-CceEEEEeecCCccceeecccccccccc Q lcl|Aclame:pro 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~~v~Eg~~~~~s~ 220 (497) ....+..++...+.+++.-.+....+..++.....+.+.++++..++++.+ +++++|+. + ...+.+..-|+..--+. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G-~s~a~y~~pG~~ldg~~ 78 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-G-ETELQVLAPGQSPAATS 78 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-e-eeEEeeecCCCCcCCCC Confidence 111122222222333333345567788888899999999999999999865 46888886 3 35677777777765555 Q ss_pred ccceeEEeeeeeeee-echhhHHHHhhH---HH-HHHHHHHHHHHHHHHHHHhhhhc----c----CC--Cccccceecc Q lcl|Aclame:pro 221 EEFARVYEQVGKVAN-ALTITDEGLRDA---PE-LFNFVQGRLLEGIQRKEEVQLLA----G----GG--YPGVNGLLQR 285 (497) Q Consensus 221 ~~~~~v~~~~~kia~-~~~iS~ell~ds---~~-l~~~i~~~la~~~~~~~d~a~l~----G----~g--~~~~~Gil~~ 285 (497) +..++..+..-.+-. ...|-. |+|. -+ +.+.+...+.+++++..|+.++. + +. .+.|.|+... T Consensus 79 ~~~dk~~ItIDtLL~a~~~V~d--lDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g 156 (400) T protein:vir:10 79 TQADKNQLVIDATVIARNTVAH--LHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHG 156 (400) T ss_pred cccCcEEEEeCceeeecchhhh--HHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccc Confidence 666666665433221 112211 2222 24 67888999999999999987652 1 11 1122333222 Q ss_pred ccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhh Q lcl|Aclame:pro 286 STGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI 365 (497) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 365 (497) ............. .........++.+...+ T Consensus 157 ~s~~v~~~~~~~~--------------------------------------------------~~~~~l~~A~~~A~~~L 186 (400) T protein:vir:10 157 FSVNVEVNEGEAL--------------------------------------------------VNPQYVMAAVEFALEQQ 186 (400) T ss_pred cceeecccccccc--------------------------------------------------cCHHHHHHHHHHHHHHH Confidence 1111000000000 00001111122222221 Q ss_pred hhhhccCCceEEEehhHHHHHHHHhcccCccccccccc-cccccc-ccccccccccceeecCCCCcCc------------ Q lcl|Aclame:pro 366 QLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG-NAYGNP-VNGGKNIWGVPVVTTPLIPLGT------------ 431 (497) Q Consensus 366 ~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~-~~~~~~-~~~~~~l~G~pvv~s~~~~~~~------------ 431 (497) ....-......+++|-.|+.+.+-.| +++.-.++ +..+.+ .+.-.+++|+||+.|+.+|.+. T Consensus 187 dEkdVP~~d~vvl~pp~~Ys~Ll~~d----kLvnrdf~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~ 262 (400) T protein:vir:10 187 LEQEVDISDVAILMPWRYFNVLRDAD----RIVDKSYTISQSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNED 262 (400) T ss_pred HhcCCCccceEEEcCHHHHHHHHhCC----cccchhccccCCCccccceEEEEeceEEEeeCcCCcccCcccccccccCC Confidence 11111111234444444443332222 22222221 111111 1222478999999999998531 Q ss_pred ---E--EEeeccceEEEEEeccccE-EEEeccchhhhhc---CceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 432 ---I--LVGHFAPSVIQTARREGVT-MQMTNSNGTDFVD---GKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 432 ---~--~~gd~~~~~~~i~~r~~~~-i~~~~~~~~~f~~---~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) . +-|||+...-.+|.+..+- ++.-+-..+.|.. -...+-+..-++..++||++.+.++.+-.++.- T Consensus 263 ~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~~~~ 337 (400) T protein:vir:10 263 NGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMSEGAIPDRWEAVSVVTTKRQSTGA 337 (400) T ss_pred CCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHHhCCcccchhheEEEEecCCcccc Confidence 1 2378887666565554222 2221111111111 111223334479999999999999988776655 No 148 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.66 E-value=1.1e-08 Score=64.22 Aligned_cols=283 Identities=16% Similarity=0.097 Sum_probs=161.3 Q ss_pred hcccccCCccccc---chhhHHHHHHHhhhhHHhhccceecC---CCceEEEEeecCCccceeecccc-cccccccccee Q lcl|Aclame:pro 153 FGSTGTFAPGILP---TFLPGIVEQLFYELSLADLISSRPVT---SPNLSYLTESAAHNNAAAVAEAG-TYPFSSEEFAR 225 (497) Q Consensus 153 ~~~~~~~g~~i~~---~~~~~ii~~~~~~~~l~~~~~~~~~~---~~~~~~p~~~~~~~~a~~v~Eg~-~~~~s~~~~~~ 225 (497) .. +.+.|.+... .+.+.+++.+.+....++++++.... ...+.|..... .+.+.|++.++ .+|..+..++. T Consensus 1 ~~-~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~-~G~~~~~~~~~~dip~~~~~~~~ 78 (301) T protein:vir:80 1 MQ-GKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTR-SGAAKIIANGADDLPLVDVDMVR 78 (301) T ss_pred CC-ccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeecc-ceeEEEecCccccccccccccee Confidence 11 1222333332 34567888888888888877665332 23456665544 46788998865 47999999999 Q ss_pred EEeeeeeeeeechhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHH Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDA----PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds----~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~ 301 (497) .....+.++.-+.++..=|+.+ .++..--....+++++..+|+.+++|+..-+..|++|.++......... T Consensus 79 ~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~----- 153 (301) T protein:vir:80 79 KSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTT----- 153 (301) T ss_pred EEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCc----- Confidence 9999999999888887544433 2577777888899999999999999998888999999876533221110 Q ss_pred HHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhh--hccCCceEEEe Q lcl|Aclame:pro 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT--LFQTPNAVVMN 379 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~n 379 (497) +..........+...+++++..++.++... +...|..++|+ T Consensus 154 -------------------------------------~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~ 196 (301) T protein:vir:80 154 -------------------------------------GVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLP 196 (301) T ss_pred -------------------------------------ccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEec Confidence 000111112223445667777777666443 45577889999 Q ss_pred hhHHHHHHHHh--cccCcccccccccccccccccccccccccceeecCCCCcCc-EEEeeccceEEEEEeccccEEEEec Q lcl|Aclame:pro 380 PRDWELLRLTK--DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-ILVGHFAPSVIQTARREGVTMQMTN 456 (497) Q Consensus 380 ~~~~~~l~~lk--d~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~-~~~gd~~~~~~~i~~r~~~~i~~~~ 456 (497) |..+..|.... +..|.-++.-... ....-.|.+.|...+....-.+ +++..-+.-.+.+..-+.++. .. T Consensus 197 p~~~~~L~~~~~~~~~~~tvl~~l~~------~~~~~~I~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~--~~ 268 (301) T protein:vir:80 197 PKQFELINKKRYSNEDSRSVLKVLQD------NAWFSAIVRVPDLAGMGTAGSDSFAVIHDSNETAELIIPMDITR--HP 268 (301) T ss_pred HHHHHhhhhccccCCCCeeHHHHHHH------HcCcceEEEcceeccCCCCcccEEEEEecCCcEEEEEecCceee--ec Confidence 99999997543 3344333322110 0111123333333322111001 111111111122221122221 11 Q ss_pred cchhhhhcCc-eEEEEEeee-ccEeecccceEEEEec Q lcl|Aclame:pro 457 SNGTDFVDGK-VTVRAEERL-GLLVYRPSAFQLIQLK 491 (497) Q Consensus 457 ~~~~~f~~~~-v~~r~~~r~-~~~v~~~~Af~~~~~~ 491 (497) . -.++. ..+-+..|+ +..+++|.||++++=- T Consensus 269 ~----e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 269 E----EYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred c----eecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 1 11332 223456777 5678899999998733 No 149 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.65 E-value=2.1e-09 Score=68.11 Aligned_cols=292 Identities=12% Similarity=0.055 Sum_probs=147.2 Q ss_pred hhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCC-CceEEEEeecCCccceeecccccccccc Q lcl|Aclame:pro 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~~v~Eg~~~~~s~ 220 (497) ....+..++...+.+++.-.+....+..++.....+.+.++++..+.++.+ +++++|+. + ..++.+..-|+..--+. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~i-G-~~~a~y~~~G~~ldg~~ 78 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-G-ETELQVLAPGQSPNATP 78 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEE-e-eeEEeeeccccccCCCC Confidence 112222222223333333334457788888888889999999999988865 57899986 3 24455555555544445 Q ss_pred ccceeEEeeeeeeeeechhhHHHH---hh---HHH-HHHHHHHHHHHHHHHHHHhhhhc---cCC------C-cccccee Q lcl|Aclame:pro 221 EEFARVYEQVGKVANALTITDEGL---RD---APE-LFNFVQGRLLEGIQRKEEVQLLA---GGG------Y-PGVNGLL 283 (497) Q Consensus 221 ~~~~~v~~~~~kia~~~~iS~ell---~d---s~~-l~~~i~~~la~~~~~~~d~a~l~---G~g------~-~~~~Gil 283 (497) +..++..+..-.+- +++..+ ++ ..+ +.+.+..++.+++++..|+.++. ..+ . ..|.+.- T Consensus 79 ~~~~k~~ItID~lL----~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~ 154 (402) T protein:vir:97 79 TQADKNQLVIDTTV----IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) T ss_pred cccccEEEEeCcee----echhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccc Confidence 66666655544332 222222 22 234 56888899999999999997642 101 0 0111111 Q ss_pred ccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhh Q lcl|Aclame:pro 284 QRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFV 363 (497) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 363 (497) ......+. ..............+.++.+.. T Consensus 155 ~g~s~~~~--------------------------------------------------~t~~~a~~~~~~l~~ai~~a~~ 184 (402) T protein:vir:97 155 HGFSINVN--------------------------------------------------VTESEALANPQYVMAAVEYALE 184 (402) T ss_pred cccccccc--------------------------------------------------cccchhhcCHHHHHHHHHHHHH Confidence 00000000 0000000011112222333322 Q ss_pred hhhh-hhccCCceEEEehhHHHHHHHHhcccCccccccccc-cccc-ccccccccccccceeecCCCCcC---------- Q lcl|Aclame:pro 364 DIQL-TLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG-NAYG-NPVNGGKNIWGVPVVTTPLIPLG---------- 430 (497) Q Consensus 364 ~~~~-~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~-~~~~-~~~~~~~~l~G~pvv~s~~~~~~---------- 430 (497) .+.. ..-...-+.+++|..|..|.+-.. +..-.+. .+.+ ...+.-..++|+||+.|+++|.+ T Consensus 185 ~LdEkdVP~~dRv~vv~P~~y~~Ll~~~r-----l~n~d~~~~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls 259 (402) T protein:vir:97 185 QQLEQEVDISDVAIMMPWKFFNALRDADR-----IVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLS 259 (402) T ss_pred HHHhcCCCccccEEEeChHHHHHHhhccc-----ccchhhccccCCccccceeEEEeceEEEecCccccccccccccccc Confidence 2211 111123467889999988875321 1111110 0111 11223357999999999999852 Q ss_pred -----cE--EEeeccceEEEEEecccc-EEEEeccch------hhhhcCceEEEEEeeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 431 -----TI--LVGHFAPSVIQTARREGV-TMQMTNSNG------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 431 -----~~--~~gd~~~~~~~i~~r~~~-~i~~~~~~~------~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) .. +-||++.....++.+..+ +++.-+-.. ..|..=++++ +-++..++||++...+.++-.+|. T Consensus 260 ~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~---~a~G~g~~RPeaa~vv~~~~~~t~ 336 (402) T protein:vir:97 260 NEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTF---MAEGAIPDRWEAVSVVTTKRDATT 336 (402) T ss_pred cCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHH---HHhCCcccCccceEEEEEeccccc Confidence 11 227777665555544322 122211111 1121112333 337899999999999988875444 Q ss_pred C Q lcl|Aclame:pro 497 S 497 (497) Q Consensus 497 ~ 497 (497) - T Consensus 337 ~ 337 (402) T protein:vir:97 337 G 337 (402) T ss_pred c Confidence 3 No 150 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.62 E-value=6.9e-09 Score=65.34 Aligned_cols=273 Identities=14% Similarity=0.086 Sum_probs=133.2 Q ss_pred hhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCC-ce-EEEEeecCCccceeecccccccc Q lcl|Aclame:pro 141 GETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NL-SYLTESAAHNNAAAVAEAGTYPF 218 (497) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~-~~p~~~~~~~~a~~v~Eg~~~~~ 218 (497) ....+.. .....+.+.+-+....-++...+-..+.+-.-++...+..||..+ .+ .||.++- .+.+.-|+||+.+|. T Consensus 1 ~~~~~~~-~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y-~gda~dVaEGe~Ipl 78 (296) T protein:vir:98 1 MVTSRTY-PEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDV-TLAEGNVPEGEVIPL 78 (296) T ss_pred CCCcccc-CcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceee-eeccccccCCcccch Confidence 0000000 000111111111111122333322222222223333367788655 46 5565444 466889999999999 Q ss_pred cccccee---EEeeeeeeeeechhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccch Q lcl|Aclame:pro 219 SSEEFAR---VYEQVGKVANALTITDEGLRDA--PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASS 293 (497) Q Consensus 219 s~~~~~~---v~~~~~kia~~~~iS~ell~ds--~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~ 293 (497) ++.+... .++..+|++.-+ |.|.++.+ .+..+.-.+.|..+++.++|..|+.--.++ +..+... T Consensus 79 skvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~Lkta---------T~t~~~t 147 (296) T protein:vir:98 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG---------TGTQDAL 147 (296) T ss_pred hhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcc---------cceeeec Confidence 9988764 777778888774 99998654 356778888999999999999887532111 0000000 Q ss_pred hhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCC Q lcl|Aclame:pro 294 ASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTP 373 (497) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 373 (497) ....... +.+. +.. +.+.+.. ....+ T Consensus 148 ~~~lQ~A----la~~----------------~~~---------------------------l~~~fed-------ed~~~ 173 (296) T protein:vir:98 148 GAGLQGA----LASA----------------WGK---------------------------LQVLFED-------YGSER 173 (296) T ss_pred hhhHHHH----HHHH----------------hhh---------------------------hhhhccc-------cCCCc Confidence 0000000 0000 000 0000000 01135 Q ss_pred ceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEE Q lcl|Aclame:pro 374 NAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQ 453 (497) Q Consensus 374 ~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~ 453 (497) .+.++||.+.+.++ ++++ +-.....+. .. --.++|.-++.|..+|.|+++.---....+.-.+-.+- + T Consensus 174 ~V~FVnP~D~a~yl--g~a~---it~qt~fG~---ty--l~nfLG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~--~ 241 (296) T protein:vir:98 174 AIVFANSLDVAEYI--AKAG---ITTQTAFGL---TY--LVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNS--E 241 (296) T ss_pred eEEEEehHHHHHHh--cCCc---cchhheech---hh--hhhccccEEEEcCcCCCceEEEeeecceEEEeeccccc--c Confidence 68899999998865 3432 211111111 00 01277889999999999987643333221111110000 0 Q ss_pred EeccchhhhhcCceEEEEEeee-------------c---cEeecccceEEEEecCCC Q lcl|Aclame:pro 454 MTNSNGTDFVDGKVTVRAEERL-------------G---LLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 454 ~~~~~~~~f~~~~v~~r~~~r~-------------~---~~v~~~~Af~~~~~~~~a 494 (497) ... .-.|..|.+++.+..+. . +-+-+++++++.+++++. T Consensus 242 l~~--~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 242 LAK--EFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred hhh--hhccccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEecCCC Confidence 110 01133344444443321 1 123467799999998777 No 151 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.61 E-value=2e-09 Score=68.25 Aligned_cols=306 Identities=11% Similarity=0.093 Sum_probs=146.3 Q ss_pred hhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCC-CceEEEEeecCCccceeecccccccccc Q lcl|Aclame:pro 142 ETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS-PNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 142 ~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~~v~Eg~~~~~s~ 220 (497) ....+..++...+.+++.-.+....+..++.....+.+.++++..++++.+ +++++|+. + ..++....-|+...-+. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G-~s~~~~~~pG~~ld~~~ 78 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-G-ETELQVLAPGQSPAATS 78 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-e-eeEeeeecCCCCcCCCC Confidence 111111122222233333345567788888888999999999999999865 56888986 3 34566666666655555 Q ss_pred ccceeEEeeeeeeeeechhhHHH---HhhH---HH-HHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccch Q lcl|Aclame:pro 221 EEFARVYEQVGKVANALTITDEG---LRDA---PE-LFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASS 293 (497) Q Consensus 221 ~~~~~v~~~~~kia~~~~iS~el---l~ds---~~-l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~ 293 (497) +..++..+..-.+- +++-+ |++. -+ +.+.+...+.+++++..|+.++.-- ...|+-+.......+. T Consensus 79 ~~~dK~~ItID~lL----~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i---~~aa~ana~~~~~~p~ 151 (401) T protein:vir:70 79 TQADKNQLVIDATV----IARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQM---MLGGIANTQAKRTNPR 151 (401) T ss_pred cccccEEEEeCcee----ehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHH---HHhccccccccccCCC Confidence 66666655443321 12221 2232 24 5778899999999999998663200 0001100000000000 Q ss_pred hhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCC Q lcl|Aclame:pro 294 ASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTP 373 (497) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 373 (497) . ........................+.++.+...+....-... T Consensus 152 ~-------------------------------------~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~ 194 (401) T protein:vir:70 152 V-------------------------------------KGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDIS 194 (401) T ss_pred c-------------------------------------CCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCcc Confidence 0 000000000000000000111233334444433332222222 Q ss_pred ceEEEehhHHHHHHHHhcccCccccccccc-cccccc-ccccccccccceeecCCCCcC---------------cEE--E Q lcl|Aclame:pro 374 NAVVMNPRDWELLRLTKDANGQYMGGNFFG-NAYGNP-VNGGKNIWGVPVVTTPLIPLG---------------TIL--V 434 (497) Q Consensus 374 ~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~-~~~~~~-~~~~~~l~G~pvv~s~~~~~~---------------~~~--~ 434 (497) -..+++|-.|+.+.+-+| +++.-.+. ...+.. .+.-.+++|+||+.|+++|.+ ..+ - T Consensus 195 r~vvl~pp~~Ys~Ll~~d----~L~nrd~~~s~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~ 270 (401) T protein:vir:70 195 DVAILMPWRYFNVLRDAD----RIVDKTYTISQSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPL 270 (401) T ss_pred ceEEEcCHHHHHHHHhcC----cccchhhccccCCccccceEEEEeceEEEeeccccccccccccccccccCCCccCCCC Confidence 355566666665544333 22222211 111111 122247899999999999853 222 2 Q ss_pred eeccceEEEEEeccccE-EEEeccchhhhhc---CceEEEEEeeeccEeecccceEEEEecCCCC-----CC Q lcl|Aclame:pro 435 GHFAPSVIQTARREGVT-MQMTNSNGTDFVD---GKVTVRAEERLGLLVYRPSAFQLIQLKKGAT-----GS 497 (497) Q Consensus 435 gd~~~~~~~i~~r~~~~-i~~~~~~~~~f~~---~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~-----~~ 497 (497) |||+...-.+|.+..+- ++.-+-..+.|.. -...+-+..-++..++||++.+.++.+-..+ |. T Consensus 271 ~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~g~g~~RPeaa~vv~~k~~~~~~~~~~~ 342 (401) T protein:vir:70 271 PAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRNTTTGAVEGT 342 (401) T ss_pred ccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHHHhCCcccchhheEEEeecCcccccccccC Confidence 77777655555544222 2221111111111 0111223444799999999999986654421 22 No 152 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.58 E-value=4.7e-08 Score=60.77 Aligned_cols=271 Identities=10% Similarity=0.028 Sum_probs=145.5 Q ss_pred hhhcccccCCcccccchhhHHH-HHHHhhhhHHh---------hccce--ecCCCceEEEEeecCCccceeecccccccc Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIV-EQLFYELSLAD---------LISSR--PVTSPNLSYLTESAAHNNAAAVAEAGTYPF 218 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii-~~~~~~~~l~~---------~~~~~--~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~ 218 (497) ++ ++..+..+.|+....++ ....+.+.+.+ +.... ..++..+++|....-++.+.-+.|+..++. T Consensus 1 MA---~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~ 77 (324) T protein:vir:59 1 MA---YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVP 77 (324) T ss_pred CC---ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccch Confidence 22 12234566666555544 33333433422 22222 235667899988765567788899999999 Q ss_pred ccccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhh Q lcl|Aclame:pro 219 SSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSL 297 (497) Q Consensus 219 s~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~ 297 (497) .+.+.++.....++.+.-..++++...-+ .+....|.+.++..+.+..+..+|.- ..|++.......... T Consensus 78 ~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~-----l~g~~~~~~~~~~~~---- 148 (324) T protein:vir:59 78 QKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAE-----LAGVFSNDDMKDNKL---- 148 (324) T ss_pred hhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhhhcccccccee---- Confidence 99988888888888888778887654433 36677799999999999888877631 112111100000000 Q ss_pred hhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEE Q lcl|Aclame:pro 298 FGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVV 377 (497) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 377 (497) .. .. ........+.+.++....-.. ...-.+|+ T Consensus 149 -----------------------------------------dv----sa-~~~~~~s~~~l~~A~~~~GD~-~~~~~~iv 181 (324) T protein:vir:59 149 -----------------------------------------DI----SG-TADGIYSAETFVDASYKLGDH-ESLLTAIG 181 (324) T ss_pred -----------------------------------------ee----ec-cccceecHHHHHHHHHHhCCc-ccCcEEEE Confidence 00 00 000000112222222221111 22456899 Q ss_pred EehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcC-------cEEEeeccceEEEEEe-ccc Q lcl|Aclame:pro 378 MNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG-------TILVGHFAPSVIQTAR-REG 449 (497) Q Consensus 378 ~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~-------~~~~gd~~~~~~~i~~-r~~ 449 (497) ||+.++..|++..--+ |+... . ....-++++|+||++++.||.. .+...-|..+++.+.. +.. T Consensus 182 mhS~v~~~L~~~~li~--~~~~s----~---~~~~i~~~~G~~VivdD~~p~~~~~~~~~~y~s~l~~~GAi~~~~~~~~ 252 (324) T protein:vir:59 182 MHSATMASAVKQDLIE--FVKDS----Q---SGIRFPTYMNKRVIVDDSMPVETLEDGTKVFTSYLFGAGALGYAEGQPE 252 (324) T ss_pred EchHHHHHHHHhhhhh--hcccc----c---cCceeeeecccEEEEeCCCCccccCCCCceEEEEEEecCeEEEeecCCC Confidence 9999999999763221 11111 0 0123467899999999999842 1111222334444443 344 Q ss_pred cEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 450 VTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 450 ~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +.++.++.. ..++..+....++. +||..+..-.-+ .++.| T Consensus 253 v~vE~dRd~----~~g~~~l~~r~~~~---~~p~G~s~~~~~-~~~~s 292 (324) T protein:vir:59 253 VPTETARNA----LGSQDILINRKHFV---LHPRGVKFTENA-MAGTT 292 (324) T ss_pred cceecccCc----cccceEEEEeeEEE---eEeeeEEecccc-cCCCC Confidence 566666554 25667777777765 445554443211 11222 No 153 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.56 E-value=1.2e-09 Score=69.57 Aligned_cols=282 Identities=14% Similarity=0.017 Sum_probs=147.9 Q ss_pred hhhhhhhhhcccccCCccc-ccchhhHHHHHHHhhhhHHhhccceecCCCc-eEEEEeecCCccceeecccccccccccc Q lcl|Aclame:pro 145 PAAIGQNPFGSTGTFAPGI-LPTFLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 145 ~~~~~~~~~~~~~~~g~~i-~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~~v~Eg~~~~~s~~~ 222 (497) +......+.+- ......+ +......||+.+.+.+.|++.++.....+.+ ....+.++ -|+++|..=++..+.++.+ T Consensus 1 m~~~~~~a~TL-~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~-LP~~~fR~lN~g~~~s~~t 78 (330) T protein:vir:10 1 MATLSTNNPTM-ADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTG-LPTPTWRKLYGGVLPNKSS 78 (330) T ss_pred CCcCCCCcccH-HHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEee-cCCchhhhcCCccccccce Confidence 00000000000 0001112 2234567999999999999999887654332 22334444 3788999999999999999 Q ss_pred ceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceecccccccc---chhhh Q lcl|Aclame:pro 223 FARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTA---SSASS 296 (497) Q Consensus 223 ~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~---~~~~~ 296 (497) +.+++-..+-+++.+.|.+.+.+.. .++.........+++...+...||||+...+|.++.....-.+. ..+.. T Consensus 79 t~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~q 158 (330) T protein:vir:10 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) T ss_pred EEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCchhh Confidence 9999999999999999999998754 24556667778999999999999999988777777533222110 00000 Q ss_pred hhhHHHHHHHH-Hhhhh---hcchhhhhh---------------------------------hhhhhhh--hhhhhhccc Q lcl|Aclame:pro 297 LFGATSATVSN-VKFPA---DGTNGAFVG---------------------------------QDTVASL--KYGRVVTGA 337 (497) Q Consensus 297 ~~~~~~~~~~~-~~~~~---~~~~~~~~~---------------------------------~~~~~~~--~~~~~~~~~ 337 (497) .+......... +.... ....-+++. ..|...+ +......-. T Consensus 159 vIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI 238 (330) T protein:vir:10 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) T ss_pred eeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEE Confidence 00000000000 00000 000000000 0000000 011111111 Q ss_pred ccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHH-hcccCcccccccccccccccccccccc Q lcl|Aclame:pro 338 AGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLT-KDANGQYMGGNFFGNAYGNPVNGGKNI 416 (497) Q Consensus 338 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~l-kd~~G~~~~~~~~~~~~~~~~~~~~~l 416 (497) .+...+.........+.++-++.+...++ +......+|+||......|++. .+.....+-...+. ......+ T Consensus 239 ~NIdvs~l~~~~~~~~li~lm~~A~~~ip-~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~------g~~~t~~ 311 (330) T protein:vir:10 239 CNIDVSDLATSANAQALIKYMIMAAERIP-QLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVS------GERVMTF 311 (330) T ss_pred eecccccCCCCccHHHHHHHHHHHHHhcc-CCCCCcceeeechHHHHHHHHHHhhcccceeeeeecC------CeeeEEE Confidence 11111111111122234444555554433 4455667899999999999986 34433332221111 1112367 Q ss_pred cccceeecCCCCcCcEEEe Q lcl|Aclame:pro 417 WGVPVVTTPLIPLGTILVG 435 (497) Q Consensus 417 ~G~pvv~s~~~~~~~~~~g 435 (497) +|+||..++++-.....+- T Consensus 312 ~gipir~~Dail~tE~~vv 330 (330) T protein:vir:10 312 DGIPVQRTDALLNTESRVV 330 (330) T ss_pred CCeEEEEEeeeecCccccC Confidence 7999999888754332211 No 154 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.54 E-value=1.8e-08 Score=63.05 Aligned_cols=297 Identities=12% Similarity=0.085 Sum_probs=156.9 Q ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccc---hhhHHHHHHHhhhhHHhhccceecCC---CceEEEEe Q lcl|Aclame:pro 127 PGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPT---FLPGIVEQLFYELSLADLISSRPVTS---PNLSYLTE 200 (497) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~---~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~p~~ 200 (497) .......... ......-+...-...+.|.++..+ +.+.+++...+...-+.++++.+-.+ .++.|... T Consensus 1 ~~~~~~~~~~------~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~ 74 (314) T protein:vir:10 1 MAIKFDAEQA------KITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEF 74 (314) T ss_pred CccchHHHHH------HHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeee Confidence 0000000000 000000011111122234455442 34457776666666666666543322 35677766 Q ss_pred ecCCccceeecccc-ccccccccceeEEeeeeeeeeechhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHhhhhccCC Q lcl|Aclame:pro 201 SAAHNNAAAVAEAG-TYPFSSEEFARVYEQVGKVANALTITDEGLRDA----PELFNFVQGRLLEGIQRKEEVQLLAGGG 275 (497) Q Consensus 201 ~~~~~~a~~v~Eg~-~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds----~~l~~~i~~~la~~~~~~~d~a~l~G~g 275 (497) .. .+.+.|++.+. .+|..+..+++.....+.++..+.+|.+=|+.+ .++..--....++++...+|+.+++|+. T Consensus 75 e~-~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~ 153 (314) T protein:vir:10 75 DG-VGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSA 153 (314) T ss_pred cc-ccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecc Confidence 55 47788998764 489999999999999999999999976444332 2577777888899999999999999998 Q ss_pred CccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhh Q lcl|Aclame:pro 276 YPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIA 355 (497) Q Consensus 276 ~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (497) ..+..|++|.++.........+ .+....+ T Consensus 154 ~~g~~GLlN~p~v~~~~~~~~W---------------------------------------------------aT~~ei~ 182 (314) T protein:vir:10 154 PHGIVSVFDQPNINNVVATPNW---------------------------------------------------SVPQNAI 182 (314) T ss_pred cccceeEeecCCCccccCCCCc---------------------------------------------------ccHHHHH Confidence 8888999998764321111111 0122445 Q ss_pred hhhHHhhhhhhhh--hccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc-E Q lcl|Aclame:pro 356 ENVFDAFVDIQLT--LFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-I 432 (497) Q Consensus 356 ~~~~~~~~~~~~~--~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~-~ 432 (497) +++..++.++... +...|..++|+|..+..|...-+..|.-++.-... .+.+-+|-+.|-..+....-.+ + T Consensus 183 ~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~~l~~------n~~~l~I~~~~el~~ag~~g~~~~ 256 (314) T protein:vir:10 183 DDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGELFTR------NNPGLTIRFLQFLDNYDGAGGKAA 256 (314) T ss_pred HHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHHHHHH------hCCCcEEEEcccccccCCCcceEE Confidence 5666666555543 44667788899988876654333333322211110 1112234444444432221111 1 Q ss_pred EEeeccceEEEEEeccccEEEEeccchhhhhcC--ceEEEEEeee-ccEeecccceEEEEecCCC Q lcl|Aclame:pro 433 LVGHFAPSVIQTARREGVTMQMTNSNGTDFVDG--KVTVRAEERL-GLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 433 ~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~--~v~~r~~~r~-~~~v~~~~Af~~~~~~~~a 494 (497) ++.+-+.-.+.+..-+.++. .+ .+.. .+.+.+..|+ +..+++|.||++++=-+=| T Consensus 257 v~y~~~~~~~~~~vp~~~~~--l~-----~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 257 LAFEKSPLNMSIEIPEVTNV--LP-----AQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred EEEecCCcEEEEecCcccee--ec-----ceecCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 22111221222221122221 11 1222 2334457777 5677889999975533333 No 155 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.52 E-value=1.7e-09 Score=68.59 Aligned_cols=284 Identities=13% Similarity=0.004 Sum_probs=145.5 Q ss_pred hhhhhhhhhcccccCCccc-ccchhhHHHHHHHhhhhHHhhccceecCCCc-eEEEEeecCCccceeecccccccccccc Q lcl|Aclame:pro 145 PAAIGQNPFGSTGTFAPGI-LPTFLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSEE 222 (497) Q Consensus 145 ~~~~~~~~~~~~~~~g~~i-~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~~v~Eg~~~~~s~~~ 222 (497) +......+.+- ......+ +......||+.+.+.+.|+..++.....+.+ ....+.++ -|+++|..=++..+.++.+ T Consensus 1 m~~~~~~a~TL-~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~-LP~~~fR~lN~g~~~s~~t 78 (335) T protein:vir:73 1 MALIGQTLPSL-LDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAG-IPEPVWRRYNQGVQPTKTQ 78 (335) T ss_pred CCcCCCCchhH-HHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEe-cCCchhhhcCCccccccce Confidence 00000000000 0001111 1234556999999999999999887654332 22334444 3788999999999999999 Q ss_pred ceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccc------cch Q lcl|Aclame:pro 223 FARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFT------ASS 293 (497) Q Consensus 223 ~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~------~~~ 293 (497) +.+++-..+-+++.+.|.+.+.+-. .++...-.....+++...+...||||+...+|.++...+.-.+ ... T Consensus 79 t~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~ 158 (335) T protein:vir:73 79 TVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAAS 158 (335) T ss_pred EEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccccCc Confidence 9999999999999999999887643 2466666777899999999999999998877777753322110 000 Q ss_pred hhhhhhHHHHHHHH-Hhhhhh-cch--hhhhh-------------------------------hhhhhhh--hhhhhhcc Q lcl|Aclame:pro 294 ASSLFGATSATVSN-VKFPAD-GTN--GAFVG-------------------------------QDTVASL--KYGRVVTG 336 (497) Q Consensus 294 ~~~~~~~~~~~~~~-~~~~~~-~~~--~~~~~-------------------------------~~~~~~~--~~~~~~~~ 336 (497) +...+......... +..... +.. -+++. ..|...+ +......- T Consensus 159 a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvR 238 (335) T protein:vir:73 159 AENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISR 238 (335) T ss_pred ccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEE Confidence 00000000000000 000000 000 00000 0000000 00011111 Q ss_pred ccccccccc-ccccchhhhhhhhHHhhhh-hhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccc Q lcl|Aclame:pro 337 AAGSGSGVA-GSYPTAAEIAENVFDAFVD-IQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGK 414 (497) Q Consensus 337 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~ 414 (497) ..+...+.. .......+..+.+..++.+ .....+....+|+||......|++.-.......+...... ....- T Consensus 239 I~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~-----g~~~t 313 (335) T protein:vir:73 239 ICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYG-----GKKIV 313 (335) T ss_pred EeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccC-----CceeE Confidence 111111100 0011112333444445432 2344556668899999999999986444333322221111 11112 Q ss_pred cccccceeecCCCCcCc-EEEe Q lcl|Aclame:pro 415 NIWGVPVVTTPLIPLGT-ILVG 435 (497) Q Consensus 415 ~l~G~pvv~s~~~~~~~-~~~g 435 (497) .++|+||..++++-... .++. T Consensus 314 ~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 314 SFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred EECCeEEEEEeeeecCcccccC Confidence 56789999888875432 2222 No 156 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.51 E-value=7.6e-08 Score=59.62 Aligned_cols=277 Identities=10% Similarity=0.004 Sum_probs=143.0 Q ss_pred hhhcccccCCcccccchhhHHH-HHHHhhhhHHh---------hccceecCCCceEEEEeecCCccceeecccccccccc Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIV-EQLFYELSLAD---------LISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii-~~~~~~~~l~~---------~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~ 220 (497) ++. +..+..|.|+....++ +...+.+.+++ +.....-++..+++|....-++.+.-+.|+..++..+ T Consensus 1 MA~---T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~k 77 (351) T protein:vir:15 1 MAE---THLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNN 77 (351) T ss_pred CCc---eeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhe Confidence 221 2234566777666554 33333444432 1122223567799998865456677889999998888 Q ss_pred ccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhh Q lcl|Aclame:pro 221 EEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) Q Consensus 221 ~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~ 299 (497) .+-++-....+..+.-+.++++...-+ .+....|.++++..+.+..+..+|.- .+|++......... T Consensus 78 itt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~-----l~gv~~~~~~~~~~------- 145 (351) T protein:vir:15 78 LTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSV-----LKGVMGVTKIANSK------- 145 (351) T ss_pred ecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhhchhhcccc------- Confidence 888888888888887788877654433 36777799999999988888877631 11221110000000 Q ss_pred HHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEe Q lcl|Aclame:pro 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMN 379 (497) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 379 (497) ..... . ..........+.+.++....--.....-.+|+|| T Consensus 146 ----------------------------------~~d~t---~---~~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmh 185 (351) T protein:vir:15 146 ----------------------------------VYDQT---K---VSPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVN 185 (351) T ss_pred ----------------------------------eeccc---c---ccccccccCHHHHHHHHHHhccccccceEEEEEC Confidence 00000 0 0000001112333333333222212224689999 Q ss_pred hhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcC-------cEEEeeccceEEEEEeccccEE Q lcl|Aclame:pro 380 PRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG-------TILVGHFAPSVIQTARREGVTM 452 (497) Q Consensus 380 ~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~-------~~~~gd~~~~~~~i~~r~~~~i 452 (497) +.++..|++.+--+ |+ .... ....-+++.|++|++++.+|.. .+..--|..+++.+.++ ...+ T Consensus 186 S~v~~~L~~~~li~--~~--~~s~-----~~~~i~t~~G~~VivdD~~p~~~~~~~~~~ytsyl~~~GAi~~~~~-~~~v 255 (351) T protein:vir:15 186 SATYSLMKVQGLIE--TI--QPQN-----GATPFEAYNGLRIVLDDDIEIDLTDKTKPVSTSYIFAPGAVRYSTN-MRST 255 (351) T ss_pred hHHHHHHHhhhhhh--hc--cccc-----cCcccceecceEEEEcCCCccccCCCCCceeEEEEEecceeeeecC-CcCc Confidence 99999998654211 11 0000 0123468999999999999841 11122233334443333 3334 Q ss_pred EEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCC-CCCC Q lcl|Aclame:pro 453 QMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG-ATGS 497 (497) Q Consensus 453 ~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~-a~~~ 497 (497) ++.++... ..++-.+....|+ ++||..+..-.-..+ ...| T Consensus 256 e~~rd~~~--~~g~d~l~~r~~~---~~hp~G~s~~~~~~~~~~~s 296 (351) T protein:vir:15 256 ETKYDPLI--NGGQDVIVQKRVG---TIHVAGTSIKASFSPSKASF 296 (351) T ss_pred ceeecccC--CCCceEEEEeeee---eeeeeeeeecccccccCcCC Confidence 54444321 1345455554443 467777665322111 1112 No 157 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.48 E-value=6.3e-08 Score=60.04 Aligned_cols=280 Identities=11% Similarity=0.017 Sum_probs=142.9 Q ss_pred hhhcccccCCcccccchhhHHH-HHHHhhhhHHh---------hccceecCCCceEEEEeecCCccceeecccc-ccccc Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIV-EQLFYELSLAD---------LISSRPVTSPNLSYLTESAAHNNAAAVAEAG-TYPFS 219 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii-~~~~~~~~l~~---------~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~-~~~~s 219 (497) ++ ...+.-...+.|+.....+ ....+.+.+++ +......++..+++|....-++.+.-+.||+ .++.. T Consensus 1 Ma-~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~ 79 (330) T protein:vir:10 1 MA-NELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETG 79 (330) T ss_pred CC-CCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchh Confidence 22 2233445566666555544 33333333322 1122233577899999876556777788886 58888 Q ss_pred cccceeEEeeeeeeeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhh Q lcl|Aclame:pro 220 SEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLF 298 (497) Q Consensus 220 ~~~~~~v~~~~~kia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~ 298 (497) +.+-++-....++.+.-..++++....+ .+....+.+.+++.+.+..+..+|. -..|+++........... T Consensus 80 ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla-----~l~gvf~~~~~~~~~~~~--- 151 (330) T protein:vir:10 80 KITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIA-----TLNGIFATGTAGEKGALE--- 151 (330) T ss_pred hcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHH-----HHHhhhhhhhcccchhhh--- Confidence 8888888888888888888887764333 4677778888988888877776653 122222211000000000 Q ss_pred hHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEE Q lcl|Aclame:pro 299 GATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVM 378 (497) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (497) ...... ..........+.+.++....-.. ...-.+|+| T Consensus 152 -----------------------------------~~~~~~------~~~~~a~~s~~~l~~A~~~~GD~-~~~~~~ivm 189 (330) T protein:vir:10 152 -----------------------------------ETHVSD------QSKASTGIDAGMVLDAKQLLGDS-ADQVTAIAM 189 (330) T ss_pred -----------------------------------hhheec------ccccccccCHHHHHHHHHHhccc-cccceEEEE Confidence 000000 00000000111122222111111 123568999 Q ss_pred ehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc--EEEeeccceEEEEEecc---ccEEE Q lcl|Aclame:pro 379 NPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARRE---GVTMQ 453 (497) Q Consensus 379 n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~--~~~gd~~~~~~~i~~r~---~~~i~ 453 (497) |+.++..|++.+--+ |+- ... ....-++++|++|++++.+|... +..--|..+++.+.+.. .+.++ T Consensus 190 hS~v~~~L~~~~li~--~~~-~s~------~~~~i~~~~G~~VivdD~~p~~~~~yt~yl~~~GAi~~~~~~~~~~v~~E 260 (330) T protein:vir:10 190 HSAVYTKLQKDNLIQ--YIQ-PTT------ATINIPTYLGYRVIIDDGIAPTGDIYTSYLFRTGSIGLNTGNPSGLTTFE 260 (330) T ss_pred cHHHHHHHHHhhhhh--hhc-ccc------cCcccccccceEEEEeCCCCCCCCceeEEEEecCceeeecccCCcccccc Confidence 999999998753211 111 100 01234688999999999998432 21122344555554322 23445 Q ss_pred EeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCC-CCCC Q lcl|Aclame:pro 454 MTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG-ATGS 497 (497) Q Consensus 454 ~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~-a~~~ 497 (497) .++.. ...+..+....++. +||-.|..-.-... ++-| T Consensus 261 tdRd~----~~g~~~l~~r~~~~---~hp~G~s~~~~~~~~~~~s 298 (330) T protein:vir:10 261 TSREA----AKGNDMIYTRRALV---MHPYGVKWTGAEVDAGNIT 298 (330) T ss_pred ccCCc----cccceEEEEeeEEE---eeeeeeeecccccccCcCC Confidence 55443 24555666666644 45666655432211 1222 No 158 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.48 E-value=1.1e-08 Score=64.29 Aligned_cols=253 Identities=10% Similarity=0.035 Sum_probs=120.7 Q ss_pred cceec-CCCceEEEEeecCCccceeecccccccc--cccccee--EEeeeeeeeeechhhHHHHhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 186 SSRPV-TSPNLSYLTESAAHNNAAAVAEAGTYPF--SSEEFAR--VYEQVGKVANALTITDEGLRDAPELFNFVQGRLLE 260 (497) Q Consensus 186 ~~~~~-~~~~~~~p~~~~~~~~a~~v~Eg~~~~~--s~~~~~~--v~~~~~kia~~~~iS~ell~ds~~l~~~i~~~la~ 260 (497) -++++ +++++++|+.- ..++....-|+.+.. .++.-++ +++.-.++..+..-.-+=.+...++.+.+.++.++ T Consensus 1 ~vr~i~~g~s~~~~~iG--~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~ 78 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMG--RTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGE 78 (324) T ss_pred CeeeeecCceEEEeeee--eeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHHHHH Confidence 23334 35678999863 345666666665532 3344555 44444444433222211122223688999999999 Q ss_pred HHHHHHHhhhhcc----C--CCc-cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 261 GIQRKEEVQLLAG----G--GYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRV 333 (497) Q Consensus 261 ~~~~~~d~a~l~G----~--g~~-~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (497) ++++..|+.++.- . .+. ...++....+......+...... T Consensus 79 aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~--------------------------------- 125 (324) T protein:vir:99 79 ALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDP--------------------------------- 125 (324) T ss_pred HHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceeccccccccc--------------------------------- Confidence 9999999876411 0 000 00000000000000000000000 Q ss_pred hcccccccccccccccchhhhhhhhHHhhhhhhhhhc-cCCceEEEehhHHHHHHHHhc-ccCccccccccccccccccc Q lcl|Aclame:pro 334 VTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNPRDWELLRLTKD-ANGQYMGGNFFGNAYGNPVN 411 (497) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~n~~~~~~l~~lkd-~~G~~~~~~~~~~~~~~~~~ 411 (497) ........+.++.+...+....- ...-..+++|..+..|..-+. ..+.|..... .-.+ T Consensus 126 --------------~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~------~~~G 185 (324) T protein:vir:99 126 --------------AKYGTQVIQALTYARAAFAKKYIPAGDRTFYTDPDTYSAILAALMPNAANYAALID------PETG 185 (324) T ss_pred --------------ccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHhhcccccccccccccc------eecc Confidence 00000111222222222211111 122356789998876643221 1222221111 1122 Q ss_pred ccccccccceeecCCCCcCcE-------------------------EEeeccceEEEEE--------eccccEEEEeccc Q lcl|Aclame:pro 412 GGKNIWGVPVVTTPLIPLGTI-------------------------LVGHFAPSVIQTA--------RREGVTMQMTNSN 458 (497) Q Consensus 412 ~~~~l~G~pvv~s~~~~~~~~-------------------------~~gd~~~~~~~i~--------~r~~~~i~~~~~~ 458 (497) .-.+++|++|+.|+++|.+.+ +-+|++...-.++ .-..++++..++. T Consensus 186 ~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~ 265 (324) T protein:vir:99 186 NIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRP 265 (324) T ss_pred eEEEEeceEEEecCCccccccccccccccccccccccccccccccccccccCceeEEEEehhheEEEeeecceecceech Confidence 334689999999999985311 2344443322222 2233455555543 Q ss_pred hhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCC-C Q lcl|Aclame:pro 459 GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG-S 497 (497) Q Consensus 459 ~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~-~ 497 (497) . .|. ..+++..-++..++||++.+.+++++.++- - T Consensus 266 ~-~~~---d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~ 301 (324) T protein:vir:99 266 E-YQA---DQIIAKYAMGHGGLRPEAVGAIIFEDGETPAV 301 (324) T ss_pred h-hHH---HhhhhhhhhcCcccccceEEEEEEccCccccc Confidence 2 333 346666668999999999999998887651 1 No 159 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.47 E-value=7.9e-08 Score=59.52 Aligned_cols=310 Identities=14% Similarity=0.094 Sum_probs=167.1 Q ss_pred hhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcc--cccCCcccccc---hhhHHHHHHHhhhhHHh Q lcl|Aclame:pro 109 EKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGS--TGTFAPGILPT---FLPGIVEQLFYELSLAD 183 (497) Q Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~g~~i~~~---~~~~ii~~~~~~~~l~~ 183 (497) .++. .-..+....+.... ......+..... ..+.+.++..+ +.+.+++...+....+. T Consensus 1 ~~~~-----------~~~~~~~~d~~~~~------~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~ 63 (329) T protein:vir:79 1 MRGN-----------IMSKEMKYDEFEAN------VIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALR 63 (329) T ss_pred Cccc-----------hhhhhhccchhhhh------hHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhh Confidence 0000 00000000000000 000001111111 12223444433 35668887887777788 Q ss_pred hccceecC---CCceEEEEeecCCccceeeccc-cccccccccceeEEeeeeeeeeechhhHHHHhhH----HHHHHHHH Q lcl|Aclame:pro 184 LISSRPVT---SPNLSYLTESAAHNNAAAVAEA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA----PELFNFVQ 255 (497) Q Consensus 184 ~~~~~~~~---~~~~~~p~~~~~~~~a~~v~Eg-~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds----~~l~~~i~ 255 (497) ++++.+.. ..++.|..... .+.+.|++.+ ..+|..+..+++-....+.++..+.++..=|+.+ .++..--. T Consensus 64 ~i~i~~~~~~~~~~~t~~~~~~-~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~ 142 (329) T protein:vir:79 64 VFPVTSELSDTDKTFEYQTFDK-VGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKA 142 (329) T ss_pred hcccccCCCCceeEEEeeeeec-ceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHH Confidence 77765432 23567776665 4778899875 5688888889998899999999888876444332 24777788 Q ss_pred HHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhc Q lcl|Aclame:pro 256 GRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT 335 (497) Q Consensus 256 ~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (497) ...++++...+|+-+++|++..+..|++|.++..+.+.++ T Consensus 143 ~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~---------------------------------------- 182 (329) T protein:vir:79 143 NAAQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSAG---------------------------------------- 182 (329) T ss_pred HHHHHHHHHhhccEEEeecccccceeeecCCCccccccCC---------------------------------------- Confidence 8888999999999999999887889999987654322110 Q ss_pred ccccccccccccccchhhhhhhhHHhhhhhhhh--hccCCceEEEehhHHHHHHHHhcccCccccccccccccccccccc Q lcl|Aclame:pro 336 GAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT--LFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGG 413 (497) Q Consensus 336 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~ 413 (497) .........+...+++++..++.++... +...|..++|+|..+..|.......|.-++.-... .+.. T Consensus 183 -----~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~~tvl~~lk~------~~~~ 251 (329) T protein:vir:79 183 -----WNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETTMSYLDYFKQ------QNGG 251 (329) T ss_pred -----CCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCCccHHHHHHH------hCCC Confidence 0001112234445666777776666553 44567889999998888865444445433322111 1112 Q ss_pred ccccccceeecCCCC-cCcEEEeeccceEEEEEeccccEEEEeccchhhhhcC--ceEEEEEeeec-cEeecccceEEEE Q lcl|Aclame:pro 414 KNIWGVPVVTTPLIP-LGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDG--KVTVRAEERLG-LLVYRPSAFQLIQ 489 (497) Q Consensus 414 ~~l~G~pvv~s~~~~-~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~--~v~~r~~~r~~-~~v~~~~Af~~~~ 489 (497) -+|-+.|-..+.... .+.+++.+-+.-.+.+..-+.++. ... +.. .+.+.+..|++ ..+++|.||++++ T Consensus 252 l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~--l~~-----q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~d 324 (329) T protein:vir:79 252 ITIESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNM--LTA-----QPKDLHFKVPCTSKCTGLTIYRPLTLVLIK 324 (329) T ss_pred cEEEEcccccccCCCCceEEEEEecCCceEEEecCcceee--eec-----eecCceEEEceeeeEEEEEEECcceeeeee Confidence 234444443322211 112333333333333332223322 111 122 23345567775 6677899999976 Q ss_pred ecCCCCC Q lcl|Aclame:pro 490 LKKGATG 496 (497) Q Consensus 490 ~~~~a~~ 496 (497) -...| T Consensus 325 --GI~~~ 329 (329) T protein:vir:79 325 --GLVVG 329 (329) T ss_pred --eeeeC Confidence 22222 No 160 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.41 E-value=5.7e-09 Score=65.76 Aligned_cols=283 Identities=12% Similarity=0.000 Sum_probs=145.9 Q ss_pred hhhhhhhhhcccccCCccc-ccc-hhhHHHHHHHhhhhHHhhccceecCCCc-eEEEEeecCCccceeeccccccccccc Q lcl|Aclame:pro 145 PAAIGQNPFGSTGTFAPGI-LPT-FLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSE 221 (497) Q Consensus 145 ~~~~~~~~~~~~~~~g~~i-~~~-~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~~v~Eg~~~~~s~~ 221 (497) +......+ .+-......+ +.. ....||+.+.+.+.|+..++.....++. ..+.+++. -++++|..=++..+.++. T Consensus 1 m~~~~~~~-~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~-LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:10 1 MPTLSTTN-PTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCccccCc-ccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEec-cCCchhhccCCccCcccc Confidence 00000000 0000001111 222 2457999999999999999998765443 44566666 378999999999999999 Q ss_pred cceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccc---hhh Q lcl|Aclame:pro 222 EFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTAS---SAS 295 (497) Q Consensus 222 ~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~---~~~ 295 (497) ++.+++-..+-+++.+.|.+.+.+.. .++...-.....+++...+...||||+...+|.++.....-.... .+. T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 99999999999999999999988754 245555666788899999999999999877777664222211000 000 Q ss_pred hhhhHHHHHHH-HHhhhh---hcchhhhh-------------------------------hhhhhhhh--hhhhhhcccc Q lcl|Aclame:pro 296 SLFGATSATVS-NVKFPA---DGTNGAFV-------------------------------GQDTVASL--KYGRVVTGAA 338 (497) Q Consensus 296 ~~~~~~~~~~~-~~~~~~---~~~~~~~~-------------------------------~~~~~~~~--~~~~~~~~~~ 338 (497) ..+........ .+.... ....-+++ ...|...+ +......-.. T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 00000000000 000000 00000000 00000000 0000011011 Q ss_pred cccccc-cccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCc-ccccccccccccccccccccc Q lcl|Aclame:pro 339 GSGSGV-AGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ-YMGGNFFGNAYGNPVNGGKNI 416 (497) Q Consensus 339 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~-~~~~~~~~~~~~~~~~~~~~l 416 (497) +...+. .....+..+..+-++.+...+ .+......+|+||......|++....-+. +....... .......+ T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~i-p~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~-----~g~~~t~~ 312 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELI-PNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI-----AGKKVVAF 312 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHh-cccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeec-----CCcceeEE Confidence 110000 000111223334444444443 23446667899999999999987433322 21111111 11222357 Q ss_pred cccceeecCCCCcCcEEEe Q lcl|Aclame:pro 417 WGVPVVTTPLIPLGTILVG 435 (497) Q Consensus 417 ~G~pvv~s~~~~~~~~~~g 435 (497) +|+||..++++-.....+- T Consensus 313 ~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 313 DGIPCRRTDALLLTEARVV 331 (331) T ss_pred CCeeEEEeeeeecCccccC Confidence 7899998887754322111 No 161 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.41 E-value=5.7e-09 Score=65.76 Aligned_cols=283 Identities=12% Similarity=0.000 Sum_probs=145.9 Q ss_pred hhhhhhhhhcccccCCccc-ccc-hhhHHHHHHHhhhhHHhhccceecCCCc-eEEEEeecCCccceeeccccccccccc Q lcl|Aclame:pro 145 PAAIGQNPFGSTGTFAPGI-LPT-FLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSE 221 (497) Q Consensus 145 ~~~~~~~~~~~~~~~g~~i-~~~-~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~~v~Eg~~~~~s~~ 221 (497) +......+ .+-......+ +.. ....||+.+.+.+.|+..++.....++. ..+.+++. -++++|..=++..+.++. T Consensus 1 m~~~~~~~-~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~-LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:98 1 MPTLSTTN-PTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCccccCc-ccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEec-cCCchhhccCCccCcccc Confidence 00000000 0000001111 222 2457999999999999999998765443 44566666 378999999999999999 Q ss_pred cceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccc---hhh Q lcl|Aclame:pro 222 EFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTAS---SAS 295 (497) Q Consensus 222 ~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~---~~~ 295 (497) ++.+++-..+-+++.+.|.+.+.+.. .++...-.....+++...+...||||+...+|.++.....-.... .+. T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:98 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 99999999999999999999988754 245555666788899999999999999877777664222211000 000 Q ss_pred hhhhHHHHHHH-HHhhhh---hcchhhhh-------------------------------hhhhhhhh--hhhhhhcccc Q lcl|Aclame:pro 296 SLFGATSATVS-NVKFPA---DGTNGAFV-------------------------------GQDTVASL--KYGRVVTGAA 338 (497) Q Consensus 296 ~~~~~~~~~~~-~~~~~~---~~~~~~~~-------------------------------~~~~~~~~--~~~~~~~~~~ 338 (497) ..+........ .+.... ....-+++ ...|...+ +......-.. T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:98 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 00000000000 000000 00000000 00000000 0000011011 Q ss_pred cccccc-cccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCc-ccccccccccccccccccccc Q lcl|Aclame:pro 339 GSGSGV-AGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ-YMGGNFFGNAYGNPVNGGKNI 416 (497) Q Consensus 339 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~-~~~~~~~~~~~~~~~~~~~~l 416 (497) +...+. .....+..+..+-++.+...+ .+......+|+||......|++....-+. +....... .......+ T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~i-p~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~-----~g~~~t~~ 312 (331) T protein:vir:98 239 NVDVSELTKNASAGADLIDLMTQAVELI-PNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI-----AGKKVVAF 312 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHh-cccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeec-----CCcceeEE Confidence 110000 000111223334444444443 23446667899999999999987433322 21111111 11222357 Q ss_pred cccceeecCCCCcCcEEEe Q lcl|Aclame:pro 417 WGVPVVTTPLIPLGTILVG 435 (497) Q Consensus 417 ~G~pvv~s~~~~~~~~~~g 435 (497) +|+||..++++-.....+- T Consensus 313 ~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:98 313 DGIPCRRTDALLLTEARVV 331 (331) T ss_pred CCeeEEEeeeeecCccccC Confidence 7899998887754322111 No 162 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.41 E-value=5.7e-09 Score=65.76 Aligned_cols=283 Identities=12% Similarity=0.000 Sum_probs=145.9 Q ss_pred hhhhhhhhhcccccCCccc-ccc-hhhHHHHHHHhhhhHHhhccceecCCCc-eEEEEeecCCccceeeccccccccccc Q lcl|Aclame:pro 145 PAAIGQNPFGSTGTFAPGI-LPT-FLPGIVEQLFYELSLADLISSRPVTSPN-LSYLTESAAHNNAAAVAEAGTYPFSSE 221 (497) Q Consensus 145 ~~~~~~~~~~~~~~~g~~i-~~~-~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~~a~~v~Eg~~~~~s~~ 221 (497) +......+ .+-......+ +.. ....||+.+.+.+.|+..++.....++. ..+.+++. -++++|..=++..+.++. T Consensus 1 m~~~~~~~-~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~-LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:10 1 MPTLSTTN-PTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSG-LPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCccccCc-ccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEec-cCCchhhccCCccCcccc Confidence 00000000 0000001111 222 2457999999999999999998765443 44566666 378999999999999999 Q ss_pred cceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccc---hhh Q lcl|Aclame:pro 222 EFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTAS---SAS 295 (497) Q Consensus 222 ~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~---~~~ 295 (497) ++.+++-..+-+++.+.|.+.+.+.. .++...-.....+++...+...||||+...+|.++.....-.... .+. T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 99999999999999999999988754 245555666788899999999999999877777664222211000 000 Q ss_pred hhhhHHHHHHH-HHhhhh---hcchhhhh-------------------------------hhhhhhhh--hhhhhhcccc Q lcl|Aclame:pro 296 SLFGATSATVS-NVKFPA---DGTNGAFV-------------------------------GQDTVASL--KYGRVVTGAA 338 (497) Q Consensus 296 ~~~~~~~~~~~-~~~~~~---~~~~~~~~-------------------------------~~~~~~~~--~~~~~~~~~~ 338 (497) ..+........ .+.... ....-+++ ...|...+ +......-.. T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 00000000000 000000 00000000 00000000 0000011011 Q ss_pred cccccc-cccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCc-ccccccccccccccccccccc Q lcl|Aclame:pro 339 GSGSGV-AGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQ-YMGGNFFGNAYGNPVNGGKNI 416 (497) Q Consensus 339 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~-~~~~~~~~~~~~~~~~~~~~l 416 (497) +...+. .....+..+..+-++.+...+ .+......+|+||......|++....-+. +....... .......+ T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~i-p~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~-----~g~~~t~~ 312 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELI-PNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI-----AGKKVVAF 312 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHh-cccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeec-----CCcceeEE Confidence 110000 000111223334444444443 23446667899999999999987433322 21111111 11222357 Q ss_pred cccceeecCCCCcCcEEEe Q lcl|Aclame:pro 417 WGVPVVTTPLIPLGTILVG 435 (497) Q Consensus 417 ~G~pvv~s~~~~~~~~~~g 435 (497) +|+||..++++-.....+- T Consensus 313 ~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 313 DGIPCRRTDALLLTEARVV 331 (331) T ss_pred CCeeEEEeeeeecCccccC Confidence 7899998887754322111 No 163 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.38 E-value=1.4e-08 Score=63.68 Aligned_cols=271 Identities=14% Similarity=0.110 Sum_probs=133.7 Q ss_pred hhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCC-ceEEEEee--cCCccceeeccccccccccccc Q lcl|Aclame:pro 147 AIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTES--AAHNNAAAVAEAGTYPFSSEEF 223 (497) Q Consensus 147 ~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~p~~~--~~~~~a~~v~Eg~~~~~s~~~~ 223 (497) ............-+....-++...+-..+.+-.-++...+..|+..+ .++.++.. ...+.+.-|+||+.+|.++.+. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt~ 80 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVTR 80 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccchhhhee Confidence 00011111111222222223333333333322333334466777654 35444432 2245678899999999999886 Q ss_pred e---eEEeeeeeeeeechhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHhhhhcc----CCCccccceeccccccccchh Q lcl|Aclame:pro 224 A---RVYEQVGKVANALTITDEGLRDA--PELFNFVQGRLLEGIQRKEEVQLLAG----GGYPGVNGLLQRSTGFTASSA 294 (497) Q Consensus 224 ~---~v~~~~~kia~~~~iS~ell~ds--~~l~~~i~~~la~~~~~~~d~a~l~G----~g~~~~~Gil~~~~~~~~~~~ 294 (497) . ..++..+|++.-+ |.|.++.+ .+..+.-.+.|..+++.++|..|+.- +|+..-+ ..+.... T Consensus 81 ~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~t-------~~t~~s~ 151 (303) T protein:vir:10 81 EQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKRT-------NKTKLSA 151 (303) T ss_pred eecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccccc-------cceeecH Confidence 4 5788889988855 99998654 35677888899999999999887742 1110000 0000000 Q ss_pred hhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCc Q lcl|Aclame:pro 295 SSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN 374 (497) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 374 (497) ....... ... +..++.... . .... T Consensus 152 ~glq~Al-------------------------------~~~----------------~~kl~~~~e--------d-~~~~ 175 (303) T protein:vir:10 152 ENLQGAL-------------------------------SKG----------------RANLSVLLD--------D-EITP 175 (303) T ss_pred HHHHHHH-------------------------------Hhh----------------hhhcccccc--------c-cccE Confidence 0000000 000 000000000 0 1134 Q ss_pred eEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEE-EEEeccccEEE Q lcl|Aclame:pro 375 AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVI-QTARREGVTMQ 453 (497) Q Consensus 375 ~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~-~i~~r~~~~i~ 453 (497) ++++||.+.+.++. +++ ++.. .+..|...- -.++|.-|+.|..+|.|+++.---....+ .+..+..+. T Consensus 176 V~FvNP~Daa~yl~--~A~---i~~~--~t~fG~n~L--~nfLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~-- 244 (303) T protein:vir:10 176 IAFVNPNDTAEYLA--NGF---INST--GAQFGVNLL--TPYVGVKIVEFADVPQGEVWMTVAENLNVAYANPRGELS-- 244 (303) T ss_pred EEEEchHHHHHHhh--cCC---cchh--hhhhhhhhh--hhhhcceEEEeccCCCceEEEeeccceEEEEecCchhhh-- Confidence 88999999999864 432 1111 011111111 13789999999999999877533222211 122221111 Q ss_pred EeccchhhhhcCceEEEEEeee-------------c---cEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 454 MTNSNGTDFVDGKVTVRAEERL-------------G---LLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 454 ~~~~~~~~f~~~~v~~r~~~r~-------------~---~~v~~~~Af~~~~~~~~a~~~ 497 (497) ..-.|..|.+++.+..+. . +-+-+++++++.+++..-.+- T Consensus 245 ----~~f~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~~e~~~ 300 (303) T protein:vir:10 245 ----RAFAFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKKDEAGE 300 (303) T ss_pred ----hhhhhccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEEeccccCC Confidence 112234455555444332 1 123366799999986543322 No 164 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.35 E-value=1.1e-07 Score=58.81 Aligned_cols=300 Identities=9% Similarity=0.019 Sum_probs=151.6 Q ss_pred hhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEE Q lcl|Aclame:pro 148 IGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVY 227 (497) Q Consensus 148 ~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~ 227 (497) +..-..+-++......-.++...|+..-....|+.+++...+.++..++|...+-..+...-..||...+.......... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~ 80 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTML 80 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEEe Confidence 11111122222223334555666666666677899998888887777888876654444345568877665432221111 Q ss_pred ee-eeeeeeechhhHHHHhhH-H---HHHHHHHHHHHHHHHHHHHhhhhccCCC-----c----cccceeccccccccch Q lcl|Aclame:pro 228 EQ-VGKVANALTITDEGLRDA-P---ELFNFVQGRLLEGIQRKEEVQLLAGGGY-----P----GVNGLLQRSTGFTASS 293 (497) Q Consensus 228 ~~-~~kia~~~~iS~ell~ds-~---~l~~~i~~~la~~~~~~~d~a~l~G~g~-----~----~~~Gil~~~~~~~~~~ 293 (497) -+ ..-+...+.||.-+..-+ . +...|=..+-...+.+-++.++|+|.-. . +-.||++.... T Consensus 81 ~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t----- 155 (317) T protein:vir:88 81 NNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKT----- 155 (317) T ss_pred ccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhcc----- Confidence 11 122223344454433322 1 2223323333445778888999998621 0 11122211000 Q ss_pred hhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccc-cccccccccchh-hhhhhhHHhhhhhhhhhcc Q lcl|Aclame:pro 294 ASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGS-GSGVAGSYPTAA-EIAENVFDAFVDIQLTLFQ 371 (497) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 371 (497) .......+.... .........+.. ...+.+..++..+-.+ +. T Consensus 156 -----------------------------------~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~-Gg 199 (317) T protein:vir:88 156 -----------------------------------NGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRN-GG 199 (317) T ss_pred -----------------------------------CceeccCccccccCCCccccccccccccHHHHHHHHHHHHhc-CC Confidence 000000000000 000000000000 1222333333333333 45 Q ss_pred CCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccc-cceeecCCCCcCcEEEeeccceEEEEEecccc Q lcl|Aclame:pro 372 TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWG-VPVVTTPLIPLGTILVGHFAPSVIQTARREGV 450 (497) Q Consensus 372 ~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G-~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~ 450 (497) .++.+++||.....|..+-...+.++..+......+..+...-+=+| +.++.+..||++++++.|++....... .++ T Consensus 200 ~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~L--r~~ 277 (317) T protein:vir:88 200 QANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYL--RPF 277 (317) T ss_pred CCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeec--ccc Confidence 67788999999999998854455555322211111111111111233 788999999999999999997654433 344 Q ss_pred EEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCC Q lcl|Aclame:pro 451 TMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 451 ~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~ 495 (497) ..+.....+ +.....++..+++.+..|.|.++++--++.- T Consensus 278 ~~e~laKtG-----d~~k~~i~~E~tLe~~N~~a~a~i~~l~~~~ 317 (317) T protein:vir:88 278 FQHELAKTG-----DSEKRQLLVEYTFRVNNEKSGALIRDVVAQL 317 (317) T ss_pred eeeccCCCc-----ccceeEEEEEEEEEEcCccceeEEEEecccC Confidence 444333333 3445778889999999999999998555444 No 165 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.13 E-value=4.4e-07 Score=55.45 Aligned_cols=297 Identities=10% Similarity=-0.033 Sum_probs=141.7 Q ss_pred HHHHHhhhhhhhhhhhhhhc-ccccCCcccccchhhHHHHHHHhhhhHHhhccceec---CCCceEEEEeecCCccceee Q lcl|Aclame:pro 135 MGAFADGETAPAAIGQNPFG-STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPV---TSPNLSYLTESAAHNNAAAV 210 (497) Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~-~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~---~~~~~~~p~~~~~~~~a~~v 210 (497) .....++ . ..+..+ .+...-.++|..|...+++.+.+.+.+.++++.... .+.++++|+.. .+++..+ T Consensus 1 ~~~~~~~---~---~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g--~~~a~d~ 72 (381) T protein:vir:80 1 MATIQGT---G---GYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS--RAAVYDK 72 (381) T ss_pred Cceeccc---c---cccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC--cceeeee Confidence 0000000 0 000001 111113356556777788888777777777665432 35578899854 4567888 Q ss_pred ccccccccccccceeEEeeeeeee-eechhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHhhhhccC--CCcccccee-cc Q lcl|Aclame:pro 211 AEAGTYPFSSEEFARVYEQVGKVA-NALTITDE-GLRDAPELFNFVQGRLLEGIQRKEEVQLLAGG--GYPGVNGLL-QR 285 (497) Q Consensus 211 ~Eg~~~~~s~~~~~~v~~~~~kia-~~~~iS~e-ll~ds~~l~~~i~~~la~~~~~~~d~a~l~G~--g~~~~~Gil-~~ 285 (497) .++..++..+++.+++++...+.- .-..|++. ..+.+.++.+.+.+.+..+++++.|+.++.-- ....+.+.. .. T Consensus 73 ~~g~~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~ 152 (381) T protein:vir:80 73 QPQTPVNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSY 152 (381) T ss_pred cCCCcccccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 899888877777787777774433 33566654 33445578899999999999999999876321 000000000 00 Q ss_pred ccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhh Q lcl|Aclame:pro 286 STGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI 365 (497) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 365 (497) ......... .............+.++.+...+ T Consensus 153 ~~~i~~~~~------------------------------------------------~~~~t~~~~~~t~~~i~~a~~~L 184 (381) T protein:vir:80 153 DTTLGDGTV------------------------------------------------NAHLTGTPAPLTYAALLLAKQKL 184 (381) T ss_pred ccccccccc------------------------------------------------ccccccchhhHHHHHHHHHHHHH Confidence 000000000 00000000011122333333332 Q ss_pred hhhhc-cCCceEEEehhHHHHHHHHhcccC-cccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEE Q lcl|Aclame:pro 366 QLTLF-QTPNAVVMNPRDWELLRLTKDANG-QYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQ 443 (497) Q Consensus 366 ~~~~~-~~~~~~~~n~~~~~~l~~lkd~~G-~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~ 443 (497) ....- ...-.++++|..+..|.+...-.. .|.... ..-.+.-.+++|++|+.|+++|.+.+..-....+ . T Consensus 185 de~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~------~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~~~~ag-a- 256 (381) T protein:vir:80 185 DEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVK------PVTSGVVGTILGMEVIVTTQIGINSLTGYVNGQG-A- 256 (381) T ss_pred hhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccch------hhhceeeeEEcceEEEeecccccccccceeeecc-c- Confidence 22211 122367889999998865321111 111111 1112223479999999999999753321000000 0 Q ss_pred EEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEE----EecCCCCCC Q lcl|Aclame:pro 444 TARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLI----QLKKGATGS 497 (497) Q Consensus 444 i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~----~~~~~a~~~ 497 (497) .......+.-+.+.+ .|..+-.+++....+|..+...-..+.. -.+.+.+.. T Consensus 257 -p~~~~~~~~~~~~~g-~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~ 312 (381) T protein:vir:80 257 -PTQPTPGVLGSPYLP-DQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQ 312 (381) T ss_pred -ccccccccccccccc-ccccceeeeeeeeeeceeeeeeeccceeeecceeeecCCCc Confidence 000011112222222 3555666777777777777433222221 111111111 No 166 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=97.96 E-value=9.6e-07 Score=53.56 Aligned_cols=280 Identities=11% Similarity=0.056 Sum_probs=136.7 Q ss_pred hhhcccccCC-ccccc-chhhHHHHHHHhhhhHHhhccceec-CCCceEEEEeecCCccceeeccccccccccccce--e Q lcl|Aclame:pro 151 NPFGSTGTFA-PGILP-TFLPGIVEQLFYELSLADLISSRPV-TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFA--R 225 (497) Q Consensus 151 ~~~~~~~~~g-~~i~~-~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~--~ 225 (497) +..+..++.. .++.| .|+..|..-+.+......+.++... .+.++.||.... ++..=..+++.+.-.+++-. . T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~--~tV~dY~~~~~i~~d~ltt~~~~ 78 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGT--PVVRSRPEQGDFTFDNLDTGEIS 78 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccc--cccccccCCCCcccccCCCceEE Confidence 3333333332 34545 5777777777766655555554332 466789887654 34443444554433333333 4 Q ss_pred EEeeeeeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhc--cCCCc------cccceeccccccccchhhhh Q lcl|Aclame:pro 226 VYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLA--GGGYP------GVNGLLQRSTGFTASSASSL 297 (497) Q Consensus 226 v~~~~~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~--G~g~~------~~~Gil~~~~~~~~~~~~~~ 297 (497) +.+.-.|+.++. |+++..+++.+|.+...++.+++++...|..+.. -+|.. .|.-+-..+..+. T Consensus 79 l~IDq~KYfaf~-VdDD~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv------- 150 (322) T protein:vir:31 79 IILRDEVYAGNA-ISKKLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFV------- 150 (322) T ss_pred EEEehhhhhccc-cchhHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCcccee------- Confidence 445556676655 7777777788999999999999999988876521 11110 0100000000000 Q ss_pred hhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCc-eE Q lcl|Aclame:pro 298 FGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN-AV 376 (497) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 376 (497) ..........+.++.+...+....-...+ .. T Consensus 151 ------------------------------------------------~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~v 182 (322) T protein:vir:31 151 ------------------------------------------------GTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIG 182 (322) T ss_pred ------------------------------------------------ccCCCchhhHHHHHHHHHHhccccCCCCCeEE Confidence 00000001112222222222222212122 34 Q ss_pred EEehhHHHHHHH-------HhcccCcccccccccccccccccccccccccceeecCCCCcCc--EE---------Eeecc Q lcl|Aclame:pro 377 VMNPRDWELLRL-------TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--IL---------VGHFA 438 (497) Q Consensus 377 ~~n~~~~~~l~~-------lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~--~~---------~gd~~ 438 (497) +++|.-...|.. +|| +|..... .++.......-.++.|+-|++|+.++.++ +. .|-++ T Consensus 183 VV~P~~~~~L~~i~~~~~l~~D--~rf~~i~--~sG~a~g~~~Vg~~~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n 258 (322) T protein:vir:31 183 IIDPSVAHHLETITNISNISNN--PRWEGIV--ESGIAPDMQFVRSVYGIDLFVSNLLADANETINAGGDARSTTAGKCN 258 (322) T ss_pred EeCchhhhhhhhhhhhhhhhcc--ccccccc--cccchhhHHHHHHHhceeeeeeccccccccccccCcccccccceeec Confidence 456777665533 333 3322110 01100000113578899999999987433 11 12222 Q ss_pred ceEEEEEecc----------ccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 439 PSVIQTARRE----------GVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 439 ~~~~~i~~r~----------~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) . +..+.|-+ -.+-+..+... +.--.+|+.+|+|..+.+|+..+.|.-.++.+-- T Consensus 259 ~-f~~~~~~~~~~~~~~~~~l~~~e~~r~~~----~~~d~~~~~~~~g~g~~r~e~l~~~~a~~~~~~~ 322 (322) T protein:vir:31 259 M-FMNVSDMGLLPFVVAWKEMPTTKSFIDDY----NDDLNTATTARWGNGLVRDENLVCVLANADKVTF 322 (322) T ss_pred c-cccccchhhhhhhhHhhhhhhhhcccCcc----ccccceeeeeeecceeecccceEEEEeccccccC Confidence 1 22222211 11112222111 2334689999999999999999998744433333 No 167 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=97.79 E-value=1.9e-05 Score=46.50 Aligned_cols=420 Identities=12% Similarity=0.084 Sum_probs=157.1 Q ss_pred CchHHHHHH-------HHHHHHHHHHHHH--------------------------------HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEA-------QGRQLAKSIKDIN--------------------------------ADETKTAAEKKEALAKIEP 41 (497) Q Consensus 1 m~~~~~~~~-------~~~~l~~~~~~~~--------------------------------~~~~~~~~e~~~~~~~~~~ 41 (497) +|....... +.+.+-+.++.+. .-..+.+++.++....|.+ T Consensus 168 ~~~~~~~a~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~v~d~EPa~~~~pvqAaAP~~De~airAq~~aeeraRi~~I~~ 247 (652) T protein:vir:79 168 TPAVKAMACIQSKRTEEFKKMPDSIRNMITPPRNSAPRVQDDEPAASRTPVQAAAPVVDENSIRAQVLAEQKARVNGIND 247 (652) T ss_pred cchhhhhhhhhhhhhhhhhhhHHHHHHHhcccccccccccccccccccccccccCCcCchhHHHHHHHHHHHHHHHHHHH Confidence 221111100 0000000111000 0000011111111111111 Q ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH--HHHhhhhhhHHHhhhhhhhhhhhh-- Q lcl|Aclame:pro 42 DFKAHQAEVEAHERAQEM---LKSLGGADAAKDGLDNDIPEVEVRNLKQIRK--HLARAVIMNPELKNATSFEKGTKF-- 114 (497) Q Consensus 42 ~~~~~~~~~e~~e~~~e~---~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~-- 114 (497) ....+...++... .+.+ ...++++++ .+-+.+.+........... .............+......+... T Consensus 248 l~a~Fggr~~~l~-~~~l~d~~~s~e~ar~---~il~~l~~~~~p~~~~~~~~~~~~~g~~~~d~~~~aL~~R~g~~~~~ 323 (652) T protein:vir:79 248 LFAMFGGRYQTLQ-AQCLADPECSLEQARE---KLLNEMGRESTPSNKNTPAHIYAGNGNFVGDGIRQALMARAGFEKTE 323 (652) T ss_pred HHHhhccccchHH-HHHhhccCCCHHHHHH---HHHHHHHhhcCCCCCCcceeEeeccchhhHHHHHHHHHhhcCCcccc Confidence 1111110000000 0000 000000000 0000110000000000000 000000000000000000000000 Q ss_pred -hhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHh-hhhHHhhccceecCC Q lcl|Aclame:pro 115 -DVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFY-ELSLADLISSRPVTS 192 (497) Q Consensus 115 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~-~~~l~~~~~~~~~~~ 192 (497) ...... ....+..+.....++.............+....++++++..+-...-..+...... +...+..+...+++- T Consensus 324 ~~~~~~g-~~L~elAr~~L~~~G~~~~~~~~~~~v~~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~D 402 (652) T protein:vir:79 324 RDNVYNG-MTLREYARMSLTERGIGVSSYNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSD 402 (652) T ss_pred cCccccC-ccHHHHHHHHHHhhccCCCCCCHHHHHHHHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCcc Confidence 000000 00011122222222222222222233333333456666555544444444444443 335667777766643 Q ss_pred -CceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHH-hhHHHHHHHHHHHHHHHHHHHHHhh- Q lcl|Aclame:pro 193 -PNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGL-RDAPELFNFVQGRLLEGIQRKEEVQ- 269 (497) Q Consensus 193 -~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell-~ds~~l~~~i~~~la~~~~~~~d~a- 269 (497) ...+..+... -+...-|.|++++.-....=...++...+++.++.||+|++ +|--++-.-|-..+.++.++.+++. T Consensus 403 Fk~~~~~~lg~-~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~v 481 (652) T protein:vir:79 403 FKIAHRVGMGG-FSALRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLV 481 (652) T ss_pred ccccceeecCC-CCCccccCCCCccceeeecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHH Confidence 2234444433 46677899999998777777788999999999999999975 6766767778888888888888864 Q ss_pred --hhccCCC-c-cccceeccccccc-cchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccc Q lcl|Aclame:pro 270 --LLAGGGY-P-GVNGLLQRSTGFT-ASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGV 344 (497) Q Consensus 270 --~l~G~g~-~-~~~Gil~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 344 (497) +|.++.. . ..+.++..+.-.. .+.+............ ..+..+. T Consensus 482 y~~l~~Np~~~~DGk~LF~hA~H~Nl~~~aa~~~~~l~~ar~--------------------aM~~Qk~----------- 530 (652) T protein:vir:79 482 YAILTSNPKISTDNVSLFDKAKHANVLESAAMDVASLDKARQ--------------------LMRVQKE----------- 530 (652) T ss_pred HHHHhcCcccccCCceeecccccccccccccCCHHHHHHHHH--------------------HHHHhcc----------- Confidence 4444432 1 2223331111000 0000000000000000 0000000 Q ss_pred cccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccc-cceee Q lcl|Aclame:pro 345 AGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWG-VPVVT 423 (497) Q Consensus 345 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G-~pvv~ 423 (497) ........|..|+..|.-.....++-.+. .+-......+..+| +.| ..||. T Consensus 531 --------------------g~~~l~i~P~~llvp~~le~~a~~ll~s~--~v~~a~~~~~~~Np------~~~~~~~i~ 582 (652) T protein:vir:79 531 --------------------GERHLNIRPAFVLVPTAMESVANQVIRSS--SVKGADINAGIINP------VKDFATVIA 582 (652) T ss_pred --------------------CCccccccccEEEecchhHHHHHHHhccC--CCcccccccccccc------ccccccccc Confidence 00011223445555555444444432211 11000011111111 222 25555 Q ss_pred cCCCCcC---cEEEeeccc-----eEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEe Q lcl|Aclame:pro 424 TPLIPLG---TILVGHFAP-----SVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQL 490 (497) Q Consensus 424 s~~~~~~---~~~~gd~~~-----~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~ 490 (497) ++.+... ..|+++-.. .+| +--.++..|+.. ..|..|-+.+++...++.+++|--+++|.+- T Consensus 583 eprL~~~s~~~wylaa~~~~dtiev~y-L~G~~~P~ie~~----~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 583 EPRLDDNSQTTFYLAASKGSDTIEVAY-LNGVDTPYIDQM----EGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred ccccCCCCcccEEEecCCCCCeEEEEE-ecCCCCCeeeec----CCCCcceEEEEEEEeccCceeeccceeeecC Confidence 6555321 122322211 011 111233444321 2499999999999999999999999999775 No 168 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.61 E-value=1.1e-05 Score=47.83 Aligned_cols=280 Identities=11% Similarity=0.046 Sum_probs=146.9 Q ss_pred ccCCcccccc---hhhHHHHHHHhhhhHHhhccceec---CCCceEEEEeecCCccce--eeccc-cccccccccceeEE Q lcl|Aclame:pro 157 GTFAPGILPT---FLPGIVEQLFYELSLADLISSRPV---TSPNLSYLTESAAHNNAA--AVAEA-GTYPFSSEEFARVY 227 (497) Q Consensus 157 ~~~g~~i~~~---~~~~ii~~~~~~~~l~~~~~~~~~---~~~~~~~p~~~~~~~~a~--~v~Eg-~~~~~s~~~~~~v~ 227 (497) -++..++..+ +.+.|.+...+....++++++.+. .-.++.+..... .+.+. |++-+ ..+|..+..+++-. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~-~G~a~~~~i~~~a~dip~vd~~~~~~~ 79 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADE-HGSLDDGLITVGTSTLDQVEVGFTPTR 79 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeec-cCcccccccCCcCCccceeecccceeE Confidence 1122233332 123344444444455555555432 223566665544 35566 88765 67899999999999 Q ss_pred eeeeeeeeechhhHHHHhhH-H---HHHHHHHHHHHHHHHHHHHhhhhccCCC-ccccceeccccccccchhhhhhhHHH Q lcl|Aclame:pro 228 EQVGKVANALTITDEGLRDA-P---ELFNFVQGRLLEGIQRKEEVQLLAGGGY-PGVNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 228 ~~~~kia~~~~iS~ell~ds-~---~l~~~i~~~la~~~~~~~d~a~l~G~g~-~~~~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) ...+.++.-+.+|-+=|+.+ . ++..-=.....+++...+|+..++|+-. .+..|++|.++.......... T Consensus 80 ~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~----- 154 (304) T protein:vir:52 80 SYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAA----- 154 (304) T ss_pred EEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCc----- Confidence 99999999888886444433 2 3666666667778889999999999753 357899998766432211000 Q ss_pred HHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc--cCCceEEEeh Q lcl|Aclame:pro 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF--QTPNAVVMNP 380 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~n~ 380 (497) .+......+...+++++..++.++..... ..++.++|.| T Consensus 155 ---------------------------------------a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp 195 (304) T protein:vir:52 155 ---------------------------------------QNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDS 195 (304) T ss_pred ---------------------------------------cCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCH Confidence 00011222344566666666666554333 4577788888 Q ss_pred hHHHHHHHHh-cccCcccccccccccccccccccc--cccccceeecCCCCcC--cEEEeeccceEEEEEeccccEEEEe Q lcl|Aclame:pro 381 RDWELLRLTK-DANGQYMGGNFFGNAYGNPVNGGK--NIWGVPVVTTPLIPLG--TILVGHFAPSVIQTARREGVTMQMT 455 (497) Q Consensus 381 ~~~~~l~~lk-d~~G~~~~~~~~~~~~~~~~~~~~--~l~G~pvv~s~~~~~~--~~~~gd~~~~~~~i~~r~~~~i~~~ 455 (497) ..+..|.... +..|.-++.-.... ++....+ .|-++|--....-..| ..++.+-+.-++.+. ..+.+.+. T Consensus 196 ~~~~~l~~~~~~~~~~Tvl~~l~~n---~~~~~g~~l~I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~--vP~p~~~l 270 (304) T protein:vir:52 196 LDLAHLALVQRANTDTTALEFLTKH---LSAAAGRQVAIKALPSNYGTRVTDGKTRAMVYVNSKEHVIFD--VPMSPTVL 270 (304) T ss_pred HHHHHHhhccCCCCCchHHHHHHHh---cccccCCcceEEEecccccccCCCCceEEEEEecChhheEEe--cCcccccc Confidence 8888886432 22222222111000 0000000 1222221111111111 134444444333321 22222222 Q ss_pred ccchhhhhcCceE--EEEEeeecc-EeecccceEEEEe Q lcl|Aclame:pro 456 NSNGTDFVDGKVT--VRAEERLGL-LVYRPSAFQLIQL 490 (497) Q Consensus 456 ~~~~~~f~~~~v~--~r~~~r~~~-~v~~~~Af~~~~~ 490 (497) . ...+|... +=++.|+++ .+++|.+|++++. T Consensus 271 ~----~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 271 D----AQPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred c----hhhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 2 13355443 346777754 5667999999999 No 169 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=97.35 E-value=3.8e-05 Score=44.79 Aligned_cols=285 Identities=10% Similarity=0.016 Sum_probs=129.1 Q ss_pred hhhhhhhhhc---ccccCCcccccchhhHHHHHHHhh-hhHHhhccceecCCCceEEEEeecCCccceeeccc------- Q lcl|Aclame:pro 145 PAAIGQNPFG---STGTFAPGILPTFLPGIVEQLFYE-LSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEA------- 213 (497) Q Consensus 145 ~~~~~~~~~~---~~~~~g~~i~~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg------- 213 (497) +.+ ....++ -+.+-...-+.+|...+.-.+.+. +.|++-++...-.+++..+-.... ..+.-++++ T Consensus 1 ~~~-~~~~~~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 77 (322) T protein:vir:10 1 MKL-NAIMSMLPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLAS--MDPDAVKRKRSRQQSA 77 (322) T ss_pred Ccc-cceeeeeeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeeccc--cccccccccccccccc Confidence 000 000000 000111111234444444444443 345555543333333211111111 112222222 Q ss_pred --c-ccccccccce--eEEeeeeeeeeechhhHHH-HhhHHHHHHHHHHHHHHHHHHHHHhhhhccC-CCccccceeccc Q lcl|Aclame:pro 214 --G-TYPFSSEEFA--RVYEQVGKVANALTITDEG-LRDAPELFNFVQGRLLEGIQRKEEVQLLAGG-GYPGVNGLLQRS 286 (497) Q Consensus 214 --~-~~~~s~~~~~--~v~~~~~kia~~~~iS~el-l~ds~~l~~~i~~~la~~~~~~~d~a~l~G~-g~~~~~Gil~~~ 286 (497) . ..|.....++ .+.+..+ .....|.+.- ++...+..+...+..+.+++++.|..|+.+- |... .|-..++ T Consensus 78 d~~~dtp~~~~~~~~r~~~~~d~--~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~-~~~~gt~ 154 (322) T protein:vir:10 78 DGTYPTPVNNKPFAKRRTNVDTY--DTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS-IKGTGQP 154 (322) T ss_pred CcccCCCccccccceEEEeeccc--ccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc-ccccccc Confidence 1 2333333333 4444444 3345666543 3445677788888999999999999887632 1110 0000000 Q ss_pred cccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhh Q lcl|Aclame:pro 287 TGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQ 366 (497) Q Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (497) ....... ...........+.++.+...+. T Consensus 155 ---v~~~ss~------------------------------------------------~i~~g~~g~t~~kl~~a~~~l~ 183 (322) T protein:vir:10 155 ---VEFLATQ------------------------------------------------EIGDGTKPISFDYVTEITERFL 183 (322) T ss_pred ---cccCCCc------------------------------------------------ccccCccchhHHHHHHHHHHHH Confidence 0000000 0000000111222333333222 Q ss_pred hhhccC--CceEEEehhHHHHHHHHhcc-cCcccccccccccccccccccccccccceeecCCCCcC------------- Q lcl|Aclame:pro 367 LTLFQT--PNAVVMNPRDWELLRLTKDA-NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------------- 430 (497) Q Consensus 367 ~~~~~~--~~~~~~n~~~~~~l~~lkd~-~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~------------- 430 (497) ...-.. .-.++++|..|..|.....- +-.|...... . ..+--.+++|+.|+.++.+|.. T Consensus 184 ~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l-~----~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~ 258 (322) T protein:vir:10 184 ENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDL-Q----SKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGP 258 (322) T ss_pred hcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhh-h----hcCeeeeeeeEEEEEeccCCccccccccccccCCC Confidence 222221 12466789888887543321 2222211111 0 0122347899999999999721 Q ss_pred ---cEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCC Q lcl|Aclame:pro 431 ---TILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 431 ---~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a 494 (497) ...+.-|.+.++.+.....++.+++...+. .+...+++.+-+|..+++|+.++.+..+.+- T Consensus 259 ~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~---~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 259 QGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSA---SFAWRIYSAFTADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred CccceeEEEEecCceeEEEeeeeeEEeeccCCc---chhhhhhhhhhhCceEeccCcEEEEEEeccC Confidence 111223334445555555566666543331 2345577888899999999999999998777 No 170 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=97.29 E-value=0.0001 Score=42.45 Aligned_cols=421 Identities=13% Similarity=0.073 Sum_probs=153.1 Q ss_pred CchHHHHHHHHHHHHHHH-H--------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSI-K--------------------DINADETKTAAEKKEALAKIEPDFKAHQAEVEAH--ERAQ 57 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~-~--------------------~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~--e~~~ 57 (497) ||..-+.....+.-.-.. . .......+..++.++....+...+..+....... +... T Consensus 220 ~p~~l~~~~~~~~~~p~~~~~~PaPTPaaaaPaaP~aaap~~adirA~~~aae~~r~aaI~a~fa~f~~~~a~l~a~~l~ 299 (693) T protein:vir:95 220 MPEALKTLLAPRAQTPAAPANTPAPTPASAAPAAPVAAAPTEADIRARILAEESGRRSAITAAFGAFSTGHAELLATCLN 299 (693) T ss_pred hHHHHHHHHhhhcccccccccCcccCccCCCCCCCccCCCCcchhhHHHHHHHHHHHHHHHHHHHhccCChHHHHHHHHh Confidence 433211110000000000 0 0000000001111111111111111111000000 0000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHH-HH--HHHHHhhhhhhHHHhhhhhhhhhhh---hhhhhhhhhhhhhHHHHH Q lcl|Aclame:pro 58 EMLKSLGGADAAKDGLDNDIPEVEVRNLK-QI--RKHLARAVIMNPELKNATSFEKGTK---FDVSFNVSAKAADPGTAA 131 (497) Q Consensus 58 e~~~~~~~~~a~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~ 131 (497) +....++++++.+ -+.+......... .. .-.............+......+.. ...... .....+..+.. T Consensus 300 d~~~s~d~ar~~l---L~~l~~~~~p~~~~~~~~~~~~~~g~~~~d~~~~al~~R~g~~~~~~~n~~~-g~~L~elAr~~ 375 (693) T protein:vir:95 300 DMNITVDQAREKL---LAAIGADTQPAAALSAGAHIHAGNGNLVGDSVRASVLARIGRGERQADNAYN-GMTLRELARAS 375 (693) T ss_pred hcCCCHHHHHHHH---HHHHhhccCCCCCcCcCccccCCchhHHHHHHHHHHHHhcCcccccCCcccc-CCcHHHHHHHH Confidence 0001111111111 1111000000000 00 0000000000000000000000000 000000 00011112222 Q ss_pred HHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHh-hhhHHhhccceecCC-CceEEEEeecCCcccee Q lcl|Aclame:pro 132 AELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFY-ELSLADLISSRPVTS-PNLSYLTESAAHNNAAA 209 (497) Q Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~-~~~l~~~~~~~~~~~-~~~~~p~~~~~~~~a~~ 209 (497) ...++..................++++++..+-...-..+...... +......+...+++- ...+..+. +.-+...- T Consensus 376 L~~rg~~~~~~~~~~~~~~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~l-g~~~~L~~ 454 (693) T protein:vir:95 376 LVDRGIGVASLNAPQMVGLAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGL-GEFSSLRQ 454 (693) T ss_pred HHhcCCccCCCCHHHHHHHHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeec-CCCCChhh Confidence 2222222222222233333334566666554433333334333332 234555566555543 22233333 22355667 Q ss_pred eccccccccccccceeEEeeeeeeeeechhhHHHH-hhHHHHHHHHHHHHHHHHHHHHHhh---hhccCCCc-cccceec Q lcl|Aclame:pro 210 VAEAGTYPFSSEEFARVYEQVGKVANALTITDEGL-RDAPELFNFVQGRLLEGIQRKEEVQ---LLAGGGYP-GVNGLLQ 284 (497) Q Consensus 210 v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell-~ds~~l~~~i~~~la~~~~~~~d~a---~l~G~g~~-~~~Gil~ 284 (497) |.|++++.-.+..=..-++...+++.++.||++++ +|--++..-|-..+.++.++.+++. +|.++..- ..+.++. T Consensus 455 V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFh 534 (693) T protein:vir:95 455 VREGAEYKYVTLGERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFH 534 (693) T ss_pred cCCCCceeeeecCCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceee Confidence 88998887666555566888999999999999976 6766666778888999998888864 33332210 0111221 Q ss_pred cccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhh Q lcl|Aclame:pro 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVD 364 (497) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (497) ..-....+.+ ......+.+...+.++.. T Consensus 535 adH~Nl~tga----------------------------------------------------~sals~~sl~~a~~am~~ 562 (693) T protein:vir:95 535 ADHSNLLTGA----------------------------------------------------ASALSIDSLSKAKTQMAT 562 (693) T ss_pred cccccccccc----------------------------------------------------ccccChHHHHHHHHHHHH Confidence 1000000000 000000111111111111 Q ss_pred hh--------hhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccc-cceeecCCCCc--CcE- Q lcl|Aclame:pro 365 IQ--------LTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWG-VPVVTTPLIPL--GTI- 432 (497) Q Consensus 365 ~~--------~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G-~pvv~s~~~~~--~~~- 432 (497) -. ......|..|+..+.-....+.+-.+. ++-......+..+| +.| ..||.++.+.. ++. T Consensus 563 qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~--~~~~a~~~~~~~NP------~~~~~~vi~~prL~~~s~~~W 634 (693) T protein:vir:95 563 QKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSE--SVPGADVNSGIVNP------IRAFAQVIGEPRLDDASATAW 634 (693) T ss_pred hhcchhccCCceeecccceEEecchHHHHHHHHhccc--cccccccccccccc------hhccccccccceecCCCCCce Confidence 00 122334556666666555555543332 11100111111122 223 24555555532 222 Q ss_pred -EEeeccc----eEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCC Q lcl|Aclame:pro 433 -LVGHFAP----SVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 433 -~~gd~~~----~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a 494 (497) ++.|-.. .+| +--.+...|+.. ..|..|-+.+++...++.+++|--+++| ...| T Consensus 635 yl~a~~~~dtie~~y-L~G~~~P~ie~~----~gf~~dG~~~kvr~D~G~~~iD~Rg~~k---n~GA 693 (693) T protein:vir:95 635 YMAAKKGSDTIEVAY-LDGVDTPYLEQQ----EGFTVDGVASKVRIDAGVAPLDFRGLQK---SNGA 693 (693) T ss_pred EEecCCCCCeEEEEE-ecCCCCCeEeec----CCCCcceEEEEEEEeccCceeecccccc---CCCC Confidence 2222211 111 112233444322 2499999999999999999999888887 3344 No 171 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=97.27 E-value=4.8e-05 Score=44.25 Aligned_cols=310 Identities=14% Similarity=0.057 Sum_probs=150.4 Q ss_pred hHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhc-ccccCCcccc---cchh-hHHHH Q lcl|Aclame:pro 99 NPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFG-STGTFAPGIL---PTFL-PGIVE 173 (497) Q Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~~i~---~~~~-~~ii~ 173 (497) -.+........+ ....+.. .. ................... .++.....+| .+++ +.+++ T Consensus 1 ~~~~~~~~~l~~---~gi~~~~--~~-----------~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~ 64 (336) T protein:vir:10 1 MRDAQRIQNLAR---AGVILPR--SV-----------QNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVID 64 (336) T ss_pred CchHHHHHHHhh---cCeeecc--hh-----------hhhhhhHHHhhhhhhhccCccccCCCchhHHHHHhhcccceee Confidence 000000000000 0000000 00 0000000000000001111 1111111222 2344 56677 Q ss_pred HHHhhhhHHhhccceecCCC---ceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh-HHHHhhH-- Q lcl|Aclame:pro 174 QLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT-DEGLRDA-- 247 (497) Q Consensus 174 ~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS-~ell~ds-- 247 (497) .+........++++.+++.- .+.+++... .+.+.+.+-+...|.++......+-..+.++..+.++ .|+-+-+ T Consensus 65 ~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~ 143 (336) T protein:vir:10 65 ILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAG 143 (336) T ss_pred ehhhhhhhhhhccccccCCccceeEEEeeeec-eeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHh Confidence 77777777788888776542 345566544 4678888988889999988888888899999999999 4554433 Q ss_pred -HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhh Q lcl|Aclame:pro 248 -PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVA 326 (497) Q Consensus 248 -~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (497) .++..--+...++++...+|.-.++|++..+..|++|.+.......... T Consensus 144 g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t------------------------------ 193 (336) T protein:vir:10 144 RVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT------------------------------ 193 (336) T ss_pred CCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCC------------------------------ Confidence 2577788888888999999999999998888889999776542111100 Q ss_pred hhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc-----cCCceEEEehhHHHHHHHHhcccCccccccc Q lcl|Aclame:pro 327 SLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQYMGGNF 401 (497) Q Consensus 327 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~ 401 (497) ......+...+++++..++.++..... ..+..++|.|..+..|.. ++..|.-++... T Consensus 194 -----------------~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~l 255 (336) T protein:vir:10 194 -----------------PWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKL 255 (336) T ss_pred -----------------CcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccC-CCccCccHHHHH Confidence 000111223445555555555555332 236677777776666642 233232222111 Q ss_pred ccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEec----cccEEEEe----ccchhhhhcCceEEEEEe Q lcl|Aclame:pro 402 FGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR----EGVTMQMT----NSNGTDFVDGKVTVRAEE 473 (497) Q Consensus 402 ~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r----~~~~i~~~----~~~~~~f~~~~v~~r~~~ 473 (497) .. ..-++.++..+.+... -|+. ++.+... ..+.+.+. .... ....-.+.+-+.. T Consensus 256 k~-----------n~Pnl~i~t~pEl~~a---~G~~---~~l~~~~~~~~~t~~~~~p~~~~~l~v-q~~~~~~~v~~~~ 317 (336) T protein:vir:10 256 KD-----------IFPKLEFVTIPEYDTA---SGRL---VQLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSA 317 (336) T ss_pred HH-----------hcCccEEEEccccccC---CCce---EEEEEEecCCCcceeeecchhhhccce-eecCceeEecccc Confidence 00 1112334443333211 1211 1222111 11121111 0000 0011234556777 Q ss_pred eeccE-eecccceEEEEec Q lcl|Aclame:pro 474 RLGLL-VYRPSAFQLIQLK 491 (497) Q Consensus 474 r~~~~-v~~~~Af~~~~~~ 491 (497) |.+|. +++|.||++++=- T Consensus 318 rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 318 GTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ceeeeeeeccchheeeecC Confidence 77554 5579999997622 No 172 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=97.11 E-value=6.8e-05 Score=43.42 Aligned_cols=309 Identities=14% Similarity=0.056 Sum_probs=149.8 Q ss_pred hHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhh--cccccCCcccc---cchh-hHHH Q lcl|Aclame:pro 99 NPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPF--GSTGTFAPGIL---PTFL-PGIV 172 (497) Q Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~i~---~~~~-~~ii 172 (497) -.+........+ ....+ .+ .................... -++++.++ +| ..++ +.++ T Consensus 1 ~~~~~~~~~l~~---~gi~~--------~~-----~~~~~~~~~~~~~~da~d~~~~~~~~~~~~-~~~~l~~~i~p~~~ 63 (336) T protein:vir:36 1 MRDAQRIQNLAR---AGVIL--------PR-----SVQNVSTPLTEYAMDAADLSPHLSSTGSSG-IPNYLTTYVDPSVI 63 (336) T ss_pred CchHHHHHHHhh---cCeee--------cc-----hhhhhhhHHHHhhhhhhhccCccccCCCcc-hHHHHHHhhccceE Confidence 000000000000 00000 00 00000000000000000111 11111111 22 2233 4566 Q ss_pred HHHHhhhhHHhhccceecCCC---ceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh-HHHHhhH- Q lcl|Aclame:pro 173 EQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT-DEGLRDA- 247 (497) Q Consensus 173 ~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS-~ell~ds- 247 (497) +.+........++++.+++.- .+.+++... .+.+.+.+-+...|.++......+-..+.++..+.++ .|+.+-+ T Consensus 64 ~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~ 142 (336) T protein:vir:36 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) T ss_pred eeecchhhhhhhccccccCCccceeEEEeeeec-eeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHH Confidence 666777777788887776542 345566544 4678888988889999988888888899999999998 5665543 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhh Q lcl|Aclame:pro 248 --PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTV 325 (497) Q Consensus 248 --~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (497) .++..--+...++++...+|.-.++|++..+..|++|.+.......... T Consensus 143 ~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t----------------------------- 193 (336) T protein:vir:36 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT----------------------------- 193 (336) T ss_pred hCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEecCCCccccccCC----------------------------- Confidence 2577777888888999999999999998888889999765532111100 Q ss_pred hhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc-----cCCceEEEehhHHHHHHHHhcccCcccccc Q lcl|Aclame:pro 326 ASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~ 400 (497) ......+...+.+++..++.++..... ..+..++|.|..+..|.. ++..|.-++.. T Consensus 194 ------------------~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~ 254 (336) T protein:vir:36 194 ------------------PWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK 254 (336) T ss_pred ------------------CcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccC-CCccCccHHHH Confidence 000111223455666666665555443 236677777776666642 23333222211 Q ss_pred cccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEec----cccEEEEe----ccchhhhhcCceEEEEE Q lcl|Aclame:pro 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR----EGVTMQMT----NSNGTDFVDGKVTVRAE 472 (497) Q Consensus 401 ~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r----~~~~i~~~----~~~~~~f~~~~v~~r~~ 472 (497) ... ..-++.++..+.+... -|+- ++.+... ..+.+.+. .... ....-.+.+-+. T Consensus 255 lk~-----------n~Pnl~i~t~pEl~~a---~g~~---~~l~~~~~~~~~t~~~~~p~~~~~l~v-q~~~~~~~v~~~ 316 (336) T protein:vir:36 255 LKD-----------IFPKLEFVTIPEYDTA---SGRL---VQLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKS 316 (336) T ss_pred HHH-----------hcCccEEEEccccccC---CCce---EEEEEEecCCCcceeeecchhhhccce-eecCceeEeccc Confidence 100 1112334443333211 1211 1222111 11121111 0000 001123455677 Q ss_pred eeeccE-eecccceEEEEec Q lcl|Aclame:pro 473 ERLGLL-VYRPSAFQLIQLK 491 (497) Q Consensus 473 ~r~~~~-v~~~~Af~~~~~~ 491 (497) .|.+|. +++|.||++++=- T Consensus 317 ~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 317 AGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cceeeeeeeccchheeeecC Confidence 777554 5579999997622 No 173 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=96.88 E-value=3.6e-05 Score=44.97 Aligned_cols=341 Identities=14% Similarity=0.067 Sum_probs=151.9 Q ss_pred HHHHHHHhHHHHHHHHHHHH--hhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhh--- Q lcl|Aclame:pro 74 DNDIPEVEVRNLKQIRKHLA--RAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAI--- 148 (497) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 148 (497) ..++.........+....+. .......+.+..+. .+- .+ .+..... +......+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~--~gi----~~--------~~~~~~~----~~~~~~amd~~~~~ 62 (379) T protein:vir:10 1 MPQISKIHSSLNARQMTQMVMDSADVTLDNLKHLES--YGI----HL--------NGRKNKL----FELMQFAMDSNDIG 62 (379) T ss_pred CCCcceeeeecCccccchhhhccccccHHHHHHHHh--cCc----cc--------cchhhhh----hhhhhhhhcccccc Confidence 00000000000000000000 00000000000000 000 00 0000000 00000000000 Q ss_pred ---hhhhhcccccCCcc--cccchhhHHHHHHHhhhhHHhhccceecCCC---ceEEEEeecCCccceeecccccccccc Q lcl|Aclame:pro 149 ---GQNPFGSTGTFAPG--ILPTFLPGIVEQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 149 ---~~~~~~~~~~~g~~--i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~~v~Eg~~~~~s~ 220 (497) ......++.+..+. .-..+++.+|+.+.....+.+++++.+.+.- .+.+++... .+.|.+++-+...|..+ T Consensus 63 ~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~-~G~A~~ygd~~d~pl~d 141 (379) T protein:vir:10 63 PIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEG-LGTAQPYTDGGNMALMS 141 (379) T ss_pred ccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeee-eeeeEEeccccCCCeee Confidence 00000011111111 1134567888888888888888888776543 455566554 47788889888889888 Q ss_pred ccceeEEeeeeeeeeechhhH-HHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCC--ccccceeccccccccchh Q lcl|Aclame:pro 221 EEFARVYEQVGKVANALTITD-EGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGY--PGVNGLLQRSTGFTASSA 294 (497) Q Consensus 221 ~~~~~v~~~~~kia~~~~iS~-ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~--~~~~Gil~~~~~~~~~~~ 294 (497) ...+..+-..+.++..+.++. |+.+-+ .++..--....++++...+|+-.++|.+. .+..|++|.+........ T Consensus 142 ~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~ 221 (379) T protein:vir:10 142 WTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAV 221 (379) T ss_pred eeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccc Confidence 777777777777887777765 443332 25788888888999999999999999543 356699997765321110 Q ss_pred hhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhcc--- Q lcl|Aclame:pro 295 SSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ--- 371 (497) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 371 (497) ..+..........+...+++++..++.++...... T Consensus 222 ------------------------------------------atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~ 259 (379) T protein:vir:10 222 ------------------------------------------PNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIK 259 (379) T ss_pred ------------------------------------------cCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeec Confidence 00111111222334455666677666665543322 Q ss_pred ---CCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEec- Q lcl|Aclame:pro 372 ---TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR- 447 (497) Q Consensus 372 ---~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r- 447 (497) .+..++|.|..+..|..- +..|.-++.... ....++.++..+.+... -|. ....|.+.+. T Consensus 260 ~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl~~lk-----------~n~Pnl~i~t~pEL~~a---ggg-~~~~~~~~~~~ 323 (379) T protein:vir:10 260 SNKTPITIGIPNAYENYITTP-TELGYSVAQYMR-----------ESYPNVTFVSAPELNDA---NGG-SSAIYYYADAV 323 (379) T ss_pred ccccceeEEecHHHHHhhccc-cccCccHHHHHH-----------HhcCCcEEEEccccccc---CCC-ccEEEEEeecc Confidence 122577777777766532 222322221110 01123445554444210 011 1112333322 Q ss_pred cccEEE-------Eeccch----hhhhcCceEEEEEeee-ccEeecccceEEEEecC Q lcl|Aclame:pro 448 EGVTMQ-------MTNSNG----TDFVDGKVTVRAEERL-GLLVYRPSAFQLIQLKK 492 (497) Q Consensus 448 ~~~~i~-------~~~~~~----~~f~~~~v~~r~~~r~-~~~v~~~~Af~~~~~~~ 492 (497) .+.... .-++.. -....-.+..-+..|. |..|++|.||++++ .+ T Consensus 324 ~~~~t~~~~~~~~~~p~k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~-G~ 379 (379) T protein:vir:10 324 ENNGTDDGRTWLQVVPTKMFTLGVEKKIKGYAEGYTNATAGAMLKRPFATYRQT-GA 379 (379) T ss_pred CCCccCCcceEEEecchhhhhccceecCceeEeccccceeeeeeecchhhheec-CC Confidence 111110 001100 0000112234555666 55566799999977 22 No 174 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=96.56 E-value=0.00048 Score=38.76 Aligned_cols=308 Identities=13% Similarity=0.076 Sum_probs=151.9 Q ss_pred hHHHhhhhhhh-hhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhc--ccccCCcccc---cchh-hHH Q lcl|Aclame:pro 99 NPELKNATSFE-KGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFG--STGTFAPGIL---PTFL-PGI 171 (497) Q Consensus 99 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~i~---~~~~-~~i 171 (497) -.+........ .+-.+. . .. .....+. ............ ++++.++ +| ..++ +.+ T Consensus 1 ~~~~~~~~~l~~~gi~~~---~---~~---~~~~~~~--------~~~a~da~d~~~~~~t~~~~g-~~~~l~~~i~p~~ 62 (336) T protein:vir:78 1 MRDAQRIQNLARAGVILP---R---SV---KNVSTPL--------AEYAMDAADLSPHLSSTGSSG-IPNYLTTYVDPSV 62 (336) T ss_pred CchHHHHHHHhccCeecc---h---hh---hhhhHHH--------HHHHHhhhhhccccccCCCcc-hHHHHHHhcccce Confidence 00000000000 000000 0 00 0000000 011111111111 1111111 22 2344 567 Q ss_pred HHHHHhhhhHHhhccceecCCC---ceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhH-HHHhhH Q lcl|Aclame:pro 172 VEQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITD-EGLRDA 247 (497) Q Consensus 172 i~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~-ell~ds 247 (497) ++.+........++++.+++.- .+.|++... .+.+.+.+-+...|..+...+..+-..+.++..+.++. |+-+-+ T Consensus 63 ~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~ 141 (336) T protein:vir:78 63 IDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAG 141 (336) T ss_pred eeehhhhhhhhhhcccccCCCccccEEEEeeeec-ceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHH Confidence 7777777777888888776432 456766555 47788999888999999999999999999999999995 443332 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhh Q lcl|Aclame:pro 248 ---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDT 324 (497) Q Consensus 248 ---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (497) .++..--+...++++...+|.-.++|+...+..|++|.+.......... T Consensus 142 ~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~---------------------------- 193 (336) T protein:vir:78 142 AGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT---------------------------- 193 (336) T ss_pred HhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCc---------------------------- Confidence 2577777788888899999999999998888999999765542211100 Q ss_pred hhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc-----cCCceEEEehhHHHHHHHHhcccCccccc Q lcl|Aclame:pro 325 VASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) Q Consensus 325 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~n~~~~~~l~~lkd~~G~~~~~ 399 (497) ......+...+++++..++.++..... -.+..++|.|..+..|.. .+..|--++. T Consensus 194 -------------------~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~ 253 (336) T protein:vir:78 194 -------------------PWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA 253 (336) T ss_pred -------------------CcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHH Confidence 000111223455555555555544432 124457777777766643 2322322211 Q ss_pred ccccccccccccccccccccceeecCCCC-cCcEEEeeccceEEEEEec----cccEEEEec---cchhhhhcCceEEEE Q lcl|Aclame:pro 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIP-LGTILVGHFAPSVIQTARR----EGVTMQMTN---SNGTDFVDGKVTVRA 471 (497) Q Consensus 400 ~~~~~~~~~~~~~~~~l~G~pvv~s~~~~-~~~~~~gd~~~~~~~i~~r----~~~~i~~~~---~~~~~f~~~~v~~r~ 471 (497) ... . ..-++.++..+.+. +| |+- .+.+... ..+++.+.. ...-....-.+..-+ T Consensus 254 ~lk--------~---n~Pnl~i~t~pel~~Ag----g~~---~~~~~~~~~~~~t~~~~~p~~f~~lpvq~~~~~~~v~~ 315 (336) T protein:vir:78 254 KLK--------E---IFPKLEFVTIPEYDTAS----GRL---VQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKK 315 (336) T ss_pred HHH--------H---hcCccEEEEcccccccC----cce---EEEEEeeccCCcceeeecchhhhccceeecCceeEecc Confidence 100 0 01123344444332 11 111 1111111 112221110 000001122444566 Q ss_pred EeeeccE-eecccceEEEEec Q lcl|Aclame:pro 472 EERLGLL-VYRPSAFQLIQLK 491 (497) Q Consensus 472 ~~r~~~~-v~~~~Af~~~~~~ 491 (497) ..|.+|. +++|.||++++=- T Consensus 316 ~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 316 SAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ccceeeeeeeccchheeeccC Confidence 7777555 5579999997622 No 175 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=96.49 E-value=0.00057 Score=38.35 Aligned_cols=312 Identities=13% Similarity=0.074 Sum_probs=147.4 Q ss_pred hhhhHHHhhhhhh-hhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhh-cccccCCcccc---cchh-h Q lcl|Aclame:pro 96 VIMNPELKNATSF-EKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPF-GSTGTFAPGIL---PTFL-P 169 (497) Q Consensus 96 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~~i~---~~~~-~ 169 (497) .....+....... ..+..+.... ... ................ ..++.....|| .+++ + T Consensus 1 ~~~~~~~~~~~~l~~~g~~~~~~~---~~~-------------~~~~~~~~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~ 64 (339) T protein:vir:94 1 MSINNDRTDIKQLEKVGIIFDGYS---PKS-------------ISSEVSAYAMDAVNLTPTLQTTANAGIPAWMTTFVDR 64 (339) T ss_pred CceechHHHHHHHHhhceeeccch---hhh-------------cchhhHhhhccccccccccccccccchhhhhhhhhch Confidence 0000000000000 0000000000 000 0000000000010000 01111111232 2333 5 Q ss_pred HHHHHHHhhhhHHhhccceecCC---CceEEEEeecCCccceeecccccccccc--ccceeEEeeeeeeeeechhhHHHH Q lcl|Aclame:pro 170 GIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGTYPFSS--EEFARVYEQVGKVANALTITDEGL 244 (497) Q Consensus 170 ~ii~~~~~~~~l~~~~~~~~~~~---~~~~~p~~~~~~~~a~~v~Eg~~~~~s~--~~~~~v~~~~~kia~~~~iS~ell 244 (497) .+++...+....+.++++.+.+. .++.|+.... .+.|.|++.+...|..+ .+|.+.++....++-... ..|+- T Consensus 65 ~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~-~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~-~~E~~ 142 (339) T protein:vir:94 65 RVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEP-VGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYG-DLEMA 142 (339) T ss_pred hheeecccccchhhhcccccCCCCcccEEEEeeeec-ccceEEcccccCCCcccccceeeEEeEEEEEEEEeec-HHHHH Confidence 56677777778888888877754 3578887766 47889999998888776 456555555555444333 33443 Q ss_pred hhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhh Q lcl|Aclame:pro 245 RDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVG 321 (497) Q Consensus 245 ~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (497) +-+ .++.+--....++++...+|+..++|+...+..|++|.+.......+. T Consensus 143 ~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s-------------------------- 196 (339) T protein:vir:94 143 TYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAAT-------------------------- 196 (339) T ss_pred HHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCC-------------------------- Confidence 332 257777788888899999999999998777889999987654322110 Q ss_pred hhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc-----cCCceEEEehhHHHHHHHHhcccCcc Q lcl|Aclame:pro 322 QDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQY 396 (497) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~n~~~~~~l~~lkd~~G~~ 396 (497) ......+...+++++..++.++..... ..+..++|.|..+..|.. .+..|.- T Consensus 197 ----------------------~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~-~n~~~~T 253 (339) T protein:vir:94 197 ----------------------VNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNR-TNNFGLS 253 (339) T ss_pred ----------------------CCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhccc-CCcCCcc Confidence 011122344556666666666654432 124467788877776653 2333332 Q ss_pred cccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEE----eccccEEEEecc---chhhhhcCceEE Q lcl|Aclame:pro 397 MGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTA----RREGVTMQMTNS---NGTDFVDGKVTV 469 (497) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~----~r~~~~i~~~~~---~~~~f~~~~v~~ 469 (497) ++.-... ...++.++..+.+.... |+- .+.+. +...+.+.+.-. ..-....-.+.+ T Consensus 254 vl~~lk~-----------n~pnl~i~~~~el~~a~---g~~---~~~~~~~~~~~~~~~~~~p~~~~~lpvq~~~~~~~v 316 (339) T protein:vir:94 254 AGAKIAQ-----------TYPNIQFVAVPEFDTAS---GRL---VQLWVPEVNGQPTGEVAFAEKLRSHSIERYSTTTRQ 316 (339) T ss_pred HHHHHHH-----------hcCCcEEEEccccccCC---Cce---EEEEEEeccCCcceEEEcchhhhccccEEcCceEEe Confidence 3221110 11234455444432110 111 11111 111122211100 000001123445 Q ss_pred EEEeee-ccEeecccceEEEEec Q lcl|Aclame:pro 470 RAEERL-GLLVYRPSAFQLIQLK 491 (497) Q Consensus 470 r~~~r~-~~~v~~~~Af~~~~~~ 491 (497) -+..|. |..|++|.||++++=- T Consensus 317 ~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 317 KHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred cceeeeeeEEEEccceeeeeecC Confidence 677785 5566689999997622 No 176 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=96.28 E-value=0.00079 Score=37.58 Aligned_cols=268 Identities=15% Similarity=0.095 Sum_probs=109.7 Q ss_pred hhhcccccCCcccccc-hhhHHHHHHHhhhhHHhhccce---ec---CCCceEEEEeecCCccceee-----cccccccc Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPT-FLPGIVEQLFYELSLADLISSR---PV---TSPNLSYLTESAAHNNAAAV-----AEAGTYPF 218 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~-~~~~ii~~~~~~~~l~~~~~~~---~~---~~~~~~~p~~~~~~~~a~~v-----~Eg~~~~~ 218 (497) |+ -.++.|+ |...+++.+++.+.+.++++.- .. .+.++++|+... ..+.+. +++..+.. T Consensus 1 Ma-------~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~--~~~~~~~~~~~~~~~~~~~ 71 (392) T protein:vir:99 1 MA-------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP--SRGHTRKLRGAGAERNLTV 71 (392) T ss_pred Cc-------cccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc--ccceeeeccccccCCcccc Confidence 11 1234454 6667888888888887777532 22 255688886543 233332 33444555 Q ss_pred ccccceeEEeee--eeeeeechhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhh Q lcl|Aclame:pro 219 SSEEFARVYEQV--GKVANALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSAS 295 (497) Q Consensus 219 s~~~~~~v~~~~--~kia~~~~iS~-ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~ 295 (497) .+.+-+.+++.. +|..+ +.|+. |...+..++...+.+...++++.++|..++.-- .+.+.+....... . T Consensus 72 ~~~~~~~~~~~id~~k~~~-~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~-~~a~~~~~~~~~~--~---- 143 (392) T protein:vir:99 72 SDFTEDSFPVTLTDVAYHL-GVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEAAGAVHE--V---- 143 (392) T ss_pred cccccceEEEEEeeeeecc-eeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-hcccccccccccc--c---- Confidence 555556666655 44433 34554 455566677777777889999999998765310 0000000000000 0 Q ss_pred hhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCce Q lcl|Aclame:pro 296 SLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNA 375 (497) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 375 (497) ......+.+..+...+....-..... T Consensus 144 ------------------------------------------------------~~~~~~~~i~~a~~~L~~~~vP~~R~ 169 (392) T protein:vir:99 144 ------------------------------------------------------APDEFFKGVNGARRALNELYIPQGRV 169 (392) T ss_pred ------------------------------------------------------ChhhhHHHHHHHHHHHhhcCCCCCCE Confidence 00001111222222111111111235 Q ss_pred EEEehhHHHHHHHHhcccCcccccccccccc-c-ccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccE-- Q lcl|Aclame:pro 376 VVMNPRDWELLRLTKDANGQYMGGNFFGNAY-G-NPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVT-- 451 (497) Q Consensus 376 ~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~-~-~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~-- 451 (497) +++.|..+..|. ++.. +......+... . .-.+.-..+.|++|+.++++|.+..+.+..+. ..+..+.... T Consensus 170 ~vv~p~~~~~l~--~~~~--~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a--~~~at~a~v~~~ 243 (392) T protein:vir:99 170 LVVGTAVTEQIL--NDDR--FIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA--FIMATRAPAPPM 243 (392) T ss_pred EEEcHHHHHHHh--cccc--eeecccccchhhhhhhcceeeeeeeeEEEeecccccccceeeeccc--cccccccccccc Confidence 777888777765 3321 11000000000 0 00122247899999999999987655433221 1111111111 Q ss_pred ---------------EEEeccchhhhhcCceEEEEEeeeccEeec---ccceEE---EEecC---------CCCCC Q lcl|Aclame:pro 452 ---------------MQMTNSNGTDFVDGKVTVRAEERLGLLVYR---PSAFQL---IQLKK---------GATGS 497 (497) Q Consensus 452 ---------------i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~---~~Af~~---~~~~~---------~a~~~ 497 (497) ..+.......+..+...+-.. .+..... ..+|.. ++... .+..+ T Consensus 244 ~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~--~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v~~~~~~ 317 (392) T protein:vir:99 244 GAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTY--FGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANAT 317 (392) T ss_pred cccceeEEecccceecceeecccceeecccccccee--EEEEEEeeccccceeeeeeeeeecceeeeeeeecccce Confidence 000000000011111111100 0000000 001100 00000 00000 No 177 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=96.21 E-value=0.00034 Score=39.63 Aligned_cols=343 Identities=12% Similarity=0.034 Sum_probs=135.6 Q ss_pred HHHHHHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHH-HHHHHHHHHHhhh-hhhhhhhhh Q lcl|Aclame:pro 74 DNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGT-AAAELMGAFADGE-TAPAAIGQN 151 (497) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~ 151 (497) ..++.........+..+.++......... ....+ ..-...+. .....+....... ....+.-.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~l~~--~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~ 66 (382) T protein:vir:96 1 MSHISKTHSRLAGRHAKPFDLKNVTHEAV------------AALGR--IGLVFDHAVVQDQIKALAKAGAFRSGSAMDSN 66 (382) T ss_pred CCCcceeeeecCCccccchhhhcccHHHH------------HHHhc--cccccCcccchhHhhhhhhhhhhhhhcccccc Confidence 00000000000000000000000000000 00000 00000000 0000000000000 000001111 Q ss_pred hhcccccCCcccc----cchhhHHHHHHHhhhhHHhhccceecCCC---ceEEEEeecCCccceeeccccccccccccce Q lcl|Aclame:pro 152 PFGSTGTFAPGIL----PTFLPGIVEQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFSSEEFA 224 (497) Q Consensus 152 ~~~~~~~~g~~i~----~~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~ 224 (497) ..+..+..+.-+| .-+.+.+++-+.+......++++.+.+.- .+.|++... .+.|.+++-+...|..+...+ T Consensus 67 ~~~~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~-~G~A~~ygd~~D~Pl~d~~~~ 145 (382) T protein:vir:96 67 FTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEP-AGTAVEYGDHTNIPLTSWNAN 145 (382) T ss_pred cCCccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeec-ccceEEeecccCCCccccccc Confidence 0111111122233 23456788888888788888888776432 457776655 477889998888888776655 Q ss_pred eEEeeeeeeeeechhh-HHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCC--c-cccceeccccccccchhhhh Q lcl|Aclame:pro 225 RVYEQVGKVANALTIT-DEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGY--P-GVNGLLQRSTGFTASSASSL 297 (497) Q Consensus 225 ~v~~~~~kia~~~~iS-~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~--~-~~~Gil~~~~~~~~~~~~~~ 297 (497) ..+-..+.++....++ .|+.+-+ .++.+--.....+++...+|+-.++|+-. . +.-|++|.+.......... T Consensus 146 ~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~- 224 (382) T protein:vir:96 146 FERRTIVRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPS- 224 (382) T ss_pred eeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCC- Confidence 5555556666656664 5665543 24666667777888888899999999633 2 3569998776432111000 Q ss_pred hhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc------c Q lcl|Aclame:pro 298 FGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF------Q 371 (497) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~ 371 (497) ......+...+++++..++.++..... . T Consensus 225 ----------------------------------------------~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~ 258 (382) T protein:vir:96 225 ----------------------------------------------QGWATADWAGIIGDIREAVRQLRIQSQDQIDPKA 258 (382) T ss_pred ----------------------------------------------CCcccccHHHHHHHHHHHHHHHHhccCCeeeecc Confidence 001112233445555555555544332 1 Q ss_pred CCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCc-CcEEEeeccceEEEEEecccc Q lcl|Aclame:pro 372 TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL-GTILVGHFAPSVIQTARREGV 450 (497) Q Consensus 372 ~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~-~~~~~gd~~~~~~~i~~r~~~ 450 (497) .+..++|.|..+..|.. .++.|--++.... ....++.++..+.+.. +..--|- ....|...+.... T Consensus 259 ~~~~L~LP~~~~~~Ls~-~n~~g~Tvl~~lk-----------~n~Pnl~i~t~peL~~a~~~g~g~-~~~~~~~~~e~~~ 325 (382) T protein:vir:96 259 EKITMALATSKVDYLSV-TTPYGISVSDWIE-----------QTYPKMRIVSAPELSGVQMQGKTP-EDALVLFVEEVDA 325 (382) T ss_pred cceEEeechHHHhhccc-cCccCccHHHHHH-----------HhcCCcEEEEccccccccCCCccc-eeEEEEecchhhh Confidence 12235566655544432 1222211111000 0111233333333210 0000000 0001111111111 Q ss_pred EEEEeccchhhhhcC---------------ceEEEEEee-eccEeecccceEEEEec Q lcl|Aclame:pro 451 TMQMTNSNGTDFVDG---------------KVTVRAEER-LGLLVYRPSAFQLIQLK 491 (497) Q Consensus 451 ~i~~~~~~~~~f~~~---------------~v~~r~~~r-~~~~v~~~~Af~~~~~~ 491 (497) .+..+.+....|.+- .+..-+..| .|..|++|.||++++=- T Consensus 326 ~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 326 SVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred hcccccccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccCC Confidence 011111111112110 011112223 46677789999997622 No 178 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=96.14 E-value=0.00069 Score=37.90 Aligned_cols=285 Identities=10% Similarity=0.035 Sum_probs=121.0 Q ss_pred hhhcccccCCcccccchhhHHHHHHHh-hhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEee Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFY-ELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQ 229 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~-~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~ 229 (497) +..+ ...-..+-..+...+..-... +......++..+-+...-+|........--.|++| .+-.++.=...++. T Consensus 1 m~it--~~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge---~~~~~l~~~~~~i~ 75 (302) T protein:vir:10 1 MLIN--KQSLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGA---KVVKNLKAYKYVVE 75 (302) T ss_pred Cccc--HHHHHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccc---eeeccccccceeEE Confidence 0000 000000000011111111111 12344555555544444455544332112245544 44445566667899 Q ss_pred eeeeeeechhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHhhhhc---c-CCCcc--ccceeccccccccchhhhhhhHHH Q lcl|Aclame:pro 230 VGKVANALTITDEGLR-DAPELFNFVQGRLLEGIQRKEEVQLLA---G-GGYPG--VNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 230 ~~kia~~~~iS~ell~-ds~~l~~~i~~~la~~~~~~~d~a~l~---G-~g~~~--~~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) .++++..+.||++.+. |.-.+..-+...+.++.++.+|+.++. + .++.- .+.++...-..-... T Consensus 76 ~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~--------- 146 (302) T protein:vir:10 76 NEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDAS--------- 146 (302) T ss_pred eecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccc--------- Confidence 9999999999999875 567777888888999999999875432 2 11110 111111100000000 Q ss_pred HHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhh----hhhhccCCceEEE Q lcl|Aclame:pro 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI----QLTLFQTPNAVVM 378 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~ 378 (497) ..+.. ....+. .........+...+.++... -.+....|..++. T Consensus 147 -----------~~N~g------------------~~~~~~---~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiV 194 (302) T protein:vir:10 147 -----------VSNKG------------------TAPLSN---ASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLV 194 (302) T ss_pred -----------ccccc------------------chhhhh---cccccchHHHHHHHHHHHHHhhhcccccccCCCEEEe Confidence 00000 000000 00000001111111111111 1123344666666 Q ss_pred ehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcE--EEeeccceEEE-EEeccccEEEEe Q lcl|Aclame:pro 379 NPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI--LVGHFAPSVIQ-TARREGVTMQMT 455 (497) Q Consensus 379 n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~--~~gd~~~~~~~-i~~r~~~~i~~~ 455 (497) .|.-...-+++-. .+++.. +..+|. ..-+.+|+++.+..++. ++.|.+..-.. +-.++...++.. T Consensus 195 p~~le~~A~~ll~-~~~~~~------g~~Np~-----~g~~~~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~ 262 (302) T protein:vir:10 195 GPALEDVAKMLLT-NPKLAD------NTPNPY-----VGTAELVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQ 262 (302) T ss_pred cchhHHHHHHHhh-ccccCC------CCccee-----ccceEEEEeeccCCCCceEEEecCCccceEEEcCccccEEEec Confidence 6665555554421 122111 111111 11256777888876663 34454442111 123444555432 Q ss_pred ccchhhhhcCceEEEEEeeeccEeecccceE--EEEecCCCCCC Q lcl|Aclame:pro 456 NSNGTDFVDGKVTVRAEERLGLLVYRPSAFQ--LIQLKKGATGS 497 (497) Q Consensus 456 ~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~--~~~~~~~a~~~ 497 (497) ..|..+.+-++++..++..-+-.-+|. .+.++...++| T Consensus 263 ----~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 263 ----VNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred ----cCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccCC Confidence 247788888888887775444333332 22223333333 No 179 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=95.55 E-value=0.0018 Score=35.59 Aligned_cols=308 Identities=13% Similarity=0.073 Sum_probs=148.4 Q ss_pred hHHHhhhhhhh-hhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhc--ccccCCcccc---cchh-hHH Q lcl|Aclame:pro 99 NPELKNATSFE-KGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFG--STGTFAPGIL---PTFL-PGI 171 (497) Q Consensus 99 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~i~---~~~~-~~i 171 (497) -.+........ .+-.+. . .. .....+. ............ ++++.++ +| ..++ +++ T Consensus 1 ~~~~~~~~~l~~~gi~~~---~---~~---~~~~~~~--------~~~a~da~d~~~~~~t~~~~g-~~~~l~~~i~p~~ 62 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILP---R---SV---KNVSTPL--------AEYAMDAADLSPHLSSTGSSG-IPNYLTTYVDPSV 62 (336) T ss_pred CchHHHHHHHhccCeecc---h---hh---hhhhHHH--------HHHHHhhhhhccccccCCCcc-hHHHHHhhcCcce Confidence 00000000000 000000 0 00 0000000 011111111111 1111111 22 2333 556 Q ss_pred HHHHHhhhhHHhhccceecCCC---ceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHH-HHhhH Q lcl|Aclame:pro 172 VEQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDE-GLRDA 247 (497) Q Consensus 172 i~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~e-ll~ds 247 (497) ++.+........++++.+.+.- .+.++.... .+.+.+.+.....|..+...+..+-+.+.++..+.++.+ +-+-+ T Consensus 63 ~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~-~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~ 141 (336) T protein:vir:10 63 IDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAG 141 (336) T ss_pred eeeeechhchhhhcccccCCCcceeeEEEEeeee-eeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHH Confidence 6666666667777777665432 345555444 466778888889999998888888889999999999954 43332 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhh Q lcl|Aclame:pro 248 ---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDT 324 (497) Q Consensus 248 ---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (497) .++..--+...++++...+|.-.++|+...+..|++|.+.......... T Consensus 142 ~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~---------------------------- 193 (336) T protein:vir:10 142 AGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT---------------------------- 193 (336) T ss_pred HhCCCcHHHHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCc---------------------------- Confidence 2577777788888899999999999998888999999776542211100 Q ss_pred hhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhc-----cCCceEEEehhHHHHHHHHhcccCccccc Q lcl|Aclame:pro 325 VASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQYMGG 399 (497) Q Consensus 325 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~n~~~~~~l~~lkd~~G~~~~~ 399 (497) ......+...+++++..++.++..... -.+..++|.|..+..|.. .+..|--++. T Consensus 194 -------------------~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~ 253 (336) T protein:vir:10 194 -------------------PWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA 253 (336) T ss_pred -------------------CcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHH Confidence 000112223455666666555544432 124457777777766643 2333322221 Q ss_pred ccccccccccccccccccccceeecCCCC-cCcEEEeeccceEEEEEec----cccEEEEec---cchhhhhcCceEEEE Q lcl|Aclame:pro 400 NFFGNAYGNPVNGGKNIWGVPVVTTPLIP-LGTILVGHFAPSVIQTARR----EGVTMQMTN---SNGTDFVDGKVTVRA 471 (497) Q Consensus 400 ~~~~~~~~~~~~~~~~l~G~pvv~s~~~~-~~~~~~gd~~~~~~~i~~r----~~~~i~~~~---~~~~~f~~~~v~~r~ 471 (497) ... . ..-++.++..+.+. +| |+- .+.+... ..+++.+.. ...-....-.+..-+ T Consensus 254 ~lk--------~---n~Pnl~i~t~pel~~Ag----g~~---~~~~~~~~~~~~t~~~~~P~~f~~lpvq~~~~~~~v~~ 315 (336) T protein:vir:10 254 KLK--------E---IFPKLEFVTIPEYDTAS----GRL---VQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKK 315 (336) T ss_pred HHH--------H---hCCccEEEEcccccccC----Cce---EEEEEecccCCcceeeecChhhhccceeecCceeEecc Confidence 110 0 01123344444432 11 111 1211111 112221110 000000122444566 Q ss_pred EeeeccE-eecccceEEEEec Q lcl|Aclame:pro 472 EERLGLL-VYRPSAFQLIQLK 491 (497) Q Consensus 472 ~~r~~~~-v~~~~Af~~~~~~ 491 (497) ..|.+|. +++|.||++++=- T Consensus 316 ~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 316 SAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred ccceeeeeeeccchheeeccC Confidence 7777555 5579999997622 No 180 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=95.32 E-value=0.0023 Score=35.05 Aligned_cols=274 Identities=12% Similarity=-0.010 Sum_probs=102.9 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhhc-------cceecCCCceEEEEeecCCc---cceeecccccccccc Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLI-------SSRPVTSPNLSYLTESAAHN---NAAAVAEAGTYPFSS 220 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~-------~~~~~~~~~~~~p~~~~~~~---~a~~v~Eg~~~~~s~ 220 (497) ++..-. -...|...+..++.+.+.....+.. ...+..+.-+.+|....-.+ ...-+.+.+..+..+ T Consensus 1 m~lsD~----~vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~k 76 (325) T protein:vir:95 1 MALSDL----AVYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKV 76 (325) T ss_pred Cchhhh----hhhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceeccce Confidence 000000 0011222333444444333333322 12233344456666543222 112233333343333 Q ss_pred -ccceeEEeeeeeeeeechhhHHHHh---hHH-HHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhh Q lcl|Aclame:pro 221 -EEFARVYEQVGKVANALTITDEGLR---DAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSAS 295 (497) Q Consensus 221 -~~~~~v~~~~~kia~~~~iS~ell~---ds~-~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~ 295 (497) .+...+....+.=.++.....+.+. +.+ .+...|.+.+++...+.+=..++.+-. +++...+..+ T Consensus 77 itt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~-----~a~~~~~~~v----- 146 (325) T protein:vir:95 77 LKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVY-----SALSQVSDVV----- 146 (325) T ss_pred eccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----Hhhcccccce----- Confidence 2344444443333332222222211 222 234444444444332222111211100 0000000000 Q ss_pred hhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCce Q lcl|Aclame:pro 296 SLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNA 375 (497) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 375 (497) .... ..............+.++...+- -....-.. T Consensus 147 -------------------------------------------~dis-~~~~~~~~~~s~~~l~~A~~klG-D~~~~l~~ 181 (325) T protein:vir:95 147 -------------------------------------------YDAT-ANTDAADKLPTWNNLNNGQAKFG-DQSSQIAA 181 (325) T ss_pred -------------------------------------------eeee-cccCcccccccHHHHHHHHHHhc-ccccceeE Confidence 0000 00000000001122222222221 11233457 Q ss_pred EEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcC------cEEEeeccceEEEEEeccc Q lcl|Aclame:pro 376 VVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------TILVGHFAPSVIQTARREG 449 (497) Q Consensus 376 ~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~------~~~~gd~~~~~~~i~~r~~ 449 (497) |+||..++..|.+++-.+...++.... ...-++.+|++|++++.+|.. .+...-|..+++.+.+..+ T Consensus 182 ~~MHS~v~~~L~~~~L~~~~~~~~~~g-------~~~i~t~~G~~VIVdD~~p~~~~g~~~~ytty~lg~GAi~~~~~~~ 254 (325) T protein:vir:95 182 WIMHSTPMHKLYGSNLTNGERLFTYGT-------VNVVRDPFGKLLVMTDSPNLFAAGTPNVYHILGLVPGGVLIGQNND 254 (325) T ss_pred EEEchHHHHHHHHhhccccccccccCC-------cccccccCCcEEEEeCCCCCCCccCceeEEEEEEecCeEEecCCCC Confidence 999999999999876665444433321 122357899999999999843 2212223334454544444 Q ss_pred cEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 450 VTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 450 ~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ......+..++ ..-...+|.+.. -++||..+..-+ +..+.| T Consensus 255 ~~~~~~~~~~~--~~~~~~~~~~~t---f~lhp~G~sw~~--s~~g~s 295 (325) T protein:vir:95 255 FDANEETKNGD--ENIIRTYQAEWS---YNIGVKGFAWDK--ANGGKS 295 (325) T ss_pred ccccccccCcc--cceeeeeeeeee---EEeecceeeeec--ccccCC Confidence 33322222211 122223332221 367888887722 222223 No 181 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=94.92 E-value=0.00075 Score=37.70 Aligned_cols=351 Identities=12% Similarity=0.070 Sum_probs=141.5 Q ss_pred HHHHHHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 74 DNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPF 153 (497) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (497) ..++.........+..+...... .....+ .+........ ..+. ............. +.......-..-.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--~~~~~~~~l~-~~g~---~~~~~~~~~~~~~-~~~~~~~~~a~da~~~ 72 (388) T protein:vir:99 1 MKQLSKVHQSLAGRSVRAFDMAN-GKADYR--LTDMAVRELK-KFGL---VFDHATVKRQIEL-LHEGGVATQAFDSAYV 72 (388) T ss_pred CCCccceeeecCCcccchhhhhc-CCccee--eechhhHhhh-hcce---eccCccchhhhhh-hhhhhhhhcccCcccc Confidence 00000000000000000000000 000000 0000000000 0000 0000000000000 0000000000001111 Q ss_pred cccccCCcccc----cchhhHHHHHHHhhhhHHhhccceecCCC---ceEEEEeecCCccceeeccccccccccccceeE Q lcl|Aclame:pro 154 GSTGTFAPGIL----PTFLPGIVEQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) Q Consensus 154 ~~~~~~g~~i~----~~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v 226 (497) +....++.-+| .-+.+.+++.+.......+++++.+.+.- .+.|++... .+.+.+.+-+...|..+...+.. T Consensus 73 ~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~-~G~A~~ygd~~D~Pl~d~~~~~~ 151 (388) T protein:vir:99 73 APTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEP-AGTAMEYGDLTNIPLSSWNVNFE 151 (388) T ss_pred cccccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeec-ceeEEEeecccCCCceeccceee Confidence 11111122233 23456677777777777778888776432 456666544 47788889888889888777777 Q ss_pred EeeeeeeeeechhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHhhhhccCCC---ccccceeccccccccchhhhhhh Q lcl|Aclame:pro 227 YEQVGKVANALTITDEGLRDA----PELFNFVQGRLLEGIQRKEEVQLLAGGGY---PGVNGLLQRSTGFTASSASSLFG 299 (497) Q Consensus 227 ~~~~~kia~~~~iS~ell~ds----~~l~~~i~~~la~~~~~~~d~a~l~G~g~---~~~~Gil~~~~~~~~~~~~~~~~ 299 (497) +-..+.++....++.+=+..+ .++...-+....+++...+|+-.++|..- .+.-|++|.+........+ T Consensus 152 ~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at---- 227 (388) T protein:vir:99 152 RRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIAST---- 227 (388) T ss_pred eeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccc---- Confidence 777777777777775433332 25777778888889999999999999532 2467999876543211110 Q ss_pred HHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhcc------CC Q lcl|Aclame:pro 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQ------TP 373 (497) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~ 373 (497) ...+.......+...+++++..++.++...... .+ T Consensus 228 ---------------------------------------~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~ 268 (388) T protein:vir:99 228 ---------------------------------------TPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVD 268 (388) T ss_pred ---------------------------------------cCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccc Confidence 000111122234445566666666665444331 12 Q ss_pred ceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCC-cCcEEEeeccceEEEEEeccccEE Q lcl|Aclame:pro 374 NAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-LGTILVGHFAPSVIQTARREGVTM 452 (497) Q Consensus 374 ~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~-~~~~~~gd~~~~~~~i~~r~~~~i 452 (497) ..++|-|..+..|.. .+..|--++.... . ...++.++..+.+. ++. - +-...+|.+.+..+... T Consensus 269 ~tL~LP~~~~~~Ls~-~n~~g~Tvl~~lk--------~---n~Pnl~i~t~pEl~~a~~--t-gg~~~~~~~~~~~~~~~ 333 (388) T protein:vir:99 269 ITLVLPMNKVDMLSV-VTDLGISVRDWLK--------Q---TYPRVRVMSAPELQGGNP--D-DGKDIAYMFLDSVDTAV 333 (388) T ss_pred eEEEechHHHHhccc-cCcCCccHHHHHH--------H---hcCCcEEEEecccccccc--c-CCceeEEEEeccccccc Confidence 245566666655532 1222221211100 0 11123333333221 100 0 00111121211111100 Q ss_pred EEeccch--------hhhh-----cC--ceEEEEEeee-ccEeecccceEEEEec Q lcl|Aclame:pro 453 QMTNSNG--------TDFV-----DG--KVTVRAEERL-GLLVYRPSAFQLIQLK 491 (497) Q Consensus 453 ~~~~~~~--------~~f~-----~~--~v~~r~~~r~-~~~v~~~~Af~~~~~~ 491 (497) .-++... -.|. .. .+..-+..|. |..+++|.||++++=- T Consensus 334 ~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 334 DGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred ccCccCcceeEEecccccccccceecCceeEeccccceeeeEEeccchhheeccC Confidence 0000000 0010 11 2223344555 5556679999997622 No 182 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=94.69 E-value=0.0037 Score=33.89 Aligned_cols=326 Identities=12% Similarity=0.062 Sum_probs=141.2 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCcccee Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA 209 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~ 209 (497) .....+..+........ ..+.. .+.+....|.|.....+...+.+.+.+++++++++++--.....-.....+-++- T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv-~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagr 77 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLA--KLNGV-NSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASR 77 (338) T ss_pred CCHHHHHHHHHHHHHHH--HHhCC-CcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCcccccc Confidence 01111111111111100 01111 1122234577877888889999999999999999987544333322111111221 Q ss_pred ec--cc-cccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCcccccee Q lcl|Aclame:pro 210 VA--EA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLL 283 (497) Q Consensus 210 v~--Eg-~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil 283 (497) +. .+ +..|..-..++.-.+.+++.---+.|+-+.|+.. ++|...+++.+.++++.=.=.--+||+.....+-.- T Consensus 78 tdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~ 157 (338) T protein:vir:11 78 TDTTGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRA 157 (338) T ss_pred ccCCCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChh Confidence 11 11 1222222244555566666666677888888753 678888888888887653333344554321111000 Q ss_pred ccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccc-cccchhhhhhhh-HHh Q lcl|Aclame:pro 284 QRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAG-SYPTAAEIAENV-FDA 361 (497) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~~ 361 (497) +.+. ..+.+..|..-.......+.... .......... +.......+|.+ +++ T Consensus 158 ~nPl-----------------------lqDVNkGWlQ~~Re~ap~rv~~~---~~~~~~i~i~~g~~gdy~nLDalV~d~ 211 (338) T protein:vir:11 158 ANPL-----------------------LQDVNIGWFQQYRNNAPARVLKE---GKTTGKVVVGNGADADYKNLDALVFDV 211 (338) T ss_pred hCcC-----------------------ccccchhHHHHHHhhhhhhhhhc---ccccceeeecCCCCCccccHHHHHHHH Confidence 0000 00011111111111111111110 0000000000 000112223333 333 Q ss_pred hhhhhhhhcc-CCce-EEEehhHHH--HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeec Q lcl|Aclame:pro 362 FVDIQLTLFQ-TPNA-VVMNPRDWE--LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHF 437 (497) Q Consensus 362 ~~~~~~~~~~-~~~~-~~~n~~~~~--~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~ 437 (497) ...+..+.+. .+.. .++...-.+ .+..+ ..... +---.......-..+|-|+|.+..|.+|.+.+++=-| T Consensus 212 ~~~lI~~~~~~d~dLVvivG~dLladk~~~l~-n~~~~-----ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L 285 (338) T protein:vir:11 212 VSSLIDPWHRRDPGLVVILGRELVHDKYFPMV-NKDQP-----ATEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTL 285 (338) T ss_pred HhccCChHHhcCCCEEEEEchhhhHHHHhHHH-hcCCC-----hHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeec Confidence 3333333333 3333 334433221 22222 11111 1000111112224589999999999999999998887 Q ss_pred cceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCC Q lcl|Aclame:pro 438 APSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 438 ~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~ 495 (497) +.+.+.. .+....-.+-+.. .+|++.-.=..--+..|-+++.+|.++-.+.+. T Consensus 286 ~NLsIY~-Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 286 KNLSLYW-QIGGRRRYLKEVP----EKNRIENYESSNDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred cccEEEE-ecCcEEEEEEecc----ccccccchhhhccceeeeccccEEEeecceecC Confidence 7753322 2222222222222 134444433344566677777777777655555 No 183 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=94.44 E-value=0.0035 Score=34.06 Aligned_cols=187 Identities=15% Similarity=0.130 Sum_probs=83.6 Q ss_pred eeeechhhHHHHhh---H---HHHHHHHHHHHHHHHHHHHHhhhhc----cCCCccccceeccccccccchhhhhhhHHH Q lcl|Aclame:pro 233 VANALTITDEGLRD---A---PELFNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 233 ia~~~~iS~ell~d---s---~~l~~~i~~~la~~~~~~~d~a~l~----G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) |-+ ..+|+-++.| + -++.+...+++.++++...|+.++. +.....|..--+..+...... T Consensus 1 iD~-lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a--------- 70 (221) T protein:vir:17 1 MDD-LLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGA--------- 70 (221) T ss_pred CCc-chhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccc--------- Confidence 222 3445545442 2 3688899999999999999988753 211111100000000000000 Q ss_pred HHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCce-EEEehh Q lcl|Aclame:pro 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNA-VVMNPR 381 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~n~~ 381 (497) ..........+.++.+...+....-....- ++++|. T Consensus 71 -------------------------------------------~~t~~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~ 107 (221) T protein:vir:17 71 -------------------------------------------GNTNNAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPR 107 (221) T ss_pred -------------------------------------------cccCCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcH Confidence 000011112233333333333333222333 455898 Q ss_pred HHHHHHHHhccc-Cccccccccccccccccc--ccccccccceeecCCCCc--CcEEE---eeccceEEEEEeccccEEE Q lcl|Aclame:pro 382 DWELLRLTKDAN-GQYMGGNFFGNAYGNPVN--GGKNIWGVPVVTTPLIPL--GTILV---GHFAPSVIQTARREGVTMQ 453 (497) Q Consensus 382 ~~~~l~~lkd~~-G~~~~~~~~~~~~~~~~~--~~~~l~G~pvv~s~~~~~--~~~~~---gd~~~~~~~i~~r~~~~i~ 453 (497) .+..|-+..|.. ..+.+.. ..+...+ .-..+.|++|+.|+++|. |+-+. |+|.. ..... T Consensus 108 ~y~~LL~~~d~~~~n~d~~~----s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~~~~ag~~~~---~~~~~------ 174 (221) T protein:vir:17 108 QYYSLISSVDTNILNREIGN----TQGDMNTGKGLYVNAGIRIYKSNVLASLYGTNLVTDPGDATT---SGENN------ 174 (221) T ss_pred HHHHHHHhcCcceeeeeccc----ccccccccceeeeecCcEEEEeccCCcccccccccCCccccc---ccccc------ Confidence 888876432211 1111111 1111111 234688999999999996 33222 22110 00000 Q ss_pred EeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 454 MTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 454 ~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ..+.. .|.+ .-+.++||+|+.-+++-....-. T Consensus 175 -~~yr~-~fs~----------~~glv~~~~Avgtvkl~~~~~~~ 206 (221) T protein:vir:17 175 -GSYRP-AITD----------RAGLVFHKEAADTVEVLLPPSRP 206 (221) T ss_pred -ccccc-cccc----------eEEEEEcchheeeeeeecCCCCC Confidence 00111 1211 12667888888888776555443 No 184 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=94.30 E-value=0.0048 Score=33.29 Aligned_cols=275 Identities=11% Similarity=-0.015 Sum_probs=106.7 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhhcccee-----c--CCCceEEEEeecCCccceee-cccccccccccc Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRP-----V--TSPNLSYLTESAAHNNAAAV-AEAGTYPFSSEE 222 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~-----~--~~~~~~~p~~~~~~~~a~~v-~Eg~~~~~s~~~ 222 (497) ++ ......+|..+....++.+++.+++-++++.-. . .+.++++++....+. ..+. +.+..+...+.+ T Consensus 1 MA----N~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v-~d~~~~~~~~~~~~~~~ 75 (423) T protein:vir:35 1 MA----NNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKS-ERTETGDITGKDKNGLF 75 (423) T ss_pred Cc----cchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCccee-ecccCcCCCCccccccc Confidence 11 111233556677788888988888888776522 1 145778886543211 1111 112222223333 Q ss_pred cee--EEeeeeeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhH Q lcl|Aclame:pro 223 FAR--VYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 223 ~~~--v~~~~~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~ 300 (497) -.+ +.+.-+|...+-.=..|+..+..+++.++... .++++..+|..++..--...+..+-+ . .+ T Consensus 76 e~~v~l~id~~k~~a~~v~d~e~~l~i~~~~~~l~~a-~~ala~~vd~~l~~~l~~~a~~~vgt-~--~t---------- 141 (423) T protein:vir:35 76 SAKATGKVGKYITVAVEWTQIEEALKLNQLDQILSPI-HERMVTDLETELAHFMMNNGALSLGS-P--NT---------- 141 (423) T ss_pred cceeeEEeccceeccceeCHHHHHhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhcccccccc-c--cC---------- Confidence 333 55566666665555566666666777777766 47788888887763110000000000 0 00 Q ss_pred HHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEeh Q lcl|Aclame:pro 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) .+...+.+......+.....+. ..-..+++| T Consensus 142 -----------------------------------------------~~~~~~~i~~a~~~Ld~~~vP~--~~R~~Vv~p 172 (423) T protein:vir:35 142 -----------------------------------------------AIKKWADVAQTASFIKDIGIKT--GENYAIMDP 172 (423) T ss_pred -----------------------------------------------CcchHHHHHHHHHHHHHhcCCc--CCCEEEeCH Confidence 0000111111222222221111 123457889 Q ss_pred hHHHHHHHHhcccCcccccccccccccccccc-cccccccceeecCCCCcCcE-------EEeeccce-EEEEEeccccE Q lcl|Aclame:pro 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNG-GKNIWGVPVVTTPLIPLGTI-------LVGHFAPS-VIQTARREGVT 451 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~-~~~l~G~pvv~s~~~~~~~~-------~~gd~~~~-~~~i~~r~~~~ 451 (497) ..+..|.+ + +. .++............+. ...+.|+.|+.|+++|..+. .++--... ...+.+....+ T Consensus 173 ~~~a~Ll~--~-~~-~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~ 248 (423) T protein:vir:35 173 WSAQRLAD--A-QS-GLHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFT 248 (423) T ss_pred HHHHHHhc--c-cc-ceeccccchhHHHhhccceeeecceEEEEcCCCccccccccccceeeccccccccccccccccce Confidence 88877642 1 11 12111111111111222 25789999999999995321 11000000 00000010000 Q ss_pred E----EEeccchhhhhcCceEEEEEeee---ccEee----cccceEEEEecCC---CCCC Q lcl|Aclame:pro 452 M----QMTNSNGTDFVDGKVTVRAEERL---GLLVY----RPSAFQLIQLKKG---ATGS 497 (497) Q Consensus 452 i----~~~~~~~~~f~~~~v~~r~~~r~---~~~v~----~~~Af~~~~~~~~---a~~~ 497 (497) + .+....+..-..|.+.|-+..-+ ...+. .|..+.+.-..++ +.|. T Consensus 249 ~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~ 308 (423) T protein:vir:35 249 VALTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTASGD 308 (423) T ss_pred eeeeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEeccccccccCc Confidence 0 10000000111122222211100 00000 0111111111000 0000 No 185 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=94.08 E-value=0.0055 Score=32.99 Aligned_cols=325 Identities=13% Similarity=0.089 Sum_probs=141.4 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhc---ccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCcc Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFG---STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNN 206 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~ 206 (497) .....+..+........ ..+... .+.+--..|.|.....+...+.+.+.+++++++++++--.....-.....+- T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~i 78 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQA--ELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTV 78 (342) T ss_pred CChHHHHHHHHHHHHHH--HHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCccc Confidence 00111111111111000 001110 0111123466777788888889999999999999987644433322111111 Q ss_pred ceeec---cccccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCcccc Q lcl|Aclame:pro 207 AAAVA---EAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVN 280 (497) Q Consensus 207 a~~v~---Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~ 280 (497) ++-+. -++..|..-..++.-.+.+++.---+.|+-+.|+.. ++|...+++.+.++++.=.=.--+||+.....+ T Consensus 79 agrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T 158 (342) T protein:vir:10 79 ASTTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATS 158 (342) T ss_pred ccccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCC Confidence 22211 112223333455666666666666677888888753 678888888888887643333344554321111 Q ss_pred ceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhh---hcccccccccccccccchhhhhhh Q lcl|Aclame:pro 281 GLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRV---VTGAAGSGSGVAGSYPTAAEIAEN 357 (497) Q Consensus 281 Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 357 (497) -. ..... ..+.+.. |+-.++.... +.+.........+..+ ....+|. T Consensus 159 d~---------------~~nPl--------lqDVN~G------WlQ~~Re~ap~rv~~~~~~~~~i~iG~~g-dy~NLDa 208 (342) T protein:vir:10 159 DR---------------NSNPL--------LQDVAKG------WLQKMREDAKERVMNGESTDNQVLVGKGQ-EYANLDA 208 (342) T ss_pred Ch---------------hhCcC--------ccccchH------HHHHHHhhhhhhhcccceeccceeecCCC-CcccHHH Confidence 00 00000 0001111 1111111111 0111111111111111 2223333 Q ss_pred h-HHhhhhhhhhhc-cCCceEEEehhHHH---HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcE Q lcl|Aclame:pro 358 V-FDAFVDIQLTLF-QTPNAVVMNPRDWE---LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI 432 (497) Q Consensus 358 ~-~~~~~~~~~~~~-~~~~~~~~n~~~~~---~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~ 432 (497) + +++...+..+.+ -.+...++=..+.. .+.++.- .+. +---.......-..++-|+|.+.-+++|++.+ T Consensus 209 lV~D~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~-~~~-----ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~i 282 (342) T protein:vir:10 209 LVMDATEELIDEWHRDDTDLVVITGRKLLADKYFPIVNQ-QNA-----PTEELAADIVISQKRIGGLKAVRVPFFPANAI 282 (342) T ss_pred HHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhc-CCC-----hHHHHHHHHHHhhhhhcCceeEEccccCCCce Confidence 3 334443333333 33444443333221 1222211 111 00001111223345899999999999999999 Q ss_pred EEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 433 LVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 433 ~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++=-|+.+.+.. .+....-.+-+.. .+|++.-.=..--+..|-+++.+|.++-...+..- T Consensus 283 lVT~L~NLsIY~-Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 283 LITKLENLAIYV-QEGTTRKHIENVP----KKDRIETYESENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred EEeeccccEEEE-ecCcEEEEEEecc----ccccccchhhhccceeeeccccEEEeecceecCCC Confidence 988877753321 2222222222222 13333333333345566667777766644444444 No 186 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=94.01 E-value=0.0057 Score=32.90 Aligned_cols=278 Identities=12% Similarity=0.016 Sum_probs=111.9 Q ss_pred hhhcccccCCcccccc---hhhHHHHHHHhhhhHHhh---------ccceecCCCceEEEEeecCCccce--eeccc--c Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPT---FLPGIVEQLFYELSLADL---------ISSRPVTSPNLSYLTESAAHNNAA--AVAEA--G 214 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~---~~~~ii~~~~~~~~l~~~---------~~~~~~~~~~~~~p~~~~~~~~a~--~v~Eg--~ 214 (497) ++.+ .-...++|+ +.+-+.+...+.+.+.+= ......++..+++|....-++... +-+.. + T Consensus 1 Ma~T---~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~ 77 (349) T protein:vir:94 1 MAIT---TIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQD 77 (349) T ss_pred CCce---EEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCccc Confidence 2211 112345555 333334444444444331 111233456788898765433322 11111 1 Q ss_pred cccccccc-ceeEEeeeeeeeee--chhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceecccccccc Q lcl|Aclame:pro 215 TYPFSSEE-FARVYEQVGKVANA--LTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTA 291 (497) Q Consensus 215 ~~~~s~~~-~~~v~~~~~kia~~--~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~ 291 (497) ..+..+.+ ..++-...+.--++ ..++.++-- .+....|.+++++-..+...+.+|. -.+|+++....... T Consensus 78 ~~t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG--~dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~~~~~~~~ 150 (349) T protein:vir:94 78 IATPRAIQTGEMMARVAYLNEGFGQADLTVELTS--QNPLQSVASRLDNFWQRQAQRRLIA-----TALGLYNDNVSATD 150 (349) T ss_pred ccccccccccceeeeeeeeccccchhHHHHHhhC--chHHHHHHHHHHHHHhhHHHHHHHH-----HHHhhhcccccccc Confidence 22223322 22222222222222 233443322 2456666777666665555544442 12333322111000 Q ss_pred chhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhh--- Q lcl|Aclame:pro 292 SSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT--- 368 (497) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 368 (497) .... ......... .........++.+...+-.. T Consensus 151 ~~~~--------------------------------------~~~~~~d~~------~~a~~~~~~~~~A~~~~Gdaa~G 186 (349) T protein:vir:94 151 AYHE--------------------------------------QNDMVVDVS------ATSGFDAGAFIDATQTMGDALMG 186 (349) T ss_pred cccc--------------------------------------cCceeEEec------ccCCCChhhHHHHHHHHHHHhcc Confidence 0000 000000000 00000111122222111111 Q ss_pred -hccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCc---C---c---EEEeecc Q lcl|Aclame:pro 369 -LFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL---G---T---ILVGHFA 438 (497) Q Consensus 369 -~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~---~---~---~~~gd~~ 438 (497) ....-++++||...+..|++++-=+ |+ ++.. ....-++++|++|++++.||. | . ++||. T Consensus 187 d~~~~lt~i~mHS~v~~~L~~~~li~--~i-~~s~------~~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~-- 255 (349) T protein:vir:94 187 NGGEVLGAIAMHSFVYAQARKAQLID--FI-RDAE------NNTMFATYQGYRVIVDDSMTVVGQDTSRKFISIIFGQ-- 255 (349) T ss_pred ccccceeEEEEchHHHHHHHhcchhh--hc-cCcc------cCcccceecCcEEEEeCCCccccCCCCceEEEEEeec-- Confidence 1123457999999999998763311 11 1110 011346899999999999984 1 2 34443 Q ss_pred ceEEEEEecc-ccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCC-------CC Q lcl|Aclame:pro 439 PSVIQTARRE-GVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT-------GS 497 (497) Q Consensus 439 ~~~~~i~~r~-~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~-------~~ 497 (497) +++...+-. ...+++.++....=..++-.+....|+ ++||..|..-.-..+.. |. T Consensus 256 -GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~---~~hp~G~s~~~a~v~~~~~~~~~~sP 318 (349) T protein:vir:94 256 -GAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYSFTSAVITGNGTETIARSA 318 (349) T ss_pred -ceEEeecCCCCcceeeecccccCCcceeEEEEEeeEE---EeeeeeeeecccccCCCccccccCCC Confidence 334443332 223444443321001345556665555 56677766654222210 11 No 187 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=93.90 E-value=0.006 Score=32.76 Aligned_cols=302 Identities=9% Similarity=0.030 Sum_probs=139.2 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhh----ccceecCC-CceEEEEeecCCccceeec-cccccccccccce Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADL----ISSRPVTS-PNLSYLTESAAHNNAAAVA-EAGTYPFSSEEFA 224 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~----~~~~~~~~-~~~~~p~~~~~~~~a~~v~-Eg~~~~~s~~~~~ 224 (497) ++.+.-..-...-....++.+.+.+-..++|+.. ..+.+.++ .++..|.+.....++.|.. +..-...-.-.|. T Consensus 1 mp~~~lsel~t~tl~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~~p~d~~~ 80 (321) T protein:vir:34 1 MPFPNISDIITTTIESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSGYDVLPTAPQDVIS 80 (321) T ss_pred CCCchHHHHHHHHHHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEeeeeeccchhhhcc Confidence 0000000000000111223344444444444333 33344444 4678888777667788864 3333334456899 Q ss_pred eEEeeeeeeeeechhhH-HHHhhHH--HHHHHHHHHH---HHHHHHHHHhhhhc-cCC--Cccccceeccccccccchhh Q lcl|Aclame:pro 225 RVYEQVGKVANALTITD-EGLRDAP--ELFNFVQGRL---LEGIQRKEEVQLLA-GGG--YPGVNGLLQRSTGFTASSAS 295 (497) Q Consensus 225 ~v~~~~~kia~~~~iS~-ell~ds~--~l~~~i~~~l---a~~~~~~~d~a~l~-G~g--~~~~~Gil~~~~~~~~~~~~ 295 (497) ..++..+..++-+.||- |+|+.+. .+..++...+ .+.+...++..+.. |++ ..+..|+....... ..+ T Consensus 81 ~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~sdGTa~g~~~i~GL~~lv~~~---p~t 157 (321) T protein:vir:34 81 SAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYGDGTAFGGRAINGLDGAVPVD---PTV 157 (321) T ss_pred ccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhccccccccchhhhhhhhcccC---CCC Confidence 99999999999888885 5666542 3555555544 45677778777654 543 23455653322111 111 Q ss_pred hhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCce Q lcl|Aclame:pro 296 SLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNA 375 (497) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 375 (497) ..++....+. ...+.......... .....+...+.....-..-....|+. T Consensus 158 GtvGGIdra~---------------------------~~~WRn~~~d~~~~---~t~~tl~~~m~~~w~~~~Rg~~~PDl 207 (321) T protein:vir:34 158 GTYGGINRAL---------------------------WPFWRSQVEDMAAV---ATINTIQPAMTKLWSRCVRGADMPDL 207 (321) T ss_pred ceeccccccc---------------------------hhhhhhhhhhhhhc---ccHHHHHHHHHHHHHhhccCCCCccE Confidence 1111110000 00000000000000 01111122222222222233446788 Q ss_pred EEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecC----CCCcCcEEEeeccceEEEEEeccccE Q lcl|Aclame:pro 376 VVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP----LIPLGTILVGHFAPSVIQTARREGVT 451 (497) Q Consensus 376 ~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~----~~~~~~~~~gd~~~~~~~i~~r~~~~ 451 (497) |++...-|..++.---..-||--... ...|. ..=...|.-||.++ .+|+++.+|-|-+...+.......+. T Consensus 208 ii~~~~~y~~y~~s~q~~qR~~~~~~--a~~Gf---~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~ 282 (321) T protein:vir:34 208 IMSGNDAWTTYSNSLQVLQRFTSAEE--ANLGF---RSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMV 282 (321) T ss_pred EEechHHHHHHHHhhheeeeeccccc--ccccc---eeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCcee Confidence 88888888777765444444433222 11111 12245688888887 68999999888775333222222332 Q ss_pred EEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEec Q lcl|Aclame:pro 452 MQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 452 i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~ 491 (497) . +.+..-..+-+|-+.-.+-.+....+-+|.+=.++.-. T Consensus 283 p-i~p~r~~~~NqdA~~q~I~~~GnL~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 283 P-LSPSRRAAFNQDAEAQILAWAGNLTCSGAQFQGRLIAE 321 (321) T ss_pred e-cCcccccccchhHHhhhhhhhheeeeecccceeEEeeC Confidence 2 22211101123333334444455555566555555433 No 188 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=93.72 E-value=0.0066 Score=32.54 Aligned_cols=387 Identities=12% Similarity=0.060 Sum_probs=108.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADET-KTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) +-++.+++++.....+++++...++. ++..++.+.++.+..+++...+.++..+.. ....... ...... T Consensus 8 ~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~------~~~~~~~----~~~~~~ 77 (415) T protein:vir:46 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEK------DRTSENN----QQSVEV 77 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHhhhhc----cccccc Confidence 44556666666666666665433322 223333333333333333322222111100 0000000 000000 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+.+ .............. .......... ........ .... ......... .++...- T Consensus 78 ~~~~-------~~~~~~~~~~~~~~----~~~~~~~~~~----~~~~~~~~--~~~~-----~~~~~~~~t--~~g~~~i 133 (415) T protein:vir:46 78 NEAR-------TYRNQANINDLGIS----IQNTKVTSQE----VRDFTEYL--ETRN-----DIQGGSLKT--DSGFVVI 133 (415) T ss_pred chhh-------hhHHHHHHHHHHHh----hhhhhhhHHH----HHHHHHHH--hhhh-----hhhhccccc--cCCcccc Confidence 0000 00000000000000 0000000000 00000000 0000 000000000 0000001 Q ss_pred CcccccchhhH-----HHHHHHhhhhHHhhccceec---CCC-ceEEEEeecCCccceeeccccccccccccceeEEeee Q lcl|Aclame:pro 160 APGILPTFLPG-----IVEQLFYELSLADLISSRPV---TSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 160 g~~i~~~~~~~-----ii~~~~~~~~l~~~~~~~~~---~~~-~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~ 230 (497) ...+.+.+... .+..+...-++...-..+++ .+. ...+. + .+ .-..|.........++..-.+.. T Consensus 134 P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v---~-Eg--~~~~~~~~~~~~~v~~~~~k~~~ 207 (415) T protein:vir:46 134 PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKV---E-EL--EENPELAVKPFFQLAYDINTHRG 207 (415) T ss_pred cHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeec---c-cc--cccccccccceeeEEeeeeeeEe Confidence 11111211111 11111111111111111111 111 11111 1 11 11222222222233333333322 Q ss_pred eeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhh Q lcl|Aclame:pro 231 GKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKF 310 (497) Q Consensus 231 ~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (497) .-.-..--+...-..-...+...|...+++.+..++=...-.|...+...+....... +.........+.......... T Consensus 208 ~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~ 286 (415) T protein:vir:46 208 YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK-LEVKKAKSLDDIKDAINLNVK 286 (415) T ss_pred eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccce-eccccccchHHHHHHHHhhhh Confidence 2111111111111111234566677777777766665544445443332222222111 122222233334444445555 Q ss_pred hhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHh Q lcl|Aclame:pro 311 PADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK 390 (497) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk 390 (497) .....+.|+++...+..+...++..|.+...+......+...... .++..+... . T Consensus 287 ~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~-------------------pV~~~~~~~------~ 341 (415) T protein:vir:46 287 PNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGA-------------------KIEILPDEV------L 341 (415) T ss_pred hccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccce-------------------eeEEecccc------c Confidence 566677888888888888888887777665443322222111000 011111000 0 Q ss_pred cccCc--ccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEE---------EEeccch Q lcl|Aclame:pro 391 DANGQ--YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTM---------QMTNSNG 459 (497) Q Consensus 391 d~~G~--~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i---------~~~~~~~ 459 (497) .+.|. .+|.+ +..... ...-.|+.|-.+++.-..+.+ ....|-+..+ +++.... T Consensus 342 ~~~~~~~~~~gd-~~~~~~-----~~~~~~~~v~~~~~~~~~~~~---------~~~~r~d~~v~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:46 342 GQKGNNTLIIGN-LKDAIV-----LFDRSQYQASWTDYMHFGECL---------MIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred cCCCccEEEEEe-hhccEE-----EEeecceEEEeeccccCceEE---------EEEEEeccEEeccccEEEEEeeccCC Confidence 00000 11110 000000 000112333333322111111 1111211211 1111110 Q ss_pred hhhhcCceEEEE Q lcl|Aclame:pro 460 TDFVDGKVTVRA 471 (497) Q Consensus 460 ~~f~~~~v~~r~ 471 (497) ..+..++-+ T Consensus 407 ---~~~~~~~~~ 415 (415) T protein:vir:46 407 ---GEGDLGLEA 415 (415) T ss_pred ---CCCCccCCC Confidence 011111111 No 189 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=93.72 E-value=0.0066 Score=32.54 Aligned_cols=387 Identities=12% Similarity=0.060 Sum_probs=108.4 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADET-KTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) +-++.+++++.....+++++...++. ++..++.+.++.+..+++...+.++..+.. ....... ...... T Consensus 8 ~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~------~~~~~~~----~~~~~~ 77 (415) T protein:vir:47 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEK------DRTSENN----QQSVEV 77 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHhhhhc----cccccc Confidence 44556666666666666665433322 223333333333333333322222111100 0000000 000000 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+.+ .............. .......... ........ .... ......... .++...- T Consensus 78 ~~~~-------~~~~~~~~~~~~~~----~~~~~~~~~~----~~~~~~~~--~~~~-----~~~~~~~~t--~~g~~~i 133 (415) T protein:vir:47 78 NEAR-------TYRNQANINDLGIS----IQNTKVTSQE----VRDFTEYL--ETRN-----DIQGGSLKT--DSGFVVI 133 (415) T ss_pred chhh-------hhHHHHHHHHHHHh----hhhhhhhHHH----HHHHHHHH--hhhh-----hhhhccccc--cCCcccc Confidence 0000 00000000000000 0000000000 00000000 0000 000000000 0000001 Q ss_pred CcccccchhhH-----HHHHHHhhhhHHhhccceec---CCC-ceEEEEeecCCccceeeccccccccccccceeEEeee Q lcl|Aclame:pro 160 APGILPTFLPG-----IVEQLFYELSLADLISSRPV---TSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 160 g~~i~~~~~~~-----ii~~~~~~~~l~~~~~~~~~---~~~-~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~ 230 (497) ...+.+.+... .+..+...-++...-..+++ .+. ...+. + .+ .-..|.........++..-.+.. T Consensus 134 P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v---~-Eg--~~~~~~~~~~~~~v~~~~~k~~~ 207 (415) T protein:vir:47 134 PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKV---E-EL--EENPELAVKPFFQLAYDINTHRG 207 (415) T ss_pred cHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeec---c-cc--cccccccccceeeEEeeeeeeEe Confidence 11111211111 11111111111111111111 111 11111 1 11 11222222222233333333322 Q ss_pred eeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhh Q lcl|Aclame:pro 231 GKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKF 310 (497) Q Consensus 231 ~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (497) .-.-..--+...-..-...+...|...+++.+..++=...-.|...+...+....... +.........+.......... T Consensus 208 ~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~ 286 (415) T protein:vir:47 208 YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK-LEVKKAKSLDDIKDAINLNVK 286 (415) T ss_pred eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccce-eccccccchHHHHHHHHhhhh Confidence 2111111111111111234566677777777766665544445443332222222111 122222233334444445555 Q ss_pred hhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHh Q lcl|Aclame:pro 311 PADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK 390 (497) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk 390 (497) .....+.|+++...+..+...++..|.+...+......+...... .++..+... . T Consensus 287 ~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~-------------------pV~~~~~~~------~ 341 (415) T protein:vir:47 287 PNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGA-------------------KIEILPDEV------L 341 (415) T ss_pred hccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccce-------------------eeEEecccc------c Confidence 566677888888888888888887777665443322222111000 011111000 0 Q ss_pred cccCc--ccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEE---------EEeccch Q lcl|Aclame:pro 391 DANGQ--YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTM---------QMTNSNG 459 (497) Q Consensus 391 d~~G~--~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i---------~~~~~~~ 459 (497) .+.|. .+|.+ +..... ...-.|+.|-.+++.-..+.+ ....|-+..+ +++.... T Consensus 342 ~~~~~~~~~~gd-~~~~~~-----~~~~~~~~v~~~~~~~~~~~~---------~~~~r~d~~v~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:47 342 GQKGNNTLIIGN-LKDAIV-----LFDRSQYQASWTDYMHFGECL---------MIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred cCCCccEEEEEe-hhccEE-----EEeecceEEEeeccccCceEE---------EEEEEeccEEeccccEEEEEeeccCC Confidence 00000 11110 000000 000112333333322111111 1111211211 1111110 Q ss_pred hhhhcCceEEEE Q lcl|Aclame:pro 460 TDFVDGKVTVRA 471 (497) Q Consensus 460 ~~f~~~~v~~r~ 471 (497) ..+..++-+ T Consensus 407 ---~~~~~~~~~ 415 (415) T protein:vir:47 407 ---GEGDLGLEA 415 (415) T ss_pred ---CCCCccCCC Confidence 011111111 No 190 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=93.64 E-value=0.0069 Score=32.44 Aligned_cols=265 Identities=12% Similarity=-0.002 Sum_probs=95.5 Q ss_pred hhcccccCCcccccchhhHHHHHHHhhhhHHhhccceec-------CCCceEEEEee-cCCccceeeccccccccccc-c Q lcl|Aclame:pro 152 PFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPV-------TSPNLSYLTES-AAHNNAAAVAEAGTYPFSSE-E 222 (497) Q Consensus 152 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~-------~~~~~~~p~~~-~~~~~a~~v~Eg~~~~~s~~-~ 222 (497) +.++..+.-...-+...+..++.+.+...+++.+....+ .+.=...+... +......-|.-.+.....+. + T Consensus 1 ~~~t~~sdl~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit~ 80 (315) T protein:vir:96 1 MATTVNSDLVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIAA 80 (315) T ss_pred CceeeecceeeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceeccc Confidence 222222221222234455666666665555544332211 11111111110 10000000111111111221 1 Q ss_pred ceeEEeeeeeee-eechh--hHHHHh---hHH-HHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhh Q lcl|Aclame:pro 223 FARVYEQVGKVA-NALTI--TDEGLR---DAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSAS 295 (497) Q Consensus 223 ~~~v~~~~~kia-~~~~i--S~ell~---ds~-~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~ 295 (497) ...+.. |++ +.-++ +.+.+. +.| ....-|...+..++...+=...+.|. .+.+...+..... T Consensus 81 ~~dvaV---k~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~-----~aai~~~t~~~~~--- 149 (315) T protein:vir:96 81 DEMVSV---KVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNAL-----QGAIGSNAGMNVS--- 149 (315) T ss_pred ccceeE---EEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhh-----hhhhccccccccc--- Confidence 222222 233 23333 333332 222 22222222322222221111111110 0000000000000 Q ss_pred hhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCce Q lcl|Aclame:pro 296 SLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNA 375 (497) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 375 (497) ..........+.++...+ --....-.. T Consensus 150 ----------------------------------------------------~~~a~~~~~~l~dA~~kl-GD~~~~l~~ 176 (315) T protein:vir:96 150 ----------------------------------------------------GELATEGKKVLTKGLRTM-GDKASSIAI 176 (315) T ss_pred ----------------------------------------------------ccccccCHHHHHHHHHHh-cccccCeeE Confidence 000001111122222221 111223457 Q ss_pred EEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEe Q lcl|Aclame:pro 376 VVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMT 455 (497) Q Consensus 376 ~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~ 455 (497) |+||-.++..|.+ +.= -.+++... .+.-...+++.+|+||++++.||.++++. |..+++.+.....+..... T Consensus 177 ~vMHS~v~~~L~~-q~L-~~~~~~~~----~~~~~~~~~~~lGkrViVdD~~P~~~~~g--l~~GAi~~~~~~~~~~~~~ 248 (315) T protein:vir:96 177 WVMDSTSYFDIVD-EAI-DNKLYEEA----GVVVYGGTPGTLGKPVLVTDQCPATKIFG--LVAGAVMITESQAPGMRSY 248 (315) T ss_pred EEEchHHHHHHHH-hhh-hhhccccc----ceeEecCcCcccccEEEEECCCCcceeee--eecceeeecCCCccccccc Confidence 9999999999986 321 12232221 11122334567799999999999876543 3444555543333211111 Q ss_pred ccchhhhhcCceEEEEEeeecc-EeecccceEEEEecCCCCCC Q lcl|Aclame:pro 456 NSNGTDFVDGKVTVRAEERLGL-LVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 456 ~~~~~~f~~~~v~~r~~~r~~~-~v~~~~Af~~~~~~~~a~~~ 497 (497) .. .++-.+....|..+ -+++|..|..- +++..| T Consensus 249 ~~------~g~e~l~~~~r~e~tf~l~p~G~sw~---~~~~~s 282 (315) T protein:vir:96 249 QI------DDQENLAIGFRAEGTANVEVLGYKWK---TKTNVN 282 (315) T ss_pred cC------CCcceeEEEEeeeeEeeeeeeeEEee---cCCCcC Confidence 11 12333445555555 46788887763 333233 No 191 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=93.40 E-value=0.0077 Score=32.17 Aligned_cols=382 Identities=15% Similarity=0.146 Sum_probs=140.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFK--AHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~--~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) |-+.--.++|-+ ++|+++..-.+..++. ..+..++...+.+|++.-+.+..-++.++++.+. T Consensus 1 mnkpdliekqnr----------------laelkennvslksqisgfevknaiedl~K~~ELe~TlSe~~iEI~k~en~LN 64 (393) T protein:vir:16 1 MNKPDLIEKQNR----------------LAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN 64 (393) T ss_pred CCCcchhhhhhh----------------hhhhhhcccchhhhccchhhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhh Confidence 544433333221 2222222222222222 1233444455566666666655555555444432 Q ss_pred HHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccccc Q lcl|Aclame:pro 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) ..+... .++ .++..-.+..+ ....+......+.. ....+.+|..... .+..+ .++ T Consensus 65 ~~eE~~-------KGK-~kMt~~iesq~---A~~eF~~vL~~N~G-------~S~~k~AW~A~L~------E~GVt-iTD 119 (393) T protein:vir:16 65 AQEEKP-------KGK-DKMTNFIESQN---AVTEFFDVLKKNSG-------KSEIKNAWSAKLA------ENGVT-ITD 119 (393) T ss_pred hhhhcc-------hhh-HHHHHHHhhHH---HHHHHHHHHhccCC-------chhhhhhhhhhHh------hcCcc-eec Confidence 211110 000 00000001000 00000000000000 0012222221111 11111 111 Q ss_pred CCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeech Q lcl|Aclame:pro 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALT 238 (497) Q Consensus 159 ~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~ 238 (497) ....+|.-.+-.|-..+....+++....+..++.--++...++. ..|.-.-.|.++.+...+|..-++.+.-++.... T Consensus 120 ~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~~s~--~eAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S 197 (393) T protein:vir:16 120 TTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSA--NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQS 197 (393) T ss_pred cchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhhhhh--hhhhhhccCCccccceeeeeeechhHHHHHHHHH Confidence 22234434444555555666777766555555443333333332 2455566677888777777777776644433333 Q ss_pred hhHHHHhh---H-HHHHHHHHHHHHHHHH-HHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhh Q lcl|Aclame:pro 239 ITDEGLRD---A-PELFNFVQGRLLEGIQ-RKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 239 iS~ell~d---s-~~l~~~i~~~la~~~~-~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) + -++..| + ..+..||..+|+.++. +..|.+++-|+|+.....+..-+....... T Consensus 198 ~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k-------------------- 256 (393) T protein:vir:16 198 L-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKK-------------------- 256 (393) T ss_pred H-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHH-------------------- Confidence 3 233333 2 3578999999999998 889999999999876555433221110000 Q ss_pred cchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhH-HHHHHHHhcc Q lcl|Aclame:pro 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD-WELLRLTKDA 392 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~-~~~l~~lkd~ 392 (497) ..+.....|.. | ..+.+-.+....+...+ ....+....+ -+-|..|+-+ T Consensus 257 --------------~Ttkaksagkt----------p----fadaieeavdfvrptag--rrylivktedrkalldelrqa 306 (393) T protein:vir:16 257 --------------ITTKAKSAGKT----------P----FADAIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQA 306 (393) T ss_pred --------------HhhhhhhcCCC----------c----hhHHHHHHHhhhccCCC--ceEEEEeccchHHHHHHHHhh Confidence 00000000000 0 00111111111111111 0011111111 1222222211 Q ss_pred cCc---ccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEE Q lcl|Aclame:pro 393 NGQ---YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 393 ~G~---~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) ... .|-.+. +.....+ .+.-+-|+.... .-..-++-|.. |.|. -+.+ +..+.--|.+|.-.+ T Consensus 307 tananvrikndd--teiasev----gvdeiivytgsk-alkptvlvdqk---yhid-mqdl----tkvdafewktnsnmi 371 (393) T protein:vir:16 307 TANANVRIKNDD--TEIASEV----GVDEIIVYTGSK-ALKPTVLVDQK---YHID-MQDL----TKVDAFEWKTNSNMI 371 (393) T ss_pred hccCceeeeccc--hhhhhhc----Ccceeeeeeccc-cccceeeeccc---cccc-hhhh----hhhhhheeccCCceE Confidence 100 000000 0000000 000000111000 00011122222 2221 1111 112233466666666 Q ss_pred EEEeeeccEeecccceEEEEec Q lcl|Aclame:pro 470 RAEERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 470 r~~~r~~~~v~~~~Af~~~~~~ 491 (497) .++.-..++|---.|=+.+++. T Consensus 372 lvetltsghvetynagavitvs 393 (393) T protein:vir:16 372 LVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred EEeecccCcceeeccceeEeeC Confidence 7776666666544444444444 No 192 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=93.27 E-value=0.0082 Score=32.03 Aligned_cols=379 Identities=11% Similarity=-0.015 Sum_probs=82.4 Q ss_pred Cch---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPS---TAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDI 77 (497) Q Consensus 1 m~~---~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~ 77 (497) |=- +.+++++++++.+++..+.+ +.+++.+... .+....+.+..+..++ ++..+++++++..+..+... T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~~~~-e~r~~~e~~~-~~~~~~~~~e~~~~~~------~l~~ei~~l~e~~~~~~~~~ 72 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPAMKT-ELRSLLEGED-SEENLKKAEGVRAKYD------KAGKEIKDLEEKRDLYEAAL 72 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhc-cchHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHH Confidence 422 22223344444333333322 1111111110 0111111111111111 11111111111111111111 Q ss_pred HHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHH-HHHHHHHHHHHhhhhhhhhhhhhhhccc Q lcl|Aclame:pro 78 PEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPG-TAAAELMGAFADGETAPAAIGQNPFGST 156 (497) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (497) +..+.......... .................................... ......+.... .......+.. T Consensus 73 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~gg~ 144 (400) T protein:vir:38 73 KGNEQSSGKKPDHP-EEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVN-------AGVKAADAAS 144 (400) T ss_pred HHHhhcccccccch-hhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHh-------hcccccCCcc Confidence 10000000000000 000000000000000000000000000000000000 00000000000 0000000000 Q ss_pred ccCCcccccchhh-----HHHHHHHhhhhHHhhccceec---CCCceEEEEeecCCccceeeccccccccccccceeEEe Q lcl|Aclame:pro 157 GTFAPGILPTFLP-----GIVEQLFYELSLADLISSRPV---TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYE 228 (497) Q Consensus 157 ~~~g~~i~~~~~~-----~ii~~~~~~~~l~~~~~~~~~---~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~ 228 (497) .. -..+.+++.. ..+..+...-++-..-..+++ +++...+..+.+ -..+.........++..-.+ T Consensus 145 ~v-P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~------~~~~~~~~~f~~i~~~~~k~ 217 (400) T protein:vir:38 145 TI-PETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELE------KNPAMAKPEFKPVNWSVETY 217 (400) T ss_pred cc-cHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCCCccccccccc------cccccccccceeeEeehhhe Confidence 00 0011111111 111111111111110111121 112122211111 11111111112222222222 Q ss_pred eee-eeee-echhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccC--CCccccceeccccccccchhhhhhhHHHHH Q lcl|Aclame:pro 229 QVG-KVAN-ALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGG--GYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) Q Consensus 229 ~~~-kia~-~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~--g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) ... ++.- .+.-|.--+. ..+...|...+...+..++-...=.+. +.....+|....... T Consensus 218 ~~~~~is~ell~ds~~~~~--~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~--------------- 280 (400) T protein:vir:38 218 RQALPVSQESIDDSAIDLV--GLIAQNGQQIKVNTTNGAVATLLKGFTAKTISSVDDLKHINNVD--------------- 280 (400) T ss_pred eeehhhHHHHHhhhHHHHH--HHHHHHHHHHHHHHHHHhhhhccccccccccccHHHHHHHHHhh--------------- Confidence 111 1111 1111111011 124444555555555554433222222 222333343211100 Q ss_pred HHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHH Q lcl|Aclame:pro 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 384 (497) ......+.|+++...+..+...++..|.+...+......+..... - .+++++... T Consensus 281 -----~~~~~~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G------------------~-pv~~~~~~~- 335 (400) T protein:vir:38 281 -----LDPAYSRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGKSVLG------------------M-PIAVVSDDT- 335 (400) T ss_pred -----hhhhhCcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCcccccc------------------c-eeEEecccc- Confidence 011123567778888888888877777766554433222211100 0 011111100 Q ss_pred HHHHHhcccCc--ccccccccccccccccccccc---cccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccch Q lcl|Aclame:pro 385 LLRLTKDANGQ--YMGGNFFGNAYGNPVNGGKNI---WGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) Q Consensus 385 ~l~~lkd~~G~--~~~~~~~~~~~~~~~~~~~~l---~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~ 459 (497) ....|. .+|.+ +.. ...+ .|+.+-.++.....+. +....|-+..+.. . T Consensus 336 -----~~~~g~~~~~~gd-~s~--------~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~r~d~~~~~----~ 388 (400) T protein:vir:38 336 -----LGAAGEAHAFLGD-IKR--------AILFANRADFMVRWVDDQIYGQF---------LQAGMRFGVSVAD----E 388 (400) T ss_pred -----cCCCCceEEEEEe-ccc--------cEEEEeecceEEEEeccccccee---------EEEEEEeccEEec----c Confidence 000011 01100 000 0001 1333434333222221 1222222222211 0 Q ss_pred hhhhcCceEEEEEeeeccEeecccc Q lcl|Aclame:pro 460 TDFVDGKVTVRAEERLGLLVYRPSA 484 (497) Q Consensus 460 ~~f~~~~v~~r~~~r~~~~v~~~~A 484 (497) +.|. . .-+-|.| T Consensus 389 ~a~~-------~------l~~~~~a 400 (400) T protein:vir:38 389 KAGY-------F------LTYTPKA 400 (400) T ss_pred cceE-------E------EEeecCC Confidence 1111 1 1112222 No 193 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=92.91 E-value=0.0095 Score=31.67 Aligned_cols=326 Identities=10% Similarity=-0.029 Sum_probs=134.5 Q ss_pred hhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhc-ccccCCcccccchhhHHHHHHHhhhhHHhhccceecCC Q lcl|Aclame:pro 114 FDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFG-STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTS 192 (497) Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 192 (497) .. . ......+..+........ ..+... ...+-...|.|.....+...+.+.+.+++++++++++- T Consensus 1 m~-------~-----~M~~~tr~~~~~y~~~~A--~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e 66 (358) T protein:vir:78 1 MS-------Q-----TLTVQAEQRLNKYCDALA--KAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQ 66 (358) T ss_pred Cc-------c-----cccHHHHHHHHHHHHHHH--HHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCccccccc Confidence 00 0 000011111111110000 001110 01122345677777788888888999999999999876 Q ss_pred CceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhHH------HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 193 PNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP------ELFNFVQGRLLEGIQRKE 266 (497) Q Consensus 193 ~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds~------~l~~~i~~~la~~~~~~~ 266 (497) -.....-.....+-++-+.. ..|.....++.-.+.+++.---+.|+-+.|+... +|...+++.+.++++.=. T Consensus 67 ~~Ge~v~lg~~g~iagrt~t--r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~ 144 (358) T protein:vir:78 67 IKGQVVQVGVGQLYTGRKKG--GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDM 144 (358) T ss_pred ceeeEEeecCCcccceecCC--CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhcc Confidence 54444322111111222221 2233334556666666666666677777777542 588888888888775433 Q ss_pred HhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccc---cccccc Q lcl|Aclame:pro 267 EVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGA---AGSGSG 343 (497) Q Consensus 267 d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 343 (497) =.--+||+.....+-. ..... ..+.+.. |+-.++........ ...... T Consensus 145 i~IGfNGts~A~~Td~---------------~~nPl--------lqDVN~G------WlQ~~Re~a~~~v~~~~~~~~~i 195 (358) T protein:vir:78 145 LRVGWNGVSAADDTDP---------------TANPL--------GQDVNKG------WHQLAREWKGGSQIIKAAAGEKI 195 (358) T ss_pred ceecccceeeccCCCh---------------hhCcC--------ccccchH------HHHHHHhhchhhhhccccccCce Confidence 2334455432111100 00000 0011111 11111111110000 000000 Q ss_pred ccc-cccchhhhhhhhH-HhhhhhhhhhccC-CceEEEehhHHHH---HHHHhcccCccccccccccccccccccccccc Q lcl|Aclame:pro 344 VAG-SYPTAAEIAENVF-DAFVDIQLTLFQT-PNAVVMNPRDWEL---LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIW 417 (497) Q Consensus 344 ~~~-~~~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~n~~~~~~---l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~ 417 (497) ... ........+|.+. ++...+..+.+.+ +...++=-.++.+ +.++ ...+.+ +..-....-..+|- T Consensus 196 ~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~-n~~~~p-------TE~~Aa~~i~k~iG 267 (358) T protein:vir:78 196 YFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGTDLVAAAQAKLY-SEATKP-------SEQIAAQQLAKSIA 267 (358) T ss_pred eecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHh-hcCCCc-------HHHHHHHHHHHHhC Confidence 000 0011122333333 3333343333333 3344433333222 2222 222111 11000011114789 Q ss_pred ccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCC--- Q lcl|Aclame:pro 418 GVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA--- 494 (497) Q Consensus 418 G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a--- 494 (497) |+|.+.-+++|++.+++=-|+.+.+.. .+....-.+-+... +|++.-.=..--+..|-+++.+|.++..... T Consensus 268 Glpa~~~PfFP~~~ilVT~L~NLsIY~-Q~gs~RR~~~d~p~----r~riE~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~ 342 (358) T protein:vir:78 268 GRKAYIPPFFPGKRMVVTTLDNLHCYT-QRGTRKRKADDNQD----SKSFDNQYWRMEGYALGEHKAYGGFEEADIEIGA 342 (358) T ss_pred CCeEEEccccCCCceEEeeccccEEEE-ecCcEEEEEEeccc----cccccchhhhcceeeeeccccEEEEeeeeeeeCC Confidence 999999999999999988887753321 22222222222211 3333333333334555566655555443211 Q ss_pred -------CCC Q lcl|Aclame:pro 495 -------TGS 497 (497) Q Consensus 495 -------~~~ 497 (497) +++ T Consensus 343 ~pa~~~~~~~ 352 (358) T protein:vir:78 343 DPAVLAVEAA 352 (358) T ss_pred CCCccccCCc Confidence 111 No 194 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=92.31 E-value=0.012 Score=31.12 Aligned_cols=274 Identities=11% Similarity=-0.031 Sum_probs=109.4 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhhcccee-----c--CCCceEEEEeecCCccceee-cccccccccccc Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRP-----V--TSPNLSYLTESAAHNNAAAV-AEAGTYPFSSEE 222 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~-----~--~~~~~~~p~~~~~~~~a~~v-~Eg~~~~~s~~~ 222 (497) |+ ......+|..+....++.+++..++.++++... . .+.++++++...... ..+. ..+..+...+.+ T Consensus 1 Ma----N~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~-~d~~~~~~~~~~~~dl~ 75 (423) T protein:vir:10 1 MP----NNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSS-LRTPTGDISGQNKNNLI 75 (423) T ss_pred Cc----cchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceee-eccCCccccccccCccc Confidence 11 111123455566788888888888877776521 1 356777776443211 1121 122222222333 Q ss_pred ce--eEEeeeeeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCccccceeccccccccchhhhhhh Q lcl|Aclame:pro 223 FA--RVYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAG-GGYPGVNGLLQRSTGFTASSASSLFG 299 (497) Q Consensus 223 ~~--~v~~~~~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G-~g~~~~~Gil~~~~~~~~~~~~~~~~ 299 (497) -+ .+.+.-+|...+-.=+.|+..+..+++.+++.. .++++..+|..++.- .+.. +. ....++ +. T Consensus 76 e~~v~l~id~~k~va~~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~-~~-~~gt~~--t~-------- 142 (423) T protein:vir:10 76 SGKATGRVGNYITVAVEYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNG-AL-SLGSPN--TP-------- 142 (423) T ss_pred cceeEEEeeceeeeeeeechHHHhcChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhcc-cc-ccccCC--cc-------- Confidence 33 466666777666655667665666677777655 688999999987631 1100 00 000000 00 Q ss_pred HHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEe Q lcl|Aclame:pro 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMN 379 (497) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 379 (497) +...+.+......+.....+. ..-..+++ T Consensus 143 -------------------------------------------------~~a~~~i~~a~~~Ld~~~vP~--~~R~~Vv~ 171 (423) T protein:vir:10 143 -------------------------------------------------ITKWSDVAQTASFLKDLGVNE--GENYAVMD 171 (423) T ss_pred -------------------------------------------------cchHHHHHHHHHHHHhccCCc--CCCEEEeC Confidence 000111111222222222221 12346788 Q ss_pred hhHHHHHHHHhcccCcccccccccccccccccc-cccccccceeecCCCCcCcEE-Eee--ccceEEEE-----Eecccc Q lcl|Aclame:pro 380 PRDWELLRLTKDANGQYMGGNFFGNAYGNPVNG-GKNIWGVPVVTTPLIPLGTIL-VGH--FAPSVIQT-----ARREGV 450 (497) Q Consensus 380 ~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~-~~~l~G~pvv~s~~~~~~~~~-~gd--~~~~~~~i-----~~r~~~ 450 (497) |..+..|.+ +. +.++..........-.+. ...+.|+.|+.|+++|..+.. ++- +......+ .+.... T Consensus 172 p~~~a~Ll~--~~--~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~ 247 (423) T protein:vir:10 172 PWSAQRLAD--AQ--TGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQF 247 (423) T ss_pred hHHHHHHhc--cc--cceecccccchhhhhhccceeeecceEEEEeCCCccccccccccceeeeecceecccccccccee Confidence 888777653 11 111111111111111111 247899999999999963211 100 00000000 011111 Q ss_pred EEEEe----ccchhhhhcCceEEEE---EeeeccEee------cccceEEEE-------------ecC------------ Q lcl|Aclame:pro 451 TMQMT----NSNGTDFVDGKVTVRA---EERLGLLVY------RPSAFQLIQ-------------LKK------------ 492 (497) Q Consensus 451 ~i~~~----~~~~~~f~~~~v~~r~---~~r~~~~v~------~~~Af~~~~-------------~~~------------ 492 (497) ++... +..+..-.-|.+.|-+ ..+....++ .+.-|+.+. +.. T Consensus 248 ~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i~p~~i~~~~~~~~~ 327 (423) T protein:vir:10 248 TVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVTVTLSGVPIYDTTNPQYN 327 (423) T ss_pred eeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccCCceeeeccCccccccCCcccc Confidence 11110 0000000011111111 111111111 111122111 110 Q ss_pred CCCCC Q lcl|Aclame:pro 493 GATGS 497 (497) Q Consensus 493 ~a~~~ 497 (497) ..++| T Consensus 328 ~v~a~ 332 (423) T protein:vir:10 328 SVSRQ 332 (423) T ss_pred ccccc Confidence 00000 No 195 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=92.30 E-value=0.012 Score=31.12 Aligned_cols=332 Identities=11% Similarity=0.033 Sum_probs=134.1 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhc-ccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccce Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFG-STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAA 208 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~ 208 (497) .....+..+........ ..+... ...+-...|.|.....+...+.+.+.+++++++++|+--.....-.....+-++ T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iag 78 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLA--KLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIAS 78 (355) T ss_pred CChHHHHHHHHHHHHHH--HHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceee Confidence 00011111111110000 001111 011123456677778888889999999999999998764443332211111122 Q ss_pred eecc--c-cccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccce Q lcl|Aclame:pro 209 AVAE--A-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGL 282 (497) Q Consensus 209 ~v~E--g-~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gi 282 (497) -+.- + +-.|..-..++.-.+.+++.---+.|+-+.|+.. ++|...+++.+.++++.=.=.--+||+.....+-. T Consensus 79 rtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~ 158 (355) T protein:vir:18 79 TTDTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDR 158 (355) T ss_pred ccccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCCh Confidence 2111 1 1223333445566666666666677888888753 67888888888888765333334455432111100 Q ss_pred eccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhc---ccccccccccccccchhhhhhhh- Q lcl|Aclame:pro 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT---GAAGSGSGVAGSYPTAAEIAENV- 358 (497) Q Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~- 358 (497) -+ ... ..+.+..|..-.......+.+.... +.........+.. .....+|.+ T Consensus 159 ~~---------------nPl--------lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~-gdy~NLDAlV 214 (355) T protein:vir:18 159 VK---------------NPM--------LQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKN-GDYENLDALV 214 (355) T ss_pred hh---------------CcC--------ccccchhHHHHHHhcchhhhhccccccccccccceeeecCC-CCcccHHHHH Confidence 00 000 0001111111111111111111000 0000000000111 112223333 Q ss_pred HHhhhhhhhhh-ccCCceE-EEehhHHH-HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEe Q lcl|Aclame:pro 359 FDAFVDIQLTL-FQTPNAV-VMNPRDWE-LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVG 435 (497) Q Consensus 359 ~~~~~~~~~~~-~~~~~~~-~~n~~~~~-~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~g 435 (497) +++...+.... .-.+... ++...-.+ ....|-+..+.+ --........-..+|-|+|.+..+++|.+.+++= T Consensus 215 ~d~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~p-----tE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT 289 (355) T protein:vir:18 215 MDGTNTLIDEIYQDDPKLVAIVGRKLLADKYFPLVNKQQEN-----TESLAADIIISQKRIGNLPAVRVPYFPANAVFVT 289 (355) T ss_pred HHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHhhccCCh-----HHHHHHHHHHHHHhhCCceeEEccccCCCceEEe Confidence 33333333333 3333333 33333221 111222222111 0001111122235899999999999999999988 Q ss_pred eccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 436 HFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 436 d~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) -|+.+.+.. .+....-.+-+... +|++.-.=..--+..|-+++.+|.++--..+... T Consensus 290 ~L~NLsIY~-Q~gs~RR~~~d~p~----r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~~~ 346 (355) T protein:vir:18 290 TLENLSIYF-MDESHRRSIDENPK----KDRVENYESMNIDYVVEAYAAGCLLENITLGDFT 346 (355) T ss_pred eccccEEEE-ecCcEEEEEEeccc----cccccchhhhcceeeeeccccEEEEeeeeecCCC Confidence 777753322 22222222221111 2222222222233344444444444322222211 No 196 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=91.97 E-value=0.013 Score=30.85 Aligned_cols=275 Identities=11% Similarity=-0.033 Sum_probs=110.4 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhhcccee-----c--CCCceEEEEeecCCccceeeccccccccccccc Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRP-----V--TSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEF 223 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~-----~--~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~ 223 (497) |+ + .....+|..+....++.+++..++.++++... . .+.++++++.......-+-...+..+...+.+- T Consensus 1 Ma-N---~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e 76 (423) T protein:vir:17 1 MP-N---NLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cc-c---chhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCcccc Confidence 11 1 11123455566788888888888877776532 1 255787776432111111111111122223322 Q ss_pred --eeEEeeeeeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhcc-CCCccccceeccccccccchhhhhhhH Q lcl|Aclame:pro 224 --ARVYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAG-GGYPGVNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 224 --~~v~~~~~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G-~g~~~~~Gil~~~~~~~~~~~~~~~~~ 300 (497) -.+.+.-+|...+-.=..|+..+..+++.+++.. .++++..+|..++.- .+.. +..+ ..++ +. T Consensus 77 ~~v~l~id~~k~va~~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a-~~~~-gt~~--t~--------- 142 (423) T protein:vir:17 77 GKATGRVGNYITVAVEYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNG-ALSL-GSPN--TP--------- 142 (423) T ss_pred ceeEEEeeceeeeeeeecHHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhcc-cccc-ccCC--cc--------- Confidence 3566777777776666667665666777766655 688999999877532 1100 0000 0000 00 Q ss_pred HHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEeh Q lcl|Aclame:pro 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 380 (497) +...+.+......+.....+. ..-..+++| T Consensus 143 ------------------------------------------------~~a~~~i~~a~~~Ld~~~vP~--~~R~~Vv~p 172 (423) T protein:vir:17 143 ------------------------------------------------ITKWSDVAQTASFLKDLGVNE--GENYAVMDP 172 (423) T ss_pred ------------------------------------------------cccHHHHHHHHHHHHhccCCc--CCCEEEeCh Confidence 000111111222222222221 123467888 Q ss_pred hHHHHHHHHhcccCcccccccccccccccccc-cccccccceeecCCCCcCcEE-Eeec-----c-ce-EEEEEec---- Q lcl|Aclame:pro 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNG-GKNIWGVPVVTTPLIPLGTIL-VGHF-----A-PS-VIQTARR---- 447 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~-~~~l~G~pvv~s~~~~~~~~~-~gd~-----~-~~-~~~i~~r---- 447 (497) ..+..|.+ +. +.++..........-.+. ...+.|+.|+.|+++|..+.. ++-- . .. .....+. T Consensus 173 ~~~a~Ll~--~~--~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~ 248 (423) T protein:vir:17 173 WSAQRLAD--AQ--TGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFT 248 (423) T ss_pred HHHHHHhc--cc--cceecccccchHHHhhccceeeecceEEEEeCCCccccccceeceeeeccccccccccccccccee Confidence 88877753 11 111111111111111111 247899999999999953311 1100 0 00 0000000 Q ss_pred cccEEEEeccchhhhhcCceEEE---EEeeeccEee------cccceEEEE-ecCCCCCC Q lcl|Aclame:pro 448 EGVTMQMTNSNGTDFVDGKVTVR---AEERLGLLVY------RPSAFQLIQ-LKKGATGS 497 (497) Q Consensus 448 ~~~~i~~~~~~~~~f~~~~v~~r---~~~r~~~~v~------~~~Af~~~~-~~~~a~~~ 497 (497) .++...+....+..-.-|.+.|- ...+....|. ++.-|++.. ..+.+.+. T Consensus 249 ~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~ 308 (423) T protein:vir:17 249 VTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGD 308 (423) T ss_pred eeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccccccCc Confidence 00010111111100011222222 1222222211 222232211 00001111 No 197 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=91.94 E-value=0.014 Score=30.82 Aligned_cols=391 Identities=15% Similarity=0.128 Sum_probs=139.9 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKA--HQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~--~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) |- + +.+.+. ..++.++ ...+++.++.+.++..++.- .+..++...+.+|++.-+.+..-++.++++.+. T Consensus 1 mr----i--S~~~~~--K~~l~EK-~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~LN 71 (400) T protein:vir:93 1 MR----I--SKRNMN--KPDLIEK-QNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN 71 (400) T ss_pred Cc----c--cccccc--cchHHHH-HHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhh Confidence 10 0 000000 0000000 01122233333333333221 233344445566666666555555555444443 Q ss_pred HHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccccc Q lcl|Aclame:pro 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) ..+...... .++..-.+..+ ....+......+.. ....+.+|..... .+..+ .++ T Consensus 72 a~~E~~KGK--------~kMt~~i~sq~---A~~eF~~vL~~N~G-------~S~~k~AW~A~L~------E~GVt-iTD 126 (400) T protein:vir:93 72 AQEEKPKGK--------DKMTNFIESQN---AVTEFFDVLKKNSG-------KSEIKNAWSAKLA------ENGVT-ITD 126 (400) T ss_pred hhhhhhhhh--------HHHHHHHhhHH---HHHHHHHHHhccCC-------chhhhhhhhhhHh------hcCcc-eec Confidence 221111100 00000000000 00000000000000 0012222221111 11111 111 Q ss_pred CCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeech Q lcl|Aclame:pro 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALT 238 (497) Q Consensus 159 ~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~ 238 (497) ....+|.-.+-.|-..+....+++....+..++.--++...++. ..|.-.-.|.++.+...+|..-++.+--++.... T Consensus 127 ~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~~s~--~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S 204 (400) T protein:vir:93 127 TTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSA--NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQS 204 (400) T ss_pred cchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhhhhh--hhhhhhccCCccccceeeeeeechhHHHHHHHHH Confidence 22234434444555556666777766555555443333333332 2455566678888877777777776654443333 Q ss_pred hhHHHHhh---H-HHHHHHHHHHHHHHHH-HHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhh Q lcl|Aclame:pro 239 ITDEGLRD---A-PELFNFVQGRLLEGIQ-RKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 239 iS~ell~d---s-~~l~~~i~~~la~~~~-~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) + -++..+ + ..+.+||..+|+.++. +..|.+++-|+|++....+..-+...... T Consensus 205 ~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~--------------------- 262 (400) T protein:vir:93 205 L-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIK--------------------- 262 (400) T ss_pred H-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHH--------------------- Confidence 3 233333 2 3588999999999998 88999999999987655554322111000 Q ss_pred cchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehh-HHHHHHHHhcc Q lcl|Aclame:pro 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPR-DWELLRLTKDA 392 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~-~~~~l~~lkd~ 392 (497) ...+.....|.. | ..+.+-.+....+...+ ....+.... .-+-|..|+-+ T Consensus 263 -------------~~Ttkaksagkt----------p----fadaieeavdfvrptag--rrylivktedrkalldelrqa 313 (400) T protein:vir:93 263 -------------KITTKAKSAGKT----------P----FADAIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQA 313 (400) T ss_pred -------------HHhhhhhhcCCC----------c----hhHHHHHHHhhhccCCC--ceEEEEeccchHHHHHHHHhh Confidence 000000000000 0 00111111111111111 011111111 12222222221 Q ss_pred cCc-ccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEE Q lcl|Aclame:pro 393 NGQ-YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) Q Consensus 393 ~G~-~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~ 471 (497) ... .+-...-.......+ .+.-+-|+.... .-..-++-|.. |.|. -+++ +..+.--|.+|.-.+.+ T Consensus 314 tanahvriknddaeiasev----gvdeiivytgsk-alkptvlvdqk---yhid-mqdl----tkvdafewktnsnmilv 380 (400) T protein:vir:93 314 TANAHVRIKNDDAEIASEV----GVDEIIVYTGSK-ALKPTVLVDQK---YHID-MQDL----TKVDAFEWKTNSNMILV 380 (400) T ss_pred ccccceEeecchhhhhhhc----Ccceeeeeeccc-cccceeeeccc---cccc-hhhh----hhhhhheeccCCceEEE Confidence 100 000000000000000 000000111000 00111122222 2221 1111 11223346666666677 Q ss_pred EeeeccEeecccceEEEEec Q lcl|Aclame:pro 472 EERLGLLVYRPSAFQLIQLK 491 (497) Q Consensus 472 ~~r~~~~v~~~~Af~~~~~~ 491 (497) +.-..++|---.|=+.+++. T Consensus 381 etltsghvetynagavitvs 400 (400) T protein:vir:93 381 ETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eecccCcceeeccceeEeeC Confidence 76666666544444444444 No 198 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=91.50 E-value=0.016 Score=30.50 Aligned_cols=333 Identities=12% Similarity=0.063 Sum_probs=134.0 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhccc-ccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccce Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFGST-GTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAA 208 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~ 208 (497) .....+..+........ ..+..... .+-...|.|.....+...+.+.+.+++++++++|+--.....-.....+-++ T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iag 78 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVA--ELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIAS 78 (355) T ss_pred CChHHHHHHHHHHHHHH--HHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccc Confidence 00111111111111100 01111110 1112346677777888888999999999999998764443332211111122 Q ss_pred eecc--c-cccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccce Q lcl|Aclame:pro 209 AVAE--A-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGL 282 (497) Q Consensus 209 ~v~E--g-~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gi 282 (497) -+.- + +..|..-..++.-.+.+++.---+.|+-+.|+.. ++|...+++.+.++++.=.=.--+||+.....+-. T Consensus 79 rtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~ 158 (355) T protein:vir:98 79 TTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDR 158 (355) T ss_pred cccCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCCh Confidence 1111 1 1223333445556666666666677888888753 67888888888888765333334455432111100 Q ss_pred eccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhc---ccccccccccccccchhhhhhhh- Q lcl|Aclame:pro 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT---GAAGSGSGVAGSYPTAAEIAENV- 358 (497) Q Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~- 358 (497) -+.+. ..+.+..|..-.......+.+.... +.........+. ......+|.+ T Consensus 159 ~~nPl-----------------------lqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~-~gdy~NLDAlV 214 (355) T protein:vir:98 159 TKNTL-----------------------LQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGK-NGDYENIDALV 214 (355) T ss_pred hhCcC-----------------------ccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCC-CCCcccHHHHH Confidence 00000 0001111111111111111111000 000000000111 1112223333 Q ss_pred HHhhhhhhhhh-ccCCceEE-EehhHH--HHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEE Q lcl|Aclame:pro 359 FDAFVDIQLTL-FQTPNAVV-MNPRDW--ELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILV 434 (497) Q Consensus 359 ~~~~~~~~~~~-~~~~~~~~-~n~~~~--~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~ 434 (497) +++...+.... .-.+...+ +...-. ..+.++ +....+ ---..........+|-|+|.+..+++|.+.+++ T Consensus 215 ~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~-n~~~~p-----tE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lV 288 (355) T protein:vir:98 215 MDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLV-NKQQEN-----SESLAADIIISQKRIGNLPAVRVPYFPANAVLV 288 (355) T ss_pred HHHHhccCChHHhcCCCEEEEEchhhhHHHhhhHh-hccCCc-----HHHHHHHHHHHhhhhCCceeEEccccCCCceEE Confidence 33343333333 33333333 333222 112222 121111 000011122234589999999999999999998 Q ss_pred eeccceEEEEEeccccEEEEeccch----hhhhcCceEEEEEeeeccEeecccceEEEEec-CCCCCC Q lcl|Aclame:pro 435 GHFAPSVIQTARREGVTMQMTNSNG----TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK-KGATGS 497 (497) Q Consensus 435 gd~~~~~~~i~~r~~~~i~~~~~~~----~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~-~~a~~~ 497 (497) =-|+.+.+.. .+....-.+-+... ..|..-.-+|.++..--+...+ .+...... +++.+| T Consensus 289 T~L~NLsIY~-Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~~~~~~~ 353 (355) T protein:vir:98 289 TTLENLSIYF-MDESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTAPAAPES 353 (355) T ss_pred eeccccEEEE-ecCcEEEEEEeccccccccchhhhcceeeeeccccEEEee--ceeeeCCCCCccccc Confidence 8777753322 12222222211111 1122333344444444443333 22222221 122222 No 199 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=91.33 E-value=0.016 Score=30.38 Aligned_cols=324 Identities=13% Similarity=0.042 Sum_probs=144.4 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCcccee Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA 209 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~ 209 (497) .....+..+........ ..+.. ...+-...|.|.....+...+.+.+.+++++++++++--.....-.....+-++- T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv-~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagr 77 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIA--KLNDT-GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASR 77 (337) T ss_pred CChHHHHHHHHHHHHHH--HhcCh-hhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeee Confidence 00001111110000000 00000 1112233466777778888888899999999999987544333222111111111 Q ss_pred e--ccccccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceec Q lcl|Aclame:pro 210 V--AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQ 284 (497) Q Consensus 210 v--~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~ 284 (497) + +.+...|..-..++.-.+.+++.---+.|+-+.|+.. ++|...+++.+.++++.=.=.--+||+.....+-.-+ T Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~ 157 (337) T protein:vir:10 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) T ss_pred ecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhh Confidence 1 1222333334456666677777666777888888753 6788888888888876533333445543211110000 Q ss_pred cccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhh---hcccccccccccccccchhhhhhh-hHH Q lcl|Aclame:pro 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRV---VTGAAGSGSGVAGSYPTAAEIAEN-VFD 360 (497) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 360 (497) .+.. .+. +--|+-..+.... +.......+....+.......+|. +++ T Consensus 158 nPll-----------------------qDV------NkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D 208 (337) T protein:vir:10 158 NPLL-----------------------QDV------NIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMD 208 (337) T ss_pred CcCc-----------------------ccc------chhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHH Confidence 0000 000 1111111111000 000000000011111112233343 344 Q ss_pred hhhhhhhhhcc-CCceEEEehhHHHH---HHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEee Q lcl|Aclame:pro 361 AFVDIQLTLFQ-TPNAVVMNPRDWEL---LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGH 436 (497) Q Consensus 361 ~~~~~~~~~~~-~~~~~~~n~~~~~~---l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd 436 (497) +...+..+.+. .+...++=..+..+ +.++. ..+. +---.......-..+|-|+|.+..|.+|++.+++=- T Consensus 209 ~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n-~~~~-----ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~ 282 (337) T protein:vir:10 209 IVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVN-ATQA-----PTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTK 282 (337) T ss_pred HHhccCChHHhcCCCEEEEEchhhhhHHhhHHhc-cCCC-----cHHHHHHHHHHHhhhhCCceeEEccccCCCceEEee Confidence 44433333333 33443332332211 11121 1111 000000111222358999999999999999999888 Q ss_pred ccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 437 FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 437 ~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) |+.+.+.. .+....-.+-+.. .+|++.-.=..--+..|-+++.+|.++--..+.+ T Consensus 283 L~NLsIY~-Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 283 LSNLSIYY-QEGARRRTLKEVP----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred chhcEEEE-ecCcEEEEEEEcc----ccccccchhhccceeeeeccccEEEEeceeecCC Confidence 87754322 2222222222222 1444444444445667777888887776566666 No 200 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=91.03 E-value=0.018 Score=30.17 Aligned_cols=324 Identities=13% Similarity=0.042 Sum_probs=144.0 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCcccee Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA 209 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~ 209 (497) .....+..+........ ..+.. ...+-...|.|.....+...+.+.+.+++++++++++--.....-.....+-++- T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv-~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagr 77 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIA--KLNDT-GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASR 77 (337) T ss_pred CChHHHHHHHHHHHHHH--HhcCh-hhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeee Confidence 00001111110000000 00000 1111223466777778888888899999999999987544333322111111111 Q ss_pred e--ccccccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceec Q lcl|Aclame:pro 210 V--AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQ 284 (497) Q Consensus 210 v--~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~ 284 (497) + +.+...|..-..++.-.+.+++.---+.|+-+.|+.. ++|...+++.+.++++.=.=.--+||+.....+-.-+ T Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~ 157 (337) T protein:vir:79 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) T ss_pred ecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhh Confidence 1 1222333334456666677776666777888888753 6788888888888876533333445543211110000 Q ss_pred cccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhh---hcccccccccccccccchhhhhhh-hHH Q lcl|Aclame:pro 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRV---VTGAAGSGSGVAGSYPTAAEIAEN-VFD 360 (497) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 360 (497) .+.. .+. +--|+-..+.... +.......+....+.......+|. +++ T Consensus 158 nPll-----------------------qDV------NkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D 208 (337) T protein:vir:79 158 NPLL-----------------------QDV------NIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMD 208 (337) T ss_pred CcCc-----------------------ccc------chhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHH Confidence 0000 000 1111111111000 000000000011111112233343 344 Q ss_pred hhhhhhhhhcc-CCceEEEehhHHHH---HHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEee Q lcl|Aclame:pro 361 AFVDIQLTLFQ-TPNAVVMNPRDWEL---LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGH 436 (497) Q Consensus 361 ~~~~~~~~~~~-~~~~~~~n~~~~~~---l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd 436 (497) +...+..+.+. .+...++=..+..+ +.++. ..+. +---.......-..+|-|+|.+..|.+|++.+++=- T Consensus 209 ~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n-~~~~-----ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~ 282 (337) T protein:vir:79 209 IVSSMIDPWFQEDTGLVAICGRELLHDKYFPIVN-ATQA-----PTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTK 282 (337) T ss_pred HHhccCChHHhcCCCEEEEEchhhhhHHhhHHhc-cCCC-----cHHHHHHHHHHHhhhhCCceeEEccccCCCceEEee Confidence 44433333333 33443332332211 11121 1111 000000111222358999999999999999999888 Q ss_pred ccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 437 FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 437 ~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) |+.+.+.. .+....-.+-+.. .+|++.-.=..--+..|-+++.+|.++--..+.+ T Consensus 283 L~NLsIY~-Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 283 LSNLSIYY-QEGARRRTLKEVP----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred chhcEEEE-ecCcEEEEEEEcc----ccccccchhhccceeeeeccccEEEEeceeecCC Confidence 87754322 2222222222222 1444444444445667777787777775555555 No 201 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=90.44 E-value=0.021 Score=29.80 Aligned_cols=324 Identities=12% Similarity=0.032 Sum_probs=143.1 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCcccee Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA 209 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~ 209 (497) .....+..+........ ..+.. ...+-...|.|.....+...+.+.+.+++++++++++--.....-.....+-++- T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv-~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagr 77 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIA--KLNDT-GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASR 77 (337) T ss_pred CChHHHHHHHHHHHHHH--HhcCh-hhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeee Confidence 00001111110000000 00000 1111223466777788888889999999999999887544333222111111111 Q ss_pred e--ccccccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceec Q lcl|Aclame:pro 210 V--AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQ 284 (497) Q Consensus 210 v--~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~ 284 (497) . +-+...|..-..++.-.+.+++.---+.|+-+.|+.. ++|...+++.+.++++.=.=.--+||+.....+-.- T Consensus 78 tdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~- 156 (337) T protein:vir:78 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQ- 156 (337) T ss_pred ecCCCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChh- Confidence 1 1122333333455666666666666677888888753 678888888888887643333344554321111000 Q ss_pred cccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhh---hcccccccccccccccchhhhhhhh-HH Q lcl|Aclame:pro 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRV---VTGAAGSGSGVAGSYPTAAEIAENV-FD 360 (497) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 360 (497) ..... .+. +.-|+-..+.... +.......+....+.......+|.+ ++ T Consensus 157 --------------~nPll--------qDV------N~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d 208 (337) T protein:vir:78 157 --------------ANPLL--------QDV------NIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMD 208 (337) T ss_pred --------------hCcCc--------ccc------chHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHH Confidence 00000 001 1111111111000 0000000000111111122233333 33 Q ss_pred hhhhhhhhh-ccCCceEEEehhHHHH---HHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEee Q lcl|Aclame:pro 361 AFVDIQLTL-FQTPNAVVMNPRDWEL---LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGH 436 (497) Q Consensus 361 ~~~~~~~~~-~~~~~~~~~n~~~~~~---l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd 436 (497) +...+..+. .-.+...++=..+..+ +..+. ..+.+ ---.......-..++-|+|.+.-|++|++.+++=- T Consensus 209 ~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n-~~~~p-----tE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~ 282 (337) T protein:vir:78 209 IVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVN-ATQAP-----TERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTK 282 (337) T ss_pred HHhccCChHHhcCCCEEEEEchhhhHHHHHHHHh-cCCCc-----HHHHHHHHHHHhhhhcCcceEEccccCCCceEEee Confidence 343333333 3334444443333222 22222 11110 00001111223458999999999999999999888 Q ss_pred ccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCC Q lcl|Aclame:pro 437 FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) Q Consensus 437 ~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~ 496 (497) |+.+.+.. .+....-.+-+.. .+|++.-.=..--+..|-+++.+|.++--..+.| T Consensus 283 L~NLsIY~-Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 283 LSNLSIYY-QEGARRRTLKEVP----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred chhcEEEE-ecCcEEEEEEecc----ccccccchhhccceeeeeccccEEEEeceeecCC Confidence 87753321 2222222222222 1444444444445667778888887776666666 No 202 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=89.63 E-value=0.025 Score=29.34 Aligned_cols=297 Identities=10% Similarity=0.033 Sum_probs=110.9 Q ss_pred hhhc-ccccCCcccccchhhHHH-HHHHhhhhHHh---------hccceecCCCceEEEEeecCCccceeeccccc---c Q lcl|Aclame:pro 151 NPFG-STGTFAPGILPTFLPGIV-EQLFYELSLAD---------LISSRPVTSPNLSYLTESAAHNNAAAVAEAGT---Y 216 (497) Q Consensus 151 ~~~~-~~~~~g~~i~~~~~~~ii-~~~~~~~~l~~---------~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~---~ 216 (497) ++.. .-+.-...+.|+.....+ +...+.+.|++ +......++..+++|....-++...-+.+... . T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 0000 000011123333322222 22222222211 11112345667899988665554444444322 2 Q ss_pred ccccccceeEEeeeeeeeee---chhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccch Q lcl|Aclame:pro 217 PFSSEEFARVYEQVGKVANA---LTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASS 293 (497) Q Consensus 217 ~~s~~~~~~v~~~~~kia~~---~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~ 293 (497) +..+.+-++.....+-.+.- ..++..+-- .+....|.+.++.-..+.....+|. -.+||++......... T Consensus 81 t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG--~dpm~~Ia~qva~yW~r~~q~~Lla-----~L~Gvf~~~~a~~~~~ 153 (367) T protein:vir:80 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAG--SNPMTRIRNRFGVYWTRQWQRRIIA-----MAVGVYKSNLAGNFAT 153 (367) T ss_pred cccccccchheeeeehhcccchhhhHHHHhhC--chHHHHHHHHHHHHhhhhhHHHHHH-----HHHHhhccccccchhh Confidence 22333322222222222222 233333321 3556666666665555444444332 1223332211110000 Q ss_pred hhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCC Q lcl|Aclame:pro 294 ASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTP 373 (497) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 373 (497) ... ..... ......... ..+... ..............+.++...+-. ....- T Consensus 154 ~~~----------~~~~~-------------a~~~~~~~~--~~~Dis--~~t~~~~~~~s~~~~~~A~~~lGD-~~~~l 205 (367) T protein:vir:80 154 IKT----------RGRVP-------------AEVLGTAGD--MVIDIS--GQTNPADAVFNREAFVDAAFTMGD-HVGSI 205 (367) T ss_pred hhh----------hhccc-------------cccccccCc--eeeeee--ccCCCccceecHHHHHHHHHHhcc-ccccc Confidence 000 00000 000000000 000000 000000011112223333222222 22345 Q ss_pred ceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCc-----C-c---EEEeeccceEEEE Q lcl|Aclame:pro 374 NAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL-----G-T---ILVGHFAPSVIQT 444 (497) Q Consensus 374 ~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~-----~-~---~~~gd~~~~~~~i 444 (497) .+++||+..+..|++++=- .|+- +. . ....-+++.|++|++++.||. + . ++||... +.. T Consensus 206 ~~i~mHS~V~~~L~~~~li--~~i~-~s-d-----~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GA---i~~ 273 (367) T protein:vir:80 206 AAIAVHSMVYKRMTNNDEI--EFIP-DS-K-----GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA---FGY 273 (367) T ss_pred cEEEEchHHHHHHHhcccc--cccc-CC-C-----CccccceecceeEEEeCCCcccccCCCceEEEEEEecce---eee Confidence 6899999999999887421 1110 00 0 012356889999999999994 2 2 3454443 332 Q ss_pred Eecc-ccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCC-------------CCC Q lcl|Aclame:pro 445 ARRE-GVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA-------------TGS 497 (497) Q Consensus 445 ~~r~-~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a-------------~~~ 497 (497) .+.. ...+++.+.....=-.++-.+....| .+.||-.|....-..++ +-| T Consensus 274 ~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~---~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~s 337 (367) T protein:vir:80 274 ADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPA 337 (367) T ss_pred cccCCccceecccchhhhcCCceEEEEeeee---EEeecceeeecccccccccccccccccccccCC Confidence 2221 12234444331000023333444434 68889888775433221 001 No 203 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=89.01 E-value=0.029 Score=29.03 Aligned_cols=332 Identities=13% Similarity=0.028 Sum_probs=136.4 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhcc-cccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccce Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFGS-TGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAA 208 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~ 208 (497) .....+..+........ ..+.... ..+-...|.|.....+...+.+.+.+++++++++++--.....-.....+-++ T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iag 78 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVA--ELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIAS 78 (357) T ss_pred CChHHHHHHHHHHHHHH--HHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccc Confidence 00111111111111100 0111110 01122346677778888888999999999999998764443332211111111 Q ss_pred eec--cc-cccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccce Q lcl|Aclame:pro 209 AVA--EA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGL 282 (497) Q Consensus 209 ~v~--Eg-~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gi 282 (497) -+. -+ +..|..-..++.-.+.+++.---+.|+-+.|+.. ++|...+++.+.++++.=.=.--+||+.....+-. T Consensus 79 rtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 158 (357) T protein:vir:56 79 TTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDR 158 (357) T ss_pred cccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCCh Confidence 111 01 1122222345556666666666667888888753 67888888888888764333334455432111100 Q ss_pred eccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhc---ccccccccccccccchhhhhhhh- Q lcl|Aclame:pro 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT---GAAGSGSGVAGSYPTAAEIAENV- 358 (497) Q Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~- 358 (497) ..... ..+.+..|..-.-.....+.+.... +.........+. ......+|.+ T Consensus 159 ---------------~~nPl--------lqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~-~gdy~NLDalV 214 (357) T protein:vir:56 159 ---------------SSNPM--------LQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGK-GGDYASLDALV 214 (357) T ss_pred ---------------hhCcC--------ccccchhHHHHHHhhchhhhhccccccCCccccceeeecC-CCCcccHHHHH Confidence 00000 0011111111111111111111000 000000000111 1122233333 Q ss_pred HHhhhhhhhhh-ccCCceEEEehhHHHH--HHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEe Q lcl|Aclame:pro 359 FDAFVDIQLTL-FQTPNAVVMNPRDWEL--LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVG 435 (497) Q Consensus 359 ~~~~~~~~~~~-~~~~~~~~~n~~~~~~--l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~g 435 (497) +++...+..+. .-.+...++=..++.+ ...|-+..+.+ ---.......-..+|-|+|.+.-+++|++.+++= T Consensus 215 ~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~p-----TE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT 289 (357) T protein:vir:56 215 MDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQDN-----SEMLAADVIISQKRIGNLPAVRVPYFPADAMLIT 289 (357) T ss_pred HHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccCCh-----HHHHHHHHHHHhhhhCCceeEEccccCCCceEEe Confidence 33444333333 3334444433333221 11121121111 0001111222245899999999999999999988 Q ss_pred eccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 436 HFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 436 d~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) -|+.+.+.. .+....-.+-+... +|++.-.=..--+..|-+++.+|.++-...+.+. T Consensus 290 ~L~NLsIY~-Q~gs~RR~~~d~p~----r~riE~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 346 (357) T protein:vir:56 290 KLENLSIYY-MDDSHRRVIEENPK----LDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFS 346 (357) T ss_pred eccccEEEE-ecCcEEEEEEeccc----cccccchhhhcceeeeeccccEEEeeeeeeccCC Confidence 877753321 22222222222111 2333332223334455555555555444333333 No 204 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=88.71 E-value=0.031 Score=28.88 Aligned_cols=278 Identities=12% Similarity=0.005 Sum_probs=111.0 Q ss_pred hhhcccccCCcccccc---hhhHHHHHHHhhhhHHh---------hccceecCCCceEEEEeecCCccce--eeccc--c Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPT---FLPGIVEQLFYELSLAD---------LISSRPVTSPNLSYLTESAAHNNAA--AVAEA--G 214 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~---~~~~ii~~~~~~~~l~~---------~~~~~~~~~~~~~~p~~~~~~~~a~--~v~Eg--~ 214 (497) ++.+ .-.-.++|+ +-+-+.+...+.+.+.+ +......++..+++|....-++... +-..+ + T Consensus 1 Ma~T---~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~ 77 (349) T protein:vir:78 1 MAIT---TIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQD 77 (349) T ss_pred CCce---EEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCccc Confidence 2211 122345555 33333344444444332 1111233466788998765333221 21222 2 Q ss_pred ccccccc-cceeEEeeeeeeeeec--hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceecccccccc Q lcl|Aclame:pro 215 TYPFSSE-EFARVYEQVGKVANAL--TITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTA 291 (497) Q Consensus 215 ~~~~s~~-~~~~v~~~~~kia~~~--~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~ 291 (497) ..+..+. +..++-...+.--++. .++.++-- .+....|.+++++-..+.....+|. -.+||++....... T Consensus 78 ~~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG--~dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~~~~~a~~ 150 (349) T protein:vir:78 78 IATPRAIQTGEMMARVAYLNEGFGQADLTVELTS--QNPLQSVASRLDNFWQRQAQRRLIA-----TALGLYNDNVSATD 150 (349) T ss_pred ccccccccccceeeeeeeeccccchhHHHHHhhC--chHHHHHHHHHHHHHhhHHHHHHHH-----HHHHhhcccccccc Confidence 2222232 3333333332222222 33333322 2556666777766555554444432 12233321100000 Q ss_pred chhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhh--- Q lcl|Aclame:pro 292 SSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT--- 368 (497) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 368 (497) ... . ..+.. .. .. . ........++++...+-.+ T Consensus 151 ~~~------------------------~-----------~~~~t--~d-~s---~---~a~~~~~~~~dA~~~lgda~~G 186 (349) T protein:vir:78 151 AYH------------------------E-----------QNDMV--VD-VS---A---TLGFDAGAFIDATQTMGDALMG 186 (349) T ss_pred hhh------------------------h-----------cccce--ee-ec---c---ccCCChhhhhhhHHHHHHHhcc Confidence 000 0 00000 00 00 0 0000111111211111111 Q ss_pred -hccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcC------c---EEEeecc Q lcl|Aclame:pro 369 -LFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------T---ILVGHFA 438 (497) Q Consensus 369 -~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~------~---~~~gd~~ 438 (497) ....-++++||...+..|++++-=+ |+ ++.. ....-++++|++|++++.||.. . ++||. T Consensus 187 d~~~~lt~i~mHS~v~~~L~~~~li~--~i-~~s~------~~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~-- 255 (349) T protein:vir:78 187 NGGEVLGAIAMHSFVYAQARKAQLID--FI-RDAE------NNTMFATYQGYRVIVDDSMTVVGQGAQRKFISIIFGQ-- 255 (349) T ss_pred ccccceeEEEEchHHHHHHHhhhhhh--hc-cCcc------cCcccceecCeEEEEeCCCccccCCCCceEEEEEeec-- Confidence 1223467999999999998663211 11 1110 0123468999999999999842 2 34443 Q ss_pred ceEEEEEeccc-cEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCC-------CCC Q lcl|Aclame:pro 439 PSVIQTARREG-VTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA-------TGS 497 (497) Q Consensus 439 ~~~~~i~~r~~-~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a-------~~~ 497 (497) +++...+-.. ..+++.+.....=..++..+....|+ ++||..|..-.-..+. .|. T Consensus 256 -GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~---~~hp~G~s~~~a~v~~~~~~~~~~sP 318 (349) T protein:vir:78 256 -GAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYRFTSAVITGNGTETIARSA 318 (349) T ss_pred -ceEEEccCCCccceeeecccccCCcceeEEEEEeeEE---EeeeeeeeeccccccCCccccccCCC Confidence 3344433221 23444443321101345566666665 4566666654422221 111 No 205 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=88.22 E-value=0.034 Score=28.66 Aligned_cols=331 Identities=13% Similarity=0.048 Sum_probs=134.1 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhcc-cccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccce Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFGS-TGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAA 208 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~ 208 (497) .....+..+........ ..+.... ..+-...|.|.....+...+.+.+.+++++++++++--.....-.....+-++ T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iag 78 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVA--ELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIAS 78 (357) T ss_pred CChHHHHHHHHHHHHHH--HHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccc Confidence 00111111111111100 0111110 01122346677778888888999999999999998764443332211111111 Q ss_pred eec--cc-cccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccce Q lcl|Aclame:pro 209 AVA--EA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGL 282 (497) Q Consensus 209 ~v~--Eg-~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gi 282 (497) -+. -+ +-.|..-..++.-.+.+++.---+.|+-+.|+.. ++|...+++.+.++++.=.=.--+||+.....+-. T Consensus 79 rtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 158 (357) T protein:vir:60 79 TTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDR 158 (357) T ss_pred ccccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCCh Confidence 111 01 1122222345556666666666677888888753 67888888888888764333334455432111100 Q ss_pred eccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhc---ccccccccccccccchhhhhhhh- Q lcl|Aclame:pro 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVT---GAAGSGSGVAGSYPTAAEIAENV- 358 (497) Q Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~- 358 (497) ..... ..+.+..|..-.-.....+.+.... +.........+. ......+|.+ T Consensus 159 ---------------~~nPl--------lqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~-~gdy~NLDalV 214 (357) T protein:vir:60 159 ---------------SSNQM--------LQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGK-GGDYASLDALV 214 (357) T ss_pred ---------------hhCcC--------ccccchhHHHHHHhhchhhhhccccccCCccccceeeecC-CCCcccHHHHH Confidence 00000 0011111111111111111111000 000000000111 1122233333 Q ss_pred HHhhhhhhhhh-ccCCceEEEehhHHH---HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEE Q lcl|Aclame:pro 359 FDAFVDIQLTL-FQTPNAVVMNPRDWE---LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILV 434 (497) Q Consensus 359 ~~~~~~~~~~~-~~~~~~~~~n~~~~~---~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~ 434 (497) +++...+..+. .-.+...++=..++. .+.++ +..+.+ ---.......-..+|-|+|.+.-+++|++.+++ T Consensus 215 ~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~-n~~~~p-----TE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llV 288 (357) T protein:vir:60 215 MDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIV-NREQDN-----SEMLAADVIISQKRIGNLPAVRVPYFPADAMLI 288 (357) T ss_pred HHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHh-hcCCCh-----HHHHHHHHHHHhhhhcCcceEEccccCCCceEE Confidence 33444333333 333444444333322 22222 111111 000111112234589999999999999999998 Q ss_pred eeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCC------CCC Q lcl|Aclame:pro 435 GHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA------TGS 497 (497) Q Consensus 435 gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a------~~~ 497 (497) =-|+.+.+.. .+....-.+-+... +|++.-.=..--+..|-+++.+|.++-...+ .+. T Consensus 289 T~L~NLsIY~-Q~gs~RR~~~d~p~----r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~pa~~~ 352 (357) T protein:vir:60 289 TKLENLSIYY-MDDSHRRVIEENPK----LDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKAT 352 (357) T ss_pred eeccccEEEE-ecCcEEEEEEeccc----cccccchhhhcceeeeeccccEEEeeeeeeccCcccccCC Confidence 8777753321 22222222222111 2222222222234444455555544422222 212 No 206 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=88.16 E-value=0.034 Score=28.63 Aligned_cols=329 Identities=11% Similarity=0.075 Sum_probs=140.9 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCcccee Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA 209 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~ 209 (497) .....+..+........ ..+.. ...+-...|.|.....+...+.+.+.+++++++++++--.....-.....+-++- T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv-~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagr 77 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIA--KLNGV-ERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVAST 77 (339) T ss_pred CChHHHHHHHHHHHHHH--HHhCc-ccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeec Confidence 00011111111100000 00001 1112223466777788888889999999999999987544333222110111111 Q ss_pred e--ccccccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceec Q lcl|Aclame:pro 210 V--AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQ 284 (497) Q Consensus 210 v--~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~ 284 (497) + .-++..|..-..++.-.+.+++.---+.|+-+.|+.. ++|...+++.+.++++.=.=.--+||+.....+-. T Consensus 78 tdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~-- 155 (339) T protein:vir:79 78 TDTTQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDR-- 155 (339) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCCh-- Confidence 1 1112222222355555666666666667787877753 67888888888888764333334455432111100 Q ss_pred cccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhh-HHhhh Q lcl|Aclame:pro 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENV-FDAFV 363 (497) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 363 (497) ..... ..+.+..|..-.-.....+.... +...+......+.......+|.+ +++.. T Consensus 156 -------------~~nPl--------lqDVN~GWlQ~~Re~ap~rV~~~--g~~~s~~i~~~G~ggdy~NLDalV~d~~~ 212 (339) T protein:vir:79 156 -------------VANPM--------LQDVNKGWLQNLREQAPQRVMKE--GKAAAGKITVGGAGADYGNLDALVYDITN 212 (339) T ss_pred -------------hhCcC--------ccccchhHHHHHHhhhhhhhhcc--ceeccceeEeccCCCCcccHHHHHHHHHh Confidence 00000 00011111111110000011110 00000000110111122233333 33333 Q ss_pred hhhhhhcc-CCceEEEehhHHH---HHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccc Q lcl|Aclame:pro 364 DIQLTLFQ-TPNAVVMNPRDWE---LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAP 439 (497) Q Consensus 364 ~~~~~~~~-~~~~~~~n~~~~~---~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~ 439 (497) .+..+.+. .+...++=..++. .+.++ .....+ ---.......-..++-|+|.+.-|++|++.+++=-|+. T Consensus 213 ~lId~~~~~d~dLVvivG~dLla~k~~~l~-n~~~~p-----tE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~N 286 (339) T protein:vir:79 213 HLVEPWYAEDPDLVVVCGRNLLSDKYFPLV-NRDRDP-----VQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDN 286 (339) T ss_pred ccCChHHhcCCCEEEEEchhhhhhHhhhHh-hcCCCh-----HHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechh Confidence 33333333 3444443333321 22222 111111 00001112223358999999999999999999888877 Q ss_pred eEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 440 SVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 440 ~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +.+.. .+....-.+-+.. .+|++.-.=..--+..|-+++.+|.++-.+.+.++ T Consensus 287 LsIY~-Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 287 LSIYY-QEGGRRRTILDNA----KRDRIENYESSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred cEEEE-ecCcEEEEEEecc----ccccccchhhccceeeeeccccEEEeeeeecccCC Confidence 53321 2222222222222 13444333333346666677777776655555555 No 207 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=87.89 E-value=0.036 Score=28.51 Aligned_cols=309 Identities=14% Similarity=0.049 Sum_probs=133.8 Q ss_pred hhhhhhhhhhhhcc-cccCCccccc-chhhHHHHHHHhhhhHHhhccceecCCC---ceEEEEeecCCccceeeccc--- Q lcl|Aclame:pro 142 ETAPAAIGQNPFGS-TGTFAPGILP-TFLPGIVEQLFYELSLADLISSRPVTSP---NLSYLTESAAHNNAAAVAEA--- 213 (497) Q Consensus 142 ~~~~~~~~~~~~~~-~~~~g~~i~~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~p~~~~~~~~a~~v~Eg--- 213 (497) ....+.......++ .++.|.-+-. -+....+....+...+.+++...+++-+ ++.+-+..+-...-.-..|| T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 00111111111111 1111211111 1335666666667788899999999754 33333322211101112222 Q ss_pred --ccc-----------------------------ccccccceeEEeeeeeeeeechhhHHHHh-hH-HHHHHHHHHH-HH Q lcl|Aclame:pro 214 --GTY-----------------------------PFSSEEFARVYEQVGKVANALTITDEGLR-DA-PELFNFVQGR-LL 259 (497) Q Consensus 214 --~~~-----------------------------~~s~~~~~~v~~~~~kia~~~~iS~ell~-ds-~~l~~~i~~~-la 259 (497) .+. .....+-..+..+.++++.+..+|++++. +. +.+..-|..+ |. T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~ 160 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMN 160 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhh Confidence 211 11123334566789999999999999876 33 4566655333 33 Q ss_pred HHHHHHH---HhhhhccCCCccccce---eccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 260 EGIQRKE---EVQLLAGGGYPGVNGL---LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRV 333 (497) Q Consensus 260 ~~~~~~~---d~a~l~G~g~~~~~Gi---l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (497) -+....+ -..+|++.++.--.|- ....+.-+.+.. ....+........+. .-+..+. T Consensus 161 g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t-~vt~~~l~rl~~~L~----------------~nRapk~ 223 (401) T protein:vir:95 161 GATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPS-VVSYKNLMRLDQILT----------------ENRTPTQ 223 (401) T ss_pred hhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccc-eechhHHHHHHHHHH----------------hcccccc Confidence 3333333 3456655433210011 110000000000 000000000000000 0000000 Q ss_pred hcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccccccccccccc--cc Q lcl|Aclame:pro 334 VTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNP--VN 411 (497) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~--~~ 411 (497) ... +......-+... ...-+.++|+.....|+-++|-.|.+-|.+..-.+...+ .+ T Consensus 224 t~~------------------i~~s~~~dTk~i----~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~g 281 (401) T protein:vir:95 224 TTI------------------ITGSRMIDTKVI----GATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNG 281 (401) T ss_pred hhh------------------hhhhhccCcccc----ccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccc Confidence 000 000000000000 112245679988888988888888777777665544443 23 Q ss_pred ccccccccceeecCCCC--------cC---------------------cEEEeeccceEEEEEecc--cc----EEE--E Q lcl|Aclame:pro 412 GGKNIWGVPVVTTPLIP--------LG---------------------TILVGHFAPSVIQTARRE--GV----TMQ--M 454 (497) Q Consensus 412 ~~~~l~G~pvv~s~~~~--------~~---------------------~~~~gd~~~~~~~i~~r~--~~----~i~--~ 454 (497) .-..|.++++++++.+- ++ ..++|.-. |....-. +. .+. . T Consensus 282 EiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~lV~G~dA---f~~~~l~g~g~~~~~~~ivk~ 358 (401) T protein:vir:95 282 EVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVSGQEHYDVYPMLVVGDDS---FTSIGFQTDGKSLKFTVMTKM 358 (401) T ss_pred cccccCceeEEecccceeecCCcccccccccccccccccCCCcceeeeeeEEcccc---ceecccccCCccccceeEeec Confidence 44567789999988742 21 12334322 1111111 11 121 2 Q ss_pred ecc----chhhhhcCceEEEEEeee-ccEeecccceEEEEecCCC Q lcl|Aclame:pro 455 TNS----NGTDFVDGKVTVRAEERL-GLLVYRPSAFQLIQLKKGA 494 (497) Q Consensus 455 ~~~----~~~~f~~~~v~~r~~~r~-~~~v~~~~Af~~~~~~~~a 494 (497) ..+ .++ .-||.++..+.-+ .+.+.+++-.++|.-.+-- T Consensus 359 pG~~~ad~~D--PlgQ~g~vgwK~~~a~~vL~~e~m~~ies~a~~ 401 (401) T protein:vir:95 359 PGKETADRND--PYGETGFSSIKWYYGILVKRPERLALIKTVAPL 401 (401) T ss_pred CCcCCCCCCC--cccceehhhhhhhhhhheeccceeEEEEeecCC Confidence 211 011 2456666655444 7788889988888744444 No 208 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=87.48 E-value=0.038 Score=28.34 Aligned_cols=314 Identities=15% Similarity=0.057 Sum_probs=128.4 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhh--hhcccccCCcccccchhh-HHHHHHH--hhhhHH Q lcl|Aclame:pro 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQN--PFGSTGTFAPGILPTFLP-GIVEQLF--YELSLA 182 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~~i~~~~~~-~ii~~~~--~~~~l~ 182 (497) +....+. ....... ...+.+.. .++.... ....+..+|+.+--+.+. +|-.+.. +...+. T Consensus 1 ~~~~~~~---~~~~~~~----------~~~~~e~~--~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~ 65 (463) T protein:vir:99 1 MTIEKNL---SDVQQKY----------ADQFQEDV--VKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFY 65 (463) T ss_pred CCccccc---chHHHHH----------HhhhhHHH--HHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhh Confidence 0000000 0000000 00000000 0000000 011122234444333222 2222221 222445 Q ss_pred hhccceecCCCceEEEEeec--CCccceeeccccccccccccceeEEeeeeeeeeechhhHHH-HhhH-HHHHHHHHHHH Q lcl|Aclame:pro 183 DLISSRPVTSPNLSYLTESA--AHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PELFNFVQGRL 258 (497) Q Consensus 183 ~~~~~~~~~~~~~~~p~~~~--~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~el-l~ds-~~l~~~i~~~l 258 (497) .-+...++.+.--+|-.+.. ..+.+.+++|++..+.+++++...+...|-|+....+|.-+ |.++ .+.+....++- T Consensus 66 ~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~da 145 (463) T protein:vir:99 66 RDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDA 145 (463) T ss_pred hhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHH Confidence 55667777665445544443 33457899999999999999999999999999988888754 4454 47888888898 Q ss_pred HHHHHHHHHhhhhccCCCccc----cceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 259 LEGIQRKEEVQLLAGGGYPGV----NGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVV 334 (497) Q Consensus 259 a~~~~~~~d~a~l~G~g~~~~----~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 334 (497) .-.++..++.+.++|+..-.| .|+- . +.. ..+....+. T Consensus 146 i~~ia~tiE~a~FyGds~l~~~~~~~gle-F--------------DGl-----------------------~~lId~env 187 (463) T protein:vir:99 146 IAVVAKTIEWASFYGDASLTSEVEGEGLE-F--------------DGL-----------------------AKLIDKNNV 187 (463) T ss_pred HHHHHHHHHHHHhhhhhccCCCcCccccc-h--------------hhh-----------------------hhhcCCCCe Confidence 999999999999999853222 1110 0 000 000000000 Q ss_pred cccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccccccc-cccccccccc Q lcl|Aclame:pro 335 TGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG-NAYGNPVNGG 413 (497) Q Consensus 335 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~-~~~~~~~~~~ 413 (497) ....+... ....+..+. ......|..++-.+|+..+.+.|..---..-|.+..+..+ ...|.++... T Consensus 188 iDarG~~L-----------s~~~ln~Aa-~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f 255 (463) T protein:vir:99 188 INAKGNQL-----------TEKHLNEAA-VRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGF 255 (463) T ss_pred eecCCCcc-----------cHHHHhhhh-hhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccce Confidence 00000000 000011111 1112345556667777777777653211111111111110 0111111111 Q ss_pred ccccc-----------cceeecCCCCcCcEEEeeccceEEEEEeccccEEEEecc-chhhh-h--cCceEEEEEeeeccE Q lcl|Aclame:pro 414 KNIWG-----------VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNS-NGTDF-V--DGKVTVRAEERLGLL 478 (497) Q Consensus 414 ~~l~G-----------~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~-~~~~f-~--~~~v~~r~~~r~~~~ 478 (497) -+-.| .|-+..... ..+.++|... .++..++.. .+..| . .....|++...-+.. T Consensus 256 ~s~~G~I~L~~s~~m~~~~il~~~~---~~~p~ap~~~--------~~tatv~~~~~~~~~~~~~~a~~~Y~vv~~s~~g 324 (463) T protein:vir:99 256 YSSRGFIKLHGSTVMENELILDESL---QPLPNAPQPA--------KVTATVETKQKGAFENEEDRAGLSYKVVVNSDDA 324 (463) T ss_pred eeeeeeeeeCCceecCCcccccchh---hcCCCCccCc--------eeEEEEeeccCCCCCCcccccceEEEEEEECCCC Confidence 11111 111110000 0111122211 112222221 11112 1 223456666555555 Q ss_pred eecccceEEEE-----------ecCCCCCC Q lcl|Aclame:pro 479 VYRPSAFQLIQ-----------LKKGATGS 497 (497) Q Consensus 479 v~~~~Af~~~~-----------~~~~a~~~ 497 (497) =-.|+.++-.| +...+.++ T Consensus 325 eS~pS~ivtaT~a~~~~gv~l~It~~a~~~ 354 (463) T protein:vir:99 325 QSAPSEEVTATVSNVDDGVKLSINVNAMYQ 354 (463) T ss_pred CcccchheeeeeeeccceEEEEEEecCCcc Confidence 55555553333 22222222 No 209 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=87.48 E-value=0.038 Score=28.34 Aligned_cols=314 Identities=15% Similarity=0.057 Sum_probs=128.4 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhh--hhcccccCCcccccchhh-HHHHHHH--hhhhHH Q lcl|Aclame:pro 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQN--PFGSTGTFAPGILPTFLP-GIVEQLF--YELSLA 182 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~~i~~~~~~-~ii~~~~--~~~~l~ 182 (497) +....+. ....... ...+.+.. .++.... ....+..+|+.+--+.+. +|-.+.. +...+. T Consensus 1 ~~~~~~~---~~~~~~~----------~~~~~e~~--~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~ 65 (463) T protein:vir:95 1 MTIEKNL---SDVQQKY----------ADQFQEDV--VKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFY 65 (463) T ss_pred CCccccc---chHHHHH----------HhhhhHHH--HHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhh Confidence 0000000 0000000 00000000 0000000 011122234444333222 2222221 222445 Q ss_pred hhccceecCCCceEEEEeec--CCccceeeccccccccccccceeEEeeeeeeeeechhhHHH-HhhH-HHHHHHHHHHH Q lcl|Aclame:pro 183 DLISSRPVTSPNLSYLTESA--AHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PELFNFVQGRL 258 (497) Q Consensus 183 ~~~~~~~~~~~~~~~p~~~~--~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~el-l~ds-~~l~~~i~~~l 258 (497) .-+...++.+.--+|-.+.. ..+.+.+++|++..+.+++++...+...|-|+....+|.-+ |.++ .+.+....++- T Consensus 66 ~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~da 145 (463) T protein:vir:95 66 RDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDA 145 (463) T ss_pred hhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHH Confidence 55667777665445544443 33457899999999999999999999999999988888754 4454 47888888898 Q ss_pred HHHHHHHHHhhhhccCCCccc----cceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 259 LEGIQRKEEVQLLAGGGYPGV----NGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVV 334 (497) Q Consensus 259 a~~~~~~~d~a~l~G~g~~~~----~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 334 (497) .-.++..++.+.++|+..-.| .|+- . +.. ..+....+. T Consensus 146 i~~ia~tiE~a~FyGds~l~~~~~~~gle-F--------------DGl-----------------------~~lId~env 187 (463) T protein:vir:95 146 IAVVAKTIEWASFYGDASLTSEVEGEGLE-F--------------DGL-----------------------AKLIDKNNV 187 (463) T ss_pred HHHHHHHHHHHHhhhhhccCCCcCccccc-h--------------hhh-----------------------hhhcCCCCe Confidence 999999999999999853222 1110 0 000 000000000 Q ss_pred cccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccccccc-cccccccccc Q lcl|Aclame:pro 335 TGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG-NAYGNPVNGG 413 (497) Q Consensus 335 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~-~~~~~~~~~~ 413 (497) ....+... ....+..+. ......|..++-.+|+..+.+.|..---..-|.+..+..+ ...|.++... T Consensus 188 iDarG~~L-----------s~~~ln~Aa-~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f 255 (463) T protein:vir:95 188 INAKGNQL-----------TEKHLNEAA-VRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGF 255 (463) T ss_pred eecCCCcc-----------cHHHHhhhh-hhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccce Confidence 00000000 000011111 1112345556667777777777653211111111111110 0111111111 Q ss_pred ccccc-----------cceeecCCCCcCcEEEeeccceEEEEEeccccEEEEecc-chhhh-h--cCceEEEEEeeeccE Q lcl|Aclame:pro 414 KNIWG-----------VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNS-NGTDF-V--DGKVTVRAEERLGLL 478 (497) Q Consensus 414 ~~l~G-----------~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~-~~~~f-~--~~~v~~r~~~r~~~~ 478 (497) -+-.| .|-+..... ..+.++|... .++..++.. .+..| . .....|++...-+.. T Consensus 256 ~s~~G~I~L~~s~~m~~~~il~~~~---~~~p~ap~~~--------~~tatv~~~~~~~~~~~~~~a~~~Y~vv~~s~~g 324 (463) T protein:vir:95 256 YSSRGFIKLHGSTVMENELILDESL---QPLPNAPQPA--------KVTATVETKQKGAFENEEDRAGLSYKVVVNSDDA 324 (463) T ss_pred eeeeeeeeeCCceecCCcccccchh---hcCCCCccCc--------eeEEEEeeccCCCCCCcccccceEEEEEEECCCC Confidence 11111 111110000 0111122211 112222221 11112 1 223456666555555 Q ss_pred eecccceEEEE-----------ecCCCCCC Q lcl|Aclame:pro 479 VYRPSAFQLIQ-----------LKKGATGS 497 (497) Q Consensus 479 v~~~~Af~~~~-----------~~~~a~~~ 497 (497) =-.|+.++-.| +...+.++ T Consensus 325 eS~pS~ivtaT~a~~~~gv~l~It~~a~~~ 354 (463) T protein:vir:95 325 QSAPSEEVTATVSNVDDGVKLSINVNAMYQ 354 (463) T ss_pred CcccchheeeeeeeccceEEEEEEecCCcc Confidence 55555553333 22222222 No 210 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=86.44 E-value=0.046 Score=27.94 Aligned_cols=333 Identities=12% Similarity=0.022 Sum_probs=135.4 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhcc-cccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccce Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFGS-TGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAA 208 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~ 208 (497) .....+..+........ ..+.... ..+-...|.|.....+...+.+.+.+++++++++++--.....-.....+-++ T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iag 78 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVA--ELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIAS 78 (357) T ss_pred CChHHHHHHHHHHHHHH--HHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccc Confidence 00111111111111100 0111110 01122346677778888888999999999999998764443332211111111 Q ss_pred eec--cc-cccccccccceeEEeeeeeeeeechhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCccccce Q lcl|Aclame:pro 209 AVA--EA-GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGL 282 (497) Q Consensus 209 ~v~--Eg-~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gi 282 (497) -+. -+ +..|..-..++.-.+.+++.---+.|+-+.|+.. ++|...+++.+.++++.=.=.--+||+.....+-. T Consensus 79 rtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 158 (357) T protein:vir:20 79 TTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDR 158 (357) T ss_pred cccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCCh Confidence 111 01 1122222345556666666666667888888753 67888888888888764333334455432111100 Q ss_pred eccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccc--ccccccccchhhhhhhh-H Q lcl|Aclame:pro 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSG--SGVAGSYPTAAEIAENV-F 359 (497) Q Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~-~ 359 (497) ..... ..+.+..|..-.-.....+.+.......+.. .....+.......+|.+ + T Consensus 159 ---------------~~nPl--------lqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~ 215 (357) T protein:vir:20 159 ---------------SSNPM--------LQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVM 215 (357) T ss_pred ---------------hhCcC--------ccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHH Confidence 00000 0011111111111111111111000000000 00001111122233333 3 Q ss_pred Hhhhhhhhhh-ccCCceEEEehhHHHH--HHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEee Q lcl|Aclame:pro 360 DAFVDIQLTL-FQTPNAVVMNPRDWEL--LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGH 436 (497) Q Consensus 360 ~~~~~~~~~~-~~~~~~~~~n~~~~~~--l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd 436 (497) ++...+..+. .-.+...++=-.++.+ ...|-+..+.+ ---.......-..+|-|+|.+.-+++|++.+++=- T Consensus 216 D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~p-----tE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~ 290 (357) T protein:vir:20 216 DATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQDN-----SEMLAADVIISQKRIGNLPAVRVPYFPADAMLITK 290 (357) T ss_pred HHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccCCh-----HHHHHHHHHHHhhhhCCceeEEccccCCCceEEee Confidence 3444333333 3334444433333221 11121121111 00011112222458999999999999999999888 Q ss_pred ccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 437 FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 437 ~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) |+.+.+.. .+....-.+-+... +|++.-.=..--+..|-+++.+|.++-...+... T Consensus 291 L~NLsIY~-Q~gs~RR~~~d~p~----r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~ 346 (357) T protein:vir:20 291 LENLSIYY-MDDSHRRVIEENPK----LDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFS 346 (357) T ss_pred ccccEEEE-ecCcEEEEEEeccc----cccccchhhhcceeeeeccccEEEeeeeeecccc Confidence 77753321 22222222222111 2333332223334455555555555432222221 No 211 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=85.93 E-value=0.049 Score=27.76 Aligned_cols=264 Identities=11% Similarity=-0.036 Sum_probs=109.7 Q ss_pred hhhcccccCCccc-ccchhhHHHHHHHhhhhHHhhcccee-----cCCCceEEEEeecCCccceeeccccccccccccce Q lcl|Aclame:pro 151 NPFGSTGTFAPGI-LPTFLPGIVEQLFYELSLADLISSRP-----VTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFA 224 (497) Q Consensus 151 ~~~~~~~~~g~~i-~~~~~~~ii~~~~~~~~l~~~~~~~~-----~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~ 224 (497) ++ .....++ |.-+...+++.+++.+++.++++.-. -.+.++++|+... .-+.++......+.+-+ T Consensus 1 m~----~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~-----~~v~dg~~~~~~~~te~ 71 (418) T protein:vir:10 1 MA----VQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYR-----VKSASGRTLVKQPMVDQ 71 (418) T ss_pred CC----ccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCc-----eeecccCCccccccccc Confidence 11 1112334 33456788899998888877775521 1245788887432 12334544444455555 Q ss_pred eEEee--eeeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHH Q lcl|Aclame:pro 225 RVYEQ--VGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATS 302 (497) Q Consensus 225 ~v~~~--~~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) .+++. -+|...+..=..|...+..++...+.+...++++..+|..++.- -.+.+ +..+ + T Consensus 72 ~v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l-~~~a~----~~~g--t------------ 132 (418) T protein:vir:10 72 TIPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALT-LKKAF----HSSG--T------------ 132 (418) T ss_pred eEEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-Hhhcc----cccc--c------------ Confidence 54444 44444444444555556667777777788899999999876520 00000 0000 0 Q ss_pred HHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccC-C-ceEEEeh Q lcl|Aclame:pro 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQT-P-NAVVMNP 380 (497) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~n~ 380 (497) . ...+ ...+.+..+...+....--. . -..+++| T Consensus 133 -----------------------------------~-------gt~~---~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P 167 (418) T protein:vir:10 133 -----------------------------------P-------GVRP---GAFIDFANAGAKQTTYAVPQDGMRHAVLDP 167 (418) T ss_pred -----------------------------------C-------CcCc---chHHHHHHHHHHHHhcCCCCCCceEEEeCH Confidence 0 0000 01122222222222211111 1 2456888 Q ss_pred hHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCc-------EEE-eeccceEEEEEeccccEE Q lcl|Aclame:pro 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT-------ILV-GHFAPSVIQTARREGVTM 452 (497) Q Consensus 381 ~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~-------~~~-gd~~~~~~~i~~r~~~~i 452 (497) ..+..| +++... .+..... ....-.+.-.++.|+.|+.|+++|..+ ..+ |-.. .+..+... . T Consensus 168 ~~~~~L--~~~~~~--~~~~~~~-~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~-~~~~~~~~----~ 237 (418) T protein:vir:10 168 FTCASL--SDEVTK--LFKESMV-EQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVV-NGDTVGFD----G 237 (418) T ss_pred HHHHHH--hhhccc--ccccccc-chhhheeeeeeeeceEEEEecCCCcccccccccceeeecccc-cceeEEEe----e Confidence 877655 344332 2222111 111112223479999999999999522 111 1111 11111100 0 Q ss_pred EEeccchhhhhcCceEEEE---EeeeccEee-cccceEEEEec-CCCCCC Q lcl|Aclame:pro 453 QMTNSNGTDFVDGKVTVRA---EERLGLLVY-RPSAFQLIQLK-KGATGS 497 (497) Q Consensus 453 ~~~~~~~~~f~~~~v~~r~---~~r~~~~v~-~~~Af~~~~~~-~~a~~~ 497 (497) ......+..-.-|.+.|-. ..++...+. ++.-|+...-. +.+.+. T Consensus 238 ~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~ 287 (418) T protein:vir:10 238 GTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGA 287 (418) T ss_pred cceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCc Confidence 0000000000001112211 001111110 22233222211 111111 No 212 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=84.90 E-value=0.057 Score=27.41 Aligned_cols=326 Identities=9% Similarity=-0.009 Sum_probs=131.0 Q ss_pred hhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCC Q lcl|Aclame:pro 114 FDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP 193 (497) Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 193 (497) ... ......+..+........ ..+.. ...+....|.|.....+...+.+.+.+++++++++++-- T Consensus 1 m~~------------~m~~~tr~~~~~y~~~~A--~~ngv-~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~ 65 (341) T protein:vir:27 1 MSQ------------ILTQSAREYMDNFAQQLA--KSYGV-SNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQI 65 (341) T ss_pred Ccc------------cccHHHHHHHHHHHHHHH--HHcCc-ccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccce Confidence 000 000001111111110000 00111 111223446777778888999999999999999998754 Q ss_pred ceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhh------HHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 194 NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRD------APELFNFVQGRLLEGIQRKEE 267 (497) Q Consensus 194 ~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~d------s~~l~~~i~~~la~~~~~~~d 267 (497) .....-.....+-++-+. .+..|. ++.++...+.+++.---+.|+-+.|+. -+++...+++.+.++++.=.= T Consensus 66 ~Ge~v~lg~~g~iagrtd-t~R~~r-~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i 143 (341) T protein:vir:27 66 EGQVVDVGVSGLYTGRKA-GGRFTK-QVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIM 143 (341) T ss_pred eeeEeecccccceeeccC-CCceec-ccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhh Confidence 433332211111111111 122222 235566666666665566667666653 156888888888888765333 Q ss_pred hhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccccccccc Q lcl|Aclame:pro 268 VQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGS 347 (497) Q Consensus 268 ~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 347 (497) .--+||+.....+-.-..+. ..+. +--|+...+.... ......+....+ T Consensus 144 ~IGfnGts~A~~Td~~anPl-----------------------lqDV------NkGWlQ~~Re~a~--~rVl~~~~~~~g 192 (341) T protein:vir:27 144 RIGWNGVSAEADTDPSANPL-----------------------GQDV------NEGWIAFVKNRKA--SQVVDVDVYFDE 192 (341) T ss_pred hhcccceeeccCCChhhccc-----------------------cccc------chhHHHHHHhhcc--cceeccceeecc Confidence 34445543211110000000 0000 1111111111110 000001111111 Q ss_pred ccchhhhhhh-hHHhhhhhhhhhcc-CCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecC Q lcl|Aclame:pro 348 YPTAAEIAEN-VFDAFVDIQLTLFQ-TPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTP 425 (497) Q Consensus 348 ~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~ 425 (497) .......+|. +.++...+..+.+. .+...++=..+..+ +..-+.+-.....+.......-..+|.|+|.+..+ T Consensus 193 ~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla-----~k~~~l~n~~~~ptE~~Aa~~i~k~iGGlpa~~~P 267 (341) T protein:vir:27 193 TNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIG-----AAQAKLYDKADKPSEQIAAQKLDKTIAGRPAYVPP 267 (341) T ss_pred CCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhh-----hhhhhhhccCCCCHHHHHHHHHHHhhCCCeEEEcc Confidence 1222233443 34444443333333 23333332222211 11111111111111111111113589999999999 Q ss_pred CCCcCcEEEeeccceEEEEEeccccEEEEecc-chhhhhc-CceEEEEEeeeccEeecccceEEEEecCCC--CCC Q lcl|Aclame:pro 426 LIPLGTILVGHFAPSVIQTARREGVTMQMTNS-NGTDFVD-GKVTVRAEERLGLLVYRPSAFQLIQLKKGA--TGS 497 (497) Q Consensus 426 ~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~-~~~~f~~-~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a--~~~ 497 (497) .+|.+.+++=-|+.+.+.. .+....-.+-+. ..+.++. +. +|.++. ......-.|..+++.+.| ++| T Consensus 268 ffP~~~~lVT~L~NLsIY~-Q~gs~RR~~~d~p~r~rie~yes-~YvVEd---yg~~~~~~~~~vkl~~~~~~~~~ 338 (341) T protein:vir:27 268 FLPDNAMVVTIPENLQVLT-QHGTAQRKAKHESDRKRSKTHTG-AWKVTQ---WVCWKRSPLTTQKKSTSALNHRS 338 (341) T ss_pred ccCCCceEEeeccceEEEE-ecCcEEEEEEeccccccccchhh-hheeeh---hhhhhhccccccccCcccccccc Confidence 9999999988777753322 122222222111 1111111 11 343332 233333344444444333 455 No 213 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=84.81 E-value=0.058 Score=27.38 Aligned_cols=397 Identities=11% Similarity=-0.035 Sum_probs=86.3 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEK---KEALAKIEPDFKAHQAEVEAHERAQEM-----LKSLGGADAAKDG 72 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~---~~~~~~~~~~~~~~~~~~e~~e~~~e~-----~~~~~~~~a~~~~ 72 (497) -..+.+++++++...+++++...+..+...++ ++.+.++.++++..++.++..++..+. .........+... T Consensus 7 k~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~~~~e~~~ 86 (437) T protein:vir:10 7 KKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDLVAPELEE 86 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33344444555554444554443333222222 222222333333322222221111000 0000000000000 Q ss_pred HHHHHHHHhHHHH-HHHHHHHHhhh---hhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHH-HHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 73 LDNDIPEVEVRNL-KQIRKHLARAV---IMNPELKNATSFEKGTKFDVSFNVSAKAADPGTA-AAELMGAFADGETAPAA 147 (497) Q Consensus 73 ~~~~~~~~~~~~~-~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 147 (497) ........+.... ........... .................................. ....+...... T Consensus 87 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~------ 160 (437) T protein:vir:10 87 NSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIA------ 160 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcc------ Confidence 0000000000000 00000000000 0000000000000000000000000000000000 00000000000 Q ss_pred hhhhhhccccc-CCcccccchhhHHHHHHHhhhhHH---hhccceec---CCCceEEEEeecCCccceeecccccccccc Q lcl|Aclame:pro 148 IGQNPFGSTGT-FAPGILPTFLPGIVEQLFYELSLA---DLISSRPV---TSPNLSYLTESAAHNNAAAVAEAGTYPFSS 220 (497) Q Consensus 148 ~~~~~~~~~~~-~g~~i~~~~~~~ii~~~~~~~~l~---~~~~~~~~---~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~ 220 (497) ........+. ....+. .. .. ...++....+. .....+++ .++...+..+ .....|........ T Consensus 161 -~~~~g~lvp~~~~~~i~-~~-~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e------~~~~~e~~~~~~~~ 230 (437) T protein:vir:10 161 -LKDGKVIIPETILTPEK-EV-HQ-FPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTE------YGQTTKNATPVITP 230 (437) T ss_pred -cccccccchHHHHHHHH-Hh-hh-hhhhhhcceeEeeccCceeeEEeeccccccccccc------ccccccccccccee Confidence 0000000000 000000 00 00 01111111111 11111111 1111111111 11122221111122 Q ss_pred ccceeEEeeee-eeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHH----HHhhhhccCCCccccceeccccccccchhh Q lcl|Aclame:pro 221 EEFARVYEQVG-KVANALTITDEGLRDAPELFNFVQGRLLEGIQRK----EEVQLLAGGGYPGVNGLLQRSTGFTASSAS 295 (497) Q Consensus 221 ~~~~~v~~~~~-kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~----~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~ 295 (497) .+|..-.+... ++.-- .+.+....-...+...|...++..+..+ .....-.++++.....+.... T Consensus 231 v~~~~~k~~~~~~is~e-ll~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~~~~~~~~~~--------- 300 (437) T protein:vir:10 231 ILWDLKTYTGGYVFSQE-LISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKTTSTYLLGDLKKVL--------- 300 (437) T ss_pred eeeehhheeeehhhhHH-HHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHH--------- Confidence 22221111110 11110 0111111111223334444444443333 322222333333333332110 Q ss_pred hhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCce Q lcl|Aclame:pro 296 SLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNA 375 (497) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 375 (497) ...+...+.....|+++...+..+...++..|.+...+......+.........+...............+ T Consensus 301 ---------~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~ 371 (437) T protein:vir:10 301 ---------NVTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNI 371 (437) T ss_pred ---------HhhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCceEE Confidence 00111222344568888888888888888888777655444332221111100000000000000000112 Q ss_pred EEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCc-CcEEEeeccceEEEEEeccccE--- Q lcl|Aclame:pro 376 VVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL-GTILVGHFAPSVIQTARREGVT--- 451 (497) Q Consensus 376 ~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~-~~~~~gd~~~~~~~i~~r~~~~--- 451 (497) ++.+...+..+ -+ -.|+-|..++.... .+.+.+-+.. ...+.+-..+. T Consensus 372 ~~gd~~~~~~~---~~------------------------r~~~~~~~~~~~~~~~~~~~~~~r~-d~~~~~~~a~~~l~ 423 (437) T protein:vir:10 372 VVAPLKKAVIN---FK------------------------LTEITGQFQDTYDIWYKQLGIFLRQ-NVVQASKDLIVNLT 423 (437) T ss_pred EEeeccccEEE---Ee------------------------eeceEEEEecccccccceeeEEEEE-ccEEecccceEEEE Confidence 22222111000 00 00222222211111 1111111110 01111111111 Q ss_pred -----EEEeccchhhhhcCceEE Q lcl|Aclame:pro 452 -----MQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 452 -----i~~~~~~~~~f~~~~v~~ 469 (497) +.++.... + T Consensus 424 ~~~~~~~~~~~~~---------~ 437 (437) T protein:vir:10 424 GKLKAVTVVQSTA---------V 437 (437) T ss_pred eeccccccCCCCC---------C Confidence 11111111 1 No 214 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=83.50 E-value=0.068 Score=26.99 Aligned_cols=321 Identities=12% Similarity=-0.037 Sum_probs=128.5 Q ss_pred HHHHHHHHHHhhhhhhhhhhhhhhcc---cccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCcc Q lcl|Aclame:pro 130 AAAELMGAFADGETAPAAIGQNPFGS---TGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNN 206 (497) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~ 206 (497) .....+..+........ ..+.... ..+--..|.|.....+...+.+.+.++++++++++..-...+.-....... T Consensus 1 M~~~tr~~~~~y~~~~A--~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~ 78 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAA--EYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRH 78 (343) T ss_pred CChHHHHHHHHHHHHHH--HHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccc Confidence 00111111111111100 0111110 111124467777788888888989999999999886432233221111111 Q ss_pred ceeecc-ccccccccccceeEEeeeeeeeeechhhHHHHhhH---HH-HHHHHHHHHHHHHHHHHHhhhhccCCCccccc Q lcl|Aclame:pro 207 AAAVAE-AGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PE-LFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNG 281 (497) Q Consensus 207 a~~v~E-g~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds---~~-l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~G 281 (497) ++-... +...... ..+.-.+.+++.---+.|+-+.|+.. ++ |...+++.+.++++.=.=.--+||+.....+ T Consensus 79 t~r~~t~~~~~~~~--~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T- 155 (343) T protein:vir:98 79 YGAHDRRTPIQQRW--TRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDT- 155 (343) T ss_pred cCccccCCCccccc--cCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCC- Confidence 111111 1111111 11111344444444556777777653 56 7788888887777543323344554322111 Q ss_pred eeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccc---cccccccccccchhhhhhhh Q lcl|Aclame:pro 282 LLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAA---GSGSGVAGSYPTAAEIAENV 358 (497) Q Consensus 282 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 358 (497) + .+ ...+.+.. |+-..+......... ........+.......+|.+ T Consensus 156 --~------nP-----------------llqDVN~G------WLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDal 204 (343) T protein:vir:98 156 --S------DP-----------------NLADVNKG------WIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDEL 204 (343) T ss_pred --C------Cc-----------------chhhcchH------HHHHHHhcchhhhhccceeccceeEecCCCCcccHHHH Confidence 0 00 00011111 111111111100000 00000000111112223332 Q ss_pred HHhhhhhhhhh-ccCCceEEEehhHHHHHH--HHhcccCcccccccccccc--cccccccccccccceeecCCCCcCcEE Q lcl|Aclame:pro 359 FDAFVDIQLTL-FQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAY--GNPVNGGKNIWGVPVVTTPLIPLGTIL 433 (497) Q Consensus 359 ~~~~~~~~~~~-~~~~~~~~~n~~~~~~l~--~lkd~~G~~~~~~~~~~~~--~~~~~~~~~l~G~pvv~s~~~~~~~~~ 433 (497) .........+. .-.+...++--.++.+-. .+-...++ ..+.. .....-..++-|+|.+.-|++|++.++ T Consensus 205 V~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~n~~~~------~ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~~ll 278 (343) T protein:vir:98 205 AYDLKQGLDARHRDAGDLVFLVGADLVAKEASLVYKGNGL------IATEKAALNTHDLMKSFGGMPAMIVPNMPPRAAI 278 (343) T ss_pred HHHHHhcCchHHhcCCCEEEEEchhhhhhhhhhhhhhcCC------ChHHHHHHHHHHHHHhhCCCeeEEccccCCCceE Confidence 22222233333 333444444333332211 11111111 11111 111223357899999999999999999 Q ss_pred EeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 434 VGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 434 ~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +=-|+.+.+.. .+....-.+-+.. .+|++.-.=..--+..|-+++.+|.++....+-+. T Consensus 279 VT~L~NLsIY~-Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~ 337 (343) T protein:vir:98 279 VTSLSNLSIYT-QEGSMRRGMKDDD----DKKAVRDSYYRNEAYAVEDCGKFMAVDFTKVKLSS 337 (343) T ss_pred EeeccccEEEE-ecCcEEEEEEecc----ccccccchhhhcceeeeeccccEEEeeeeeeeecC Confidence 88887753321 2222222222222 13333333333345566677766665543322222 No 215 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=80.07 E-value=0.098 Score=26.11 Aligned_cols=278 Identities=14% Similarity=0.079 Sum_probs=136.8 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhH-Hhhcc-ceecC-CCceEEEEeecCCccceeeccccccccccccceeEE Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSL-ADLIS-SRPVT-SPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVY 227 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l-~~~~~-~~~~~-~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~ 227 (497) +..+ +..-.+|..++..+.|.+.....-| ....+ +...+ +.++.+|.. +++...--.|.+...-.....++|+ T Consensus 1 ~~~T--SNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti--Gs~~~~~~~E~~~~~~~~i~TGEIt 76 (313) T protein:vir:95 1 MQLT--SNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI--GSVTLQEAEEDTPLIYNPIETGEIT 76 (313) T ss_pred Cccc--ccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc--CceeeeccccCCCeeecccccceEE Confidence 2112 2223567777666655554433211 12222 34443 345777753 2344444455555556667889999 Q ss_pred eeeeeeeeec-hhhHHHHhhHHH---HHHHHHHHHHHHHHHHHHhhhhc-cC----CCccccceeccccccccchhhhhh Q lcl|Aclame:pro 228 EQVGKVANAL-TITDEGLRDAPE---LFNFVQGRLLEGIQRKEEVQLLA-GG----GYPGVNGLLQRSTGFTASSASSLF 298 (497) Q Consensus 228 ~~~~kia~~~-~iS~ell~ds~~---l~~~i~~~la~~~~~~~d~a~l~-G~----g~~~~~Gil~~~~~~~~~~~~~~~ 298 (497) +....+++-. .||+.|-+|+-. +.+.+..+-+++|....+..||. |. |.+.|.-|...+-.+..+.+. T Consensus 77 ~~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~--- 153 (313) T protein:vir:95 77 FQITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETN--- 153 (313) T ss_pred EEEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCC--- Confidence 9999998854 899999999844 55566666677787777777664 22 111222221111111100000 Q ss_pred hHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEE Q lcl|Aclame:pro 299 GATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVM 378 (497) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (497) ..-...-+-.+..++.+...+ ...-++++ T Consensus 154 -------------------------------------------------~~~~~~~~~~~~~~~~~a~~P--~~G~v~Iv 182 (313) T protein:vir:95 154 -------------------------------------------------GVFALKHLIAMRLAFDKANVP--AEGRVFIV 182 (313) T ss_pred -------------------------------------------------ceehhhHHHHhhhhhhhccCC--ccceEEEE Confidence 000001112222333333222 23446777 Q ss_pred ehhHHHHHHHHhc------ccCcccccccccccccccccccccccccceeecCCCC---------cCcEEEeeccce--- Q lcl|Aclame:pro 379 NPRDWELLRLTKD------ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP---------LGTILVGHFAPS--- 440 (497) Q Consensus 379 n~~~~~~l~~lkd------~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~---------~~~~~~gd~~~~--- 440 (497) .|.....|..+.+ .+|++|......- ...--..+.|.-+.+|+-+. .+..++|++=+. T Consensus 183 DP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~----~~~Fi~~~YG~Di~~SN~L~~AN~~D~~tT~~G~~~NlFM~i~D 258 (313) T protein:vir:95 183 DPVAEATLNGLVTITHDVTDFGKMILESGMAR----GQRFIMNLYGWDILTSNRLHVANYNDGTTTGNGYVGNLFMCILD 258 (313) T ss_pred cchhhhhhhhhheeecccccccceeeeccCCc----hhHHHHHHhhhhhhhhhhhhhccccccccccCceeeeeeeeeec Confidence 8888887776643 4566654332111 11122356677777776543 233555543211 Q ss_pred -----EEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCC Q lcl|Aclame:pro 441 -----VIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGAT 495 (497) Q Consensus 441 -----~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~ 495 (497) ++.-|.|+ .+-+..... +-..+... +..|.++.+.+-+-.+.+--.++|- T Consensus 259 ~~~~P~~~AWr~M-P~s~~~~~~--~~~~~~~~--~~~R~G~Gi~R~~~L~~~~~~A~~~ 313 (313) T protein:vir:95 259 DQTKPIMGAWRRM-PKSEGERNK--DRARDEHV--VRCRYGFGIQRLDTLGLLATSATAY 313 (313) T ss_pred ccccceeeeeccc-ccccccccc--ccccccce--eeeeecccceeecceeEEEeccccC Confidence 11122221 111111111 11133444 4558888888877777765555555 No 216 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=78.94 E-value=0.11 Score=25.85 Aligned_cols=286 Identities=10% Similarity=-0.012 Sum_probs=110.8 Q ss_pred hhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHh Q lcl|Aclame:pro 98 MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFY 177 (497) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~ 177 (497) .....+++..+.+ ..... | ..-+.......-.+....+++.+.. T Consensus 1 ~~~~~~~~~~~~~----~~~~~------------------~--------------~~~~~~~nt~~l~~k~~~~LD~~~~ 44 (319) T protein:vir:94 1 MNKTIKNATGMLK----LNLQH------------------F--------------ANKSVEPGQTLLKNKHVGILERVTA 44 (319) T ss_pred CCcccccccceeE----eehhh------------------h--------------hccCCCcchHHHHHHHHHHHHHHHH Confidence 0111110000000 00000 0 0000011111112233444444444 Q ss_pred hhhHHh--hcc--ceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhHHHH--H Q lcl|Aclame:pro 178 ELSLAD--LIS--SRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAPEL--F 251 (497) Q Consensus 178 ~~~l~~--~~~--~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds~~l--~ 251 (497) ...+-. .++ ..-.++++++||+.....-..+--..|-.....+.++...++.-.+.-.+..=.-+.-+.+..+ . T Consensus 45 ~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~ 124 (319) T protein:vir:94 45 VNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDIN 124 (319) T ss_pred HhhhhhhcccCcceEeccCcEEEEeeecccccccccCCCCcccCCcccceeEEEeecccccccccchhhHhhhhchhhHH Confidence 333221 122 2334677899999876322222112222222223344445554444444331111111111122 1 Q ss_pred HHHHHHHHHHHHHHHHhhhhc---c-CCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhh Q lcl|Aclame:pro 252 NFVQGRLLEGIQRKEEVQLLA---G-GGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVAS 327 (497) Q Consensus 252 ~~i~~~la~~~~~~~d~a~l~---G-~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (497) ..+.......++-.+|.-.+. + .|.... T Consensus 125 ~i~~~~~~~~v~PEiDay~~skla~~a~~~~~------------------------------------------------ 156 (319) T protein:vir:94 125 YVVARQGAEVVAPYLDNLRFATLARNKAKHLT------------------------------------------------ 156 (319) T ss_pred HHHHHHHHHHhhhhhhHHHHHHHHhhcccccc------------------------------------------------ Confidence 222333334444445543221 1 010000 Q ss_pred hhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccccccccccc Q lcl|Aclame:pro 328 LKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYG 407 (497) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~ 407 (497) ...+.....+.+..+...+........-.++++|..+..|.+-.. ......... .+ T Consensus 157 -------------------~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~----f~~~~~~~~-~~ 212 (319) T protein:vir:94 157 -------------------VGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVI----ALPQGDTRQ-QV 212 (319) T ss_pred -------------------cccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhh----hhccccccc-cc Confidence 000111223334444444433332223356678888777644321 111111111 11 Q ss_pred ccccccccccccceeecCC--CCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccce Q lcl|Aclame:pro 408 NPVNGGKNIWGVPVVTTPL--IPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAF 485 (497) Q Consensus 408 ~~~~~~~~l~G~pvv~s~~--~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af 485 (497) .-.+.-+.|.|++|+.++. +..-.+++|.-+. .... ..--.++........| -..++....+|..|.+|++. T Consensus 213 ~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A--~~~~-~k~~~~~~~~p~~~~~---a~~v~gr~y~d~~V~~~k~~ 286 (319) T protein:vir:94 213 LGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEV--LASP-IQADLAKTNSNIPGMF---GTLAEQLLYTGAFVPEHLQK 286 (319) T ss_pred eeeeeceeecCeEEEEecccccccceEEEEcCCe--eeee-eeeeeeeccCCCcccc---ceeeeeeeeeeeEEeccccc Confidence 1122235789999988643 3344466665543 2111 1111222221111112 35688888899999999976 Q ss_pred EEEEecCCCCCC Q lcl|Aclame:pro 486 QLIQLKKGATGS 497 (497) Q Consensus 486 ~~~~~~~~a~~~ 497 (497) .......++..+ T Consensus 287 ~Iy~~~~~~~~~ 298 (319) T protein:vir:94 287 YIFTIGGTEVAT 298 (319) T ss_pred eEEEeecCCccc Confidence 666655555555 No 217 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=78.94 E-value=0.11 Score=25.85 Aligned_cols=286 Identities=10% Similarity=-0.012 Sum_probs=110.8 Q ss_pred hhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHh Q lcl|Aclame:pro 98 MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFY 177 (497) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~ 177 (497) .....+++..+.+ ..... | ..-+.......-.+....+++.+.. T Consensus 1 ~~~~~~~~~~~~~----~~~~~------------------~--------------~~~~~~~nt~~l~~k~~~~LD~~~~ 44 (319) T protein:vir:97 1 MNKTIKNATGMLK----LNLQH------------------F--------------ANKSVEPGQTLLKNKHVGILERVTA 44 (319) T ss_pred CCcccccccceeE----eehhh------------------h--------------hccCCCcchHHHHHHHHHHHHHHHH Confidence 0111110000000 00000 0 0000011111112233444444444 Q ss_pred hhhHHh--hcc--ceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhHHHH--H Q lcl|Aclame:pro 178 ELSLAD--LIS--SRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAPEL--F 251 (497) Q Consensus 178 ~~~l~~--~~~--~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds~~l--~ 251 (497) ...+-. .++ ..-.++++++||+.....-..+--..|-.....+.++...++.-.+.-.+..=.-+.-+.+..+ . T Consensus 45 ~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~ 124 (319) T protein:vir:97 45 VNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDIN 124 (319) T ss_pred HhhhhhhcccCcceEeccCcEEEEeeecccccccccCCCCcccCCcccceeEEEeecccccccccchhhHhhhhchhhHH Confidence 333221 122 2334677899999876322222112222222223344445554444444331111111111122 1 Q ss_pred HHHHHHHHHHHHHHHHhhhhc---c-CCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhh Q lcl|Aclame:pro 252 NFVQGRLLEGIQRKEEVQLLA---G-GGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVAS 327 (497) Q Consensus 252 ~~i~~~la~~~~~~~d~a~l~---G-~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (497) ..+.......++-.+|.-.+. + .|.... T Consensus 125 ~i~~~~~~~~v~PEiDay~~skla~~a~~~~~------------------------------------------------ 156 (319) T protein:vir:97 125 YVVARQGAEVVAPYLDNLRFATLARNKAKHLT------------------------------------------------ 156 (319) T ss_pred HHHHHHHHHHhhhhhhHHHHHHHHhhcccccc------------------------------------------------ Confidence 222333334444445543221 1 010000 Q ss_pred hhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccccccccccc Q lcl|Aclame:pro 328 LKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYG 407 (497) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~ 407 (497) ...+.....+.+..+...+........-.++++|..+..|.+-.. ......... .+ T Consensus 157 -------------------~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~----f~~~~~~~~-~~ 212 (319) T protein:vir:97 157 -------------------VGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVI----ALPQGDTRQ-QV 212 (319) T ss_pred -------------------cccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhh----hhccccccc-cc Confidence 000111223334444444433332223356678888777644321 111111111 11 Q ss_pred ccccccccccccceeecCC--CCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccce Q lcl|Aclame:pro 408 NPVNGGKNIWGVPVVTTPL--IPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAF 485 (497) Q Consensus 408 ~~~~~~~~l~G~pvv~s~~--~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af 485 (497) .-.+.-+.|.|++|+.++. +..-.+++|.-+. .... ..--.++........| -..++....+|..|.+|++. T Consensus 213 ~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A--~~~~-~k~~~~~~~~p~~~~~---a~~v~gr~y~d~~V~~~k~~ 286 (319) T protein:vir:97 213 LGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEV--LASP-IQADLAKTNSNIPGMF---GTLAEQLLYTGAFVPEHLQK 286 (319) T ss_pred eeeeeceeecCeEEEEecccccccceEEEEcCCe--eeee-eeeeeeeccCCCcccc---ceeeeeeeeeeeEEeccccc Confidence 1122235789999988643 3344466665543 2111 1111222221111112 35688888899999999976 Q ss_pred EEEEecCCCCCC Q lcl|Aclame:pro 486 QLIQLKKGATGS 497 (497) Q Consensus 486 ~~~~~~~~a~~~ 497 (497) .......++..+ T Consensus 287 ~Iy~~~~~~~~~ 298 (319) T protein:vir:97 287 YIFTIGGTEVAT 298 (319) T ss_pred eEEEeecCCccc Confidence 666655555555 No 218 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=78.13 E-value=0.12 Score=25.68 Aligned_cols=316 Identities=11% Similarity=0.019 Sum_probs=128.4 Q ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhhhcc---cccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecC Q lcl|Aclame:pro 127 PGTAAAELMGAFADGETAPAAIGQNPFGS---TGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAA 203 (497) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~ 203 (497) .. +..+........ ..+.... +.+--..|.|.....+...+.+.+.+++++++++++--.....-.... T Consensus 1 mt------r~~~~~y~~~~A--~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~ 72 (336) T protein:vir:37 1 MN------KQAYYALAAALA--KHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATE 72 (336) T ss_pred Cc------HHHHHHHHHHHH--HHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccC Confidence 00 000100000000 0111110 111124567777788888889999999999999987544333222111 Q ss_pred CccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhHHHHH----HHHHHHHHHHHHHHHHhhh--hccCCCc Q lcl|Aclame:pro 204 HNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAPELF----NFVQGRLLEGIQRKEEVQL--LAGGGYP 277 (497) Q Consensus 204 ~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds~~l~----~~i~~~la~~~~~~~d~a~--l~G~g~~ 277 (497) .+-++-..-+. .......+.-.+.+++.---+.|+-+.|+....+. ..+...+.+.+ ++|.-. +||+... T Consensus 73 g~iagrtdt~r--~r~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~i--ALD~i~IGfnG~s~A 148 (336) T protein:vir:37 73 KGVTGRKQTGR--NLATLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQV--ALDILQIGWNGQSVA 148 (336) T ss_pred cccccccCCCC--CccccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHH--hcchhhhcccceeec Confidence 11111111111 11122344455555555556678888887653333 33333333333 345443 3443211 Q ss_pred cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhh---hcccccc-cccccccccchhh Q lcl|Aclame:pro 278 GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRV---VTGAAGS-GSGVAGSYPTAAE 353 (497) Q Consensus 278 ~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~~~~~~~~~~ 353 (497) ..+- + -...+.+.. |+-..+.... +...... +-....+...... T Consensus 149 ~~Td--n------------------------PllqDVNkG------WlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~ 196 (336) T protein:vir:37 149 TNTT--K------------------------TDLSDVNKG------WLKLLQEQRAANFMTESTKSSGKITIFGDNADYA 196 (336) T ss_pred cCCC--C------------------------ccccccchh------HHHHHHhccchhhcccccccCCceEEecCCCCcc Confidence 1000 0 000011111 1111111111 0000000 0001111111222 Q ss_pred hhhh-hHHhhhhhhhhhccCCceEEEehhHHHHHH--HHhcccCcccccccccccccc--cccccccccccceeecCCCC Q lcl|Aclame:pro 354 IAEN-VFDAFVDIQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGN--PVNGGKNIWGVPVVTTPLIP 428 (497) Q Consensus 354 ~~~~-~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~~~~~~~~~~~~~--~~~~~~~l~G~pvv~s~~~~ 428 (497) .+|. ++++...+.....-.+...++=-.+..+-. .|-..+|. .+ +..-. ...-..++-|+|.+..|.+| T Consensus 197 NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~-----~P-tE~~Aa~~~~~~k~iGGlpa~~~PffP 270 (336) T protein:vir:37 197 NLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGL-----TP-TEKAALGSHNLMGSFGGMNAITPPNFP 270 (336) T ss_pred cHHHHHHHHHhccchHHhcCCCeEEEEchhhhhhhhhhhhhhcCC-----CH-HHHHHHHHHHHHHhhCCceEEEccccC Confidence 3333 344443333333333444443333322211 11111111 00 10000 12234578999999999999 Q ss_pred cCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 429 LGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 429 ~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++.+++=-|+.+.+.. .+....-.+-+.. .+|++.-.=..--+..|-+++.+|.++.....-+- T Consensus 271 ~~~~lVT~L~NLsIY~-Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~ 334 (336) T protein:vir:37 271 ARAAAVTTLKNLSVYT-EAESVRRSLRNDE----DKKGLVTSYYRQEGYVVEDLGLMTAIDHTKVKLNG 334 (336) T ss_pred CCceEEeeccccEEEE-ecCcEEEEEEEcc----ccccccchhhhcceeeeeccccEEEeeeeeeeccc Confidence 9999988887753322 2222222222211 13333333333345566677777766654444433 No 219 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=78.08 E-value=0.12 Score=25.67 Aligned_cols=316 Identities=11% Similarity=0.025 Sum_probs=128.5 Q ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhhhc---ccccCCcccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecC Q lcl|Aclame:pro 127 PGTAAAELMGAFADGETAPAAIGQNPFG---STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAA 203 (497) Q Consensus 127 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~ 203 (497) .. +..+........ ..+... .+.+--..|.|.....+...+.+.+.+++++++++++--.....-.... T Consensus 1 mt------r~~~~~y~~~~A--~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~ 72 (336) T protein:vir:37 1 MN------KQAYYALAAALA--KHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATE 72 (336) T ss_pred Cc------HHHHHHHHHHHH--HHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccC Confidence 00 000100000000 011111 0111124567777888889999999999999999987544333222111 Q ss_pred CccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhhHH---H-HHHHHHHHHHHHHHHHHHhhh--hccCCCc Q lcl|Aclame:pro 204 HNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP---E-LFNFVQGRLLEGIQRKEEVQL--LAGGGYP 277 (497) Q Consensus 204 ~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds~---~-l~~~i~~~la~~~~~~~d~a~--l~G~g~~ 277 (497) .+-++-..- +..| .++.++.-.+.+++.---+.|+-+.|+... + +...+...+.+.+ ++|.-. +||+... T Consensus 73 g~iagrtdt-~R~~-~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~i--ALD~i~IGfnG~s~A 148 (336) T protein:vir:37 73 KGVTGRKQT-GRNL-ANLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQV--ALDILQIGWNGQSVA 148 (336) T ss_pred cccccccCC-Cccc-cccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHH--hhchhhhcccceeec Confidence 111111111 1222 224555666666666666778888887653 3 2233333334443 345443 3444221 Q ss_pred cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhh---hcccccccc-cccccccchhh Q lcl|Aclame:pro 278 GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRV---VTGAAGSGS-GVAGSYPTAAE 353 (497) Q Consensus 278 ~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~~~~~~ 353 (497) ..+- . -...+.+.. |+-..+.... +.......+ ....+...... T Consensus 149 ~~Td------------------n--------PllqDVNkG------WlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~ 196 (336) T protein:vir:37 149 DNTT------------------K--------ADLSDVNKG------WLKLLQEQRAANFMTESTKSSGKITIFGDNADYA 196 (336) T ss_pred cCCC------------------C--------Ccccccchh------HHHHHHhccchhhcccccccCCceEEecCCCCcc Confidence 1100 0 000011111 1112211111 000000000 01111111222 Q ss_pred hhhh-hHHhhhhhhhhhccCCceEEEehhHHHHHH--HHhcccCcccccccccccccc--cccccccccccceeecCCCC Q lcl|Aclame:pro 354 IAEN-VFDAFVDIQLTLFQTPNAVVMNPRDWELLR--LTKDANGQYMGGNFFGNAYGN--PVNGGKNIWGVPVVTTPLIP 428 (497) Q Consensus 354 ~~~~-~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~--~lkd~~G~~~~~~~~~~~~~~--~~~~~~~l~G~pvv~s~~~~ 428 (497) .+|. ++++...+.....-.+...++=-.+..+-. .|-..+|. .+ +..-. ...-..++-|+|.+..|.+| T Consensus 197 NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~-----~P-tE~~Aa~~~~~~k~iGGlpa~~~PffP 270 (336) T protein:vir:37 197 NLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGL-----TP-TEKAALGSHNLMGSFGGMNAITPPNFP 270 (336) T ss_pred cHHHHHHHHHhcCchHHhcCCCeEEEEchhhhhhhhhhhhhhcCC-----CH-HHHHHHHHHHHHHhhCCceeEEccccC Confidence 3333 344443333333333444443333322211 11111111 00 00000 12234578999999999999 Q ss_pred cCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 429 LGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 429 ~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ++.+++=-|+.+.+.. .+....-.+-+.. .+|++.-.=..--+..|-+++.+|.++-....-+- T Consensus 271 ~~~~lVT~L~NLsIY~-Q~gs~RR~~~d~p----~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~ 334 (336) T protein:vir:37 271 ARAAAVTTLKNLSVYT-EAESVRRSLRNDE----DKKGLVTSYYRQEGYVVEDLGLMTAIDHTKVKLNG 334 (336) T ss_pred CCceEEeechhcEEEE-ecCcEEEEEEEcc----ccccccchhhhcceeeeeccccEEEeeeeeeeecC Confidence 9999988887753322 2222222222211 13333333333345555666666665543333333 No 220 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=76.62 E-value=0.13 Score=25.38 Aligned_cols=311 Identities=16% Similarity=0.107 Sum_probs=124.0 Q ss_pred hhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHH---hhhhHHhh Q lcl|Aclame:pro 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF---YELSLADL 184 (497) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~---~~~~l~~~ 184 (497) +........ .......... ....+.+.- .-.....+..+++.+-.+.+..-|..+. +...+..- T Consensus 1 ~~~~~~~~~---~~~~~~~~~~--e~~~KS~~t--------g~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~ 67 (462) T protein:vir:96 1 MHKDTNLTA---EQNKYADKFQ--EEVMKSYQT--------GYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYRE 67 (462) T ss_pred Cccccccch---hhhhhhchhh--HHHHHHHhc--------CCCcCCccccccchhhhhhhhhhhheeeecccchhhhhh Confidence 000000000 0000000000 000000100 0000111222334443333322222221 12244555 Q ss_pred ccceecCCCceEEEEeec--CCccceeeccccccccccccceeEEeeeeeeeeechhhHHH-HhhH-HHHHHHHHHHHHH Q lcl|Aclame:pro 185 ISSRPVTSPNLSYLTESA--AHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PELFNFVQGRLLE 260 (497) Q Consensus 185 ~~~~~~~~~~~~~p~~~~--~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~el-l~ds-~~l~~~i~~~la~ 260 (497) +...++.+.--+|-.+.. ..+.+.+++|++..+.+++++...+...|=++.-..+|-.. |..+ .+.+....++-.- T Consensus 68 i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~ 147 (462) T protein:vir:96 68 ISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIA 147 (462) T ss_pred cCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHH Confidence 666777665444544443 33457899999999999999999999999999977777654 3333 4677888888888 Q ss_pred HHHHHHHhhhhccCCCccccce---eccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccc Q lcl|Aclame:pro 261 GIQRKEEVQLLAGGGYPGVNGL---LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGA 337 (497) Q Consensus 261 ~~~~~~d~a~l~G~g~~~~~Gi---l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 337 (497) .++..++.+.++|+..-.|.+. +..-+. ..+....+.... T Consensus 148 ~~a~tiE~a~Fygds~l~~~~~~~gleFDGl-------------------------------------~~lI~~~NViDa 190 (462) T protein:vir:96 148 VVAKTIEWASFYGDASLTADPTGQGLEFDGL-------------------------------------AKLIDKDNVIDA 190 (462) T ss_pred HHHHHHHHHHhhhhcccCCCccccccchhhh-------------------------------------hhhcCCCceeec Confidence 9999999999999864333221 000000 000000000000 Q ss_pred ccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccccccc-ccccccccccc-- Q lcl|Aclame:pro 338 AGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG-NAYGNPVNGGK-- 414 (497) Q Consensus 338 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~-~~~~~~~~~~~-- 414 (497) .+... -..++.-........|..++-.+|+..+.+.|..---..-|.+.++..+ ...|.++...- T Consensus 191 rG~~L------------s~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f~s~ 258 (462) T protein:vir:96 191 KGESL------------TETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQDNSGNVNAGYNVQGFYSS 258 (462) T ss_pred CCCCc------------cHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEEEcCCCCceeeeeeccceeee Confidence 00000 0011111111122344455556666666666552211111111111100 01111111111 Q ss_pred ---------cccccceeec------CCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhh----cCceEEEEEeee Q lcl|Aclame:pro 415 ---------NIWGVPVVTT------PLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFV----DGKVTVRAEERL 475 (497) Q Consensus 415 ---------~l~G~pvv~s------~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~----~~~v~~r~~~r~ 475 (497) ++++.|-+.. +.+|+-. .++..+..-....|. .....|++...- T Consensus 259 ~G~I~L~~s~~m~~~~i~~~~~~~~p~ap~~~-----------------~vsaTv~t~~~g~f~~~~d~~~y~Y~V~avs 321 (462) T protein:vir:96 259 RGFIKLHGSTVMENELILDESLQPLPNAPQPA-----------------TVKATVETGKKGLFTDEHDRAELTYKVVVNS 321 (462) T ss_pred eeeeeeCCceecCcccccccccccCCCCCCCC-----------------ceeEEEEeCCCCCCCCccCceeEEEEEEEEC Confidence 1111111111 1111100 000000000000110 123344444444 Q ss_pred ccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 476 GLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 476 ~~~v~~~~Af~~~~~~~~a~~~ 497 (497) +..=--|+.++-++..+..+|. T Consensus 322 ~dgeS~PS~~VtaTva~~~~gv 343 (462) T protein:vir:96 322 DDAQSAPSEAVTATVNNATDGV 343 (462) T ss_pred CCCccccceeeEeeeecccccc Confidence 3333346666666655555554 No 221 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=76.34 E-value=0.14 Score=25.32 Aligned_cols=273 Identities=10% Similarity=-0.021 Sum_probs=106.6 Q ss_pred hhhcccccCCcccccchhhHHHHHHHhhhhHHhhcccee-----c--CCCceEEEEeecCC--ccceeeccccccccccc Q lcl|Aclame:pro 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRP-----V--TSPNLSYLTESAAH--NNAAAVAEAGTYPFSSE 221 (497) Q Consensus 151 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~-----~--~~~~~~~p~~~~~~--~~a~~v~Eg~~~~~s~~ 221 (497) |+ .+...++|.-+...+++.+++.+++.++++.-. . .+.++++|+..... ..+.+-..+.. ..+. T Consensus 1 MA----Nsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~--~~~l 74 (423) T protein:vir:10 1 MA----NNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKS--KNSL 74 (423) T ss_pred Cc----cccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCccc--cccc Confidence 11 112235556677788999999988888876532 2 25677777643211 11111111111 1112 Q ss_pred c--ceeEEeeeeeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhh Q lcl|Aclame:pro 222 E--FARVYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) Q Consensus 222 ~--~~~v~~~~~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~ 299 (497) . --.+.+.-+|...+-.=+.|+..+..+++++++.. .++++..+|..+...-....+.. ...++. . T Consensus 75 ~e~~v~l~id~~k~~a~~v~d~E~~l~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~-vgt~~t--~-------- 142 (423) T protein:vir:10 75 ISAKATGEVGNYITVAVEYRQIEEALKLNQLDQILVPI-NERMVTDLETELALFMMKHGALS-LGSPNT--P-------- 142 (423) T ss_pred ccceEEEEecceeeeeeeeChHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHhhhccccc-cccccc--c-------- Confidence 1 13566666666666655667665666777766555 78999999988753211100100 000000 0 Q ss_pred HHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEe Q lcl|Aclame:pro 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMN 379 (497) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 379 (497) ...++.+......+.....+. ..-..+++ T Consensus 143 -------------------------------------------------~~a~~~~a~a~~~L~~~~vP~--~~R~~Vv~ 171 (423) T protein:vir:10 143 -------------------------------------------------IKKWSDVAQTASFLKDLGINS--GENYAVMD 171 (423) T ss_pred -------------------------------------------------cccHHHHHHHHHHHhhccCCc--CCCEEEeC Confidence 000011111111122221221 12346788 Q ss_pred hhHHHHHHHHhcccCccccccccccccccccc-ccccccccceeecCCCCc---CcEEE-eeccceEEEEE-----ec-- Q lcl|Aclame:pro 380 PRDWELLRLTKDANGQYMGGNFFGNAYGNPVN-GGKNIWGVPVVTTPLIPL---GTILV-GHFAPSVIQTA-----RR-- 447 (497) Q Consensus 380 ~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~-~~~~l~G~pvv~s~~~~~---~~~~~-gd~~~~~~~i~-----~r-- 447 (497) |..+..|.+ +............ .....+ -...+.|+.++.|+++|. |+.-+ +-- .+++.+- +- T Consensus 172 p~~~a~Ll~--~~~~~~~~~~~~~--~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~-~~~~~vt~a~~~~~~~ 246 (423) T protein:vir:10 172 PWAAQRLAD--AQSGLHVSEQLVR--TAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTV-KGTPEVNYDSVKDSYA 246 (423) T ss_pred HHHHHHHhh--hhhhhccccccch--HHHHhcccceeecceEEEEecCCcccccccccceeee-eeeeEEEecccccccc Confidence 888777642 1111111111111 111111 224789999999999983 32110 000 0111110 00 Q ss_pred cccEEEEecc--chhhhhcCceEE---EEEeeeccEee------cccceEEEEe-cCCCCCC Q lcl|Aclame:pro 448 EGVTMQMTNS--NGTDFVDGKVTV---RAEERLGLLVY------RPSAFQLIQL-KKGATGS 497 (497) Q Consensus 448 ~~~~i~~~~~--~~~~f~~~~v~~---r~~~r~~~~v~------~~~Af~~~~~-~~~a~~~ 497 (497) ...+...... .+..-.-|.+.| ....++...++ ++.-|+...= .+.+.|. T Consensus 247 ~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~ 308 (423) T protein:vir:10 247 FTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHSSGD 308 (423) T ss_pred cccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccccccCc Confidence 0000000000 000000111111 11222222211 1111221110 0001111 No 222 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=64.22 E-value=0.3 Score=23.42 Aligned_cols=296 Identities=9% Similarity=-0.045 Sum_probs=109.6 Q ss_pred hhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHHHHHHHhhhhHH--hhcc--cee Q lcl|Aclame:pro 114 FDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLA--DLIS--SRP 189 (497) Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~--~~~~--~~~ 189 (497) ....+-....... .+.+... +. ...++.. -..-+-..+.+.-.+....+++.......+- .+++ ... T Consensus 1 ~~~~~~~~~~~~~-----~~~~~~~-~~-~~~~~~~--~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~ 71 (329) T protein:vir:10 1 MDGIFITGVKTMN-----KEIKNAT-GK-LKLNLQH--FANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIF 71 (329) T ss_pred CCceEEechhhhh-----hhhhccc-ce-eEEehhh--hcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceee Confidence 0000000000000 0000000 00 0000000 0000000111111122223333333222111 1222 244 Q ss_pred cCCCceEEEEeecCCccceee-ccccccccccccceeEEeeeeeeeeechhhHHHHhhHHHH--HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 190 VTSPNLSYLTESAAHNNAAAV-AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAPEL--FNFVQGRLLEGIQRKE 266 (497) Q Consensus 190 ~~~~~~~~p~~~~~~~~a~~v-~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~ds~~l--~~~i~~~la~~~~~~~ 266 (497) ..+++++||+.... +-..+- ..|-.....+.++...+|.-.+.-.+..=.-+.-+.+..+ ...+.......++-.+ T Consensus 72 ~~g~tVkIp~i~~~-gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEi 150 (329) T protein:vir:10 72 MQGRSFTVIKGDVT-ELKDYKRNATNEFDHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYL 150 (329) T ss_pred ccCcEEEEeeeccc-ccccccCCCCccccccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHH Confidence 56788999998653 222222 2222222233445555555555544432111111111112 2223333444555555 Q ss_pred Hhhhhc---cC-CCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccc Q lcl|Aclame:pro 267 EVQLLA---GG-GYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGS 342 (497) Q Consensus 267 d~a~l~---G~-g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 342 (497) |...+. +. |+.-. T Consensus 151 Day~~skla~~a~~~~~--------------------------------------------------------------- 167 (329) T protein:vir:10 151 DNLRFATLARNKAKHLT--------------------------------------------------------------- 167 (329) T ss_pred HHHHHHHHHhhcccccc--------------------------------------------------------------- Confidence 543221 10 00000 Q ss_pred cccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccCccccccccccccccccccccccccccee Q lcl|Aclame:pro 343 GVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVV 422 (497) Q Consensus 343 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv 422 (497) ...+.....+.+..+...+.......+-.+++.|..+..|.+. .+.+........ +.-.+.-..|.|++|+ T Consensus 168 ----~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~----~~f~~~~~~~~~-~~~~g~Vg~idG~~Ii 238 (329) T protein:vir:10 168 ----VGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKF----VIELPQGDNRQQ-VLGKGVQGELDGFTIV 238 (329) T ss_pred ----cccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhh----hhhhcccccccc-ceeeeeeeeecCeEEE Confidence 0001111233344444444433222233566788888777542 112211111111 1112223478999999 Q ss_pred ecCC--CCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEE-EEecCCCCCC Q lcl|Aclame:pro 423 TTPL--IPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQL-IQLKKGATGS 497 (497) Q Consensus 423 ~s~~--~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~-~~~~~~a~~~ 497 (497) .++. ++.-.+++|.-+.. ..+.-...+.+-...+. ++-..++....+|..|.+|++... ...+.+.+.| T Consensus 239 ~vps~~~k~in~ii~~~~A~-~~~~K~~~~~~~~p~~~-----~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a~~~~ 310 (329) T protein:vir:10 239 KVPSKMLQGVEAMAVIGEVM-ASPIQANEAKLNSNVPG-----MFGTLAEQMLYTGAFVPEHLQKYIFTIGGKEVETN 310 (329) T ss_pred EecCCcccceeEEEEcCCce-eeeeeeeeeeeeCCCCc-----cchheeeeeeeeeeEEEccccCEEEEecccCcccC Confidence 8654 33334566655431 11211112222211111 123478888899999999996553 3345555544 No 223 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=57.98 E-value=0.38 Score=22.86 Aligned_cols=106 Identities=11% Similarity=-0.030 Sum_probs=56.3 Q ss_pred EEehhHHHHHHHHhc-------ccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEec-- Q lcl|Aclame:pro 377 VMNPRDWELLRLTKD-------ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR-- 447 (497) Q Consensus 377 ~~n~~~~~~l~~lkd-------~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r-- 447 (497) ++....|+.+...-. .+-.+++.+ .-+-+++|+..+.|+++|.++.++.|-.+. -.+.|. T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~aNp~ltG----------~lpV~~~GltWl~tpnlpg~~a~vlDst~l-GgmaDE~l 69 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQANIVLTG----------SLPVSAYGLTWVTSRHITGTDPWLFDVEQL-GGMADEKL 69 (123) T ss_pred CcchhhHHHHhcchhcccccccccCCceEec----------CcceeeeceeeeecCCCCCCccceeehhhh-cccccccc Confidence 111111111111000 111233322 122258899999999999998888776553 233332 Q ss_pred --------cccEEEEeccchhhhhcCceEEEEEeeeccEeecccceEEEEecCC Q lcl|Aclame:pro 448 --------EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) Q Consensus 448 --------~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~~~~~ 493 (497) .+.-|+++....+.=.+|+..+|+..-.--.|.-|.|.++|+=.-- T Consensus 70 ~~Pgya~~~~~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 70 LSPEFAPAGNTGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred CCCcccCCCCcceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 2223343333332234788888886655666778999999984333 No 224 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=57.88 E-value=0.42 Score=22.62 Aligned_cols=312 Identities=16% Similarity=0.116 Sum_probs=111.0 Q ss_pred HHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHH Q lcl|Aclame:pro 92 LARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGI 171 (497) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~i 171 (497) ... .++.+. ....... +..+.+..+ -.....+..+|+.+--+.+..- T Consensus 1 ~~~-------~~n~~~---------------~~~~~~e---~~~Ks~ttg--------y~~~p~~q~~~~AlRrEsL~~~ 47 (464) T protein:vir:80 1 MTE-------KKNTER---------------QLTSVQE---EVIKGFTTG--------YGITPESQTDAAALRREFLDDQ 47 (464) T ss_pred CCc-------chhhHh---------------hcCcccH---HHHHHHHhC--------CccCcccccCcchhhhhhhhhh Confidence 000 000000 0000000 000011000 0011122233444443333322 Q ss_pred HHHHH---hhhhHHhhccceecCCCceEEEEeec--CCccceeeccccccccccccceeEEeeeeeeeeechhhH--HHH Q lcl|Aclame:pro 172 VEQLF---YELSLADLISSRPVTSPNLSYLTESA--AHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITD--EGL 244 (497) Q Consensus 172 i~~~~---~~~~l~~~~~~~~~~~~~~~~p~~~~--~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~--ell 244 (497) |..+. ....+..-+...++.+.--+|-.+.. ..+.+.+++|++..+.++|++...+...|-+..--.+|- .|. T Consensus 48 i~~Lt~~~~~f~f~~di~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lv 127 (464) T protein:vir:80 48 ITMLTWADGDLSFYRDITKRPATSTVAKYDVYLAHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLV 127 (464) T ss_pred hheeeecccchhhhhhcCCchhhhhhhhhheeeccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhh Confidence 22221 12244555667777665445544443 234578999999999999999999999886666433343 334 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccc-----cceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhh Q lcl|Aclame:pro 245 RDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGV-----NGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAF 319 (497) Q Consensus 245 ~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~-----~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (497) +.-.+-.....++-.-.++..++.+.++|+..-.| .|+- .- ... T Consensus 128 n~~~d~~~~~~~dai~~va~tiE~a~FyGds~l~~~~~~~~gle-FD--------------Gl~---------------- 176 (464) T protein:vir:80 128 NNIEDPMRILTDDAISVVAKTIEWASFYGDSDLSENPDAGSGLE-FD--------------GLA---------------- 176 (464) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCCCccccc-hh--------------hhH---------------- Confidence 43335555666677788999999999999853221 1110 00 000 Q ss_pred hhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHH-HHHhcccCcccc Q lcl|Aclame:pro 320 VGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELL-RLTKDANGQYMG 398 (497) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l-~~lkd~~G~~~~ 398 (497) .+....+.....+.... ..++..........|..++-.+|+..+.+.+ ...-+.+=+.+. T Consensus 177 -------~lI~~~NViDarG~~Ls------------~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~q~~~~~ 237 (464) T protein:vir:80 177 -------KLIDKHNVLDAKGASLT------------EALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDRQVQVIS 237 (464) T ss_pred -------hhcCCCceeecCCCCcC------------HHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCceeEEEc Confidence 00000000000000000 0011111111123444455556666555554 222111111111 Q ss_pred cccccccccccccc-----------cccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhh-cC- Q lcl|Aclame:pro 399 GNFFGNAYGNPVNG-----------GKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFV-DG- 465 (497) Q Consensus 399 ~~~~~~~~~~~~~~-----------~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~-~~- 465 (497) ........|.++.. +.++++.|-+..++ ...+.+.+.. ..++..+++.....|- ++ T Consensus 238 ~n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~ld~~---~~~~~~apaa--------psvt~tv~~~~~g~f~~~~~ 306 (464) T protein:vir:80 238 DNGQNATMGFNVKGFNSARGFIRLHGSTVMELEQILDEN---RMQLPNAPQK--------ATVKATLEAGTKGKFRDEDL 306 (464) T ss_pred CCCCcceeeeecccccccccceeccCccccCcccccccc---cccCCCCcCC--------ceeEEEecCCcccCCccccc Confidence 00000001111111 11122221111111 1111111111 1111112111111121 11 Q ss_pred --ceEEEEEeeeccEeecccceEEEEecCCCCCC Q lcl|Aclame:pro 466 --KVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) Q Consensus 466 --~v~~r~~~r~~~~v~~~~Af~~~~~~~~a~~~ 497 (497) ..-|++...-+-.=--|..++-.++....++= T Consensus 307 ~~~~~Ykv~~vn~~GeS~ps~~~~~ti~~~~~~V 340 (464) T protein:vir:80 307 TIDTEYKVVVVSDDAESAPSDVASVVIDDKKKQV 340 (464) T ss_pred cceeEEEEEEECCCCccccceeeeeeecCcccEE Confidence 12233333332222223332222222111111 No 225 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=54.00 E-value=0.51 Score=22.17 Aligned_cols=397 Identities=12% Similarity=0.046 Sum_probs=110.2 Q ss_pred CchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDI-NADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |..+.++.++..+..++++.. .+++.++..++.+.++.+..+++...+..+..+ ............ .+. T Consensus 8 ~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~------~~~~~~~~~~~~----~~~ 77 (415) T protein:vir:81 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLK------EKDGTSENNQQS----VEV 77 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHhhhhhcccc----ccc Confidence 555666666666665555543 333333344444444444444433222221111 000000000000 000 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+..... .... ................. ...... ........ .. ..... ..+. ... T Consensus 78 ~~~~~~~-~~~~------~~~~~~~~~~~~~~~~~--------~~~~~~--~~~~~~~~----~~-~~~~~-~~gg-~~i 133 (415) T protein:vir:81 78 NEARTYR-NQAN------INDLGISIQNTKVTSQE--------VRDFTE--YLETRNDI----QG-GSLKT-DSGF-VVI 133 (415) T ss_pred chhhhHH-HHHH------HHHHhhhhhhhhhHHHH--------HHHHHH--HHhhhhhh----hh-ccccc-cccc-ccc Confidence 0000000 0000 00000000000000000 000000 00000000 00 00000 0000 000 Q ss_pred CcccccchhhH-----HHHHHHhhhhHHhhcccee---cCCC-ceEEEEeecCCccceeeccccccccccccceeEEeee Q lcl|Aclame:pro 160 APGILPTFLPG-----IVEQLFYELSLADLISSRP---VTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 160 g~~i~~~~~~~-----ii~~~~~~~~l~~~~~~~~---~~~~-~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~ 230 (497) ...+.+.+... .+..+...-++......++ .++. ...+. + . ++ -..|.........++..-.+.. T Consensus 134 P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v---~-E-~~-~~~~~~~~~~~~v~~~~~k~~~ 207 (415) T protein:vir:81 134 PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKV---E-E-LE-ENPELAVKPFFQLAYDINTHRG 207 (415) T ss_pred chHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceee---c-c-cc-ccCcccccceeeEEeeeeeeEe Confidence 01111111111 1111111111111111111 1111 11111 1 1 11 1222222222333433333332 Q ss_pred eeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhh Q lcl|Aclame:pro 231 GKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKF 310 (497) Q Consensus 231 ~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (497) .-.-..--+......-...+...|...++.++..++=...-.|.+.+...+........+ ........+.......... T Consensus 208 ~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~~~~~~~ 286 (415) T protein:vir:81 208 YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE-VKKAKSLDDIKDAINLNVK 286 (415) T ss_pred eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccc-cccccchhHHHHHHHhhhh Confidence 211111111111111123456666677777777666554444554433222222222222 2222233334444444455 Q ss_pred hhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHh Q lcl|Aclame:pro 311 PADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK 390 (497) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk 390 (497) .....+.|+++...+..++..++..|.+...+......+............ ...........+++.|...+ T Consensus 287 ~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~--~~~~~~~~~~~~~~Gd~~~~------- 357 (415) T protein:vir:81 287 PNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILP--DEVLGQKGNNTLIIGNLKDA------- 357 (415) T ss_pred hccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEec--ccccCCCCccEEEEEehhcc------- Confidence 556677788898888888888887777665544322221111000000000 00000000000111111110 Q ss_pred cccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEecccc-EEEEeccchhhhhcCceEE Q lcl|Aclame:pro 391 DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 391 d~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~ 469 (497) ++. ..-.|+.|..+++.-..+.+.+.... ...+.+-..+ .++.+.... ..+..++ T Consensus 358 ------~~~--------------~~~~~~~v~~~~~~~~~~~~~~~~r~-d~~v~~~~a~~~~~~~~~~~---~~~~~~~ 413 (415) T protein:vir:81 358 ------IVL--------------FDRSQYQASWTDYMHFGECLMIAVRQ-DCRILDYKSAIVIEYDDSER---GEGDLGL 413 (415) T ss_pred ------EEE--------------EeecceEEEEeccccCceEEEEEEEe-ccEEeccccEEEEEEeccCC---CCCcccc Confidence 000 00112333333322211111111000 0001111111 111111111 0111111 Q ss_pred EE Q lcl|Aclame:pro 470 RA 471 (497) Q Consensus 470 r~ 471 (497) -+ T Consensus 414 ~~ 415 (415) T protein:vir:81 414 EA 415 (415) T ss_pred CC Confidence 11 No 226 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=54.00 E-value=0.51 Score=22.17 Aligned_cols=397 Identities=12% Similarity=0.046 Sum_probs=110.2 Q ss_pred CchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDI-NADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |..+.++.++..+..++++.. .+++.++..++.+.++.+..+++...+..+..+ ............ .+. T Consensus 8 ~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~------~~~~~~~~~~~~----~~~ 77 (415) T protein:vir:98 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLK------EKDGTSENNQQS----VEV 77 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHhhhhhcccc----ccc Confidence 555666666666665555543 333333344444444444444433222221111 000000000000 000 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+..... .... ................. ...... ........ .. ..... ..+. ... T Consensus 78 ~~~~~~~-~~~~------~~~~~~~~~~~~~~~~~--------~~~~~~--~~~~~~~~----~~-~~~~~-~~gg-~~i 133 (415) T protein:vir:98 78 NEARTYR-NQAN------INDLGISIQNTKVTSQE--------VRDFTE--YLETRNDI----QG-GSLKT-DSGF-VVI 133 (415) T ss_pred chhhhHH-HHHH------HHHHhhhhhhhhhHHHH--------HHHHHH--HHhhhhhh----hh-ccccc-cccc-ccc Confidence 0000000 0000 00000000000000000 000000 00000000 00 00000 0000 000 Q ss_pred CcccccchhhH-----HHHHHHhhhhHHhhcccee---cCCC-ceEEEEeecCCccceeeccccccccccccceeEEeee Q lcl|Aclame:pro 160 APGILPTFLPG-----IVEQLFYELSLADLISSRP---VTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 160 g~~i~~~~~~~-----ii~~~~~~~~l~~~~~~~~---~~~~-~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~ 230 (497) ...+.+.+... .+..+...-++......++ .++. ...+. + . ++ -..|.........++..-.+.. T Consensus 134 P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v---~-E-~~-~~~~~~~~~~~~v~~~~~k~~~ 207 (415) T protein:vir:98 134 PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKV---E-E-LE-ENPELAVKPFFQLAYDINTHRG 207 (415) T ss_pred chHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceee---c-c-cc-ccCcccccceeeEEeeeeeeEe Confidence 01111111111 1111111111111111111 1111 11111 1 1 11 1222222222333433333332 Q ss_pred eeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhh Q lcl|Aclame:pro 231 GKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKF 310 (497) Q Consensus 231 ~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (497) .-.-..--+......-...+...|...++.++..++=...-.|.+.+...+........+ ........+.......... T Consensus 208 ~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~~~~~~~ 286 (415) T protein:vir:98 208 YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE-VKKAKSLDDIKDAINLNVK 286 (415) T ss_pred eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccc-cccccchhHHHHHHHhhhh Confidence 211111111111111123456666677777777666554444554433222222222222 2222233334444444455 Q ss_pred hhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHh Q lcl|Aclame:pro 311 PADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK 390 (497) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk 390 (497) .....+.|+++...+..++..++..|.+...+......+............ ...........+++.|...+ T Consensus 287 ~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~--~~~~~~~~~~~~~~Gd~~~~------- 357 (415) T protein:vir:98 287 PNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILP--DEVLGQKGNNTLIIGNLKDA------- 357 (415) T ss_pred hccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEec--ccccCCCCccEEEEEehhcc------- Confidence 556677788898888888888887777665544322221111000000000 00000000000111111110 Q ss_pred cccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEecccc-EEEEeccchhhhhcCceEE Q lcl|Aclame:pro 391 DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 391 d~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~ 469 (497) ++. ..-.|+.|..+++.-..+.+.+.... ...+.+-..+ .++.+.... ..+..++ T Consensus 358 ------~~~--------------~~~~~~~v~~~~~~~~~~~~~~~~r~-d~~v~~~~a~~~~~~~~~~~---~~~~~~~ 413 (415) T protein:vir:98 358 ------IVL--------------FDRSQYQASWTDYMHFGECLMIAVRQ-DCRILDYKSAIVIEYDDSER---GEGDLGL 413 (415) T ss_pred ------EEE--------------EeecceEEEEeccccCceEEEEEEEe-ccEEeccccEEEEEEeccCC---CCCcccc Confidence 000 00112333333322211111111000 0001111111 111111111 0111111 Q ss_pred EE Q lcl|Aclame:pro 470 RA 471 (497) Q Consensus 470 r~ 471 (497) -+ T Consensus 414 ~~ 415 (415) T protein:vir:98 414 EA 415 (415) T ss_pred CC Confidence 11 No 227 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=54.00 E-value=0.51 Score=22.17 Aligned_cols=397 Identities=12% Similarity=0.046 Sum_probs=110.2 Q ss_pred CchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDI-NADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) |..+.++.++..+..++++.. .+++.++..++.+.++.+..+++...+..+..+ ............ .+. T Consensus 8 ~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~------~~~~~~~~~~~~----~~~ 77 (415) T protein:vir:79 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLK------EKDGTSENNQQS----VEV 77 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHhhhhhcccc----ccc Confidence 555666666666665555543 333333344444444444444433222221111 000000000000 000 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .+..... .... ................. ...... ........ .. ..... ..+. ... T Consensus 78 ~~~~~~~-~~~~------~~~~~~~~~~~~~~~~~--------~~~~~~--~~~~~~~~----~~-~~~~~-~~gg-~~i 133 (415) T protein:vir:79 78 NEARTYR-NQAN------INDLGISIQNTKVTSQE--------VRDFTE--YLETRNDI----QG-GSLKT-DSGF-VVI 133 (415) T ss_pred chhhhHH-HHHH------HHHHhhhhhhhhhHHHH--------HHHHHH--HHhhhhhh----hh-ccccc-cccc-ccc Confidence 0000000 0000 00000000000000000 000000 00000000 00 00000 0000 000 Q ss_pred CcccccchhhH-----HHHHHHhhhhHHhhcccee---cCCC-ceEEEEeecCCccceeeccccccccccccceeEEeee Q lcl|Aclame:pro 160 APGILPTFLPG-----IVEQLFYELSLADLISSRP---VTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 160 g~~i~~~~~~~-----ii~~~~~~~~l~~~~~~~~---~~~~-~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~ 230 (497) ...+.+.+... .+..+...-++......++ .++. ...+. + . ++ -..|.........++..-.+.. T Consensus 134 P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v---~-E-~~-~~~~~~~~~~~~v~~~~~k~~~ 207 (415) T protein:vir:79 134 PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKV---E-E-LE-ENPELAVKPFFQLAYDINTHRG 207 (415) T ss_pred chHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceee---c-c-cc-ccCcccccceeeEEeeeeeeEe Confidence 01111111111 1111111111111111111 1111 11111 1 1 11 1222222222333433333332 Q ss_pred eeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhh Q lcl|Aclame:pro 231 GKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKF 310 (497) Q Consensus 231 ~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (497) .-.-..--+......-...+...|...++.++..++=...-.|.+.+...+........+ ........+.......... T Consensus 208 ~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~~~~~~~ 286 (415) T protein:vir:79 208 YFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE-VKKAKSLDDIKDAINLNVK 286 (415) T ss_pred eehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccc-cccccchhHHHHHHHhhhh Confidence 211111111111111123456666677777777666554444554433222222222222 2222233334444444455 Q ss_pred hhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHh Q lcl|Aclame:pro 311 PADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK 390 (497) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lk 390 (497) .....+.|+++...+..++..++..|.+...+......+............ ...........+++.|...+ T Consensus 287 ~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~--~~~~~~~~~~~~~~Gd~~~~------- 357 (415) T protein:vir:79 287 PNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILP--DEVLGQKGNNTLIIGNLKDA------- 357 (415) T ss_pred hccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEec--ccccCCCCccEEEEEehhcc------- Confidence 556677788898888888888887777665544322221111000000000 00000000000111111110 Q ss_pred cccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEecccc-EEEEeccchhhhhcCceEE Q lcl|Aclame:pro 391 DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGV-TMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 391 d~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~-~i~~~~~~~~~f~~~~v~~ 469 (497) ++. ..-.|+.|..+++.-..+.+.+.... ...+.+-..+ .++.+.... ..+..++ T Consensus 358 ------~~~--------------~~~~~~~v~~~~~~~~~~~~~~~~r~-d~~v~~~~a~~~~~~~~~~~---~~~~~~~ 413 (415) T protein:vir:79 358 ------IVL--------------FDRSQYQASWTDYMHFGECLMIAVRQ-DCRILDYKSAIVIEYDDSER---GEGDLGL 413 (415) T ss_pred ------EEE--------------EeecceEEEEeccccCceEEEEEEEe-ccEEeccccEEEEEEeccCC---CCCcccc Confidence 000 00112333333322211111111000 0001111111 111111111 0111111 Q ss_pred EE Q lcl|Aclame:pro 470 RA 471 (497) Q Consensus 470 r~ 471 (497) -+ T Consensus 414 ~~ 415 (415) T protein:vir:79 414 EA 415 (415) T ss_pred CC Confidence 11 No 228 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=48.83 E-value=0.66 Score=21.58 Aligned_cols=308 Identities=14% Similarity=0.134 Sum_probs=116.1 Q ss_pred HHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHH Q lcl|Aclame:pro 92 LARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGI 171 (497) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~i 171 (497) ...-+ +.+......+..-... ....+.+..|... ...+..+ ..+...-+|..++..| T Consensus 1 mtnfi------------esqnavteffdvlkkn----sgkseiknawnak------laengvt-itdttfqlprklvesi 57 (318) T protein:vir:94 1 MTNFI------------ESQNAVTEFFDVLKKN----SGKSEIKNAWNAK------LAENGVT-ITDTTFQLPRKLVESI 57 (318) T ss_pred Cccch------------hhhhhHHHHHHHHhcc----cChhhhhhhhhhh------hhhCCce-eecchhhhHHHHHHhh Confidence 00000 0000000000000000 0000111111100 0111111 1111122444444455 Q ss_pred HHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHH--HhhH-H Q lcl|Aclame:pro 172 VEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG--LRDA-P 248 (497) Q Consensus 172 i~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~el--l~ds-~ 248 (497) -..+...+++.....+.+++.--++... ..++.+.....|+++.+...+++--++.|--++.+-.+.... |+.+ . T Consensus 58 ntallntnpvfkvfhvtnvgallvsrsf--dssneaqvhkdgqtkteqaatltidtlepvmvyklqslaervkrlqmsys 135 (318) T protein:vir:94 58 NTALLNTNPVFKVFHVTNVGALLVSRSF--DSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYS 135 (318) T ss_pred hhhhccCCcceeeeeehhhhheeeeccc--cccchhhhhcccccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHH Confidence 5555666666666555555443222222 223456677788888887777776677666555555555444 3444 5 Q ss_pred HHHHHHHHHHHHHHHHHH-HhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhh Q lcl|Aclame:pro 249 ELFNFVQGRLLEGIQRKE-EVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVAS 327 (497) Q Consensus 249 ~l~~~i~~~la~~~~~~~-d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (497) .+...|..+|..++..++ |-+++-|+|+...+.|...+...... . T Consensus 136 elynlivaeltqaivnkivdlalvegdgtngfksidkeadvkkik----------------------------------k 181 (318) T protein:vir:94 136 ELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIK----------------------------------K 181 (318) T ss_pred HHHHHHHHHHHHHHHhhhhheeeeecCCcchhhhhchhhhHHHHH----------------------------------H Confidence 688889999999988776 67788999998887776543211000 0 Q ss_pred hhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhH-HHHHHHHhcccCc---cccccccc Q lcl|Aclame:pro 328 LKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD-WELLRLTKDANGQ---YMGGNFFG 403 (497) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~-~~~l~~lkd~~G~---~~~~~~~~ 403 (497) +.+.....|.. | ..+.+-.+....+...+ ....+....+ -+-|..|+-+... .|-.+. T Consensus 182 ittkaksagkt----------p----fadaieeavdfvrptag--rrylivktedrkalldelrqatananvrikndd-- 243 (318) T protein:vir:94 182 ITTKAKSAGKT----------P----FADAIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQATANANVRIKNDD-- 243 (318) T ss_pred hhhhhhhcCCC----------c----hhHHHHHHHhhhccCCC--ceEEEEeccchHHHHHHHHhhhcccceEEeccc-- Confidence 00000000000 0 00111111111111111 0011111111 1222222211100 000000 Q ss_pred cccccccccccccccc-ceee-cCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeec Q lcl|Aclame:pro 404 NAYGNPVNGGKNIWGV-PVVT-TPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYR 481 (497) Q Consensus 404 ~~~~~~~~~~~~l~G~-pvv~-s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~ 481 (497) +... .--|+ .+++ +..-....-++-|.+ |.|. -+++ ...+.-.|.+|.-.+.++.-..++|-- T Consensus 244 teia-------sevgvdeiivytgskavkptvlvdqk---yhid-mqdl----tkvdafewktnsnmilvetltsghvet 308 (318) T protein:vir:94 244 TEIA-------SEVGVDEIIVYTGSKAVKPTVLVDQK---YHID-MQDL----TKVDAFEWKTNSNMILVETLTSGHVET 308 (318) T ss_pred hhhh-------hhcCcceeEEeeccccccceeEeccc---eecc-hhhh----hhhhceeeccCCceEEEEecccCccee Confidence 0000 00010 0000 000011111222322 3221 1111 112223466666666677666666654 Q ss_pred ccceEEEEec Q lcl|Aclame:pro 482 PSAFQLIQLK 491 (497) Q Consensus 482 ~~Af~~~~~~ 491 (497) -.|=+.+++. T Consensus 309 ynagavitvs 318 (318) T protein:vir:94 309 YNAGAVITVS 318 (318) T ss_pred ecCceeEEeC Confidence 4444444444 No 229 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=47.41 E-value=0.7 Score=21.42 Aligned_cols=371 Identities=11% Similarity=0.028 Sum_probs=90.1 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) .....+++++.+.+.+++.+...+......+ ++++.++++...+.++...+. .++...... ...... T Consensus 8 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee----~~~~~~~i~~~~~~~e~~~~~------~~~~~~~~~---~~~~~~ 74 (397) T protein:vir:49 8 HDLWVAQGDKVENLNEKLNVAMLDDSVSAEE----LQAIKNERDTAKMKRDMFKEQ------YTEARANEV---ANMSEE 74 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcCHHH----HHHHHHHHHHHHHHHHHHHHH------HHHHHHHhh---hccccc Confidence 4445555555555555544443332222222 223333333322222211111 000000000 000000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) ........ .........+........................+.. ....... T Consensus 75 ~~~~~~~~-----~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~-----------------------~vP~~~~ 126 (397) T protein:vir:49 75 EKKPLTKS-----EEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGL-----------------------TIPQDIQ 126 (397) T ss_pred cccccccc-----hhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcc-----------------------cccHhHH Confidence 00000000 0000000111111111100000000000000000000 0000000 Q ss_pred cccccchhh-HHHHHHHhhhhHHhhc---cceecC--CCceEEEEeecCCccceeeccccccccccccceeEEeeee-ee Q lcl|Aclame:pro 161 PGILPTFLP-GIVEQLFYELSLADLI---SSRPVT--SPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVG-KV 233 (497) Q Consensus 161 ~~i~~~~~~-~ii~~~~~~~~l~~~~---~~~~~~--~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~-ki 233 (497) ..|...... ..+..+...-++.... ...+.. .+...+.- .+ .-+.|.........++..-.+... ++ T Consensus 127 ~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~----E~--~~~~~~~~~~~~~i~~~~~k~~~~~~i 200 (397) T protein:vir:49 127 TAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDD----EA--GKIADVDDPKLSLIKYTIKRYAGISTV 200 (397) T ss_pred HHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeec----Cc--cccccccccceeeEEeeeeeEEeeehh Confidence 001000000 0111110000111000 011111 11111111 11 112222222223333333333222 11 Q ss_pred ee-echhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhh Q lcl|Aclame:pro 234 AN-ALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPA 312 (497) Q Consensus 234 a~-~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (497) .- ++.-|..-+ ...+...|...+++.+..++=...=.+...+...++... ......+.... T Consensus 201 S~ell~ds~~~l--~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~~~~~~~d~i----------------~~~~~~l~~~~ 262 (397) T protein:vir:49 201 TNSLLADSAENI--LAWLSGWIAKKVVVTRNKAILEAIAALPTKPTLTKWDDI----------------IDLEAKVDPAI 262 (397) T ss_pred HHHHHhhhHHHH--HHHHHHHHHHHHHHHHHHHHHhhccccccccccccHHHH----------------HHHHHhhhhhh Confidence 11 111121111 123444555555555555543322223322222333221 01111222233 Q ss_pred hcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcc Q lcl|Aclame:pro 313 DGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDA 392 (497) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~ 392 (497) .....|+++...+..++..++..|.+...+......+...... .+++. +..+ +..-... T Consensus 263 ~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~------------------PV~~~-~~~~--~~~~~~~ 321 (397) T protein:vir:49 263 KQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGF------------------AVKEV-ADRW--LANGTGG 321 (397) T ss_pred cCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecce------------------eeEEe-cccc--cccccCC Confidence 4456678888888888888777776665443322211111000 00100 0000 0000001 Q ss_pred cCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccc--eEEEEEeccccEE---------EEeccchhh Q lcl|Aclame:pro 393 NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAP--SVIQTARREGVTM---------QMTNSNGTD 461 (497) Q Consensus 393 ~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~--~~~~i~~r~~~~i---------~~~~~~~~~ 461 (497) ++..+|.+. ..... -..-.|+.+..++.... +|.+ ..|....|-+..+ ++....... T Consensus 322 ~~~i~~gd~-~~~~~-----~~~~~~~~i~~~~~~~~------~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 389 (397) T protein:vir:49 322 AMPLYFGDL-KQAVT-----LFDRQHMSLLSTNIGGG------AFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQK 389 (397) T ss_pred ceeEEEeec-cceEE-----EEeecceEEEEeccccc------hhhcCceeEEEEeeeCcEEecccceEEEEeecccCCC Confidence 111121110 00000 00012445544433110 0110 1111111211111 111100000 Q ss_pred hhcCceEE Q lcl|Aclame:pro 462 FVDGKVTV 469 (497) Q Consensus 462 f~~~~v~~ 469 (497) =....+++ T Consensus 390 ~~~~~~~~ 397 (397) T protein:vir:49 390 GNLGSTAV 397 (397) T ss_pred CCcccccC Confidence 00111111 No 230 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=45.40 E-value=0.77 Score=21.20 Aligned_cols=371 Identities=12% Similarity=0.066 Sum_probs=105.7 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+.+++++++.+..+++++.++...+..+..+...+..++++ ++.++.+++.+.+++++...... T Consensus 21 ~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~-------------~l~~~~~~l~~~~~~~e~~~~~~ 87 (418) T protein:vir:10 21 EQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATVD-------------ELLIKQGELQARLLEAEQKLARG 87 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHhhc Confidence 44444444444444444444433333322222222222222221 22222233333333333222221 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) .........+..........+.+.......... .............. ........+..--- T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~----------~~~~~~~~~g~lvp 148 (418) T protein:vir:10 88 GGSAELETPKTLGQLVTESEEMKGMDGSARKSV---------RVRVDRKSIMNVPA----------TVGSGVSGSNSLVV 148 (418) T ss_pred ccccccchhhhhhHHhhhHHHHHHHHHHHhhhh---------hhhhHHHHHHHhhh----------hccCCCCCCccccc Confidence 111111111111111111111111111000000 00000000000000 00000000000000 Q ss_pred cccccchhh-----HHHHHHHhhhhHHhhccceec-C--CCceEEEEeecCCccceeeccccccccccccceeEEeeeee Q lcl|Aclame:pro 161 PGILPTFLP-----GIVEQLFYELSLADLISSRPV-T--SPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGK 232 (497) Q Consensus 161 ~~i~~~~~~-----~ii~~~~~~~~l~~~~~~~~~-~--~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~k 232 (497) ..+.+++.. ..+..+...-++-..--.++. + +....+. + .+ .-+.|.. ......++..-.+...- T Consensus 149 ~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v---~-E~--~~~~~~~-~~f~~v~~~~~k~~~~~ 221 (418) T protein:vir:10 149 ADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAV---A-EG--AQKPTSD-LKFNLKNQPVRTIAHLF 221 (418) T ss_pred hhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeee---c-cC--ccccccc-cceeeEEEeeeeEEEee Confidence 111111111 111111111111111001111 1 1222221 1 11 1133432 22333444444433322 Q ss_pred -eeeechhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHh---------hhhccCCCccccceeccccccccchhhhhhhHH Q lcl|Aclame:pro 233 -VANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEV---------QLLAGGGYPGVNGLLQRSTGFTASSASSLFGAT 301 (497) Q Consensus 233 -ia~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~---------a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~ 301 (497) |.-.+ + +.. .+- ..+...|...++.++-.++=. -+++..+... .... .......... T Consensus 222 ~is~el-l-~ds-~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~------~~~~---~~~~~~~~~i 289 (418) T protein:vir:10 222 KASRQI-L-DDA-PALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFM------PSIT---LANATPIDKI 289 (418) T ss_pred hhhHHH-H-HhH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccc------cccc---ccccccHHHH Confidence 22221 1 112 233 356677777777777776622 2333333221 1111 1111122233 Q ss_pred HHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehh Q lcl|Aclame:pro 302 SATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPR 381 (497) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 381 (497) ................|+++...+..+...++..|.+..........+... . - .++.++. T Consensus 290 ~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~-G------------------~-pV~~~~~ 349 (418) T protein:vir:10 290 RLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPRLW-N------------------L-PVVETQA 349 (418) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCceec-c------------------e-eeEEcCC Confidence 333334444444455678888888888887777666554322111111000 0 0 0111110 Q ss_pred HHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCC----cCcEEE---eeccceEEEEEeccccE-EE Q lcl|Aclame:pro 382 DWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP----LGTILV---GHFAPSVIQTARREGVT-MQ 453 (497) Q Consensus 382 ~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~----~~~~~~---gd~~~~~~~i~~r~~~~-i~ 453 (497) + ..|..++.+. ..... -..-.|+.+..++... .+.+.+ ..+. ..+.+-..+. ++ T Consensus 350 -------~--p~~~~~~gd~-s~~~~-----~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d---~~~~~~~a~~~~~ 411 (418) T protein:vir:10 350 -------M--TANEFLVGAF-SMAAQ-----IFDRMEIEVLLSTENVDDFEKNMVSIRAEERLA---LAVYRPESFVTGA 411 (418) T ss_pred -------C--CCCcEEEeec-cceEE-----EEEecceEEEEecccchhhhcCceEEEEEEeec---cEEecccceEEEE Confidence 0 1122222111 00000 0001244444443321 122111 1111 1111111111 22 Q ss_pred EeccchhhhhcC Q lcl|Aclame:pro 454 MTNSNGTDFVDG 465 (497) Q Consensus 454 ~~~~~~~~f~~~ 465 (497) +..-. .+ T Consensus 412 ~~~~~-----~g 418 (418) T protein:vir:10 412 LVEQA-----GG 418 (418) T ss_pred eccCC-----CC Confidence 22222 12 No 231 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=45.37 E-value=0.77 Score=21.20 Aligned_cols=387 Identities=12% Similarity=0.071 Sum_probs=112.7 Q ss_pred CchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDI-NADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPE 79 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~ 79 (497) +-.+.++.++..+..++++.. .++..++...+.++++.+..+++...+.++..+. ..................... T Consensus 8 ~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 84 (415) T protein:vir:94 8 QSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKE---KDGTSENNQQSVEVNEASTYR 84 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHhhhhccccccccchhhHH Confidence 445566666666666655553 3333344445555555555555443322211111 000000000000000000000 Q ss_pred HhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccC Q lcl|Aclame:pro 80 VEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTF 159 (497) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (497) .. ..................+.+.... .. ........ .. ....++...- T Consensus 85 ~~-~~~~~~~~~~~~~~~~~~e~~~~~~---------~~----------~~~~~~~~---------~~--~~~~~g~~~i 133 (415) T protein:vir:94 85 NQ-ANINDLGISIQNTKVTSQEVRDFTE---------YL----------ETRNDIQG---------GS--LKTDSGFVVI 133 (415) T ss_pred HH-HHHHHHHhhhhhhhhhHHHHHHHHH---------Hh----------hhhhhhhh---------hc--cccccccccC Confidence 00 0000000000000000000000000 00 00000000 00 0000000000 Q ss_pred CcccccchhhH-----HHHHHHhhhhHHhhcccee---cCCC-ceEEEEeecCCccceeeccccccccccccceeEEeee Q lcl|Aclame:pro 160 APGILPTFLPG-----IVEQLFYELSLADLISSRP---VTSP-NLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) Q Consensus 160 g~~i~~~~~~~-----ii~~~~~~~~l~~~~~~~~---~~~~-~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~ 230 (497) -..+.+.+... .+..+...-++-.....++ .++. ...+.- .+ .-..|.........++..-.+.. T Consensus 134 P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~----Eg--~~~~~~~~~~~~~i~~~~~k~~~ 207 (415) T protein:vir:94 134 PEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVE----EL--EENPELAVKPFFQLAYDINTHRG 207 (415) T ss_pred cHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceecc----cc--ccccccccccceeeEeeheeeee Confidence 01111111111 1111111111111111111 1111 111111 11 11222221112222222222222 Q ss_pred e-eeee-echhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHH Q lcl|Aclame:pro 231 G-KVAN-ALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNV 308 (497) Q Consensus 231 ~-kia~-~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (497) . ++.- .+.-|.. .-...+...|...++.++..++-...-.|.+.+...+..........+ ....+.+........ T Consensus 208 ~~~is~ell~ds~~--~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~~~~~ 284 (415) T protein:vir:94 208 YFRISREAIEDAKV--NVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVK-KAKSLDDIKDAINLN 284 (415) T ss_pred echhhHHHHhhchH--HHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccc-cccchHHHHHHHHhh Confidence 1 1111 1122221 112345666777777777777665555565544433333322222222 222333444444454 Q ss_pred hhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHH Q lcl|Aclame:pro 309 KFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRL 388 (497) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~ 388 (497) .......+.|+++...+..+...++..|.+...+......+........+... ...........+++.+...+ T Consensus 285 ~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~--~~~~~~~~~~~i~~gd~~~~----- 357 (415) T protein:vir:94 285 VKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILP--DEVLGQKGNNTLIIGNLKDA----- 357 (415) T ss_pred hhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEec--ccccCCCCccEEEEEehhcc----- Confidence 55555677788898888888888887777665444322221111000000000 00000000000111111100 Q ss_pred HhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEE---------Eeccch Q lcl|Aclame:pro 389 TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQ---------MTNSNG 459 (497) Q Consensus 389 lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~---------~~~~~~ 459 (497) ++. ..-.|+.|-.++..-..+. +....|.+..+. .+.... T Consensus 358 --------~~~--------------~~~~~~~v~~~~~~~~~~~---------~r~~~r~d~~~~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:94 358 --------IVL--------------FDRSQYQASWTDYMHFGEC---------LMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred --------EEE--------------EeecceEEEEeccccCceE---------EEEEEEeccEEeccccEEEEEEeccCC Confidence 000 0011233333332211111 111112222221 111110 Q ss_pred hhhhcCceEEEE Q lcl|Aclame:pro 460 TDFVDGKVTVRA 471 (497) Q Consensus 460 ~~f~~~~v~~r~ 471 (497) ..+..++-+ T Consensus 407 ---~~~~~~~~~ 415 (415) T protein:vir:94 407 ---GEGDLGLEA 415 (415) T ss_pred ---CCCccccCC Confidence 011111111 No 232 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=42.10 E-value=0.9 Score=20.84 Aligned_cols=329 Identities=13% Similarity=0.031 Sum_probs=118.8 Q ss_pred HHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhh-hhhhhhcccccCCcccccchhh- Q lcl|Aclame:pro 92 LARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAA-IGQNPFGSTGTFAPGILPTFLP- 169 (497) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~~i~~~~~~- 169 (497) .-...+. +.+... ..+.......+........+....+....... ..-.....+-.+|+.+--+.+. T Consensus 1 ~~~~~~~----~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~ 69 (514) T protein:vir:10 1 MYTQDKT----KDIMKK-------SFFGGDRAVAFDTNKEDILNENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNR 69 (514) T ss_pred CCccchh----hHHHhh-------hhcccceeeeecCcHHHHHHHhcchhhhhhhhccccccCCccccCccchhhhhhcc Confidence 0000000 000000 00000000001000011111111110000000 0000111222334444333222 Q ss_pred HHHHHHH--hhhhHHhhccceecCCCceEEEEee--cCCccceeeccccccccccccceeEEeeeeeeeeechhhHHH-H Q lcl|Aclame:pro 170 GIVEQLF--YELSLADLISSRPVTSPNLSYLTES--AAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-L 244 (497) Q Consensus 170 ~ii~~~~--~~~~l~~~~~~~~~~~~~~~~p~~~--~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~el-l 244 (497) ++..+.. +...+..-+...++.+.-..|-... +..+.+.+++|++-.+.+++++...++..+-++.-..+|.-+ + T Consensus 70 ~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l 149 (514) T protein:vir:10 70 DLKVTTWGERDFTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDVNNPNERQRTINIKYIVDTHVTSIALQR 149 (514) T ss_pred ceeEeeecCcchhhhhhcCCchhhHHHhhhhhhcccCcccccccccccccCcCCCcceEEEEEeeeeeeeeeeeeehhhh Confidence 2221111 2224455566666666544444433 333457899999999999999999999999998877666544 3 Q ss_pred hhH-HHHHHHHHHHHHHHHHHHHHhhhhccCCCcc---------ccceeccccccccchhhhhhhHHHHHHHHHhhhhhc Q lcl|Aclame:pro 245 RDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG---------VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) Q Consensus 245 ~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~g~~~---------~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) .++ .+.+....++-.-.++..++.+.++|+..-. ..||.+..... T Consensus 150 ~n~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~~------------------------- 204 (514) T protein:vir:10 150 ANTIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAPE------------------------- 204 (514) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcCC------------------------- Confidence 344 5777788888888999999999999985321 12222211100 Q ss_pred chhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhcccC Q lcl|Aclame:pro 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANG 394 (497) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~G 394 (497) +....-+.. .-..++.-.......+|..++-.+|+..+.+.+..--...- T Consensus 205 ------------------NvIDarG~~------------Ls~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~~~q 254 (514) T protein:vir:10 205 ------------------NHIDLRGGR------------LSPAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQHLNGQ 254 (514) T ss_pred ------------------CeEecCCCC------------ccHHHHhhhhhhhhcccCChhheeCchHHHHHHhhcccCcc Confidence 000000000 00011111111222234445555555555554432211111 Q ss_pred ccccccc------------ccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhh Q lcl|Aclame:pro 395 QYMGGNF------------FGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDF 462 (497) Q Consensus 395 ~~~~~~~------------~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f 462 (497) |-+.... ..+..|.-...++++++.+-......+. +++.. ....+++.++...+..| T Consensus 255 RV~~~~n~~~~~~G~~v~~f~s~~G~I~L~gs~im~~~n~L~~~~~~-----~~~Ap------~~~~va~svT~~~~g~~ 323 (514) T protein:vir:10 255 RVMLPGQTGGMTTGLDIDKFLSAHGSIRIQGSTIMDSDNKLDFDRPV-----SPTAP------TAPQLSATVTPDGGGLW 323 (514) T ss_pred eEEeecCccceeeeeeccceeEeccceeecCCeeecccccCccCCcc-----CCcCC------CCCcceEEEecCccccc Confidence 1111100 0000010011111222221111111110 11100 00111122211111001 Q ss_pred h-------------cC----ceEEEEEeeeccEeecccce-----------EEEEecCCCCCC Q lcl|Aclame:pro 463 V-------------DG----KVTVRAEERLGLLVYRPSAF-----------QLIQLKKGATGS 497 (497) Q Consensus 463 ~-------------~~----~v~~r~~~r~~~~v~~~~Af-----------~~~~~~~~a~~~ 497 (497) . .. ...|++...-+..=-.|+.+ +.|++...+-++ T Consensus 324 ~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~vtaT~a~~~~~i~ltItp~~~~~ 386 (514) T protein:vir:10 324 HEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLVQTATPTKKDDAITLTITPNAMQN 386 (514) T ss_pred CcccccccccccccccccceeEEEEEEEECCCCcccccceeeeeeeccCceEEEEEEeccCcc Confidence 0 00 11234433333333334444 233333223333 No 233 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=40.31 E-value=0.98 Score=20.64 Aligned_cols=311 Identities=13% Similarity=0.102 Sum_probs=112.0 Q ss_pred HHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccchhhHH Q lcl|Aclame:pro 92 LARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGI 171 (497) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~i 171 (497) .......+... ..+......+.. ..+.+.+|..... .+..+ .++....+|.-.+-.| T Consensus 1 mtn~iesq~A~---------~eF~~vL~~N~G-------~S~~k~AW~A~L~------E~GVt-iTD~~~~LP~~lv~sI 57 (318) T protein:vir:86 1 MTNFIESQNAV---------TEFFDVLKKNSG-------KSEIKNAWNAKLA------ENGVT-ITDTTFQLPRKLVESI 57 (318) T ss_pred CcchhhhhHHH---------HHHHHHHhccCC-------chhhhhhhhhhhh------hcCce-eeccchhccHHHHHHH Confidence 00000000000 000000000000 0011222211110 11111 1112223443344445 Q ss_pred HHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhhHHHHhh---H- Q lcl|Aclame:pro 172 VEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRD---A- 247 (497) Q Consensus 172 i~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~d---s- 247 (497) -..+....+++....+...+.--++...++. ..|.-.-.|.++.+...+|..-++.+--++....+ -++..| + T Consensus 58 ~~A~~n~n~v~~vfHVT~~~~~~V~~s~~s~--AeAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sY 134 (318) T protein:vir:86 58 NTALLNTNPVFKVFHVTNVGALLVSRSFDSS--AEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSY 134 (318) T ss_pred HHhhhccCcceeeeeeccchhhhhhhhhhhh--hhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhH Confidence 5555666777766555555443333333332 23455566778877777777777766444333333 233333 2 Q ss_pred HHHHHHHHHHHHHHHH-HHHHhhhhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhh Q lcl|Aclame:pro 248 PELFNFVQGRLLEGIQ-RKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVA 326 (497) Q Consensus 248 ~~l~~~i~~~la~~~~-~~~d~a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (497) ..+.+||..+|+.++. +..|.+++-|+|.+....+..-+....... T Consensus 135 sel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k--------------------------------- 181 (318) T protein:vir:86 135 SELYNLIVAELTQAIVNKIVDLALVEGDGSNGFKSIDKEADVKKIKK--------------------------------- 181 (318) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhheeecCCCCccchhhHHHHHHHHH--------------------------------- Confidence 3588999999999998 888999999999876555433221110000 Q ss_pred hhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHH-HHHHHHhcccCc-ccccccccc Q lcl|Aclame:pro 327 SLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDW-ELLRLTKDANGQ-YMGGNFFGN 404 (497) Q Consensus 327 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~-~~l~~lkd~~G~-~~~~~~~~~ 404 (497) ..+.....|.. |. ...+-.+....+...+ ....+....+. +-|..|+-+... .+-...-.+ T Consensus 182 -~Ttkaksagtt----------pf----anaieeavdfvrptag--rrylivkaedrkalldelrqatanahvriknddt 244 (318) T protein:vir:86 182 -ITTKAKSAGTT----------PF----ANAIEEAVDFVRPTAG--RRYLIVKAEDRKALLDELRQATANAHVRIKNDDT 244 (318) T ss_pred -HhhhhhccCCC----------ch----hhHHHHHHhhhccCCC--ceEEEEeecchHHHHHHHHhhcccceeEEeccch Confidence 00000000000 00 0000001111111100 01111121211 222222211100 000000000 Q ss_pred cccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecccc Q lcl|Aclame:pro 405 AYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSA 484 (497) Q Consensus 405 ~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~~A 484 (497) .....+ -+.-+-|+.... .-..-++-|.+ |.|. -+++ ...+.-.|.+|.-.+.++.-..++|---.| T Consensus 245 eiasev----gvdeiivytgsk-alkptvlvdqk---yhid-mqdl----tkvdafewktnsnmilvetltsghvetyna 311 (318) T protein:vir:86 245 EIASEV----GVDEIIVYTGSK-ALKPTVLVDQK---YHID-MQDL----TKVDAFEWKTNSNMILVETLTSGHVETYNA 311 (318) T ss_pred hhhhhc----Ccceeeeeeccc-cccceeeeccc---eecc-hhhh----hhhhcceeccCCceEEEeecccCcceeecC Confidence 000000 000000111000 00111222322 3221 1111 112223466666666777666666654444 Q ss_pred eEEEEec Q lcl|Aclame:pro 485 FQLIQLK 491 (497) Q Consensus 485 f~~~~~~ 491 (497) =+.+++. T Consensus 312 gavitvs 318 (318) T protein:vir:86 312 GAVITVS 318 (318) T ss_pred ceeEEeC Confidence 4444444 No 234 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=38.52 E-value=1.1 Score=20.44 Aligned_cols=352 Identities=15% Similarity=0.049 Sum_probs=119.5 Q ss_pred HHHHHHHHHHHHHHHHHHHH-hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHH Q lcl|Aclame:pro 61 KSLGGADAAKDGLDNDIPEV-EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFA 139 (497) Q Consensus 61 ~~~~~~~a~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (497) -. ....+.+..++..+ +.+...++ ....+..-....+....+.. .+.+.... T Consensus 1 ~~----~~~~e~l~~kw~p~l~~~~~~~i-----------------~~~~~~~v~a~l~enq~~~~------~~~~~~l~ 53 (470) T protein:vir:10 1 MQ----MFNSEYLQEKWAPILDYDGLDPI-----------------KDSHRRSVTAVLLENQEKEL------REERNFLS 53 (470) T ss_pred CC----cchhHHHHHhhhhhhcCCccchh-----------------cchhhhhhhhhhhhhhHHHH------hhccchhh Confidence 00 00011111111110 10000000 00000000000000000000 00000000 Q ss_pred hhhhhhh---hhhhhhhcccccCCcccccchhhHHHHHHH---hhhhHHhhccceecCCCceEEE--E--eecCCcc--- Q lcl|Aclame:pro 140 DGETAPA---AIGQNPFGSTGTFAPGILPTFLPGIVEQLF---YELSLADLISSRPVTSPNLSYL--T--ESAAHNN--- 206 (497) Q Consensus 140 ~~~~~~~---~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~p--~--~~~~~~~--- 206 (497) +...... ........++.+++ +..+.+.++...| ....-.+++.++||++++.-+- | ....+++ T Consensus 54 e~~~~~~~~~~~~~~i~~st~t~~---v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG~Eaf 130 (470) T protein:vir:10 54 EAPNVNTNSGATAGFSADATAAGP---VAGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSGTEAL 130 (470) T ss_pred hhhhcccccccccccccccccccc---ccccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCcccee Confidence 0000000 00000111111111 2234455555554 4455678999999998764332 1 1110000 Q ss_pred -----ceeec---------------------------------------------------------c------cccccc Q lcl|Aclame:pro 207 -----AAAVA---------------------------------------------------------E------AGTYPF 218 (497) Q Consensus 207 -----a~~v~---------------------------------------------------------E------g~~~~~ 218 (497) ..|-+ | +..+++ T Consensus 131 fnEA~T~fSG~~~~~~~~~~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~s~~~~f~E 210 (470) T protein:vir:10 131 FNEADTAFSGQPDGLDDTSGFTATGANNVGLGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGDGTGDQFNQ 210 (470) T ss_pred eecCCcccCcccccccccccccccccccccccccccccccccccccccccccccccccccccchHHhhhcCCCCCcccce Confidence 00100 1 122344 Q ss_pred ccccceeEEeeeeeeeeechhhHHHHhh--HH---HHHHHHHHHHHHHHHHHHHhhhhccCC----Cccccceecccccc Q lcl|Aclame:pro 219 SSEEFARVYEQVGKVANALTITDEGLRD--AP---ELFNFVQGRLLEGIQRKEEVQLLAGGG----YPGVNGLLQRSTGF 289 (497) Q Consensus 219 s~~~~~~v~~~~~kia~~~~iS~ell~d--s~---~l~~~i~~~la~~~~~~~d~a~l~G~g----~~~~~Gil~~~~~~ 289 (497) ...++++++..++.-+-...+|-||.+| +. |.++.|.+-|+..|..-|++.||.--- .++..|+.+ .+.+ T Consensus 211 MaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~~~~~~-~Gv~ 289 (470) T protein:vir:10 211 MAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQANVAA-AGTF 289 (470) T ss_pred eeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceeccccc-cceE Confidence 4455666666666666678899999998 42 579999999999999999998874210 011111100 0000 Q ss_pred ccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhh Q lcl|Aclame:pro 290 TASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTL 369 (497) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 369 (497) .... .....|. . ...+ ...+.-...+-.-.+... T Consensus 290 Dl~~-------------------~~~gr~~--~------e~~~-------------------~l~~~i~~ean~i~~~t~ 323 (470) T protein:vir:10 290 DLDT-------------------DSNGRWS--V------EKFK-------------------GLIFQIERDANAIAQRTR 323 (470) T ss_pred Eeec-------------------ccchhHH--H------HHHH-------------------HHHHHHHHHHHHHHHhhc Confidence 0000 0000000 0 0000 000111122223335566 Q ss_pred ccCCceEEEehhHHHHHHHHhcccCcccccccc-cccc--ccccccccccc-ccceeecCCCCcC------cEEEeeccc Q lcl|Aclame:pro 370 FQTPNAVVMNPRDWELLRLTKDANGQYMGGNFF-GNAY--GNPVNGGKNIW-GVPVVTTPLIPLG------TILVGHFAP 439 (497) Q Consensus 370 ~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~-~~~~--~~~~~~~~~l~-G~pvv~s~~~~~~------~~~~gd~~~ 439 (497) +...+.+++++.....|.. .|-.-+.+.. .... .........|. |++|+.++++..+ -+++|.=-. T Consensus 324 r~~~n~~i~S~~Va~~La~----sG~l~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~ 399 (470) T protein:vir:10 324 RGKGNMILCSADVASALTM----AGVLDYTPALNANLNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGYKGS 399 (470) T ss_pred cccceEEEEchhHHhHhhh----ccccccccccccccccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEEecC Confidence 7778888888888877742 1200000000 0000 00011112333 4788888764432 233332100 Q ss_pred eEE--EEEeccccEEEEecc-chhhhhcCceEEEEEeeeccEeecccceEEEE-ecCCCCCC Q lcl|Aclame:pro 440 SVI--QTARREGVTMQMTNS-NGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQ-LKKGATGS 497 (497) Q Consensus 440 ~~~--~i~~r~~~~i~~~~~-~~~~f~~~~v~~r~~~r~~~~v~~~~Af~~~~-~~~~a~~~ 497 (497) ..+ .++-..=+.++..+. +-..|+- -++ +..|++..+ +|=.-..=+ .+....++ T Consensus 400 ~~~~~glfy~PYv~l~~~~~~dp~sfqP-~~g--~~tRY~l~~-NP~~~~~~~~~~~i~~~~ 457 (470) T protein:vir:10 400 SPYDAGLFYCPYVPLQMVRAVGQDTFQP-KIG--FKTRYGLVE-NPFSQGTTQGLGTLTRNS 457 (470) T ss_pred cceecceeeccccccccCCCCCCccccc-eee--eeeeeceee-cCcccCCCcccccccCCC Confidence 000 000000000110000 1112322 222 333444433 222110000 00122233 No 235 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=36.05 E-value=1.2 Score=20.16 Aligned_cols=360 Identities=12% Similarity=0.039 Sum_probs=99.8 Q ss_pred CchHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQL----EAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDND 76 (497) Q Consensus 1 m~~~~~~----~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~ 76 (497) +..+-+. .++.+.|.+...+ +.. ..|.++..+++..+++.+.+.++.... ..+..+............ T Consensus 6 l~~l~e~r~~~~~e~~~L~~~~~~---~~l--t~e~~~~~~~l~~e~~~l~~~i~~~~~---~~~~~~~~~~~~~~~~~~ 77 (390) T protein:vir:62 6 LSANFEARERATAELRTLTDEFAG---KEM--TDEAREKEERLITAVSDYDARIKRGIE---AIKAIDPVTSLLSGLQGS 77 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhc---ccc--cHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHhhcccc Confidence 2333333 3444555443322 111 112223333333343333333321111 111111111100000000 Q ss_pred HHHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccc Q lcl|Aclame:pro 77 IPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGST 156 (497) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (497) .. ..... .........+....... .... ..... . .....++. T Consensus 78 ---~~-----~~~~~-----~~~~~~~~~r~~~~~~~---------r~~~---~~~~~----------~---~~t~~~~g 119 (390) T protein:vir:62 78 ---GS-----GAQRS-----ADVDDDATLRAGNLGEA---------RSFE---FAPEK----------R---DGTKAGNP 119 (390) T ss_pred ---cc-----cchhh-----cchHHHHHHhhhhhhhh---------HHHH---hhhhh----------h---cccccCCC Confidence 00 00000 00000000000000000 0000 00000 0 00000000 Q ss_pred ccCCcccccchhhH------HHHHHHhhhhHHhhcc--ceecC--CCceEEEEeecCCccceeeccccccccccccceeE Q lcl|Aclame:pro 157 GTFAPGILPTFLPG------IVEQLFYELSLADLIS--SRPVT--SPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) Q Consensus 157 ~~~g~~i~~~~~~~------ii~~~~~~~~l~~~~~--~~~~~--~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v 226 (497) +.....+....+.+ ++..+-...+... .+ .++.. +....+.-+ + .-+.|.. ......++..- T Consensus 120 ~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~-~~~~~~p~~~~~~~a~wv~E----~--~~~~~~~-~~f~~i~~~~~ 191 (390) T protein:vir:62 120 NVLSRTLYGQLIAQAVERSAIMRGGATTFTTSD-ANPLDFTVITGRSSASIVGE----T--AEIPESY-PATAQRSMGGF 191 (390) T ss_pred ccccccchHHHHHHHHhhhhhhhhcceeeecCC-CceeEEEEEcCCcceeeecc----c--ccccccc-cceeeeEeeee Confidence 00111111111111 1111111111111 01 12221 122222211 1 1223332 22344444444 Q ss_pred EeeeeeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHh------hhhccCCCccccceeccccccccchhhhhhhH Q lcl|Aclame:pro 227 YEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEV------QLLAGGGYPGVNGLLQRSTGFTASSASSLFGA 300 (497) Q Consensus 227 ~~~~~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~------a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~ 300 (497) .+...-.-..-.+-+...+--..+...|...++..+..++=. -|++..+..... .... ........+ T Consensus 192 k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~Gi~~~~~~~~~~--~~~~-----~~~~~~~~~ 264 (390) T protein:vir:62 192 KYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPRGILTDASPATAT--FLAT-----DTDSKVSDA 264 (390) T ss_pred eEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccccccccccc--eecc-----cccccchHH Confidence 444322222222222222212345556666666666655532 244433221111 0000 011111122 Q ss_pred HHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccC-CceEEEe Q lcl|Aclame:pro 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQT-PNAVVMN 379 (497) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~n 379 (497) .......+...+.....|+++...+..+...++..+.+...+......+........... ...+ ..+++.+ T Consensus 265 l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~~~l~G~Pv~~~--------~~~p~~~i~~gd 336 (390) T protein:vir:62 265 LIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAPSLFNGKVVETD--------DGMPADKILFAD 336 (390) T ss_pred HHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCccceecccceEEe--------cCCCCccEEEee Confidence 222223333333445568888888888888888877776665544333221111100000 0000 1111111 Q ss_pred hhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCC--CcCcE-EEeeccceEEEEEeccccEEE-Ee Q lcl|Aclame:pro 380 PRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLI--PLGTI-LVGHFAPSVIQTARREGVTMQ-MT 455 (497) Q Consensus 380 ~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~--~~~~~-~~gd~~~~~~~i~~r~~~~i~-~~ 455 (497) .+ +|+.. .-.++-|-.+... ..+.+ +.+..... ..+.+...+.+. +. T Consensus 337 ~s-------------~~~i~---------------~~~~~~v~~~~~~~~~~~~~~~~~~~r~d-~~~~~~~A~~~l~~~ 387 (390) T protein:vir:62 337 LS-------------KYRVR---------------FAGSLRVDRSVDAKFSTDQIVYRFLQRAD-GLLVDARGAKVLTVT 387 (390) T ss_pred cc-------------ceeEE---------------eecceEEEeeccccccCCcEEEEEEEEeC-cEeechhheEEEEee Confidence 11 01100 0112223322221 11222 22221111 122333333332 22 Q ss_pred ccc Q lcl|Aclame:pro 456 NSN 458 (497) Q Consensus 456 ~~~ 458 (497) .-+ T Consensus 388 ~~a 390 (390) T protein:vir:62 388 PGA 390 (390) T ss_pred cCC Confidence 222 No 236 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=35.88 E-value=1.2 Score=20.14 Aligned_cols=140 Identities=11% Similarity=0.048 Sum_probs=10.4 Q ss_pred CchHHH------HHHHHHHHHHHHHHHHHH----HHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQ------LEAQGRQLAKSIKDINAD----ETKTAAEKKEAL-----------AKIEPDFKAHQAEVEAHERAQEM 59 (497) Q Consensus 1 m~~~~~------~~~~~~~l~~~~~~~~~~----~~~~~~e~~~~~-----------~~~~~~~~~~~~~~e~~e~~~e~ 59 (497) +|.... .....+++.+.+...... ......++...+ .....+.+..+...+......+. T Consensus 544 ~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~ 623 (705) T protein:vir:88 544 GGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEA 623 (705) T ss_pred ccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111110 101111111100000000 000000000000 00000000111111100000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhh-hhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHH Q lcl|Aclame:pro 60 LKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARA-VIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAF 138 (497) Q Consensus 60 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 138 (497) ..+..+++....+++ .+..+ .....+..... .....+.... .......... .... .............. T Consensus 624 q~~q~E~q~~q~e~e--~~~~~---~~~~~~e~~~~~a~~~~~~~~~---e~e~~~~e~e-~~~e-~~q~~~~~~~~~~~ 693 (705) T protein:vir:88 624 QMKQVEAQIRLAEIE--LKKQE---AVLQQREMALKEAELQLERDRF---TWERARNEAE-YHLE-ATQARAAYIGDGKV 693 (705) T ss_pred HHHHHHHHHHHHHHH--HHHHH---HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH-HHHH-HHHHHHHHHHHHhH Confidence 000000000000000 00000 00000000000 0000000000 0000000000 0000 00000000000000 Q ss_pred Hhhhhhhhhhhh Q lcl|Aclame:pro 139 ADGETAPAAIGQ 150 (497) Q Consensus 139 ~~~~~~~~~~~~ 150 (497) ..........++ T Consensus 694 ~~~~k~~~~~rr 705 (705) T protein:vir:88 694 PETKKPTKAVRR 705 (705) T ss_pred HHHHHHHHHhcC Confidence 000011111111 No 237 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=35.12 E-value=1.2 Score=20.05 Aligned_cols=378 Identities=10% Similarity=-0.007 Sum_probs=83.6 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKI--EPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~--~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~ 78 (497) .....+++++..++.+..+++.. +..+++..+++. .++.....+ +.+++..+++.+.+++++++.... T Consensus 9 ~~~~~el~~~l~eL~e~~~~l~~----~~~el~~~~ee~~~~e~~~~~~~------~~~~l~~~i~~l~~~i~~~~~~~~ 78 (397) T protein:vir:96 9 NKQIKERSSEIDKLLSQRSDLEK----QENDLERALEEAKTDEEISTVSD------SADDLEKQVKDLDEKIAELQKEKQ 78 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHhhhhHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHH Confidence 22233444444444333333332 222222222211 111111111 122233333333333333332222 Q ss_pred HHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccccc Q lcl|Aclame:pro 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) ........... ........ .+......................... +............ .......... T Consensus 79 ~l~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~-~~~~~~vp~~ 147 (397) T protein:vir:96 79 DLEDELAKAAD-PTDQKPKD-GEKRKMKKFKVTEEELAEKRSAINAFV--------KSKGAEKRDGFTS-VEGGALIPQE 147 (397) T ss_pred HHHHHHHhhhh-hhhhhhHH-HHHHHHHHHhhhhHHHHHHHHHHHHHH--------Hhhhhhhhhcccc-cccccchhHH Confidence 22111110000 00000000 000000000000000000000000000 0000000000000 0000000000 Q ss_pred CCccccc-chhhHHHHHHHhhhhHHhh---ccceecCCCceEEEEeecCCccceeeccccc--cccccccceeEEeeeee Q lcl|Aclame:pro 159 FAPGILP-TFLPGIVEQLFYELSLADL---ISSRPVTSPNLSYLTESAAHNNAAAVAEAGT--YPFSSEEFARVYEQVGK 232 (497) Q Consensus 159 ~g~~i~~-~~~~~ii~~~~~~~~l~~~---~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~--~~~s~~~~~~v~~~~~k 232 (497) ....+.. .....+..... ..++... .++...++....+..+.. -..|... ...-+.+...+.--. + T Consensus 148 ~~~~i~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~E~~------~~~~~~~~~~~~i~~~~~~~~~~~-~ 219 (397) T protein:vir:96 148 LLQPQLEPKDIVDLSKYVR-SVPVNSASGKFPVISKSGSKMATVQQLE------KNPQLANPKMVEIDYSVATRRGYI-P 219 (397) T ss_pred HHHHHHHhhhhhhHHHhhh-hccccccceeEEEEeccCCccccccccc------cccccccccccceeecHhHhhcch-h Confidence 0000110 00001111111 0011100 111111111111111111 0111111 111111111110000 0 Q ss_pred eee-echhhHHHHhhHHHHHHHHHHHHHHHHHHHHHhhhh--ccCCCccccceeccccccccchhhhhhhHHHHHHHHHh Q lcl|Aclame:pro 233 VAN-ALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLL--AGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVK 309 (497) Q Consensus 233 ia~-~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~a~l--~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (497) +.- .+.-|..-+. ..+...|...++......+=...= ...|......|..... ... T Consensus 220 ~s~ell~ds~~~l~--~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~d~~~~~~~-------------------~~~ 278 (397) T protein:vir:96 220 ISQEMIDDASYDVT--GLIADEIQDQSLNTKNADIAAVLKTATAKSVVGVDGLKDLIN-------------------KEI 278 (397) T ss_pred hHHHHHhhhHHHHH--HHHHHHHHHHHHHHHHHHHhhcccccccccccchHHHHHHHH-------------------Hhh Confidence 000 0111111011 123334444444444333322211 2223333444432110 001 Q ss_pred hhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHH Q lcl|Aclame:pro 310 FPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLT 389 (497) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~l 389 (497) .......|+++...+..+...++..|.+...+......+...... .+++.+.... .. T Consensus 279 -~~~~~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~------------------pv~~~~~~~~----~~ 335 (397) T protein:vir:96 279 -KKVYDVKLFISASMYSELDKLKDKNGRYLLQDSITAASGKQLLGK------------------EVVVLDDDVI----GK 335 (397) T ss_pred -hhhcCcEEEEcHHHHHHHHHhhccCCCeEeccCccCCCccccccc------------------ceEEeccccc----CC Confidence 111234678888888888888877777665544333222111000 0111110000 00 Q ss_pred hcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEE Q lcl|Aclame:pro 390 KDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 390 kd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) -..+...+|.+ +..... .....|+-|..++.....+. +....|.+..+. +.+.|..-.+.. T Consensus 336 ~~~~~~~~~gd-~~~~~~-----~~~~~~~~~~~~~~~~~~~~---------~~~~~r~d~~~~----~~~a~~~~~~~~ 396 (397) T protein:vir:96 336 SVGNVVGFIGD-AKAFAS-----FFDRKQVSVSWVDNNIYGQL---------LAGIIRYDVKAT----DKKAGFYVTFTI 396 (397) T ss_pred CCCceEEEEee-hhcceE-----eEeecceEEEEeccccccee---------EEEEEEEccEEe----cccceEEEEeec Confidence 00000011110 000000 00011233333332211111 112222222221 111222111111 Q ss_pred EEEeeec Q lcl|Aclame:pro 470 RAEERLG 476 (497) Q Consensus 470 r~~~r~~ 476 (497) . T Consensus 397 ------a 397 (397) T protein:vir:96 397 ------G 397 (397) T ss_pred ------C Confidence 1 No 238 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=33.89 E-value=1.3 Score=19.91 Aligned_cols=380 Identities=10% Similarity=0.025 Sum_probs=102.5 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) .-.+..++........++++... ..++..+..+...++..+++...+.+.. + .......+.+++...+...... T Consensus 127 ~a~I~~vke~~~~e~~~~~~~~a-~~ee~~e~~~k~~el~a~l~~~~~~~~~--~---~~e~~~~l~a~~~~~~~~~~~~ 200 (517) T protein:vir:97 127 NAVVTYFREEKKKEENKMTFDQN-LMQELLDAKKLAADLNAKLKERENGGDN--A---ALKTVSELAANLMKQRESEKIL 200 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhh-hhhhhhhhhhhHHHHHHHHHHHHHHHHH--H---HHhhhhhhhhhHHHHHHhhhhc Confidence 22222222211111111111110 0011111111111111111111111110 0 0111111111111111000000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) .. ..... ........................ ....... .. ...........+............... T Consensus 201 ~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~-~~----~~~~~~~~~~~p~~~~~~i~~~~~~~~ 267 (517) T protein:vir:97 201 GV-EALKV-TPEATEFLKTREAEVAYMSASLTK------DPKAAWT-AE----LKERGISGMPAPAGILKRIQDAVNDEG 267 (517) T ss_pred cc-ccccc-cchhhHHHHHHHHHHHHHHhcccc------cccceee-ee----cccccccccccchHHHHHHHHhhhhhc Confidence 00 00000 000000000000000000000000 0000000 00 000000000000000000000000000 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) . +++... ..++..+......+..... ++..+. .. .|.. ......++...++...--...-.+. T Consensus 268 ~---------i~~~~~-~~~i~~~~~~~~~~~~~a~-~~~eG~-~k----p~s~-~tf~~~~~~~~~ia~~~~~S~qll~ 330 (517) T protein:vir:97 268 S---------LLPFIR-HENLPTLVVGGDNALTQGT-GHTTGT-DK----TESN-ITLQTRVLTPQYVYKYIKLPKIVMN 330 (517) T ss_pred c---------ceeeee-eccccceeeecccccceee-eeecCC-cc----cccc-cceeeEEeeHhhhhhhhhhhHHHHH Confidence 0 000000 0111111001111111111 111111 00 1110 1111111111111111011112344 Q ss_pred HHHHhh---H-HHHHHHHHHHHHHHHHHHHHhh---hhccCCCccccceeccccccccchhhhhhhHHHHHHHHHhhhhh Q lcl|Aclame:pro 241 DEGLRD---A-PELFNFVQGRLLEGIQRKEEVQ---LLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) Q Consensus 241 ~ell~d---s-~~l~~~i~~~la~~~~~~~d~a---~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) +..++| - ..+.+.|...|++....++=.- -.++.|.-. +.+.....+...+..... .......... .. T Consensus 331 Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~---~a~~~~~~~~~~~~~~~d-~i~~l~~a~~-~a 405 (517) T protein:vir:97 331 SNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYP---VVGDAWATNVTGTTNIQE-LLEKLSVATP-KA 405 (517) T ss_pred HhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccc---cccccccccccccchHHH-HHHHHHHHhh-hc Confidence 443331 2 3455666666666655444221 122222111 111111111111111111 1111111111 12 Q ss_pred cchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHHhccc Q lcl|Aclame:pro 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDAN 393 (497) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~lkd~~ 393 (497) ....|+++...+..++..|+..|.+.+++......+...... + T Consensus 406 ~~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~~~~~~l~G~-----------------------~-------------- 448 (517) T protein:vir:97 406 ADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGF-----------------------N-------------- 448 (517) T ss_pred cCCEEEECHHHHHHHHHhhcCCCCeeccCcCCcccccccCCc-----------------------c-------------- Confidence 245688999999999999998888876543322211110000 0 Q ss_pred Cccccccccccccccccccccccc-ccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceE---E Q lcl|Aclame:pro 394 GQYMGGNFFGNAYGNPVNGGKNIW-GVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVT---V 469 (497) Q Consensus 394 G~~~~~~~~~~~~~~~~~~~~~l~-G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~---~ 469 (497) ... +.++ |-..+.+ ..+..+++++....+..+||. +....|..+++. + T Consensus 449 ----------------~~~-~~~~~~~~~~~~---~~~y~i~~~~g~~~~~~fd~~--------~n~~~f~~~~~~~g~i 500 (517) T protein:vir:97 449 ----------------RLV-QSVAVDEKTAVS---LSGYVTNGSRGMEFEQGTILV--------ENNKEYLFEMPISGSL 500 (517) T ss_pred ----------------ccc-cccccCceeEee---ccccEEEeecceeeeeeeecc--------cCceeEeeeeeecccc Confidence 000 0001 1001110 113344555554334445443 223347788776 9 Q ss_pred EEEeeeccEeecccceE Q lcl|Aclame:pro 470 RAEERLGLLVYRPSAFQ 486 (497) Q Consensus 470 r~~~r~~~~v~~~~Af~ 486 (497) |+++|+-+.|++|-.-- T Consensus 501 ~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 501 EYKGTTAYGTYTPPVAG 517 (517) T ss_pred ccccceEEEEEcCCCCC Confidence 99999999999998655 No 239 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=32.84 E-value=1.4 Score=19.79 Aligned_cols=354 Identities=13% Similarity=0.081 Sum_probs=129.0 Q ss_pred HHHHHHHHHHHHH-hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhh Q lcl|Aclame:pro 68 AAKDGLDNDIPEV-EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPA 146 (497) Q Consensus 68 a~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (497) =.++.+..++..+ +.+...++... .+..+...-. +...+... ..............+.+........ T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~-~~~~i~~~~~------en~~~~~~-----~~~~~~~~~~~~~~~~~l~e~~~~~ 68 (519) T protein:vir:10 1 MKKNALVQKWSALLENEALPEIVGA-SKQAIIAKIF------ENQEQDIL-----TAPEYRDEKISEAFGSFLTEAEIGG 68 (519) T ss_pred CchhHHHHHhHHhhcccccchhhhh-hhHHHHHHHH------HHHHHHhh-----hcccccchHHHHHHhhhcchhccCC Confidence 0111222222211 11111111000 0000000000 00000000 0000000000011111111000000 Q ss_pred h---hhhhhhcccccCCcccccchhhHHHHHHHh---hhhHHhhccceecCCCce-------EEEEeecC---------- Q lcl|Aclame:pro 147 A---IGQNPFGSTGTFAPGILPTFLPGIVEQLFY---ELSLADLISSRPVTSPNL-------SYLTESAA---------- 203 (497) Q Consensus 147 ~---~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~---~~~l~~~~~~~~~~~~~~-------~~p~~~~~---------- 203 (497) . .......+++++ -+..+.+.++..+++ ...-.+++.++||++++. .|+..... T Consensus 69 ~~~~~~t~i~~~~~t~---~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~ 145 (519) T protein:vir:10 69 DHGYDATNIAAGQTSG---AVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPM 145 (519) T ss_pred ccccCccccccccccc---cccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccc Confidence 0 000001111111 123455666666643 445578999999987642 22211100 Q ss_pred -Ccccee---------------------------------------------------------------------eccc Q lcl|Aclame:pro 204 -HNNAAA---------------------------------------------------------------------VAEA 213 (497) Q Consensus 204 -~~~a~~---------------------------------------------------------------------v~Eg 213 (497) ...+.| +++| T Consensus 146 nEadt~fSG~~~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~g 225 (519) T protein:vir:10 146 YAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEG 225 (519) T ss_pred cccccccCccccccccccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccc Confidence 000000 0111 Q ss_pred -----------------cccccccccceeEEeeeeeeeeechhhHHHHhh--HH---HHHHHHHHHHHHHHHHHHHhhhh Q lcl|Aclame:pro 214 -----------------GTYPFSSEEFARVYEQVGKVANALTITDEGLRD--AP---ELFNFVQGRLLEGIQRKEEVQLL 271 (497) Q Consensus 214 -----------------~~~~~s~~~~~~v~~~~~kia~~~~iS~ell~d--s~---~l~~~i~~~la~~~~~~~d~a~l 271 (497) ..+++...++++++..++.-+-...+|-||.+| +. |.++.|.+-|+..|..-|++.|| T Consensus 226 msTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii 305 (519) T protein:vir:10 226 MATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVI 305 (519) T ss_pred cccchhhccccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHH Confidence 123455566777777777777788999999998 42 58999999999999999999988 Q ss_pred ccCCC---cccc----------ceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccc Q lcl|Aclame:pro 272 AGGGY---PGVN----------GLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAA 338 (497) Q Consensus 272 ~G~g~---~~~~----------Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 338 (497) .=-.. -+.. |++..........+.. .. .++. T Consensus 306 ~~i~~sa~~~~~g~t~~~~~~aGv~d~~~~~d~~~~rw-~~-----------------------e~~k------------ 349 (519) T protein:vir:10 306 DWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARW-AG-----------------------ESFK------------ 349 (519) T ss_pred hhhhhhhhcceeecccCcccccceeecccccccccchH-HH-----------------------HHHH------------ Confidence 41100 0111 3332211111000000 00 0000 Q ss_pred cccccccccccchhhhhhhhHHhhhhh-hhhhccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccc----- Q lcl|Aclame:pro 339 GSGSGVAGSYPTAAEIAENVFDAFVDI-QLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNG----- 412 (497) Q Consensus 339 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~----- 412 (497) ....-+-...... +...+...+.+++++.....|... |-.++.+......+...+. T Consensus 350 --------------~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~----g~~~~~~~~~~~~~~~~d~~~~~~ 411 (519) T protein:vir:10 350 --------------ALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAV----DTSVSYAAQGLGQGFNVDTTKAVF 411 (519) T ss_pred --------------HHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhc----cchhccccccccccccccCCCceE Confidence 0000000111111 123334457777888877777543 2111111111111111111 Q ss_pred ccccc-ccceeecCCCCcCcEEEeeccc------eEEEEEeccccEEEEeccchhhhhcCceEEEEEeeeccEeecc--- Q lcl|Aclame:pro 413 GKNIW-GVPVVTTPLIPLGTILVGHFAP------SVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRP--- 482 (497) Q Consensus 413 ~~~l~-G~pvv~s~~~~~~~~~~gd~~~------~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~r~~~r~~~~v~~~--- 482 (497) ...|. |++|+.+++.+..-+++|.=-. .||.=+ ..+........ ..|+- -++| ..|++..+ +| T Consensus 412 ~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPY--v~l~~~~~~dp-~sfqP-~~g~--~tRY~l~~-NP~~~ 484 (519) T protein:vir:10 412 AGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPY--VALTPLRGSDP-KNFQP-VMGF--KTRYGIGI-NPFAD 484 (519) T ss_pred EEEecCceEEEecCCCCcceEEEEEecCcccccceeeccc--cccccccccCC-ccccc-eeee--eeeeceee-cCccc Confidence 12333 5899999998876666543110 011111 11111111111 22432 3333 34554433 22 Q ss_pred ------cceEEEEecCCCCCC Q lcl|Aclame:pro 483 ------SAFQLIQLKKGATGS 497 (497) Q Consensus 483 ------~Af~~~~~~~~a~~~ 497 (497) .+-+.--+..-|.+| T Consensus 485 ~~~~~~~~~i~~g~~~~a~~~ 505 (519) T protein:vir:10 485 PAAQAPTKRIQNGMPDIVNSL 505 (519) T ss_pred ccccCccceeccCchhhhccc Confidence 111111111122232 No 240 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=32.55 E-value=1.4 Score=19.75 Aligned_cols=366 Identities=11% Similarity=0.006 Sum_probs=96.2 Q ss_pred CchHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLE----AQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDND 76 (497) Q Consensus 1 m~~~~~~~----~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~ 76 (497) +..+-+.+ ++.+.+.+...+. +... +.++..+++..+++.+.+.++... +..++......... T Consensus 6 l~~l~e~r~~~~~e~~~l~~~~~~~--~~~~---e~~~~~~~l~~e~~~l~~~i~~~~---e~~~~~~~~~~~~~----- 72 (392) T protein:vir:13 6 LSANFEARERATAELRSLTDEFAGK--EMTA---EAREKEERLLTAVADFDGRIKRGI---DAIKATDAVTSLLS----- 72 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcc--cccH---HHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHhc----- Confidence 45554444 3444444333221 1111 122223333344444333332111 11111111111100 Q ss_pred HHHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhccc Q lcl|Aclame:pro 77 IPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGST 156 (497) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (497) ..+......... .........+....+.. . ....... .. .....++. T Consensus 73 --~~~~~~~~~~~~------~~~~~~~~~r~g~~~~~---------~-------~~~~~~~------~~---~~t~~~~g 119 (392) T protein:vir:13 73 --GLQGSGSGAQRS------ADHDDDAVLRAGNLGEA---------R-------SFEFAPE------KR---DGTKAGNP 119 (392) T ss_pred --ccCCcccchhhh------hhHHHHHHHhccchhhh---------H-------HHHhhhh------hh---cccccCCC Confidence 000000000000 00000000000000000 0 0000000 00 00000111 Q ss_pred ccCCcccccchhhHHHHH---HHhhhhHHhh--cc--ceec-CC-CceEEEEeecCCccceeeccccccccccccceeEE Q lcl|Aclame:pro 157 GTFAPGILPTFLPGIVEQ---LFYELSLADL--IS--SRPV-TS-PNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVY 227 (497) Q Consensus 157 ~~~g~~i~~~~~~~ii~~---~~~~~~l~~~--~~--~~~~-~~-~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~ 227 (497) +.....++..++..+... ++....+... .. .++. ++ ....+ + + .++ -+.|.. ......++.... T Consensus 120 ~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~-v--~-E~~--~~~~~~-~~f~~v~~~~~k 192 (392) T protein:vir:13 120 NVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGI-V--G-ETA--EIPESY-PATTQRSMGGFK 192 (392) T ss_pred ccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceee-e--c-ccc--cccccc-cceeeEEeeeee Confidence 111122222222221111 1111111111 11 1122 11 22222 1 1 111 123332 223445555444 Q ss_pred eeeeeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHh--------hhhccCCCccccceeccccccccchhhhhhh Q lcl|Aclame:pro 228 EQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEV--------QLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) Q Consensus 228 ~~~~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~--------a~l~G~g~~~~~Gil~~~~~~~~~~~~~~~~ 299 (497) +...-.-..-.+-....+-...+...|...++..+..++=. -|++..+..... ..+......... T Consensus 193 ~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~-------~~~~~~~~~~~d 265 (392) T protein:vir:13 193 YGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAA-------FGEADADSKVSD 265 (392) T ss_pred EEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccccc-------ccccccccccHH Confidence 44332222112211111111234555556666655555521 134333211110 001111111122 Q ss_pred HHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEe Q lcl|Aclame:pro 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMN 379 (497) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 379 (497) ........+.........|+++...+..+...++..|.+...+......+........+... ......+++.+ T Consensus 266 ~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~-------~~~~~~i~~Gd 338 (392) T protein:vir:13 266 ALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKVVETDD-------GMPADKVLFAD 338 (392) T ss_pred HHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEcC-------CCCCCcEEEee Confidence 22223333333344556688888888888888887777666554443332211110000000 00000011111 Q ss_pred hhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccch Q lcl|Aclame:pro 380 PRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) Q Consensus 380 ~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~ 459 (497) ++ +|++. ...|+-+-.+... + ..-++..|....|.+..+ T Consensus 339 f~-------------~~~i~---------------~~~~~~i~~~~~~-----~-~~~~~~~~r~~~r~d~~~------- 377 (392) T protein:vir:13 339 LS-------------KYRVR---------------FAGSLRVDRSVDA-----K-FSTDQIVYRFLQRADGLL------- 377 (392) T ss_pred cc-------------ceeEE---------------eecceEEEeeccc-----c-ccCCcEEEEEEEEeccEE------- Confidence 10 00000 0001111111110 0 011111121212221111 Q ss_pred hhhhcCceEEEEEeeeccEeecccceE Q lcl|Aclame:pro 460 TDFVDGKVTVRAEERLGLLVYRPSAFQ 486 (497) Q Consensus 460 ~~f~~~~v~~r~~~r~~~~v~~~~Af~ 486 (497) ..--.+.+..-.+-+ T Consensus 378 ------------~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 378 ------------VDARGAKVLTVTPAA 392 (392) T ss_pred ------------ecccceEEEEeeccC Confidence 111111111111111 No 241 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=32.20 E-value=1.4 Score=19.71 Aligned_cols=386 Identities=10% Similarity=0.021 Sum_probs=102.2 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) Q Consensus 1 m~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~~~~~~~~~~~ 80 (497) |..+.++..++++..+++++..++...++.+... ++.++++.+.+.++..+.. ..++ .+............ T Consensus 5 lk~l~~~~~el~~~~~~~k~~~~~~~~~~e~~~~---~l~~~~~~l~~~~~~~~~~---~~~~---~~~~~~~~~~~~~~ 75 (401) T protein:vir:44 5 IKDVEQVAQELQQKFDDFKAKNDKRVEAIEQEKG---KLAGQVETLNGKLSELENL---KSDL---EKELLELKRPARGA 75 (401) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH---HHHH---HHHHHHhhcccccc Confidence 9999999999988888888888766555443322 2233333332222211111 1111 11111100000000 Q ss_pred hHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCC Q lcl|Aclame:pro 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 160 (497) +.......++......+.. ........+... ............-+.....+ ..+..+...............++ T Consensus 76 ~~~~~~e~~~a~~~~lr~~-~~~~~~~~e~~a-~~~~~~~~GG~~iP~~~~~~----ii~~~~~~~~l~~~~~~~~~~~~ 149 (401) T protein:vir:44 76 QNKVAAEHKDAFVGFLRKG-REDGLRDLERKA-LQVGTDEDGGYAVPEELDRS----ILSLLKDEVVMRQEATVITVGGS 149 (401) T ss_pred ccchhHHHHHHHHHHHhhh-hhhhhHHHHHHH-hhcCCCCCCceeccHhHHHH----HHHHHHhhhhhhhhceeeecCCC Confidence 0000000000000000000 000000000000 00000000000000000000 10100000000000000000011 Q ss_pred cccccchhhHHHHHHHhhhhHHhhccceecCCCceEEEEeecCCccceeeccccccccccccceeEEeeeeeeeeechhh Q lcl|Aclame:pro 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) Q Consensus 161 ~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS 240 (497) .+.+|. ..++....+.-+.. -+.+.........++..-.+...--...-.+. T Consensus 150 ~~~~~~----------------------~~~~~~a~wv~E~~------~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ 201 (401) T protein:vir:44 150 DYKKLV----------------------NLGGTASGWVGETD------TRSQTATSRLGLIEPFMGEIYGNPQATQKMLD 201 (401) T ss_pred ceEEEE----------------------ecCCccceeecccc------ccCccccccceeeeeehhheeeehhhhHHHHh Confidence 111111 01111111111100 01111100111111111111111000000111 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHH--------hhhhccCCCccccceec--cccc-cccchhhhhhhHHHHHHHHHh Q lcl|Aclame:pro 241 DEGLRDAPELFNFVQGRLLEGIQRKEE--------VQLLAGGGYPGVNGLLQ--RSTG-FTASSASSLFGATSATVSNVK 309 (497) Q Consensus 241 ~ell~ds~~l~~~i~~~la~~~~~~~d--------~a~l~G~g~~~~~Gil~--~~~~-~~~~~~~~~~~~~~~~~~~~~ 309 (497) +...+=-..+...|...++..+..++= .-|++..+.....+... .... .+.........+.......+. T Consensus 202 ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~ 281 (401) T protein:vir:44 202 DAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLR 281 (401) T ss_pred cchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcc Confidence 111111133455556666665555442 12333222111111100 0000 111111111222222333333 Q ss_pred hhhhcchhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhhhccCCceEEEehhHHHHHHHH Q lcl|Aclame:pro 310 FPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLT 389 (497) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~l~~l 389 (497) ......+.|+++...+..+...++..|.+...+......+...... .+++ ++.. . . T Consensus 282 ~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~------------------PVv~-~~~~----p-~ 337 (401) T protein:vir:44 282 KAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGY------------------GIAE-NEQM----P-D 337 (401) T ss_pred hhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecce------------------eeEE-ecCc----C-C Confidence 3334555688888888888888888777766554333222111100 0111 1000 0 0 Q ss_pred hcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEeccccEEEEeccchhhhhcCceEE Q lcl|Aclame:pro 390 KDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTV 469 (497) Q Consensus 390 kd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~~~~i~~~~~~~~~f~~~~v~~ 469 (497) .++.+..++-+++..... . ..-.|+-+..+++...+... |....|-+..+.. ... | T Consensus 338 ~~~~~~~i~~Gd~~~~~~--i---~~~~~~~~~~~~~~~~~~v~--------~~a~~r~d~~~~~----~~a-------~ 393 (401) T protein:vir:44 338 IAADAKAIAFGNFKRGYT--I---VDRIGTRILRDPYTNKPFVG--------FYTTKRTGGMLVD----SQA-------I 393 (401) T ss_pred ccCCccEEEEeehhccEE--E---EEecceEEeeeccccCCcEE--------EEEEEEeccEEec----ccc-------e Confidence 011111111011100000 0 00113333333333222211 1111122222210 001 1 Q ss_pred EEEeeeccEeecccc Q lcl|Aclame:pro 470 RAEERLGLLVYRPSA 484 (497) Q Consensus 470 r~~~r~~~~v~~~~A 484 (497) +... -+.| T Consensus 394 ~~l~-------~~aa 401 (401) T protein:vir:44 394 KLLK-------IAAA 401 (401) T ss_pred EEEE-------eecC Confidence 1100 0111 No 242 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=32.00 E-value=1.5 Score=19.69 Aligned_cols=301 Identities=13% Similarity=0.080 Sum_probs=103.8 Q ss_pred hhhHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCCcccccc-hhhHHHHHH--HhhhhHHhhccceecCCCceEEEEe Q lcl|Aclame:pro 124 AADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPT-FLPGIVEQL--FYELSLADLISSRPVTSPNLSYLTE 200 (497) Q Consensus 124 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~-~~~~ii~~~--~~~~~l~~~~~~~~~~~~~~~~p~~ 200 (497) ....+-.+. ..+ ...+ .+. .+..|+.+--+ +.+++..+. .+...+..-+...++.+.-..|-.+ T Consensus 1 ~~~~~~~~~--~~a------~~~a--l~~---a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey~~~ 67 (470) T protein:vir:10 1 MPYEHLKHL--DEA------TLKA--LNA---AGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVV 67 (470) T ss_pred CChhHhhhh--hHH------HHHH--HHH---hhhcchhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhhhhh Confidence 000000000 000 0000 001 11112222211 111111111 1122344555666666654455433 Q ss_pred ecC--CccceeeccccccccccccceeEEeeeeeeeeechhhHHHH---hhH-HHHHHHHHHHHHHHHHHHHHhhhhccC Q lcl|Aclame:pro 201 SAA--HNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGL---RDA-PELFNFVQGRLLEGIQRKEEVQLLAGG 274 (497) Q Consensus 201 ~~~--~~~a~~v~Eg~~~~~s~~~~~~v~~~~~kia~~~~iS~ell---~ds-~~l~~~i~~~la~~~~~~~d~a~l~G~ 274 (497) ... ..+.+.+.|++-.+.++|++...+...|-++.-..+|.-.+ +.. .+++..+.++-.-.++.+++.+.++|+ T Consensus 68 ~~rhG~~g~s~~~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGD 147 (470) T protein:vir:10 68 TARHDKIGYAAFREGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGD 147 (470) T ss_pred ccccccccceeecccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhc Confidence 331 22334569999999999999999999999999989997643 333 478888888888899999999999996 Q ss_pred CC------c-----cccceeccccccccchhhhhhhHHHHHHHHHhhhhhcchhhhhhhhhhhhhhhhhhhccccccccc Q lcl|Aclame:pro 275 GY------P-----GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSG 343 (497) Q Consensus 275 g~------~-----~~~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 343 (497) .. + +..||.+... ........ ...+... T Consensus 148 s~l~s~~~g~~~gleFDGl~~lId---~~~~~NVi------------DarG~~L-------------------------- 186 (470) T protein:vir:10 148 NLLGDDVPGSPNNLQQDGIINIIK---RGAPQNVL------------DAGGRPL-------------------------- 186 (470) T ss_pred cccccccCcccCceeccchhhhcc---CCCCcccc------------ccCCCCc-------------------------- Confidence 41 1 1223222110 00000000 0000000 Q ss_pred ccccccchhhhhhhhHHhhhhhh-hhhccCCceEEEehhHHHHHHHHhcccCccccccccc-ccccccccccccccc-cc Q lcl|Aclame:pro 344 VAGSYPTAAEIAENVFDAFVDIQ-LTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG-NAYGNPVNGGKNIWG-VP 420 (497) Q Consensus 344 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~-~~~~~~~~~~~~l~G-~p 420 (497) ....+..+..... ...|..++-.+|+..+.+.|..--...-|.+.++..+ ...|.++...-+..| +. T Consensus 187 ----------s~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qRv~~~~N~~~~~~G~~v~~f~sa~G~I~ 256 (470) T protein:vir:10 187 ----------SIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISRVMTTADRRAGLLGADAQSYIGVRGEHS 256 (470) T ss_pred ----------cHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceEEEEecCCCceeeeeeccceeeeeeeee Confidence 0000111111110 1223334444555555544443222222222111100 001111111100000 00 Q ss_pred eeecCCCCcCcEEEeeccceEEEEEec-------cccEEEEeccch----------hhhhcC---ceEEEEEeeeccEee Q lcl|Aclame:pro 421 VVTTPLIPLGTILVGHFAPSVIQTARR-------EGVTMQMTNSNG----------TDFVDG---KVTVRAEERLGLLVY 480 (497) Q Consensus 421 vv~s~~~~~~~~~~gd~~~~~~~i~~r-------~~~~i~~~~~~~----------~~f~~~---~v~~r~~~r~~~~v~ 480 (497) .-.+..| .|+.+.-=.++++ ..++..++.... ..|.-- ...|.+--+.|=. T Consensus 257 L~~s~~m-------~~~~k~~p~~l~~~v~~~aAP~~~~tv~~t~~~~a~~~~sk~g~~~~~~v~sy~y~v~~~~gds-- 327 (470) T protein:vir:10 257 LYPSQFL-------GDFHKFNPARFGAEVGDFAAPSNSWTVSTTDNFVTLPYNSGLGDPANTTVYSYAFKAANFYGES-- 327 (470) T ss_pred ecccccc-------cchhhcCcccCCcccCCcccCceeEEeecCCCceeecccCCCCcccCcceeEEEEEEEEecCCC-- Confidence 0000000 0000000000000 000111000000 001100 1222222222111 Q ss_pred cccce-EEEEecCC------------------------CCCC Q lcl|Aclame:pro 481 RPSAF-QLIQLKKG------------------------ATGS 497 (497) Q Consensus 481 ~~~Af-~~~~~~~~------------------------a~~~ 497 (497) .+.++ +..+..++ ++++ T Consensus 328 ~s~~v~vt~t~~~v~kgv~ltI~~~~~v~yv~IYRk~~~s~~ 369 (470) T protein:vir:10 328 AAKYIDVYIDSTEAGKGVRFQFHGLVNVKWLDVYRKDPGSQE 369 (470) T ss_pred CcceEEEEEeeehhcceeEEEEecCCCCcEEEEEeecCCCCc Confidence 12222 11111111 1111 No 243 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=31.57 E-value=1.5 Score=19.64 Aligned_cols=442 Identities=10% Similarity=-0.027 Sum_probs=95.7 Q ss_pred Cch--------HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPS--------TAQLEAQGRQLAKSIKDINADETKTAA--EKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAK 70 (497) Q Consensus 1 m~~--------~~~~~~~~~~l~~~~~~~~~~~~~~~~--e~~~~~~~~~~~~~~~~~~~e~~e~~~e~~~~~~~~~a~~ 70 (497) |.+ +.+|++++.++.++++.+.++...+.. ...+...+...+++.++++++..+. +.++++++.... T Consensus 1 ~~k~~eem~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~---~~~~~~~~~~~~ 77 (477) T protein:vir:84 1 MEKHLEELRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVED---LDEQIRELESEI 77 (477) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH Confidence 443 233333333333333333222111100 0011111111122222222222111 111111111111 Q ss_pred HH---HHHHHHHHhHHHHHHHHHHHHhhhhhhHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 71 DG---LDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAA 147 (497) Q Consensus 71 ~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 147 (497) .. ..............................................................+............ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (477) T protein:vir:84 78 ERSGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDL 157 (477) T ss_pred HHhhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccc Confidence 10 000000000000000000000000000111111100000000000000000000000000111111111111111 Q ss_pred hhhhhhcccccCCcccccchhhH-----HHHHHHhhhhHHhhcc--cee-cCCCc-eEEEEeecCCccceeecccccccc Q lcl|Aclame:pro 148 IGQNPFGSTGTFAPGILPTFLPG-----IVEQLFYELSLADLIS--SRP-VTSPN-LSYLTESAAHNNAAAVAEAGTYPF 218 (497) Q Consensus 148 ~~~~~~~~~~~~g~~i~~~~~~~-----ii~~~~~~~~l~~~~~--~~~-~~~~~-~~~p~~~~~~~~a~~v~Eg~~~~~ 218 (497) ......++......+++.+++.. ++..+...-++-.... .+| +.++. .-+....+......-..+.. ... T Consensus 158 ~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~-~~f 236 (477) T protein:vir:84 158 DRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVD-LTD 236 (477) T ss_pred cccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccc-cce Confidence 11111121111112222222221 2222222212222211 122 22222 22222211111111112221 112 Q ss_pred ccccceeEEeeeeeeeeechhhHHHHhhHHHHHHHHHHHHHHHHHHHHHh---------hhhccCCCccccceecccccc Q lcl|Aclame:pro 219 SSEEFARVYEQVGKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEV---------QLLAGGGYPGVNGLLQRSTGF 289 (497) Q Consensus 219 s~~~~~~v~~~~~kia~~~~iS~ell~ds~~l~~~i~~~la~~~~~~~d~---------a~l~G~g~~~~~Gil~~~~~~ 289 (497) ...++..-.+.....-..--+.+....--..+...|...++.++..++=. -|++..|.+.. ....... T Consensus 237 ~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~---~~~~~~~ 313 (477) T protein:vir:84 237 GFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQV---TATSAGS 313 (477) T ss_pred eeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccc---ccccccc Confidence 22333333332222111111111111111234445555555555444321 23333332211 1111111 Q ss_pred ccchhhhhhhHHHHHHHHHhhhhhc-chhhhhhhhhhhhhhhhhhhcccccccccccccccchhhhhhhhHHhhhhhhhh Q lcl|Aclame:pro 290 TASSASSLFGATSATVSNVKFPADG-TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT 368 (497) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 368 (497) +.................+...... ...|+++...+..+...++..+.+...+......+....... +... .. T Consensus 314 t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~-~~~~-----~~ 387 (477) T protein:vir:84 314 ALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEV-ASQR-----VV 387 (477) T ss_pred chhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCccccccccccccc-cccc-----cc Confidence 1111111222222222222222222 234777777777788888887777665544322211110000 0000 00 Q ss_pred hccCCceEEEehhHHHHHHHHhcccCcccccccccccccccccccccccccceeecCCCCcCcEEEeeccceEEEEEecc Q lcl|Aclame:pro 369 LFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARRE 448 (497) Q Consensus 369 ~~~~~~~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~~~~~~~l~G~pvv~s~~~~~~~~~~gd~~~~~~~i~~r~ 448 (497) +....-.++.++.....+-...| ...++|... ... .. .-.|+.+..++..-++...+ .|.- |. T Consensus 388 ~~l~G~pVv~s~~~p~~~~~~~d-~~~i~~gd~-~~~---~i----~~~~~~~~~~~~~~~~~~~~-~~~v--~~----- 450 (477) T protein:vir:84 388 GQMHGLPVVTDPTLPTTLGTGTD-QDVIHVLRA-SDL---AL----FESSVRMRALQETRAENLSV-LLQV--YG----- 450 (477) T ss_pred chhcccceEecCcccccccccCC-cceEEEEEe-ceE---EE----EeeceeEEecccccccccee-eeee--hh----- Confidence 00000001111100000000000 000111100 000 00 00123333333221111000 0000 00 Q ss_pred ccEEEEeccchhhhhcCceEEE---EEeeeccEeecccceE Q lcl|Aclame:pro 449 GVTMQMTNSNGTDFVDGKVTVR---AEERLGLLVYRPSAFQ 486 (497) Q Consensus 449 ~~~i~~~~~~~~~f~~~~v~~r---~~~r~~~~v~~~~Af~ 486 (497) | .+..++| ++..+.+.=.-.--|+ T Consensus 451 -------------~-~~~~~~r~~~afv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 451 -------------Y-LAFTAARFPQSVVEIGGTALTAPTFA 477 (477) T ss_pred -------------h-hhhhhhccccceEEeecccccccccC Confidence 0 0001111 0000111100001111 Done!