Query lcl|Aclame:protein:vir:107593|NCBI_annot:major capsid protein, HK97 family|genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Match_columns 392 No_of_seqs 119 out of 1013 Neff 9.9 Searched_HMMs 1612 Date Mon Dec 2 04:29:43 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_26 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_26_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:102873 Length: 392 100.0 1.9E-87 1.2E-90 496.0 41.8 392 1-392 1-392 (392) 2 protein:vir:105004 Length: 392 100.0 1.9E-87 1.2E-90 496.0 41.8 392 1-392 1-392 (392) 3 protein:vir:102082 Length: 392 100.0 1.9E-87 1.2E-90 496.0 41.8 392 1-392 1-392 (392) 4 protein:vir:107593 Length: 392 100.0 1.9E-87 1.2E-90 496.0 41.8 392 1-392 1-392 (392) 5 protein:vir:81160 Length: 371 100.0 1.5E-75 9.3E-79 430.8 39.2 368 1-384 1-371 (371) 6 protein:vir:1268 Length: 397 # 100.0 1.8E-74 1.1E-77 424.9 39.5 379 1-384 5-397 (397) 7 protein:vir:102119 Length: 404 100.0 6.9E-71 4.3E-74 405.2 39.7 384 1-391 1-404 (404) 8 protein:vir:1025 Length: 408 # 100.0 7.4E-71 4.6E-74 405.1 39.0 382 1-392 1-401 (408) 9 protein:vir:7409 Length: 408 # 100.0 1E-69 6.5E-73 398.8 39.6 382 1-392 1-403 (408) 10 protein:vir:3991 Length: 404 # 100.0 5.7E-69 3.6E-72 394.7 39.5 382 1-392 1-403 (404) 11 protein:vir:4953 Length: 397 # 100.0 6.4E-69 4E-72 394.4 39.2 376 1-392 1-393 (397) 12 protein:vir:4997 Length: 397 # 100.0 9.2E-68 5.7E-71 388.1 39.8 376 1-392 1-396 (397) 13 protein:vir:485 Length: 407 # 100.0 7E-67 4.4E-70 383.3 38.6 372 1-391 1-407 (407) 14 protein:vir:4830 Length: 397 # 100.0 1.4E-66 8.8E-70 381.6 40.1 377 1-392 1-393 (397) 15 protein:vir:3845 Length: 395 # 100.0 1.3E-66 8.4E-70 381.7 38.6 376 1-392 1-394 (395) 16 protein:vir:4700 Length: 415 # 100.0 6.1E-66 3.8E-69 378.1 39.4 378 1-392 7-409 (415) 17 protein:vir:4600 Length: 415 # 100.0 6.1E-66 3.8E-69 378.1 39.4 378 1-392 7-409 (415) 18 protein:vir:4456 Length: 401 # 100.0 6.6E-66 4.1E-69 377.9 39.0 365 1-384 1-401 (401) 19 protein:vir:79987 Length: 415 100.0 1E-65 6.2E-69 376.9 39.3 378 1-392 7-409 (415) 20 protein:vir:81100 Length: 415 100.0 1E-65 6.2E-69 376.9 39.3 378 1-392 7-409 (415) 21 protein:vir:98339 Length: 415 100.0 1E-65 6.2E-69 376.9 39.3 378 1-392 7-409 (415) 22 protein:vir:9410 Length: 415 # 100.0 1.4E-65 8.4E-69 376.2 39.6 378 1-392 7-409 (415) 23 protein:vir:9704 Length: 394 # 100.0 2.7E-65 1.6E-68 374.6 38.6 372 1-391 2-394 (394) 24 protein:vir:100172 Length: 394 100.0 1.1E-64 7E-68 371.2 37.2 367 1-392 1-391 (394) 25 protein:vir:100884 Length: 389 100.0 2.4E-64 1.5E-67 369.3 39.0 366 1-391 1-389 (389) 26 protein:vir:3870 Length: 400 # 100.0 2E-64 1.2E-67 369.8 37.9 372 1-385 10-400 (400) 27 protein:vir:100247 Length: 425 100.0 2.4E-64 1.5E-67 369.3 35.7 357 1-385 21-425 (425) 28 protein:vir:1084 Length: 437 # 100.0 1.6E-63 1E-66 364.8 35.6 383 1-392 1-435 (437) 29 protein:vir:1383 Length: 421 # 100.0 3.4E-63 2.1E-66 363.1 36.9 370 1-392 1-400 (421) 30 protein:vir:4511 Length: 409 # 100.0 1.3E-62 7.8E-66 359.9 39.1 371 1-387 1-409 (409) 31 protein:vir:6242 Length: 390 # 100.0 1.9E-62 1.2E-65 358.9 37.4 365 1-385 1-390 (390) 32 protein:vir:1328 Length: 392 # 100.0 1.3E-61 8.3E-65 354.3 38.1 364 1-385 1-392 (392) 33 protein:vir:962 Length: 397 # 100.0 9.2E-62 5.7E-65 355.2 36.4 369 1-384 1-397 (397) 34 protein:vir:1886 Length: 385 # 100.0 5.6E-61 3.5E-64 350.9 35.7 360 1-385 1-385 (385) 35 protein:vir:191 Length: 385 # 100.0 5.6E-61 3.5E-64 350.9 35.7 360 1-385 1-385 (385) 36 protein:vir:105038 Length: 428 100.0 1.5E-60 9E-64 348.6 37.0 371 1-383 1-428 (428) 37 protein:vir:4856 Length: 293 # 100.0 3.1E-61 1.9E-64 352.3 28.1 287 102-392 1-289 (293) 38 protein:vir:100135 Length: 418 100.0 1.2E-59 7.4E-63 343.6 36.3 369 1-387 20-418 (418) 39 protein:vir:1433 Length: 435 # 100.0 1.6E-59 1E-62 342.9 36.8 371 1-385 1-435 (435) 40 protein:vir:7855 Length: 497 # 100.0 1.1E-59 6.9E-63 343.8 35.3 373 1-388 7-497 (497) 41 protein:vir:101650 Length: 497 100.0 1.1E-59 6.9E-63 343.8 35.3 373 1-388 7-497 (497) 42 protein:vir:80376 Length: 435 100.0 2.9E-59 1.8E-62 341.5 37.0 372 1-385 1-435 (435) 43 protein:vir:81070 Length: 390 100.0 7.3E-59 4.5E-62 339.3 38.0 361 1-382 1-390 (390) 44 protein:vir:97053 Length: 390 100.0 1E-58 6.4E-62 338.5 37.7 361 1-382 1-390 (390) 45 protein:vir:10364 Length: 390 100.0 1.4E-58 9E-62 337.7 38.1 361 1-382 1-390 (390) 46 protein:vir:6212 Length: 434 # 100.0 2.2E-58 1.4E-61 336.7 37.2 376 1-389 1-434 (434) 47 protein:vir:4339 Length: 395 # 100.0 1.4E-58 8.7E-62 337.7 36.1 359 1-384 1-395 (395) 48 protein:vir:81227 Length: 413 100.0 5.1E-58 3.1E-61 334.7 38.4 375 1-390 1-413 (413) 49 protein:vir:101607 Length: 379 100.0 1.3E-57 8E-61 332.5 34.7 359 1-384 1-379 (379) 50 protein:vir:8102 Length: 543 # 100.0 3E-57 1.9E-60 330.4 36.3 370 1-385 140-543 (543) 51 protein:vir:2685 Length: 387 # 100.0 1.4E-56 8.9E-60 326.7 33.3 361 1-390 1-387 (387) 52 protein:vir:96978 Length: 387 100.0 1.4E-56 8.9E-60 326.7 33.3 361 1-390 1-387 (387) 53 protein:vir:94424 Length: 387 100.0 1.4E-56 8.9E-60 326.7 33.3 361 1-390 1-387 (387) 54 protein:vir:93881 Length: 387 100.0 2.8E-56 1.8E-59 325.1 34.1 361 1-390 1-387 (387) 55 protein:vir:9361 Length: 402 # 100.0 1.8E-56 1.1E-59 326.2 32.3 361 1-390 16-402 (402) 56 protein:vir:104256 Length: 458 100.0 9.1E-56 5.7E-59 322.3 35.2 367 1-384 12-458 (458) 57 protein:vir:95376 Length: 425 100.0 4.8E-55 3E-58 318.4 36.2 365 1-392 8-424 (425) 58 protein:vir:94673 Length: 419 100.0 1.6E-54 9.9E-58 315.5 38.6 373 1-386 1-419 (419) 59 protein:vir:98635 Length: 377 100.0 3.6E-56 2.3E-59 324.5 29.1 339 1-384 1-377 (377) 60 protein:vir:93616 Length: 645 100.0 2.2E-54 1.4E-57 314.7 35.5 374 1-392 195-645 (645) 61 protein:vir:8420 Length: 477 # 100.0 3.4E-54 2.1E-57 313.7 34.0 380 1-389 1-477 (477) 62 protein:vir:78640 Length: 352 100.0 1.7E-53 1.1E-56 309.9 31.6 338 1-390 1-352 (352) 63 protein:vir:80684 Length: 315 100.0 2.1E-54 1.3E-57 314.9 26.5 279 106-392 1-314 (315) 64 protein:vir:9643 Length: 377 # 100.0 2.2E-52 1.3E-55 303.8 33.0 331 1-384 1-377 (377) 65 protein:vir:4092 Length: 390 # 100.0 4.6E-52 2.8E-55 302.1 34.2 344 1-392 1-378 (390) 66 protein:vir:78350 Length: 383 100.0 1.4E-52 8.6E-56 304.9 29.3 350 1-392 1-383 (383) 67 protein:vir:9574 Length: 300 # 100.0 4.5E-53 2.8E-56 307.6 26.2 270 107-384 1-300 (300) 68 protein:vir:7771 Length: 330 # 100.0 6.5E-53 4E-56 306.7 26.7 284 98-392 1-329 (330) 69 protein:vir:95963 Length: 395 100.0 2.1E-51 1.3E-54 298.5 34.2 345 1-392 1-385 (395) 70 protein:vir:101291 Length: 381 100.0 7.9E-52 4.9E-55 300.7 30.9 334 1-392 1-376 (381) 71 protein:vir:9509 Length: 381 # 100.0 7.9E-52 4.9E-55 300.7 30.9 334 1-392 1-376 (381) 72 protein:vir:97148 Length: 324 100.0 3.9E-52 2.4E-55 302.4 27.5 296 70-392 1-323 (324) 73 protein:vir:8187 Length: 311 # 100.0 4.3E-52 2.7E-55 302.2 26.7 272 108-385 1-311 (311) 74 protein:vir:1638 Length: 298 # 100.0 4.1E-52 2.5E-55 302.3 26.5 266 110-383 1-298 (298) 75 protein:vir:9759 Length: 303 # 100.0 7.3E-52 4.5E-55 300.9 26.7 270 108-384 1-303 (303) 76 protein:vir:80128 Length: 466 100.0 1.1E-50 6.5E-54 294.6 32.7 370 1-392 8-456 (466) 77 protein:vir:41 Length: 299 # N 100.0 9.6E-52 6E-55 300.3 26.3 272 101-385 1-299 (299) 78 protein:vir:100632 Length: 381 100.0 7.9E-51 4.9E-54 295.3 30.8 335 1-392 1-379 (381) 79 protein:vir:105905 Length: 304 100.0 1.1E-51 6.5E-55 300.1 25.9 270 98-383 1-304 (304) 80 protein:vir:94142 Length: 304 100.0 1.1E-51 6.5E-55 300.1 25.9 270 98-383 1-304 (304) 81 protein:vir:94771 Length: 298 100.0 2.3E-51 1.4E-54 298.2 26.0 266 110-383 1-298 (298) 82 protein:vir:103955 Length: 324 100.0 3.1E-51 2E-54 297.5 26.6 296 70-392 1-323 (324) 83 protein:vir:99749 Length: 324 100.0 3.9E-51 2.4E-54 296.9 27.0 296 70-392 1-323 (324) 84 protein:vir:2430 Length: 318 # 100.0 2.3E-51 1.4E-54 298.2 25.4 286 84-389 1-318 (318) 85 protein:vir:78830 Length: 324 100.0 6.3E-51 3.9E-54 295.8 27.3 296 67-392 1-323 (324) 86 protein:vir:96392 Length: 324 100.0 6.3E-51 3.9E-54 295.8 27.3 296 67-392 1-323 (324) 87 protein:vir:4226 Length: 326 # 100.0 4.3E-51 2.6E-54 296.7 26.0 288 84-387 1-326 (326) 88 protein:vir:9309 Length: 324 # 100.0 9.4E-51 5.8E-54 294.9 27.9 295 70-392 1-322 (324) 89 protein:vir:5739 Length: 366 # 100.0 3.6E-51 2.2E-54 297.1 25.4 325 48-383 1-366 (366) 90 protein:vir:78223 Length: 333 100.0 7.4E-51 4.6E-54 295.4 26.8 283 97-385 1-333 (333) 91 protein:vir:78523 Length: 338 100.0 1E-50 6.3E-54 294.7 27.2 286 97-387 1-338 (338) 92 protein:vir:104085 Length: 320 100.0 4.4E-51 2.7E-54 296.7 25.0 283 84-387 1-320 (320) 93 protein:vir:95763 Length: 297 100.0 1.7E-50 1E-53 293.5 25.8 271 98-387 1-297 (297) 94 protein:vir:2504 Length: 305 # 100.0 1.7E-50 1E-53 293.5 25.3 270 106-391 1-305 (305) 95 protein:vir:2344 Length: 397 # 100.0 1.4E-50 9E-54 293.8 24.9 283 97-392 1-319 (397) 96 protein:vir:96223 Length: 324 100.0 5.1E-50 3.1E-53 290.8 26.9 295 70-392 1-322 (324) 97 protein:vir:99920 Length: 311 100.0 1.5E-49 9E-53 288.3 25.3 271 107-384 1-311 (311) 98 protein:vir:96762 Length: 632 100.0 1.3E-48 7.9E-52 283.2 28.6 356 1-383 216-632 (632) 99 protein:vir:97397 Length: 517 100.0 2E-38 1.2E-41 227.3 26.6 360 1-392 124-517 (517) 100 protein:vir:4159 Length: 315 # 100.0 4.2E-37 2.6E-40 220.0 21.7 282 78-383 1-315 (315) 101 protein:vir:4197 Length: 314 # 100.0 1.3E-36 8.3E-40 217.3 23.1 281 90-387 1-314 (314) 102 protein:vir:4074 Length: 480 # 100.0 2.7E-36 1.7E-39 215.6 23.2 351 1-387 111-480 (480) 103 protein:vir:3158 Length: 321 # 100.0 4.2E-31 2.6E-34 187.1 24.2 287 89-391 1-321 (321) 104 protein:vir:9820 Length: 272 # 99.9 4.9E-27 3E-30 164.8 23.0 265 106-387 1-272 (272) 105 protein:vir:3033 Length: 272 # 99.9 4.9E-27 3E-30 164.8 23.0 265 106-387 1-272 (272) 106 protein:vir:3613 Length: 272 # 99.7 1.5E-18 9.1E-22 118.3 19.7 266 106-384 1-272 (272) 107 protein:vir:94933 Length: 330 99.7 5.3E-18 3.3E-21 115.3 19.9 296 80-384 1-330 (330) 108 protein:vir:93742 Length: 274 99.7 2.9E-17 1.8E-20 111.2 21.2 264 106-389 1-274 (274) 109 protein:vir:93858 Length: 400 99.7 6.4E-17 4E-20 109.4 22.3 354 1-382 8-400 (400) 110 protein:vir:79928 Length: 393 99.6 2.1E-16 1.3E-19 106.5 22.0 349 1-392 1-387 (393) 111 protein:vir:96123 Length: 274 99.6 1.1E-16 6.9E-20 108.0 20.4 264 106-388 1-274 (274) 112 protein:vir:80930 Length: 278 99.6 1.4E-16 8.6E-20 107.5 20.5 264 106-385 1-278 (278) 113 protein:vir:105334 Length: 276 99.6 3E-16 1.9E-19 105.7 20.4 267 106-392 1-275 (276) 114 protein:vir:96833 Length: 275 99.6 2.9E-16 1.8E-19 105.8 20.1 263 104-388 1-275 (275) 115 protein:vir:97433 Length: 274 99.6 2.3E-15 1.4E-18 100.9 21.5 262 106-389 1-274 (274) 116 protein:vir:94494 Length: 274 99.6 2.3E-15 1.4E-18 100.9 21.5 262 106-389 1-274 (274) 117 protein:vir:95107 Length: 270 99.5 4.3E-15 2.7E-18 99.3 20.1 265 106-389 1-270 (270) 118 protein:vir:1239 Length: 274 # 99.5 3E-14 1.9E-17 94.7 20.3 262 106-389 1-274 (274) 119 protein:vir:96262 Length: 274 99.4 6.3E-14 3.9E-17 93.0 20.8 262 106-389 1-274 (274) 120 protein:vir:95898 Length: 274 99.4 6.3E-14 3.9E-17 93.0 20.8 262 106-389 1-274 (274) 121 protein:vir:97255 Length: 310 99.3 1.6E-12 9.7E-16 85.3 22.0 272 106-383 1-310 (310) 122 protein:vir:739 Length: 231 # 99.2 4.7E-13 2.9E-16 88.2 15.0 227 140-384 1-231 (231) 123 protein:vir:8324 Length: 410 # 99.2 2.7E-12 1.7E-15 84.0 18.4 354 1-382 8-410 (410) 124 protein:vir:99424 Length: 360 99.2 2.1E-11 1.3E-14 79.1 21.8 288 84-392 1-360 (360) 125 protein:vir:108211 Length: 318 99.0 1.1E-10 6.6E-14 75.3 16.1 271 102-390 1-318 (318) 126 protein:vir:7990 Length: 273 # 98.8 9.9E-10 6.1E-13 70.0 17.4 257 106-384 1-273 (273) 127 protein:vir:105822 Length: 273 98.8 1.6E-09 9.9E-13 68.8 17.9 257 106-384 1-273 (273) 128 protein:vir:102605 Length: 273 98.8 1.6E-09 9.9E-13 68.8 17.9 257 106-384 1-273 (273) 129 protein:vir:94576 Length: 347 98.7 2E-09 1.2E-12 68.3 15.0 280 84-384 1-347 (347) 130 protein:vir:8885 Length: 347 # 98.7 2.5E-09 1.6E-12 67.7 15.4 282 95-385 1-347 (347) 131 protein:vir:2201 Length: 345 # 98.7 6.5E-09 4E-12 65.5 16.6 279 95-384 1-345 (345) 132 protein:vir:80213 Length: 334 98.6 6.1E-09 3.8E-12 65.6 16.3 280 98-386 1-334 (334) 133 protein:vir:94622 Length: 341 98.5 1.6E-08 1E-11 63.3 15.1 284 98-388 1-341 (341) 134 protein:vir:10450 Length: 344 98.5 3.5E-08 2.1E-11 61.5 16.0 285 80-384 1-344 (344) 135 protein:vir:6324 Length: 335 # 98.5 6.3E-08 3.9E-11 60.0 17.2 285 84-391 1-335 (335) 136 protein:vir:1583 Length: 351 # 98.5 6.1E-08 3.8E-11 60.1 17.1 269 106-392 1-299 (351) 137 protein:vir:78935 Length: 335 98.4 7.1E-08 4.4E-11 59.8 16.9 285 84-391 1-335 (335) 138 protein:vir:94711 Length: 347 98.4 3.5E-08 2.2E-11 61.5 15.0 281 96-385 1-347 (347) 139 protein:vir:3364 Length: 347 # 98.4 1.1E-07 6.8E-11 58.7 17.4 281 84-386 1-347 (347) 140 protein:vir:1541 Length: 347 # 98.4 2E-07 1.2E-10 57.4 18.5 282 80-386 1-347 (347) 141 protein:vir:100057 Length: 375 98.4 2.6E-07 1.6E-10 56.7 18.6 284 96-389 1-375 (375) 142 protein:vir:95318 Length: 328 98.3 9.7E-08 6E-11 59.0 15.2 215 101-325 1-328 (328) 143 protein:vir:5974 Length: 324 # 98.3 5.7E-07 3.6E-10 54.8 19.0 266 106-392 1-295 (324) 144 protein:vir:78739 Length: 332 98.2 2.8E-07 1.7E-10 56.5 15.4 280 81-382 1-332 (332) 145 protein:vir:80180 Length: 381 98.2 5E-07 3.1E-10 55.1 16.7 282 80-392 1-315 (381) 146 protein:vir:99675 Length: 324 98.2 5.3E-07 3.3E-10 55.0 16.3 241 139-392 1-305 (324) 147 protein:vir:107687 Length: 319 98.2 9.4E-07 5.8E-10 53.6 17.1 283 71-381 1-319 (319) 148 protein:vir:103323 Length: 364 98.1 4E-06 2.5E-09 50.2 21.6 280 100-392 1-344 (364) 149 protein:vir:102944 Length: 330 98.1 1.2E-06 7.3E-10 53.1 17.0 267 106-392 1-301 (330) 150 protein:vir:103285 Length: 296 98.1 6E-07 3.8E-10 54.7 14.5 265 106-384 1-296 (296) 151 protein:vir:3136 Length: 322 # 98.0 1.6E-06 9.7E-10 52.4 15.9 271 106-389 1-322 (322) 152 protein:vir:9875 Length: 296 # 98.0 3.1E-07 1.9E-10 56.3 11.6 263 100-385 1-296 (296) 153 protein:vir:102655 Length: 322 98.0 4.9E-06 3E-09 49.7 17.6 280 95-385 1-322 (322) 154 protein:vir:106647 Length: 303 97.9 6.9E-07 4.3E-10 54.4 12.1 266 100-391 1-303 (303) 155 protein:vir:103759 Length: 330 97.8 2.1E-06 1.3E-09 51.7 13.6 215 101-325 1-330 (330) 156 protein:vir:97031 Length: 402 97.8 3.6E-06 2.2E-09 50.4 14.6 284 100-392 1-342 (402) 157 protein:vir:80068 Length: 301 97.8 6.4E-06 4E-09 49.0 15.9 259 108-381 1-301 (301) 158 protein:vir:107826 Length: 331 97.8 1.3E-05 8.2E-09 47.3 17.1 216 101-325 1-331 (331) 159 protein:vir:107388 Length: 331 97.8 1.3E-05 8.2E-09 47.3 17.1 216 101-325 1-331 (331) 160 protein:vir:98525 Length: 331 97.8 1.3E-05 8.2E-09 47.3 17.1 216 101-325 1-331 (331) 161 protein:vir:105645 Length: 400 97.8 4.7E-06 2.9E-09 49.8 14.5 287 100-392 1-341 (400) 162 protein:vir:7324 Length: 335 # 97.7 4.4E-06 2.7E-09 49.9 13.8 216 101-327 1-335 (335) 163 protein:vir:104342 Length: 314 97.7 6.9E-06 4.3E-09 48.9 14.7 283 68-384 1-314 (314) 164 protein:vir:9927 Length: 295 # 97.7 2.8E-06 1.7E-09 51.1 12.5 262 106-392 1-294 (295) 165 protein:vir:8843 Length: 317 # 97.4 4.3E-05 2.6E-08 44.5 15.7 265 104-386 1-317 (317) 166 protein:vir:7019 Length: 401 # 97.4 2.2E-05 1.4E-08 46.1 14.0 286 100-392 1-345 (401) 167 protein:vir:79642 Length: 329 97.4 5.1E-05 3.2E-08 44.1 15.7 287 84-384 1-329 (329) 168 protein:vir:93966 Length: 400 97.2 0.00013 7.9E-08 42.0 17.1 352 1-382 8-400 (400) 169 protein:vir:1663 Length: 393 # 97.0 0.00016 9.8E-08 41.4 14.9 349 1-382 1-393 (393) 170 protein:vir:1829 Length: 355 # 96.9 0.00028 1.7E-07 40.1 17.3 289 89-392 1-353 (355) 171 protein:vir:98566 Length: 355 96.3 0.00081 5E-07 37.5 17.5 287 89-392 1-354 (355) 172 protein:vir:95131 Length: 325 96.2 0.00093 5.8E-07 37.2 18.7 268 87-392 1-298 (325) 173 protein:vir:5694 Length: 357 # 96.1 0.00098 6.1E-07 37.1 16.7 289 89-392 1-351 (357) 174 protein:vir:78777 Length: 358 96.0 0.0011 6.9E-07 36.8 17.2 285 85-392 1-349 (358) 175 protein:vir:97331 Length: 319 96.0 0.0012 7.3E-07 36.7 21.8 286 80-392 1-302 (319) 176 protein:vir:94800 Length: 319 96.0 0.0012 7.3E-07 36.7 21.8 286 80-392 1-302 (319) 177 protein:vir:99075 Length: 392 95.7 0.0016 9.9E-07 35.9 18.0 274 106-392 1-323 (392) 178 protein:vir:2016 Length: 357 # 95.7 0.0016 1E-06 35.8 16.7 289 89-392 1-351 (357) 179 protein:vir:79548 Length: 652 95.6 0.0018 1.1E-06 35.6 22.7 357 1-381 188-652 (652) 180 protein:vir:5255 Length: 304 # 95.6 0.0016 9.6E-07 36.0 12.7 263 111-381 1-304 (304) 181 protein:vir:6061 Length: 357 # 95.4 0.0021 1.3E-06 35.3 16.8 289 89-392 1-356 (357) 182 protein:vir:1153 Length: 338 # 94.8 0.0034 2.1E-06 34.1 17.0 282 89-385 1-338 (338) 183 protein:vir:79171 Length: 337 94.6 0.0039 2.4E-06 33.8 17.4 283 89-386 1-337 (337) 184 protein:vir:79157 Length: 339 94.6 0.0039 2.4E-06 33.8 16.8 284 89-387 1-339 (339) 185 protein:vir:78186 Length: 337 94.5 0.0043 2.7E-06 33.6 16.7 283 89-386 1-337 (337) 186 protein:vir:104011 Length: 337 94.5 0.0044 2.7E-06 33.5 17.6 283 89-386 1-337 (337) 187 protein:vir:95512 Length: 693 93.9 0.0061 3.8E-06 32.7 22.7 360 1-385 260-693 (693) 188 protein:vir:270 Length: 341 # 91.0 0.018 1.1E-05 30.1 15.1 281 85-392 1-340 (341) 189 protein:vir:100331 Length: 342 90.9 0.018 1.1E-05 30.1 16.6 284 84-391 1-342 (342) 190 protein:vir:107120 Length: 329 90.6 0.02 1.2E-05 29.9 21.7 297 62-392 1-313 (329) 191 protein:vir:98856 Length: 343 90.4 0.021 1.3E-05 29.8 17.3 285 89-392 1-340 (343) 192 protein:vir:95603 Length: 463 90.2 0.022 1.4E-05 29.7 11.2 274 70-392 1-342 (463) 193 protein:vir:99311 Length: 463 90.2 0.022 1.4E-05 29.7 11.2 274 70-392 1-342 (463) 194 protein:vir:108303 Length: 418 90.0 0.023 1.4E-05 29.6 17.8 266 106-392 1-325 (418) 195 protein:vir:80446 Length: 367 89.9 0.024 1.5E-05 29.5 17.3 267 100-392 1-330 (367) 196 protein:vir:861 Length: 318 # 88.2 0.034 2.1E-05 28.6 13.4 288 70-382 1-318 (318) 197 protein:vir:78387 Length: 349 87.1 0.041 2.6E-05 28.2 20.0 270 106-392 1-320 (349) 198 protein:vir:96792 Length: 315 83.7 0.066 4.1E-05 27.1 16.0 260 106-392 1-285 (315) 199 protein:vir:94870 Length: 318 83.6 0.067 4.2E-05 27.0 13.0 285 70-382 1-318 (318) 200 protein:vir:3643 Length: 336 # 83.4 0.069 4.3E-05 27.0 11.4 286 70-381 1-336 (336) 201 protein:vir:95451 Length: 313 79.1 0.11 6.7E-05 25.9 14.2 268 107-386 1-313 (313) 202 protein:vir:101557 Length: 336 78.8 0.11 6.9E-05 25.8 11.1 286 70-381 1-336 (336) 203 protein:vir:94989 Length: 349 76.0 0.14 8.7E-05 25.3 19.5 270 106-392 1-320 (349) 204 protein:vir:78558 Length: 336 70.0 0.21 0.00013 24.2 12.5 288 70-381 1-336 (336) 205 protein:vir:1781 Length: 221 # 67.3 0.25 0.00016 23.8 12.9 175 190-392 1-209 (221) 206 protein:vir:96666 Length: 462 66.6 0.27 0.00016 23.7 13.9 297 59-392 1-359 (462) 207 protein:vir:95875 Length: 401 62.6 0.33 0.00021 23.2 15.4 285 100-387 1-401 (401) 208 protein:vir:80835 Length: 464 61.1 0.36 0.00022 23.0 7.8 287 67-392 1-299 (464) 209 protein:vir:3783 Length: 336 # 53.4 0.53 0.00033 22.1 16.1 281 88-387 1-336 (336) 210 protein:vir:102823 Length: 470 46.9 0.72 0.00045 21.4 8.5 280 68-392 1-302 (470) 211 protein:vir:98480 Length: 348 46.5 0.73 0.00045 21.3 16.7 268 107-383 1-348 (348) 212 protein:vir:3746 Length: 336 # 45.8 0.76 0.00047 21.2 16.1 279 88-387 1-336 (336) 213 protein:vir:80491 Length: 467 45.8 0.76 0.00047 21.2 8.9 297 68-392 1-358 (467) 214 protein:vir:105374 Length: 423 43.0 0.86 0.00054 20.9 18.3 274 106-392 1-335 (423) 215 protein:vir:94070 Length: 339 42.2 0.89 0.00055 20.8 13.7 293 45-381 1-339 (339) 216 protein:vir:106734 Length: 336 38.6 1.1 0.00066 20.4 11.6 288 70-381 1-336 (336) 217 protein:vir:103886 Length: 302 36.2 1.2 0.00074 20.2 16.7 257 107-388 1-302 (302) 218 protein:vir:2736 Length: 348 # 33.3 1.4 0.00085 19.8 19.0 272 106-385 1-348 (348) 219 protein:vir:63741 Length: 468 32.9 1.4 0.00087 19.8 11.3 306 59-392 1-359 (468) 220 protein:vir:3525 Length: 423 # 32.0 1.5 0.0009 19.7 18.0 273 106-392 1-322 (423) 221 protein:vir:93696 Length: 364 27.6 1.8 0.0011 19.1 14.8 271 106-389 1-364 (364) 222 protein:vir:174 Length: 423 # 26.3 2 0.0012 19.0 17.9 273 106-392 1-335 (423) 223 protein:vir:96079 Length: 382 25.5 2 0.0013 18.9 8.3 322 13-381 1-382 (382) 224 protein:vir:79008 Length: 299 22.9 2.4 0.0015 18.5 20.6 268 106-386 1-299 (299) 225 protein:vir:100851 Length: 514 22.0 2.5 0.0016 18.4 10.7 296 70-392 1-333 (514) No 1 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=1.9e-87 Score=495.99 Aligned_cols=392 Identities=100% Similarity=1.357 Sum_probs=374.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF 80 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 80 (392) |+|+|+|++++++++.+|+++++++++.++++++.+|+++|+++|++.+++.+.+.+..........+......++++++ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF 80 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHH Confidence 99999999999999999999999999999999999999999999999999998888888888888888889999999999 Q ss_pred HHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCC Q lcl|Aclame:pro 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~ 160 (392) .+++++.......+.......+.+++..+++.+||++||+++.+.|++.+++.++|++++++.++++.++.++++...++ T Consensus 81 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~ 160 (392) T protein:vir:10 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) T ss_pred HHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCC Confidence 99999999888888888888888888888888999999999999999999999999999999999999999999998988 Q ss_pred ccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Q lcl|Aclame:pro 161 IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI 240 (392) Q Consensus 161 ~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~~ 240 (392) +.++|++|+++.++++.++|++|++++++++++++||+|+|+|+.++|.+||.++|++++++++|.++++|.++.++.+. T Consensus 161 ~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~ 240 (392) T protein:vir:10 161 IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI 240 (392) T ss_pred ccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc Confidence 99999999999998777999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCC Q lcl|Aclame:pro 241 KSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAK 320 (392) Q Consensus 241 ~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~ 320 (392) .+++++++++...++..|+.+++|||||++|.+|+++||++|+|||++++..+.+.+++|.|||+++++..+++.+..++ T Consensus 241 ~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~ 320 (392) T protein:vir:10 241 KSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAK 320 (392) T ss_pred cCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCC Confidence 99999999998899999999999999999999999999999999999999999999999999999888988888888889 Q ss_pred cceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 321 KAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 321 ~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ..+++||||+++|.+++|.+++++++++.+.+|++|++.||+++|+|+++.+|+||+++++++++|+++||| T Consensus 321 ~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 321 KAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred ceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999999999999988899999999999999999999999999999999999999999 No 2 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=1.9e-87 Score=495.99 Aligned_cols=392 Identities=100% Similarity=1.357 Sum_probs=374.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF 80 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 80 (392) |+|+|+|++++++++.+|+++++++++.++++++.+|+++|+++|++.+++.+.+.+..........+......++++++ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF 80 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHH Confidence 99999999999999999999999999999999999999999999999999998888888888888888889999999999 Q ss_pred HHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCC Q lcl|Aclame:pro 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~ 160 (392) .+++++.......+.......+.+++..+++.+||++||+++.+.|++.+++.++|++++++.++++.++.++++...++ T Consensus 81 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~ 160 (392) T protein:vir:10 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) T ss_pred HHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCC Confidence 99999999888888888888888888888888999999999999999999999999999999999999999999998988 Q ss_pred ccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Q lcl|Aclame:pro 161 IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI 240 (392) Q Consensus 161 ~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~~ 240 (392) +.++|++|+++.++++.++|++|++++++++++++||+|+|+|+.++|.+||.++|++++++++|.++++|.++.++.+. T Consensus 161 ~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~ 240 (392) T protein:vir:10 161 IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI 240 (392) T ss_pred ccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc Confidence 99999999999998777999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCC Q lcl|Aclame:pro 241 KSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAK 320 (392) Q Consensus 241 ~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~ 320 (392) .+++++++++...++..|+.+++|||||++|.+|+++||++|+|||++++..+.+.+++|.|||+++++..+++.+..++ T Consensus 241 ~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~ 320 (392) T protein:vir:10 241 KSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAK 320 (392) T ss_pred cCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCC Confidence 99999999998899999999999999999999999999999999999999999999999999999888988888888889 Q ss_pred cceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 321 KAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 321 ~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ..+++||||+++|.+++|.+++++++++.+.+|++|++.||+++|+|+++.+|+||+++++++++|+++||| T Consensus 321 ~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 321 KAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred ceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999999999999988899999999999999999999999999999999999999999 No 3 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=1.9e-87 Score=495.99 Aligned_cols=392 Identities=100% Similarity=1.357 Sum_probs=374.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF 80 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 80 (392) |+|+|+|++++++++.+|+++++++++.++++++.+|+++|+++|++.+++.+.+.+..........+......++++++ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF 80 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHH Confidence 99999999999999999999999999999999999999999999999999998888888888888888889999999999 Q ss_pred HHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCC Q lcl|Aclame:pro 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~ 160 (392) .+++++.......+.......+.+++..+++.+||++||+++.+.|++.+++.++|++++++.++++.++.++++...++ T Consensus 81 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~ 160 (392) T protein:vir:10 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) T ss_pred HHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCC Confidence 99999999888888888888888888888888999999999999999999999999999999999999999999998988 Q ss_pred ccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Q lcl|Aclame:pro 161 IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI 240 (392) Q Consensus 161 ~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~~ 240 (392) +.++|++|+++.++++.++|++|++++++++++++||+|+|+|+.++|.+||.++|++++++++|.++++|.++.++.+. T Consensus 161 ~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~ 240 (392) T protein:vir:10 161 IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI 240 (392) T ss_pred ccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc Confidence 99999999999998777999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCC Q lcl|Aclame:pro 241 KSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAK 320 (392) Q Consensus 241 ~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~ 320 (392) .+++++++++...++..|+.+++|||||++|.+|+++||++|+|||++++..+.+.+++|.|||+++++..+++.+..++ T Consensus 241 ~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~ 320 (392) T protein:vir:10 241 KSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAK 320 (392) T ss_pred cCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCC Confidence 99999999998899999999999999999999999999999999999999999999999999999888988888888889 Q ss_pred cceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 321 KAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 321 ~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ..+++||||+++|.+++|.+++++++++.+.+|++|++.||+++|+|+++.+|+||+++++++++|+++||| T Consensus 321 ~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 321 KAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred ceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999999999999988899999999999999999999999999999999999999999 No 4 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=1.9e-87 Score=495.99 Aligned_cols=392 Identities=100% Similarity=1.357 Sum_probs=374.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF 80 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 80 (392) |+|+|+|++++++++.+|+++++++++.++++++.+|+++|+++|++.+++.+.+.+..........+......++++++ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF 80 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHH Confidence 99999999999999999999999999999999999999999999999999998888888888888888889999999999 Q ss_pred HHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCC Q lcl|Aclame:pro 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~ 160 (392) .+++++.......+.......+.+++..+++.+||++||+++.+.|++.+++.++|++++++.++++.++.++++...++ T Consensus 81 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~ 160 (392) T protein:vir:10 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) T ss_pred HHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCC Confidence 99999999888888888888888888888888999999999999999999999999999999999999999999998988 Q ss_pred ccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Q lcl|Aclame:pro 161 IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI 240 (392) Q Consensus 161 ~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~~ 240 (392) +.++|++|+++.++++.++|++|++++++++++++||+|+|+|+.++|.+||.++|++++++++|.++++|.++.++.+. T Consensus 161 ~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~ 240 (392) T protein:vir:10 161 IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI 240 (392) T ss_pred ccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCc Confidence 99999999999998777999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCC Q lcl|Aclame:pro 241 KSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAK 320 (392) Q Consensus 241 ~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~ 320 (392) .+++++++++...++..|+.+++|||||++|.+|+++||++|+|||++++..+.+.+++|.|||+++++..+++.+..++ T Consensus 241 ~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~ 320 (392) T protein:vir:10 241 KSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAK 320 (392) T ss_pred cCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCC Confidence 99999999998899999999999999999999999999999999999999999999999999999888988888888889 Q ss_pred cceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 321 KAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 321 ~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ..+++||||+++|.+++|.+++++++++.+.+|++|++.||+++|+|+++.+|+||+++++++++|+++||| T Consensus 321 ~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 321 KAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred ceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999999999999988899999999999999999999999999999999999999999 No 5 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=1.5e-75 Score=430.79 Aligned_cols=368 Identities=43% Similarity=0.683 Sum_probs=327.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF 80 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 80 (392) |+|+|++++++++++.+|+++++++++.++++++.+|++.++++|+.++...+...+...... ..........+++++| T Consensus 1 M~k~l~~l~e~~~~~~~e~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 79 (371) T protein:vir:81 1 MPKELRELLEQINNKKEEARKLLAENKIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKE-PLKPTVQVKENEVEAF 79 (371) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-ccccchhhHHHHHHHH Confidence 999999999999999999999999999999999999999999999988877766555433322 2222333445666777 Q ss_pred HHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCC Q lcl|Aclame:pro 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~ 160 (392) .++++++ +.+++..++..+||++||+++.+.|++.+++.++|+++++++++++.++.+++++..+. T Consensus 80 ~~~l~~~--------------~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~ 145 (371) T protein:vir:81 80 VNHIRTR--------------FRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQ 145 (371) T ss_pred HHHHHHH--------------HHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCC Confidence 7776653 23556677888899999999999999999999999999999999998899999998888 Q ss_pred ccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Q lcl|Aclame:pro 161 IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAI 240 (392) Q Consensus 161 ~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~~ 240 (392) +.++|++|++++++++.++|++|++++++++++++||+|+++|+.++|++||.++|++++++++|.++++|+++.++.+. T Consensus 146 ~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~~ 225 (371) T protein:vir:81 146 TGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKTAI 225 (371) T ss_pred cceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 89999999999988778999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCccccc---ccc Q lcl|Aclame:pro 241 KSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKS---KGT 317 (392) Q Consensus 241 ~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~---~~~ 317 (392) ..++++..++...+++.++.+++|+|||++|.+|+++||++|+|||+|++..+.+.+++| +||+++++...+. .+. T Consensus 226 ~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G-~pV~~~~~~~~~~~~~~~~ 304 (371) T protein:vir:81 226 ADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQLLG-LPVVIVSNKVLANRVDGGT 304 (371) T ss_pred ccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCceecc-eeEEEecccccCccccccc Confidence 999999999888899999999999999999999999999999999999999999999998 4677766544332 245 Q ss_pred cCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 318 TAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 318 ~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) .++...++||||+++|.+++|.+++++++++..++|++|++.||++.|+|+++.+|+||++++++++ T Consensus 305 ~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 305 GAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred cCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 5678889999999999999999999999999888999999999999999999999999999999988 No 6 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=1.8e-74 Score=424.94 Aligned_cols=379 Identities=49% Similarity=0.817 Sum_probs=330.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--------------cccccc Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERN--------------NGREVE 66 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~--------------~~~~~~ 66 (392) |+|+|++++++++++.+++++++++++.++++++.+|++.++++++.+.+..+...+... ...... T Consensus 5 m~k~l~el~~~~~~~~~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (397) T protein:vir:12 5 MSKKEIALRQQFTEKKQQADKALQEGNTDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQRSQG 84 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccccc Confidence 889999999999999999999999999999999999999999999876654443322211 011112 Q ss_pred ccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc Q lcl|Aclame:pro 67 TRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR 146 (392) Q Consensus 67 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~ 146 (392) ........++++++.+++++.....+.+... ...+.+++..++..+||++||+++.+.|++.+++.++|++++++.+++ T Consensus 85 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~-~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~ 163 (397) T protein:vir:12 85 QGNEERQQQYSKAFLKGLRGKRLTDEERDLL-DSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVT 163 (397) T ss_pred chhhHHHHHHHHHHHHHHhccCCcHHHHHHH-hhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeecc Confidence 2223344567888888888887766665443 344667788888889999999999999999999999999999999999 Q ss_pred CCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 147 TRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNV 226 (392) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~ 226 (392) +.++.+++++..+.+.++|++|+++.++++.++|++|++++++++++++||+|+++|+.++|++||.++|++++++++|. T Consensus 164 ~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~ 243 (397) T protein:vir:12 164 TRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNN 243 (397) T ss_pred CCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHH Confidence 98999999999999999999999999987789999999999999999999999999999999999999999999999999 Q ss_pred HHhhccccccccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEE Q lcl|Aclame:pro 227 LILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVV 306 (392) Q Consensus 227 ~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~ 306 (392) ++++|.|+..+.+..++++++++++..++++++.+++|+|||++|.+|+++||++|+|||+|++.++.+.++||. ||++ T Consensus 244 ~il~G~g~~~~~g~~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~-pv~~ 322 (397) T protein:vir:12 244 LILAAIASLKKVDIDGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTKKLLDGR-PVVP 322 (397) T ss_pred HHHhccccccccccccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCccccce-eeEE Confidence 999999999999999999999999888999999999999999999999999999999999999999999999995 6766 Q ss_pred ecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 307 VSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) +++..+ +...+...++||||+++|.+++|++++++++++.+..|++|++.||+++|+|+++.+|+||+++++++- T Consensus 323 ~~~~~~---~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 323 FTNRVL---KTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred eccccc---ccCCCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 654332 234567789999999999999999999999999888999999999999999999999999999998866 No 7 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=6.9e-71 Score=405.25 Aligned_cols=384 Identities=34% Similarity=0.492 Sum_probs=316.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-cchhhHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDK--VAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN-VDGEMEYR 77 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~--~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 77 (392) |+|+|++++++++++.+++++++++.+ .++++++.+|+++|++++++.++..+................ ........ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDGVTAEELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNG 80 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHH Confidence 999999999999999999999987644 367899999999999999987777666555433222222111 11222222 Q ss_pred HHHHHHHhcchhhHHH-HHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEe Q lcl|Aclame:pro 78 DVFMKALRNKPLNAEE-REFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEK 156 (392) Q Consensus 78 ~a~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~ 156 (392) ..+.+..+........ +.......+.+++..++.++||++||+++.+.|++.+++.++|++++++.++++.++.+.+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~ 160 (404) T protein:vir:10 81 ALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEK 160 (404) T ss_pred HHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEE Confidence 3333333333332222 222234556778888888899999999999999999999999999999999999999999999 Q ss_pred ecCCccccccccccccccc-cccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|Aclame:pro 157 NSDMIPFAEITEMGEIPET-DNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL 235 (392) Q Consensus 157 ~~~~~~~~~~~E~~~~~~~-~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~ 235 (392) ..+.+.++|++|++..+++ ..++|++|++++++++++++||+|+|+|+.++|.+||.++|+++++.++|.+++.|.|+. T Consensus 161 ~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~ 240 (404) T protein:vir:10 161 RSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGD 240 (404) T ss_pred ecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC Confidence 9999999999999988865 358999999999999999999999999999999999999999999999999999998864 Q ss_pred cc---------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceec Q lcl|Aclame:pro 236 TK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAG 300 (392) Q Consensus 236 ~~---------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g 300 (392) .+ .+...++++..++...+++.+.++++|+|||++|.+|+++||++|+|+|.|++.++.+.++|| T Consensus 241 ~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G 320 (404) T protein:vir:10 241 EHATGIMTANKFKKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYRFLG 320 (404) T ss_pred CcccceeeccccceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCccccc Confidence 32 234568888888888899999999999999999999999999999999999999999999999 Q ss_pred ccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEE Q lcl|Aclame:pro 301 TNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE 380 (392) Q Consensus 301 ~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~ 380 (392) . ||+++++..++ ...+..+++||||+++|.+++|+++++.++++.+..|++|++.||+++|+|+++.+|+||++++ T Consensus 321 ~-PV~~~~~~~~~---~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~ 396 (404) T protein:vir:10 321 L-PVIELPNDLLL---STESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAE 396 (404) T ss_pred e-eeEEecccccC---CCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE Confidence 5 66655543332 2345678999999999999999999999999888889999999999999999999999999999 Q ss_pred ecccCCCCCCC Q lcl|Aclame:pro 381 IDLSAPVEQPQ 391 (392) Q Consensus 381 ~~~~a~~~~~~ 391 (392) +++++.. + T Consensus 397 ~~~aa~~---~ 404 (404) T protein:vir:10 397 IPVESVQ---A 404 (404) T ss_pred eecccCC---C Confidence 9977653 3 No 8 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=7.4e-71 Score=405.07 Aligned_cols=382 Identities=34% Similarity=0.511 Sum_probs=304.5 Q ss_pred CC--HHHHHHHHHHHHHHHHHHHHhhh-------h--hHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhhcccc---- Q lcl|Aclame:pro 1 MS--KELRELLAKLEGKKEEVRSLMGE-------D--KVAEAEQMMEEVRSLQKKIDLQRSLDEAE-TEERNNGRE---- 64 (392) Q Consensus 1 M~--kel~el~~~~~~~~~e~~~~~~~-------~--~~~~~~~~~~ei~~l~~~i~~~~~~~~~~-~~~~~~~~~---- 64 (392) |. +.|+||++++.++.++++++.++ + ..++++.+.+++..+.++++.++...+.. .+....... T Consensus 1 m~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) T protein:vir:10 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) T ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 33 23667777776666666555432 1 23455566666666666665544332222 222111111 Q ss_pred -ccccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhccee Q lcl|Aclame:pro 65 -VETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) Q Consensus 65 -~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~ 143 (392) ..........++.++|..++++.... ....+.+++..++..+||++||+++++.|++.+++.++|+++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 153 (408) T protein:vir:10 81 PLNKSENELKDKFVKDFVNMVRNPMAF-------MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (408) T ss_pred ccccchhhhHHHHHHHHHHHhhcchhh-------hhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhccee Confidence 11122223345556666666654321 2234567788888889999999999999999999999999999999 Q ss_pred eccCCcceeEEEeecCC-ccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 144 PVRTRSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) Q Consensus 144 ~~~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~ 222 (392) ++++..+.++++...+. ..+.|++|++++++++.++|++|++++++++++++||+|+|+|+.++|.+||.++|+++++. T Consensus 154 ~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~ 233 (408) T protein:vir:10 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVV 233 (408) T ss_pred eccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHH Confidence 99999999988876544 66889999999998778999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhcccccccc-chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecc Q lcl|Aclame:pro 223 TRNVLILGVIEKLTKQ-AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGT 301 (392) Q Consensus 223 ~~d~~~~~~~~~~~~~-~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~ 301 (392) +++.+|+.|+|++++. +..++++++++++..++++|+.+++|+|||++|.+|+++||++|+|||++++.++.+.++|| T Consensus 234 ~~~~~il~g~g~~~~~~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G- 312 (408) T protein:vir:10 234 TRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKG- 312 (408) T ss_pred HHHHHHhhcccccccccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCceecc- Confidence 9999999999988764 66789999999888899999999999999999999999999999999999999999999998 Q ss_pred cceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEe Q lcl|Aclame:pro 302 NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) Q Consensus 302 ~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~ 381 (392) +||+++++..+|+. .++...++||||+++|.+++|+++++.++++.+..|.+|++.||+++|+|+++.+|+||+++++ T Consensus 313 ~PV~~~~~~~~~~~--~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~ 390 (408) T protein:vir:10 313 KQVIVVADRWLPNT--GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSF 390 (408) T ss_pred eeeEEecccccCcc--CCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEe Confidence 57778777776654 3566789999999999999999999999999888999999999999999999999999999999 Q ss_pred cccCCCCCCCC Q lcl|Aclame:pro 382 DLSAPVEQPQG 392 (392) Q Consensus 382 ~~~a~~~~~~~ 392 (392) ++++|...--+ T Consensus 391 ~~~~~~~~~~~ 401 (408) T protein:vir:10 391 SAIADQVGNFK 401 (408) T ss_pred eccccCCCCCC Confidence 98765433222 No 9 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=1e-69 Score=398.76 Aligned_cols=382 Identities=34% Similarity=0.521 Sum_probs=301.8 Q ss_pred CCH--HHHHHHHHHHHHHHHHHHHhhh-------h--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cccc---- Q lcl|Aclame:pro 1 MSK--ELRELLAKLEGKKEEVRSLMGE-------D--KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERN-NGRE---- 64 (392) Q Consensus 1 M~k--el~el~~~~~~~~~e~~~~~~~-------~--~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~-~~~~---- 64 (392) |+. .|+||++++.++.++++++.++ + ..++++.+.++++.+.++++.++...+....... .... T Consensus 1 m~~~m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) T protein:vir:74 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) T ss_pred CChhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 553 3566666666666665554332 1 1234556666666666666655544333222221 1111 Q ss_pred -ccccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhccee Q lcl|Aclame:pro 65 -VETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) Q Consensus 65 -~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~ 143 (392) ..........++.+++...++.... .....+.+++..++...||++||+++.+.|++.+++.++|+++|++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~ 153 (408) T protein:vir:74 81 PLNKSENELKDKFVKDFVNMVRNPMA-------FLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (408) T ss_pred cccchhhhhHHHHHHHHHHHHhcchh-------hhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhccee Confidence 1112222333444455554443321 12234667778888889999999999999999999999999999999 Q ss_pred eccCCcceeEEEeecC-CccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 144 PVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) Q Consensus 144 ~~~~~~~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~ 222 (392) ++++.++.+++++..+ +..+.|++|+++.++++.++|++|++++++++++++||+|+++|+.++|.+||.++|+++++. T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~ 233 (408) T protein:vir:74 154 SVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVV 233 (408) T ss_pred eccCCcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHH Confidence 9999999998887655 355679999999998778999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhcccccccc-chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecc Q lcl|Aclame:pro 223 TRNVLILGVIEKLTKQ-AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGT 301 (392) Q Consensus 223 ~~d~~~~~~~~~~~~~-~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~ 301 (392) ++|.++++|+|+..+. +..++++++++++..++.+|+.+++|+|||++|.+|+++||++|+|||++++..+.+.+++|. T Consensus 234 ~~d~~il~G~G~~~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~ 313 (408) T protein:vir:74 234 TRNQAIIAAMGTVPKKPTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGK 313 (408) T ss_pred HHHHHHhhcccccccccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCceecce Confidence 9999999999987765 556789999988889999999999999999999999999999999999999999999899884 Q ss_pred cceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEe Q lcl|Aclame:pro 302 NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) Q Consensus 302 ~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~ 381 (392) ||+++++..+|.. .++..+++||||+++|.+++|+++++.++++.+..|.+|++.+|+++|+|+++++|+||+++++ T Consensus 314 -pV~~~~~~~~~~~--~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~ 390 (408) T protein:vir:74 314 -QVIVVADRWLPNS--GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSF 390 (408) T ss_pred -eeEEecCcccccc--cCCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEe Confidence 6877777676654 4567789999999999999999999999999888999999999999999999999999999998 Q ss_pred ccc--CCCCCCCC Q lcl|Aclame:pro 382 DLS--APVEQPQG 392 (392) Q Consensus 382 ~~~--a~~~~~~~ 392 (392) ++. +|.++|++ T Consensus 391 ~~~~~~~~~~~~~ 403 (408) T protein:vir:74 391 TAIADQVGNFKTT 403 (408) T ss_pred ecccCCCCCCCCC Confidence 774 44455555 No 10 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=5.7e-69 Score=394.71 Aligned_cols=382 Identities=33% Similarity=0.518 Sum_probs=306.5 Q ss_pred CCH--HHHHHHHHHHHHHHHHHHHhhh-------h--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-Hhhccc----- Q lcl|Aclame:pro 1 MSK--ELRELLAKLEGKKEEVRSLMGE-------D--KVAEAEQMMEEVRSLQKKIDLQRSLDEAETE-ERNNGR----- 63 (392) Q Consensus 1 M~k--el~el~~~~~~~~~e~~~~~~~-------~--~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~-~~~~~~----- 63 (392) |.- .|+||+++++++.++++++.++ . ..++.+.+.+++..+..+++.+++..+.... ...... T Consensus 1 ~~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:39 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (404) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 442 3667777777777766665432 1 2334555555666666666554443332222 111111 Q ss_pred cccccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhccee Q lcl|Aclame:pro 64 EVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) Q Consensus 64 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~ 143 (392) ...........+++++|..++++.... ....+.+++..+++++||++||+++.+.|++.+++.++|+++|++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 153 (404) T protein:vir:39 81 PLNKSEYELKDKFVKEFVNMVRNPMAF-------LNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (404) T ss_pred ccccchhhhHHHHHHHHHHHHhcchhh-------hhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhccee Confidence 112222334556667777777664321 2234667788888889999999999999999999999999999999 Q ss_pred eccCCcceeEEEeecC-CccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 144 PVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) Q Consensus 144 ~~~~~~~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~ 222 (392) ++++..+.+++++..+ ...+.|++|+++.++++.++|++|++++++++++++||+++++|+.++|.+||.++|+++++. T Consensus 154 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~ 233 (404) T protein:vir:39 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVV 233 (404) T ss_pred eccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHH Confidence 9999889988887654 366899999999998778999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhcccccccc-chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecc Q lcl|Aclame:pro 223 TRNVLILGVIEKLTKQ-AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGT 301 (392) Q Consensus 223 ~~d~~~~~~~~~~~~~-~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~ 301 (392) ++|.++++|+|+..+. +...++++.++++..++..++.+++|+|||++|.+|+++||++|+|||++++..+.+.+++|. T Consensus 234 ~~d~~il~g~g~~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~ 313 (404) T protein:vir:39 234 TRNQAIIAAMGTVPKKPTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGK 313 (404) T ss_pred HHHHHHHhcccccccccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecce Confidence 9999999999987764 556789999988888999999999999999999999999999999999999999989899985 Q ss_pred cceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEe Q lcl|Aclame:pro 302 NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) Q Consensus 302 ~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~ 381 (392) ||+++++..+|+. ..+...++||||+++|.+++|+++++.++++.+.+|++|++.+|++.|+|+++.+|+||+++++ T Consensus 314 -pV~~~~~~~~~~~--~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~ 390 (404) T protein:vir:39 314 -KVIVVADRWLPNS--GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSF 390 (404) T ss_pred -eEEEecccccCcc--CCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEe Confidence 7778777666653 4566789999999999999999999999999888999999999999999999999999999998 Q ss_pred cccCC--CCCCCC Q lcl|Aclame:pro 382 DLSAP--VEQPQG 392 (392) Q Consensus 382 ~~~a~--~~~~~~ 392 (392) +++++ .+.|+| T Consensus 391 ~~~a~~~~~~~~~ 403 (404) T protein:vir:39 391 TAIADQVGNFTAG 403 (404) T ss_pred eccccCCCCCCCC Confidence 88655 446677 No 11 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=6.4e-69 Score=394.45 Aligned_cols=376 Identities=34% Similarity=0.486 Sum_probs=300.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhh-------h--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------c Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGE-------D--KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGRE------V 65 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~-------~--~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~------~ 65 (392) |+ +++||+++++++.++++++.++ + ..++++++.++++.++++++.+++..+........... . T Consensus 1 Mk-~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (397) T protein:vir:49 1 MK-TSNELHDLWVAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKKPL 79 (397) T ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Confidence 98 4667777777666666554332 1 23456777777777777777655544433322221111 1 Q ss_pred cccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec Q lcl|Aclame:pro 66 ETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV 145 (392) Q Consensus 66 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~ 145 (392) ..........++++|.++++++.. ....+...++.++||++||+++.+.|++.+++.++|+++|++.++ T Consensus 80 ~~~~~~~~~~~~~~~~~~l~~~~~-----------~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 148 (397) T protein:vir:49 80 TKSEEEVKAGFVKDFKNLVRGRYQ-----------NLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENV 148 (397) T ss_pred ccchhHHHHHHHHHHHHHHhcchh-----------HHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeec Confidence 111222233445555555554421 122334556777899999999999999999999999999999999 Q ss_pred cCCcceeEEEeecC-CccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 146 RTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTR 224 (392) Q Consensus 146 ~~~~~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~ 224 (392) ++.++.+++++..+ .+.+.|++|++++++++.++|++|++++++++++++||+|+++|+.++|.+||.++|++++++++ T Consensus 149 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~ 228 (397) T protein:vir:49 149 TTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTR 228 (397) T ss_pred ccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHH Confidence 99899988887654 46689999999999877899999999999999999999999999999999999999999999999 Q ss_pred HHHHhhcccccccc-chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccc Q lcl|Aclame:pro 225 NVLILGVIEKLTKQ-AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNP 303 (392) Q Consensus 225 d~~~~~~~~~~~~~-~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~p 303 (392) |.++++|+|+.++. +...+|++.+++ ..++.++..+++|+|||++|.+|++|||++|+|||+|++..+.+.+++|. | T Consensus 229 d~ai~~G~g~~~~~~~~~~~d~i~~~~-~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~-P 306 (397) T protein:vir:49 229 NKAILEAIAALPTKPTLTKWDDIIDLE-AKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGF-A 306 (397) T ss_pred HHHHHhhccccccccccccHHHHHHHH-HhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecce-e Confidence 99999999987654 456788888866 47788899999999999999999999999999999999999888899985 7 Q ss_pred eEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecc Q lcl|Aclame:pro 304 VVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 304 v~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~ 383 (392) |+++++..+++. .++..+++||||+++|.+++|++++++++++.+++|.+|++.||++.|+|+++.+|+||+++++++ T Consensus 307 V~~~~~~~~~~~--~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 307 VKEVADRWLANG--TGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKA 384 (397) T ss_pred eEEecccccccc--cCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEEeec Confidence 777777666653 356778999999999999999999999999988899999999999999999999999999999998 Q ss_pred cCCCCCCCC Q lcl|Aclame:pro 384 SAPVEQPQG 392 (392) Q Consensus 384 ~a~~~~~~~ 392 (392) ++..+.+.+ T Consensus 385 ~~~~~~~~~ 393 (397) T protein:vir:49 385 IADQKGNLG 393 (397) T ss_pred ccCCCCCcc Confidence 766555555 No 12 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=9.2e-68 Score=388.09 Aligned_cols=376 Identities=35% Similarity=0.492 Sum_probs=298.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHH---hhh------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc------cc Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSL---MGE------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGR------EV 65 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~---~~~------~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~------~~ 65 (392) |++ +++|+++++++.++++++ +++ ...+++++++++++.++++++.+++..+.......... .. T Consensus 1 Mk~-~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (397) T protein:vir:49 1 MKT-SNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPL 79 (397) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 983 556666655555555443 322 12345677777777777777665554443333221111 11 Q ss_pred cccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec Q lcl|Aclame:pro 66 ETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV 145 (392) Q Consensus 66 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~ 145 (392) .........++++++.+++++.... ..+....+++++||++||+++...|++.+++.++|++++++.++ T Consensus 80 ~~~~~~~~~~~~~~~~~~l~~~~~~-----------~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 148 (397) T protein:vir:49 80 TKNEEEVKANFVKDFKNLVRGRYQN-----------LLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENV 148 (397) T ss_pred cchhhHHHHHHHHHHHHHhhcchhh-----------HHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeec Confidence 1222233445566666666654321 22345566778899999999999999999999999999999999 Q ss_pred cCCcceeEEEeecC-CccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 146 RTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTR 224 (392) Q Consensus 146 ~~~~~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~ 224 (392) +++++.+++++..+ .+.+.|++|++++++++.++|++|++++++++++++||+++|+|+.++|.+||.++|++++++++ T Consensus 149 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~ 228 (397) T protein:vir:49 149 TTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTR 228 (397) T ss_pred cCCcceEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHH Confidence 99999998887654 46788999999999877789999999999999999999999999999999999999999999999 Q ss_pred HHHHhhcccccccc-chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccc Q lcl|Aclame:pro 225 NVLILGVIEKLTKQ-AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNP 303 (392) Q Consensus 225 d~~~~~~~~~~~~~-~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~p 303 (392) |.++++|.|++++. +..+||++.+++ ..++.++..+++|+|||++|.+|++|||++|+|||.|++..+.+.+++|. | T Consensus 229 d~ail~G~g~~~~~~~~~~~d~i~~~~-~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~~l~G~-p 306 (397) T protein:vir:49 229 NKAILEAIGTLPNKPTLAKWDDIIDLQ-AKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGYSIDGF-V 306 (397) T ss_pred HHHHHhccccccccccccCHHHHHHHH-HhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCceecce-e Confidence 99999999988765 567889999876 57788899999999999999999999999999999999999988999985 6 Q ss_pred eEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecc Q lcl|Aclame:pro 304 VVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 304 v~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~ 383 (392) |+++++..+|+ ..++..+++||||+++|.+++|++++++++++.+.+|++|++.||+++|+|+++.+|+||+++++++ T Consensus 307 V~~~~~~~~~~--~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 307 VKEISDRFLPN--GTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFKA 384 (397) T ss_pred eEEeccccccc--ccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEecccceEEEEecc Confidence 77776666664 3456778999999999999999999999999988899999999999999999999999999999777 Q ss_pred c---CCCCCCCC Q lcl|Aclame:pro 384 S---APVEQPQG 392 (392) Q Consensus 384 ~---a~~~~~~~ 392 (392) + +|++.+.| T Consensus 385 ~~~~~~~~~~~~ 396 (397) T protein:vir:49 385 IADQKAKLSTAG 396 (397) T ss_pred cccccCcccccC Confidence 4 34444444 No 13 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=7e-67 Score=383.25 Aligned_cols=372 Identities=16% Similarity=0.185 Sum_probs=288.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhh------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hccccccccccchh Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGE------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEER-NNGREVETRNVDGE 73 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~------~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 73 (392) |.+ ++++++.++++.+++..+.++ +..++...+.++++.++++++.+++..+...... .............. T Consensus 1 l~~-~k~l~~~i~e~~~~~~~~k~~~~~~~~~~e~~~~~l~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (407) T protein:vir:48 1 MAD-VKDVEQVAQELQRKFDDFKEKNDKRIDAIEQEKGKLAGEVETLNGKLAELENLKSDLEAELAEVKRPAGGTQNKVA 79 (407) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh Confidence 773 555555555544444333211 1113445566666666666666555443322221 22222333333455 Q ss_pred hHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeE Q lcl|Aclame:pro 74 MEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRV 153 (392) Q Consensus 74 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~ 153 (392) .+++++|.++++.+.... ....+.+++..++..+||++||+++.++|++.+++.++|+++|++++++++ .+. T Consensus 80 ~e~~~a~~~~l~~g~~~~------~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~--~~~ 151 (407) T protein:vir:48 80 SEHKEAFIGFMRKGREDG------LRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGS--DYK 151 (407) T ss_pred hHHHHHHHHHHhccchhh------hhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCC--ceE Confidence 677888888887653221 223456788888888999999999999999999999999999999887654 456 Q ss_pred EEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 154 LEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE 233 (392) Q Consensus 154 ~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~ 233 (392) +++..+++.++|++|++..++++.++|+++++++++++++++||+|+|+|+.++|++||.++|+++++.++|.++++|.| T Consensus 152 ~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G 231 (407) T protein:vir:48 152 KLVNLGGTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDG 231 (407) T ss_pred EEEecCCcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCC Confidence 66777888999999999999877789999999999999999999999999999999999999999999999999999977 Q ss_pred ccccc----------------------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCcee Q lcl|Aclame:pro 234 KLTKQ----------------------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYI 285 (392) Q Consensus 234 ~~~~~----------------------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l 285 (392) +..+. +..++|++++++. .++.+|+.+++|+||+++|..|++|||++|||| T Consensus 232 ~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~-~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l 310 (407) T protein:vir:48 232 SKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIY-TLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYL 310 (407) T ss_pred CCccceeeecccccccccccccccccccccccccccChHHHHHHHH-hhchhhhcCCEEEEcHHHHHHHHHhhccCCcee Confidence 64332 1235788888765 689999999999999999999999999999999 Q ss_pred ecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEe Q lcl|Aclame:pro 286 LQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQR 365 (392) Q Consensus 286 ~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r 365 (392) |+|++..+.+.++||. ||+++++ +|. ..++..+++||||+++|.+++|.++++..++ +|.+|++.||+++| T Consensus 311 ~~~~~~~g~~~~l~G~-PV~~~~~--~p~--~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~----~~~~~~~~~~~~~r 381 (407) T protein:vir:48 311 WRPGIELGQPSSLAGY-GIVENEQ--MPD--IAADAKAIAFGNFKRGYTIVDRIGTRILRDP----YTNKPFVGFYTTKR 381 (407) T ss_pred eccCcCCCCCceecce-eeEEecC--cCC--ccCCccEEEEEeccccEEEEEeeceEEEeec----cccCCcEEEEEEEE Confidence 9999999999999985 6665443 443 3455678999999998999999999987754 46789999999999 Q ss_pred eCcEEecccceEEEEecccCCCCCCC Q lcl|Aclame:pro 366 DDVQMWDNEAAVYGEIDLSAPVEQPQ 391 (392) Q Consensus 366 ~~~~v~~~~af~~l~~~~~a~~~~~~ 391 (392) +|+++++|+||++++++++++....+ T Consensus 382 ~d~~v~~~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 382 TGGMLVDSQAIKLMKIGAATRQKAAA 407 (407) T ss_pred eccEEecccceEEEEeeccCCCCCCC Confidence 99999999999999999887765555 No 14 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=1.4e-66 Score=381.58 Aligned_cols=377 Identities=33% Similarity=0.483 Sum_probs=293.9 Q ss_pred CC--HHHHHHHHHHHHHHHHHHHHhhh------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------ccccc Q lcl|Aclame:pro 1 MS--KELRELLAKLEGKKEEVRSLMGE------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNN------GREVE 66 (392) Q Consensus 1 M~--kel~el~~~~~~~~~e~~~~~~~------~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~------~~~~~ 66 (392) |+ ++|++.++++.+..+++++.++. ...+++++++++++.++++++.++...+........ ..... T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcccccc Confidence 77 34444444444444444433321 234567777888888877777665544433222111 11122 Q ss_pred ccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc Q lcl|Aclame:pro 67 TRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR 146 (392) Q Consensus 67 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~ 146 (392) ........++.+.+..++++.... .......+++++||++||+++.+.|++.+++.++|++++++.+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~ 149 (397) T protein:vir:48 81 KSEEEVKAGFVKDFKNLVRGRYQN-----------LLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVT 149 (397) T ss_pred chhhHHHHHHHHHHHHHHhhhhhH-----------HHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeecc Confidence 222333344555555555543211 122334556678999999999999999999999999999999999 Q ss_pred CCcceeEEEeec-CCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 147 TRSGSRVLEKNS-DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRN 225 (392) Q Consensus 147 ~~~~~~~~~~~~-~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d 225 (392) +.++.++++... ..+.+.|++|++..++++.++|++|++++++++++++||+++++|+.++|.+||.++|+++++.++| T Consensus 150 ~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d 229 (397) T protein:vir:48 150 TLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRN 229 (397) T ss_pred CCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 999988887654 4456899999999998777999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccccccc-cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccce Q lcl|Aclame:pro 226 VLILGVIEKLTK-QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPV 304 (392) Q Consensus 226 ~~~~~~~~~~~~-~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv 304 (392) .++++|+|+.++ ++..++|++++++ ..++..+..+++|+|||++|..|++|||++|+|||++++..+.+.+++| +|| T Consensus 230 ~~il~G~g~~~~~~~~~~~d~i~~~~-~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G-~PV 307 (397) T protein:vir:48 230 KAILEAIATLPTKPTLTKWDDIIDLQ-AKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSIDG-FAV 307 (397) T ss_pred HHHhhcccccccccccccHHHHHHHH-HHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceecc-cee Confidence 999999998776 4567789999865 5778889999999999999999999999999999999999998899998 467 Q ss_pred EEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 305 VVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) +++++..+++. ..+..+++||||+++|.+++|++++++++++.+.+|.+|++.||+++|+|+++.+|+||++++++++ T Consensus 308 ~~~~~~~~~~~--~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 308 KEVADRWLANA--SSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAI 385 (397) T ss_pred EEecccccCCc--CCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEeccc Confidence 77776666543 4567789999999999999999999999999888999999999999999999999999999999876 Q ss_pred CCCCCCCC Q lcl|Aclame:pro 385 APVEQPQG 392 (392) Q Consensus 385 a~~~~~~~ 392 (392) +..+...+ T Consensus 386 ~~~~~~~~ 393 (397) T protein:vir:48 386 ADQKGNLG 393 (397) T ss_pred ccCCCCcc Confidence 44333333 No 15 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=1.3e-66 Score=381.71 Aligned_cols=376 Identities=32% Similarity=0.488 Sum_probs=279.3 Q ss_pred CCH-HHHHHHHHHHHHHHHHHHHh----hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc---ccccc---- Q lcl|Aclame:pro 1 MSK-ELRELLAKLEGKKEEVRSLM----GEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGR---EVETR---- 68 (392) Q Consensus 1 M~k-el~el~~~~~~~~~e~~~~~----~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~---~~~~~---- 68 (392) |+. +|++.++++.+..+++.+.+ .+++.++.+...++++.++++++..+...+...+...... ..... T Consensus 1 M~~~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (395) T protein:vir:38 1 MNINQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKKP 80 (395) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 993 45544444444333333222 2223333444555666666666654443332222111100 00000 Q ss_pred -ccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccC Q lcl|Aclame:pro 69 -NVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRT 147 (392) Q Consensus 69 -~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~ 147 (392) .........+.+...++ ............+.++||++||+++.+.|++.+++.++|+++|++.++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 148 (395) T protein:vir:38 81 LPVKDGKPDAQAMKNQFV------------KDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTT 148 (395) T ss_pred cchhhhhHHHHHHHHHHH------------HHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccC Confidence 00000111111111111 01111112233455678999999999999999999999999999999999 Q ss_pred CcceeEEEeecC-CccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 148 RSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNV 226 (392) Q Consensus 148 ~~~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~ 226 (392) ..+.++++...+ .+.+.|++|++.+++++.++|++|++++++++++++||+++++|+.++|.+||.++|+++++.++|. T Consensus 149 ~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ 228 (395) T protein:vir:38 149 SHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNA 228 (395) T ss_pred CcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 889988876654 4567899999999987779999999999999999999999999999999999999999999999999 Q ss_pred HHhhccccccc-cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceE Q lcl|Aclame:pro 227 LILGVIEKLTK-QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVV 305 (392) Q Consensus 227 ~~~~~~~~~~~-~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~ 305 (392) ++++|.|+..+ .+...+++++++++..++..++.+++|+|||++|.+|+++||++|+|||++++.++.+.+++|. ||+ T Consensus 229 ~il~g~g~~~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~-pV~ 307 (395) T protein:vir:38 229 KILEVMGKAPKKPTISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKYLIDGK-PVI 307 (395) T ss_pred HHhhcccccccccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceeccc-eeE Confidence 99999988665 4667899999998888999999999999999999999999999999999999999999999984 677 Q ss_pred EecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccC Q lcl|Aclame:pro 306 VVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a 385 (392) ++++..+|. .++...++||||+++|.+++|++++++++++.+.+|++|++.||++.|+|+++.+|+||+++++++++ T Consensus 308 ~~~~~~~~~---~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 384 (395) T protein:vir:38 308 RIADKWLPD---VSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVA 384 (395) T ss_pred EecccccCc---CCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeeccc Confidence 766544332 34567799999999999999999999999998888999999999999999999999999999988753 Q ss_pred CCC---CCCC Q lcl|Aclame:pro 386 PVE---QPQG 392 (392) Q Consensus 386 ~~~---~~~~ 392 (392) ..+ +..| T Consensus 385 ~~~~~~~~~~ 394 (395) T protein:vir:38 385 NQAQGTAGTG 394 (395) T ss_pred CCCCCccCCC Confidence 222 1222 No 16 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=6.1e-66 Score=378.12 Aligned_cols=378 Identities=26% Similarity=0.402 Sum_probs=293.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc----ccccccccch--hh Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNG----REVETRNVDG--EM 74 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~----~~~~~~~~~~--~~ 74 (392) |.++|+++++++.++.++++.++++++.++++++++|+++|+++|+.+++..+...+..... .......... .. T Consensus 7 m~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (415) T protein:vir:47 7 LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQ 86 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHH Confidence 77788999999999999999999999999999999999999999987665544333222111 1111111111 11 Q ss_pred HHHHHHHHHHhcchhhHHHHHHHHhhh---hhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcce Q lcl|Aclame:pro 75 EYRDVFMKALRNKPLNAEEREFLEDDL---EQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGS 151 (392) Q Consensus 75 ~~~~a~~~~~~~~~~~~~~~~~~~~~~---~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~ 151 (392) ..........+.......++....... ........+..+||++||+++.+.|++.+++.++|++++++++++++.+. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 166 (415) T protein:vir:47 87 ANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGK 166 (415) T ss_pred HHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCcee Confidence 111222222222222222222221111 11122233455788999999999999999999999999999999999999 Q ss_pred eEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 152 RVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV 231 (392) Q Consensus 152 ~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~ 231 (392) +++++..+...++|++|+++.++++.++|++|++++++++++++||+++++|+.++|.+||.++|++++++++|.++++| T Consensus 167 ~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g 246 (415) T protein:vir:47 167 YPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDV 246 (415) T ss_pred EEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 99998888889999999999998778899999999999999999999999999999999999999999999999999998 Q ss_pred cccccc----------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcc Q lcl|Aclame:pro 232 IEKLTK----------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNK 295 (392) Q Consensus 232 ~~~~~~----------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~ 295 (392) +++..+ .+..+++++++++.. +...++.+++|||||++|.+|+++||++|+|||+|++.++.+ T Consensus 247 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~ 325 (415) T protein:vir:47 247 ITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL-NVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccCCccccccccccccceeccccccchHHHHHHHHh-hhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCC Confidence 776422 234578899988764 455667788999999999999999999999999999999989 Q ss_pred cceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccc Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~a 375 (392) .+++|. ||+++++... ..++...++||||+++|.+++|.++++.+++ |.++++.+|+++|+|+++.+|+| T Consensus 326 ~~l~G~-pV~~~~~~~~----~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~~~~~r~d~~v~~~~a 395 (415) T protein:vir:47 326 QRLLGA-KIEILPDEVL----GQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILDYKS 395 (415) T ss_pred ccccce-eeEEeccccc----cCCCccEEEEEehhccEEEEeecceEEEeec-----cccCceEEEEEEEeccEEecccc Confidence 999985 5666554322 2245667999999999999999999998875 45677899999999999999999 Q ss_pred eEEEEecccCCCCCCCC Q lcl|Aclame:pro 376 AVYGEIDLSAPVEQPQG 392 (392) Q Consensus 376 f~~l~~~~~a~~~~~~~ 392 (392) |+++++++++. +.| T Consensus 396 ~~~~~~~~~~~---~~~ 409 (415) T protein:vir:47 396 AIVIEYDDSER---GEG 409 (415) T ss_pred EEEEEeeccCC---CCC Confidence 99999985544 677 No 17 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=6.1e-66 Score=378.12 Aligned_cols=378 Identities=26% Similarity=0.402 Sum_probs=293.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc----ccccccccch--hh Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNG----REVETRNVDG--EM 74 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~----~~~~~~~~~~--~~ 74 (392) |.++|+++++++.++.++++.++++++.++++++++|+++|+++|+.+++..+...+..... .......... .. T Consensus 7 m~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (415) T protein:vir:46 7 LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQ 86 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHH Confidence 77788999999999999999999999999999999999999999987665544333222111 1111111111 11 Q ss_pred HHHHHHHHHHhcchhhHHHHHHHHhhh---hhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcce Q lcl|Aclame:pro 75 EYRDVFMKALRNKPLNAEEREFLEDDL---EQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGS 151 (392) Q Consensus 75 ~~~~a~~~~~~~~~~~~~~~~~~~~~~---~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~ 151 (392) ..........+.......++....... ........+..+||++||+++.+.|++.+++.++|++++++++++++.+. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 166 (415) T protein:vir:46 87 ANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGK 166 (415) T ss_pred HHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCcee Confidence 111222222222222222222221111 11122233455788999999999999999999999999999999999999 Q ss_pred eEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 152 RVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV 231 (392) Q Consensus 152 ~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~ 231 (392) +++++..+...++|++|+++.++++.++|++|++++++++++++||+++++|+.++|.+||.++|++++++++|.++++| T Consensus 167 ~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g 246 (415) T protein:vir:46 167 YPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDV 246 (415) T ss_pred EEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 99998888889999999999998778899999999999999999999999999999999999999999999999999998 Q ss_pred cccccc----------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcc Q lcl|Aclame:pro 232 IEKLTK----------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNK 295 (392) Q Consensus 232 ~~~~~~----------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~ 295 (392) +++..+ .+..+++++++++.. +...++.+++|||||++|.+|+++||++|+|||+|++.++.+ T Consensus 247 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~ 325 (415) T protein:vir:46 247 ITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL-NVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccCCccccccccccccceeccccccchHHHHHHHHh-hhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCC Confidence 776422 234578899988764 455667788999999999999999999999999999999989 Q ss_pred cceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccc Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~a 375 (392) .+++|. ||+++++... ..++...++||||+++|.+++|.++++.+++ |.++++.+|+++|+|+++.+|+| T Consensus 326 ~~l~G~-pV~~~~~~~~----~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~~~~~r~d~~v~~~~a 395 (415) T protein:vir:46 326 QRLLGA-KIEILPDEVL----GQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILDYKS 395 (415) T ss_pred ccccce-eeEEeccccc----cCCCccEEEEEehhccEEEEeecceEEEeec-----cccCceEEEEEEEeccEEecccc Confidence 999985 5666554322 2245667999999999999999999998875 45677899999999999999999 Q ss_pred eEEEEecccCCCCCCCC Q lcl|Aclame:pro 376 AVYGEIDLSAPVEQPQG 392 (392) Q Consensus 376 f~~l~~~~~a~~~~~~~ 392 (392) |+++++++++. +.| T Consensus 396 ~~~~~~~~~~~---~~~ 409 (415) T protein:vir:46 396 AIVIEYDDSER---GEG 409 (415) T ss_pred EEEEEeeccCC---CCC Confidence 99999985544 677 No 18 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=6.6e-66 Score=377.93 Aligned_cols=365 Identities=16% Similarity=0.195 Sum_probs=282.3 Q ss_pred CCHHHHHHHHHHHHHH---HHHHHHhhhhhHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hccccccccccch Q lcl|Aclame:pro 1 MSKELRELLAKLEGKK---EEVRSLMGEDKVA----EAEQMMEEVRSLQKKIDLQRSLDEAETEER-NNGREVETRNVDG 72 (392) Q Consensus 1 M~kel~el~~~~~~~~---~e~~~~~~~~~~~----~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 72 (392) |.-+++++++.++++. +++++..++ ..+ +..++.++++.++.+++.++...+...+.. ...+......... T Consensus 1 m~~~lk~l~~~~~el~~~~~~~k~~~~~-~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (401) T protein:vir:44 1 MAVDIKDVEQVAQELQQKFDDFKAKNDK-RVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKV 79 (401) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Confidence 6655555555444444 444433221 112 233455666666666665554443332222 1222222233445 Q ss_pred hhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCccee Q lcl|Aclame:pro 73 EMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSR 152 (392) Q Consensus 73 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~ 152 (392) ..+++++|..+++.+... .....+.+++..+++..||++||+++.+.|++.+++.++|+++|+++++++.. + T Consensus 80 ~~e~~~a~~~~lr~~~~~------~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~ 151 (401) T protein:vir:44 80 AAEHKDAFVGFLRKGRED------GLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSD--Y 151 (401) T ss_pred hHHHHHHHHHHHhhhhhh------hhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCc--e Confidence 567788888888755322 12234667788888889999999999999999999999999999998887654 4 Q ss_pred EEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 153 VLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI 232 (392) Q Consensus 153 ~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~ 232 (392) .++...+++.++|++|++..++++.++|++|++++++++++++||+|+|+|+.++|++||.++|+++++.++|.++++|. T Consensus 152 ~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~ 231 (401) T protein:vir:44 152 KKLVNLGGTASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGD 231 (401) T ss_pred EEEEecCCccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccC Confidence 55667778889999999998887778999999999999999999999999999999999999999999999999999998 Q ss_pred cccccc----------------------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCce Q lcl|Aclame:pro 233 EKLTKQ----------------------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKY 284 (392) Q Consensus 233 ~~~~~~----------------------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~ 284 (392) |+..+. +..+|+++++++. .++..|+.+++|+||+++|.+|+++||++||| T Consensus 232 G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~-~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~ 310 (401) T protein:vir:44 232 GTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIY-TLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNY 310 (401) T ss_pred CCCccceeeccccccccccccccccccccccccccccCHHHHHHHHH-hcchhhhcCCEEEEcHHHHHHHHHhhccCCce Confidence 764332 1234888888765 67889999999999999999999999999999 Q ss_pred eecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEE Q lcl|Aclame:pro 285 ILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQ 364 (392) Q Consensus 285 l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~ 364 (392) ||+|++..+.+.+++|. ||+++++ +|. ..++..+++||||+++|.+++|.++++..++ +|.+|++.||+++ T Consensus 311 l~~~~~~~g~~~~l~G~-PVv~~~~--~p~--~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~----~~~~~~v~~~a~~ 381 (401) T protein:vir:44 311 LWRPGLELGQPSSLAGY-GIAENEQ--MPD--IAADAKAIAFGNFKRGYTIVDRIGTRILRDP----YTNKPFVGFYTTK 381 (401) T ss_pred eecCCcCCCCCceecce-eeEEecC--cCC--ccCCccEEEEeehhccEEEEEecceEEeeec----cccCCcEEEEEEE Confidence 99999999989999985 5655443 343 3455667899999999999999999987654 4678999999999 Q ss_pred eeCcEEecccceEEEEeccc Q lcl|Aclame:pro 365 RDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 365 r~~~~v~~~~af~~l~~~~~ 384 (392) |+|+++++|+||++|+++++ T Consensus 382 r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 382 RTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred EeccEEecccceEEEEeecC Confidence 99999999999999999988 No 19 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=1e-65 Score=376.92 Aligned_cols=378 Identities=26% Similarity=0.399 Sum_probs=292.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----cccccccccch--hh Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNN----GREVETRNVDG--EM 74 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~----~~~~~~~~~~~--~~ 74 (392) |.++|++++++++++.++++.++++++.++++++.+|+++|+++|+.+++..+........ ........... .. T Consensus 7 l~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (415) T protein:vir:79 7 LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQ 86 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHH Confidence 6667788888888888888888888888999999999999999998766554433222111 11111111111 11 Q ss_pred HHHHHHHHHHhcchhhHHHHHHHHhhhh---hhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcce Q lcl|Aclame:pro 75 EYRDVFMKALRNKPLNAEEREFLEDDLE---QRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGS 151 (392) Q Consensus 75 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~---~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~ 151 (392) .....+...++.......++........ .......+..+||++||+++.+.|++.+++.++|++++++.+++++++. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 166 (415) T protein:vir:79 87 ANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGK 166 (415) T ss_pred HHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCcee Confidence 1112222223333222222222211111 1112233455788999999999999999999999999999999999999 Q ss_pred eEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 152 RVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV 231 (392) Q Consensus 152 ~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~ 231 (392) +++++..++..++|++|+++.++++.++|++|++++++++++++||+++++|+.++|++||.++|+++++.++|.+++.| T Consensus 167 ~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g 246 (415) T protein:vir:79 167 YPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDV 246 (415) T ss_pred EEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 99999999999999999999998777899999999999999999999999999999999999999999999999999998 Q ss_pred cccccc----------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcc Q lcl|Aclame:pro 232 IEKLTK----------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNK 295 (392) Q Consensus 232 ~~~~~~----------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~ 295 (392) +++..+ .+..+|+++++++. .+...+..+++|||||++|.+|+++||++|+|||+|++.++.+ T Consensus 247 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:79 247 ITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccCccccccccccccccccccccccchhHHHHHHH-hhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCC Confidence 775422 24467899999875 4566677889999999999999999999999999999999888 Q ss_pred cceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccc Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~a 375 (392) .+++|. ||+++++...+ ..++.+++||||+++|.+++|.++++.+++ |.++++.+|+++|+|+++.+|+| T Consensus 326 ~~l~G~-pV~~~~~~~~~----~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~~~~~r~d~~v~~~~a 395 (415) T protein:vir:79 326 QRLLGA-KIEILPDEVLG----QKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILDYKS 395 (415) T ss_pred ceecce-eeEEecccccC----CCCccEEEEEehhccEEEEeecceEEEEec-----cccCceEEEEEEEeccEEecccc Confidence 999985 67665543222 245667999999998999999999999876 34567789999999999999999 Q ss_pred eEEEEecccCCCCCCCC Q lcl|Aclame:pro 376 AVYGEIDLSAPVEQPQG 392 (392) Q Consensus 376 f~~l~~~~~a~~~~~~~ 392 (392) |+++++++++. |.| T Consensus 396 ~~~~~~~~~~~---~~~ 409 (415) T protein:vir:79 396 AIVIEYDDSER---GEG 409 (415) T ss_pred EEEEEEeccCC---CCC Confidence 99999995554 777 No 20 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=1e-65 Score=376.92 Aligned_cols=378 Identities=26% Similarity=0.399 Sum_probs=292.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----cccccccccch--hh Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNN----GREVETRNVDG--EM 74 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~----~~~~~~~~~~~--~~ 74 (392) |.++|++++++++++.++++.++++++.++++++.+|+++|+++|+.+++..+........ ........... .. T Consensus 7 l~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (415) T protein:vir:81 7 LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQ 86 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHH Confidence 6667788888888888888888888888999999999999999998766554433222111 11111111111 11 Q ss_pred HHHHHHHHHHhcchhhHHHHHHHHhhhh---hhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcce Q lcl|Aclame:pro 75 EYRDVFMKALRNKPLNAEEREFLEDDLE---QRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGS 151 (392) Q Consensus 75 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~---~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~ 151 (392) .....+...++.......++........ .......+..+||++||+++.+.|++.+++.++|++++++.+++++++. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 166 (415) T protein:vir:81 87 ANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGK 166 (415) T ss_pred HHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCcee Confidence 1112222223333222222222211111 1112233455788999999999999999999999999999999999999 Q ss_pred eEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 152 RVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV 231 (392) Q Consensus 152 ~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~ 231 (392) +++++..++..++|++|+++.++++.++|++|++++++++++++||+++++|+.++|++||.++|+++++.++|.+++.| T Consensus 167 ~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g 246 (415) T protein:vir:81 167 YPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDV 246 (415) T ss_pred EEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 99999999999999999999998777899999999999999999999999999999999999999999999999999998 Q ss_pred cccccc----------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcc Q lcl|Aclame:pro 232 IEKLTK----------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNK 295 (392) Q Consensus 232 ~~~~~~----------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~ 295 (392) +++..+ .+..+|+++++++. .+...+..+++|||||++|.+|+++||++|+|||+|++.++.+ T Consensus 247 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:81 247 ITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccCccccccccccccccccccccccchhHHHHHHH-hhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCC Confidence 775422 24467899999875 4566677889999999999999999999999999999999888 Q ss_pred cceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccc Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~a 375 (392) .+++|. ||+++++...+ ..++.+++||||+++|.+++|.++++.+++ |.++++.+|+++|+|+++.+|+| T Consensus 326 ~~l~G~-pV~~~~~~~~~----~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~~~~~r~d~~v~~~~a 395 (415) T protein:vir:81 326 QRLLGA-KIEILPDEVLG----QKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILDYKS 395 (415) T ss_pred ceecce-eeEEecccccC----CCCccEEEEEehhccEEEEeecceEEEEec-----cccCceEEEEEEEeccEEecccc Confidence 999985 67665543222 245667999999998999999999999876 34567789999999999999999 Q ss_pred eEEEEecccCCCCCCCC Q lcl|Aclame:pro 376 AVYGEIDLSAPVEQPQG 392 (392) Q Consensus 376 f~~l~~~~~a~~~~~~~ 392 (392) |+++++++++. |.| T Consensus 396 ~~~~~~~~~~~---~~~ 409 (415) T protein:vir:81 396 AIVIEYDDSER---GEG 409 (415) T ss_pred EEEEEEeccCC---CCC Confidence 99999995554 777 No 21 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=1e-65 Score=376.92 Aligned_cols=378 Identities=26% Similarity=0.399 Sum_probs=292.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----cccccccccch--hh Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNN----GREVETRNVDG--EM 74 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~----~~~~~~~~~~~--~~ 74 (392) |.++|++++++++++.++++.++++++.++++++.+|+++|+++|+.+++..+........ ........... .. T Consensus 7 l~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (415) T protein:vir:98 7 LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQ 86 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHH Confidence 6667788888888888888888888888999999999999999998766554433222111 11111111111 11 Q ss_pred HHHHHHHHHHhcchhhHHHHHHHHhhhh---hhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcce Q lcl|Aclame:pro 75 EYRDVFMKALRNKPLNAEEREFLEDDLE---QRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGS 151 (392) Q Consensus 75 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~---~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~ 151 (392) .....+...++.......++........ .......+..+||++||+++.+.|++.+++.++|++++++.+++++++. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 166 (415) T protein:vir:98 87 ANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGK 166 (415) T ss_pred HHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCcee Confidence 1112222223333222222222211111 1112233455788999999999999999999999999999999999999 Q ss_pred eEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 152 RVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV 231 (392) Q Consensus 152 ~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~ 231 (392) +++++..++..++|++|+++.++++.++|++|++++++++++++||+++++|+.++|++||.++|+++++.++|.+++.| T Consensus 167 ~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g 246 (415) T protein:vir:98 167 YPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDV 246 (415) T ss_pred EEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 99999999999999999999998777899999999999999999999999999999999999999999999999999998 Q ss_pred cccccc----------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcc Q lcl|Aclame:pro 232 IEKLTK----------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNK 295 (392) Q Consensus 232 ~~~~~~----------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~ 295 (392) +++..+ .+..+|+++++++. .+...+..+++|||||++|.+|+++||++|+|||+|++.++.+ T Consensus 247 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:98 247 ITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN-LNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccCccccccccccccccccccccccchhHHHHHHH-hhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCC Confidence 775422 24467899999875 4566677889999999999999999999999999999999888 Q ss_pred cceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccc Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~a 375 (392) .+++|. ||+++++...+ ..++.+++||||+++|.+++|.++++.+++ |.++++.+|+++|+|+++.+|+| T Consensus 326 ~~l~G~-pV~~~~~~~~~----~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~~~~~r~d~~v~~~~a 395 (415) T protein:vir:98 326 QRLLGA-KIEILPDEVLG----QKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILDYKS 395 (415) T ss_pred ceecce-eeEEecccccC----CCCccEEEEEehhccEEEEeecceEEEEec-----cccCceEEEEEEEeccEEecccc Confidence 999985 67665543222 245667999999998999999999999876 34567789999999999999999 Q ss_pred eEEEEecccCCCCCCCC Q lcl|Aclame:pro 376 AVYGEIDLSAPVEQPQG 392 (392) Q Consensus 376 f~~l~~~~~a~~~~~~~ 392 (392) |+++++++++. |.| T Consensus 396 ~~~~~~~~~~~---~~~ 409 (415) T protein:vir:98 396 AIVIEYDDSER---GEG 409 (415) T ss_pred EEEEEEeccCC---CCC Confidence 99999995554 777 No 22 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=1.4e-65 Score=376.20 Aligned_cols=378 Identities=26% Similarity=0.399 Sum_probs=294.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hccccccccc--cchhh Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEER----NNGREVETRN--VDGEM 74 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~----~~~~~~~~~~--~~~~~ 74 (392) |.++|++++++++++.++++.++++++.++++++.+|++.|+++|+.+.+..+...+.. .......... ..... T Consensus 7 l~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (415) T protein:vir:94 7 LQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEASTYRNQ 86 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHH Confidence 77788889999999999999999999999999999999999999987655433322211 1111111111 11111 Q ss_pred HHHHHHHHHHhcchhhHHHHHHHHhh---hhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcce Q lcl|Aclame:pro 75 EYRDVFMKALRNKPLNAEEREFLEDD---LEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGS 151 (392) Q Consensus 75 ~~~~a~~~~~~~~~~~~~~~~~~~~~---~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~ 151 (392) .....+....+.......++...... .........+..+||++||+++.+.|++.+++.++|++++++.+++++++. T Consensus 87 ~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 166 (415) T protein:vir:94 87 ANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGK 166 (415) T ss_pred HHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCcee Confidence 11222233333333333332222111 111222334456789999999999999999999999999999999999999 Q ss_pred eEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 152 RVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV 231 (392) Q Consensus 152 ~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~ 231 (392) +++++..+++.+.|++|++++++.+.++|++|++++++++++++||+|+++|+.++|++||.++|+++++.++|.++++| T Consensus 167 ~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g 246 (415) T protein:vir:94 167 YPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDV 246 (415) T ss_pred EEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 99999999999999999999998777899999999999999999999999999999999999999999999999999998 Q ss_pred cccccc----------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcc Q lcl|Aclame:pro 232 IEKLTK----------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNK 295 (392) Q Consensus 232 ~~~~~~----------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~ 295 (392) +++..+ .+...|+++++++.. +...+..+++|||||++|.+|+++||++|+|||.|++.++.+ T Consensus 247 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:94 247 ITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL-NVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccCccccccccccccccccccccccchHHHHHHHHh-hhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCC Confidence 776432 244678999998765 455667788999999999999999999999999999999988 Q ss_pred cceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccc Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~a 375 (392) .+++|. ||+++++...+ ..++.+++||||+++|.+++|.++++.+++ |.++++.+|+++|+|+++.+|+| T Consensus 326 ~~l~G~-pV~~~~~~~~~----~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~~r~~~r~d~~~~~~~a 395 (415) T protein:vir:94 326 QRLLGA-KIEILPDEVLG----QKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGECLMIAVRQDCRILDYKS 395 (415) T ss_pred ceecce-eeEEecccccC----CCCccEEEEEehhccEEEEeecceEEEEec-----cccCceEEEEEEEeccEEecccc Confidence 999995 57665543222 234567999999999999999999998875 45677899999999999999999 Q ss_pred eEEEEecccCCCCCCCC Q lcl|Aclame:pro 376 AVYGEIDLSAPVEQPQG 392 (392) Q Consensus 376 f~~l~~~~~a~~~~~~~ 392 (392) |+++++++++. |.| T Consensus 396 ~~~~~~~~~~~---~~~ 409 (415) T protein:vir:94 396 AIVIEYDDSER---GEG 409 (415) T ss_pred EEEEEEeccCC---CCC Confidence 99999885544 777 No 23 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=2.7e-65 Score=374.61 Aligned_cols=372 Identities=23% Similarity=0.324 Sum_probs=286.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhh-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc---ccccccccc Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMG-------EDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNN---GREVETRNV 70 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~-------~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~---~~~~~~~~~ 70 (392) |.++|++++++++++.+++..+.+ +++.++++++++|+++++++++++++..+........ ......... T Consensus 2 ~~~~l~el~~~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~ 81 (394) T protein:vir:97 2 FEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVT 81 (394) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 556899999888888777665543 3455678888889999998888776544333222111 111122222 Q ss_pred chhhHHHHHHHHHHhcchhhHH---------H-HH-HHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhh Q lcl|Aclame:pro 71 DGEMEYRDVFMKALRNKPLNAE---------E-RE-FLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQY 139 (392) Q Consensus 71 ~~~~~~~~a~~~~~~~~~~~~~---------~-~~-~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l 139 (392) ....++++.+..+++....... + .. ..............+..+||++||+++.+.|++.+++.++|+++ T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~ 161 (394) T protein:vir:97 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) T ss_pred hhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhh Confidence 3344445555554443221100 0 00 00111112223345667799999999999999999999999999 Q ss_pred cceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 140 VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKK 219 (392) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~ 219 (392) |++.+++++++.+++... +++.+.|++|+++.++++.++|++|++++++++++++||+|+|+|+.++|++||.++|+++ T Consensus 162 ~~~~~~~~~~~~~~~~~~-~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~ 240 (394) T protein:vir:97 162 TTVYQAKKASGKYPVLQR-ATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) T ss_pred ceeeeccCcceEEEEEec-CCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHH Confidence 999999988888777653 4467899999999998778999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCccccee Q lcl|Aclame:pro 220 SKVTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFA 299 (392) Q Consensus 220 ~~~~~d~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~ 299 (392) ++.++|.++++|.++.++.+..+++++++++...++.+ .+++|||||++|.+|++|||++|+|||+|++.++.+.++| T Consensus 241 ~~~~~~~~i~~g~~~~~~~~~~~~~~~~~~~~~~~~~~--~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~ 318 (394) T protein:vir:97 241 KVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPA--YNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLL 318 (394) T ss_pred HHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhh--hCCEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCCceec Confidence 99999999999999999999999999999887766554 3688999999999999999999999999999998888999 Q ss_pred cccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEE Q lcl|Aclame:pro 300 GTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYG 379 (392) Q Consensus 300 g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l 379 (392) |. ||+++++. ..+...++||||+++|.+++|++++++++++. .+...+|+++|+|+++.+|+||+++ T Consensus 319 G~-pv~~~~~~-------~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~r~d~~v~~~~a~~~~ 385 (394) T protein:vir:97 319 GK-PVFVLSDE-------VLGANKAFIGDFKRGVLFADRKDLGLRWADNE-----IYGQYLQAVLRFGVSKVDDKAGYYV 385 (394) T ss_pred cc-eeEEeccc-------ccCCccEEEeeccccEEEEEecceEEEEeccc-----ccceeEEEEEEEccEEecccceEEE Confidence 95 66665432 33455689999999999999999999987753 3346899999999999999999999 Q ss_pred EecccCCCCCCC Q lcl|Aclame:pro 380 EIDLSAPVEQPQ 391 (392) Q Consensus 380 ~~~~~a~~~~~~ 391 (392) ++++++. |= T Consensus 386 ~~~~~~~---p~ 394 (394) T protein:vir:97 386 TFTPEPL---PL 394 (394) T ss_pred Eeccccc---CC Confidence 9875422 22 No 24 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=1.1e-64 Score=371.16 Aligned_cols=367 Identities=25% Similarity=0.362 Sum_probs=281.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhh------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------c Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGED------KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNG-----------R 63 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~------~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~-----------~ 63 (392) |+| |++++++++++.+++++.+++. ..++.+++.++++.+..+++.+++..+...+..... + T Consensus 1 M~~-l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~ 79 (394) T protein:vir:10 1 MDK-LQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQP 79 (394) T ss_pred ChH-HHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhcc Confidence 995 8999999999988888876532 123445555555555555554443332221111110 0 Q ss_pred cccccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhccee Q lcl|Aclame:pro 64 EVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) Q Consensus 64 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~ 143 (392) .............++++..++++.... ...+....++++||++||+++.+.|++.+++.++|+++|++. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~l~~~~~~-----------~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~ 148 (394) T protein:vir:10 80 NGTDLKKKPIDAKKKAINDFIHSHGKV-----------IDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT 148 (394) T ss_pred cccchhhhHHHHHHHHHHHHHhccchh-----------hhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceee Confidence 111111222234445666666654321 223455567788999999999999999999999999999999 Q ss_pred eccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 144 PVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVT 223 (392) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~ 223 (392) +++++++.++++...+ ..+.|++|+++.++++.++|++|++++++++++++||+|+|+|+.++|++||.++|+++++.+ T Consensus 149 ~~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~ 227 (394) T protein:vir:10 149 PVTTPKGTYPILKRAT-DRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNT 227 (394) T ss_pred eccCCceEEEEEecCC-CccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHH Confidence 9999888888876543 568899999999987789999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhcccccccc---chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccC----Cccc Q lcl|Aclame:pro 224 RNVLILGVIEKLTKQ---AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQ----KNKK 296 (392) Q Consensus 224 ~d~~~~~~~~~~~~~---~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~----~~~~ 296 (392) +|.++++++++..+. +..++|++.+++...++.++ +++|||||++|.+|++|||++|||||+|++.. +.+. T Consensus 228 ~~~~il~g~g~~~~~~~~~~~~~d~l~~~~~~~~~~~~--~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~ 305 (394) T protein:vir:10 228 YNAMIAPVLQSFTAKATTTDTLVDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKG 305 (394) T ss_pred HHHHHhhcccccccccccccccHHHHHHHHHhhhhhhc--cCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCccc Confidence 999999999876654 45668888888777777665 58899999999999999999999999987654 4456 Q ss_pred ceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccce Q lcl|Aclame:pro 297 LFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAA 376 (392) Q Consensus 297 ~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af 376 (392) ++||. ||+++++..++. .+++.+++||||+++|.+++++++++.++++. .|. ..+++++|+|+++++|+|| T Consensus 306 ~L~G~-PV~~~~~~~~~~---~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~--~~~---~~~~~~~r~d~~~~~~~ai 376 (394) T protein:vir:10 306 TVLGV-PVYVVGDALLGS---AAGDQKAFVGDLKRGVLFADRQQVTLAWEDSK--IYG---RYLGAAFRFGVKQADSNAG 376 (394) T ss_pred ccccc-eeEEecccccCC---CCCceEEEEeeccccEEEEeecceEEEEeccc--ccc---eeEEEEEEeccEEeccccE Confidence 78885 677766655443 35567899999999999999999999988754 344 4689999999999999999 Q ss_pred EEEEecccCCCCCCCC Q lcl|Aclame:pro 377 VYGEIDLSAPVEQPQG 392 (392) Q Consensus 377 ~~l~~~~~a~~~~~~~ 392 (392) ++++++++++ .+++| T Consensus 377 ~~~~~~~~~~-~~~~~ 391 (394) T protein:vir:10 377 YFVTNTDAAS-GSTSG 391 (394) T ss_pred EEEEeecccC-CCCCC Confidence 9999888755 45555 No 25 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=2.4e-64 Score=369.34 Aligned_cols=366 Identities=25% Similarity=0.352 Sum_probs=281.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhh------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------cc Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGED------KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNG----------RE 64 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~------~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~----------~~ 64 (392) |+| |++++++++++.+++++.+++. ..++.+++.++++.+.++++.+++..+......... .. T Consensus 1 mee-L~~~~~~~~~~~~e~~~~l~~~~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (389) T protein:vir:10 1 MDK-LQTLFNDVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKK 79 (389) T ss_pred ChH-HHHHHHHHHHHHHHHHHHHHHHHHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 884 8888888888888887776531 123455555566655555554443333222211110 01 Q ss_pred ccccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceee Q lcl|Aclame:pro 65 VETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP 144 (392) Q Consensus 65 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~ 144 (392) ...........+++++..++++.. ...+++..++..+||++||+++...|++.+++.++|+++|++.+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~lr~~~------------~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~ 147 (389) T protein:vir:10 80 GTDLSKKPIDAKKKAINDFIHSHG------------KVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTP 147 (389) T ss_pred ccccchhHHHHHHHHHHHHhhcch------------hhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceee Confidence 111111222233445555554432 23455666777889999999999999999999999999999999 Q ss_pred ccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 145 VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTR 224 (392) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~ 224 (392) ++++++.+++....+ ..+.|++|+++.++.+.++|++|++.++++++++++|+|+|+|+.++|.+||.++|+++++.++ T Consensus 148 ~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~ 226 (389) T protein:vir:10 148 VTTPKGTYPILKRAT-DRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTY 226 (389) T ss_pred ccCCeeEEEEEecCC-CccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHH Confidence 999888888876654 5678999999999777899999999999999999999999999999999999999999999999 Q ss_pred HHHHhhccccccc---cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccC----Ccccc Q lcl|Aclame:pro 225 NVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQ----KNKKL 297 (392) Q Consensus 225 d~~~~~~~~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~----~~~~~ 297 (392) |.+|++++++..+ .+...+|++.++++..++.++ +++|+|||++|.+|+++||++|+|||+|++.. +.+.+ T Consensus 227 ~~~i~~g~~~~~~~~~~~~~~~d~l~~~~~~~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~ 304 (389) T protein:vir:10 227 NAMIAPVLQSFTAKKTTTDTLVDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGT 304 (389) T ss_pred HHHHhhhhcccccccccccccHHHHHHHHHhhhhhhh--CcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccc Confidence 9999999887554 345678999988877777766 67899999999999999999999999987654 34467 Q ss_pred eecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceE Q lcl|Aclame:pro 298 FAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAV 377 (392) Q Consensus 298 ~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~ 377 (392) +|| +||+++++..++.. +++.+++||||+++|.+++|+++++.++++. .|. ..+|+++|+|+++.+|+||+ T Consensus 305 l~G-~pV~~~~~~~~~~~---~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~~---~~~~~~~r~d~~~~~~~a~~ 375 (389) T protein:vir:10 305 ILG-VPVYVVGDTLLGSL---AGDQKAFVGDLKRGVLFTDRQQVTLAWEDSK--IYG---KYLGAAFRFGVQKADSKAGY 375 (389) T ss_pred ccc-ceeEEecccccCCC---CCceEEEEeeccccEEEEeecceEEEeeccc--ccc---ceEEEEEEeccEEecccceE Confidence 888 56777776555543 4556899999999999999999999998853 343 47899999999999999999 Q ss_pred EEEecccCCCCCCC Q lcl|Aclame:pro 378 YGEIDLSAPVEQPQ 391 (392) Q Consensus 378 ~l~~~~~a~~~~~~ 391 (392) ++++++++.+++-+ T Consensus 376 ~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 376 FVTNTDVPGSALGK 389 (389) T ss_pred EEEeeccCCCCCCC Confidence 99999887777777 No 26 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=2e-64 Score=369.79 Aligned_cols=372 Identities=24% Similarity=0.324 Sum_probs=270.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--ccccccchhh Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKV----AEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGRE--VETRNVDGEM 74 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~----~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 74 (392) |.++|+|++++++++.+|+|+++++++. ++.+.+.++++.++.+++.+++..+...+....... .......... T Consensus 10 ~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (400) T protein:vir:38 10 VKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSGKKPDHPEEH 89 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhh Confidence 5566677777777788888877654321 123333444444444444333322222111110000 0000000111 Q ss_pred HHHHHHHHHHhcchhh------------HHHHHHHHhhhhhhhhc-cccccccceecchhhhhHHHHhHHhhhhhhhhcc Q lcl|Aclame:pro 75 EYRDVFMKALRNKPLN------------AEEREFLEDDLEQRAMS-GLTGEDGGLVIPQDIQTQINELARSFDALEQYVT 141 (392) Q Consensus 75 ~~~~a~~~~~~~~~~~------------~~~~~~~~~~~~~~a~~-~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~ 141 (392) ..+..+....+..... ................. ..+..+||++||+++.+.|++.+++.++|+++++ T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~ 169 (400) T protein:vir:38 90 SYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTN 169 (400) T ss_pred hHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcce Confidence 1111111111111000 00000000111112222 2356678999999999999999999999999999 Q ss_pred eeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 142 VEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSK 221 (392) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~ 221 (392) +.+++++++.++++... .+.++|++|+++.++.+.++|++|++.+++++++++||+|+|+|+.++|++||.++|+++++ T Consensus 170 ~~~~~~~~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~ 248 (400) T protein:vir:38 170 VFQASTQKGTYPTVANA-TTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKV 248 (400) T ss_pred eEeccCcceEEEEEecC-CCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHH Confidence 99999888888877644 45788999999999877899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhccccccccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecc Q lcl|Aclame:pro 222 VTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGT 301 (392) Q Consensus 222 ~~~d~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~ 301 (392) .+++.+++.|+++.++.+..+++++.+++...++.++ +++|+|||++|.+|++|||++|+|||+|++..+.+.+++|. T Consensus 249 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~ 326 (400) T protein:vir:38 249 NTTNGAVATLLKGFTAKTISSVDDLKHINNVDLDPAY--SRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGKSVLGM 326 (400) T ss_pred HHHHHhhhhccccccccccccHHHHHHHHHhhhhhhh--CcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCccccccc Confidence 9999999999999999999999999998876666554 68999999999999999999999999999999989999985 Q ss_pred cceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEe Q lcl|Aclame:pro 302 NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) Q Consensus 302 ~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~ 381 (392) ||+++++... ..+++..++||||+++|.+++|.++++.++++. .| ...+|+++|+|+++.+|+||++|++ T Consensus 327 -pv~~~~~~~~----~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~--~~---~~~~~~~~r~d~~~~~~~a~~~l~~ 396 (400) T protein:vir:38 327 -PIAVVSDDTL----GAAGEAHAFLGDIKRAILFANRADFMVRWVDDQ--IY---GQFLQAGMRFGVSVADEKAGYFLTY 396 (400) T ss_pred -eeEEeccccc----CCCCceEEEEEeccccEEEEeecceEEEEeccc--cc---ceeEEEEEEeccEEecccceEEEEe Confidence 6766554322 234567799999999999999999999998864 23 4589999999999999999999999 Q ss_pred cccC Q lcl|Aclame:pro 382 DLSA 385 (392) Q Consensus 382 ~~~a 385 (392) +++| T Consensus 397 ~~~a 400 (400) T protein:vir:38 397 TPKA 400 (400) T ss_pred ecCC Confidence 8877 No 27 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=2.4e-64 Score=369.33 Aligned_cols=357 Identities=16% Similarity=0.213 Sum_probs=266.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhh----------hhHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHH-Hhh Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGE----------DKVAEAEQ---------MMEEVRSLQKKIDLQRSLDEAETE-ERN 60 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~----------~~~~~~~~---------~~~ei~~l~~~i~~~~~~~~~~~~-~~~ 60 (392) |.+.|.|+|++... |+++++++ ++.+++++ ..++++.++.+++.++...+.... ... T Consensus 21 ~~~~l~e~ra~~~~---e~~~l~~~~~~~~~~~k~~~~~~~~~~~~~~~~~e~~~~~~~~~~ei~~~~~~~~~~~~~~~~ 97 (425) T protein:vir:10 21 VPRGIISVRAEGPT---EVKALIENLQKAFHDFKAEHTKQLDAVKAGLPTSDALAKVDKVSADLEALQAAVDEANIKIAA 97 (425) T ss_pred hhHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 55555555544332 22222111 11111111 122233334444433222211111 111 Q ss_pred ccccccccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhc Q lcl|Aclame:pro 61 NGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYV 140 (392) Q Consensus 61 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~ 140 (392) ..............+++++|..+++.+ ...++++.++.++||++||+++.+.|++.+++.++|+++| T Consensus 98 ~~~~~~~~~~~~~~~~~~af~~~l~~~-------------e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~ 164 (425) T protein:vir:10 98 AQMGANGVKPLRDPEYTEAFKAHVKRG-------------DVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLC 164 (425) T ss_pred hhcccccccccccHHHHHHHHHHhhhh-------------hhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhc Confidence 111122223334456666776666543 2345667778889999999999999999999999999999 Q ss_pred ceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 141 TVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKS 220 (392) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~ 220 (392) ++++++++... +++..+++.+.|++|++..|+++.++|++|++++++++++++||+|+++|+.++|++||.++|++++ T Consensus 165 ~~~~~~~~~~~--~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai 242 (425) T protein:vir:10 165 RVQPVSKAGFS--KLFNMGGTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEF 242 (425) T ss_pred eeeeccCCceE--EEEEcCCcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHH Confidence 99988766554 4556777889999999999987778999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhcccccccc----------------------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHH Q lcl|Aclame:pro 221 KVTRNVLILGVIEKLTKQ----------------------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFN 272 (392) Q Consensus 221 ~~~~d~~~~~~~~~~~~~----------------------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~ 272 (392) +.++|.++++|.|+..+. +...++++++++. .+++.|+.+++|+|||++|. T Consensus 243 ~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~-~l~~~~~~~a~~vmn~~~~~ 321 (425) T protein:vir:10 243 AKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVY-DLPSAFTGNARFAMNRNTQR 321 (425) T ss_pred HHHHHhhhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHh-hhhhhhccCCEEEEchHHHH Confidence 999999999997754321 2336788888664 78899999999999999999 Q ss_pred HHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhh Q lcl|Aclame:pro 273 YLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKA 352 (392) Q Consensus 273 ~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~ 352 (392) +|+++||++|+|||+|++..+.+.++||. ||+++++ +|. ..++..+++||||+++|.+++|.++++..++ + T Consensus 322 ~L~~lkD~~G~~l~~~~~~~g~~~~l~G~-PV~~~~~--~p~--~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~----~ 392 (425) T protein:vir:10 322 QVRKLKDGQGNYLWQPSYVAGQPATLAGY-PVTEVPD--MPD--VAANSTPILFGDFQQTYLIIDRIGVRVLRDP----Y 392 (425) T ss_pred HHHHhhcCCCceeeccCccCCCCceecce-eeEEecC--cCC--ccCCccEEEEEehhccEEEEEecceEEEecc----c Confidence 99999999999999999999999999985 6766543 443 2345678999999999999999999886654 4 Q ss_pred hhcCceeEEEEEeeCcEEecccceEEEEecccC Q lcl|Aclame:pro 353 FTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 353 f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a 385 (392) |.+|++.|+++.|+|+++++|+||++++++++- T Consensus 393 ~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 393 TAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred ccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 678999999999999999999999999999888 No 28 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=1.6e-63 Score=364.83 Aligned_cols=383 Identities=23% Similarity=0.316 Sum_probs=272.5 Q ss_pred CC-----HHHHHHHHHHHHHHHHHHHHhhhhh--HH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---- Q lcl|Aclame:pro 1 MS-----KELRELLAKLEGKKEEVRSLMGEDK--VA-------EAEQMMEEVRSLQKKIDLQRSLDEAETEERNNG---- 62 (392) Q Consensus 1 M~-----kel~el~~~~~~~~~e~~~~~~~~~--~~-------~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~---- 62 (392) |+ ++|++++++++++.+|++.+.+..+ .+ +++++.+++++++++++..+.......+..... T Consensus 1 Mki~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~~ 80 (437) T protein:vir:10 1 MKIEKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDLV 80 (437) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 65 3455566666666777776654321 12 233444555555555443332221111100000 Q ss_pred --------ccccccccchh-hH-------HHHHHHHHH-hcc------------hhhHHHH---HHHHhhhhhhhhcccc Q lcl|Aclame:pro 63 --------REVETRNVDGE-ME-------YRDVFMKAL-RNK------------PLNAEER---EFLEDDLEQRAMSGLT 110 (392) Q Consensus 63 --------~~~~~~~~~~~-~~-------~~~a~~~~~-~~~------------~~~~~~~---~~~~~~~~~~a~~~~~ 110 (392) ........... .+ ......... +.. ....... .......+.++....+ T Consensus 81 ~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~ 160 (437) T protein:vir:10 81 APELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIA 160 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcc Confidence 00000000000 00 000000000 000 0000000 0001122344566677 Q ss_pred ccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhhe Q lcl|Aclame:pro 111 GEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDR 190 (392) Q Consensus 111 ~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i 190 (392) ..+||++||+++...|.. +++.++|+.++++.+++++.+.+++.... ...++|++|++..++++.++|++|++.++++ T Consensus 161 ~~~~g~lvp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~ 238 (437) T protein:vir:10 161 LKDGKVIIPETILTPEKE-VHQFPRLGSLVRTESVTTTTGKLPIFNNS-TDLLTAHTEYGQTTKNATPVITPILWDLKTY 238 (437) T ss_pred cccccccchHHHHHHHHH-hhhhhhhhhcceeEeeccCceeeEEeecc-ccccccccccccccccccccceeeeeehhhe Confidence 788999999999876654 57889999999999998888887766543 4678999999999987889999999999999 Q ss_pred eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--chhhHHHHHHHHHHHhhhcccCCceEEEcH Q lcl|Aclame:pro 191 AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ--AIKSLDDIKDVLNVKLDPAISPNAILLTNQ 268 (392) Q Consensus 191 ~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~--~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~ 268 (392) +++++||+|+|+|+.++|.+||.++|+++++.+++.++++|+|++.+. +..+++++.+++...++++|+.+++|+||| T Consensus 239 ~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 318 (437) T protein:vir:10 239 TGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKTTSTYLLGDLKKVLNVTLKPQDSAAASIVMSQ 318 (437) T ss_pred eeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHHHhhhhhhhhcCCEEEEcH Confidence 999999999999999999999999999999999999999998876554 445688899988888999999999999999 Q ss_pred HHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEecc Q lcl|Aclame:pro 269 DGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDV 348 (392) Q Consensus 269 ~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~ 348 (392) ++|.+|++|||++|+|||+|++..+.+.++|| +||+++++..+|.. .+++.+++||||+++|.+++|.++++.+++. T Consensus 319 ~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G-~pv~~~~~~~~~~~--~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~ 395 (437) T protein:vir:10 319 SAYNLFDMATDAMGRPLLQPNVTAATGYTLLG-KTVVIVDDKLFPSA--SAGDVNIVVAPLKKAVINFKLTEITGQFQDT 395 (437) T ss_pred HHHHHHHHhhccCCCeeeccCccCCCCccccc-ceeEEecccccCCc--CCCceEEEEeeccccEEEEeeeceEEEEecc Confidence 99999999999999999999999998999998 57777777666543 4567789999999999999999999988764 Q ss_pred chhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 349 GGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 349 ~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) |..+...+++.+|+|+++++|+||++|+.+.++.++++++ T Consensus 396 ----~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~ 435 (437) T protein:vir:10 396 ----YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTVVQST 435 (437) T ss_pred ----cccccceeeEEEEEccEEecccceEEEEeeccccccCCCC Confidence 3445578899999999999999999999886655555555 No 29 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=3.4e-63 Score=363.08 Aligned_cols=370 Identities=23% Similarity=0.332 Sum_probs=281.8 Q ss_pred CC--HHHHHHHHHHHHHHHHHHH-------HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcccc--- Q lcl|Aclame:pro 1 MS--KELRELLAKLEGKKEEVRS-------LMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEER----NNGRE--- 64 (392) Q Consensus 1 M~--kel~el~~~~~~~~~e~~~-------~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~----~~~~~--- 64 (392) |+ |+|+++++++.++.+++++ +.++.+.++++++.+|+++++++++.+++..+...... ..... T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGG 80 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 55 5677777777766655444 33444556788888888888888886655433322211 11111 Q ss_pred cccccc----chhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhc Q lcl|Aclame:pro 65 VETRNV----DGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYV 140 (392) Q Consensus 65 ~~~~~~----~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~ 140 (392) ...... ........++.+++++.....+ .++ ..+.++||++||+++.+.|++.+++.++|+++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~ra--~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~ 148 (421) T protein:vir:13 81 RVIINGDSKEEKRSLQLSAMSKTIRGIQLSEE----------ERD--IMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHC 148 (421) T ss_pred ccccccchhHHHHHHHHHHHHHhhhccchhHH----------Hhh--ccccCCcceecchhhHHHHHHHHHhhhhhhhhc Confidence 111111 1122333445555554433222 222 234567899999999999999999999999999 Q ss_pred ceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 141 TVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKS 220 (392) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~ 220 (392) ++++++++++.+++........++|++|+++++++ .++|++|++++++++++++||+|+|+|+.++|++||.++|++++ T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s-~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~ 227 (421) T protein:vir:13 149 HVIPVNRNAGKMPVRAGASVDKLANLAKDTELVKA-MLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFA 227 (421) T ss_pred eeeeccCCceEEEEeecCCccceeecccccccccc-ccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHH Confidence 99999998889888887777788999999998875 69999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhcccc-ccccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCccccee Q lcl|Aclame:pro 221 KVTRNVLILGVIEK-LTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFA 299 (392) Q Consensus 221 ~~~~d~~~~~~~~~-~~~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~ 299 (392) ..++|..+++...+ .+..+..++|++++++. .+..+++.+++|||||++|.+|++|||++|+|||++ +..+.+.++| T Consensus 228 ~~~~~~~i~~~~~g~~~~~~~~~~d~i~~~~~-~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~-~~~~~~~tl~ 305 (421) T protein:vir:13 228 VNTENAEIVKQAKAVLAEETINDYAGLVKTIN-SLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKE-LSDGGDLVFK 305 (421) T ss_pred HHHhhhhHhhhhhhccccccccchHHHHHHHH-HhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecC-cCCCCCceec Confidence 99999999886544 34566678999999876 567888899999999999999999999999999975 6677788899 Q ss_pred cccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEE Q lcl|Aclame:pro 300 GTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYG 379 (392) Q Consensus 300 g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l 379 (392) |. ||+++++.. . ...+...++||||+++|++++|++++++++++. +|.+|++.||++.|+|+++++|+||+.+ T Consensus 306 G~-pV~~~~~~~--~--~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~--~f~~~~~~~r~~~r~d~~~~~~~a~~~~ 378 (421) T protein:vir:13 306 GR-PVIELEESI--F--DVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEA--GYTKNETIARIIERFDVNSPLDKSSDAE 378 (421) T ss_pred ce-eeEEecccc--c--cCCCceEEEEEeccccEEEEEecceEEEeeccc--ccccCeeEEEEEeeecceeecchhhhee Confidence 85 676665432 2 224577899999999999999999999999874 6999999999999999999999998766 Q ss_pred Eecc---------cCCCCCCCC Q lcl|Aclame:pro 380 EIDL---------SAPVEQPQG 392 (392) Q Consensus 380 ~~~~---------~a~~~~~~~ 392 (392) .+.. +++++.+.| T Consensus 379 ~~~~~~a~v~~~~~~~~~~~~~ 400 (421) T protein:vir:13 379 KIRKFGVIVKLQEVLKSSPRSG 400 (421) T ss_pred eecccceeeccccccCCCCcCC Confidence 5443 222222222 No 30 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=1.3e-62 Score=359.94 Aligned_cols=371 Identities=18% Similarity=0.180 Sum_probs=288.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhh--------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc------ccc Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMG--------EDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGR------EVE 66 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~--------~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~------~~~ 66 (392) |+ |+||+++++++.++++++++ +++.++++++++|++.++++|++.+++.+.......... ... T Consensus 1 M~--l~eL~e~r~~l~~e~~~l~~k~~~~~~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 78 (409) T protein:vir:45 1 MK--LHELKQKRNTIATDMRALNEKIGDNAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDP 78 (409) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCC Confidence 99 56788888888888877654 355678899999999999999877665554433221111 111 Q ss_pred ccccchhhHHHHHHHHHHhcchh--hHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceee Q lcl|Aclame:pro 67 TRNVDGEMEYRDVFMKALRNKPL--NAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP 144 (392) Q Consensus 67 ~~~~~~~~~~~~a~~~~~~~~~~--~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~ 144 (392) ........++.++|.++++++.. ...++. ...+.++...+++.+||++||+++.+.|++.+++.++|+++|++++ T Consensus 79 ~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~---~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~ 155 (409) T protein:vir:45 79 ENNSQQDEKRAQVFDKWMRHGASELTSEERK---ALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILT 155 (409) T ss_pred CCcchhhHHHHHHHHHHHHhhhhhccHHHHH---HHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeee Confidence 11222344556677877776432 222222 2235567777788889999999999999999999999999999999 Q ss_pred ccCCcceeEEEeec-CCccccccccccccccccccceeeEEechhhee-eehhhHHHHHhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 145 VRTRSGSRVLEKNS-DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRA-GILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) Q Consensus 145 ~~~~~~~~~~~~~~-~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~-~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~ 222 (392) +++..... ++... ....+.|++|++..+++ .++|.++++.++|++ ++++||+|+++|+.++|++||.++|+++++. T Consensus 156 ~~~~~~~~-~~~~~~~~~~~~~v~E~~~~~~~-~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~ 233 (409) T protein:vir:45 156 TSDGRTME-WATADGTSEVGVLLGENEEAGEE-DTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGR 233 (409) T ss_pred cCCCceEE-EEeeccCcccccccccccccccc-ccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHH Confidence 87654333 34333 34567899999999875 589999999999985 6899999999999999999999999999999 Q ss_pred HHHHHHhhccccccc------------------cchhhHHHHHHHHHHHhhhcccCCceE--EEcHHHHHHHHHhhccCC Q lcl|Aclame:pro 223 TRNVLILGVIEKLTK------------------QAIKSLDDIKDVLNVKLDPAISPNAIL--LTNQDGFNYLDKLKDKDG 282 (392) Q Consensus 223 ~~d~~~~~~~~~~~~------------------~~~~~~d~~~~~~~~~~~~~~~~~a~~--v~~~~~~~~L~~lkd~~g 282 (392) +++.++++|.|+... .+..+++++++++. .++.+|+.++.| +||+.+|.+|++|||++| T Consensus 234 ~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~~d~i~~l~~-~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G 312 (409) T protein:vir:45 234 GEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQEILALKH-SIDPAYRRGPKFRLAFNDNTLKLISEMEDGQG 312 (409) T ss_pred HHHHHhhccCCCCCccccceeeeccccccccccccccchHHHHHHHH-hhhhhhccCCeEEEEECHHHHHHHHHhhcCCC Confidence 999999998775421 12346788888765 778888888865 779999999999999999 Q ss_pred ceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEE Q lcl|Aclame:pro 283 KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRA 362 (392) Q Consensus 283 ~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~ 362 (392) +|||++++..+.+.++||. ||++++ .+|.. .++...++||||+++ .+..+.+++++.+.+ .+|.+|++.||+ T Consensus 313 ~~i~~~~~~~~~~~~l~G~-PV~~~~--~~p~~--~~~~~~i~~Gd~~~~-~i~~~~~~~~~~~~d--~~~~~~~~~~~~ 384 (409) T protein:vir:45 313 RPLWLPDIVGVAPASVLNV-PYVIDQ--EIDDI--GAGKKFMFCGDFDRF-IIRRVRYMILKRLVE--RYAEYDQTGFLA 384 (409) T ss_pred ceeeccCcCCCCCceecce-eeEEec--CcCCc--cCCccEEEEeehhhh-heeeccceEEEEeec--ccccCCcEEEEE Confidence 9999999999999999995 666543 34432 345667899999985 578899999988764 467899999999 Q ss_pred EEeeCcEEecccceEEEEecccCCC Q lcl|Aclame:pro 363 IQRDDVQMWDNEAAVYGEIDLSAPV 387 (392) Q Consensus 363 ~~r~~~~v~~~~af~~l~~~~~a~~ 387 (392) +.|+|+++++|+||+++++++++.+ T Consensus 385 ~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 385 FHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred EEEeccEeechhheEEEEeccCCCC Confidence 9999999999999999999988776 No 31 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=1.9e-62 Score=358.91 Aligned_cols=365 Identities=16% Similarity=0.127 Sum_probs=269.7 Q ss_pred CCH-HHHHHHHHHHHHHHHHHHHhhh--------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MSK-ELRELLAKLEGKKEEVRSLMGE--------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVD 71 (392) Q Consensus 1 M~k-el~el~~~~~~~~~e~~~~~~~--------~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (392) |++ +|++++++++++.+|++++.++ ++.+++++++.|++.++++|++..+..+...+.............. T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSG 80 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 885 6999999999999999988543 4556788899999999999886554433322211111111111111 Q ss_pred hhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcce Q lcl|Aclame:pro 72 GEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGS 151 (392) Q Consensus 72 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~ 151 (392) ...........++|.+... ..+.... ... .....+..+|++++|+.+...|++.+++.++|++++++.++++. +. T Consensus 81 ~~~~~~~~~~~~~r~~~~~-~~r~~~~-~~~--~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~-~~ 155 (390) T protein:vir:62 81 AQRSADVDDDATLRAGNLG-EARSFEF-APE--KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDA-NP 155 (390) T ss_pred chhhcchHHHHHHhhhhhh-hhHHHHh-hhh--hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCC-ce Confidence 0111111112233333211 1111111 111 11222333445555555555566777788888999999877543 45 Q ss_pred eEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 152 RVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV 231 (392) Q Consensus 152 ~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~ 231 (392) +.+|+.++.+.+.|++|+++++++ +++|+++++++++++++++||+|+|+|+.+++++||.++|+++++.++|.++++| T Consensus 156 ~~~p~~~~~~~a~wv~E~~~~~~~-~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G 234 (390) T protein:vir:62 156 LDFTVITGRSSASIVGETAEIPES-YPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITG 234 (390) T ss_pred eEEEEEcCCcceeeeccccccccc-ccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 677888888999999999999976 6999999999999999999999999999999999999999999999999999998 Q ss_pred ccccc----------------ccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcc Q lcl|Aclame:pro 232 IEKLT----------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNK 295 (392) Q Consensus 232 ~~~~~----------------~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~ 295 (392) .|... ..+..+++++++++. .++++|+.+++||||+++|..|++|||++|+|||+|++..+.+ T Consensus 235 ~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~ 313 (390) T protein:vir:62 235 TGQPRGILTDASPATATFLATDTDSKVSDALIDLFH-EVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAP 313 (390) T ss_pred CCccccccccccccccceecccccccchHHHHHHHH-hhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCcc Confidence 76421 112346788888664 6788999999999999999999999999999999999999988 Q ss_pred cceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccc Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~a 375 (392) .+++|. ||++++ .+| ...++||||++ |.+++++++++..+.+ .+|.+|++.||+++|+|+++++|+| T Consensus 314 ~~l~G~-Pv~~~~--~~p-------~~~i~~gd~s~-~~i~~~~~~~v~~~~~--~~~~~~~~~~~~~~r~d~~~~~~~A 380 (390) T protein:vir:62 314 SLFNGK-VVETDD--GMP-------ADKILFADLSK-YRVRFAGSLRVDRSVD--AKFSTDQIVYRFLQRADGLLVDARG 380 (390) T ss_pred ceeccc-ceEEec--CCC-------CccEEEeeccc-eeEEeecceEEEeecc--ccccCCcEEEEEEEEeCcEeechhh Confidence 888885 666543 223 34589999997 6789999999999875 4799999999999999999999999 Q ss_pred eEEEEecccC Q lcl|Aclame:pro 376 AVYGEIDLSA 385 (392) Q Consensus 376 f~~l~~~~~a 385 (392) |++|+++++| T Consensus 381 ~~~l~~~~~a 390 (390) T protein:vir:62 381 AKVLTVTPGA 390 (390) T ss_pred eEEEEeecCC Confidence 9999999888 No 32 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=1.3e-61 Score=354.32 Aligned_cols=364 Identities=17% Similarity=0.127 Sum_probs=269.2 Q ss_pred CCH-HHHHHHHHHHHHHHHHHHHhhh--------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MSK-ELRELLAKLEGKKEEVRSLMGE--------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVD 71 (392) Q Consensus 1 M~k-el~el~~~~~~~~~e~~~~~~~--------~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (392) |+. .|++|+++++++.+|++++.++ ++.+++++++.|++.++++|++..+..+.................. T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGSG 80 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccc Confidence 995 6899999999999999987653 3455678888889888888875433322221111111111111111 Q ss_pred hhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHh-HHhhhhhhhhcceeeccCCcc Q lcl|Aclame:pro 72 GEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINEL-ARSFDALEQYVTVEPVRTRSG 150 (392) Q Consensus 72 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~-~~~~~~l~~l~~~~~~~~~~~ 150 (392) ...........+++.+... +.+.... ..+ ...++.+.+|.++|+++...++.. +...++++.+++++++++ .+ T Consensus 81 ~~~~~~~~~~~~~r~g~~~-~~~~~~~-~~~---~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~-~~ 154 (392) T protein:vir:13 81 AQRSADHDDDAVLRAGNLG-EARSFEF-APE---KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSD-AN 154 (392) T ss_pred hhhhhhHHHHHHHhccchh-hhHHHHh-hhh---hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCC-Cc Confidence 1111112223334433221 1111111 111 122344445556666666676655 555667788888876644 34 Q ss_pred eeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 151 SRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILG 230 (392) Q Consensus 151 ~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~ 230 (392) .+.++...+.+.++|++|++++|++ .++|+++++++++++++++||+|+|+|+.++|++||.++|+++++.++|.++++ T Consensus 155 ~~~~~~~~~~~~a~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~ 233 (392) T protein:vir:13 155 PMDFTVITGRATAGIVGETAEIPES-YPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLT 233 (392) T ss_pred eeEEEEEcCCcceeeeccccccccc-ccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 5777888888999999999999986 699999999999999999999999999999999999999999999999999999 Q ss_pred cccccccc------------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccC Q lcl|Aclame:pro 231 VIEKLTKQ------------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQ 292 (392) Q Consensus 231 ~~~~~~~~------------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~ 292 (392) |.|+..+. +...|+++++++ ..++..|+.+++|||||++|.+|+++||++|+|||+|+++. T Consensus 234 G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~-~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~ 312 (392) T protein:vir:13 234 GTGTGQPRGILTDATGANAAFGEADADSKVSDALIDLF-HEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTV 312 (392) T ss_pred ccCCccccccccccccccccccccccccccHHHHHHHH-HhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCC Confidence 87764332 234578888865 47788899999999999999999999999999999999999 Q ss_pred CcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEec Q lcl|Aclame:pro 293 KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWD 372 (392) Q Consensus 293 ~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~ 372 (392) +.+.+++|. ||++.+ .+| ...++||||++ |.+++++++++..+.+ .+|.+|++.||++.|+|+++++ T Consensus 313 g~~~~l~G~-Pv~~~~--~~~-------~~~i~~Gdf~~-~~i~~~~~~~i~~~~~--~~~~~~~~~~r~~~r~d~~~~~ 379 (392) T protein:vir:13 313 GAPDTFNGK-VVETDD--GMP-------ADKVLFADLSK-YRVRFAGSLRVDRSVD--AKFSTDQIVYRFLQRADGLLVD 379 (392) T ss_pred CCCceecce-eeEEcC--CCC-------CCcEEEeeccc-eeEEeecceEEEeecc--ccccCCcEEEEEEEEeccEEec Confidence 999999984 665533 233 34689999987 6789999999998775 4799999999999999999999 Q ss_pred ccceEEEEecccC Q lcl|Aclame:pro 373 NEAAVYGEIDLSA 385 (392) Q Consensus 373 ~~af~~l~~~~~a 385 (392) |+||+.++++++| T Consensus 380 ~~A~~~~~~~~aa 392 (392) T protein:vir:13 380 ARGAKVLTVTPAA 392 (392) T ss_pred ccceEEEEeeccC Confidence 9999999999888 No 33 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=9.2e-62 Score=355.20 Aligned_cols=369 Identities=23% Similarity=0.367 Sum_probs=260.3 Q ss_pred CCHH-------HHHHHHHHHHHH----------HHHHHHhhhhh-HH-------HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MSKE-------LRELLAKLEGKK----------EEVRSLMGEDK-VA-------EAEQMMEEVRSLQKKIDLQRSLDEAE 55 (392) Q Consensus 1 M~ke-------l~el~~~~~~~~----------~e~~~~~~~~~-~~-------~~~~~~~ei~~l~~~i~~~~~~~~~~ 55 (392) |.++ +++++++++++. +++++.+++.+ .+ +++.+.++++.++++++..++..+.. T Consensus 1 m~~k~~~l~~~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l 80 (397) T protein:vir:96 1 MALKQLILNKQIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDL 80 (397) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3322 222222222222 22222222211 12 23334444444444444333322222 Q ss_pred HHHhhccccccccccchhhHHHHHHHHHHhcchhhHHHHHHH---HhhhhhhhhccccccccceecchhhhhHHHHhHHh Q lcl|Aclame:pro 56 TEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFL---EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARS 132 (392) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~---~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~ 132 (392) ......... ...........................+... ............+..+||+++|+++...|++ +++ T Consensus 81 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~ 157 (397) T protein:vir:96 81 EDELAKAAD--PTDQKPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKD 157 (397) T ss_pred HHHHHhhhh--hhhhhhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhh Confidence 211111111 1111111111111111111111111111111 1112223344556678899999999999987 577 Q ss_pred hhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHH Q lcl|Aclame:pro 133 FDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYV 212 (392) Q Consensus 133 ~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v 212 (392) ...|+++|++.++++.++.++++... +..++|++|+++.++.+.++|++|+++++++++++++|+++|+|+.+++.+|| T Consensus 158 ~~~l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i 236 (397) T protein:vir:96 158 IVDLSKYVRSVPVNSASGKFPVISKS-GSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLI 236 (397) T ss_pred hhhHHHhhhhccccccceeEEEEecc-CCccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHH Confidence 88999999999999988888877654 46788999999999877899999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccC Q lcl|Aclame:pro 213 TKWLGKKSKVTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQ 292 (392) Q Consensus 213 ~~~l~~~~~~~~d~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~ 292 (392) .++|+++++.+++.++++|+|..++.+..++|++.+++...++..+ +++|||||++|..|++|||++|+|||+|++.. T Consensus 237 ~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~d~~~~~~~~~~~~~~--~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~ 314 (397) T protein:vir:96 237 ADEIQDQSLNTKNADIAAVLKTATAKSVVGVDGLKDLINKEIKKVY--DVKLFISASMYSELDKLKDKNGRYLLQDSITA 314 (397) T ss_pred HHHHHHHHHHHHHHHHhhcccccccccccchHHHHHHHHHhhhhhc--CcEEEEcHHHHHHHHHhhccCCCeEeccCccC Confidence 9999999999999999999999999999999999998876666543 78999999999999999999999999999999 Q ss_pred CcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEec Q lcl|Aclame:pro 293 KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWD 372 (392) Q Consensus 293 ~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~ 372 (392) +.+.++||. ||+++++..+ +..++..+++||||+++|.+++|+++++.++++. . +...+|+++|+|+++++ T Consensus 315 ~~~~~l~G~-pv~~~~~~~~---~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~--~---~~~~~~~~~r~d~~~~~ 385 (397) T protein:vir:96 315 ASGKQLLGK-EVVVLDDDVI---GKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNN--I---YGQLLAGIIRYDVKATD 385 (397) T ss_pred CCccccccc-ceEEeccccc---CCCCCceEEEEeehhcceEeEeecceEEEEeccc--c---cceeEEEEEEEccEEec Confidence 999999995 6666554332 2346678899999999999999999999998753 3 35678999999999999 Q ss_pred ccceEEEEeccc Q lcl|Aclame:pro 373 NEAAVYGEIDLS 384 (392) Q Consensus 373 ~~af~~l~~~~~ 384 (392) |+||++++++++ T Consensus 386 ~~a~~~~~~~~a 397 (397) T protein:vir:96 386 KKAGFYVTFTIG 397 (397) T ss_pred ccceEEEEeecC Confidence 999999999988 No 34 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=5.6e-61 Score=350.91 Aligned_cols=360 Identities=15% Similarity=0.159 Sum_probs=267.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hccccccccccch---h Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEER----NNGREVETRNVDG---E 73 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~----~~~~~~~~~~~~~---~ 73 (392) |+| |++|+++++++.++++.+.++.+ ++++++.++.+.++++++.+.+..+...+.. ............. . T Consensus 1 M~~-l~el~~~~~~~~~e~~~l~~~~~-~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (385) T protein:vir:18 1 MSE-LALIQKAIEESQQKMTQLFDAQK-AEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFS 78 (385) T ss_pred ChH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhH Confidence 996 99999999999999999876543 4566666677777666654433222211111 0001111111111 1 Q ss_pred hHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeE Q lcl|Aclame:pro 74 MEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRV 153 (392) Q Consensus 74 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~ 153 (392) ......+.+..+..... .......+++. .+...+|.++|+++...|++.+++.++|++++++.++++....+ T Consensus 79 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~-~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~- 150 (385) T protein:vir:18 79 ERAAEELIKSWDGKQGT------FGAKTFNKSLG-SDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY- 150 (385) T ss_pred HHHHHHHHHHHHHhhcc------chhhHHHhhhc-cccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEE- Confidence 11111122222211111 01112223333 34455677788899999999999999999999999987665444 Q ss_pred EEeec-CCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 154 LEKNS-DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI 232 (392) Q Consensus 154 ~~~~~-~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~ 232 (392) ++.. ....+.|++|++.++++ +++|+++++++++++++++||+++|+|+ .++++||.++|+++++.++|.++++|. T Consensus 151 -~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~ 227 (385) T protein:vir:18 151 -VREEVFTNNADVVAEKALKPES-DITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGD 227 (385) T ss_pred -EEEecCCcceeeeccCcccccc-ccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 4443 45678999999998875 6999999999999999999999999987 579999999999999999999999987 Q ss_pred cccccc-----------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcc Q lcl|Aclame:pro 233 EKLTKQ-----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNK 295 (392) Q Consensus 233 ~~~~~~-----------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~ 295 (392) |+..+. +...++++++++ ..+...+..+++|+|||++|.+|+++||++|+|||.+ ...+.+ T Consensus 228 g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~-~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~-~~~~~~ 305 (385) T protein:vir:18 228 GTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAI-YQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGG-PQAFTS 305 (385) T ss_pred CCCCcccccccccccccccccccccchHHHHHHHH-HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccC-cccCCC Confidence 664431 234577788876 4678888899999999999999999999999999975 456778 Q ss_pred cceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccc Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~a 375 (392) .+++|. ||++++ .+| .+.++||||+++|.++++.+++++++++...+|++|++.||+++|+|+++.+|+| T Consensus 306 ~~l~G~-pV~~~~--~~p-------~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a 375 (385) T protein:vir:18 306 NIMWGL-PVVPTK--AQA-------AGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTA 375 (385) T ss_pred ceecce-eeEEcC--cCC-------CCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccc Confidence 889985 665532 333 3358999999989999999999999998888899999999999999999999999 Q ss_pred eEEEEecccC Q lcl|Aclame:pro 376 AVYGEIDLSA 385 (392) Q Consensus 376 f~~l~~~~~a 385 (392) |+++++++++ T Consensus 376 ~~~~~~~aa~ 385 (385) T protein:vir:18 376 IIKGTFSSGS 385 (385) T ss_pred eEEEEeccCC Confidence 9999999888 No 35 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=5.6e-61 Score=350.91 Aligned_cols=360 Identities=15% Similarity=0.159 Sum_probs=267.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hccccccccccch---h Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEER----NNGREVETRNVDG---E 73 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~----~~~~~~~~~~~~~---~ 73 (392) |+| |++|+++++++.++++.+.++.+ ++++++.++.+.++++++.+.+..+...+.. ............. . T Consensus 1 M~~-l~el~~~~~~~~~e~~~l~~~~~-~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (385) T protein:vir:19 1 MSE-LALIQKAIEESQQKMTQLFDAQK-AEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFS 78 (385) T ss_pred ChH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhH Confidence 996 99999999999999999876543 4566666677777666654433222211111 0001111111111 1 Q ss_pred hHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeE Q lcl|Aclame:pro 74 MEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRV 153 (392) Q Consensus 74 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~ 153 (392) ......+.+..+..... .......+++. .+...+|.++|+++...|++.+++.++|++++++.++++....+ T Consensus 79 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~-~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~- 150 (385) T protein:vir:19 79 ERAAEELIKSWDGKQGT------FGAKTFNKSLG-SDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY- 150 (385) T ss_pred HHHHHHHHHHHHHhhcc------chhhHHHhhhc-cccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEE- Confidence 11111122222211111 01112223333 34455677788899999999999999999999999987665444 Q ss_pred EEeec-CCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 154 LEKNS-DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI 232 (392) Q Consensus 154 ~~~~~-~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~ 232 (392) ++.. ....+.|++|++.++++ +++|+++++++++++++++||+++|+|+ .++++||.++|+++++.++|.++++|. T Consensus 151 -~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~ 227 (385) T protein:vir:19 151 -VREEVFTNNADVVAEKALKPES-DITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGD 227 (385) T ss_pred -EEEecCCcceeeeccCcccccc-ccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 4443 45678999999998875 6999999999999999999999999987 579999999999999999999999987 Q ss_pred cccccc-----------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcc Q lcl|Aclame:pro 233 EKLTKQ-----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNK 295 (392) Q Consensus 233 ~~~~~~-----------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~ 295 (392) |+..+. +...++++++++ ..+...+..+++|+|||++|.+|+++||++|+|||.+ ...+.+ T Consensus 228 g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~-~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~-~~~~~~ 305 (385) T protein:vir:19 228 GTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAI-YQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGG-PQAFTS 305 (385) T ss_pred CCCCcccccccccccccccccccccchHHHHHHHH-HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccC-cccCCC Confidence 664431 234577788876 4678888899999999999999999999999999975 456778 Q ss_pred cceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccc Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~a 375 (392) .+++|. ||++++ .+| .+.++||||+++|.++++.+++++++++...+|++|++.||+++|+|+++.+|+| T Consensus 306 ~~l~G~-pV~~~~--~~p-------~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a 375 (385) T protein:vir:19 306 NIMWGL-PVVPTK--AQA-------AGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTA 375 (385) T ss_pred ceecce-eeEEcC--cCC-------CCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccc Confidence 889985 665532 333 3358999999989999999999999998888899999999999999999999999 Q ss_pred eEEEEecccC Q lcl|Aclame:pro 376 AVYGEIDLSA 385 (392) Q Consensus 376 f~~l~~~~~a 385 (392) |+++++++++ T Consensus 376 ~~~~~~~aa~ 385 (385) T protein:vir:19 376 IIKGTFSSGS 385 (385) T ss_pred eEEEEeccCC Confidence 9999999888 No 36 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=1.5e-60 Score=348.63 Aligned_cols=371 Identities=17% Similarity=0.239 Sum_probs=265.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhh---------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---- Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMG---------EDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVET---- 67 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~---------~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~---- 67 (392) |++ |++|++++.++.++++++.+ +++.++++++++|+++|+.+|+++++..+.............. T Consensus 1 M~k-l~~L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~ 79 (428) T protein:vir:10 1 MPQ-IEELRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAV 79 (428) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhcccc Confidence 996 88888888888888776543 3566788899999999999999877665554433221111110 Q ss_pred ------cccchhhHHHHHHHHHHhcchhhHHHHHHH---HhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhh Q lcl|Aclame:pro 68 ------RNVDGEMEYRDVFMKALRNKPLNAEEREFL---EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQ 138 (392) Q Consensus 68 ------~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~---~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~ 138 (392) .........+.+.......+.......... ......++.. .+.+.||++||+++.++|++.+++.++|++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~gg~liP~~~~~~ii~~l~~~~~l~~ 158 (428) T protein:vir:10 80 IVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAIS-TAAGSGGVLIPQNIHSEVIELLRDRTIVRK 158 (428) T ss_pred ccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhc-ccccCCccccchhHHHHHHHHHhhhchhhh Confidence 001011111111111111111111111111 1111222222 334578999999999999999999999999 Q ss_pred h-cceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHH Q lcl|Aclame:pro 139 Y-VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLG 217 (392) Q Consensus 139 l-~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~ 217 (392) + +++ +++.++.+.+|+.++++.++|++|++..+++ +++|++|++.+++++++++||+|+|+|+.++|++||.++|+ T Consensus 159 ~~~~~--~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~ 235 (428) T protein:vir:10 159 LGARS--IPLPNGNMSLPRLAGGATASYTGENQDAKVS-EARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDIL 235 (428) T ss_pred hccee--eecCCcceEEEEEeCCcceeeeccCcccccc-ccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHH Confidence 8 444 4445566778888888999999999999975 69999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcccccccc-------------------chhhHHHH---HHHH--HHHhhhcccCCceEEEcHHHHHH Q lcl|Aclame:pro 218 KKSKVTRNVLILGVIEKLTKQ-------------------AIKSLDDI---KDVL--NVKLDPAISPNAILLTNQDGFNY 273 (392) Q Consensus 218 ~~~~~~~d~~~~~~~~~~~~~-------------------~~~~~d~~---~~~~--~~~~~~~~~~~a~~v~~~~~~~~ 273 (392) ++++.++|.++++|.|++..+ ...+++.+ ++.+ .......+..+++|+|||.+|.+ T Consensus 236 ~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~ 315 (428) T protein:vir:10 236 TAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMK 315 (428) T ss_pred HHHHHHHHHHHhccCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHH Confidence 999999999999998764211 11122222 2222 12234455667899999999999 Q ss_pred HHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch--- Q lcl|Aclame:pro 274 LDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG--- 350 (392) Q Consensus 274 L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~--- 350 (392) |+++||++|+|||++. .+.+++|. ||+++ +..+.+.+...+..+++||||+. |.+++++++++.++++.. T Consensus 316 L~~lkd~~G~~i~~~~----~~g~l~G~-pv~~~-~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~i~i~~~~~~~~~~ 388 (428) T protein:vir:10 316 LFGLRDGNGNKVYPEM----AQGMLKGY-PIQRT-SAIPANLGEGGKESEIYFADFND-VVIGEDGNMKVDFSKEASYID 388 (428) T ss_pred HHHhhccCCceeccCC----CCCeeece-eeEEe-ccccccccCCCccceEEEEecce-EEEEEecceEEEeeccccccc Confidence 9999999999999653 23468885 56553 33333344556678899999996 568899999999998743 Q ss_pred ------hhhhcCceeEEEEEeeCcEEecccceEEEE-ecc Q lcl|Aclame:pro 351 ------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDL 383 (392) Q Consensus 351 ------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~-~~~ 383 (392) .+|++|++.||+++|+|+++.+|+||++++ +++ T Consensus 389 ~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 389 TDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred ccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 579999999999999999999999999997 666 No 37 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=3.1e-61 Score=352.31 Aligned_cols=287 Identities=41% Similarity=0.612 Sum_probs=255.1 Q ss_pred hhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeec-CCccccccccccccccccccce Q lcl|Aclame:pro 102 EQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS-DMIPFAEITEMGEIPETDNPKF 180 (392) Q Consensus 102 ~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~E~~~~~~~~~~~~ 180 (392) .-+++..++.++||++||+++.+.|++.+++.++|+++++++++++..+.++++... ....++|++|++++++++.++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 556677788889999999999999999999999999999999999999999888765 4567899999999998777999 Q ss_pred eeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-chhhHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 181 SNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ-AIKSLDDIKDVLNVKLDPAIS 259 (392) Q Consensus 181 ~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~-~~~~~d~~~~~~~~~~~~~~~ 259 (392) ++++++++|+++++++|+|+++|+.++|++||.++|+++++.++|++++.++++.++. +..++|++.+++ ..++.+++ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~d~i~~~~-~~l~~~~~ 159 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTKPTLTKWDDIIDLE-AKVDPAIK 159 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccccccccCHHHHHHHH-Hhhhhhhc Confidence 9999999999999999999999999999999999999999999999999999886554 667899999966 46788899 Q ss_pred CCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeecc Q lcl|Aclame:pro 260 PNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKRE 339 (392) Q Consensus 260 ~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~ 339 (392) .+++|+|||++|..|+++||++|||||++++.++.+.+++|. ||+++++..+|+. ..+..+++||||+++|.+++|+ T Consensus 160 ~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~-Pv~~~~~~~~~~~--~~~~~~~~~gd~~~~~~~~~~~ 236 (293) T protein:vir:48 160 QTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGF-AVKEISDRWLPNA--SSGVMPLYFGDLKQAVTLFDRQ 236 (293) T ss_pred CCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecce-eeEEecccccCCc--cCCceEEEEEeccceEEEEEec Confidence 999999999999999999999999999999999999999985 6777777666653 3567789999999999999999 Q ss_pred ceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 340 DMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 340 ~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) +++++.+++.+++|++|++.||+++|+|+++.+|+||+++++++++......| T Consensus 237 ~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~ 289 (293) T protein:vir:48 237 QMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIG 289 (293) T ss_pred ceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCcccc Confidence 99999999888899999999999999999999999999999886543222222 No 38 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=1.2e-59 Score=343.62 Aligned_cols=369 Identities=18% Similarity=0.169 Sum_probs=259.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhh--HHHHHH----HHHHHHHHHHHHHHHHHHHHH-HHHHhhcccccccccc--- Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDK--VAEAEQ----MMEEVRSLQKKIDLQRSLDEA-ETEERNNGREVETRNV--- 70 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~--~~~~~~----~~~ei~~l~~~i~~~~~~~~~-~~~~~~~~~~~~~~~~--- 70 (392) ..+++++++++++++.++++++.++.. .++... ..+++.++..+++.++...+. +............... T Consensus 20 l~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~ 99 (418) T protein:vir:10 20 PEQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQKLARGGGSAELETPKTL 99 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Confidence 334577777777777788777655421 222222 222333344444333322211 1111111111111110 Q ss_pred ---chhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccC Q lcl|Aclame:pro 71 ---DGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRT 147 (392) Q Consensus 71 ---~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~ 147 (392) .......+.+....+.......+. .......+....+...+|++||+++...|++.+++.++|++++++.++++ T Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 176 (418) T protein:vir:10 100 GQLVTESEEMKGMDGSARKSVRVRVDR---KSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSS 176 (418) T ss_pred hHHhhhHHHHHHHHHHHhhhhhhhhHH---HHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccC Confidence 011122223333333222111111 11112223344556678899999999999999999999999999999887 Q ss_pred CcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 148 RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVL 227 (392) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~ 227 (392) ++..++... ..++.+.|++|+++++++ +++|++|++.+++++++++||+++|+|+. +|++||.++|+++++.++|.+ T Consensus 177 ~~~~~~~~~-~~~~~a~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~l~~a~~~~~d~a 253 (418) T protein:vir:10 177 SSIEYTVET-GFTNNAAAVAEGAQKPTS-DLKFNLKNQPVRTIAHLFKASRQILDDAP-ALQSYIDGRARYGLQLTEEGQ 253 (418) T ss_pred CceeEEEEe-cCCCceeeeccCcccccc-ccceeeEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHH Confidence 655554432 335778999999999875 69999999999999999999999999874 899999999999999999999 Q ss_pred Hhhccccccc-----------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccc Q lcl|Aclame:pro 228 ILGVIEKLTK-----------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDP 290 (392) Q Consensus 228 ~~~~~~~~~~-----------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~ 290 (392) +++|.|+... .+...++++++++. .+...+..+++|+|||.+|..|++++|++|+|||. +. T Consensus 254 ~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~-~~ 331 (418) T protein:vir:10 254 ILKGDGTGANILGILPQASAFMPSITLANATPIDKIRLALL-QAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVG-NP 331 (418) T ss_pred HhccCCCCccccccccccccccccccccccccHHHHHHHHH-hhccccCCCCEEEEcHHHHHHHHHhhcCCCceecc-cc Confidence 9998876431 12245778888765 56777788889999999999999999999999995 56 Q ss_pred cCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEE Q lcl|Aclame:pro 291 TQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQM 370 (392) Q Consensus 291 ~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v 370 (392) ..+.+.+++|. ||++.+ .+| .+.++||||+++|.++++.++++.++++.+.+|++|++.||+++|+|+++ T Consensus 332 ~~~~~~~l~G~-pV~~~~--~~p-------~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~ 401 (418) T protein:vir:10 332 VNGTTPRLWNL-PVVETQ--AMT-------ANEFLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAV 401 (418) T ss_pred ccCCCceecce-eeEEcC--CCC-------CCcEEEeeccceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEE Confidence 66777888885 665532 333 23579999999889999999999999998888999999999999999999 Q ss_pred ecccceEEEEecccCCC Q lcl|Aclame:pro 371 WDNEAAVYGEIDLSAPV 387 (392) Q Consensus 371 ~~~~af~~l~~~~~a~~ 387 (392) ++|+||+++++++++.- T Consensus 402 ~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 402 YRPESFVTGALVEQAGG 418 (418) T ss_pred ecccceEEEEeccCCCC Confidence 99999999998755443 No 39 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=1.6e-59 Score=342.89 Aligned_cols=371 Identities=16% Similarity=0.256 Sum_probs=261.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHh---------hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLM---------GEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGR-------- 63 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~---------~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~-------- 63 (392) |+ |++|++++.++.+++++++ ++++.++++++++|+++|+.+|++++++.+.......... T Consensus 1 M~--i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~ 78 (435) T protein:vir:14 1 MN--VNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAA 78 (435) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhh Confidence 99 4566666666666665543 3456678999999999999999988766554433211000 Q ss_pred --cccccccchhhHH-HHHHHHHHhc---ch--hhHHHHHHH---HhhhhhhhhccccccccceecchhhhhHHHHhHHh Q lcl|Aclame:pro 64 --EVETRNVDGEMEY-RDVFMKALRN---KP--LNAEEREFL---EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARS 132 (392) Q Consensus 64 --~~~~~~~~~~~~~-~~a~~~~~~~---~~--~~~~~~~~~---~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~ 132 (392) ...........+. ...+..+.+. .. .....+... ......++++.++...||++||+++.+.|++.+++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~ 158 (435) T protein:vir:14 79 PAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRP 158 (435) T ss_pred ccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhh Confidence 0000111111111 1122222221 10 111111111 11223344566677789999999999999999999 Q ss_pred hhhhhhh-cceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhH--HHHH Q lcl|Aclame:pro 133 FDALEQY-VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD--QNIL 209 (392) Q Consensus 133 ~~~l~~l-~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~--~~l~ 209 (392) .++|+++ ++.+++ .++.+.+|+.++++.++|++|++.++++ +++|++|++.+++++++++||+|+|+|+. ++|+ T Consensus 159 ~~~i~~~~~~~~~~--~~~~~~~p~~~~~~~a~~v~E~~~~~~~-~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~ 235 (435) T protein:vir:14 159 KSVVRKLGARTLPL--SNGNITIPRLKGGAIVGYIGADTDIPTT-QQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVD 235 (435) T ss_pred hchhhhhcceeeec--CCCceEEEEEeCCcceeeeccCcccccc-ccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHH Confidence 9999987 555444 4556777888888999999999999875 69999999999999999999999999985 4699 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccc-ccc---------------hhhH----HHHHHHHHHHhhh--cccCCceEEEc Q lcl|Aclame:pro 210 KYVTKWLGKKSKVTRNVLILGVIEKLT-KQA---------------IKSL----DDIKDVLNVKLDP--AISPNAILLTN 267 (392) Q Consensus 210 ~~v~~~l~~~~~~~~d~~~~~~~~~~~-~~~---------------~~~~----d~~~~~~~~~~~~--~~~~~a~~v~~ 267 (392) +||.++|++++++++|.++++|.|++. +.+ ..++ .++.+++. .+.. .+..+++|+|| T Consensus 236 ~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~~v~n 314 (435) T protein:vir:14 236 QIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVIL-ALENADANLTQPGWIMA 314 (435) T ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHHHH-HhhhccccccCCEEEEc Confidence 999999999999999999999987642 111 1122 23333332 2232 24567899999 Q ss_pred HHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCccccc-ccccCCcceEEEEehhhceeeeeccceEEEEe Q lcl|Aclame:pro 268 QDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKS-KGTTAKKAPLIIGDLKEAIVLFKREDMELAST 346 (392) Q Consensus 268 ~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~-~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~ 346 (392) |.+|..|+++||++|+|||. .. .+.+++|. ||++++ .+|. .+...+...++||||+++ .+++|+++++.++ T Consensus 315 ~~~~~~L~~lkd~~G~~l~~-~~---~~g~l~G~-Pv~~~~--~~p~~~~~~~~~~~i~~gd~s~~-~i~~~~~~~~~~~ 386 (435) T protein:vir:14 315 PRTFRFLEGLRDGNGNKVYP-EL---ANGMLKGY-PVGKTT--QVPINLGETGKESEIYFTDFGDV-FIGEEETLEIDYS 386 (435) T ss_pred HHHHHHHHHhhccCCceecc-CC---CCCeeecc-eeEeec--cccccccCCCccceEEEeecccE-EEEEecccEEEEe Confidence 99999999999999999994 22 23478884 665533 3444 444556678999999984 5899999999999 Q ss_pred ccch---------hhhhcCceeEEEEEeeCcEEecccceEEEE-ecccC Q lcl|Aclame:pro 347 DVGG---------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSA 385 (392) Q Consensus 347 ~~~~---------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~-~~~~a 385 (392) ++.. .+|++|++.||+++|+|+++++|+||++|+ +++.+ T Consensus 387 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 387 KEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred ccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 8754 569999999999999999999999999998 33333 No 40 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=1.1e-59 Score=343.78 Aligned_cols=373 Identities=16% Similarity=0.129 Sum_probs=236.0 Q ss_pred CCH-------HHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHH-------------HHHHHHHHHHHHHH-HHHHHh Q lcl|Aclame:pro 1 MSK-------ELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVR-------------SLQKKIDLQRSLDE-AETEER 59 (392) Q Consensus 1 M~k-------el~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~-------------~l~~~i~~~~~~~~-~~~~~~ 59 (392) |.+ ++++++++..++.+|+++++++. .++++++.++++ .+.++++.++.... .+.... T Consensus 7 l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~ 85 (497) T protein:vir:78 7 LEAQGRQLAKSIKDINADETKTAAEKKEALAKI-EPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNL 85 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 333 33333333333444444433221 111122211111 11111111110000 000000 Q ss_pred hccccccccccchhhHHHH--HHHHH------Hhc-----------chhhHHHHHHHHhhhhhhhhccccccccceecch Q lcl|Aclame:pro 60 NNGREVETRNVDGEMEYRD--VFMKA------LRN-----------KPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ 120 (392) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~--a~~~~------~~~-----------~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~ 120 (392) ...............+... .+... ... ...............+.+.+..++.+.||++||+ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~ 165 (497) T protein:vir:78 86 KQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILP 165 (497) T ss_pred hhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccch Confidence 0000000000000000000 00000 000 0000001111122223445556677789999999 Q ss_pred hhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecC-CccccccccccccccccccceeeEEechhheeeehhhHHH Q lcl|Aclame:pro 121 DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRS 199 (392) Q Consensus 121 ~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e 199 (392) ++...|++.+++.++|++++++++++++. +.+|+..+ .+.++|++|++.++++ +++|++|++.+++++++++||+| T Consensus 166 ~~~~~ii~~~~~~~~i~~l~~~~~~~~~~--~~~~~~~~~~~~a~wv~E~~~~~~s-~~~f~~i~~~~~k~a~~~~iS~e 242 (497) T protein:vir:78 166 TFLPGIVEQLFYELSLADLISSRPVTSPN--LSYLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANALTITDE 242 (497) T ss_pred hhhHHHHHHHHhhhhHHhhccccccCCCc--eEEEEEcCCCCcceeeccCcccccc-cccceeeEeeeeeeEeecHhHHH Confidence 99999999999999999999998887754 45555544 4678999999999975 69999999999999999999999 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh-------------------------------------- Q lcl|Aclame:pro 200 LLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIK-------------------------------------- 241 (392) Q Consensus 200 ~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~~~-------------------------------------- 241 (392) +|+|+. ++++||.++|+++++.++|.+|++|.|+..+.+.. T Consensus 243 ll~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (497) T protein:vir:78 243 GLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVG 321 (497) T ss_pred HHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhh Confidence 999975 69999999999999999999999998764322110 Q ss_pred --------------------------------hHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeeccc Q lcl|Aclame:pro 242 --------------------------------SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSD 289 (392) Q Consensus 242 --------------------------------~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~ 289 (392) ..+++..++.......+.+..+|+|||.+|..|+++||++|+|||++. T Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~ 401 (497) T protein:vir:78 322 QDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNF 401 (497) T ss_pred hhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCc Confidence 001111222222234455666899999999999999999999999875 Q ss_pred ccC------CcccceecccceEEecCcccccccccCCcceEEEEehhhc-eeeeeccceEEEEeccchhhhhcCceeEEE Q lcl|Aclame:pro 290 PTQ------KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEA-IVLFKREDMELASTDVGGKAFTRNTLDLRA 362 (392) Q Consensus 290 ~~~------~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~-~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~ 362 (392) ... +.+.++|| +||+++. .+| .+.++||||+.+ |.+++|.+++|.++++...+|++|++.||+ T Consensus 402 ~~~~~~~~~~~~~~l~G-~pV~~t~--~~~-------~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~ 471 (497) T protein:vir:78 402 FGNAYGNPVNGGKNIWG-VPVVTTP--LIP-------LGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) T ss_pred ccccccccccCCceeec-eeeEecC--CCC-------CCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEE Confidence 432 23457888 5665532 233 234789999984 668899999999999988899999999999 Q ss_pred EEeeCcEEecccceEEEEecccCCCC Q lcl|Aclame:pro 363 IQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) Q Consensus 363 ~~r~~~~v~~~~af~~l~~~~~a~~~ 388 (392) +.|+|+.|++|+||++|++++++.+. T Consensus 472 ~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred EEeecceeeccccEEEEEecCCccCC Confidence 99999999999999999998877666 No 41 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=1.1e-59 Score=343.78 Aligned_cols=373 Identities=16% Similarity=0.129 Sum_probs=236.0 Q ss_pred CCH-------HHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHH-------------HHHHHHHHHHHHHH-HHHHHh Q lcl|Aclame:pro 1 MSK-------ELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVR-------------SLQKKIDLQRSLDE-AETEER 59 (392) Q Consensus 1 M~k-------el~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~-------------~l~~~i~~~~~~~~-~~~~~~ 59 (392) |.+ ++++++++..++.+|+++++++. .++++++.++++ .+.++++.++.... .+.... T Consensus 7 l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~ 85 (497) T protein:vir:10 7 LEAQGRQLAKSIKDINADETKTAAEKKEALAKI-EPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNL 85 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 333 33333333333444444433221 111122211111 11111111110000 000000 Q ss_pred hccccccccccchhhHHHH--HHHHH------Hhc-----------chhhHHHHHHHHhhhhhhhhccccccccceecch Q lcl|Aclame:pro 60 NNGREVETRNVDGEMEYRD--VFMKA------LRN-----------KPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ 120 (392) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~--a~~~~------~~~-----------~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~ 120 (392) ...............+... .+... ... ...............+.+.+..++.+.||++||+ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~ 165 (497) T protein:vir:10 86 KQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILP 165 (497) T ss_pred hhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccch Confidence 0000000000000000000 00000 000 0000001111122223445556677789999999 Q ss_pred hhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecC-CccccccccccccccccccceeeEEechhheeeehhhHHH Q lcl|Aclame:pro 121 DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRS 199 (392) Q Consensus 121 ~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e 199 (392) ++...|++.+++.++|++++++++++++. +.+|+..+ .+.++|++|++.++++ +++|++|++.+++++++++||+| T Consensus 166 ~~~~~ii~~~~~~~~i~~l~~~~~~~~~~--~~~~~~~~~~~~a~wv~E~~~~~~s-~~~f~~i~~~~~k~a~~~~iS~e 242 (497) T protein:vir:10 166 TFLPGIVEQLFYELSLADLISSRPVTSPN--LSYLTESAAHNNAAAVAEAGTYPFS-SEEFARVYEQVGKVANALTITDE 242 (497) T ss_pred hhhHHHHHHHHhhhhHHhhccccccCCCc--eEEEEEcCCCCcceeeccCcccccc-cccceeeEeeeeeeEeecHhHHH Confidence 99999999999999999999998887754 45555544 4678999999999975 69999999999999999999999 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh-------------------------------------- Q lcl|Aclame:pro 200 LLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIK-------------------------------------- 241 (392) Q Consensus 200 ~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~~~-------------------------------------- 241 (392) +|+|+. ++++||.++|+++++.++|.+|++|.|+..+.+.. T Consensus 243 ll~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (497) T protein:vir:10 243 GLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVG 321 (497) T ss_pred HHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhh Confidence 999975 69999999999999999999999998764322110 Q ss_pred --------------------------------hHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeeccc Q lcl|Aclame:pro 242 --------------------------------SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSD 289 (392) Q Consensus 242 --------------------------------~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~ 289 (392) ..+++..++.......+.+..+|+|||.+|..|+++||++|+|||++. T Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~ 401 (497) T protein:vir:10 322 QDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNF 401 (497) T ss_pred hhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCc Confidence 001111222222234455666899999999999999999999999875 Q ss_pred ccC------CcccceecccceEEecCcccccccccCCcceEEEEehhhc-eeeeeccceEEEEeccchhhhhcCceeEEE Q lcl|Aclame:pro 290 PTQ------KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEA-IVLFKREDMELASTDVGGKAFTRNTLDLRA 362 (392) Q Consensus 290 ~~~------~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~-~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~ 362 (392) ... +.+.++|| +||+++. .+| .+.++||||+.+ |.+++|.+++|.++++...+|++|++.||+ T Consensus 402 ~~~~~~~~~~~~~~l~G-~pV~~t~--~~~-------~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~ 471 (497) T protein:vir:10 402 FGNAYGNPVNGGKNIWG-VPVVTTP--LIP-------LGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRA 471 (497) T ss_pred ccccccccccCCceeec-eeeEecC--CCC-------CCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEE Confidence 432 23457888 5665532 233 234789999984 668899999999999988899999999999 Q ss_pred EEeeCcEEecccceEEEEecccCCCC Q lcl|Aclame:pro 363 IQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) Q Consensus 363 ~~r~~~~v~~~~af~~l~~~~~a~~~ 388 (392) +.|+|+.|++|+||++|++++++.+. T Consensus 472 ~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 472 EERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred EEeecceeeccccEEEEEecCCccCC Confidence 99999999999999999998877666 No 42 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=2.9e-59 Score=341.48 Aligned_cols=372 Identities=15% Similarity=0.230 Sum_probs=262.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHh---------hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------c Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLM---------GEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGR-------E 64 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~---------~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~-------~ 64 (392) |+ |+||++++.++.+++++++ ++++.++++++++|+++|+.+|+++++..+...+...... . T Consensus 1 M~--l~eL~~~r~~~~~~~~~l~~~~~e~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~ 78 (435) T protein:vir:80 1 MN--VNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTA 78 (435) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhcc Confidence 99 4566666666666665543 3456778899999999999999988765554332211000 0 Q ss_pred cccc---ccchhhHHHH-HHHHHHhcc---h--hh-HHHHHHH--HhhhhhhhhccccccccceecchhhhhHHHHhHHh Q lcl|Aclame:pro 65 VETR---NVDGEMEYRD-VFMKALRNK---P--LN-AEEREFL--EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARS 132 (392) Q Consensus 65 ~~~~---~~~~~~~~~~-a~~~~~~~~---~--~~-~~~~~~~--~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~ 132 (392) .... ......+.+. .+.++.+.. . .. ....... .......+.+.++...||++||+++.+.|++.+++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~ 158 (435) T protein:vir:80 79 SAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRP 158 (435) T ss_pred ccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhh Confidence 0000 0111111111 122222211 0 01 1111111 11122234556677789999999999999999999 Q ss_pred hhhhhhh-cceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhH--HHHH Q lcl|Aclame:pro 133 FDALEQY-VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD--QNIL 209 (392) Q Consensus 133 ~~~l~~l-~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~--~~l~ 209 (392) .++|+++ ++++++ .++.+.+|+..+++.+.|++|++.++++ +++|++|++.+++++++++||+|+|+|+. ++|+ T Consensus 159 ~~~i~~~~~~~v~~--~~~~~~~p~~~~~~~a~~v~E~~~~~~~-~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~ 235 (435) T protein:vir:80 159 KSVVRKLGARTLPL--SNGNITIPRLKGGAIVGYIGADTDIPTT-QQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVD 235 (435) T ss_pred hchhhhccceeeec--CCCceEEEEEeCCcceeeeccCcccccc-ccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHH Confidence 9999998 555444 4556777888888999999999999975 59999999999999999999999999985 4799 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccccccc-cc---------------hhh----HHHHHHHHHHHh-hhcccCCceEEEcH Q lcl|Aclame:pro 210 KYVTKWLGKKSKVTRNVLILGVIEKLTK-QA---------------IKS----LDDIKDVLNVKL-DPAISPNAILLTNQ 268 (392) Q Consensus 210 ~~v~~~l~~~~~~~~d~~~~~~~~~~~~-~~---------------~~~----~d~~~~~~~~~~-~~~~~~~a~~v~~~ 268 (392) +||.++|+++++.++|.++++|.|+... .+ ..+ +.++.+++.... ...++.+++|+||| T Consensus 236 ~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~ 315 (435) T protein:vir:80 236 QIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAP 315 (435) T ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccchhhHHHHHHHHHHHhhccccccccCEEEEcH Confidence 9999999999999999999999876421 11 111 223444332221 12355678999999 Q ss_pred HHHHHHHHhhccCCceeecccccCCcccceecccceEEecCccccc-ccccCCcceEEEEehhhceeeeeccceEEEEec Q lcl|Aclame:pro 269 DGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKS-KGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) Q Consensus 269 ~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~-~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~ 347 (392) .+|.+|+++||++|+|+|.. . .+.+++|. ||++++ .+|. .+...+...++||||++ |.+++|+++++++++ T Consensus 316 ~~~~~L~~lkd~~G~~l~~~-~---~~~~l~G~-pv~~~~--~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~~~i~~~~ 387 (435) T protein:vir:80 316 RTFRFLEGLRDGNGNKVYPE-L---ANGMLKGY-PVGKTT--QVPINLGEAGKESEIYFTDFGD-VFIGEEETLEIDYSK 387 (435) T ss_pred HHHHHHHhhhccCCceeccC-C---CCCeEeee-eeEEec--cccccccCCCCcceEEEEEccc-EEEEeecceEEEEec Confidence 99999999999999999943 2 23478885 665532 3443 44555677899999998 458899999999999 Q ss_pred cch---------hhhhcCceeEEEEEeeCcEEecccceEEEE-ecccC Q lcl|Aclame:pro 348 VGG---------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSA 385 (392) Q Consensus 348 ~~~---------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~-~~~~a 385 (392) +.. .+|++|++.||++.|+|+++++|+||++|+ +++.+ T Consensus 388 ~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 388 EATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred cccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 864 569999999999999999999999999998 55555 No 43 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=7.3e-59 Score=339.31 Aligned_cols=361 Identities=15% Similarity=0.164 Sum_probs=266.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhh---------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGE---------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVD 71 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~---------~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (392) |++.++++++++.++.++++++++. +..+.++++.+|++.++++|++.++..................... T Consensus 1 m~~l~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 80 (390) T protein:vir:81 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGD 80 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccchh Confidence 9987777888888888888876542 3445678888899999988887665444333222111111111100 Q ss_pred --hhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCc Q lcl|Aclame:pro 72 --GEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) Q Consensus 72 --~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~ 149 (392) ...+....+.......... .. ....... .....++...+|+++|+++...|++.+++.++|++++++.+++++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~-~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 156 (390) T protein:vir:81 81 MFVASEQFQASAGRWNDRSAR--AT-MNIKAAL-NTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSAL 156 (390) T ss_pred hhhhhHHHHHHHHHHhhhhhh--hh-hHHHHHH-HhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCc Confidence 0111111111111111100 00 0111111 2233445567788889999999999999999999999998887654 Q ss_pred ceeEEEeecC-CccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 150 GSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLI 228 (392) Q Consensus 150 ~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~ 228 (392) +.+++..+ ...+.|++|+++++++ +++|+++++++++++++++||+++|+|+. ++++||.++|++++++++|.++ T Consensus 157 --~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~d~~-~~~~~i~~~l~~~~~~~~d~a~ 232 (390) T protein:vir:81 157 --IEYVQETGFVNNAAIVAEGALKPES-SLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEI 232 (390) T ss_pred --eEEEEEecCCcceeeecCCcccccc-cceeeEEEEeeeEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 44455444 3578999999999876 58999999999999999999999999984 7999999999999999999999 Q ss_pred hhccccccc-----------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeeccccc Q lcl|Aclame:pro 229 LGVIEKLTK-----------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPT 291 (392) Q Consensus 229 ~~~~~~~~~-----------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~ 291 (392) ++|.|++.. .+...++++++++ ..+...+..+++|+|||++|..|+++||++|+|||++.. T Consensus 233 l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~- 310 (390) T protein:vir:81 233 LRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAM-LQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNAR- 310 (390) T ss_pred HhcCCCCCcccceeecccccccccccccchhHHHHHHHH-HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcc- Confidence 998776431 1234567777766 466788888899999999999999999999999998754 Q ss_pred CCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEe Q lcl|Aclame:pro 292 QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) Q Consensus 292 ~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~ 371 (392) .+.+.+++|. ||++++ .+| .+.++||||+++|.+++|.++++.++++. .+|++|++.||+++|+|+++. T Consensus 311 ~~~~~~l~G~-pv~~~~--~~p-------~~~~~~gd~~~~~~~~~~~~~~v~~~~~~-~~~~~~~v~~r~~~r~d~~v~ 379 (390) T protein:vir:81 311 GTLTPTLWGL-PVVATQ--AMA-------PGEFLVGAFDLAAQIFDQWDARVEIGYVG-EDFQRNMITVLAEERLALVVY 379 (390) T ss_pred cccCceecce-eeEEcC--CCC-------CCcEEEEehhceEEEEEecceEEEEeccc-chhhcCcEEEEEEEeeccEEe Confidence 4556688885 565432 233 33589999999889999999999998864 579999999999999999999 Q ss_pred cccceEEEEec Q lcl|Aclame:pro 372 DNEAAVYGEID 382 (392) Q Consensus 372 ~~~af~~l~~~ 382 (392) +|+||+++++. T Consensus 380 ~~~a~v~~t~a 390 (390) T protein:vir:81 380 RPEALISGSFA 390 (390) T ss_pred cccceEEEEeC Confidence 99999999988 No 44 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=1e-58 Score=338.48 Aligned_cols=361 Identities=14% Similarity=0.171 Sum_probs=267.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhh---------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--c Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMG---------EDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETR--N 69 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~---------~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~--~ 69 (392) |++..++++++++++.++++++.+ ++..++++++.++++.++++++..++..+............... . T Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 80 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGD 80 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccchh Confidence 998777888888888888887644 24455678888888888888887654443332222111111110 0 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCc Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~ 149 (392) .....+....+......... ..........+....++...+|+++|+++...|++.+++.++|++++++.+++++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~ 156 (390) T protein:vir:97 81 MFVASEQFQASTGRWNDRSA----RATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSAL 156 (390) T ss_pred hhhhhHHHHHHHHHhhhhhh----hhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCc Confidence 00111111222221111111 11111122223344556677889999999999999999999999999999987655 Q ss_pred ceeEEEeecC-CccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 150 GSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLI 228 (392) Q Consensus 150 ~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~ 228 (392) .. +++..+ ...+.|++|+++++++ .++|+++++++++++++++||+|+++|+ .++++||.++|++++++++|.++ T Consensus 157 ~~--~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~la~a~~~~~d~a~ 232 (390) T protein:vir:97 157 IE--YVQETGFVNNAAIVAEGALKPES-SLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEI 232 (390) T ss_pred eE--EEEEecCCcceeeecCCcccccc-ccceeEEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 44 444433 4678999999999875 6899999999999999999999999998 57999999999999999999999 Q ss_pred hhccccccc-----------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeeccccc Q lcl|Aclame:pro 229 LGVIEKLTK-----------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPT 291 (392) Q Consensus 229 ~~~~~~~~~-----------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~ 291 (392) ++|.|+... .+...++++.+++ ..++..+..+++|+|||++|.+|+++||++|+|||.+.. T Consensus 233 l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~-~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~- 310 (390) T protein:vir:97 233 LRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAM-LQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNAR- 310 (390) T ss_pred hhcCCCCccccceeeccccccccccccccchHHHHHHHH-HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcc- Confidence 998765431 1234466777765 467888889999999999999999999999999998754 Q ss_pred CCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEe Q lcl|Aclame:pro 292 QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) Q Consensus 292 ~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~ 371 (392) .+.+.+++|. ||++.+ .+| .+.++||||+++|.++++.++++.++++. .+|++|++.||+++|+|+++. T Consensus 311 ~~~~~~l~G~-pV~~~~--~~~-------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~f~~~~~~~r~~~r~d~~v~ 379 (390) T protein:vir:97 311 GTLTPTLWGL-PVVATQ--AMA-------PGEFLVGAFDLAAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEERLALVVY 379 (390) T ss_pred CCCCceecce-eeEEcC--CCC-------CCcEEEEeccceEEEEEecceEEEEeecc-cccccCcEEEEEEEeeccEEe Confidence 4556788885 565532 233 33589999999899999999999998764 479999999999999999999 Q ss_pred cccceEEEEec Q lcl|Aclame:pro 372 DNEAAVYGEID 382 (392) Q Consensus 372 ~~~af~~l~~~ 382 (392) +|+||+++++. T Consensus 380 ~~~a~v~~~~a 390 (390) T protein:vir:97 380 RPEALITGSFA 390 (390) T ss_pred ccccEEEEEeC Confidence 99999999988 No 45 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=1.4e-58 Score=337.68 Aligned_cols=361 Identities=15% Similarity=0.161 Sum_probs=265.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhh---------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc- Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGE---------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNV- 70 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~---------~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~- 70 (392) |++.+++++++++++.++++++.++ +..++++++.+|++.|++++++.++..+................. T Consensus 1 m~e~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (390) T protein:vir:10 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGD 80 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccchhh Confidence 9998888999999999999877543 234567788888888888888766544443332211111111111 Q ss_pred -chhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCc Q lcl|Aclame:pro 71 -DGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) Q Consensus 71 -~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~ 149 (392) ....+...++........... .. .......++.. .+...+|.++|+++...|++.+++.++|+++|++.+++++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~-~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 156 (390) T protein:vir:10 81 LFVASEQFQASAGRWNDRSARA--TM-NIKAALNTAST-DAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSAL 156 (390) T ss_pred hhhhhHHHHHHHHhhhhhhhhh--hh-HHHHHHHhhhc-ccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCc Confidence 111111122222222211111 11 11112223333 33445566677788899999999999999999999887765 Q ss_pred ceeEEEeecC-CccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 150 GSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLI 228 (392) Q Consensus 150 ~~~~~~~~~~-~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~ 228 (392) .+++ +..+ ...+.|++|+++++++ +++|++|++++++++++++||+++|+|+. ++.+||.++|+++++.++|.++ T Consensus 157 ~~~~--~~~~~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~~~~~~i 232 (390) T protein:vir:10 157 IEYV--QETGFVNNAAIVAEGALKPES-SLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEI 232 (390) T ss_pred eEEE--EEecCCcceeeecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 5554 4443 3578999999999876 58999999999999999999999999975 8999999999999999999999 Q ss_pred hhccccccc-----------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeeccccc Q lcl|Aclame:pro 229 LGVIEKLTK-----------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPT 291 (392) Q Consensus 229 ~~~~~~~~~-----------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~ 291 (392) ++|.|++.. .+...++++.+++ ..+...+.++++|+|||++|.+|+++||++|+|||++... T Consensus 233 l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~ 311 (390) T protein:vir:10 233 LRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAM-LQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARG 311 (390) T ss_pred hhcCCCCccccccccccccccccccccccchHHHHHHHH-HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCcC Confidence 998775431 1223466777765 5678888999999999999999999999999999998654 Q ss_pred CCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEe Q lcl|Aclame:pro 292 QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) Q Consensus 292 ~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~ 371 (392) . .+.+++|. ||++++ .+| .+.++||||+++|.+++|++++++++++. .+|++|++.||++.|+|++++ T Consensus 312 ~-~~~~l~G~-pv~~~~--~~p-------~~~~~~gdf~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~~~r~d~~v~ 379 (390) T protein:vir:10 312 T-LTPTLWGL-PVVATQ--AMA-------PGEFLVGAFDLAAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEERLALVVY 379 (390) T ss_pred c-CCceecce-eeEEcC--CCC-------CCcEEEEeccceEEEEEecceEEEEeecc-cccccCcEEEEEEEeeccEEe Confidence 4 45588885 565532 333 23579999999999999999999998764 579999999999999999999 Q ss_pred cccceEEEEec Q lcl|Aclame:pro 372 DNEAAVYGEID 382 (392) Q Consensus 372 ~~~af~~l~~~ 382 (392) +|+||+++++. T Consensus 380 ~~~a~~~~~~a 390 (390) T protein:vir:10 380 RPEALISGSFA 390 (390) T ss_pred ccccEEEEEeC Confidence 99999999988 No 46 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=2.2e-58 Score=336.67 Aligned_cols=376 Identities=16% Similarity=0.171 Sum_probs=252.5 Q ss_pred CC-HH-HHHHHHHHHHHHHHHHHHhhhhh--HH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cc--- Q lcl|Aclame:pro 1 MS-KE-LRELLAKLEGKKEEVRSLMGEDK--VA-------EAEQMMEEVRSLQKKIDLQRSLDEAETEERNNG--RE--- 64 (392) Q Consensus 1 M~-ke-l~el~~~~~~~~~e~~~~~~~~~--~~-------~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~--~~--- 64 (392) |+ +| ++++.+++++++.+++++.++++ .+ +++++.++++.++++++++++..+.+.+..... .. T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~~~ 80 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQGKVEKNEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEKKE 80 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhhc Confidence 88 33 34455556666667766654332 12 344444555555555544443332222211110 00 Q ss_pred ------ccccccchhhHHHHHHHHHHhcchh--------hHHHHHHHH-------hhhhhhhhccccccccceecchhhh Q lcl|Aclame:pro 65 ------VETRNVDGEMEYRDVFMKALRNKPL--------NAEEREFLE-------DDLEQRAMSGLTGEDGGLVIPQDIQ 123 (392) Q Consensus 65 ------~~~~~~~~~~~~~~a~~~~~~~~~~--------~~~~~~~~~-------~~~~~~a~~~~~~~~gg~~iP~~~~ 123 (392) ..........+++..+...+..... ..+.+.... ...+.++.+. ++.+||++||+++. T Consensus 81 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~-~t~~GG~lvP~~~~ 159 (434) T protein:vir:62 81 DPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGL-VTGNGSVTIPDFLS 159 (434) T ss_pred chhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcc-cccccceecchhhH Confidence 0001111122334444333322110 111111111 1112334433 34578999999999 Q ss_pred hHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccc---ccccccccccccceeeEEechhheeeehhhHHHH Q lcl|Aclame:pro 124 TQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEI---TEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL 200 (392) Q Consensus 124 ~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~---~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~ 200 (392) +.|++.+++.++|++++++.++++ . +.+|+....+.+.|. +|++..++ ++++|++|++++++++++++||+|+ T Consensus 160 ~~Ii~~l~~~~~i~~~~~~~~~~~-~--~~~p~~~~~~~a~~~~~~~e~~~~~~-~~~~f~~v~~~~~k~~~~~~iS~el 235 (434) T protein:vir:62 160 KEIITYAQEENFLRRLGTGVKTKE-N--IKYPVLVKKAEAQGHKNERTNNEMPE-TDIEFDEIELSPTEFDALATVTKKL 235 (434) T ss_pred HHHHHhhhhhhhhhhhcceeccCC-c--eEEEEEecCCcccceecccccccccc-cccceeeEEeeheeeEeehhhHHHH Confidence 999999999999999999877653 3 344444444455554 44556665 4699999999999999999999999 Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--------------chhhHHHHHHHHHHHhhhcccCCceEEE Q lcl|Aclame:pro 201 LQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ--------------AIKSLDDIKDVLNVKLDPAISPNAILLT 266 (392) Q Consensus 201 l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~--------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~ 266 (392) |+|+.++|++||.++|+++++.++|.++++|.|+..+. +...+|++++++. .++.+|+.+++||| T Consensus 236 l~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d~l~~l~~-~l~~~~~~~a~~v~ 314 (434) T protein:vir:62 236 LARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNLYDALVKMKN-TPVKEVRKKARWVL 314 (434) T ss_pred HhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccchhhHHHHHHh-hcchhhhcCCEEEE Confidence 99999999999999999999999999999998865432 2245788888765 77889999999999 Q ss_pred cHHHHHHHHHhhccCCceeecccc--cCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeecc-ceEE Q lcl|Aclame:pro 267 NQDGFNYLDKLKDKDGKYILQSDP--TQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKRE-DMEL 343 (392) Q Consensus 267 ~~~~~~~L~~lkd~~g~~l~~~~~--~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~-~~~~ 343 (392) ||.+|.+|++|||++|+|||+|.. ..+.+.+++| +||+++++ ++.. ...+...++||||++++ +++|. .+++ T Consensus 315 n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G-~pV~~~~~--~~~~-~~~~~~~i~~Gdfs~~~-i~~~~g~~~i 389 (434) T protein:vir:62 315 NTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLG-FPVEEEDA--IDIP-DSPDTPVFYFGDFSKFY-IQDVIGSLEV 389 (434) T ss_pred cHHHHHHHHHhhccCCCEeeccCCCccCCCCceecc-eeeEEecC--ccCc-cCCCceEEEEeeccceE-EEEeeceeEE Confidence 999999999999999999998743 4466778888 56766543 3322 23345668899999754 77765 5778 Q ss_pred EEeccchhhhhcCceeEEEEEeeCcEEec-ccceEEEEecccCCCCC Q lcl|Aclame:pro 344 ASTDVGGKAFTRNTLDLRAIQRDDVQMWD-NEAAVYGEIDLSAPVEQ 389 (392) Q Consensus 344 ~~~~~~~~~f~~~~~~~~~~~r~~~~v~~-~~af~~l~~~~~a~~~~ 389 (392) +.+.+ .+|.+|+|.||++.|+|+++++ |.++.+++++..+|+.. T Consensus 390 ~~~~~--~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 390 QKLVE--LFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred Eeehh--hhcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 88765 4778999999999999999886 99998887665555333 No 47 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=1.4e-58 Score=337.75 Aligned_cols=359 Identities=18% Similarity=0.213 Sum_probs=258.2 Q ss_pred CC---HHHHHHHHHHHHHHHHHHHHhhhh--hHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---cc Q lcl|Aclame:pro 1 MS---KELRELLAKLEGKKEEVRSLMGED--KVAEA----EQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVE---TR 68 (392) Q Consensus 1 M~---kel~el~~~~~~~~~e~~~~~~~~--~~~~~----~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~---~~ 68 (392) |+ |+|+|++++++++.++++++.++- +.++. +.+.++++++..+++.++...+............. .. T Consensus 1 m~~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEEA 80 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccch Confidence 55 578999999999988888775532 22222 22233333344444333322211111111100000 00 Q ss_pred ccc-----hhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhccee Q lcl|Aclame:pro 69 NVD-----GEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) Q Consensus 69 ~~~-----~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~ 143 (392) ... ......+.+...++.... ...+ +....++..++|+++|++++..|++.+++.++|+++|++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~-~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~ 150 (395) T protein:vir:43 81 PKTAGQMVAESLKEQGVTSSLRGSHR---------VSMP-RSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPG 150 (395) T ss_pred hhhHHHHHHHHHHHHHHHHHhhhhhh---------hhhh-hhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccce Confidence 000 011111222222222111 1112 2233455667888999999999999999999999999999 Q ss_pred eccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 144 PVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVT 223 (392) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~ 223 (392) +++++.+.++... .....+.|++|+++++++ +++|+++++++++++++++||+++|+|+. ++++||.++|+++++.+ T Consensus 151 ~~~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~v~~~la~a~~~~ 227 (395) T protein:vir:43 151 TTESNSVEYVRET-GFVNNAAPVSEGTQKPYS-DLTFELENAPVRTIAHLFKASRQILDDAS-ALQSYIDARARYGLMLV 227 (395) T ss_pred ecCCCceEEEEEe-cCCCceeeecCCcccccc-ccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHH Confidence 9987655554432 334678999999999875 69999999999999999999999999875 79999999999999999 Q ss_pred HHHHHhhcccccccc-------------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCce Q lcl|Aclame:pro 224 RNVLILGVIEKLTKQ-------------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKY 284 (392) Q Consensus 224 ~d~~~~~~~~~~~~~-------------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~ 284 (392) +|.++++|.|+..+. +...++++.+++ ..+...+..+++|+|||++|..|++++|++|+| T Consensus 228 ~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~ 306 (395) T protein:vir:43 228 EECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAI-LQAQLAEFPASGIVLNPIDWALIELNKDAENRY 306 (395) T ss_pred HHHHHHhccCCCCccccccccccccccccccccccchhHHHHHHHH-HhhccccCCCcEEEEcHHHHHHHHHhhccCCce Confidence 999999987764431 123467777765 466888888999999999999999999999999 Q ss_pred eecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEE Q lcl|Aclame:pro 285 ILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQ 364 (392) Q Consensus 285 l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~ 364 (392) ||.+ ..++.+.+++|. ||++.+ .+| .+.++||||+++|.+++|.+++++++++.+.+|++|++.||+++ T Consensus 307 i~~~-~~~~~~~~l~G~-pVv~~~--~~~-------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~ 375 (395) T protein:vir:43 307 IIGS-PQNGTTPTLWRL-PVVETQ--AIT-------QDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEE 375 (395) T ss_pred eccc-cccCCCceecce-eeEEcC--CCC-------CCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEE Confidence 9975 556667788985 665532 333 33579999999899999999999999998889999999999999 Q ss_pred eeCcEEecccceEEEEeccc Q lcl|Aclame:pro 365 RDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 365 r~~~~v~~~~af~~l~~~~~ 384 (392) |+|+++.+|+||++++++++ T Consensus 376 r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 376 RLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred eeccEEecccceEEEEeccC Confidence 99999999999999999988 No 48 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=5.1e-58 Score=334.69 Aligned_cols=375 Identities=13% Similarity=0.076 Sum_probs=250.0 Q ss_pred CCHHH-HHHHHHHHHHHHHHHHHhhhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcccccccccc--- Q lcl|Aclame:pro 1 MSKEL-RELLAKLEGKKEEVRSLMGED--KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEER----NNGREVETRNV--- 70 (392) Q Consensus 1 M~kel-~el~~~~~~~~~e~~~~~~~~--~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~----~~~~~~~~~~~--- 70 (392) |-||. +...++..++.+|++++.++- ..+..+++.++++.+.+.++..+.......... ........... T Consensus 1 ~~ke~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (413) T protein:vir:81 1 MVKEAGDAPTNAQVAEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYKSIGEF 80 (413) T ss_pred ChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhhhhhh Confidence 66554 444444555566666655431 122333344444444444433332221111111 00000000000 Q ss_pred --chhhHHHHHHHHHHhcchhhHH-HHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccC Q lcl|Aclame:pro 71 --DGEMEYRDVFMKALRNKPLNAE-EREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRT 147 (392) Q Consensus 71 --~~~~~~~~a~~~~~~~~~~~~~-~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~ 147 (392) ....+....... ......... ...........+....++...+|+++|+++.+.|++.+++.++|++++++.++++ T Consensus 81 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 159 (413) T protein:vir:81 81 FAKRAGDQIKQQAG-GAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTN 159 (413) T ss_pred hhhhhhhHHHHHHH-HHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccC Confidence 000000000000 000000000 0000111122233445566788999999999999999999999999999999988 Q ss_pred CcceeEEEeec--CCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 148 RSGSRVLEKNS--DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRN 225 (392) Q Consensus 148 ~~~~~~~~~~~--~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d 225 (392) ....+++.... ....++|++|++++++++.++|++|++++++++++++||+++|+|+. .|.+||.++|+++++.++| T Consensus 160 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d 238 (413) T protein:vir:81 160 TTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYD-FLVSYINARLLEELAIEEE 238 (413) T ss_pred CceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 77777665433 33568999999999987667899999999999999999999999986 5999999999999999999 Q ss_pred HHHhhcccccccc----------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeeccc Q lcl|Aclame:pro 226 VLILGVIEKLTKQ----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSD 289 (392) Q Consensus 226 ~~~~~~~~~~~~~----------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~ 289 (392) .++++|.|+..+. +...++++..++.............|+|||++|.+|++|||++|+|||.+. T Consensus 239 ~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~ 318 (413) T protein:vir:81 239 RQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGV 318 (413) T ss_pred HHHhccCCCCCcccccccccccccccccccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceecccc Confidence 9999998765442 223455555655443333333444699999999999999999999999876 Q ss_pred ccCC-------cccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEE Q lcl|Aclame:pro 290 PTQK-------NKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRA 362 (392) Q Consensus 290 ~~~~-------~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~ 362 (392) ...+ .+.++||. ||++++ .+| .+.++||||+++|.+++|++++++++++.+.+|++|++.||+ T Consensus 319 ~~~~~~~~~~~~~~~l~G~-pv~~s~--~~~-------~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~ 388 (413) T protein:vir:81 319 FQGQYGSGGIMLDPAPWGL-RTVQSQ--VVP-------VGKPVVGAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRA 388 (413) T ss_pred ccccccccccccCceecce-eeEEcC--CCC-------cccEEEEecccEEEEEEecceEEEEeccccchhhcCcEEEEE Confidence 5433 23468875 665533 222 346899999999999999999999999988889999999999 Q ss_pred EEeeCcEEecccceEEEEecccCCCCCC Q lcl|Aclame:pro 363 IQRDDVQMWDNEAAVYGEIDLSAPVEQP 390 (392) Q Consensus 363 ~~r~~~~v~~~~af~~l~~~~~a~~~~~ 390 (392) ++|+|+++.+|+||++++++++ ++| T Consensus 389 ~~r~d~~~~~~~a~~~l~~~~~---~~p 413 (413) T protein:vir:81 389 EERVGLMVTFPEAIVQLDVAEV---VTP 413 (413) T ss_pred EEeeccEEecccceEEEEecCC---CCC Confidence 9999999999999999997643 223 No 49 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=1.3e-57 Score=332.46 Aligned_cols=359 Identities=12% Similarity=0.108 Sum_probs=245.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhh---hhhHHHHHHH--------HHHHHHHHHHHHHHHHH-HHHHHHHhhcccccccc Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMG---EDKVAEAEQM--------MEEVRSLQKKIDLQRSL-DEAETEERNNGREVETR 68 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~---~~~~~~~~~~--------~~ei~~l~~~i~~~~~~-~~~~~~~~~~~~~~~~~ 68 (392) |+ +.|+.++++++.+++++..+ ++..+..+.+ .+++++++.++..+.+. .+.+.+.......... T Consensus 1 m~--~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~~~- 77 (379) T protein:vir:10 1 ME--ALEIKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSEDK- 77 (379) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc- Confidence 87 23444444444444443321 1111111111 11122233333222211 1111111111111110 Q ss_pred ccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCC Q lcl|Aclame:pro 69 NVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTR 148 (392) Q Consensus 69 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~ 148 (392) ...+.+.+....+.. ...+.. ...........+++++++.+||+++...|++.+++.++|+++|++.+++++ T Consensus 78 ----~~~~~~~~~~~~~~~---~~~~~~-~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~ 149 (379) T protein:vir:10 78 ----SDSLVKSITENFNDI---KEVRNG-KSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGG 149 (379) T ss_pred ----chhHHHHHHHHHHhH---HHHHhh-hhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCC Confidence 111111111111110 011111 111111122234455566679999999999999999999999999999887 Q ss_pred cceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 149 SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLI 228 (392) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~ 228 (392) +..++.....+.+.+.|++|++.+|++ +++|++|++++++++++++||+|+|+|+. +|.+||.++|+++++.++|.++ T Consensus 150 ~~~~~~~~~~~~~~~~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~~la~~~~~~~~~~~ 227 (379) T protein:vir:10 150 TYTFVRENGAGEGAIGAQVEGATKGQK-DYDISMIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPNALRRDYAKAENAAF 227 (379) T ss_pred ceEEEEeecCCCcccccccCCcccccc-ccceeeeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 777766666667788999999999975 69999999999999999999999999985 6999999999999999999999 Q ss_pred hhccccccc------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeeccccc--CCcccceec Q lcl|Aclame:pro 229 LGVIEKLTK------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPT--QKNKKLFAG 300 (392) Q Consensus 229 ~~~~~~~~~------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~--~~~~~~~~g 300 (392) +.|+++.+. .+...+|++.+++. .+..++..+++|||||.+|..|+++||++|+|||+|++. .+.+.++|| T Consensus 228 ~~g~~~~~~~~~~~~~~~~~~d~i~~~~~-~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G 306 (379) T protein:vir:10 228 NAVLAANATASTEIITNKNKVEMLINEIA-KQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRING 306 (379) T ss_pred hcccccccccccccccCcccHHHHHHHHH-hhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcceecc Confidence 998876543 23445778888764 556777888899999999999999999999999998764 455567888 Q ss_pred ccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEE Q lcl|Aclame:pro 301 TNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE 380 (392) Q Consensus 301 ~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~ 380 (392) . ||++. ..++ .+.++||||++++ ++.|+++++.++++...+|++|++.||+++|+|++|.+|+||++++ T Consensus 307 ~-pvv~s--~~~~-------ag~~~~gdf~~~~-~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~ 375 (379) T protein:vir:10 307 I-PLFRA--TWLA-------ANKYYVGDWTRVT-KVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGD 375 (379) T ss_pred e-eeEec--CCCC-------CCceEEeecccEE-EEEEeceEEEEeecccccccCCcEEEEEEEEeccEEecCccEEEEE Confidence 5 56542 2232 2358999999854 6678899999999888889999999999999999999999999999 Q ss_pred eccc Q lcl|Aclame:pro 381 IDLS 384 (392) Q Consensus 381 ~~~~ 384 (392) +++- T Consensus 376 ~~~~ 379 (379) T protein:vir:10 376 FTAV 379 (379) T ss_pred ecCC Confidence 9877 No 50 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=3e-57 Score=330.43 Aligned_cols=370 Identities=15% Similarity=0.165 Sum_probs=258.9 Q ss_pred CC-HHHHHHHHHHHHHHHHHHHHhhhh----------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MS-KELRELLAKLEGKKEEVRSLMGED----------KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) Q Consensus 1 M~-kel~el~~~~~~~~~e~~~~~~~~----------~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) .. .++++++.+.....++++++.++. ..++++++..++..++..+....+..+....... .... T Consensus 140 ~~~~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~-----~~~~ 214 (543) T protein:vir:81 140 LEPDSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLA-----RQCL 214 (543) T ss_pred ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----hhhh Confidence 00 134556666666666665543321 1223344444444444433332222111111110 0011 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHH-HhHHhhhhhhhhcceeeccCC Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQIN-ELARSFDALEQYVTVEPVRTR 148 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii-~~~~~~~~l~~l~~~~~~~~~ 148 (392) .........++.+.++.............. .........+..+||++||+++...|| ..+++.++|+.++++.++ T Consensus 215 ~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~-~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~--- 290 (543) T protein:vir:81 215 ATSSPAYLRAWSKMARNPHAAILTEEEKRA-INEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA--- 290 (543) T ss_pred hhhhhhhhhHHHHHHHhhHHHHhhhhhhhh-hhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC--- Confidence 112233444554444433221111111111 122223345667899999999998865 667888999999887544 Q ss_pred cceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 149 SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLI 228 (392) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~ 228 (392) ++.+.+++..+++.+.|++|++.++++ .++|++|++++++++++++||+++++|+ ++|.+||.+.|+++++.++|.++ T Consensus 291 ~g~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ai 368 (543) T protein:vir:81 291 TGDVWHGVSSAAVQWSWDAEFEEVSDD-SPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTL 368 (543) T ss_pred CcceEEEEecCCcceeecccCcccccc-ccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHH Confidence 456777888889999999999999864 6999999999999999999999999997 69999999999999999999999 Q ss_pred hhccccccc-------------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeeccc Q lcl|Aclame:pro 229 LGVIEKLTK-------------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSD 289 (392) Q Consensus 229 ~~~~~~~~~-------------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~ 289 (392) ++|.|+... .+..+++++++++ ..++.+|..+++|+|||++|..|+++||++|+|||.+ T Consensus 369 l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~- 446 (543) T protein:vir:81 369 TTGTGQGNQPTGIVTALAGTAAEIAPVTAETFALADVYAVY-EQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTT- 446 (543) T ss_pred hccCCCCcccccchhhcccccccccccccccccHHHHHHHH-HhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccC- Confidence 998775421 1123567777765 4678899999999999999999999999999999986 Q ss_pred ccCCcccceecccceEEecCccccc-ccccCCcceEEEEehhhceeeeeccceEEEEeccch--hhhhcCceeEEEEEee Q lcl|Aclame:pro 290 PTQKNKKLFAGTNPVVVVSNRFLKS-KGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG--KAFTRNTLDLRAIQRD 366 (392) Q Consensus 290 ~~~~~~~~~~g~~pv~~~~~~~~~~-~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~--~~f~~~~~~~~~~~r~ 366 (392) +..+.+.+++|. ||+++++..... .+...+..+++||||+. |.+++++++++.++++.. ..|.+|++.|++++|+ T Consensus 447 ~~~g~~~~l~G~-pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~ 524 (543) T protein:vir:81 447 IGNGEPSQLLGR-PVGEAEAMDANWNTSASADNFVLLYGNFQN-YVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRM 524 (543) T ss_pred cCCCCCccccce-eeEEeccccccccccccCCcceEEEeeccc-eeEEeecccEEEEeccccccchhhcCceEEEEEEee Confidence 445667788884 676665432222 23445677899999985 678999999999988753 3577899999999999 Q ss_pred CcEEecccceEEEEecccC Q lcl|Aclame:pro 367 DVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 367 ~~~v~~~~af~~l~~~~~a 385 (392) |+++.+|+||++++++++| T Consensus 525 d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 525 GADVVNPNAFRLLNVETAS 543 (543) T ss_pred ccEeecccceEEEEecccC Confidence 9999999999999999988 No 51 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=1.4e-56 Score=326.73 Aligned_cols=361 Identities=13% Similarity=0.127 Sum_probs=250.5 Q ss_pred CCHHHHHHHHH-------HHHHHHHHHHHhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhh--cccccccc Q lcl|Aclame:pro 1 MSKELRELLAK-------LEGKKEEVRSLMGEDK--VAEAEQMMEEVRSLQKKIDLQRSLDEAE-TEERN--NGREVETR 68 (392) Q Consensus 1 M~kel~el~~~-------~~~~~~e~~~~~~~~~--~~~~~~~~~ei~~l~~~i~~~~~~~~~~-~~~~~--~~~~~~~~ 68 (392) |++ |.||+++ ++++.++++.+..+++ .+++.++.++++.|+++++.+++..+.. .+... ........ T Consensus 1 Mk~-l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 79 (387) T protein:vir:26 1 MPT-LYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQ 79 (387) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCC Confidence 763 3444444 4444444444333322 2456667777777777776654432221 11111 11111111 Q ss_pred ccchhhHHHHHHHHHHhcchhhHHH-HHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccC Q lcl|Aclame:pro 69 NVDGEMEYRDVFMKALRNKPLNAEE-REFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRT 147 (392) Q Consensus 69 ~~~~~~~~~~a~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~ 147 (392) .........+++..++|........ ........+.+++..+++++||++||+++.+.|++.+++.++|++++++.++++ T Consensus 80 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~ 159 (387) T protein:vir:26 80 SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG 159 (387) T ss_pred CCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC Confidence 1122223334555555554332222 222233445567777888889999999999999999999999999999988764 Q ss_pred CcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 148 RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVL 227 (392) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~ 227 (392) . .++.. ..+...++|++|++..+++ .++|++|++.+++++++++||+|+|+||.++|++||.++|+++++.+++.. T Consensus 160 ~--~~p~~-~~~~~~a~~v~Eg~~~~~~-~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:26 160 L--EIPRV-SYTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred c--eeeee-eccCCcccccccccccccc-ccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 3 34433 3345678999999999876 599999999999999999999999999999999999999999999997654 Q ss_pred -Hhhcccccc------------ccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCc Q lcl|Aclame:pro 228 -ILGVIEKLT------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) Q Consensus 228 -~~~~~~~~~------------~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~ 294 (392) +..|.++.. ..+...+|+++++++ .++.+|+.+++|+||+.+|..+.++++..|+|+|. +. T Consensus 236 ~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~-~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-----~~ 309 (387) T protein:vir:26 236 ALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TP 309 (387) T ss_pred HhhcCCCccccceeeeccccccccccchHHHHHHHHh-ccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cC Confidence 444443221 123345889999775 78899999999999999998877777777788875 34 Q ss_pred ccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEeccc Q lcl|Aclame:pro 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE 374 (392) Q Consensus 295 ~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~ 374 (392) |.+++| +||+++++ ..+++||||+++|..+ .++.+..+.+ ..+|++.|+++.|+|+++++|+ T Consensus 310 ~~~llG-~PV~~~~~-----------~~~~~~GDf~~~~~~~--~~~~~~~~~~----~~~~~~~~~~~~r~Dg~v~~~~ 371 (387) T protein:vir:26 310 AEKVFG-KPVVFTDA-----------AVKPIVGDFNYFGINY--DGTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDS 371 (387) T ss_pred Cccccc-cceEEecC-----------CCceeeechhhhhhhh--hhhhheeccc----ccCCceEEEEEEEeCcEeechh Confidence 668888 47766542 2357999999877543 4555554443 2468999999999999999999 Q ss_pred ceEEEEecccCCCCCC Q lcl|Aclame:pro 375 AAVYGEIDLSAPVEQP 390 (392) Q Consensus 375 af~~l~~~~~a~~~~~ 390 (392) ||++|++|+++..+|. T Consensus 372 A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:26 372 AFRIAKAKENTGPLPS 387 (387) T ss_pred heEEEEeecCCCCCCC Confidence 9999999887666655 No 52 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=1.4e-56 Score=326.73 Aligned_cols=361 Identities=13% Similarity=0.127 Sum_probs=250.5 Q ss_pred CCHHHHHHHHH-------HHHHHHHHHHHhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhh--cccccccc Q lcl|Aclame:pro 1 MSKELRELLAK-------LEGKKEEVRSLMGEDK--VAEAEQMMEEVRSLQKKIDLQRSLDEAE-TEERN--NGREVETR 68 (392) Q Consensus 1 M~kel~el~~~-------~~~~~~e~~~~~~~~~--~~~~~~~~~ei~~l~~~i~~~~~~~~~~-~~~~~--~~~~~~~~ 68 (392) |++ |.||+++ ++++.++++.+..+++ .+++.++.++++.|+++++.+++..+.. .+... ........ T Consensus 1 Mk~-l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 79 (387) T protein:vir:96 1 MPT-LYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQ 79 (387) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCC Confidence 763 3444444 4444444444333322 2456667777777777776654432221 11111 11111111 Q ss_pred ccchhhHHHHHHHHHHhcchhhHHH-HHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccC Q lcl|Aclame:pro 69 NVDGEMEYRDVFMKALRNKPLNAEE-REFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRT 147 (392) Q Consensus 69 ~~~~~~~~~~a~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~ 147 (392) .........+++..++|........ ........+.+++..+++++||++||+++.+.|++.+++.++|++++++.++++ T Consensus 80 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~ 159 (387) T protein:vir:96 80 SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG 159 (387) T ss_pred CCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC Confidence 1122223334555555554332222 222233445567777888889999999999999999999999999999988764 Q ss_pred CcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 148 RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVL 227 (392) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~ 227 (392) . .++.. ..+...++|++|++..+++ .++|++|++.+++++++++||+|+|+||.++|++||.++|+++++.+++.. T Consensus 160 ~--~~p~~-~~~~~~a~~v~Eg~~~~~~-~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:96 160 L--EIPRV-SYTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred c--eeeee-eccCCcccccccccccccc-ccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 3 34433 3345678999999999876 599999999999999999999999999999999999999999999997654 Q ss_pred -Hhhcccccc------------ccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCc Q lcl|Aclame:pro 228 -ILGVIEKLT------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) Q Consensus 228 -~~~~~~~~~------------~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~ 294 (392) +..|.++.. ..+...+|+++++++ .++.+|+.+++|+||+.+|..+.++++..|+|+|. +. T Consensus 236 ~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~-~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-----~~ 309 (387) T protein:vir:96 236 ALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TP 309 (387) T ss_pred HhhcCCCccccceeeeccccccccccchHHHHHHHHh-ccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cC Confidence 444443221 123345889999775 78899999999999999998877777777788875 34 Q ss_pred ccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEeccc Q lcl|Aclame:pro 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE 374 (392) Q Consensus 295 ~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~ 374 (392) |.+++| +||+++++ ..+++||||+++|..+ .++.+..+.+ ..+|++.|+++.|+|+++++|+ T Consensus 310 ~~~llG-~PV~~~~~-----------~~~~~~GDf~~~~~~~--~~~~~~~~~~----~~~~~~~~~~~~r~Dg~v~~~~ 371 (387) T protein:vir:96 310 AEKVFG-KPVVFTDA-----------AVKPIVGDFNYFGINY--DGTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDS 371 (387) T ss_pred Cccccc-cceEEecC-----------CCceeeechhhhhhhh--hhhhheeccc----ccCCceEEEEEEEeCcEeechh Confidence 668888 47766542 2357999999877543 4555554443 2468999999999999999999 Q ss_pred ceEEEEecccCCCCCC Q lcl|Aclame:pro 375 AAVYGEIDLSAPVEQP 390 (392) Q Consensus 375 af~~l~~~~~a~~~~~ 390 (392) ||++|++|+++..+|. T Consensus 372 A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:96 372 AFRIAKAKENTGPLPS 387 (387) T ss_pred heEEEEeecCCCCCCC Confidence 9999999887666655 No 53 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=1.4e-56 Score=326.73 Aligned_cols=361 Identities=13% Similarity=0.127 Sum_probs=250.5 Q ss_pred CCHHHHHHHHH-------HHHHHHHHHHHhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhh--cccccccc Q lcl|Aclame:pro 1 MSKELRELLAK-------LEGKKEEVRSLMGEDK--VAEAEQMMEEVRSLQKKIDLQRSLDEAE-TEERN--NGREVETR 68 (392) Q Consensus 1 M~kel~el~~~-------~~~~~~e~~~~~~~~~--~~~~~~~~~ei~~l~~~i~~~~~~~~~~-~~~~~--~~~~~~~~ 68 (392) |++ |.||+++ ++++.++++.+..+++ .+++.++.++++.|+++++.+++..+.. .+... ........ T Consensus 1 Mk~-l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 79 (387) T protein:vir:94 1 MPT-LYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQ 79 (387) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCC Confidence 763 3444444 4444444444333322 2456667777777777776654432221 11111 11111111 Q ss_pred ccchhhHHHHHHHHHHhcchhhHHH-HHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccC Q lcl|Aclame:pro 69 NVDGEMEYRDVFMKALRNKPLNAEE-REFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRT 147 (392) Q Consensus 69 ~~~~~~~~~~a~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~ 147 (392) .........+++..++|........ ........+.+++..+++++||++||+++.+.|++.+++.++|++++++.++++ T Consensus 80 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~ 159 (387) T protein:vir:94 80 SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG 159 (387) T ss_pred CCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC Confidence 1122223334555555554332222 222233445567777888889999999999999999999999999999988764 Q ss_pred CcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 148 RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVL 227 (392) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~ 227 (392) . .++.. ..+...++|++|++..+++ .++|++|++.+++++++++||+|+|+||.++|++||.++|+++++.+++.. T Consensus 160 ~--~~p~~-~~~~~~a~~v~Eg~~~~~~-~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:94 160 L--EIPRV-SYTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred c--eeeee-eccCCcccccccccccccc-ccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 3 34433 3345678999999999876 599999999999999999999999999999999999999999999997654 Q ss_pred -Hhhcccccc------------ccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCc Q lcl|Aclame:pro 228 -ILGVIEKLT------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) Q Consensus 228 -~~~~~~~~~------------~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~ 294 (392) +..|.++.. ..+...+|+++++++ .++.+|+.+++|+||+.+|..+.++++..|+|+|. +. T Consensus 236 ~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~-~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-----~~ 309 (387) T protein:vir:94 236 ALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TP 309 (387) T ss_pred HhhcCCCccccceeeeccccccccccchHHHHHHHHh-ccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cC Confidence 444443221 123345889999775 78899999999999999998877777777788875 34 Q ss_pred ccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEeccc Q lcl|Aclame:pro 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE 374 (392) Q Consensus 295 ~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~ 374 (392) |.+++| +||+++++ ..+++||||+++|..+ .++.+..+.+ ..+|++.|+++.|+|+++++|+ T Consensus 310 ~~~llG-~PV~~~~~-----------~~~~~~GDf~~~~~~~--~~~~~~~~~~----~~~~~~~~~~~~r~Dg~v~~~~ 371 (387) T protein:vir:94 310 AEKVFG-KPVVFTDA-----------AVKPIVGDFNYFGINY--DGTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDS 371 (387) T ss_pred Cccccc-cceEEecC-----------CCceeeechhhhhhhh--hhhhheeccc----ccCCceEEEEEEEeCcEeechh Confidence 668888 47766542 2357999999877543 4555554443 2468999999999999999999 Q ss_pred ceEEEEecccCCCCCC Q lcl|Aclame:pro 375 AAVYGEIDLSAPVEQP 390 (392) Q Consensus 375 af~~l~~~~~a~~~~~ 390 (392) ||++|++|+++..+|. T Consensus 372 A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:94 372 AFRIAKAKENTGPLPS 387 (387) T ss_pred heEEEEeecCCCCCCC Confidence 9999999887666655 No 54 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=2.8e-56 Score=325.11 Aligned_cols=361 Identities=13% Similarity=0.125 Sum_probs=252.4 Q ss_pred CC------HHHHHHHHHHHHHHHHHHHHhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhhc--cccccccc Q lcl|Aclame:pro 1 MS------KELRELLAKLEGKKEEVRSLMGEDK--VAEAEQMMEEVRSLQKKIDLQRSLDEA-ETEERNN--GREVETRN 69 (392) Q Consensus 1 M~------kel~el~~~~~~~~~e~~~~~~~~~--~~~~~~~~~ei~~l~~~i~~~~~~~~~-~~~~~~~--~~~~~~~~ 69 (392) |+ +++.+++++++++.++++.+..+++ .++++++.++++.|+.+++.+++..+. +.+.... ........ T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCC Confidence 65 2344555555666666665554332 345677777788888888765543322 2121111 11111111 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHH-HHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCC Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEERE-FLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTR 148 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~ 148 (392) ........+++..++|........+. ......+.+++..++.++||++||+++.+.|++.+++.++|+++|++.++++. T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~ 160 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL 160 (387) T ss_pred cchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCCc Confidence 12222223445555555433322222 22334456778888888999999999999999999999999999999887643 Q ss_pred cceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 149 SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVL- 227 (392) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~- 227 (392) .++.. ..+...++|++|++..+++ .++|++|++++++++++++||+|+|+||.++|++||.++|+++++.+++.. T Consensus 161 --~~p~~-~~~~~~a~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~ 236 (387) T protein:vir:93 161 --EIPRV-SYTLDDDDFITDVETAKEL-KLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDA 236 (387) T ss_pred --eEEEE-eecCCccccccCccccccc-ccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhH Confidence 33332 3445678999999998875 699999999999999999999999999999999999999999999998765 Q ss_pred Hhhccccccc------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHH-HHhhccCCceeecccccCCc Q lcl|Aclame:pro 228 ILGVIEKLTK------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYL-DKLKDKDGKYILQSDPTQKN 294 (392) Q Consensus 228 ~~~~~~~~~~------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L-~~lkd~~g~~l~~~~~~~~~ 294 (392) +..|.|++.+ .+...||+++++++ .++.+|+.+++|+||+.+|..+ ++++|++| ++|. +. T Consensus 237 ~~~g~g~g~p~g~l~~~~~~~v~~~~~~d~i~~~~~-~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~-~~~~-----~~ 309 (387) T protein:vir:93 237 LAVSPKSGLDHMSFYNGSVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTT-NFFD-----TP 309 (387) T ss_pred hhcCCCccccceeeeccccccccccchHHHHHHHHh-ccChhhhcCCEEEEechHHHHHHHHHhcCCC-cccc-----cC Confidence 4444443221 23345889988765 7899999999999999998765 45566555 5553 34 Q ss_pred ccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEeccc Q lcl|Aclame:pro 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE 374 (392) Q Consensus 295 ~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~ 374 (392) |.+++|. ||+++++ ..+++||||+++|.. +.++.+..+.+ +.++++.|+++.|+|+++++|+ T Consensus 310 ~~~llG~-PV~~~~~-----------~~~~~~GDf~~~~~~--~~~~~~~~~~~----~~~~~~~~~~~~r~d~~v~~~e 371 (387) T protein:vir:93 310 AEKVFGK-PVVFTDA-----------AVKPIVGDFNYFGIN--YDGTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDS 371 (387) T ss_pred Ccccccc-ceEEecC-----------CCceeeeehhhhhee--hhhheeeeccc----ccCCceeEEEEeeeCceeechh Confidence 5688884 6766442 235789999987654 45566655443 4578999999999999999999 Q ss_pred ceEEEEecccCCCCCC Q lcl|Aclame:pro 375 AAVYGEIDLSAPVEQP 390 (392) Q Consensus 375 af~~l~~~~~a~~~~~ 390 (392) ||+++++++++..+|. T Consensus 372 A~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 372 AFRIAKAKENTGSLPS 387 (387) T ss_pred heEEEEeecCCCCCCC Confidence 9999999987766666 No 55 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=1.8e-56 Score=326.19 Aligned_cols=361 Identities=13% Similarity=0.125 Sum_probs=248.3 Q ss_pred CCHHHHHHHH-------HHHHHHHHHHHHhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcccccccc Q lcl|Aclame:pro 1 MSKELRELLA-------KLEGKKEEVRSLMGEDK--VAEAEQMMEEVRSLQKKIDLQRSLDEAETEER---NNGREVETR 68 (392) Q Consensus 1 M~kel~el~~-------~~~~~~~e~~~~~~~~~--~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~---~~~~~~~~~ 68 (392) |+ .|.||++ +++++.++++.+..+++ .++++++.+++++|+.+++.+++..+...+.. ......... T Consensus 16 mk-~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 94 (402) T protein:vir:93 16 MP-TLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQ 94 (402) T ss_pred Ch-HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCC Confidence 65 2344444 44444444444333222 24566667777777777766544332221111 111111111 Q ss_pred ccchhhHHHHHHHHHHhcchhhHHHHH-HHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccC Q lcl|Aclame:pro 69 NVDGEMEYRDVFMKALRNKPLNAEERE-FLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRT 147 (392) Q Consensus 69 ~~~~~~~~~~a~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~ 147 (392) .........+++..+++.......... ......+.+++..+++++||++||+++...|++.+++.++|+++|+++++++ T Consensus 95 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~ 174 (402) T protein:vir:93 95 SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG 174 (402) T ss_pred CCchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC Confidence 222223333445555554433222222 2223345566777888889999999999999999999999999999988764 Q ss_pred CcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 148 RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVL 227 (392) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~ 227 (392) . .++.. ..+...+.|++|++..+++ .++|++|++.+++++++++||+|+|+||.+++++||.++|+++++.+++.. T Consensus 175 ~--~~p~~-~~~~~~a~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~ 250 (402) T protein:vir:93 175 L--EIPRV-SYTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKD 250 (402) T ss_pred c--eeeee-eccCCcccccccccccccc-ccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 3 33332 2345678999999999876 599999999999999999999999999999999999999999999997664 Q ss_pred -Hhhccccccc------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCc Q lcl|Aclame:pro 228 -ILGVIEKLTK------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) Q Consensus 228 -~~~~~~~~~~------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~ 294 (392) +..|.+++.+ .+...+|+++++++ .++.+|+.+++|+||+.+|..++++++.+|+++|. +. T Consensus 251 ~~~~g~g~g~p~g~~~~~~~~~~~~~~~~d~l~~~~~-~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~-----~~ 324 (402) T protein:vir:93 251 ALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TP 324 (402) T ss_pred HhhcCCCccccceeeeccccccccccchHHHHHHHHh-ccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cC Confidence 4444433221 23345788998775 78899999999999999988877666667778775 34 Q ss_pred ccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEeccc Q lcl|Aclame:pro 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE 374 (392) Q Consensus 295 ~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~ 374 (392) |.+++| +||+++++ ..+++||||+++|.+++ ++.+..+.+ ..++++.|+++.|+|+++++|+ T Consensus 325 ~~~llG-~PV~~t~~-----------~~~i~~GDf~~~~~~~~--~~~~~~~~~----~~~~~~~~~~~~r~Dg~v~~~~ 386 (402) T protein:vir:93 325 AEKVFG-KPVVFTDA-----------AVKPIVGDFNYFGINYD--GTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDS 386 (402) T ss_pred Cccccc-cceEEecC-----------CCceeeechhhhhhhhh--hhhhhhhhc----ccCCceEEEEEEEeCcEEechh Confidence 678898 57776542 23579999998876554 344444332 2368999999999999999999 Q ss_pred ceEEEEecccCCCCCC Q lcl|Aclame:pro 375 AAVYGEIDLSAPVEQP 390 (392) Q Consensus 375 af~~l~~~~~a~~~~~ 390 (392) ||++|++|+++..+|. T Consensus 387 A~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 387 AFRIAKAKENTGPLPS 402 (402) T ss_pred heEEEEeecCCCCCCC Confidence 9999999987665555 No 56 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=9.1e-56 Score=322.34 Aligned_cols=367 Identities=14% Similarity=0.085 Sum_probs=240.0 Q ss_pred CC--------------HHH----HHHHHHHHHHHHHHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MS--------------KEL----RELLAKLEGKKEEVRSLMG---EDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEER 59 (392) Q Consensus 1 M~--------------kel----~el~~~~~~~~~e~~~~~~---~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~ 59 (392) |. ++. +..+++.++...+++++.. ++..++++...+|++.+.++++...+......+.. T Consensus 12 ~~~~e~a~~~~~~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~l~ee~~~~~~~~a~~~e~~ 91 (458) T protein:vir:10 12 LGLGDLAKSLEGLTAAQKAQEAERMRKEQEEKELARMNDLVSKAVGEDRKRLEEALELVKSLDEKSKKSNELFAQTVEKQ 91 (458) T ss_pred hchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 000 0001111111111111111 11222233333344444333322221111100000 Q ss_pred ---------------------hcccccccccc------chhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhh-hccccc Q lcl|Aclame:pro 60 ---------------------NNGREVETRNV------DGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRA-MSGLTG 111 (392) Q Consensus 60 ---------------------~~~~~~~~~~~------~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a-~~~~~~ 111 (392) ........... .......+++...++........+. .....+ ....+. T Consensus 92 ~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~----~~~~~a~~~~~~~ 167 (458) T protein:vir:10 92 QETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHG----QRHLKAVNQSSSV 167 (458) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhh----hhhhhhhhhcccC Confidence 00000000000 0000011122222222111111111 011112 223344 Q ss_pred cccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccc-----cccceeeEEec Q lcl|Aclame:pro 112 EDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPET-----DNPKFSNVQYA 186 (392) Q Consensus 112 ~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~-----~~~~~~~v~~~ 186 (392) ..||++||+++.+.|++.+++.++|+++|++.+++++. +.+++..+.+.+.|++|++..+++ +.++|++++++ T Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~ 245 (458) T protein:vir:10 168 EVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKI--LTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFS 245 (458) T ss_pred ccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcc--eEEEEecCCcceeecccccccccccccccccccceeeEee Confidence 57899999999999999999999999999999887654 445667778889999999887754 35689999999 Q ss_pred hhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc----------------------hhhHH Q lcl|Aclame:pro 187 VKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA----------------------IKSLD 244 (392) Q Consensus 187 ~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~----------------------~~~~d 244 (392) +++++++++||+++|+|+.++|.+||.++|+++++.++|.++++|.|++.+.+ ..+|+ T Consensus 246 ~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (458) T protein:vir:10 246 TYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGSVLVTAK 325 (458) T ss_pred eeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeecccccccccccHH Confidence 99999999999999999999999999999999999999999999887644321 23578 Q ss_pred HHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccc----cCCcccceecccceEEecCcccccccccCC Q lcl|Aclame:pro 245 DIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDP----TQKNKKLFAGTNPVVVVSNRFLKSKGTTAK 320 (392) Q Consensus 245 ~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~----~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~ 320 (392) +++++++ .++.+|..+++|+|||++|..|+++||++|+|||.+.. ..+.+.++|| +||+++ ..+|.. .+ T Consensus 326 ~i~~~~~-~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G-~pv~~~--~~~p~~---~~ 398 (458) T protein:vir:10 326 TISKLRR-KLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYG-LPVVVS--EYFPAK---AN 398 (458) T ss_pred HHHHHHH-hhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecc-eeeEEc--cccccc---cC Confidence 8998764 67888899999999999999999999999999997644 3455667888 566553 345543 34 Q ss_pred cceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 321 KAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 321 ~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) ...++||||+++|.++++.++++..+++ +.+|++.||++.|+|+.+.+|+||++.++.++ T Consensus 399 ~~~~~~~~f~~~~~~~~~~~~~v~~d~~----~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 399 SAEFAVIVYKDNFVMPRQRAVTVERERQ----AGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred CcceEEEEecccEEEEEeeceEEEeecc----cCCCceEEEEEEEecceEecccceEEEeeccC Confidence 4568999999889999999999987653 46899999999999999999999999888877 No 57 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=4.8e-55 Score=318.37 Aligned_cols=365 Identities=17% Similarity=0.191 Sum_probs=237.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHH----------Hhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHH---Hh Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRS----------LMGEDK-VAEAEQMMEEVRSLQKKIDLQRSLDEA-------ETE---ER 59 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~----------~~~~~~-~~~~~~~~~ei~~l~~~i~~~~~~~~~-------~~~---~~ 59 (392) |++|+++++++++++.++..+ .+++.+ .++...+.+++..++++.+.+.+.... ..+ .. T Consensus 8 ~~~el~~~~~~l~el~~~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~~~~~~~l~~~ 87 (425) T protein:vir:95 8 LTKKIEQRKAALDELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEIAQLEDELEQI 87 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 344555555544444432222 111111 112223333333332222211111111 000 00 Q ss_pred hccccccccccchhhHHH-------HHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHh Q lcl|Aclame:pro 60 NNGREVETRNVDGEMEYR-------DVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARS 132 (392) Q Consensus 60 ~~~~~~~~~~~~~~~~~~-------~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~ 132 (392) .................. ..+...++..... ................+.++||++||+++.+.|++.+++ T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~ 164 (425) T protein:vir:95 88 NSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYY---KRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGD 164 (425) T ss_pred hhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhh---hhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHh Confidence 000000000000000000 0111111111100 000001111222233455678999999999999999999 Q ss_pred hhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHH Q lcl|Aclame:pro 133 FDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYV 212 (392) Q Consensus 133 ~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v 212 (392) .++|++++++.++++ .+.+|+..+.+.++|++|+++.++++.++|++|++++++++++++||+|+|+|+.++|++|| T Consensus 165 ~~~i~~~~~~~~~~g---~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i 241 (425) T protein:vir:95 165 YTTLYPLVDKIRVKG---TTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYV 241 (425) T ss_pred hhhHHHhhceeecCc---eeEEEEecCCccccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHHH Confidence 999999999988753 45677888889999999999998877789999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcccccc--c----------------cchhhHHHHHHHHHHHhhhccc--CCceEEEcHHHH- Q lcl|Aclame:pro 213 TKWLGKKSKVTRNVLILGVIEKLT--K----------------QAIKSLDDIKDVLNVKLDPAIS--PNAILLTNQDGF- 271 (392) Q Consensus 213 ~~~l~~~~~~~~d~~~~~~~~~~~--~----------------~~~~~~d~~~~~~~~~~~~~~~--~~a~~v~~~~~~- 271 (392) .++|+++++.++|.++++|.|+.+ + .+..+++++.+++. .+..++. .+++|+||+.++ T Consensus 242 ~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~~~~~ 320 (425) T protein:vir:95 242 TKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKNLVKQIG-LIDTGDDSVGEIVAVMKRSTYY 320 (425) T ss_pred HHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchHHHHHHHHH-hhhhhccccCceEEEEeChHHH Confidence 999999999999999999987532 1 12335677777654 3455443 567899999984 Q ss_pred ---HHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEecc Q lcl|Aclame:pro 272 ---NYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDV 348 (392) Q Consensus 272 ---~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~ 348 (392) ..|+++||++|+|||+++ .+...++||. ||++. ..+| ...++||||++ |.+++|+++++.++++ T Consensus 321 ~~l~~l~~~kd~~g~~i~~~~--~~~~~~l~G~-pvv~~--~~~~-------~~~i~~Gd~~~-~~~~~~~~~~i~~~~~ 387 (425) T protein:vir:95 321 NRLVEFSIQVDSNGNVVGKLP--NLRTPDLLGL-RVVFN--NFLD-------DDTVLFGEFEQ-YTLVERENITIDSSTH 387 (425) T ss_pred HHHHHHHhhcCCCCceeeccC--CCCCccccce-eeEEc--CcCC-------CccEEEEeccc-EEEEeecceEEEeecc Confidence 346788999999999854 3445678885 55542 2333 33589999997 6788999999999986 Q ss_pred chhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 349 GGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 349 ~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) . +|.+|++.||++.|+|+++++|+||+++++++. ++| T Consensus 388 ~--~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~-----~~g 424 (425) T protein:vir:95 388 V--KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDP-----VQG 424 (425) T ss_pred c--ccccCceEEEEEEeeCcEeecccceEEEEecCc-----CCC Confidence 4 799999999999999999999999999997753 223 No 58 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=1.6e-54 Score=315.52 Aligned_cols=373 Identities=16% Similarity=0.165 Sum_probs=255.5 Q ss_pred CC--HHHHHHHHHHHHHHHHHHHHhhhh--hHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccch Q lcl|Aclame:pro 1 MS--KELRELLAKLEGKKEEVRSLMGED--KVAEA----EQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDG 72 (392) Q Consensus 1 M~--kel~el~~~~~~~~~e~~~~~~~~--~~~~~----~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) |. +.|+|++++++++.++.....++. ..++. ++++++++.+..+++..+...+................... T Consensus 1 m~~~~~lee~~a~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) T protein:vir:94 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 88 458888888888777665543321 12222 33444444454444444443333322222111111111110 Q ss_pred -----hhHHHHHHHHHHhc----chhhHHHHHHHHhhhhhhhhccc-cccccceecchhhhhHHHHhHHhhhhhhhhcce Q lcl|Aclame:pro 73 -----EMEYRDVFMKALRN----KPLNAEEREFLEDDLEQRAMSGL-TGEDGGLVIPQDIQTQINELARSFDALEQYVTV 142 (392) Q Consensus 73 -----~~~~~~a~~~~~~~----~~~~~~~~~~~~~~~~~~a~~~~-~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~ 142 (392) ....+....+.... .......+..............+ +..+++.++|..+...|+..+...+.|++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~ 160 (419) T protein:vir:94 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) T ss_pred ccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhccee Confidence 00011111111111 11111111111111122222222 234455667777777778888888899999999 Q ss_pred eeccCCcceeEEE------eecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHH Q lcl|Aclame:pro 143 EPVRTRSGSRVLE------KNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWL 216 (392) Q Consensus 143 ~~~~~~~~~~~~~------~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l 216 (392) .+++++...++.. ...+.+.++|++|++.++++ +++|+++++++++++++++||+|+++|+ .+|++||.++| T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~l 238 (419) T protein:vir:94 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRL 238 (419) T ss_pred eeccCCceeeeeeccccccccccCcccceecCCcccccc-ccceeeEEeeeeeEEEeehhhHHHHHhH-HHHHHHHHHHH Confidence 9887765554432 22345568899999999875 6999999999999999999999999987 57999999999 Q ss_pred HHHHHHHHHHHHhhcccccccc---------------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHH Q lcl|Aclame:pro 217 GKKSKVTRNVLILGVIEKLTKQ---------------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLD 275 (392) Q Consensus 217 ~~~~~~~~d~~~~~~~~~~~~~---------------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~ 275 (392) +++++.++|.++++|.|+..+. ....++++.+++. .+...+..+++|+|||++|..|+ T Consensus 239 a~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~-~~~~~~~~~~~~v~n~~~~~~l~ 317 (419) T protein:vir:94 239 TYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKT-VAEIAGFPPDGVVVHPQDWESIE 317 (419) T ss_pred HHHHHHHHHHHHHhccCcccccceecccccccccccccccccccchhHHHHHHHHH-hhhhccCCCCEEEEcHHHHHHHH Confidence 9999999999999987764332 1234678888775 45566677889999999999999 Q ss_pred HhhccCCc-eeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhh Q lcl|Aclame:pro 276 KLKDKDGK-YILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) Q Consensus 276 ~lkd~~g~-~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~ 354 (392) +++|++|+ |+|+++..++.+.+++|. ||++++ .+| ...++||||+++|.+++|.++++.++++.+.+|. T Consensus 318 ~~k~~~~~~~~~~~~~~~~~~~~l~G~-pV~~~~--~~~-------~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~ 387 (419) T protein:vir:94 318 LDQAPGSGVFRVIANVQGEATPRIWGL-NVVSTV--AIA-------QGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) T ss_pred HHhhcCCCceeecCCcccCCCccccce-eeEEcC--CCC-------CccEEEeeccceEEEEEecceEEEEeccccchhh Confidence 99998665 578888888888999985 665543 222 3458999999988999999999999999888899 Q ss_pred cCceeEEEEEeeCcEEecccceEEEEecccCC Q lcl|Aclame:pro 355 RNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) Q Consensus 355 ~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~ 386 (392) +|++.||+++|+|+++++|+||+++++++++- T Consensus 388 ~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred cCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 99999999999999999999999999885443 No 59 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=3.6e-56 Score=324.52 Aligned_cols=339 Identities=12% Similarity=0.062 Sum_probs=245.5 Q ss_pred CCH---HHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHH Q lcl|Aclame:pro 1 MSK---ELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYR 77 (392) Q Consensus 1 M~k---el~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |.. ++.++++++.++.+.+......+ ++.+...+..+.+.+++....+... + T Consensus 1 M~i~~k~~~~~~~~~~~l~~~~~~~~~~e--e~~~~~~~~~~~~~~~~~~~~~~e~-----------------------~ 55 (377) T protein:vir:98 1 MAINLKELPKYREAVAELSAKISAGATSE--EQEKLFEAAFTTMGDEILAKNEEEM-----------------------E 55 (377) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHhhhhhH--HHHHHHHHHHHhHHHHHHHHHHHHH-----------------------H Confidence 553 34444444444433333221111 1122233334444444433221111 1 Q ss_pred HHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEee Q lcl|Aclame:pro 78 DVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN 157 (392) Q Consensus 78 ~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~ 157 (392) +++........+..+++.+... ....++.++||++||+++.+.|++.+.+.++|+++|++.++++ ...+++. T Consensus 56 ~~~~~~~~~~~lt~ee~~~~~~-----~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~---~~~~~~~ 127 (377) T protein:vir:98 56 RMFDLRDKNRELTAEEIKFFND-----IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL---RLKALTA 127 (377) T ss_pred HHHHhccCCcccCHHHHHHHHH-----HHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCc---ceEEEEe Confidence 1111111111222222222211 2334567789999999999999999999999999999988753 3456777 Q ss_pred cCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 158 SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK 237 (392) Q Consensus 158 ~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~ 237 (392) .+.+.+.|++|+++.+++++++|+++++++++++++++||+++|+|+.+++++||+++|+++++.+++.+|++|.|+..| T Consensus 128 ~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP 207 (377) T protein:vir:98 128 ETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQP 207 (377) T ss_pred cCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcc Confidence 88889999999988776678999999999999999999999999999999999999999999999999999999987655 Q ss_pred cchh---------------------hHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccc------ Q lcl|Aclame:pro 238 QAIK---------------------SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDP------ 290 (392) Q Consensus 238 ~~~~---------------------~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~------ 290 (392) .+.. ..+.+.++ ...++..|+.+++|+||+.++..++++||.+|+|+|..++ T Consensus 208 ~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~ 286 (377) T protein:vir:98 208 VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL-SDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWAL 286 (377) T ss_pred eeeeecccccccccccccccccccchhhhHhhh-hhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhc Confidence 4321 12344443 4567888899999999999999999999999999995433 Q ss_pred --------cCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEE Q lcl|Aclame:pro 291 --------TQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRA 362 (392) Q Consensus 291 --------~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~ 362 (392) ..|.+.+++|. |+.++.+..+|. ..++||||++ |.+++|++++++.+++. +|.+|++.|++ T Consensus 287 ~p~~~~~~~~G~~~t~lg~-p~~vv~s~~~p~-------~~i~fgdf~~-Y~i~~r~~~~i~~~~~~--~~~~d~~~f~~ 355 (377) T protein:vir:98 287 EAQFTSRNQFGEYVTVLPH-GITILESLAVET-------GKAIAFVANR-YDAFMATASTIEEYDQT--FAMEDLQLYLT 355 (377) T ss_pred cccccccCCCCccccccCC-CceEEecCCCCc-------ccEEEEEecc-eeEEeecceEEEeechh--hhhcCceEEEE Confidence 23445577774 665656555553 3589999998 78999999999999864 79999999999 Q ss_pred EEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 363 IQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 363 ~~r~~~~v~~~~af~~l~~~~~ 384 (392) .+|+|+++++|+||++|+++.. T Consensus 356 ~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 356 KNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEcCEEeccCcEEEEEEecC Confidence 9999999999999999999987 No 60 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=2.2e-54 Score=314.74 Aligned_cols=374 Identities=11% Similarity=0.096 Sum_probs=256.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhh----------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-- Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMG----------EDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETR-- 68 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~----------~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~-- 68 (392) |.+.|+++++++.++.++++++++ .++.++++.+.+|+++++.+|++.+++.+............... T Consensus 195 ~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~~~ 274 (645) T protein:vir:93 195 IGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNGNV 274 (645) T ss_pred hhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Confidence 667788888888888777766543 34556788899999999999988876554433322111110000 Q ss_pred -----cc--c-hhhHHHH-HHHHHHhc----c-hhhHHHHHHHH--------hhhhh----hhhccccccccceecchhh Q lcl|Aclame:pro 69 -----NV--D-GEMEYRD-VFMKALRN----K-PLNAEEREFLE--------DDLEQ----RAMSGLTGEDGGLVIPQDI 122 (392) Q Consensus 69 -----~~--~-~~~~~~~-a~~~~~~~----~-~~~~~~~~~~~--------~~~~~----~a~~~~~~~~gg~~iP~~~ 122 (392) .. . .....+. .|.+..+. . ........... ..... +..+..+...||+++|+++ T Consensus 275 ~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~ 354 (645) T protein:vir:93 275 AAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEY 354 (645) T ss_pred ccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCchhh Confidence 00 0 0111111 12222211 1 11100000000 00011 1223333445889999999 Q ss_pred hhHHHHhHHhhhhhhhhcceeecc--CCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHH Q lcl|Aclame:pro 123 QTQINELARSFDALEQYVTVEPVR--TRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL 200 (392) Q Consensus 123 ~~~ii~~~~~~~~l~~l~~~~~~~--~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~ 200 (392) ..+|++.+++.++++.++.....+ +..+.+.+|+..+++.++|++|++.++++ +++|++|+++++|+++++++|+|+ T Consensus 355 ~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s-~~~f~~v~l~~~kla~~~~iS~el 433 (645) T protein:vir:93 355 AQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLT-KFDFESITFSHAKVSAIAVLTEEL 433 (645) T ss_pred HHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCcccccc-ccceeEEEEeeEEEEEeehhHHHH Confidence 999999999999999986553222 22346778888889999999999999875 699999999999999999999999 Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----c-----------cchhhHHHHHHHHHHHhhhc-ccCCceE Q lcl|Aclame:pro 201 LQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT----K-----------QAIKSLDDIKDVLNVKLDPA-ISPNAIL 264 (392) Q Consensus 201 l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~----~-----------~~~~~~d~~~~~~~~~~~~~-~~~~a~~ 264 (392) |+|+.+++++||.++|+++++.++|.++++|.++.. + .+...++++..++....... ...+++| T Consensus 434 l~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~~~~~~~~~a~~~~~~a~~ 513 (645) T protein:vir:93 434 IRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAEAAFGQFVAANLQPTGAVW 513 (645) T ss_pred HhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHHHHHHHHHhcCCCccccEE Confidence 999999999999999999999999999998765431 1 11233456665554333333 3456799 Q ss_pred EEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEE Q lcl|Aclame:pro 265 LTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELA 344 (392) Q Consensus 265 v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~ 344 (392) ||||.++.+|+++||++|+|+| |+.. ..+.+++| +||++.+ .+|+ .++||||++ +.++++.++.+. T Consensus 514 vmn~~~~~~L~~lkd~~G~~~~-~~~~-~~~~tL~G-~PV~~s~--~vp~--------~~~~gd~s~-~~ig~~~~v~i~ 579 (645) T protein:vir:93 514 LMSSTNALALSMRKNALGQKEY-PDMT-LLGGSFQG-LPVIVSQ--YVGD--------QLVLVNAPD-IYLADDGGVAVD 579 (645) T ss_pred EEcHHHHHHHHhccccCCceee-cCCC-CCCceeec-eeeEEec--cCCc--------ceeEecccc-EEEEEecceEEE Confidence 9999999999999999999998 4443 33457888 5676543 2332 267889986 457777888777 Q ss_pred Eeccch--------------------hhhhcCceeEEEEEeeCcEEecccceEEEE-ecccCCCCCCCC Q lcl|Aclame:pro 345 STDVGG--------------------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAPVEQPQG 392 (392) Q Consensus 345 ~~~~~~--------------------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~-~~~~a~~~~~~~ 392 (392) ++++.. +.|++|++.||+++|+|++++||+||++|+ +++.+.-- | T Consensus 580 ~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~~~---~ 645 (645) T protein:vir:93 580 MSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGSASG---G 645 (645) T ss_pred eecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCcccC---C Confidence 765432 349999999999999999999999999998 66654422 2 No 61 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=3.4e-54 Score=313.73 Aligned_cols=380 Identities=16% Similarity=0.136 Sum_probs=235.5 Q ss_pred CCHHHHHHHHHHHHHH-------HHHHHHhhhhhHH--------HHHHHHHHHHHHHHHHHHH-------HHHHHHHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKK-------EEVRSLMGEDKVA--------EAEQMMEEVRSLQKKIDLQ-------RSLDEAETEE 58 (392) Q Consensus 1 M~kel~el~~~~~~~~-------~e~~~~~~~~~~~--------~~~~~~~ei~~l~~~i~~~-------~~~~~~~~~~ 58 (392) |+|+|+++++++.+++ +++++++++.+.+ +..+..+++.+++++++.+ +++.+...+. T Consensus 1 ~~k~~eem~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~~ 80 (477) T protein:vir:84 1 MEKHLEELRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIERS 80 (477) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 7777666665544444 4445544422111 1111122233333333333 2222221111 Q ss_pred hhccc--------cccccccchh-----hHHHHHHHHHHhcch----------------hhHHHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 59 RNNGR--------EVETRNVDGE-----MEYRDVFMKALRNKP----------------LNAEEREFLEDDLEQRAMSGL 109 (392) Q Consensus 59 ~~~~~--------~~~~~~~~~~-----~~~~~a~~~~~~~~~----------------~~~~~~~~~~~~~~~~a~~~~ 109 (392) ..... .......... ..+...+....+... .....+.......+.++.. + T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 159 (477) T protein:vir:84 81 GKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLD-R 159 (477) T ss_pred hcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhcccc-c Confidence 10000 0000000000 000000011111100 0001111111122233333 4 Q ss_pred cccccceecchhh-hhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCc-cccccccccccc----cccccceeeE Q lcl|Aclame:pro 110 TGEDGGLVIPQDI-QTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMI-PFAEITEMGEIP----ETDNPKFSNV 183 (392) Q Consensus 110 ~~~~gg~~iP~~~-~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~E~~~~~----~~~~~~~~~v 183 (392) +...||++||+++ .+.|++.+++.++|++++++.++++..+.+.+|+..+++ .+.|++|++..+ +.++++|+++ T Consensus 160 ~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i 239 (477) T protein:vir:84 160 NGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFV 239 (477) T ss_pred cCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeE Confidence 5556788887764 678999999999999999999999988888888765554 467899986432 2346899999 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-ccc---------------h------- Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-KQA---------------I------- 240 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-~~~---------------~------- 240 (392) ++++++++++++||+|+|+|+.+++++||.++|+++++.++|.++++|.|+.. +.+ . T Consensus 240 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~ 319 (477) T protein:vir:84 240 QANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEKHQ 319 (477) T ss_pred EEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhhHH Confidence 99999999999999999999999999999999999999999999999988542 111 1 Q ss_pred hhHHHHHHHHHHHhhhccc-CCceEEEcHHHHHHHHHhhccCCceeecccc-------------cCCcccceecccceEE Q lcl|Aclame:pro 241 KSLDDIKDVLNVKLDPAIS-PNAILLTNQDGFNYLDKLKDKDGKYILQSDP-------------TQKNKKLFAGTNPVVV 306 (392) Q Consensus 241 ~~~d~~~~~~~~~~~~~~~-~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~-------------~~~~~~~~~g~~pv~~ 306 (392) ..++++++++. .+...+. +.++|+|||++|..|+++||++|||||+|+. ..+.+.+++|. ||++ T Consensus 320 ~~~~~i~~~~~-~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~-pVv~ 397 (477) T protein:vir:84 320 IIYQKIADAIQ-RVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGL-PVVT 397 (477) T ss_pred HHHHHHHHHHh-hccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhccc-ceEe Confidence 12333444443 3344444 4558999999999999999999999999863 23334567774 6655 Q ss_pred ecCcccc-cccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEE-ecccceEEEEeccc Q lcl|Aclame:pro 307 VSNRFLK-SKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQM-WDNEAAVYGEIDLS 384 (392) Q Consensus 307 ~~~~~~~-~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v-~~~~af~~l~~~~~ 384 (392) + ..+| +.+...+...++||||+++ .++. .++++.++++. ++.++++.|+++.++++.. ++|+||+.++.++. T Consensus 398 s--~~~p~~~~~~~d~~~i~~gd~~~~-~i~~-~~~~~~~~~~~--~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~ 471 (477) T protein:vir:84 398 D--PTLPTTLGTGTDQDVIHVLRASDL-ALFE-SSVRMRALQET--RAENLSVLLQVYGYLAFTAARFPQSVVEIGGTAL 471 (477) T ss_pred c--CcccccccccCCcceEEEEEeceE-EEEe-eceeEEecccc--ccccceeeeeehhhhhhhhhccccceEEeecccc Confidence 3 3444 4455566778999999864 4554 57888888875 4457788888888888744 56999998887664 Q ss_pred -CCCCC Q lcl|Aclame:pro 385 -APVEQ 389 (392) Q Consensus 385 -a~~~~ 389 (392) +|+=. T Consensus 472 ~~~~~~ 477 (477) T protein:vir:84 472 TAPTFA 477 (477) T ss_pred cccccC Confidence 33322 No 62 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=1.7e-53 Score=309.88 Aligned_cols=338 Identities=12% Similarity=0.117 Sum_probs=235.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF 80 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 80 (392) |+ +++++.++++.+.+++. .+.++++.++.+.+. ... ................+++ T Consensus 1 ~e-ei~~l~~~~~~l~~~~~------------~l~~~~d~~e~e~~~-------~~~----~~~~~~~~~~~~~~~~~~~ 56 (352) T protein:vir:78 1 ME-DIKQLETEKAGLQQRFN------------IVERQVQDIEEKEKA-------KVK----DKGEAYQSLNDNEKLVKAK 56 (352) T ss_pred Ch-hHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHHH-------Hhh----hccccccccchhhhHHHHH Confidence 44 34444444443333222 222222222211110 000 0000011111112222344 Q ss_pred HHHHhcchhhHHH-HHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecC Q lcl|Aclame:pro 81 MKALRNKPLNAEE-REFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) Q Consensus 81 ~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~ 159 (392) ..+++........ +.......+.++++.+++++||++||+++.++|++.+++.++|++++++.++++. .++. ...+ T Consensus 57 ~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~~--~~p~-~~~~ 133 (352) T protein:vir:78 57 AEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL--EIPR-VSYT 133 (352) T ss_pred HHHHHHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCCc--eEEE-EecC Confidence 5555544333222 2223344555777888888999999999999999999999999999999876542 3332 2344 Q ss_pred CccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH-HHhhccccccc- Q lcl|Aclame:pro 160 MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNV-LILGVIEKLTK- 237 (392) Q Consensus 160 ~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~-~~~~~~~~~~~- 237 (392) .+.+.|++|++..+++ +++|++|++++++++++++||+|+|+|+.++|++||.++|+++++.+++. .+..|.++..+ T Consensus 134 ~~~a~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~ 212 (352) T protein:vir:78 134 LDDDDFITDVETAKEL-KLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEH 212 (352) T ss_pred CCcccccccccccccc-cccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccc Confidence 4678999999999876 69999999999999999999999999999999999999999999988655 44455443221 Q ss_pred -----------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEE Q lcl|Aclame:pro 238 -----------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVV 306 (392) Q Consensus 238 -----------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~ 306 (392) ++...||++++++. .++.+|+.+++|+||+.++.+|++++|.+|+|||. +.|.++|| +||++ T Consensus 213 g~l~~~~~~~~t~~~~~d~i~~~~~-~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~-----~~~~~llG-~PV~~ 285 (352) T protein:vir:78 213 MSFYNGSVKEVEGANMYDAIINALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFG-KPVVF 285 (352) T ss_pred cceeccccccccccchHHHHHHHHh-ccChhhhcCCEEEEehHHHHHHHHHHhccCCcccc-----cCCccccc-cceEE Confidence 23345889998875 78999999999999999999999999999999985 34568887 57776 Q ss_pred ecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCC Q lcl|Aclame:pro 307 VSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~ 386 (392) +++ ...++||||+++|.. +.++.+..+.+ ..++++.|+++.|+|+++++|+||+.+++++++. T Consensus 286 ~~~-----------~~~~~~Gdf~~~~~~--~~~~~~~~~~~----~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~ 348 (352) T protein:vir:78 286 TDA-----------AVKPIVGDFNYFGIN--YDGTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTG 348 (352) T ss_pred ecC-----------CCceeEeehhhhhhh--hhhheeeeecc----ccCCeeEEEEEeeeCceeechhheEEEEeecccC Confidence 442 235789999987644 45566555443 3478999999999999999999999999998877 Q ss_pred CCCC Q lcl|Aclame:pro 387 VEQP 390 (392) Q Consensus 387 ~~~~ 390 (392) ..|. T Consensus 349 ~~~~ 352 (352) T protein:vir:78 349 SLPS 352 (352) T ss_pred CCCC Confidence 6666 No 63 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=2.1e-54 Score=314.89 Aligned_cols=279 Identities=11% Similarity=0.038 Sum_probs=228.6 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEe Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQY 185 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~ 185 (392) |..++...||++||+++..+|++.+++.|+|++++++++++++ .+.+|+..+++.++|++|+++++++ +++|+++++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~--~~~ip~~~~~~~a~wv~Eg~~~~~s-~~~f~~v~l 77 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG--PVKGAVFSGVPRAKIVGEGEVKPSA-SVDVSAFTA 77 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCC--ceEEEEEeCCcceEEeeCCcccccc-ccceeeeEe Confidence 6677788999999999999999999999999999999888754 4566777888999999999999875 699999999 Q ss_pred chhheeeehhhHHHHHhhhHHH----HHHHHHHHHHHHHHHHHHHHHhhccccccc------------------cchhhH Q lcl|Aclame:pro 186 AVKDRAGILPLSRSLLQDSDQN----ILKYVTKWLGKKSKVTRNVLILGVIEKLTK------------------QAIKSL 243 (392) Q Consensus 186 ~~~~i~~~~~iS~e~l~ds~~~----l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~------------------~~~~~~ 243 (392) +++|++++++||+|+++|+..+ |+++|.++|++++++++|.++++|++..+. .+...+ T Consensus 78 ~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (315) T protein:vir:80 78 QPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSAT 157 (315) T ss_pred eeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccch Confidence 9999999999999999887765 789999999999999999999988653211 122346 Q ss_pred HHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCc-----eeecccccCCcccceecccceEEecCcccccc--c Q lcl|Aclame:pro 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGK-----YILQSDPTQKNKKLFAGTNPVVVVSNRFLKSK--G 316 (392) Q Consensus 244 d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~-----~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~--~ 316 (392) +++.+++.......+..+++|+|||+++..|++++|.+|+ |+| ++...+.+.+++| +||++.+. +|.. . T Consensus 158 ~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~-~~~~~g~~~tl~G-~PV~~~~~--~~~~~~~ 233 (315) T protein:vir:80 158 ADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMY-PAAGFAGLDNWRG-LNVGASST--VSGAPEM 233 (315) T ss_pred HHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccc-cccccCCCceecc-eeeEecCc--CCccccc Confidence 7787776554455566777899999999999999877665 555 4556667788888 56765443 3332 2 Q ss_pred ccCCcceEEEEehhhceeeeeccceEEEEeccch------hhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCC Q lcl|Aclame:pro 317 TTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQP 390 (392) Q Consensus 317 ~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~ 390 (392) .......++||||++ +.+..+++++++++++.+ +.|++|++.||+++|+|+++++|+||++|+.++++..++| T Consensus 234 ~~~~~~~~~~GDfs~-~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~~~ 312 (315) T protein:vir:80 234 SPASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPP 312 (315) T ss_pred ccccccEEEEeeccc-EEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCCCC Confidence 233456789999997 557889999999987643 4699999999999999999999999999999999888888 Q ss_pred CC Q lcl|Aclame:pro 391 QG 392 (392) Q Consensus 391 ~~ 392 (392) ++ T Consensus 313 ~~ 314 (315) T protein:vir:80 313 AE 314 (315) T ss_pred CC Confidence 88 No 64 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=2.2e-52 Score=303.82 Aligned_cols=331 Identities=12% Similarity=0.071 Sum_probs=234.6 Q ss_pred CCH---HHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHH Q lcl|Aclame:pro 1 MSK---ELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYR 77 (392) Q Consensus 1 M~k---el~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |.. ++.++.+++.++.+.+.....+ .++.+++.+.++.+.+++.+..+... + T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~~~~~--e~~~~~~~~~~~~~~~~~~~~~~~e~-----------------------~ 55 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISAGATP--EEQEKLFEAAFTTMGDEILAKNEEEM-----------------------E 55 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhhcccH--HHHHHHHHHHHHHHHHHHHHHHHHHH-----------------------H Confidence 553 3444444444444433322111 12233344444555555543322111 1 Q ss_pred HHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEee Q lcl|Aclame:pro 78 DVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN 157 (392) Q Consensus 78 ~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~ 157 (392) +++........+..+++.+... ....++..+||++||+++.+.|++.+.+.++|+++|++.++++ ...+++. T Consensus 56 ~~~~~~~~~~~lt~ee~~~~~~-----~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~---~~~i~~~ 127 (377) T protein:vir:96 56 RMFDLRDKNRELTAEEIKFFND-----IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL---RLKALTA 127 (377) T ss_pred HHHHhccCCcccCHHHHHHHHH-----HHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEe Confidence 1111111111222223222221 2234567789999999999999999999999999999988753 4567777 Q ss_pred cCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 158 SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK 237 (392) Q Consensus 158 ~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~ 237 (392) .+.+.+.|++|+++.+++++++|+++++++++++++++||+++|+|+.+++++||.++|+++++.+++.+|++|.|+..| T Consensus 128 ~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P 207 (377) T protein:vir:96 128 ETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQP 207 (377) T ss_pred cCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcc Confidence 88889999999988876678999999999999999999999999999999999999999999999999999999886554 Q ss_pred cch--------------------------------hhHHHHHHHHHHHhhh-----------cccCCceEEEcHHHHHHH Q lcl|Aclame:pro 238 QAI--------------------------------KSLDDIKDVLNVKLDP-----------AISPNAILLTNQDGFNYL 274 (392) Q Consensus 238 ~~~--------------------------------~~~d~~~~~~~~~~~~-----------~~~~~a~~v~~~~~~~~L 274 (392) .+. ...+.+++.+.. +.. ...++++|+|||.++..+ T Consensus 208 ~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~ 286 (377) T protein:vir:96 208 VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVP-VMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL 286 (377) T ss_pred eeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHH-HHHhhccccccccccccCceEEEEchhhHHhc Confidence 322 123344443322 222 223567899999997644 Q ss_pred HHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhh Q lcl|Aclame:pro 275 DKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) Q Consensus 275 ~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~ 354 (392) .|+|.|++ .+|.+.+++| +|+.++.+..+|. ..++||||++ |.+++|.+++++.+++. +|. T Consensus 287 ------~~~~~~~~--~~G~~~~~l~-~p~~v~~s~~~p~-------~~i~fgdf~~-Y~i~~r~~~~i~~~~~~--~~~ 347 (377) T protein:vir:96 287 ------EAKFTSRN--QFGEYVTVLP-HGITILESLAVET-------GKAIAFVANR-YDAFMATASTIEEYDQT--FAM 347 (377) T ss_pred ------cccccccC--CCCCceeccC-CCceEEecCCCCc-------ccEEEEEcCc-EEEEEecccEEEeehhh--hhh Confidence 47778876 3566777777 5666666666553 2489999998 88999999999999874 799 Q ss_pred cCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 355 RNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 355 ~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) +|++.||+.+|+|+++++|+||++|+++-. T Consensus 348 ~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 348 EDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred cCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 999999999999999999999999999877 No 65 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=4.6e-52 Score=302.05 Aligned_cols=344 Identities=13% Similarity=0.075 Sum_probs=232.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhh-H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDK-V-AEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRD 78 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~-~-~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) |+ +|+++++++..+.+++...+.++. . ++.+++.+..+.+..++.... ... ...+.+. T Consensus 1 ik-~L~e~~~e~~e~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~--~~~-----------------~~~~~~~ 60 (390) T protein:vir:40 1 MN-NLDKKDSETLNISTAFLNAIKEGATEAEQVTAFTNMAEQIQNNIIAQA--RKE-----------------VNREMND 60 (390) T ss_pred Cc-hHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHH--HHH-----------------HHHHHHH Confidence 65 466776666655554443333221 1 111111111111111111000 000 0000011 Q ss_pred HHHHHHhcc-hhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEee Q lcl|Aclame:pro 79 VFMKALRNK-PLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN 157 (392) Q Consensus 79 a~~~~~~~~-~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~ 157 (392) ......++. .+..+++... ......++.++||++||+++.+.|++.+++.++|+++|++++++++. ..+++. T Consensus 61 ~~~~~~~~~~~l~~~~r~~~-----~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~--~~i~~~ 133 (390) T protein:vir:40 61 NNVLASRGANALTSDESKYY-----NEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATT--EWIISV 133 (390) T ss_pred HHHHHhcCchhccHHHHHHH-----HHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCce--eEEEEE Confidence 111111111 1122222211 12234456678999999999999999999999999999999887654 445666 Q ss_pred cCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 158 SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK 237 (392) Q Consensus 158 ~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~ 237 (392) .+.+.+.|++|+++.++.++++|+++++++++++++++||+|+|+|+.+++++||.++|+++++.++|.++++|.|+..+ T Consensus 134 ~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P 213 (390) T protein:vir:40 134 GDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQP 213 (390) T ss_pred cCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCcc Confidence 77788999999998887678999999999999999999999999999999999999999999999999999999876544 Q ss_pred cch-------------------hhHHHHHHHHHHH---h---hhcccCCceEEEcHHHH-H---HHHHhhccCCceeecc Q lcl|Aclame:pro 238 QAI-------------------KSLDDIKDVLNVK---L---DPAISPNAILLTNQDGF-N---YLDKLKDKDGKYILQS 288 (392) Q Consensus 238 ~~~-------------------~~~d~~~~~~~~~---~---~~~~~~~a~~v~~~~~~-~---~L~~lkd~~g~~l~~~ 288 (392) .+. .++++..+++... + ...+..+++|+|||.++ . .+++++|.+|+|+|.. T Consensus 214 ~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~ 293 (390) T protein:vir:40 214 IGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGI 293 (390) T ss_pred ceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCcccccc Confidence 321 1222333322211 1 11235678999999884 3 4558999999999854 Q ss_pred cccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCc Q lcl|Aclame:pro 289 DPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV 368 (392) Q Consensus 289 ~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~ 368 (392) . .+| +||++ +..+|. ..++||||++ |.+++|+++++.++++. +|.+|++.||++.|+|+ T Consensus 294 ~--------~~g-~pvv~--~~~~p~-------~~i~~Gd~s~-~~i~~~~~~~v~~~~~~--~f~~~~~~~r~~~r~dg 352 (390) T protein:vir:40 294 L--------PVP-LEIVQ--SVAVPV-------GKAVAGRAKD-YFMGIGSEQVIRTSTEY--RLLDDETLYYAKQYANG 352 (390) T ss_pred C--------CCc-eeEEE--cCCCCC-------CcEEEEeece-EEEEeecceEEEecchh--hhhcCcEEEEEEEEeCC Confidence 2 245 45544 334442 3489999997 67899999999998864 79999999999999999 Q ss_pred EEecccceEEEEeccc--CCCCCCCC Q lcl|Aclame:pro 369 QMWDNEAAVYGEIDLS--APVEQPQG 392 (392) Q Consensus 369 ~v~~~~af~~l~~~~~--a~~~~~~~ 392 (392) ++++|+||++|++++. .|+.+|.+ T Consensus 353 ~v~~~~A~~~l~~~~~~~~~~~~~~~ 378 (390) T protein:vir:40 353 RPKDNSSFLVFDITGLEGSPAIDVNV 378 (390) T ss_pred EEecccceEEEEeeccCCCCCCCcce Confidence 9999999999999997 44677777 No 66 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=1.4e-52 Score=304.90 Aligned_cols=350 Identities=13% Similarity=0.105 Sum_probs=243.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAE--AEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRD 78 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~--~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) |+++|++..+.+++..+++.+.+..++.++ .+++.+.++.+++++....+ ... .+..+ T Consensus 1 M~~kl~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~------------------~~~~~ 60 (383) T protein:vir:78 1 MTIKLKNNLANYEEKRTAFVNAVKNEDTQEIQNKAYVEMVDAMAADIMEQAK--KEA------------------RQEAD 60 (383) T ss_pred CchhHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHHHHHHHHHHHHHH--HHH------------------HHHHH Confidence 999888888888877777766555433221 22223223333333221110 000 01111 Q ss_pred HHHHHHhcc-hhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEee Q lcl|Aclame:pro 79 VFMKALRNK-PLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN 157 (392) Q Consensus 79 a~~~~~~~~-~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~ 157 (392) ++...-++. .+..+++.+. +++..+++++||++||+++.+.|++.+++.++|+++|++.++++ ...+++. T Consensus 61 ~~~~~~~g~~~lt~~e~~~~------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~---~~~i~~~ 131 (383) T protein:vir:78 61 AYISASRTDKNITNEEIKFF------NDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGL---RTKFLKS 131 (383) T ss_pred HHHHhcCChhhhhHHHHHHH------HHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCC---ceEEEEE Confidence 111111111 1222222211 23456677889999999999999999999999999999988754 3456777 Q ss_pred cCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 158 SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK 237 (392) Q Consensus 158 ~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~ 237 (392) .+.+.+.|++|+++.++.++++|+++++++++++++++||+++|+|+.+++++||.++|+++++.++|.+|++|.|+..+ T Consensus 132 ~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP 211 (383) T protein:vir:78 132 ETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKP 211 (383) T ss_pred cCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCc Confidence 88888999999888776678999999999999999999999999999999999999999999999999999999886544 Q ss_pred cchh-----------------------hHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhc--c-CCceeecccc- Q lcl|Aclame:pro 238 QAIK-----------------------SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKD--K-DGKYILQSDP- 290 (392) Q Consensus 238 ~~~~-----------------------~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd--~-~g~~l~~~~~- 290 (392) .+.. +++++.. +...+. .++.++.|+||..++..+++++. . .+.+.|+|.. T Consensus 212 ~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~ 289 (383) T protein:vir:78 212 IGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKT-TVNELT-DVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYT 289 (383) T ss_pred eeeeeccCCcccccccccccccccchhhhhhhHH-HHHHHH-HHHhccchhcccchhhhcCceEEEEcCcchhhhccchh Confidence 3221 1223332 223333 34455556666666666655542 1 1122333332 Q ss_pred ---cCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeC Q lcl|Aclame:pro 291 ---TQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD 367 (392) Q Consensus 291 ---~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~ 367 (392) .+|.+.+++|. |+.++++..+|. ..++||||++ |.+++|.+++++.+++. +|.+|++.||+.+|+| T Consensus 290 ~~~~~G~~~t~l~~-~~~iv~s~~~p~-------~~iifgdfs~-Y~i~~r~~~~i~~~~~~--~f~~d~~~f~~~~r~d 358 (383) T protein:vir:78 290 SLNANGVYVTALPF-NLNIIESLFVPE-------KKAISYVAER-YDALIGGPLDIGTYDQT--LAIEDLNLYAAKQFAY 358 (383) T ss_pred ccCCCCceeeecCC-CceEEecCCCCc-------ccEEEeeccc-eEEEecccceEEecchh--hhhcCceEEEEEEEEc Confidence 23444466664 555555555553 3489999998 78999999999998864 7999999999999999 Q ss_pred cEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 368 VQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 368 ~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) +++++|+||++++++.+.+.++|+| T Consensus 359 G~~~~~~A~~vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 359 GKAKDDKAAAVWTLNINPAEQTPEG 383 (383) T ss_pred CEEecCCeEEEEEEEecCCCCCCCC Confidence 9999999999999999999999999 No 67 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=4.5e-53 Score=307.59 Aligned_cols=270 Identities=11% Similarity=0.046 Sum_probs=223.5 Q ss_pred ccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEec Q lcl|Aclame:pro 107 SGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYA 186 (392) Q Consensus 107 ~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~ 186 (392) +..+..++|++||++++.+|++.+++.++|++++++++++++.. .+|+.++++.++|++|+++++++ .++|++++++ T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~--~~p~~~~~~~a~wv~Eg~~~~~s-~~~f~~v~l~ 77 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQ--REFVFDFDSDIDIVAENGKKTHG-GVSLDPVTIV 77 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCce--EEEEEecCcceEEeeCCcccccc-cccceeeEee Confidence 44555667888999999999999999999999999998887554 45666777889999999999875 6999999999 Q ss_pred hhheeeehhhHHHHH---hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---------------------ccchhh Q lcl|Aclame:pro 187 VKDRAGILPLSRSLL---QDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT---------------------KQAIKS 242 (392) Q Consensus 187 ~~~i~~~~~iS~e~l---~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~---------------------~~~~~~ 242 (392) ++|++++++||+|++ .|+.++++++|.++|++++++++|.+++.|.+... ..+... T Consensus 78 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (300) T protein:vir:95 78 PLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNP 157 (300) T ss_pred eEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccch Confidence 999999999999999 46778999999999999999999999998843211 112334 Q ss_pred HHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcc Q lcl|Aclame:pro 243 LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKA 322 (392) Q Consensus 243 ~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~ 322 (392) ++++.+++. .+...+..+++|+|||+++.+|++|||++|+|||.+....+.+.+++|. ||++.+ .++.. ...... T Consensus 158 ~~~i~~~~~-~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~-Pv~~s~--~v~~~-~~~~~~ 232 (300) T protein:vir:95 158 DESMEDAVG-MIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGL-AVDKNR--TVSYS-QTDPKN 232 (300) T ss_pred HHHHHHHHH-HhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecce-eeEEec--CCCCC-CCCCcc Confidence 667777654 5566677888899999999999999999999999888888888999995 666544 23332 223445 Q ss_pred eEEEEehhhceeeeeccceEEEEeccch------hhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 323 PLIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 323 ~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) .+++|||++++.+..|++++++++++.+ ++|++|++.+|+++|+|+++.+|+||++|+.++. T Consensus 233 ~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 233 TAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred EEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 6788999998888899999999987654 3599999999999999999999999999986655 No 68 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=6.5e-53 Score=306.69 Aligned_cols=284 Identities=12% Similarity=0.086 Sum_probs=230.0 Q ss_pred HhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccc Q lcl|Aclame:pro 98 EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDN 177 (392) Q Consensus 98 ~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~ 177 (392) ....+.++....++.++|.++|+++.++|++.+++.++|++++++.+++++. +.+|+..+++.+.|++|+++++++ + T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~~p~~~~~~~a~~v~Eg~~~~~~-~ 77 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTG--ISIPHWTGAVSASWTGEAERKPIT-K 77 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCc--eEEEEEcCCcceeEecCCCccccc-c Confidence 2333445555555666777788889999999999999999999998887654 556667788899999999999876 6 Q ss_pred cceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc------------------- Q lcl|Aclame:pro 178 PKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ------------------- 238 (392) Q Consensus 178 ~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~------------------- 238 (392) ++|++++++++|++++++||+|+|+|+.++++++|.++|++++++++|.++++|.|+..+. T Consensus 78 ~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~ 157 (330) T protein:vir:77 78 GSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNLT 157 (330) T ss_pred ceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeeccccc Confidence 9999999999999999999999999999999999999999999999999999987764331 Q ss_pred -----chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCc-----ccceecccceEEec Q lcl|Aclame:pro 239 -----AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN-----KKLFAGTNPVVVVS 308 (392) Q Consensus 239 -----~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~-----~~~~~g~~pv~~~~ 308 (392) ....++++.+++. .+...+..+++|+|||++|..|+++||++|+|||+++...+. +.+++| +||++++ T Consensus 158 ~~~~~~~~~~~~l~~~~~-~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G-~PV~~~~ 235 (330) T protein:vir:77 158 TASGPQGNAYLAVNNALS-LLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILG-RPTYVAD 235 (330) T ss_pred ccccccchhHHHHHHHHH-hhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecc-eeeEEec Confidence 1123567777654 556677888899999999999999999999999998766553 357788 5666544 Q ss_pred CcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch----------------hhhhcCceeEEEEEeeCcEEec Q lcl|Aclame:pro 309 NRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG----------------KAFTRNTLDLRAIQRDDVQMWD 372 (392) Q Consensus 309 ~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~----------------~~f~~~~~~~~~~~r~~~~v~~ 372 (392) .+|+ +...+...++||||+++ .++++++++++++++.+ +.|++|++.||+++|+|+++.+ T Consensus 236 --~~p~-~~~~~~~~~~~gd~s~~-~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~ 311 (330) T protein:vir:77 236 --NVVN-GTVGNRVVGVMGDFSQV-IWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVND 311 (330) T ss_pred --cccC-CCCCCccEEEEEecceE-EEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEec Confidence 3443 33445678999999985 58899999999877642 4599999999999999999999 Q ss_pred ccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 373 NEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 373 ~~af~~l~~~~~a~~~~~~~ 392 (392) |+||++|+.++ +.++|.- T Consensus 312 ~~a~~~i~~~~--~~~~~~~ 329 (330) T protein:vir:77 312 KDAFVKLTDQV--AGTDPEE 329 (330) T ss_pred ccceEEEEecc--CCcCCCC Confidence 99999987665 4444555 No 69 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=2.1e-51 Score=298.47 Aligned_cols=345 Identities=14% Similarity=0.126 Sum_probs=235.8 Q ss_pred CCH--HHHHHHHHHHHHHHHHHHHhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHH Q lcl|Aclame:pro 1 MSK--ELRELLAKLEGKKEEVRSLMGEDKV--AEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEY 76 (392) Q Consensus 1 M~k--el~el~~~~~~~~~e~~~~~~~~~~--~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) |.. ++++..+.+++..+++..+++++.. ++..++.+.++++..++...... + ...+. T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~---e----------------~~~~~ 61 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFANLVQNGASDEEQSKAFGAMFDALSNDLQEEITA---E----------------INNRV 61 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH---H----------------HHHHH Confidence 663 2344444444555554444333221 11222232233333322211100 0 00111 Q ss_pred HHHHHHHHhc-chhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEE Q lcl|Aclame:pro 77 RDVFMKALRN-KPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLE 155 (392) Q Consensus 77 ~~a~~~~~~~-~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~ 155 (392) ...+....++ ..+..+++.+. .++..++..+||++||+++.+.|++.+++.++|+++|++.++++ ...++ T Consensus 62 ~~~~~~~~r~~~~l~~ee~~~~------~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~---~~~i~ 132 (395) T protein:vir:95 62 VDNGILAKRSQDPLTSEERKFF------NDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI---KTRVI 132 (395) T ss_pred HHHHHHhhcCccccchHHHHHH------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEE Confidence 1111111121 12223332222 23445677889999999999999999999999999999988754 45677 Q ss_pred eecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|Aclame:pro 156 KNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL 235 (392) Q Consensus 156 ~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~ 235 (392) ...+.+.+.|++|+++.++.++++|++|++++++++++++||+++|+|+.+++++||.++|+++++.++|.+|++|.|+. T Consensus 133 ~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~ 212 (395) T protein:vir:95 133 KADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAA 212 (395) T ss_pred EecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCC Confidence 78888899999998887666679999999999999999999999999999999999999999999999999999998875 Q ss_pred c--ccch-------------------hhHHHHHHHHH------HHh-------hhcccCCceEEEcHHHHHHHHHhhccC Q lcl|Aclame:pro 236 T--KQAI-------------------KSLDDIKDVLN------VKL-------DPAISPNAILLTNQDGFNYLDKLKDKD 281 (392) Q Consensus 236 ~--~~~~-------------------~~~d~~~~~~~------~~~-------~~~~~~~a~~v~~~~~~~~L~~lkd~~ 281 (392) . |.+. .+++++...+. ..+ ...+..++.|+|||.++. |.. T Consensus 213 ~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~------~~~ 286 (395) T protein:vir:95 213 KTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW------DVQ 286 (395) T ss_pred CcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh------hcC Confidence 2 2221 11222222111 111 224556788999999865 567 Q ss_pred CceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEE Q lcl|Aclame:pro 282 GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLR 361 (392) Q Consensus 282 g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~ 361 (392) |+|+|+| ..|.+.+++|. |+.++.+..+|. ..++||||++ |.+++|++++++.+++ .+|.+|++.|| T Consensus 287 g~~~~~~--~~G~~~~~lg~-g~~v~~~~~~p~-------~~i~fgdfs~-y~i~~r~~~~i~~~~~--~~~~~d~~~f~ 353 (395) T protein:vir:95 287 ARYTYLT--ANGGFVTVLPY-NVTIITSEFVPE-------GKLVAFVTDR-YNAVRGGGLTVKKFDQ--TLALEDAVLFT 353 (395) T ss_pred Ccceecc--CCCcceeccCC-cceEEEcCCCCC-------CcEEEEeccc-EEEEEecceEEEeccc--hhhhCCcEEEE Confidence 9999997 46677788774 454445555553 2489999998 7899999999999986 47899999999 Q ss_pred EEEeeCcEEecccceEEEEeccc-CCCCCCCC Q lcl|Aclame:pro 362 AIQRDDVQMWDNEAAVYGEIDLS-APVEQPQG 392 (392) Q Consensus 362 ~~~r~~~~v~~~~af~~l~~~~~-a~~~~~~~ 392 (392) +.+|+|+++++++||++|+++.+ +|+.++++ T Consensus 354 ~~~r~dg~~~~~~A~~~l~i~~~~~~~~~~~~ 385 (395) T protein:vir:95 354 AKTFAYGQPDDNKASAVYDLKVASAPRRQTSA 385 (395) T ss_pred EEEEECCEEeccccEEEEEeeccCCCCCCCCC Confidence 99999999999999999998853 44444444 No 70 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=7.9e-52 Score=300.74 Aligned_cols=334 Identities=14% Similarity=0.100 Sum_probs=228.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHH-HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVA-EAEQMMEE-VRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRD 78 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~-~~~~~~~e-i~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) |..++.+ ++.+.+.|....++.++.. +..++.++ ++.+.+++... ...++++ T Consensus 1 m~ik~~~---~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~e~~~ 54 (381) T protein:vir:10 1 MTINLSE---TFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQ-----------------------AKAEAER 54 (381) T ss_pred CchhhHH---HHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHH-----------------------HHHHHHH Confidence 8854332 2222222222222221111 11111111 11111111110 0112222 Q ss_pred HHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeec Q lcl|Aclame:pro 79 VFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) Q Consensus 79 a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~ 158 (392) ++.....+..+..+++.+. .+...+++++||++||+++.+.|++.+++.++|+++|++.++++ ...+++.. T Consensus 55 ~~~~~~~~~~lt~~e~~~~------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~---~~~i~~~~ 125 (381) T protein:vir:10 55 VSSLPKSAQSLSANQRSFF------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSE 125 (381) T ss_pred HHHhccCcccccHHHHHHH------HHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc---ceEEEEec Confidence 2222112222233333222 23445677789999999999999999999999999999988754 45677788 Q ss_pred CCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ 238 (392) Q Consensus 159 ~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~ 238 (392) +.+.+.|++|+++.++.++++|++|++++++++++++||+++|+|+.++|++||.++|+++++.+++.+|++|.|+..|. T Consensus 126 ~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~ 205 (381) T protein:vir:10 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) T ss_pred CCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCce Confidence 88999999999887766679999999999999999999999999999999999999999999999999999998876553 Q ss_pred chh-----------------------h-------HHHHHHHHHHHh-------hhcccCCceEEEcHHHHHHHHHhh--- Q lcl|Aclame:pro 239 AIK-----------------------S-------LDDIKDVLNVKL-------DPAISPNAILLTNQDGFNYLDKLK--- 278 (392) Q Consensus 239 ~~~-----------------------~-------~d~~~~~~~~~~-------~~~~~~~a~~v~~~~~~~~L~~lk--- 278 (392) +.. + ++.+...+ ..+ ...|..+++|+|||.++..|++++ T Consensus 206 Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~-~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~ 284 (381) T protein:vir:10 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVF-KYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL 284 (381) T ss_pred eeeeccCcccccccccccccccccccccccchhhHHHHHHHH-HhhccccccccccccCceEEEEccccHHhhccccccC Confidence 221 1 11222221 122 234677889999999999998766 Q ss_pred ccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCce Q lcl|Aclame:pro 279 DKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTL 358 (392) Q Consensus 279 d~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~ 358 (392) +++|+|+|..+ .|+.++.+..+|. ..++||||++ |.+++|++++++.+++. +|.+|++ T Consensus 285 ~~~G~~v~~l~------------~g~~vv~s~~~p~-------~~iifgDfs~-Y~i~~r~~~~i~~~~~~--~~~~d~~ 342 (381) T protein:vir:10 285 NANGVYVTALP------------FNLNVIESTVQEA-------GKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMD 342 (381) T ss_pred CCCCceeecCC------------CCceEEecCCCCc-------CcEEEEeccc-EEEEEecccEEEeechh--HhhcCCe Confidence 66789887532 1333344444442 3489999997 78999999999999875 7999999 Q ss_pred eEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 359 DLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 359 ~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) .||+.+|+|+++++|+||++++++......++.+ T Consensus 343 ~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~ 376 (381) T protein:vir:10 343 LYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) T ss_pred EEEEEEEEcCEEecCceEEEEEEEecCCCcCccc Confidence 9999999999999999999999888644444444 No 71 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=7.9e-52 Score=300.74 Aligned_cols=334 Identities=14% Similarity=0.100 Sum_probs=228.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHH-HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVA-EAEQMMEE-VRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRD 78 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~-~~~~~~~e-i~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) |..++.+ ++.+.+.|....++.++.. +..++.++ ++.+.+++... ...++++ T Consensus 1 m~ik~~~---~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~e~~~ 54 (381) T protein:vir:95 1 MTINLSE---TFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQ-----------------------AKAEAER 54 (381) T ss_pred CchhhHH---HHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHH-----------------------HHHHHHH Confidence 8854332 2222222222222221111 11111111 11111111110 0112222 Q ss_pred HHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeec Q lcl|Aclame:pro 79 VFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) Q Consensus 79 a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~ 158 (392) ++.....+..+..+++.+. .+...+++++||++||+++.+.|++.+++.++|+++|++.++++ ...+++.. T Consensus 55 ~~~~~~~~~~lt~~e~~~~------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~---~~~i~~~~ 125 (381) T protein:vir:95 55 VSSLPKSAQSLSANQRSFF------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSE 125 (381) T ss_pred HHHhccCcccccHHHHHHH------HHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc---ceEEEEec Confidence 2222112222233333222 23445677789999999999999999999999999999988754 45677788 Q ss_pred CCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ 238 (392) Q Consensus 159 ~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~ 238 (392) +.+.+.|++|+++.++.++++|++|++++++++++++||+++|+|+.++|++||.++|+++++.+++.+|++|.|+..|. T Consensus 126 ~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~ 205 (381) T protein:vir:95 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) T ss_pred CCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCce Confidence 88999999999887766679999999999999999999999999999999999999999999999999999998876553 Q ss_pred chh-----------------------h-------HHHHHHHHHHHh-------hhcccCCceEEEcHHHHHHHHHhh--- Q lcl|Aclame:pro 239 AIK-----------------------S-------LDDIKDVLNVKL-------DPAISPNAILLTNQDGFNYLDKLK--- 278 (392) Q Consensus 239 ~~~-----------------------~-------~d~~~~~~~~~~-------~~~~~~~a~~v~~~~~~~~L~~lk--- 278 (392) +.. + ++.+...+ ..+ ...|..+++|+|||.++..|++++ T Consensus 206 Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~-~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~ 284 (381) T protein:vir:95 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVF-KYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL 284 (381) T ss_pred eeeeccCcccccccccccccccccccccccchhhHHHHHHHH-HhhccccccccccccCceEEEEccccHHhhccccccC Confidence 221 1 11222221 122 234677889999999999998766 Q ss_pred ccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCce Q lcl|Aclame:pro 279 DKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTL 358 (392) Q Consensus 279 d~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~ 358 (392) +++|+|+|..+ .|+.++.+..+|. ..++||||++ |.+++|++++++.+++. +|.+|++ T Consensus 285 ~~~G~~v~~l~------------~g~~vv~s~~~p~-------~~iifgDfs~-Y~i~~r~~~~i~~~~~~--~~~~d~~ 342 (381) T protein:vir:95 285 NANGVYVTALP------------FNLNVIESTVQEA-------GKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMD 342 (381) T ss_pred CCCCceeecCC------------CCceEEecCCCCc-------CcEEEEeccc-EEEEEecccEEEeechh--HhhcCCe Confidence 66789887532 1333344444442 3489999997 78999999999999875 7999999 Q ss_pred eEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 359 DLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 359 ~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) .||+.+|+|+++++|+||++++++......++.+ T Consensus 343 ~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~ 376 (381) T protein:vir:95 343 LYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) T ss_pred EEEEEEEEcCEEecCceEEEEEEEecCCCcCccc Confidence 9999999999999999999999888644444444 No 72 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=3.9e-52 Score=302.41 Aligned_cols=296 Identities=14% Similarity=0.096 Sum_probs=232.3 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCc Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~ 149 (392) .+...+.+....++ ........+.++.+.....+||++||+++.+.|++.+++.++|++++++.+++++. T Consensus 1 ~~~~~~~~~~~~~f----------~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 70 (324) T protein:vir:97 1 MEQTQKLKLNLQHF----------ASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTE 70 (324) T ss_pred CccchhHHHHHHHH----------HHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCc Confidence 11111111111100 11111222344555566677899999999999999999999999999999887654 Q ss_pred ceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 150 GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLIL 229 (392) Q Consensus 150 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~ 229 (392) +.+|+..+.+.+.|++|+++++++ +++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|.+++ T Consensus 71 --~~ip~~~~~~~a~~v~Eg~~~~~~-~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l 147 (324) T protein:vir:97 71 --KKFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) T ss_pred --eEEEEEecCcceeEeccCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh Confidence 555667778889999999999875 69999999999999999999999999999999999999999999999999999 Q ss_pred hccccccc---------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCc Q lcl|Aclame:pro 230 GVIEKLTK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) Q Consensus 230 ~~~~~~~~---------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~ 294 (392) +|.++... .+..+++++.+++ ..+..+++.+++|+|||++|..|++++|++|+|+|.+ +. T Consensus 148 ~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~----~~ 222 (324) T protein:vir:97 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) T ss_pred ccCCCCccCccccccccccceeccccCCHHHHHHHH-HhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecC----CC Confidence 98775432 1334688888876 4677888888999999999999999999999999964 34 Q ss_pred ccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEE Q lcl|Aclame:pro 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRA 362 (392) Q Consensus 295 ~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~ 362 (392) +.+++|. ||++.. ....+...++||||+++ .++++++++++++++.. +.|++|++.||+ T Consensus 223 ~~tl~G~-PV~~~~-------~~~~~~~~~~~gd~~~~-~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~ 293 (324) T protein:vir:97 223 SDTLDGL-PVVNLK-------SSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred Cccccce-eeEeec-------CCCCCcceEEEEecccE-EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 5678885 565432 22344567899999975 58899999999988743 569999999999 Q ss_pred EEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 363 IQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 363 ~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ++|+|+++.+|+||++|+.+....+.||+- T Consensus 294 ~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~ 323 (324) T protein:vir:97 294 TMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred EEEeccEEecccceEEEEeccCCCCCCCCC Confidence 999999999999999999776655555555 No 73 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=4.3e-52 Score=302.19 Aligned_cols=272 Identities=13% Similarity=0.063 Sum_probs=220.5 Q ss_pred cccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEech Q lcl|Aclame:pro 108 GLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAV 187 (392) Q Consensus 108 ~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~ 187 (392) +.+.++||++||+++.+.|++.+++.++|++++++++++++ ...+|+..+++.++|++|+++++++ +++|+++++.+ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~--~~~~p~~~~~~~a~wv~Eg~~~~~~-~~~f~~v~l~~ 77 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFG--EQQYMTLTAPPRGEVVGEGAQKSES-TATFAPVTAIP 77 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCC--ceEEEEEeCCceeEEeecCcccccc-cceeeEEEEee Confidence 55677899999999999999999999999999999888765 4556777888899999999999975 69999999999 Q ss_pred hheeeehhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---------------------chhhH Q lcl|Aclame:pro 188 KDRAGILPLSRSLLQ---DSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ---------------------AIKSL 243 (392) Q Consensus 188 ~~i~~~~~iS~e~l~---ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~---------------------~~~~~ 243 (392) +|+++++++|+|+|+ |+..+|+++|.+++++++++++|.++++|.+..+.. +...+ T Consensus 78 ~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~ 157 (311) T protein:vir:81 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATP 157 (311) T ss_pred EEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchH Confidence 999999999999995 566789999999999999999999999986422110 11223 Q ss_pred HHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccc-------- Q lcl|Aclame:pro 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSK-------- 315 (392) Q Consensus 244 d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~-------- 315 (392) +..+..+...+........+|+|||.+|.+|++|||++|+|+|.+....+.+.+++|. ||++.+ ..+... T Consensus 158 ~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~-Pv~~~~-~i~~~~~~~~~~~~ 235 (311) T protein:vir:81 158 DLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGL-NAAVSD-TVRGGPEAVTASTG 235 (311) T ss_pred HHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecce-eEEecc-cccccccccccccc Confidence 3334334344455455556799999999999999999999999998888889999985 565432 222111 Q ss_pred --cccCCcceEEEEehhhceeeeeccceEEEEeccch-----hhhhcCceeEEEEEeeCcEEecccceEEEEecccC Q lcl|Aclame:pro 316 --GTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG-----KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 316 --~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~-----~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a 385 (392) ....+...++||||++ |.+..+++++++++++.+ ++|++|++.||+++|+|++|.+|+||++|+..+.| T Consensus 236 ~~~~~~~~~~~~~gDfs~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 236 VYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred hhcccCCccEEEEEeccc-EEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 2234556789999998 568889999999987643 46999999999999999999999999999977766 No 74 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=4.1e-52 Score=302.32 Aligned_cols=266 Identities=12% Similarity=0.052 Sum_probs=220.7 Q ss_pred cccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhh Q lcl|Aclame:pro 110 TGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKD 189 (392) Q Consensus 110 ~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~ 189 (392) -..+||+++|+++..+|++.+++.++|+++|++++++++. ..+|+.++.+.++|++|+++++++ +++|++++++++| T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~--~~ip~~~~~~~a~~v~E~~~~~~~-~~~f~~v~l~~~k 77 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNG--EKVFTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPIK 77 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCc--eEEEEEecCcceEEecCCcccccc-ccceeEEEEeeee Confidence 3456789999999999999999999999999999887654 455667778899999999999976 5899999999999 Q ss_pred eeeehhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--c---------------------cchhhH Q lcl|Aclame:pro 190 RAGILPLSRSLLQ---DSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--K---------------------QAIKSL 243 (392) Q Consensus 190 i~~~~~iS~e~l~---ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~--~---------------------~~~~~~ 243 (392) +++++++|+|+|. |+..+|++||.++|++++++++|.++++|.+..+ + .+...+ T Consensus 78 ~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) T protein:vir:16 78 VEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) T ss_pred EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHH Confidence 9999999999995 5668999999999999999999999998843211 0 011224 Q ss_pred HHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcce Q lcl|Aclame:pro 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) Q Consensus 244 d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~ 323 (392) +++.+++. .+..++..+++|+|||++|..|+++||++|+|+|++....+.+.+++|. ||++.+ .+|.. ...+... T Consensus 158 ~~i~~~~~-~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~-PV~~~~--~v~~~-~~~~~~~ 232 (298) T protein:vir:16 158 GAIENAVE-LLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGL-PVDVNK--TVSDM-SLTQRDR 232 (298) T ss_pred HHHHHHHH-HhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecce-eeEEec--ccccc-cCCCccE Confidence 56666554 5566777888999999999999999999999999998888888999985 666543 33332 2344567 Q ss_pred EEEEehhhceeeeeccceEEEEeccch------hhhhcCceeEEEEEeeCcEEecccceEEEEecc Q lcl|Aclame:pro 324 LIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 324 ~~~Gd~~~~~~~~~~~~~~~~~~~~~~------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~ 383 (392) ++||||++++.+..|++++++++++.+ ++|++|++.+|+++|+|+++.+|+||++|+..+ T Consensus 233 ~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 233 AIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 899999998888899999999987643 469999999999999999999999999998555 No 75 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=7.3e-52 Score=300.93 Aligned_cols=270 Identities=14% Similarity=0.069 Sum_probs=223.8 Q ss_pred cccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEech Q lcl|Aclame:pro 108 GLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAV 187 (392) Q Consensus 108 ~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~ 187 (392) +++.++||++||++++.+|++.+++.++|+++|++.+++++. ..+|+..+++.+.|++|+++++++ +++|+++++++ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~--~~ip~~~~~~~a~wv~E~~~~~~s-~~~f~~v~l~~ 77 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNG--SKEFTFTLDSDIDVVAENGKKTHG-GLSLEPVTIVP 77 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEecCcceEEeecCcccccc-ccceeeEEeee Confidence 556778999999999999999999999999999999887654 455667778889999999999875 69999999999 Q ss_pred hheeeehhhHHHHH---hhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------------cchhh Q lcl|Aclame:pro 188 KDRAGILPLSRSLL---QDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------------QAIKS 242 (392) Q Consensus 188 ~~i~~~~~iS~e~l---~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~----------------------~~~~~ 242 (392) +|+++++++|+|++ .|+.++|.++|.+++++++++++|.++++|.+..+. .+... T Consensus 78 ~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (303) T protein:vir:97 78 IKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDA 157 (303) T ss_pred EEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccch Confidence 99999999999999 467789999999999999999999999988532110 12345 Q ss_pred HHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCC-cccceecccceEEecCccccc-ccccCC Q lcl|Aclame:pro 243 LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK-NKKLFAGTNPVVVVSNRFLKS-KGTTAK 320 (392) Q Consensus 243 ~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~-~~~~~~g~~pv~~~~~~~~~~-~~~~~~ 320 (392) ++++.+++. .+..++..++.|+|||+++.+|+++||++|+|+|+|+...+ .+.+++| +||++.+ .+|. .....+ T Consensus 158 ~~~i~~~~~-~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G-~Pv~~s~--~v~~~~~~~~~ 233 (303) T protein:vir:97 158 DANIEAAVN-LIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSING-LKSSVNT--TVGAGADEAES 233 (303) T ss_pred HHHHHHHHH-HHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecc-eeeEEec--ccCCccccCCC Confidence 778888765 45556677788999999999999999999999999986544 4567888 5676644 2333 334445 Q ss_pred cceEEEEehhhceeeeeccceEEEEeccch------hhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 321 KAPLIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 321 ~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) ...++||||+..|.++.|++++++++++.+ ++|++|++.||+++|+|++|++|+||++|+.... T Consensus 234 ~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 234 KDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred ccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 677899999988889999999999988643 4699999999999999999999999999985433 No 76 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=1.1e-50 Score=294.58 Aligned_cols=370 Identities=16% Similarity=0.182 Sum_probs=229.8 Q ss_pred CCHHH-------HHHHHHH---HHHHHHHHHHhhhhh----H----HHHHHHHHHHHHHHHHHHHH-------HHHHHHH Q lcl|Aclame:pro 1 MSKEL-------RELLAKL---EGKKEEVRSLMGEDK----V----AEAEQMMEEVRSLQKKIDLQ-------RSLDEAE 55 (392) Q Consensus 1 M~kel-------~el~~~~---~~~~~e~~~~~~~~~----~----~~~~~~~~ei~~l~~~i~~~-------~~~~~~~ 55 (392) +.++| .++++++ +++.+++...+++.+ . ++++++.++++++++++..+ +...+.. T Consensus 8 l~~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~le~el~e~ 87 (466) T protein:vir:80 8 LAKKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKELENELEQL 87 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222 2222222 222223322222111 0 12233333333333333222 1111111 Q ss_pred HHHhhcccc-ccccc------cchhhHHHHHHHHHHhcchhh-----HHHHHHHHhhhhhhhhccccccccceecchhhh Q lcl|Aclame:pro 56 TEERNNGRE-VETRN------VDGEMEYRDVFMKALRNKPLN-----AEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQ 123 (392) Q Consensus 56 ~~~~~~~~~-~~~~~------~~~~~~~~~a~~~~~~~~~~~-----~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~ 123 (392) ......... ..... ........+.+.+.+...... ...+... ...........+.++|+++||+++. T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~~~vP~~~~ 166 (466) T protein:vir:80 88 NNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFL-AQVRTLAQQKRAVSGAELTIPDVML 166 (466) T ss_pred HHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHH-HHHHHHhhhhhhhccccccccHHHH Confidence 111000000 00000 000011111111111111000 0001100 0111111222334456789999999 Q ss_pred hHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhh Q lcl|Aclame:pro 124 TQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQD 203 (392) Q Consensus 124 ~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~d 203 (392) +.|++.+++.++|++++++.++++ ...++.....+.+.|++|+++++++ +++|++|++.+++++++++||+++|+| T Consensus 167 ~~i~~~l~~~~~l~~~~~v~~~~g---~~~~~~~~~~~~a~wv~E~~~~~~~-~~~f~~i~~~~~k~~~~~~iS~ell~d 242 (466) T protein:vir:80 167 ELLRDNMHRYSKLISKVRLRPLKG---TARQNIAGAIPEGVWTEAVANLNEL-SLSFSQIEVDGYKVGGFIPIPNSTLED 242 (466) T ss_pred HHHHHhhhhhhhhhhheeeeecCc---eeEeeeecCCcceeecccccccccc-cccccceeecceeeeeehhhhHHHHhc Confidence 999999999999999999988764 3456667777889999999999876 599999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh----------------------hHH----------------- Q lcl|Aclame:pro 204 SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIK----------------------SLD----------------- 244 (392) Q Consensus 204 s~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~~~----------------------~~d----------------- 244 (392) +.+++++||.++|+++++.++|.+|++|.|+..+.|.. .+. T Consensus 243 s~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (466) T protein:vir:80 243 SDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFS 322 (466) T ss_pred chHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHH Confidence 99999999999999999999999999998876543321 011 Q ss_pred HHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhh---ccCCceeecccccCCcccceecccceEEecCcccccccccCCc Q lcl|Aclame:pro 245 DIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLK---DKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKK 321 (392) Q Consensus 245 ~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lk---d~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~ 321 (392) +++..+.........+..+|+||+.++..|..++ +++|.+++.++ + ...++|. ||++.+ .+|. T Consensus 323 ~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~--~--~~~i~G~-pvv~s~--~~~~------- 388 (466) T protein:vir:80 323 ELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLN--N--TMPIVGG-DIVILD--FIPD------- 388 (466) T ss_pred HHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCC--C--ccccccc-ceeecC--ccCc------- Confidence 1111111122333345567999999999999888 66778877653 2 2247775 565432 3332 Q ss_pred ceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 322 APLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 322 ~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) +.++||||+. |.+++|.++++..+++. +|.+|++.||+++|+|+++++|+||++++++...|.+++.+ T Consensus 389 ~~~~~g~~~~-y~i~~r~~~~i~~~~~~--~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~~~~~ 456 (466) T protein:vir:80 389 NDIIGGYGSL-YLLAERADIKLAQSEHV--RFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTTSITF 456 (466) T ss_pred cceeeecccc-EEEEeecceEEEechhh--hhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcccceee Confidence 3489999986 67999999999998864 79999999999999999999999999999999888888777 No 77 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=9.6e-52 Score=300.28 Aligned_cols=272 Identities=13% Similarity=0.089 Sum_probs=229.1 Q ss_pred hhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccce Q lcl|Aclame:pro 101 LEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKF 180 (392) Q Consensus 101 ~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~ 180 (392) .-..+....+.+.||.+||++++++|++.+++.++|+++++++++++....+ +... ++.++|++|+++.+++ +++| T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~--~~~~-~~~a~~v~E~~~~~~~-~~~f 76 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEF--TFMS-GVGAFWVDEAERIQTS-KPTF 76 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEE--EEEc-CCceeeeecCcccccc-ccce Confidence 2334455666677889999999999999999999999999999987665444 4444 5778999999999876 5999 Q ss_pred eeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------cchhhHHH Q lcl|Aclame:pro 181 SNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---------------QAIKSLDD 245 (392) Q Consensus 181 ~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~---------------~~~~~~d~ 245 (392) ++|++.+++++++++||+|+++|+.++++++|.+.|++++++++|.++++|.++..+ .+...+++ T Consensus 77 ~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~~~~~~~~ 156 (299) T protein:vir:41 77 TKAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEETANKYDD 156 (299) T ss_pred eEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeeccccccHHH Confidence 999999999999999999999999999999999999999999999999998776433 12346889 Q ss_pred HHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEE Q lcl|Aclame:pro 246 IKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLI 325 (392) Q Consensus 246 ~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~ 325 (392) +.+++. .+...+..+++|+|||++|.+|+++||++|+|||++.+..+. .+++| +||++.++ +|. ..+...++ T Consensus 157 l~~~~~-~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~-~~l~G-~PV~~~~~--~~~---~~~~~~~~ 228 (299) T protein:vir:41 157 LNEAIG-LIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGV-DDVLG-LPIAYTPK--YTF---GDKDISEL 228 (299) T ss_pred HHHHHH-hhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCC-ceecc-eeeEEecc--cCC---CCCceEEE Confidence 999875 577788888999999999999999999999999998876554 57887 56766443 332 23566799 Q ss_pred EEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEEEEeeCcEEecccceEEEEecccC Q lcl|Aclame:pro 326 IGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 326 ~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a 385 (392) ||||++ +.++++++++++++++.+ +.|++|++.||+++|+|+++.+|+||++++.+++- T Consensus 229 ~gdfs~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 229 VGDWNQ-AYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred EEeccc-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 999987 468999999999988643 35899999999999999999999999999988877 No 78 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=7.9e-51 Score=295.26 Aligned_cols=335 Identities=14% Similarity=0.100 Sum_probs=226.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAE-QMMEE-VRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRD 78 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~-~~~~e-i~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) |..++.+ ++++.+.++...+++++.++.+ ++.++ .+.+.++.....+ .++++ T Consensus 1 m~~kl~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~e~~~ 54 (381) T protein:vir:10 1 MTINLSE---TFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAK-----------------------AEAER 54 (381) T ss_pred CchhHHH---HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhhhhHHHHHH-----------------------HHHHH Confidence 8865432 2222222222222222211111 11111 1111111111100 11122 Q ss_pred HHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeec Q lcl|Aclame:pro 79 VFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) Q Consensus 79 a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~ 158 (392) ++........+..+++... .++..+++.+||++||+++.+.|++.+++.|+|+++|++.++++ ...+++.. T Consensus 55 ~~~~~~~~~~l~~~e~~~~------~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~---~~~i~~~~ 125 (381) T protein:vir:10 55 VSSLPKSAQTLSANQRNFF------MDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSE 125 (381) T ss_pred HHHhcccccccCHHHHHHH------HHHhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc---ceEEEeec Confidence 2221111122223332221 23456677889999999999999999999999999999988753 35567778 Q ss_pred CCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ 238 (392) Q Consensus 159 ~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~ 238 (392) +.+.+.|++|.++.+++++++|+++++++++++++++||+++|+|+.+++++||.++|+++++.+++.+|++|+|+..|. T Consensus 126 ~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~ 205 (381) T protein:vir:10 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) T ss_pred CCcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCce Confidence 88889999998887766789999999999999999999999999999999999999999999999999999999876554 Q ss_pred chh-----------------------hHHHHHHHH------HHH-------hhhcccCCceEEEcHHHHHHHHHhh---c Q lcl|Aclame:pro 239 AIK-----------------------SLDDIKDVL------NVK-------LDPAISPNAILLTNQDGFNYLDKLK---D 279 (392) Q Consensus 239 ~~~-----------------------~~d~~~~~~------~~~-------~~~~~~~~a~~v~~~~~~~~L~~lk---d 279 (392) +.. ++.++.... ... ....+..++.|+|||.++..|++++ + T Consensus 206 Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~ 285 (381) T protein:vir:10 206 GLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) T ss_pred eeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCC Confidence 321 111111111 011 1224667889999999999888655 8 Q ss_pred cCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCcee Q lcl|Aclame:pro 280 KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLD 359 (392) Q Consensus 280 ~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~ 359 (392) ++|+|+|..+ +|. +| +.+..+|. ..++||||++ |.+++|.+++++.+++. +|.+|++. T Consensus 286 ~~G~~v~~lp---------~g~-~v--v~~~~~p~-------~~i~fGDfs~-Y~i~~r~~~~i~~~~~~--~~~~d~~~ 343 (381) T protein:vir:10 286 ANGVYVTALP---------FNL-NV--IESTVQEA-------GKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDL 343 (381) T ss_pred CCCceeecCC---------CCc-ee--EEcCCCCc-------CcEEEEEccc-EEEEEecccEEEeechh--hhhcCceE Confidence 8999998642 232 23 33445543 3489999997 78999999999999875 79999999 Q ss_pred EEEEEeeCcEEecccceEEEEeccc---CCCCCCCC Q lcl|Aclame:pro 360 LRAIQRDDVQMWDNEAAVYGEIDLS---APVEQPQG 392 (392) Q Consensus 360 ~~~~~r~~~~v~~~~af~~l~~~~~---a~~~~~~~ 392 (392) ||+..|+|+++++|+||++++++.. ++++.|.- T Consensus 344 f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~~~~~ 379 (381) T protein:vir:10 344 YTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEDTEE 379 (381) T ss_pred EEEEEEEcCEEecCCcEEEEEEeecCCccccccccc Confidence 9999999999999999999887753 33333333 No 79 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=1.1e-51 Score=300.06 Aligned_cols=270 Identities=14% Similarity=0.116 Sum_probs=225.3 Q ss_pred HhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccc Q lcl|Aclame:pro 98 EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDN 177 (392) Q Consensus 98 ~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~ 177 (392) .......+.+..++..||++||+++.+.|++.+++.++|++++++.+++++. +.+|+..+.+.+.|++|+++++++ . T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~ip~~~~~~~a~~v~E~~~~~~~-~ 77 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQK--KKFTYLAKGVGAYWVSETERIQTS-K 77 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCc--eEEEEEeCCcceEEeecCcccccc-c Confidence 2222334556667778899999999999999999999999999999887644 556667778899999999999875 6 Q ss_pred cceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------------------- Q lcl|Aclame:pro 178 PKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-------------------- 237 (392) Q Consensus 178 ~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~-------------------- 237 (392) ++|++++++++|++++++||+|+++|+.++|++||.++|++++++++|.++++|.|+..+ T Consensus 78 ~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~ 157 (304) T protein:vir:10 78 PEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVT 157 (304) T ss_pred ceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999998775332 Q ss_pred cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccc Q lcl|Aclame:pro 238 QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) Q Consensus 238 ~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~ 317 (392) .+...++++.+++ ..+..++..+++|+|||++|..|+++||++|+|+|+++ +.+++| +||+++++ +|. T Consensus 158 ~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-----~~~l~G-~PV~~~~~--~~~--- 225 (304) T protein:vir:10 158 DTNNLYVDLSALM-ATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-----GNEIMG-LPLSYTGA--DVY--- 225 (304) T ss_pred cccchHHHHHHHH-HHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-----Cccccc-eeeEEecc--ccc--- Confidence 1234578888865 46788888899999999999999999999999999764 457888 46665443 332 Q ss_pred cCCcceEEEEehhhceeeeeccceEEEEeccch--------------hhhhcCceeEEEEEeeCcEEecccceEEEEecc Q lcl|Aclame:pro 318 TAKKAPLIIGDLKEAIVLFKREDMELASTDVGG--------------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 318 ~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~--------------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~ 383 (392) ..++..++||||+++ .+++|++++++++++.. +.|++|++.||+++|+|+++.+|+||++|+.+- T Consensus 226 ~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 226 DKKKSLALMGDWDYA-RYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCCcEEEEEehhhE-EEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 234567899999984 58999999999988643 469999999999999999999999999998554 No 80 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=1.1e-51 Score=300.06 Aligned_cols=270 Identities=14% Similarity=0.116 Sum_probs=225.3 Q ss_pred HhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccc Q lcl|Aclame:pro 98 EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDN 177 (392) Q Consensus 98 ~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~ 177 (392) .......+.+..++..||++||+++.+.|++.+++.++|++++++.+++++. +.+|+..+.+.+.|++|+++++++ . T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~ip~~~~~~~a~~v~E~~~~~~~-~ 77 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQK--KKFTYLAKGVGAYWVSETERIQTS-K 77 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCc--eEEEEEeCCcceEEeecCcccccc-c Confidence 2222334556667778899999999999999999999999999999887644 556667778899999999999875 6 Q ss_pred cceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------------------- Q lcl|Aclame:pro 178 PKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-------------------- 237 (392) Q Consensus 178 ~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~-------------------- 237 (392) ++|++++++++|++++++||+|+++|+.++|++||.++|++++++++|.++++|.|+..+ T Consensus 78 ~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~ 157 (304) T protein:vir:94 78 PEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVT 157 (304) T ss_pred ceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999998775332 Q ss_pred cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccc Q lcl|Aclame:pro 238 QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) Q Consensus 238 ~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~ 317 (392) .+...++++.+++ ..+..++..+++|+|||++|..|+++||++|+|+|+++ +.+++| +||+++++ +|. T Consensus 158 ~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-----~~~l~G-~PV~~~~~--~~~--- 225 (304) T protein:vir:94 158 DTNNLYVDLSALM-ATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-----GNEIMG-LPLSYTGA--DVY--- 225 (304) T ss_pred cccchHHHHHHHH-HHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-----Cccccc-eeeEEecc--ccc--- Confidence 1234578888865 46788888899999999999999999999999999764 457888 46665443 332 Q ss_pred cCCcceEEEEehhhceeeeeccceEEEEeccch--------------hhhhcCceeEEEEEeeCcEEecccceEEEEecc Q lcl|Aclame:pro 318 TAKKAPLIIGDLKEAIVLFKREDMELASTDVGG--------------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 318 ~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~--------------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~ 383 (392) ..++..++||||+++ .+++|++++++++++.. +.|++|++.||+++|+|+++.+|+||++|+.+- T Consensus 226 ~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 226 DKKKSLALMGDWDYA-RYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCCcEEEEEehhhE-EEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 234567899999984 58999999999988643 469999999999999999999999999998554 No 81 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=2.3e-51 Score=298.23 Aligned_cols=266 Identities=12% Similarity=0.065 Sum_probs=220.4 Q ss_pred cccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhh Q lcl|Aclame:pro 110 TGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKD 189 (392) Q Consensus 110 ~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~ 189 (392) -..+||++||+++.++|++.+++.++|++++++.+++++. +.+|+..+++.++|++|+++++++ +++|++++++++| T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~--~~~p~~~~~~~a~~v~Eg~~~~~~-~~~f~~v~l~~~k 77 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG--EKVFTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPIK 77 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCc--eEEEEEecCcceEEeeCCcccccc-ccceeEEEEeeeE Confidence 2336789999999999999999999999999999887654 455667778889999999999975 6999999999999 Q ss_pred eeeehhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----c-------------------cchhhH Q lcl|Aclame:pro 190 RAGILPLSRSLLQ---DSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT----K-------------------QAIKSL 243 (392) Q Consensus 190 i~~~~~iS~e~l~---ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~----~-------------------~~~~~~ 243 (392) +++++++|+|+|. |+..+|.++|.++|++++++++|.++++|.+..+ . .+...+ T Consensus 78 ~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) T protein:vir:94 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) T ss_pred EEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHH Confidence 9999999999996 5567899999999999999999999998743110 0 011235 Q ss_pred HHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcce Q lcl|Aclame:pro 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) Q Consensus 244 d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~ 323 (392) +++.+++. .+..++..+++|+|||++|.+|+++||++|+|+|++....+.+.+++|. ||++.+ .+|.. ...+... T Consensus 158 ~~i~~~~~-~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~-PV~~~~--~v~~~-~~~~~~~ 232 (298) T protein:vir:94 158 GAIENAVE-LLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGL-PVDVNK--TVSDM-SLTQRDR 232 (298) T ss_pred HHHHHHHH-hhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecce-eeEEec--ccccc-cCCCccE Confidence 56666554 5677777888999999999999999999999999998888989999995 565543 33332 2344567 Q ss_pred EEEEehhhceeeeeccceEEEEeccch------hhhhcCceeEEEEEeeCcEEecccceEEEEecc Q lcl|Aclame:pro 324 LIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 324 ~~~Gd~~~~~~~~~~~~~~~~~~~~~~------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~ 383 (392) +++|||++++.+..|++++++++++.+ ++|++|++.+|+++|+|+++.+|+||++++-.+ T Consensus 233 ~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 233 AIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 899999998888899999999987643 369999999999999999999999999998554 No 82 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=3.1e-51 Score=297.46 Aligned_cols=296 Identities=14% Similarity=0.097 Sum_probs=229.3 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCc Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~ 149 (392) .+...+.+....++. .........++.+.....++|.+||+++.+.|++.+++.++|++++++.+++++. T Consensus 1 ~~~~~~~~~~~~~f~----------~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 70 (324) T protein:vir:10 1 MEQTQKLKLNLQHFA----------SNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE 70 (324) T ss_pred CCCchHHHHHHHHHH----------HHhhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc Confidence 111111110111111 0111112223444555566778999999999999999999999999999887654 Q ss_pred ceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 150 GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLIL 229 (392) Q Consensus 150 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~ 229 (392) +.+|+..+++.+.|++|+++++++ +++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|.+++ T Consensus 71 --~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l 147 (324) T protein:vir:10 71 --KKFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) T ss_pred --eEEEEEeCCcceeEeccCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh Confidence 455666778889999999999975 58999999999999999999999999999999999999999999999999999 Q ss_pred hccccccc---------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCc Q lcl|Aclame:pro 230 GVIEKLTK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) Q Consensus 230 ~~~~~~~~---------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~ 294 (392) +|.++... .+..+++++.+++ ..+..++...++|+|||++|..|++++|++|+|+|.+ +. T Consensus 148 ~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~i~~~~-~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~----~~ 222 (324) T protein:vir:10 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) T ss_pred hcCCCCccCccccccccccceeccccCCHHHHHHHH-HhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecC----CC Confidence 88765432 1344678888876 4678888888899999999999999999999999864 34 Q ss_pred ccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEE Q lcl|Aclame:pro 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRA 362 (392) Q Consensus 295 ~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~ 362 (392) +.+++|. ||++.. ....+...+++|||+++ .++++++++++++++.. +.|++|++.||+ T Consensus 223 ~~~l~G~-PV~~~~-------~~~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 293 (324) T protein:vir:10 223 SDTLDGL-PVVNLK-------SSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred Cccccce-eEEeec-------CCCCCcceEEEEecccE-EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 5678885 565432 22345667999999985 57899999999988743 469999999999 Q ss_pred EEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 363 IQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 363 ~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ++|+|+++.+|+||++|+.+...-..||+. T Consensus 294 ~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:10 294 TMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred EEEEccEEecccceEEEEeccCCCCCCCCC Confidence 999999999999999998655444445555 No 83 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=3.9e-51 Score=296.94 Aligned_cols=296 Identities=14% Similarity=0.096 Sum_probs=229.1 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCc Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~ 149 (392) .+...+.+-...++. .........++.+.....++|.+||+++.+.|++.+++.++|++++++.+++++. T Consensus 1 ~~k~~~~~~~~~~~~----------~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 70 (324) T protein:vir:99 1 MEQTQKLKLNLQHFA----------SNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTE 70 (324) T ss_pred CCCchHhhHHHHHHH----------HHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc Confidence 111111110011110 1111122233444555566777999999999999999999999999999887654 Q ss_pred ceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 150 GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLIL 229 (392) Q Consensus 150 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~ 229 (392) +.+|+..+++.+.|++|++.++++ .++|++++++++|+++++++|+|+++|+.+++++||.++|++++++++|.+++ T Consensus 71 --~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l 147 (324) T protein:vir:99 71 --KKFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) T ss_pred --eEEEEEecCcceeEeccCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh Confidence 556667778899999999999976 58999999999999999999999999999999999999999999999999999 Q ss_pred hccccccc---------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCc Q lcl|Aclame:pro 230 GVIEKLTK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) Q Consensus 230 ~~~~~~~~---------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~ 294 (392) +|.++... .+..+++++.+++ ..+..++..+++|+|||++|..|++++|++|+|+|.+ +. T Consensus 148 ~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~----~~ 222 (324) T protein:vir:99 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) T ss_pred hcCCCCccCccccccccccceeccccCCHHHHHHHH-HhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecC----CC Confidence 88765432 1344578888876 4678888888899999999999999999999999864 34 Q ss_pred ccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEE Q lcl|Aclame:pro 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRA 362 (392) Q Consensus 295 ~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~ 362 (392) +.+++|. ||++.. ....+...+++|||+++ .++++++++++++++.. +.|++|++.||+ T Consensus 223 ~~~l~G~-PVv~~~-------~~~~~~~~~i~gd~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~ 293 (324) T protein:vir:99 223 SDTLDGL-PVVNLK-------SSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred Cccccce-eEEeec-------CCCCCcceEEEEecccE-EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 5678885 565432 12344567999999975 58899999999988743 459999999999 Q ss_pred EEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 363 IQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 363 ~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ++|+|+++.+|+||++|+.+...-..||+. T Consensus 294 ~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~ 323 (324) T protein:vir:99 294 TMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred EEEEccEEecccceEEEEeccCCCCCCCCC Confidence 999999999999999998655444444444 No 84 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=2.3e-51 Score=298.20 Aligned_cols=286 Identities=13% Similarity=0.050 Sum_probs=228.5 Q ss_pred HhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccc Q lcl|Aclame:pro 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~ 163 (392) ++.+.. -..+.+.+...+..++|.+||+++.++|++.+++.++|++++++.++++.. +.+|+.++.+.+ T Consensus 1 ~~~~~~---------~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~ip~~~~~~~a 69 (318) T protein:vir:24 1 MAAGTA---------FAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTG--QKIPHWVGDVSA 69 (318) T ss_pred CCCCCC---------CCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCc--eEEEEEeCCcce Confidence 322211 112334445555667778899999999999999999999999998887654 556667778899 Q ss_pred cccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----- Q lcl|Aclame:pro 164 AEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ----- 238 (392) Q Consensus 164 ~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~----- 238 (392) +|++|+++++++ +++|++++++++|+++++++|+|+|+|+.++++++|.++|++++++++|.++++|.++..+. T Consensus 70 ~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~ 148 (318) T protein:vir:24 70 QWIGEGDMKPIT-KGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQT 148 (318) T ss_pred EEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccc Confidence 999999999875 69999999999999999999999999999999999999999999999999999987754321 Q ss_pred -----------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceeccc----c Q lcl|Aclame:pro 239 -----------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTN----P 303 (392) Q Consensus 239 -----------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~----p 303 (392) ....+++....+...+...+..+++|+|||++|..|+++||++|+|||+++...+.+..+.|.+ | T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~p 228 (318) T protein:vir:24 149 TKAISIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARP 228 (318) T ss_pred cccccccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEe Confidence 1122333333334567788889999999999999999999999999999988887776544432 3 Q ss_pred eEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEEEEeeCcEEe Q lcl|Aclame:pro 304 VVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRDDVQMW 371 (392) Q Consensus 304 v~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~~~r~~~~v~ 371 (392) |.+. .....++..++||||++ +.++++++++++++++.+ +.|++|++.||+++|+|+++. T Consensus 229 v~~~-------~~~~~~~~~~~~gdfs~-~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 300 (318) T protein:vir:24 229 TILS-------DHVVEGTTVGFMGDFSQ-LIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCN 300 (318) T ss_pred eEEe-------CCCCCCccEEEEeecce-EEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEe Confidence 3332 22334566789999997 458899999999988643 459999999999999999999 Q ss_pred cccceEEEEecccCCCCC Q lcl|Aclame:pro 372 DNEAAVYGEIDLSAPVEQ 389 (392) Q Consensus 372 ~~~af~~l~~~~~a~~~~ 389 (392) +|+||++|+.++++...- T Consensus 301 ~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 301 DAEAFVALTNVVSGGGEG 318 (318) T ss_pred cccceEEEEeeccCCCCC Confidence 999999999888877655 No 85 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=6.3e-51 Score=295.81 Aligned_cols=296 Identities=14% Similarity=0.093 Sum_probs=228.9 Q ss_pred ccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc Q lcl|Aclame:pro 67 TRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR 146 (392) Q Consensus 67 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~ 146 (392) .+..+...+....|... .......++.+..+...+|++||+++.+.|++.+++.++|++++++.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~-------------~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~ 67 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASN-------------NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME 67 (324) T ss_pred CCcchhhhHHHHHHHHH-------------hhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc Confidence 00000000001111111 11112233445556677889999999999999999999999999999887 Q ss_pred CCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 147 TRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNV 226 (392) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~ 226 (392) ++. +.+|+..+.+.++|++|+++++++ +++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|. T Consensus 68 ~~~--~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~ 144 (324) T protein:vir:78 68 GTE--KKFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDE 144 (324) T ss_pred CCc--eEEEEEecCcceeEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHH Confidence 654 455667778899999999999975 69999999999999999999999999999999999999999999999999 Q ss_pred HHhhccccccc---------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeeccccc Q lcl|Aclame:pro 227 LILGVIEKLTK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPT 291 (392) Q Consensus 227 ~~~~~~~~~~~---------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~ 291 (392) +++.|.++... .+..+++++.+++. .+..++...++|+|||++|.+|++++|++|+|+|.+ T Consensus 145 a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~-~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~--- 220 (324) T protein:vir:78 145 AGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD--- 220 (324) T ss_pred HHhccCCCCCcCccccccccccceeccccccHHHHHHHHH-hhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecC--- Confidence 99988765432 23346888888764 678888889999999999999999999999999864 Q ss_pred CCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCcee Q lcl|Aclame:pro 292 QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLD 359 (392) Q Consensus 292 ~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~ 359 (392) +.+.+++|. ||++.. ....+...+++|||+++ .++++++++++++++.. +.|++|++. T Consensus 221 -~~~~~l~G~-PV~~~~-------~~~~~~~~~~~gd~~~~-~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~ 290 (324) T protein:vir:78 221 -RNSDSLDGL-PVVNLK-------SSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) T ss_pred -CCCCcccce-eeEeeC-------CCCCCcceEEEEecceE-EEEEecCcEEEEeecccccccccccccchhhhhcCcEE Confidence 446678885 555432 22345667899999984 58899999999988743 469999999 Q ss_pred EEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 360 LRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 360 ~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ||+++|+|+++.+|+||++|+........||-- T Consensus 291 ~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:78 291 LRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred EEEEEEEccEEecccceEEEecccccCCCCCCC Confidence 999999999999999999988644433333322 No 86 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=6.3e-51 Score=295.81 Aligned_cols=296 Identities=14% Similarity=0.093 Sum_probs=228.9 Q ss_pred ccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc Q lcl|Aclame:pro 67 TRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR 146 (392) Q Consensus 67 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~ 146 (392) .+..+...+....|... .......++.+..+...+|++||+++.+.|++.+++.++|++++++.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~-------------~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~ 67 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASN-------------NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME 67 (324) T ss_pred CCcchhhhHHHHHHHHH-------------hhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc Confidence 00000000001111111 11112233445556677889999999999999999999999999999887 Q ss_pred CCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 147 TRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNV 226 (392) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~ 226 (392) ++. +.+|+..+.+.++|++|+++++++ +++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|. T Consensus 68 ~~~--~~~p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~ 144 (324) T protein:vir:96 68 GTE--KKFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDE 144 (324) T ss_pred CCc--eEEEEEecCcceeEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHH Confidence 654 455667778899999999999975 69999999999999999999999999999999999999999999999999 Q ss_pred HHhhccccccc---------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeeccccc Q lcl|Aclame:pro 227 LILGVIEKLTK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPT 291 (392) Q Consensus 227 ~~~~~~~~~~~---------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~ 291 (392) +++.|.++... .+..+++++.+++. .+..++...++|+|||++|.+|++++|++|+|+|.+ T Consensus 145 a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~-~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~--- 220 (324) T protein:vir:96 145 AGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD--- 220 (324) T ss_pred HHhccCCCCCcCccccccccccceeccccccHHHHHHHHH-hhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecC--- Confidence 99988765432 23346888888764 678888889999999999999999999999999864 Q ss_pred CCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCcee Q lcl|Aclame:pro 292 QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLD 359 (392) Q Consensus 292 ~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~ 359 (392) +.+.+++|. ||++.. ....+...+++|||+++ .++++++++++++++.. +.|++|++. T Consensus 221 -~~~~~l~G~-PV~~~~-------~~~~~~~~~~~gd~~~~-~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~ 290 (324) T protein:vir:96 221 -RNSDSLDGL-PVVNLK-------SSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) T ss_pred -CCCCcccce-eeEeeC-------CCCCCcceEEEEecceE-EEEEecCcEEEEeecccccccccccccchhhhhcCcEE Confidence 446678885 555432 22345667899999984 58899999999988743 469999999 Q ss_pred EEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 360 LRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 360 ~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ||+++|+|+++.+|+||++|+........||-- T Consensus 291 ~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:96 291 LRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred EEEEEEEccEEecccceEEEecccccCCCCCCC Confidence 999999999999999999988644433333322 No 87 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=4.3e-51 Score=296.73 Aligned_cols=288 Identities=12% Similarity=0.060 Sum_probs=225.8 Q ss_pred HhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccc Q lcl|Aclame:pro 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~ 163 (392) +.-......+ .....+.++++.++...|| ++|+++.++|++.+++.++|+++++++++++.. +.+|+..+++.+ T Consensus 1 ~~~~~~r~~~---~~~~~e~~a~~~~~~~~g~-~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~--~~~p~~~~~~~a 74 (326) T protein:vir:42 1 MAVNPDRTTP---FLGVNDPKVAQTGDSMFEG-YLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTG--QKIPHWTGDVSA 74 (326) T ss_pred CCCCccchhh---hcCcchhhheeccccCCcc-eechhhHHHHHHHHHhcchhhhhcceeeccCCc--eEEEEEeCCcce Confidence 1111111111 1223355666665555444 689999999999999999999999999887654 555667788899 Q ss_pred cccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---- Q lcl|Aclame:pro 164 AEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA---- 239 (392) Q Consensus 164 ~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~---- 239 (392) +|++|+++++++ +++|++++++++++++++++|+|+++|+.+++++||.++|++++++++|.++++|.|+..+.+ T Consensus 75 ~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~ 153 (326) T protein:vir:42 75 SWIGEGDMKPIT-KGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQT 153 (326) T ss_pred EEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccc Confidence 999999999976 699999999999999999999999999999999999999999999999999999887543221 Q ss_pred ----------------hhhH-HHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCccc-----c Q lcl|Aclame:pro 240 ----------------IKSL-DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKK-----L 297 (392) Q Consensus 240 ----------------~~~~-d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~-----~ 297 (392) ...+ +..+......+...+..+++|+|||++|.+|++|||++|+|||++....+.+. + T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~ 233 (326) T protein:vir:42 154 TKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGR 233 (326) T ss_pred ccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCce Confidence 1112 22222233456777788899999999999999999999999999877766554 4 Q ss_pred eecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEEEEe Q lcl|Aclame:pro 298 FAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQR 365 (392) Q Consensus 298 ~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~~~r 365 (392) ++|. ||++.+ .+| .+...++||||++++ ++++++++++++++.+ +.|++|++.||+++| T Consensus 234 l~G~-pv~~~~--~~~-----~~~~~~~~Gd~s~~~-~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~ 304 (326) T protein:vir:42 234 IVAR-PTILSD--HVA-----SGTVVGYQGDFRQLV-WGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAE 304 (326) T ss_pred eeee-eEEEcC--CCC-----CCceEEEEeecceEE-EEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEE Confidence 6664 565533 222 345667899999865 7899999999887643 449999999999999 Q ss_pred eCcEEecccceEEEEecccCCC Q lcl|Aclame:pro 366 DDVQMWDNEAAVYGEIDLSAPV 387 (392) Q Consensus 366 ~~~~v~~~~af~~l~~~~~a~~ 387 (392) +|+++.+|+||++|+.++++++ T Consensus 305 ~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 305 YAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred eccEEecccceEEEeeccccCC Confidence 9999999999999998888877 No 88 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=9.4e-51 Score=294.85 Aligned_cols=295 Identities=14% Similarity=0.102 Sum_probs=227.6 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCc Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~ 149 (392) .+...+.+..+.+ +..........++.+..+...++.+||+++.++|++.+++.++|++++++.+++++. T Consensus 1 ~~~~~~~~~~~~~----------f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~ 70 (324) T protein:vir:93 1 MEQTQKLKLNLQH----------FASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE 70 (324) T ss_pred CchhHHHHHHHHH----------HHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc Confidence 1111111111111 111111222234455555566778999999999999999999999999999887654 Q ss_pred ceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 150 GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLIL 229 (392) Q Consensus 150 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~ 229 (392) +.+|+..+++.++|++|++.++++ .++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|.+++ T Consensus 71 --~~ip~~~~~~~a~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l 147 (324) T protein:vir:93 71 --KKFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) T ss_pred --eEEEEEecCcceeeecCCcccccc-ccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 445667778889999999999976 58999999999999999999999999999999999999999999999999999 Q ss_pred hccccccc---------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCc Q lcl|Aclame:pro 230 GVIEKLTK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) Q Consensus 230 ~~~~~~~~---------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~ 294 (392) .|.++... .+..+++++.+++. .+..++...++|+|||++|..|++++|++|+|+|.+ +. T Consensus 148 ~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~----~~ 222 (324) T protein:vir:93 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) T ss_pred cCCCCCCcCccccccccccceeccccccHHHHHHHHH-hhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecC----CC Confidence 88765422 13346888888765 667788888899999999999999999999999864 34 Q ss_pred ccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEE Q lcl|Aclame:pro 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRA 362 (392) Q Consensus 295 ~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~ 362 (392) +.+++|. ||++..+ ...+...+++|||+++ .++++++++++++++.. +.|++|++.||+ T Consensus 223 ~~~l~G~-PVv~~~~-------~~~~~~~i~~gdfs~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~ 293 (324) T protein:vir:93 223 SDSLDGL-PVVNLKS-------SNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred CCcccce-eeEeecC-------CCCCcceEEEEecceE-EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 6678885 5654321 2344567899999974 58899999999998753 569999999999 Q ss_pred EEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 363 IQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 363 ~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ++|+|+++.+|+||++|+....-. +.+.| T Consensus 294 ~~r~d~~v~~~~a~~~l~~a~~~~-~~~~~ 322 (324) T protein:vir:93 294 TMHVALHIADDKAFAKLVPADKRT-DSVPG 322 (324) T ss_pred EEEeccEEecccceEEEecccccC-CCCCC Confidence 999999999999999987443333 23344 No 89 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=3.6e-51 Score=297.15 Aligned_cols=325 Identities=16% Similarity=0.185 Sum_probs=221.0 Q ss_pred HHHHHHHHHHHhhccccc---cccccchhhHHHH-HHHHHHhcchhhHHHHHH---HHhhhhhhhhccccccccceecch Q lcl|Aclame:pro 48 QRSLDEAETEERNNGREV---ETRNVDGEMEYRD-VFMKALRNKPLNAEEREF---LEDDLEQRAMSGLTGEDGGLVIPQ 120 (392) Q Consensus 48 ~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~-a~~~~~~~~~~~~~~~~~---~~~~~~~~a~~~~~~~~gg~~iP~ 120 (392) +.+......+........ ............+ +.......+......+.. .......++... +..+||++||+ T Consensus 1 ~a~~~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~-~~~~Gg~lvP~ 79 (366) T protein:vir:57 1 MAAAVAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAIST-AAGSGGALIPQ 79 (366) T ss_pred CcccccccccccccccccccccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhccc-cccCCccccch Confidence 000000000000000000 0000000000001 110011111111111111 111112233333 44578999999 Q ss_pred hhhhHHHHhHHhhhhhhhh-cceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHH Q lcl|Aclame:pro 121 DIQTQINELARSFDALEQY-VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRS 199 (392) Q Consensus 121 ~~~~~ii~~~~~~~~l~~l-~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e 199 (392) ++.++|++.+++.++|+.+ ++++++ .++.+.+|+.++++.++|++|+++++++ +++|++|+++++|++++++||+| T Consensus 80 ~~~~~ii~~l~~~s~l~~lg~~~v~~--~~g~~~~p~~t~~~~a~wv~E~~~~~~s-~~~f~~i~~~~~k~~~~~~iS~e 156 (366) T protein:vir:57 80 NMQNEVIELLRDRTVVRILGARSIPL--PNGNLSMPRLSGGATAGYVGEGKDVVAT-GATFDDVKLSAKTMIALVPVSNQ 156 (366) T ss_pred hHHHHHHHHHhhhcchhhhceeeeec--CCCceEEEEEeCCcceeeeccCcccccc-ccceeEEEEeeEEEEEeehhhHH Confidence 9999999999999999988 666554 4567778888888999999999999976 69999999999999999999999 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-ccc-----------------h---hhHHHHHHHHHH--Hhhh Q lcl|Aclame:pro 200 LLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-KQA-----------------I---KSLDDIKDVLNV--KLDP 256 (392) Q Consensus 200 ~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-~~~-----------------~---~~~d~~~~~~~~--~~~~ 256 (392) +|+|+.++++++|+++|++++++++|.+++.|.|+.. +.+ . ...+..++.+.. .... T Consensus 157 ll~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~ 236 (366) T protein:vir:57 157 LIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHMDSN 236 (366) T ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchhhHHHHHHHHHHhhhccc Confidence 9999999999999999999999999999999987532 111 1 112223333322 2345 Q ss_pred cccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeee Q lcl|Aclame:pro 257 AISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLF 336 (392) Q Consensus 257 ~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~ 336 (392) .+..+++|+|||.+|.+|+++||++|+|+|.+ . .+.+++|. ||+++ +..+.+.+...+...++||||++ |.+. T Consensus 237 ~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~-~---~~g~l~G~-Pvv~s-~~ip~~~~~~~~~~~i~~gdfs~-~~i~ 309 (366) T protein:vir:57 237 SNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPE-M---SQGILKGY-PIQRT-SAIPANLGDDGNESEIYFCDFND-VVIG 309 (366) T ss_pred cccccCEEEecHHHHHHHHhhhccCCceeccC-C---CCCeecce-eeEEc-cccccccccCCCccEEEEEecce-EEEE Confidence 56778999999999999999999999999953 2 23468874 66543 33333344556677899999997 5699 Q ss_pred eccceEEEEeccch---------hhhhcCceeEEEEEeeCcEEecccceEEEE-ecc Q lcl|Aclame:pro 337 KREDMELASTDVGG---------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDL 383 (392) Q Consensus 337 ~~~~~~~~~~~~~~---------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~-~~~ 383 (392) ++++++++++++.. +.|++|++.+|+++|+|+++.||+||++++ +++ T Consensus 310 ~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 310 EDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred EecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 99999999988632 569999999999999999999999999998 788 No 90 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=7.4e-51 Score=295.42 Aligned_cols=283 Identities=11% Similarity=0.021 Sum_probs=220.9 Q ss_pred HHhhhhhhhhcccccc------ccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccc Q lcl|Aclame:pro 97 LEDDLEQRAMSGLTGE------DGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMG 170 (392) Q Consensus 97 ~~~~~~~~a~~~~~~~------~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~ 170 (392) +-...+.++...++.. .++.++|+++.++|++.+++.++|++++++++++++ .+.+++..+.+.+.|++|+. T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~--~~~~p~~~~~~~a~~v~eg~ 78 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYG--ETIIPTTVKRPEVGQVGVGT 78 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCC--ceEEEEEeCCceeEeecCcc Confidence 1122233333333322 334589999999999999999999999999888764 45567777777788877763 Q ss_pred --------ccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---- Q lcl|Aclame:pro 171 --------EIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ---- 238 (392) Q Consensus 171 --------~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~---- 238 (392) .+++ +.++|++++++++|++++++||+|+++|+.+++++||+++|++++++++|.++++|.|..++. T Consensus 79 ~~~~~e~~~~~~-~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g 157 (333) T protein:vir:78 79 SNEQREGGLKPL-SGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQG 157 (333) T ss_pred cccccccccccc-cccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccc Confidence 3444 578999999999999999999999999999999999999999999999999999988754321 Q ss_pred -------------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHH---hhccCCceeecccccCCccc Q lcl|Aclame:pro 239 -------------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK---LKDKDGKYILQSDPTQKNKK 296 (392) Q Consensus 239 -------------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~---lkd~~g~~l~~~~~~~~~~~ 296 (392) +..+++++++++.......+...++|+|||.+|..|++ ++|++|+|+|.+....+.+. T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~ 237 (333) T protein:vir:78 158 IDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTG 237 (333) T ss_pred ccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCc Confidence 22357788887654444445666789999999987764 78999999999988888899 Q ss_pred ceecccceEEecCcccccc-cccCCcceEEEEehhhceeeeeccceEEEEeccch---------hhhhcCceeEEEEEee Q lcl|Aclame:pro 297 LFAGTNPVVVVSNRFLKSK-GTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG---------KAFTRNTLDLRAIQRD 366 (392) Q Consensus 297 ~~~g~~pv~~~~~~~~~~~-~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~---------~~f~~~~~~~~~~~r~ 366 (392) +++|. ||++.+ ..+.+. ........++||||++ |.+.++++++++++++.. +.|++|++.||+++|+ T Consensus 238 ~l~G~-Pv~~~~-~i~~~~~~~~~~~~~~~~gD~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~ 314 (333) T protein:vir:78 238 DVLGL-PAQFGR-AVGGDLGAAVDSKTRIIGGDFSQ-LKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTF 314 (333) T ss_pred eeece-eeEEcc-ccCCCccccCCCccEEEEEeccc-EEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEE Confidence 99985 565433 233232 2334466799999998 558899999999998742 4699999999999999 Q ss_pred CcEEecccceEEEEecccC Q lcl|Aclame:pro 367 DVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 367 ~~~v~~~~af~~l~~~~~a 385 (392) |+++.+|+||++|+.++++ T Consensus 315 d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 315 GWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred ccEEecccceEEEeccCCC Confidence 9999999999999755444 No 91 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=1e-50 Score=294.68 Aligned_cols=286 Identities=13% Similarity=0.028 Sum_probs=220.7 Q ss_pred HHhhhhhhhhcc------ccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEee------cCCcccc Q lcl|Aclame:pro 97 LEDDLEQRAMSG------LTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN------SDMIPFA 164 (392) Q Consensus 97 ~~~~~~~~a~~~------~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~------~~~~~~~ 164 (392) +-...+.++... ...+.++.+||+++.++|++.+++.++|+++|++.+++++...++.... .+...+. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~ 80 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccc Confidence 111112222222 2223445689999999999999999999999999999877666555432 1345577 Q ss_pred ccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc------ Q lcl|Aclame:pro 165 EITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ------ 238 (392) Q Consensus 165 ~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~------ 238 (392) |++|+++++++ +++|++|+++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|.|+.++. T Consensus 81 ~~~Eg~~~~~~-~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~ 159 (338) T protein:vir:78 81 EQREGGTKPLS-GTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGID 159 (338) T ss_pred ccccccccccc-ccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccc Confidence 88999999875 58999999999999999999999999999999999999999999999999999988753311 Q ss_pred -----------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHH---HhhccCCceeecccccCCcccce Q lcl|Aclame:pro 239 -----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLD---KLKDKDGKYILQSDPTQKNKKLF 298 (392) Q Consensus 239 -----------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~---~lkd~~g~~l~~~~~~~~~~~~~ 298 (392) ....++++.+++...........++|+|||.++..|+ +++|++|+|||.+....+.+.++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l 239 (338) T protein:vir:78 160 TNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGDL 239 (338) T ss_pred cccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCcee Confidence 1224566666654443444556668999999988774 57899999999998888889999 Q ss_pred ecccceEEecCccccc-c-cccCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEEEE Q lcl|Aclame:pro 299 AGTNPVVVVSNRFLKS-K-GTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQ 364 (392) Q Consensus 299 ~g~~pv~~~~~~~~~~-~-~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~~~ 364 (392) +|. ||++.+ .+|+ . ........++||||++ |.++++++++++++++.+ +.|++|++.+|+++ T Consensus 240 ~G~-PV~~~~--~ip~~~~~~~~~~~~~~~gdfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~ 315 (338) T protein:vir:78 240 LGL-PVQFGK--AVGGDLGAATDSKVRVVGGDFSQ-LKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEV 315 (338) T ss_pred eee-eEEEcc--ccCccccccCCcccEEEEEecce-EEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEE Confidence 996 565533 3343 2 2233456789999987 568999999999988642 56999999999999 Q ss_pred eeCcEEecccceEEEEecccCCC Q lcl|Aclame:pro 365 RDDVQMWDNEAAVYGEIDLSAPV 387 (392) Q Consensus 365 r~~~~v~~~~af~~l~~~~~a~~ 387 (392) |+|+++.+|+||++|+..+++.+ T Consensus 316 r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 316 TFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred EeccEeecccceEEEecccCCCC Confidence 99999999999999996555444 No 92 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=4.4e-51 Score=296.68 Aligned_cols=283 Identities=12% Similarity=0.054 Sum_probs=223.5 Q ss_pred HhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccc Q lcl|Aclame:pro 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~ 163 (392) +..+. ....+.+.+...+...+|.+||+++.++|++.+++.++|+++++++++++.. +.+++..+++.+ T Consensus 1 ~~~~~---------~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~~p~~~~~~~a 69 (320) T protein:vir:10 1 MAAGT---------AFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTG--QKIPHWIGDVSA 69 (320) T ss_pred CCCCc---------cCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCc--eEEEEEeCCcce Confidence 11110 0122444455555666777899999999999999999999999999887654 455667778889 Q ss_pred cccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---- Q lcl|Aclame:pro 164 AEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA---- 239 (392) Q Consensus 164 ~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~---- 239 (392) .|++|++++|++ +++|++++++++|++++++||+|+|+|+.++++++|.++|++++++++|+++++|.++..+.. T Consensus 70 ~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~ 148 (320) T protein:vir:10 70 QWIGEGDMKPIT-KGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQT 148 (320) T ss_pred EEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccc Confidence 999999999975 699999999999999999999999999999999999999999999999999999877543211 Q ss_pred ----------hh------hHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCccc-----ce Q lcl|Aclame:pro 240 ----------IK------SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKK-----LF 298 (392) Q Consensus 240 ----------~~------~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~-----~~ 298 (392) .. .+++.+..+...+...+..+++|+|||++|.+|+++||++|+|||++....+.+. ++ T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i 228 (320) T protein:vir:10 149 TKSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRI 228 (320) T ss_pred cccccceecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCcee Confidence 11 1222333344567888899999999999999999999999999999876665543 45 Q ss_pred ecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEEEEee Q lcl|Aclame:pro 299 AGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRD 366 (392) Q Consensus 299 ~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~~~r~ 366 (392) +|. ||++.+ .+ ..+...++||||+++ .++.+++++++++++.+ +.|++|++.||+++|+ T Consensus 229 ~g~-pv~~~~--~~-----~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~ 299 (320) T protein:vir:10 229 VSR-PTILSD--HV-----ADGTTVGYMGDFRNV-IWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEY 299 (320) T ss_pred eee-eeEecC--CC-----CCCceEEEEeecceE-EEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEee Confidence 553 454432 22 234556889999975 58899999999988754 4599999999999999 Q ss_pred CcEEecccceEEEEecccCCC Q lcl|Aclame:pro 367 DVQMWDNEAAVYGEIDLSAPV 387 (392) Q Consensus 367 ~~~v~~~~af~~l~~~~~a~~ 387 (392) |+++.+|+||++|+..+++++ T Consensus 300 d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 300 AFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred ccEEecccceEEEEeccCCCC Confidence 999999999999986665444 No 93 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=1.7e-50 Score=293.49 Aligned_cols=271 Identities=13% Similarity=0.081 Sum_probs=224.1 Q ss_pred HhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccc Q lcl|Aclame:pro 98 EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDN 177 (392) Q Consensus 98 ~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~ 177 (392) ......++.+..+.+++|.+||+++.++|++.+++.++|+++++++++++.. ...+++..+...+.|++|+++++++ . T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~Eg~~~~~~-~ 78 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQ-EKTVYVQTDGISAYWVNETEKIKTD-K 78 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCc-cEEEEEEcCCceeEEeecCcccccc-c Confidence 1111224455566677888999999999999999999999999999987654 3455677888899999999999876 5 Q ss_pred cceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--------------chhhH Q lcl|Aclame:pro 178 PKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ--------------AIKSL 243 (392) Q Consensus 178 ~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~--------------~~~~~ 243 (392) ++|+++++++++++++++||+|+++|+.++++++|.++|++++++++|.++++|.++.++. +..++ T Consensus 79 ~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~t~ 158 (297) T protein:vir:95 79 PEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGPINY 158 (297) T ss_pred cceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccccCH Confidence 8999999999999999999999999999999999999999999999999999987764432 33468 Q ss_pred HHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcce Q lcl|Aclame:pro 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) Q Consensus 244 d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~ 323 (392) +++++++. .+..++..+++|+|||++|.+|++++|++|+|+|++. +.+++|. ||++.. ....+.+. T Consensus 159 ~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~-----~~~l~G~-Pv~~~~-------~~~~~~~~ 224 (297) T protein:vir:95 159 DNILKLQD-ALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKA-----ANTIDGI-TTVDLK-------SARFEKGD 224 (297) T ss_pred HHHHHHHH-HhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCC-----CCcccce-eeEeec-------CCCCCCce Confidence 89998765 5677788888999999999999999999999999753 4567775 565432 12345567 Q ss_pred EEEEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEEEEeeCcEEecccceEEEEecccCCC Q lcl|Aclame:pro 324 LIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPV 387 (392) Q Consensus 324 ~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~ 387 (392) ++||||++ +.++++++++++++++.+ +.|++|++.+|+++|+|+++.+|+||++|+ .++|+ T Consensus 225 ~~~gd~s~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~--~at~~ 297 (297) T protein:vir:95 225 LLAGDFDN-LIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLT--PAERV 297 (297) T ss_pred EEEEeccc-EEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEe--ecCCC Confidence 89999997 458899999999988753 459999999999999999999999999876 44444 No 94 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=1.7e-50 Score=293.48 Aligned_cols=270 Identities=16% Similarity=0.139 Sum_probs=217.5 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccc----cccccee Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPE----TDNPKFS 181 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~----~~~~~~~ 181 (392) +...+.++||++||+++.+.|++.+++.++|++++++++++++ .+.+|+..+++.+.|++|++..++ .++++|+ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~--~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~ 78 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTK--TTHLPVLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCC--cEEEEEEeCCcceEEeeccccccccccccccccee Confidence 7777888899999999999999999999999999999888765 455666777889999999986543 3468999 Q ss_pred eEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------------------cchh Q lcl|Aclame:pro 182 NVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK--------------------QAIK 241 (392) Q Consensus 182 ~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~--------------------~~~~ 241 (392) +++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|.|.... .+.. T Consensus 79 ~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) T protein:vir:25 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) T ss_pred eEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccch Confidence 99999999999999999999999999999999999999999999999998775321 1111 Q ss_pred hHHHHHHHHHH---HhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCccccccccc Q lcl|Aclame:pro 242 SLDDIKDVLNV---KLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTT 318 (392) Q Consensus 242 ~~d~~~~~~~~---~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~ 318 (392) .++++.+.+.. .........+.|+|||.+|..|+++||++|+|||+|+ +++| +||++.++ ++. . T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~-------~l~G-~Pv~~~~~--~~~---~ 225 (305) T protein:vir:25 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD-------SFAG-FRTFFNRN--GAW---D 225 (305) T ss_pred hhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecCC-------cccc-cceEEcCc--cCC---C Confidence 23334433322 2233334445699999999999999999999999753 6777 46666443 332 3 Q ss_pred CCcceEEEEehhhceeeeeccceEEEEeccch--------hhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCC Q lcl|Aclame:pro 319 AKKAPLIIGDLKEAIVLFKREDMELASTDVGG--------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQP 390 (392) Q Consensus 319 ~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~--------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~ 390 (392) .+...++||||++ |.++++++++++++++.. +.|++|++.+|++.|+|+.+.+|+||++++....++++|. T Consensus 226 ~~~~~~~~gd~s~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~pa 304 (305) T protein:vir:25 226 ADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) T ss_pred CCccEEEEEecce-EEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCCC Confidence 4556799999998 568999999999987642 4699999999999999999999999999998877765555 Q ss_pred C Q lcl|Aclame:pro 391 Q 391 (392) Q Consensus 391 ~ 391 (392) + T Consensus 305 ~ 305 (305) T protein:vir:25 305 A 305 (305) T ss_pred C Confidence 5 No 95 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=1.4e-50 Score=293.83 Aligned_cols=283 Identities=11% Similarity=0.021 Sum_probs=225.0 Q ss_pred HHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccccc Q lcl|Aclame:pro 97 LEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETD 176 (392) Q Consensus 97 ~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~ 176 (392) +....+.+.+...+...+|.++|+++...|++.+++.++|++++++.+++++. +.+|+..+++.+.|++|+++++++ T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~--~~ip~~~~~~~a~wv~Eg~~~~~s- 77 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATG--IVIPHWTGDVSAQWIGEGDMKPIT- 77 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCc--eEEEEEcCCcceEEecCCcccccc- Confidence 23333334444444444455677788899999999999999999999887654 555667778899999999999875 Q ss_pred ccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------------cchhh Q lcl|Aclame:pro 177 NPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK--------------QAIKS 242 (392) Q Consensus 177 ~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~--------------~~~~~ 242 (392) +++|++|+++++|++++++||+|+|+|+.++++++|+++|++++++++|.++++|.++..+ .+... T Consensus 78 ~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~ 157 (397) T protein:vir:23 78 KGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPNAY 157 (397) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecccch Confidence 6999999999999999999999999999999999999999999999999999998776432 23345 Q ss_pred HHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcc-----cceecccceEEecCcccccccc Q lcl|Aclame:pro 243 LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNK-----KLFAGTNPVVVVSNRFLKSKGT 317 (392) Q Consensus 243 ~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~-----~~~~g~~pv~~~~~~~~~~~~~ 317 (392) ++++++++. .+...+..+++|+||++++..|+++||++|+|||+++...+.+ .+++|. ||++.++ +| T Consensus 158 ~~~~~~~~~-~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~-Pv~~s~~--~~---- 229 (397) T protein:vir:23 158 QGLGVSGLT-KLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGR-PTILSDH--VA---- 229 (397) T ss_pred hHHHHHHHH-hhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeee-eEEEeCC--CC---- Confidence 666777654 5677788899999999999999999999999999998766654 367774 5655432 22 Q ss_pred cCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEEEEeeCcEEecccceEEEEecccC Q lcl|Aclame:pro 318 TAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 318 ~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a 385 (392) .+...++||||+++ .+.++++++++++++.+ +.|++|++.||+++|+|+++++|+||++++.++.. T Consensus 230 -~g~~~~~~gDfs~~-~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~ 307 (397) T protein:vir:23 230 -EGDVVGYAGDFSQI-IWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVL 307 (397) T ss_pred -CCceEEEEeecceE-EEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecccc Confidence 34556799999985 47889999999987643 45999999999999999999999999999986642 Q ss_pred CCC-----CCCC Q lcl|Aclame:pro 386 PVE-----QPQG 392 (392) Q Consensus 386 ~~~-----~~~~ 392 (392) ... ++.| T Consensus 308 ~~~~~~~~~~~~ 319 (397) T protein:vir:23 308 TTYALDLDGASA 319 (397) T ss_pred ceeeecccccCc Confidence 221 2222 No 96 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=5.1e-50 Score=290.84 Aligned_cols=295 Identities=14% Similarity=0.100 Sum_probs=225.2 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCc Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~ 149 (392) .+...+.+-...++.. ........++.+......+|.+||+++.++|++.+++.++|++++++.+++++. T Consensus 1 ~~~~~~~~~~~~~f~~----------~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~ 70 (324) T protein:vir:96 1 MEQTQKLKLNLQHFAS----------NNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE 70 (324) T ss_pred CCcchhhhHHHHHHHH----------hhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc Confidence 1111111101111100 011112223344444556777999999999999999999999999999988654 Q ss_pred ceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 150 GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLIL 229 (392) Q Consensus 150 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~ 229 (392) +.+|+..+.+.+.|++|++.++++ +++|+++++++++++++++||+|+++|+.++++++|.++|++++++++|.+++ T Consensus 71 --~~~p~~~~~~~a~~v~Eg~~~~~~-~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l 147 (324) T protein:vir:96 71 --KKFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) T ss_pred --eEEEEEecCcceeeecCCcccccc-ccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh Confidence 445666777889999999999875 69999999999999999999999999999999999999999999999999999 Q ss_pred hccccccc---------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCc Q lcl|Aclame:pro 230 GVIEKLTK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) Q Consensus 230 ~~~~~~~~---------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~ 294 (392) .|.++... .+..+++++++++. .+...+..+++|+|||++|.+|++++|++|+|+|.+ +. T Consensus 148 ~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~----~~ 222 (324) T protein:vir:96 148 LNQGNNPFGKSIAQSIKKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) T ss_pred hcCCCCCcCccccccccccceecccccchHHHHHHHH-hhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecC----CC Confidence 88665432 12346889998765 567778888899999999999999999999999864 34 Q ss_pred ccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch------------hhhhcCceeEEE Q lcl|Aclame:pro 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRA 362 (392) Q Consensus 295 ~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~------------~~f~~~~~~~~~ 362 (392) +.+++|. ||++.. ....+...++||||++ +.++++++++++++++.. +.|++|++.||+ T Consensus 223 ~~~l~G~-PV~~~~-------~~~~~~~~~~~gd~s~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~ 293 (324) T protein:vir:96 223 SDSLDGL-PVVNLK-------SSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred CCcccce-eeEeec-------CCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 6678885 555422 2234456799999997 458899999999988743 569999999999 Q ss_pred EEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 363 IQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 363 ~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ++|+|+++.+|+||++|+......+. .+| T Consensus 294 ~~r~d~~v~~~~a~~~l~~a~~~~~~-~~~ 322 (324) T protein:vir:96 294 TMHVALHIADDKAFAKLVPADKRTDS-VPG 322 (324) T ss_pred EEEeccEEecccceEEEecccccCCC-CCC Confidence 99999999999999988844433333 333 No 97 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=1.5e-49 Score=288.33 Aligned_cols=271 Identities=14% Similarity=0.065 Sum_probs=213.5 Q ss_pred ccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEec Q lcl|Aclame:pro 107 SGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYA 186 (392) Q Consensus 107 ~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~ 186 (392) +.+..++||++||+++.++|++.+++.++|++++++++++++. ..+|+..+++.++|++|+++++++ +++|++++++ T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~--~~~p~~~~~~~a~wv~Eg~~~~~~-~~~f~~v~l~ 77 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGN--EDIITFNGRPKAEFVGEGQQKSST-TGEFDFVTST 77 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCc--eEEEEEeCCceeEEeecCcccccc-cceeeEEEEe Confidence 3345567889999999999999999999999999998887644 456667778899999999999975 6999999999 Q ss_pred hhheeeehhhHHHHH---hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---------------------chhh Q lcl|Aclame:pro 187 VKDRAGILPLSRSLL---QDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ---------------------AIKS 242 (392) Q Consensus 187 ~~~i~~~~~iS~e~l---~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~---------------------~~~~ 242 (392) ++|+++++++|+|++ .|+.++|.+||.+++++++++++|.++++|.++.+.. +... T Consensus 78 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~ 157 (311) T protein:vir:99 78 PKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIAN 157 (311) T ss_pred eEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccch Confidence 999999999999999 4778899999999999999999999999987643211 1111 Q ss_pred -HHHHHHHHHHHhhhc--ccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCccccccc--- Q lcl|Aclame:pro 243 -LDDIKDVLNVKLDPA--ISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKG--- 316 (392) Q Consensus 243 -~d~~~~~~~~~~~~~--~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~--- 316 (392) ++++..++ ..+... ....+.|+|||++|..|++|||++|||||++....+.+.+++|. ||++.+ ..+.... T Consensus 158 ~~~~i~~~~-~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~-Pv~~s~-~i~~~~~~~~ 234 (311) T protein:vir:99 158 PDLAIEAAV-GLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGI-DASVSD-TVNGGDEADP 234 (311) T ss_pred hHHHHHHHH-HHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecce-eeEeec-cccccccccc Confidence 22333333 222222 22334599999999999999999999999998888888888885 565533 2221111 Q ss_pred -----ccCCcceEEEEehhhceeeeeccceEEEEeccch-----hhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 317 -----TTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG-----KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 317 -----~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~-----~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) ..++...+++|||++++.+..+++++++.+++.+ +.|++|++.||+++|+|+++.+| +|++++.++| T Consensus 235 ~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 235 DDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred ccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 1234566899999998888999999999987643 46999999999999999999986 5776665554 No 98 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=1.3e-48 Score=283.16 Aligned_cols=356 Identities=16% Similarity=0.206 Sum_probs=224.0 Q ss_pred CC-HHHHHHHHHHHHHHHHHHHH-------------hhhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|Aclame:pro 1 MS-KELRELLAKLEGKKEEVRSL-------------MGED-KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREV 65 (392) Q Consensus 1 M~-kel~el~~~~~~~~~e~~~~-------------~~~~-~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~ 65 (392) .. ..-.+..++-.++..++.++ +..+ .++++++.. ++.+...... ...+............ T Consensus 216 ~~~a~~~~~~~~E~~r~~eI~~l~~~~~~~~~~~~ai~~g~sld~~ra~~--ld~l~~~~~a--~~~~~~a~~~~~~~~~ 291 (632) T protein:vir:96 216 ASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALV--LERMNPGQPG--NFEKPGAGDLPGKPAI 291 (632) T ss_pred hhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhHHHHHhccccHHHHHHHH--HHHHhhhhhh--hhhhhhhhhhhhhhhh Confidence 00 00001111111111222211 1111 111111110 0001000000 0000000000000000 Q ss_pred cc-cccc-hhhHH-HHHHHHHHhcch-----hhH---------------HHH--HHHHhhhhhhhhccccccccceecch Q lcl|Aclame:pro 66 ET-RNVD-GEMEY-RDVFMKALRNKP-----LNA---------------EER--EFLEDDLEQRAMSGLTGEDGGLVIPQ 120 (392) Q Consensus 66 ~~-~~~~-~~~~~-~~a~~~~~~~~~-----~~~---------------~~~--~~~~~~~~~~a~~~~~~~~gg~~iP~ 120 (392) .. .... ...+. ...+.+.++... ... ..+ .......+.+++..++.++||++||+ T Consensus 292 ~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~ 371 (632) T protein:vir:96 292 HSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVAT 371 (632) T ss_pred hhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhccccccccccccc Confidence 00 0000 00000 000111111000 000 000 01122345677788888899999998 Q ss_pred hh-hhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHH Q lcl|Aclame:pro 121 DI-QTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRS 199 (392) Q Consensus 121 ~~-~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e 199 (392) ++ ...|++.+++.++++++ ....+++.++.+.+|+.++++.++|++|+++++++ +++|+++++++++++++++||++ T Consensus 372 ~~~~~~iie~lr~~s~i~~l-~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~~s-~~~f~~i~l~~~k~~~~v~iS~e 449 (632) T protein:vir:96 372 ELLSEEFIDILRNKAIIGQM-GARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAVPVTRK 449 (632) T ss_pred ccchHHHHHHHhhcchhhhh-cceEeecCCcceEEEEEeCCceeEeecCCcccccc-ccceeeEEeeeeEEEEehhhHHH Confidence 86 57899999999999987 33345666778888999999999999999999975 69999999999999999999999 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-c---------------cchhhHHHHHHHHHHHhhhcc--cCC Q lcl|Aclame:pro 200 LLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-K---------------QAIKSLDDIKDVLNVKLDPAI--SPN 261 (392) Q Consensus 200 ~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-~---------------~~~~~~d~~~~~~~~~~~~~~--~~~ 261 (392) +|+|+.++++++|.++|+++++.++|.++++|.|+.. + .+..+++++.++.. .+...+ ..+ T Consensus 450 ll~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~-~i~~~~~~~~~ 528 (632) T protein:vir:96 450 LRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMET-KISTFNADAGR 528 (632) T ss_pred HHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecccccCCHHHHHHHHH-HHhhcccccCc Confidence 9999999999999999999999999999999987532 1 12335677777654 444433 456 Q ss_pred ceEEEcHHHHHHHHH--hhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeecc Q lcl|Aclame:pro 262 AILLTNQDGFNYLDK--LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKRE 339 (392) Q Consensus 262 a~~v~~~~~~~~L~~--lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~ 339 (392) ++|+||+.++..|++ ++|++|+|||++ .+++| +||++.+ .+| ...++||||++ |++.+++ T Consensus 529 ~~~~~~~~~~~~l~~~~l~d~~G~~i~~~-------~~l~G-~pv~~s~--~ip-------~~~~~~gd~s~-~~i~~~~ 590 (632) T protein:vir:96 529 LAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNG-YRAEASN--QIP-------ADTWIFGDWSQ-IVIAMWG 590 (632) T ss_pred cEEEEchhHHHHHHHHhccCCCCceeecC-------Ceecc-cceEecc--ccc-------cCcEEEeecce-EEEEEec Confidence 789999999877765 789999999974 35666 5665532 223 23489999997 4588999 Q ss_pred ceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecc Q lcl|Aclame:pro 340 DMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 340 ~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~ 383 (392) ++++.++++. +|.+|++.|++++|+|+++++|++|+.++.++ T Consensus 591 ~~~i~~~~~~--~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 591 VLDLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred ceEEEEcccc--ccccCceEEEEEeecCceeechhhhhheeecC Confidence 9999999975 67899999999999999999999999999887 No 99 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=2e-38 Score=227.25 Aligned_cols=360 Identities=15% Similarity=0.154 Sum_probs=205.7 Q ss_pred CCH--HHHHHHHHHHHHHHHHHH---Hhhh--hhHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHhhcccccc-- Q lcl|Aclame:pro 1 MSK--ELRELLAKLEGKKEEVRS---LMGE--DKVAEAEQMMEEVRSLQKKIDL-----QRSLDEAETEERNNGREVE-- 66 (392) Q Consensus 1 M~k--el~el~~~~~~~~~e~~~---~~~~--~~~~~~~~~~~ei~~l~~~i~~-----~~~~~~~~~~~~~~~~~~~-- 66 (392) ++. ++..+++.......++++ .+++ +..++.+.+.+++.++..+++. +..+.....+......... T Consensus 124 a~~~a~I~~vke~~~~e~~~~~~~~a~~ee~~e~~~k~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~~~~ 203 (517) T protein:vir:97 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVE 203 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhcccc Confidence 221 122222222111111111 0110 0111122222222222222211 1111100000000000000 Q ss_pred cc-ccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec Q lcl|Aclame:pro 67 TR-NVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV 145 (392) Q Consensus 67 ~~-~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~ 145 (392) .. ......++.......... ......................+|+++|+.+...|...+...++++.++++.++ T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i 278 (517) T protein:vir:97 204 ALKVTPEATEFLKTREAEVAY-----MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL 278 (517) T ss_pred cccccchhhHHHHHHHHHHHH-----HHhcccccccceeeeecccccccccccchHHHHHHHHhhhhhccceeeeeeccc Confidence 00 000111111111000000 000000000011111222334468899999999999999999998888776554 Q ss_pred cCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHH----HHHHHHHHHHHHHH Q lcl|Aclame:pro 146 RTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN----ILKYVTKWLGKKSK 221 (392) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~----l~~~v~~~l~~~~~ 221 (392) +. ...+.......+.|+.||+.+|++ +++|.++++.++++++++++|+++|+|+.++ |++||.++|++.++ T Consensus 279 ~~----~~~~~~~~~~~a~~~~eG~~kp~s-~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~ 353 (517) T protein:vir:97 279 PT----LVVGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVI 353 (517) T ss_pred cc----eeeecccccceeeeeecCCccccc-ccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHH Confidence 32 333445566677899999988875 6899999999999999999999999998877 99999999999999 Q ss_pred HHHHHHHhhcccccccc-------------chhhHHHHHHHHHHHhhhcc--cCCceEEEcHHHHHHHHHhhccCCceee Q lcl|Aclame:pro 222 VTRNVLILGVIEKLTKQ-------------AIKSLDDIKDVLNVKLDPAI--SPNAILLTNQDGFNYLDKLKDKDGKYIL 286 (392) Q Consensus 222 ~~~d~~~~~~~~~~~~~-------------~~~~~d~~~~~~~~~~~~~~--~~~a~~v~~~~~~~~L~~lkd~~g~~l~ 286 (392) .+++.++++|.|+.... +....+.+.+++. .+..++ ..+++|||||.+|.+|++|||++||||| T Consensus 354 ~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~~~d~i~-~l~~a~~~a~~a~~vmn~~t~~~I~klKD~~G~Yl~ 432 (517) T protein:vir:97 354 MAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLE-KLSVATPKAADSTLVIHRNDLAAIRFLKDKNGNYVF 432 (517) T ss_pred HHHHHHHhcccCCCcccccccccccccccccccccchHHHHHH-HHHHHhhhccCCEEEECHHHHHHHHHhhcCCCCeec Confidence 99999999997754221 1122333444332 222222 2478899999999999999999999999 Q ss_pred cccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEee Q lcl|Aclame:pro 287 QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRD 366 (392) Q Consensus 287 ~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~ 366 (392) ++....+.+.++||..++ +|. ...+.. .+++++ .|.+..+.++.+..+. .+..|+..|+.++|+ T Consensus 433 ~~~~~~~~~~~l~G~~~~-------~~~--~~~~~~--~~~~~~-~y~i~~~~g~~~~~~f----d~~~n~~~f~~~~~~ 496 (517) T protein:vir:97 433 PVGVSNQTIATHFGFNRL-------VQS--VAVDEK--TAVSLS-GYVTNGSRGMEFEQGT----ILVENNKEYLFEMPI 496 (517) T ss_pred cCcCCcccccccCCcccc-------ccc--cccCce--eEeecc-ccEEEeecceeeeeee----ecccCceeEeeeeee Confidence 998888888888884322 111 112222 233444 4567777777653321 234788999999999 Q ss_pred CcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 367 DVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 367 ~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ++.|+.|++|+++.+.+. ++| T Consensus 497 ~g~i~~~~r~a~~~~~p~-----~~~ 517 (517) T protein:vir:97 497 SGSLEYKGTTAYGTYTPP-----VAG 517 (517) T ss_pred ccccccccceEEEEEcCC-----CCC Confidence 999999999998886633 233 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=4.2e-37 Score=220.03 Aligned_cols=282 Identities=11% Similarity=0.036 Sum_probs=201.5 Q ss_pred HHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec-cCCcceeEEEe Q lcl|Aclame:pro 78 DVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RTRSGSRVLEK 156 (392) Q Consensus 78 ~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~-~~~~~~~~~~~ 156 (392) -.-...++++.. ....++++. +..+||+++|.+. ..+++.+.+.|++++++++++. .+....+.... T Consensus 1 ~~~~~~~~~~~~----------~~~~k~~t~-~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g 68 (315) T protein:vir:41 1 MLTIEDIRGGKP----------FEIVPKIDV-PDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLS 68 (315) T ss_pred CcccchhhcCCh----------hhhhhhcCC-cCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccc Confidence 000111222211 112234443 4557899999887 4688999999999999998643 33332222111 Q ss_pred ec--CCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHH--HHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 157 NS--DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQ--NILKYVTKWLGKKSKVTRNVLILGVI 232 (392) Q Consensus 157 ~~--~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~--~l~~~v~~~l~~~~~~~~d~~~~~~~ 232 (392) .. ......|.+|..+.++ +.++|+++.+.+++++..+.||+++|+|+.. +|++||..++++++++.++.++++|. T Consensus 69 ~~~~~~~g~~~~~~~~~~~~-~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGd 147 (315) T protein:vir:41 69 LVLDVGPGRDETGQKLAPPE-STAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGD 147 (315) T ss_pred cCcccccccccccCcCCCCC-CccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccC Confidence 01 1123457777766665 5799999999999999999999999999964 99999999999999999999999997 Q ss_pred ccccc------cc-------------------hhhHHHHHHHHHHHhhhcccC---CceEEEcHHHHHHHHHhhccCCce Q lcl|Aclame:pro 233 EKLTK------QA-------------------IKSLDDIKDVLNVKLDPAISP---NAILLTNQDGFNYLDKLKDKDGKY 284 (392) Q Consensus 233 ~~~~~------~~-------------------~~~~d~~~~~~~~~~~~~~~~---~a~~v~~~~~~~~L~~lkd~~g~~ 284 (392) ++.+. .+ ....+.+++ +...++..|+. +++|+||++++..++++||++|+| T Consensus 148 g~s~~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~-l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~ 226 (315) T protein:vir:41 148 TSSSDPLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDT-MIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETG 226 (315) T ss_pred CcCcCccccccccceecccccccccccccccccccHHHHHH-HHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCc Confidence 64211 00 012344445 45688998874 568999999999999999999999 Q ss_pred eecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEE Q lcl|Aclame:pro 285 ILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQ 364 (392) Q Consensus 285 l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~ 364 (392) +|+|....+.+.+++|. ||..++ .++.. ..+...++||||++ +.+.++..++++.++.. ..+.+.|.... T Consensus 227 lw~~~~~~g~~~tl~G~-PV~~~~--~m~~~--~~~~~~ilf~d~~n-l~~~~~~~i~i~~~~~a----~~~~~~~~~~~ 296 (315) T protein:vir:41 227 LGDQALTGANSILYDGR-PVQYVP--ALEAL--NDGKSRALFVVPTQ-LVYGFWRNIKVVPDYDA----EMRLTKYVASL 296 (315) T ss_pred cccchhhcCCCceeccc-ceEecc--ccccc--CCCCccEEEecccc-eEEEeccccEEEeeecC----CCCceEEEEEE Confidence 99999999999999984 565433 33332 34566799999987 45788889988876643 34668889999 Q ss_pred eeCcEEecccceEEEEecc Q lcl|Aclame:pro 365 RDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 365 r~~~~v~~~~af~~l~~~~ 383 (392) |+|+.+..+++.+...+|- T Consensus 297 r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 297 RTDNHYEDEEGAVSATITV 315 (315) T ss_pred EeceeEEeccceeEeeeeC Confidence 9999988877755444444 No 101 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=1.3e-36 Score=217.25 Aligned_cols=281 Identities=14% Similarity=0.030 Sum_probs=208.8 Q ss_pred hHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceee-ccCCcceeEEEeecCC---ccccc Q lcl|Aclame:pro 90 NAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP-VRTRSGSRVLEKNSDM---IPFAE 165 (392) Q Consensus 90 ~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~~~~~~~~~~~~~---~~~~~ 165 (392) -.+.+ +.....++++. +..+||+++|.++. .+++.+++.++++++++++. +.+....++... .+. ....| T Consensus 1 ~~~~~---~~~~~~k~it~-~d~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~-~g~~~~~~~~~ 74 (314) T protein:vir:41 1 MDFLN---KPFQITPKIDV-PDLGKGILAVQRFG-EFVREVRENSAIIKDARVLNALKSYEVDISRIS-LGVELEPGRNT 74 (314) T ss_pred Cchhh---hHHHhhccccc-ccCCCceeChHHHH-HHHHHHHhccchhhheeeecccCccceeecccc-cCccccccccc Confidence 01111 11123344443 45578999999984 79999999999999999864 344444443321 121 23445 Q ss_pred cccccccccccccceeeEEechhheeeehhhHHHHHhhhHH--HHHHHHHHHHHHHHHHHHHHHHhhccccccc------ Q lcl|Aclame:pro 166 ITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQ--NILKYVTKWLGKKSKVTRNVLILGVIEKLTK------ 237 (392) Q Consensus 166 ~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~--~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~------ 237 (392) .+|..+.++ ++++|+++++.++++...++||+++|+|+.. +|+++|...++++++..++..+++|.++.++ T Consensus 75 ~~~~~~~~~-~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~ 153 (314) T protein:vir:41 75 SGTKVAPTA-DEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYR 153 (314) T ss_pred ccCCccCCc-ccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchh Confidence 666666665 5799999999999999999999999999975 9999999999999999999999999875321 Q ss_pred --c----------------chhhHHHHHHHHHHHhhhcccC---CceEEEcHHHHHHHHHhhccCCceeecccccCCccc Q lcl|Aclame:pro 238 --Q----------------AIKSLDDIKDVLNVKLDPAISP---NAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKK 296 (392) Q Consensus 238 --~----------------~~~~~d~~~~~~~~~~~~~~~~---~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~ 296 (392) . +....++++..+...+++.|+. +++|+||++++.+++++++.+|+++|++....+.+. T Consensus 154 ~p~G~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~ 233 (314) T protein:vir:41 154 INDGWMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGL 233 (314) T ss_pred cchhhhhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCc Confidence 1 1112344444455788998875 458999999999999999999999999999999998 Q ss_pred ceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccce Q lcl|Aclame:pro 297 LFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAA 376 (392) Q Consensus 297 ~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af 376 (392) +++|. ||+.++ .++. ..+++.+++||||+++ .+..+..+++..... ...+++.|.+..|+|+.+..++|. T Consensus 234 ~l~G~-PV~~~~--~~~~--~~~~~~~i~fgd~~nl-v~~~~~~ir~~~~~~----a~~~~~~~~~~~r~d~~~~~~~aa 303 (314) T protein:vir:41 234 QYDGI-PIQYVP--ALDA--LGDDKARALLTVPTNL-VYGFWRNIRIEPKRD----AAMRRTEYIASLRADCNYEDENAA 303 (314) T ss_pred eecce-eeEecc--cccc--cCCCCceEEEechhhe-EEEeeceeEEeeccc----CcCCeEEEEEEEEeceEEEEcCcE Confidence 88885 555433 3333 3456789999999975 456777777665443 357899999999999999999999 Q ss_pred EEEEecccCCC Q lcl|Aclame:pro 377 VYGEIDLSAPV 387 (392) Q Consensus 377 ~~l~~~~~a~~ 387 (392) ++..++-+..- T Consensus 304 ~~~~~~~~~~~ 314 (314) T protein:vir:41 304 VAAVIDMSSGG 314 (314) T ss_pred EEEEeeccCCC Confidence 88887766554 No 102 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=2.7e-36 Score=215.61 Aligned_cols=351 Identities=12% Similarity=0.062 Sum_probs=186.3 Q ss_pred CCHH--HHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHH Q lcl|Aclame:pro 1 MSKE--LRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRD 78 (392) Q Consensus 1 M~ke--l~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) +++. +...++......+.+.....++...+......+..++.++++.+.+..+........... ...+........+ T Consensus 111 a~~~a~v~~vks~~~~~e~~~~~~e~~e~~~e~~e~~~~~~el~akl~el~k~~ee~k~~~~~~~~-~~~~~~~~~~e~r 189 (480) T protein:vir:40 111 SNKGAKVTKVREENKGEQEQMGANETQEIMKQAIEAGVKVRELEAKVEELNKEREELKKEREASIP-SEKPEDAERKFMR 189 (480) T ss_pred cchhhhhhhhhhhhhhhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHHHhHHHHHhhhhhhhcc-ccchhhhhhHHHH Confidence 3321 222221111000000000000000111111111122222222221111111111111000 0001111111111 Q ss_pred HHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeec Q lcl|Aclame:pro 79 VFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) Q Consensus 79 a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~ 158 (392) .+....++.... ........ +...+....++. +|+.+...+.......+++...+.... . T Consensus 190 ~~~~~~~~~~e~----~~~~~~~~--~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~ 249 (480) T protein:vir:40 190 ELGSKMAEMPEQ----GFLREFAN--GADLNVVNSLGS-ITSKYARKSGIYDGAMKARFQGLTLAE-------------D 249 (480) T ss_pred HHHHHhccchhh----hhhhhhhh--hccccccccccc-cccchhhheeechhhhhhhhhcceeee-------------c Confidence 111222221111 11111111 112222333444 455554444444444455444333221 1 Q ss_pred CCcccccccccccccccccc-ceeeEEec---hhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 159 DMIPFAEITEMGEIPETDNP-KFSNVQYA---VKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK 234 (392) Q Consensus 159 ~~~~~~~~~E~~~~~~~~~~-~~~~v~~~---~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~ 234 (392) +.....|++|....+.+..+ ++.+..+. +++++++++.|.++|+|+. +|++||.++|++.++.+++.+|++|.++ T Consensus 250 g~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~~l~~~~~~~ee~a~l~G~g~ 328 (480) T protein:vir:40 250 GVDDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMSEMVNRVIQKVEYNMILGSVD 328 (480) T ss_pred cccceeeeeeeecccccccccccccchhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHhhccCCC Confidence 22334566665433322222 23344444 5789999999999999976 7999999999999999999999999543 Q ss_pred ccc------------cchhhHHHHHHHHHHHhhhcccCCc-eEEEcHHHHHHHHHhhccCCceeecccccCCcccceecc Q lcl|Aclame:pro 235 LTK------------QAIKSLDDIKDVLNVKLDPAISPNA-ILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGT 301 (392) Q Consensus 235 ~~~------------~~~~~~d~~~~~~~~~~~~~~~~~a-~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~ 301 (392) ... +...+.+++++.+...++..|+.++ .|||||.+|++|++|||++|+|||+|+.+.+.+.++||. T Consensus 329 g~~~~~g~~~~~~~~~~~~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G~Yi~q~~~~~~~~~~llG~ 408 (480) T protein:vir:40 329 GSNGFYGLKTATDGWTKQIEYTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDGHSRFNELATKEQIAQSFGA 408 (480) T ss_pred CccccccceeecccccccchhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCCCeeccCcccccCcceeccc Confidence 321 1233456667656667888888777 699999999999999999999999999999999999995 Q ss_pred cceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEe Q lcl|Aclame:pro 302 NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) Q Consensus 302 ~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~ 381 (392) ||++.+ ..+|. ....+|.++.++.+++++ ++. .+ +..+..++..|+.+.|+++.+.+|++|.++++ T Consensus 409 -pvv~~~-~~~~~-------~~~~~~~~~~~~~~~d~~-~~~--~~--~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~ 474 (480) T protein:vir:40 409 -VNLETR-VWMPK-------DEVAVYNHDEYVLIGDLN-VEN--YN--DFDLRYNVEQWLSETLVGGSIRGKNRSAYLKK 474 (480) T ss_pred -ceeeee-ccccC-------CcceeeeCCccEEEEecc-cce--ec--ccccccchhhhhhhhhhceeeEccccEEEEEe Confidence 554432 22322 123455555666666653 322 11 12345788899999999999999999999999 Q ss_pred cccCCC Q lcl|Aclame:pro 382 DLSAPV 387 (392) Q Consensus 382 ~~~a~~ 387 (392) +..--+ T Consensus 475 ~~~~~~ 480 (480) T protein:vir:40 475 KGSLGV 480 (480) T ss_pred ccCcCC Confidence 999887 No 103 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=99.96 E-value=4.2e-31 Score=187.12 Aligned_cols=287 Identities=12% Similarity=0.061 Sum_probs=200.2 Q ss_pred hhHHH-HHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccc Q lcl|Aclame:pro 89 LNAEE-REFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) Q Consensus 89 ~~~~~-~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (392) .+... ...... ..+++....+..++|++||+++...|++.+.+.++++++++++++.+..+.++ ....+....|++ T Consensus 1 ~~~k~~~~~l~~-~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~--~~~~~~~~~~~~ 77 (321) T protein:vir:31 1 MASRTINNDLSR-ITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIP--TLNIGERHRRPQ 77 (321) T ss_pred CchHHHHHHHHH-HHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeee--eeccCCcccccc Confidence 11111 111111 22233333345677889999999999999999999999999999987776654 334445556776 Q ss_pred c-ccccccccccceeeEEechhheeeehhhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhcccccccc------ Q lcl|Aclame:pro 168 E-MGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ------ 238 (392) Q Consensus 168 E-~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~--~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~------ 238 (392) + ++.....++++|+++++.++++...++||+++|+|+. ++|+++|.+.++++++..++..+++|.+...++ T Consensus 78 ~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~ 157 (321) T protein:vir:31 78 DEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQND 157 (321) T ss_pred cccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccch Confidence 4 3333334578999999999999999999999999985 589999999999999999999999998754331 Q ss_pred -----------------chhhHHHHHHHHHHHhhhcccC--CceEEEcHHHHHHHHH-hhccCCceeecccccCCcccce Q lcl|Aclame:pro 239 -----------------AIKSLDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDK-LKDKDGKYILQSDPTQKNKKLF 298 (392) Q Consensus 239 -----------------~~~~~d~~~~~~~~~~~~~~~~--~a~~v~~~~~~~~L~~-lkd~~g~~l~~~~~~~~~~~~~ 298 (392) +..+++.+.++ ...+++.|+. +.+|+||++++..+++ +++. +.++|.+....+.+.++ T Consensus 158 G~l~~a~~~~~~~~~~~~~~~~d~l~~l-~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~-~~~~~~~~l~~~~~~tl 235 (321) T protein:vir:31 158 GFITVAEGDVETIDAADDILDNDLVIRT-IAGLDSKYRARMNPALIVSEDQLLSYHYTLTDR-DTPLGDNVIMGEADVNP 235 (321) T ss_pred hhhhhhccccccccccccccCHHHHHHH-HHhccHhHhcCCCeEEEechHHHHHHHHHHhcC-CCccccchhhccccccc Confidence 11234556664 4578888874 5689999999987765 5655 45899988888888887 Q ss_pred ecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhh-hhcCceeEEEEEeeCcEEecccceE Q lcl|Aclame:pro 299 AGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKA-FTRNTLDLRAIQRDDVQMWDNEAAV 377 (392) Q Consensus 299 ~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~-f~~~~~~~~~~~r~~~~v~~~~af~ 377 (392) +|. ||+.+ +++|. ..++|+||++++ ++.+.+++++........ +..+.+......++|+.+.+++|++ T Consensus 236 ~G~-pvv~~--~~mP~-------~~il~t~~~nl~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a 304 (321) T protein:vir:31 236 FSF-PIIGS--GLWPD-------DKAMFTDPQNLI-YALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVV 304 (321) T ss_pred cce-eEEEc--CCCCC-------CcEEEeccccEE-EEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEE Confidence 774 56543 34553 358999999864 556677777776553221 1233444455567899999999999 Q ss_pred EEE-ecccC--CCCCCC Q lcl|Aclame:pro 378 YGE-IDLSA--PVEQPQ 391 (392) Q Consensus 378 ~l~-~~~~a--~~~~~~ 391 (392) .++ ++-+- -..+|. T Consensus 305 ~~~~i~~~~~~~~~~~~ 321 (321) T protein:vir:31 305 LAEGLGDPLEHLEEETS 321 (321) T ss_pred EEecCCcchhcccCCCC Confidence 987 44321 111122 No 104 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.93 E-value=4.9e-27 Score=164.84 Aligned_cols=265 Identities=12% Similarity=0.095 Sum_probs=198.2 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceee-ccCCc-ceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP-VRTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |..+++..+..++|+.++..+++.+.+.+.+.+++.+.. ..+.. .++.+|+....+.+.|++||++.+. ++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~-~~~~~~~~ 79 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPM-TQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccc-cccccceE Confidence 444455567789999999999999999988888776532 12222 2455676666778999999998886 47999999 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--ccchhhHHHHHHHHHHHhhhcccCC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--KQAIKSLDDIKDVLNVKLDPAISPN 261 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~--~~~~~~~d~~~~~~~~~~~~~~~~~ 261 (392) ++.+++++..+++|++...++.+++.+.+.+++++.+++.+|..++....+.. ..+..+++++.+++. .+...+... T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~~~t~d~i~da~~-~l~~~~~~~ 158 (272) T protein:vir:98 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEATATVDGVSKALD-IFNDEDDAE 158 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHH-HHhccCCCc Confidence 99999999999999999999999999999999999999999999988765433 345678899999875 455556677 Q ss_pred ceEEEcHHHHHHHHHhhccC---CceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeec Q lcl|Aclame:pro 262 AILLTNQDGFNYLDKLKDKD---GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKR 338 (392) Q Consensus 262 a~~v~~~~~~~~L~~lkd~~---g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~ 338 (392) .+|+|||.++..|++.+..+ ...........+...+++|. ||++.. .+|.. ..++| +.. ++.++.+ T Consensus 159 ~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~-~Vi~s~--~~p~~------t~~~~-~~~-a~~~~~~ 227 (272) T protein:vir:98 159 TVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGV-QIVRSR--KCPKG------TAYMV-RKG-ALRIMLK 227 (272) T ss_pred cEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCe-eEEEcC--CCCcc------eEEEE-cCC-eEEEEec Confidence 89999999999998764221 11122223334445677875 565533 23321 23444 433 5667778 Q ss_pred cceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCC Q lcl|Aclame:pro 339 EDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPV 387 (392) Q Consensus 339 ~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~ 387 (392) .+++++.++.. .++...+++..|+++++.+|++++++++++++.- T Consensus 228 ~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 228 RNTMVETDRDI----TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred CCceeeecccc----ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 88888876653 3567899999999999999999999999977765 No 105 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.93 E-value=4.9e-27 Score=164.84 Aligned_cols=265 Identities=12% Similarity=0.095 Sum_probs=198.2 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceee-ccCCc-ceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP-VRTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |..+++..+..++|+.++..+++.+.+.+.+.+++.+.. ..+.. .++.+|+....+.+.|++||++.+. ++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~-~~~~~~~~ 79 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPM-TQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccc-cccccceE Confidence 444455567789999999999999999988888776532 12222 2455676666778999999998886 47999999 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--ccchhhHHHHHHHHHHHhhhcccCC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--KQAIKSLDDIKDVLNVKLDPAISPN 261 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~--~~~~~~~d~~~~~~~~~~~~~~~~~ 261 (392) ++.+++++..+++|++...++.+++.+.+.+++++.+++.+|..++....+.. ..+..+++++.+++. .+...+... T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~~~t~d~i~da~~-~l~~~~~~~ 158 (272) T protein:vir:30 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEATATVDGVSKALD-IFNDEDDAE 158 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHH-HHhccCCCc Confidence 99999999999999999999999999999999999999999999988765433 345678899999875 455556677 Q ss_pred ceEEEcHHHHHHHHHhhccC---CceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeec Q lcl|Aclame:pro 262 AILLTNQDGFNYLDKLKDKD---GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKR 338 (392) Q Consensus 262 a~~v~~~~~~~~L~~lkd~~---g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~ 338 (392) .+|+|||.++..|++.+..+ ...........+...+++|. ||++.. .+|.. ..++| +.. ++.++.+ T Consensus 159 ~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~-~Vi~s~--~~p~~------t~~~~-~~~-a~~~~~~ 227 (272) T protein:vir:30 159 TVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGV-QIVRSR--KCPKG------TAYMV-RKG-ALRIMLK 227 (272) T ss_pred cEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCe-eEEEcC--CCCcc------eEEEE-cCC-eEEEEec Confidence 89999999999998764221 11122223334445677875 565533 23321 23444 433 5667778 Q ss_pred cceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCC Q lcl|Aclame:pro 339 EDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPV 387 (392) Q Consensus 339 ~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~ 387 (392) .+++++.++.. .++...+++..|+++++.+|++++++++++++.- T Consensus 228 ~~~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 228 RNTMVETDRDI----TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred CCceeeecccc----ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 88888876653 3567899999999999999999999999977765 No 106 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.71 E-value=1.5e-18 Score=118.34 Aligned_cols=266 Identities=13% Similarity=0.079 Sum_probs=183.5 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec-cCCc-ceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |..+.+.-+-.++|+.+...+.+.+.+...+.+++..... .+.. .++.+|.....+.+.++.|+++.+. +..+.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~-~~lt~~~~ 79 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISL-DKIGTTTK 79 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccCh-hhcCCcce Confidence 4433444566778999998898988888888888766442 2211 2344555554456778999988875 46788899 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--ccchhhHHHHHHHHHHHhhhcccCC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--KQAIKSLDDIKDVLNVKLDPAISPN 261 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~--~~~~~~~d~~~~~~~~~~~~~~~~~ 261 (392) ++..++.+..+.++++...++..++.+.+.++++..+++.+|+.++....+.. ..+..++|.+.+++.. +....... T Consensus 80 ~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~~~~~~~d~i~~A~~~-lgd~~~~~ 158 (272) T protein:vir:36 80 SVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTVSTKANVDGVQAALDI-FNDEDAQA 158 (272) T ss_pred eEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccHHHHHHHHHH-hhhcCCCc Confidence 99999999999999999888888999999999999999999999877664432 3455678899888754 44444556 Q ss_pred ceEEEcHHHHHHHHHhhccCCc--eeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeecc Q lcl|Aclame:pro 262 AILLTNQDGFNYLDKLKDKDGK--YILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKRE 339 (392) Q Consensus 262 a~~v~~~~~~~~L~~lkd~~g~--~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~ 339 (392) .+++|||.++..|++...-... ....+...++.-.+++|. +|++ ++. +|..+ .-...++++ +.++..+... T Consensus 159 ~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~-~Vv~-s~~-~p~~~--~~~~~~~~~--~gA~~~~~~~ 231 (272) T protein:vir:36 159 YVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGA-QIVR-SKK-LAEGS--ALMFKIVSN--SPALKLVLKR 231 (272) T ss_pred eEEEEcHHHHHHHhcccccccccccccccceeeeccceecCe-eEEE-eCC-CCCCc--eeEEEEEec--ccceeeeecC Confidence 7899999999999764322111 111111122333567775 4544 333 33321 112234554 3355566677 Q ss_pred ceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 340 DMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 340 ~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) +++++..+... +....+++..+++.++.+|+++++++++.. T Consensus 232 ~~~vE~~R~~~----~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 232 GVQVETDRDIV----TKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred Ccccccccchh----hcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 88888766542 344678999999999999999999998877 No 107 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.69 E-value=5.3e-18 Score=115.31 Aligned_cols=296 Identities=8% Similarity=0.029 Sum_probs=198.1 Q ss_pred HHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecC Q lcl|Aclame:pro 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~ 159 (392) +. -+....+....|..... ..+.++...|-..++.+.|.++...|++.+.+.+.|+.++....+.++...+ .+... T Consensus 1 ~~-~~~~~~~~~~~~~~~~~-~p~l~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~--~r~~~ 76 (330) T protein:vir:94 1 MV-RICTPPLRGRWRTLTHQ-FPELKMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAY--NRENV 76 (330) T ss_pred Cc-eecCCccccceeehhcc-ccccchhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCccee--eeeec Confidence 00 01111221111111111 1344566666777888999999999999999999999998877676655454 45556 Q ss_pred CccccccccccccccccccceeeEEechhheeeehhhHHHHHh--hhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc- Q lcl|Aclame:pro 160 MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ--DSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT- 236 (392) Q Consensus 160 ~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~--ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~- 236 (392) -+.++|...+...+++...+|.+++.+.+.+.+.+.|.+.+.+ ....++..+-.+...++++++.+..+++|..++. T Consensus 77 lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~ 156 (330) T protein:vir:94 77 LGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNS 156 (330) T ss_pred CCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcc Confidence 6788899888777765434799999999999999999999954 4566888899999999999999999999753311 Q ss_pred ------------------ccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccC--Cccc Q lcl|Aclame:pro 237 ------------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQ--KNKK 296 (392) Q Consensus 237 ------------------~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~--~~~~ 296 (392) .++..+.|++..++. .+......+.+|+||++.+.+|+.+....|+|-..|.... |.+. T Consensus 157 F~GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl~-~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v 235 (330) T protein:vir:94 157 FQGMMGLVAASQTISAGANGGTLTFELLDQLLD-LVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQI 235 (330) T ss_pred ccchhhcCCcccEEecCCCCCCCCHHHHHHHHH-HhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEE Confidence 123345566666553 3333334577899999999999999998887655443322 3344 Q ss_pred ceecccceEEecCcccccc--cccCCcceEEEEehh-----hceeeee---ccceEEEEeccchhhhhcCceeEEEEEee Q lcl|Aclame:pro 297 LFAGTNPVVVVSNRFLKSK--GTTAKKAPLIIGDLK-----EAIVLFK---REDMELASTDVGGKAFTRNTLDLRAIQRD 366 (392) Q Consensus 297 ~~~g~~pv~~~~~~~~~~~--~~~~~~~~~~~Gd~~-----~~~~~~~---~~~~~~~~~~~~~~~f~~~~~~~~~~~r~ 366 (392) ..|++-|++. ++.++.+. ++..+...|++..|. +++.... ..+++++.-.+.. .++.+.+++++++ T Consensus 236 ~~~~GvPi~~-~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~---~k~v~~~~v~~y~ 311 (330) T protein:vir:94 236 PTYRGVPWFV-NDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKE---NADETITRVKMYC 311 (330) T ss_pred eeeCCeEEEe-cccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCcc---ccceeeEEEEEee Confidence 4566666543 33333322 234566777777764 2322221 2367775543222 2566889999999 Q ss_pred CcEEecccceEEEE-eccc Q lcl|Aclame:pro 367 DVQMWDNEAAVYGE-IDLS 384 (392) Q Consensus 367 ~~~v~~~~af~~l~-~~~~ 384 (392) +.++.+|+|+.+|+ ++.. T Consensus 312 ~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 312 GFANFSQLGLAAIKGLIPG 330 (330) T ss_pred eeEEechhheeeeccccCC Confidence 99999999999986 4444 No 108 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.66 E-value=2.9e-17 Score=111.22 Aligned_cols=264 Identities=12% Similarity=0.085 Sum_probs=182.2 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceee-ccCCcc-eeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP-VRTRSG-SRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |....+.-+-.++|+.+...+.+.+.+...+.+++.... ..+..| ++.+|.....+.+.++.|++..+. ++.+.+.. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~-~~it~~~~ 79 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccc-ccccccee Confidence 444445556778999999999999888888888876632 222223 455566554457789999888775 46889999 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cchhhHHHHHHHHHHHhhhcccC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISP 260 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~---~~~~~~d~~~~~~~~~~~~~~~~ 260 (392) ++..++.+....++++...++..++.+.+.++++..+++.+|+.++....+++. .....++.+++++.. +...... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~dA~~~-l~d~~~~ 158 (274) T protein:vir:93 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDK-FNDEDLE 158 (274) T ss_pred EEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hhhccCC Confidence 999999999999999999998889999999999999999999988877655442 233568899988754 4444446 Q ss_pred CceEEEcHHHHHHHHHhhccCCceee-----cccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceee Q lcl|Aclame:pro 261 NAILLTNQDGFNYLDKLKDKDGKYIL-----QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVL 335 (392) Q Consensus 261 ~a~~v~~~~~~~~L~~lkd~~g~~l~-----~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~ 335 (392) ..+++|||..+..|++- ..-+++- .+....+.-.+++|. +|++. +. +|. ...++++ . .++.. T Consensus 159 ~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~-~Vi~s-~~-~p~------~t~~l~~-~-gai~~ 225 (274) T protein:vir:93 159 PMVLFINPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGA-IIVRT-NK-LEA------GTAILAK-K-GAVKL 225 (274) T ss_pred ccEEEeCHHHHHHHHhh--hhhcccccccccccceeecccceecCe-eEEEc-CC-CCc------ceEEEEe-C-CeEEE Confidence 67899999999999753 2111110 111123344567775 45443 32 332 1234444 3 34556 Q ss_pred eeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCC Q lcl|Aclame:pro 336 FKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) Q Consensus 336 ~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~ 389 (392) +.+.++.++..+.. .+....+++..++++++.+|+++++++++.+ ..+- T Consensus 226 ~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~-s~~~ 274 (274) T protein:vir:93 226 ILKRDFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSG-SLEM 274 (274) T ss_pred EecCCcccccccch----hhcccEEEEEEEEEEEEEcCCceEEEeeCcc-ccCC Confidence 66777777776653 2345789999999999999999999985432 2222 No 109 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.65 E-value=6.4e-17 Score=109.36 Aligned_cols=354 Identities=12% Similarity=0.126 Sum_probs=207.0 Q ss_pred CCH-HHHHHHHHHHHHHHHHHHHhhh----------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MSK-ELRELLAKLEGKKEEVRSLMGE----------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) Q Consensus 1 M~k-el~el~~~~~~~~~e~~~~~~~----------~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) |+| .|.|+.+.+...+++.-++-.+ +++-..+.+++-+.++..+|...+ .+. .......+. ... T Consensus 8 ~~k~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e--~el-n~~~E~~Kg--k~~ 82 (400) T protein:vir:93 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIE--NEL-NAQEEKPKG--KDK 82 (400) T ss_pred cccchHHHHHHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHh--hhh-hhhhhhccc--chh Confidence 776 4667666666666555443221 122234445555555554443211 111 110000000 001 Q ss_pred cchhhHHHHH---HHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc Q lcl|Aclame:pro 70 VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR 146 (392) Q Consensus 70 ~~~~~~~~~a---~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~ 146 (392) ....-+-++| |.+.+.....+...+.........++.+ ..+.-..+|.-+...|-+.+....+++++..+..++ T Consensus 83 mtefLkT~~A~~~fa~~l~~nsg~sd~knaW~A~l~E~gvt---~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p 159 (400) T protein:vir:93 83 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVT---ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 159 (400) T ss_pred HHHhhhhHHHHHHHHHHHHhhcCCcchhhhhhhhhhhcccc---cCCchhhcchHHHHHHHHhhhccCCcccceeeecCC Confidence 1111122233 3333333333333333444444444433 233334679999999999999999999998888775 Q ss_pred CCcceeEEEeecCCcccccccc-ccccccccccceeeEEechhheeeehhhHHHHHhh--hHHHHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 147 TRSGSRVLEKNSDMIPFAEITE-MGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQD--SDQNILKYVTKWLGKKSKV- 222 (392) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~E-~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~d--s~~~l~~~v~~~l~~~~~~- 222 (392) + +...+......-+|+.- |.++++ +..+|..-++.|.-++.+..+.+-..++ +.-.|..||..+|...+.. T Consensus 160 ~----l~V~~~~dt~~qa~gHk~G~~K~e-q~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k 234 (400) T protein:vir:93 160 A----LLVSRSFDSANEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK 234 (400) T ss_pred c----eeeecchhhhcccceeccCCcccc-eeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHH Confidence 4 22222233333566554 445554 5679999999999999888885444433 2346899999999999996 Q ss_pred HHHHHHhhcccccccc---------------------chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccC Q lcl|Aclame:pro 223 TRNVLILGVIEKLTKQ---------------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKD 281 (392) Q Consensus 223 ~~d~~~~~~~~~~~~~---------------------~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~ 281 (392) +.+.+++-|.|.++-. +...+.+++.-+.....+-...+-.+||+|..|+.|+.|+|++ T Consensus 235 ~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~ 314 (400) T protein:vir:93 235 IVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQAT 314 (400) T ss_pred HhhhheeecccccccCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCc Confidence 5699998887766521 2223444444433333444445556899999999999999999 Q ss_pred CceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEE Q lcl|Aclame:pro 282 GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLR 361 (392) Q Consensus 282 g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~ 361 (392) |.+.|.......+..+-+|..-+++..- .++....+++ |= ++.+ .-.+++- . .+..+.+|+-.|. T Consensus 315 ~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr-------~~~~kp~V~V-De--k~~i-~~~~~~t-~---~sf~~~tNs~~il 379 (400) T protein:vir:93 315 ANANVRIKNDDTEIASEVGVDEIIVYTG-------SKALKPTVLV-DQ--KYHI-DMQDLTK-V---DAFEWKTNSNMIL 379 (400) T ss_pred ceeeeeeccccchhhhhcccceeeeecc-------CCCCCceeee-eh--hhhc-cccCcee-c---cceeeeeccceEE Confidence 9999976666666667777654443322 2333333444 43 2323 2233321 1 1223467788889 Q ss_pred EEEeeCcEEecccceEEEEec Q lcl|Aclame:pro 362 AIQRDDVQMWDNEAAVYGEID 382 (392) Q Consensus 362 ~~~r~~~~v~~~~af~~l~~~ 382 (392) ++..++|.+.-|++-+++++. T Consensus 380 vetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 380 VETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eeeeeccceecccceeeEeeC Confidence 999999999999999988877 No 110 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.62 E-value=2.1e-16 Score=106.55 Aligned_cols=349 Identities=15% Similarity=0.175 Sum_probs=199.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVF 80 (392) Q Consensus 1 M~kel~el~~~~~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 80 (392) |.-=|+++++. +- .+...+|.++|+.++++.+.+.+.+.+..... +.+.+.-..| T Consensus 1 ~~~~~~~~~~~--------------~~---~~~~~~e~k~lr~~me~~et~~e~~~~~~~~~--------~~e~el~E~f 55 (393) T protein:vir:79 1 MENWLKQLKES--------------GF---TETQVQEQKSLRTRMERGETLAEADANKLALN--------EEETQILESF 55 (393) T ss_pred CchHHHHHHhc--------------cC---chhHHHHHHHHHHHhhhhhhhhhhhhhhhhcc--------hhHHHHHHHH Confidence 44333333221 10 12334456666766666555544443322110 1111222334 Q ss_pred HHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCC Q lcl|Aclame:pro 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDM 160 (392) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~ 160 (392) .+.+.+. ....+ .+....-++.+|..+||+.+++.+.+..........|+.......+......+ -+. T Consensus 56 ~Kmm~G~-~p~~e---------V~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~--~g~ 123 (393) T protein:vir:79 56 AKMMEGE-TPTNE---------VNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPS--IGI 123 (393) T ss_pred HHHhcCC-Cchhh---------eehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccc--hhe Confidence 4433322 21111 11111234567889999999999999766665555555555553222222112 224 Q ss_pred cccccccccccccccc--ccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc- Q lcl|Aclame:pro 161 IPFAEITEMGEIPETD--NPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK- 237 (392) Q Consensus 161 ~~~~~~~E~~~~~~~~--~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~- 237 (392) -.+.-++||++.++.+ ..+++.|+++.+|.+..+.+|.|+++||..++..+..+...+++++..+..++++..+.+. T Consensus 124 ~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ght 203 (393) T protein:vir:79 124 MRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHT 203 (393) T ss_pred eeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccce Confidence 5677899999988643 2468899999999999999999999999999999999999999999998877776433221 Q ss_pred ----------------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHh---hccCCceeecccc-- Q lcl|Aclame:pro 238 ----------------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKL---KDKDGKYILQSDP-- 290 (392) Q Consensus 238 ----------------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~l---kd~~g~~l~~~~~-- 290 (392) .+....+|+++.+. ++.+....+++++|||-.|+.+.|= -...-.++-.-+. T Consensus 204 vfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~-av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~ 282 (393) T protein:vir:79 204 VFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLII-AVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKG 282 (393) T ss_pred eeeccccCccceeecCCccccccccccHHHHHHHHH-HHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccc Confidence 23356789999775 4566667888899999999988652 2222222210000 Q ss_pred ----cCCcccceecccc--eEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEE Q lcl|Aclame:pro 291 ----TQKNKKLFAGTNP--VVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQ 364 (392) Q Consensus 291 ----~~~~~~~~~g~~p--v~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~ 364 (392) ..-+|..+-|..| ..++-.+++|-... .....++.-|-...-....+.+++++..+. -..|...++... T Consensus 283 ~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k-~~rFd~~~Vd~NnvgvlLV~D~i~tdq~dd----k~rdiq~iKl~E 357 (393) T protein:vir:79 283 APSSMALGPDSIQGRLPFNFNVNLSPFIPLDKK-SRRFDVYAVDRNNVGVLLVRDDLKTDQWDE----KARGLQNIKMIE 357 (393) T ss_pred cchhhhhchhhhccccccceeEEEecccccccc-cceeeEEEeecCCceEEEEecCcceecccc----ccccceeeeeee Confidence 0112333334323 12333556665432 222233333333322233455666555443 246788999999 Q ss_pred eeCcEEecc-cceEEEE-ecccCCCCCCCC Q lcl|Aclame:pro 365 RDDVQMWDN-EAAVYGE-IDLSAPVEQPQG 392 (392) Q Consensus 365 r~~~~v~~~-~af~~l~-~~~~a~~~~~~~ 392 (392) |+|++|.+. +|+...+ ++-+.....|-- T Consensus 358 RYG~gvLn~gkaiavakNI~~~k~y~~P~~ 387 (393) T protein:vir:79 358 RYGIGILNEGKAIAVAKNISMDKSYAEPML 387 (393) T ss_pred eeceeeeeCCceEEEEecceeecccccchh Confidence 999998874 5554443 444433333322 No 111 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.62 E-value=1.1e-16 Score=108.04 Aligned_cols=264 Identities=14% Similarity=0.092 Sum_probs=181.4 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec-cCC-cceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) +...++.-+..++|+.+...+.+.+.+...+.++++...- .+. ..++.+|.....+.+..+.|+...+.. +.+.+.. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~-~it~~~~ 79 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVD-QIGTSKR 79 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchh-hccccee Confidence 4443444567889999999999998888877777655321 111 123444554444566678888877754 6788888 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cchhhHHHHHHHHHHHhhhcccC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISP 260 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~---~~~~~~d~~~~~~~~~~~~~~~~ 260 (392) ++..++.+..+.++++...++..++.+.+.++++..+++.+|..++....+.+. ....+++.+++++.. +...... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~~~~~~~d~i~dA~~~-l~d~~~~ 158 (274) T protein:vir:96 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDK-FNDEDLE 158 (274) T ss_pred EEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCcccccHHHHHHHHHH-hcccCCC Confidence 888899888899999998888888999999999999999999988776554332 234568899988764 4444446 Q ss_pred CceEEEcHHHHHHHHHhhccCCceee-----cccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceee Q lcl|Aclame:pro 261 NAILLTNQDGFNYLDKLKDKDGKYIL-----QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVL 335 (392) Q Consensus 261 ~a~~v~~~~~~~~L~~lkd~~g~~l~-----~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~ 335 (392) ...++|||..+..|++...- +++- ......+.-.+++|. +|++ ++. +|.. ..++||. .++.. T Consensus 159 ~~~ivv~p~~~~~L~k~~~~--~f~~~~~~g~~~~~~g~ig~~~G~-~Vi~-s~~-~p~~------t~~l~~~--gA~~~ 225 (274) T protein:vir:96 159 PMVLFVNPLDAGGLRTSASD--NFTRPTQLGDNIIVKGAFGEALGA-VIVR-SNK-LNKG------EALLAKK--GAVKL 225 (274) T ss_pred ceEEEeCHHHHHHHHhcccc--cccccccccccceeecccceecCe-eEEE-cCC-CCcc------eEEEEeC--cceee Confidence 67899999999999875311 1111 011112334566775 4544 333 3321 2456653 35556 Q ss_pred eeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCC Q lcl|Aclame:pro 336 FKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) Q Consensus 336 ~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~ 388 (392) +.+.+++++..+.. .+....+++.++++.++.+|+++++++...+..+- T Consensus 226 ~~~~~~~vE~~Rd~----~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 226 ITKRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred eecCCcccccccch----hhcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 67777777765543 23456888999999999999999999988877766 No 112 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.62 E-value=1.4e-16 Score=107.52 Aligned_cols=264 Identities=14% Similarity=0.062 Sum_probs=174.0 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceee-ccCCcc-eeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP-VRTRSG-SRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |...++.-+..++|+.+...+.+.+.+...+.+++.... ..+..| ++.+|.....+.+.++.|++..+.. +.+.++. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~-~lt~~~~ 79 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYS-ALETESV 79 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCccc-cccccee Confidence 333344557788999999999999888877777765432 222222 3445554444566789998877754 6788999 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----c----chhhHHHHHHHHHHHh Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-----Q----AIKSLDDIKDVLNVKL 254 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~-----~----~~~~~d~~~~~~~~~~ 254 (392) ++..++.+..+.++++...++..++.+.+.++++..+++..|+.++....+... . ....++.+.++..+.- T Consensus 80 ~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~da~~~l~ 159 (278) T protein:vir:80 80 KHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDKIENTFTDAPDAIE 159 (278) T ss_pred eEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHhhc Confidence 998888888899999999988889999999999999999999987776533221 1 1123455555544322 Q ss_pred hhcccCCceEEEcHHHHHHHHHhhccCC---ceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhh Q lcl|Aclame:pro 255 DPAISPNAILLTNQDGFNYLDKLKDKDG---KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE 331 (392) Q Consensus 255 ~~~~~~~a~~v~~~~~~~~L~~lkd~~g---~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~ 331 (392) ........+++|||..+..|++....+. ..+-.+...++.-.+++|. +|++.+ . +|. ...++|+. . T Consensus 160 ~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~-~Vi~s~-~-~p~------~t~~l~~~--g 228 (278) T protein:vir:80 160 DESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGW-EIVRTK-K-LAD------GNALAVKA--G 228 (278) T ss_pred ccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecce-eEEEcC-C-CCc------ceEEEEec--c Confidence 2333334568899999999976532211 0111111223444567775 454433 2 332 22455652 3 Q ss_pred ceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccC Q lcl|Aclame:pro 332 AIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 332 ~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a 385 (392) ++..+...+++++.++... +....+++.++++.++.+|++++++++.+.. T Consensus 229 Ai~~~~~~~~~vE~~Rd~~----~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 229 ALKTFLKRNLLAESGRDMD----HKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred ceeeeecCCcccccccchh----hccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 5556677778877766432 3456888999999999999999999977666 No 113 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.60 E-value=3e-16 Score=105.66 Aligned_cols=267 Identities=11% Similarity=0.060 Sum_probs=182.7 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec-cC-CcceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |....+.-+..++|+.+...+.+.+.+...+.+++.+..- .+ ...++.+|....-..+.++.|+.+.+. ++.+.+.. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~-~~lt~~~~ 79 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPV-DKIETNRR 79 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCc-ccccccee Confidence 3333444566778999999999999998888888765431 11 122344454444456778999988875 46788999 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccchhhHHHHHHHHHHHhhhcccC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT---KQAIKSLDDIKDVLNVKLDPAISP 260 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~---~~~~~~~d~~~~~~~~~~~~~~~~ 260 (392) ....++.+..+.++++....+..|+...+.+.++..+++..|..++..+.+.+ .....+++.+.++.... ...... T Consensus 80 ~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~~~~t~d~i~~A~~~l-gd~~~~ 158 (276) T protein:vir:10 80 EAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVSADIGTLAGLEAAIDTF-DDEDLE 158 (276) T ss_pred eEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHh-ccccCc Confidence 99999999999999999998888889999999999999999998776544432 22345788888877543 333335 Q ss_pred CceEEEcHHHHHHHHHhhccCCce---eecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeee Q lcl|Aclame:pro 261 NAILLTNQDGFNYLDKLKDKDGKY---ILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFK 337 (392) Q Consensus 261 ~a~~v~~~~~~~~L~~lkd~~g~~---l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~ 337 (392) ..+++|||..+..|+++.+.+... .-.+....|.-.+++|. +|++. +. +|. ...++|+. .++..+. T Consensus 159 ~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~-~Vi~s-~~-~p~------~t~~l~~~--gAi~~~~ 227 (276) T protein:vir:10 159 PMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGA-VIVRS-KK-LDE------GEAILAKR--GAVKLIT 227 (276) T ss_pred ccEEEEcHHHHHHHHHhccccccccccccccceeccccceecce-eEEEc-CC-CCc------ceEEEEec--cceeeee Confidence 567899999999998764322100 00111123334567775 45443 22 232 22356663 3566677 Q ss_pred ccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ..+++++.++... +....+++..+++.++.+|..++++++.. -+.|.| T Consensus 228 ~~~~~vE~dRd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t~~~---~~~~~~ 275 (276) T protein:vir:10 228 KRDFFLETDRDPS----TKTTALYSDKHYVAYLYDESKAVKVTKGA---GTTDSG 275 (276) T ss_pred cCCceeecccchh----hcccEEEEeeEEEEEEEcCcceEEEecCC---cCCcCC Confidence 8888888877543 34578888999999999999999998553 344555 No 114 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.59 E-value=2.9e-16 Score=105.75 Aligned_cols=263 Identities=13% Similarity=0.067 Sum_probs=180.4 Q ss_pred hhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec-cCCcc-eeEEEeecCCcccccccccccccccccccee Q lcl|Aclame:pro 104 RAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RTRSG-SRVLEKNSDMIPFAEITEMGEIPETDNPKFS 181 (392) Q Consensus 104 ~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~ 181 (392) .++.. .+.-+-.++|+.+...+.+.+.+...+.+++.+..- .+..| ++.+|.....+.+.++.|+++.+.. +.+.+ T Consensus 1 ~~~~~-~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~-~lt~~ 78 (275) T protein:vir:96 1 MALEN-MTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPID-LIETK 78 (275) T ss_pred CCCcc-cchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchh-hcccc Confidence 22222 223345678999999999999998888888765432 22112 3445544444567788998887754 67888 Q ss_pred eEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cchhhHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 182 NVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAI 258 (392) Q Consensus 182 ~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~---~~~~~~d~~~~~~~~~~~~~~ 258 (392) ..++..++.+..+.++++....+..++.....++++..+++.+|..++...++.+. ....++|.+.+++... .... T Consensus 79 ~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~~~~~~~d~i~dA~~~l-gd~~ 157 (275) T protein:vir:96 79 KRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKVEADITKLAGLQTAIDKF-NDED 157 (275) T ss_pred eeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHh-cccc Confidence 88999999999999999998887778888899999999999999988776555432 3445789999987644 3333 Q ss_pred cCCceEEEcHHHHHHHHHhhcc-------CCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhh Q lcl|Aclame:pro 259 SPNAILLTNQDGFNYLDKLKDK-------DGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE 331 (392) Q Consensus 259 ~~~a~~v~~~~~~~~L~~lkd~-------~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~ 331 (392) .....++|||..+..|++...- .|..+ ..++.-.+++|.+ |++ ++. +|. ...++||. . T Consensus 158 ~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~----~~~G~ig~~~G~~-Vi~-s~~-~p~------~t~~i~~~--g 222 (275) T protein:vir:96 158 LEPMVLFVNPLDAGKLRASATDNFTRATLLGDNV----IVKGAFGEALGAI-IVR-SNK-IKE------GEAILAKR--G 222 (275) T ss_pred CCccEEEeCHHHHHHHHhcccccccccccccccc----eeccccceecCee-EEE-eCC-CCc------ceEEEEec--c Confidence 4556799999999999876321 11111 1233445677754 444 332 222 22466664 3 Q ss_pred ceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCC Q lcl|Aclame:pro 332 AIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) Q Consensus 332 ~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~ 388 (392) ++..+.+.+++++.++... +....+++.++++.++++|+++++++++++.--. T Consensus 223 A~~~~~~~~~~vE~~Rd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 223 AVKLITKRDFFLETERHAS----HKSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred ceeeeecCCcccccccchh----hcCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 5566677778887766542 3457888999999999999999999987543322 No 115 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.55 E-value=2.3e-15 Score=100.86 Aligned_cols=262 Identities=13% Similarity=0.092 Sum_probs=178.7 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceee-ccCC-cceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP-VRTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |....+.-+..++|+.+...+.+.+.+...+.+++..-. ..+. ..++.+|.......+..+.|+++.+. +..+.+.. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccc-ccccccee Confidence 444444556788999999999988887777777766532 1221 12344554443345667888887764 46788888 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cchhhHHHHHHHHHHHhhhcccC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISP 260 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~---~~~~~~d~~~~~~~~~~~~~~~~ 260 (392) ++..++.+....++++....+..++.+.+.+.++..+++..|..++....+++. .....++.+++++.. +...... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~-l~d~~~~ 158 (274) T protein:vir:97 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDK-FNDEDLE 158 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHHHHHHHHH-hhccCCC Confidence 999999988899999998888888899999999999999999988776554332 334568899998764 4444446 Q ss_pred CceEEEcHHHHHHHHHhh------ccC-CceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhce Q lcl|Aclame:pro 261 NAILLTNQDGFNYLDKLK------DKD-GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAI 333 (392) Q Consensus 261 ~a~~v~~~~~~~~L~~lk------d~~-g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~ 333 (392) ..+++|||..+..|++-. .+. |..+ ..++.-.+++|. +|++. +. +|. ...++||. .++ T Consensus 159 ~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~----~~~G~ig~~~G~-~Vi~s-~~-~p~------~t~~l~~~--gA~ 223 (274) T protein:vir:97 159 PMVLFVNPLDAGKLRGDASTNFTRATELGDDI----IVKGAFGEALGA-IIVRT-NK-LEA------GTAILAKK--GAV 223 (274) T ss_pred ceEEEeCHHHHHHHHhhhhhhccccCcccccc----eeccccceecCe-eEEEc-CC-CCc------ceEEEEeC--cce Confidence 678899999999997531 111 1221 123344567775 45443 32 332 22456653 356 Q ss_pred eeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCC Q lcl|Aclame:pro 334 VLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~ 389 (392) ..+.+.++.++..+... +....+++..++++++.+|.++++++++.+ ..+- T Consensus 224 ~~~~~~~~~vE~~Rd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~-~~~~ 274 (274) T protein:vir:97 224 KLILKRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSG-SLEM 274 (274) T ss_pred EeeecCCceeccccchh----hcccEEEEEEEEEEEEEcCCceEEEecCcc-cccC Confidence 66777888887776542 335688888999999999999999995533 2222 No 116 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.55 E-value=2.3e-15 Score=100.86 Aligned_cols=262 Identities=13% Similarity=0.092 Sum_probs=178.7 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceee-ccCC-cceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP-VRTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |....+.-+..++|+.+...+.+.+.+...+.+++..-. ..+. ..++.+|.......+..+.|+++.+. +..+.+.. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccc-ccccccee Confidence 444444556788999999999988887777777766532 1221 12344554443345667888887764 46788888 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cchhhHHHHHHHHHHHhhhcccC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISP 260 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~---~~~~~~d~~~~~~~~~~~~~~~~ 260 (392) ++..++.+....++++....+..++.+.+.+.++..+++..|..++....+++. .....++.+++++.. +...... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~-l~d~~~~ 158 (274) T protein:vir:94 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDK-FNDEDLE 158 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHHHHHHHHH-hhccCCC Confidence 999999988899999998888888899999999999999999988776554332 334568899998764 4444446 Q ss_pred CceEEEcHHHHHHHHHhh------ccC-CceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhce Q lcl|Aclame:pro 261 NAILLTNQDGFNYLDKLK------DKD-GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAI 333 (392) Q Consensus 261 ~a~~v~~~~~~~~L~~lk------d~~-g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~ 333 (392) ..+++|||..+..|++-. .+. |..+ ..++.-.+++|. +|++. +. +|. ...++||. .++ T Consensus 159 ~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~----~~~G~ig~~~G~-~Vi~s-~~-~p~------~t~~l~~~--gA~ 223 (274) T protein:vir:94 159 PMVLFVNPLDAGKLRGDASTNFTRATELGDDI----IVKGAFGEALGA-IIVRT-NK-LEA------GTAILAKK--GAV 223 (274) T ss_pred ceEEEeCHHHHHHHHhhhhhhccccCcccccc----eeccccceecCe-eEEEc-CC-CCc------ceEEEEeC--cce Confidence 678899999999997531 111 1221 123344567775 45443 32 332 22456653 356 Q ss_pred eeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCC Q lcl|Aclame:pro 334 VLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~ 389 (392) ..+.+.++.++..+... +....+++..++++++.+|.++++++++.+ ..+- T Consensus 224 ~~~~~~~~~vE~~Rd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~-~~~~ 274 (274) T protein:vir:94 224 KLILKRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSG-SLEM 274 (274) T ss_pred EeeecCCceeccccchh----hcccEEEEEEEEEEEEEcCCceEEEecCcc-cccC Confidence 66777888887776542 335688888999999999999999995533 2222 No 117 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.52 E-value=4.3e-15 Score=99.35 Aligned_cols=265 Identities=12% Similarity=0.117 Sum_probs=176.9 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec-cC-CcceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |. .+.-+-.++|+.+.+.+.+...+...+.+++.+-.. .+ +..++.+|.....+.+.-+.|+.+.+. ...+.++. T Consensus 1 Ma--~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~-~~lt~~~~ 77 (270) T protein:vir:95 1 MT--QTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDT-TQMSMTTT 77 (270) T ss_pred CC--ceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccch-hhcccchh Confidence 22 222345668999999999998888888888766332 11 112334444444456667888888775 46788888 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--cchhhHHHHHHHHHHHhhhcccCC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK--QAIKSLDDIKDVLNVKLDPAISPN 261 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~--~~~~~~d~~~~~~~~~~~~~~~~~ 261 (392) ....++.+..+.++++....+.-+....+.++++..+++++|+.++..+.+... ....+++++++++.. +....... T Consensus 78 ~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~~~~~t~~~~~dA~~~-lgd~~~~~ 156 (270) T protein:vir:95 78 KVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTATVSADATGILDAIEV-FNSENDED 156 (270) T ss_pred eeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hccccCCC Confidence 888999999999999988766556788888999999999999988776554432 345678888888764 44444556 Q ss_pred ceEEEcHHHHHHHHHhhccCC-ceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccc Q lcl|Aclame:pro 262 AILLTNQDGFNYLDKLKDKDG-KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKRED 340 (392) Q Consensus 262 a~~v~~~~~~~~L~~lkd~~g-~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~ 340 (392) .+++|||.++..|++...-.+ ++- .....++.-.+++|.+ |++.++ .++. ...++|+ +.++..+...+ T Consensus 157 ~~i~vhs~~~~~Lrk~~~~~~~~~~-~~~~~~G~ig~~~G~~-Viv~s~-~~~~------~~~~l~~--~gAi~~~~~~~ 225 (270) T protein:vir:95 157 YVLYVNPKDYNKLVKSLFKVGGNVQ-DRAISKGDLVEIVGVS-DIVKSK-RVSE------NTAFLQR--YGAMEIVNKKK 225 (270) T ss_pred cEEEEcHHHHHHHHhhhcccccccc-cchhcccccceeccee-EEEeCC-CCCc------eeEEEEe--ccceeeeecCC Confidence 679999999999986431111 111 1112234455667754 444332 2221 2345665 34677777888 Q ss_pred eEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCC Q lcl|Aclame:pro 341 MELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) Q Consensus 341 ~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~ 389 (392) +.++.++... +....+.+.++++.++.+|..+++++++.+.-+.- T Consensus 226 ~~vEtdRd~~----~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~~ 270 (270) T protein:vir:95 226 PEAYTDFDIL----KRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSLEM 270 (270) T ss_pred ceeeeccchh----hcccEEEeeeEEEEEEEccceEEEEEecCCCCcCC Confidence 8888776542 34568888899999999999999999874333222 No 118 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.45 E-value=3e-14 Score=94.68 Aligned_cols=262 Identities=13% Similarity=0.093 Sum_probs=175.8 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec-cCCcc-eeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RTRSG-SRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) +....+.-+-.++|+.+...+.+.+.+...+.+++..-.- .+..| ++.+|.....+.+..+.|+...+. ++.+.... T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccch-hhccccee Confidence 3443444566789999988888888877777777665321 22112 344444443345667888877764 46788888 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cchhhHHHHHHHHHHHhhhcccC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISP 260 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~---~~~~~~d~~~~~~~~~~~~~~~~ 260 (392) ++..++.+..+.++++....+..++.+.+.+.++..+++..|..++....++.. ....+++.+++++.. +...... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~~a~~~d~i~dA~~~-lgd~~~~ 158 (274) T protein:vir:12 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDK-FNDEDLE 158 (274) T ss_pred eEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hcccccc Confidence 888888899999999888777778888999999999999999988776554332 234578899988764 4444445 Q ss_pred CceEEEcHHHHHHHHHhh------ccC-CceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhce Q lcl|Aclame:pro 261 NAILLTNQDGFNYLDKLK------DKD-GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAI 333 (392) Q Consensus 261 ~a~~v~~~~~~~~L~~lk------d~~-g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~ 333 (392) ..+++|||..+..|++.. +++ |..+ ...+.-.+++|. +|++. +. +|. ...++||.- ++ T Consensus 159 ~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~----~~~G~ig~~~G~-~Vi~s-~~-~p~------~t~~l~~~g--A~ 223 (274) T protein:vir:12 159 PMVLFINPLDAGKLRGDASTNFTRATELGDDI----IVKGAFGEALGA-IIVRS-NK-LEA------GTAILAKKG--AV 223 (274) T ss_pred ccEEEeCHHHHHHHHhhhhhhccccccccccc----eecccceeecCe-eEEEe-CC-CCc------ceEEEEecc--ce Confidence 667899999999997631 122 1111 123334557775 45443 32 332 124677643 45 Q ss_pred eeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCC Q lcl|Aclame:pro 334 VLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~ 389 (392) ..+.+.+++++.++... +....+++.+++++++.+|+.++++++ +++..+- T Consensus 224 ~~~~~~~~~vE~~Rd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t~-~~~~~~~ 274 (274) T protein:vir:12 224 KLILKRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITK-GSGSLEM 274 (274) T ss_pred eeeecCCceeccccchh----hcccEEEeeeEEEEEEEcCCceEEEEc-CCccccC Confidence 56667888888777543 344688899999999999999999984 3333333 No 119 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.43 E-value=6.3e-14 Score=92.96 Aligned_cols=262 Identities=11% Similarity=0.070 Sum_probs=174.7 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceee-ccCC-cceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP-VRTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) +....+.-+-.++|+.+...+.+.+.....+.+++.+-. ..+. ..++.+|.....+.+..+.|+...+. ++.+.... T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccch-hhccccee Confidence 333344445677899999899988888777777764432 1111 12344444443345667888877764 46788888 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cchhhHHHHHHHHHHHhhhcccC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISP 260 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~---~~~~~~d~~~~~~~~~~~~~~~~ 260 (392) ++..++.+..+.++++....+..++...+.++++..+++..|..++....++.. .+..+++.+.+++.. +...... T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~-lgd~~~~ 158 (274) T protein:vir:96 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLTGLQTAIDK-FNDEDLE 158 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hcccccc Confidence 888888888899999988877778899999999999999999988776654432 234568888888764 4434445 Q ss_pred CceEEEcHHHHHHHHHhh------ccC-CceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhce Q lcl|Aclame:pro 261 NAILLTNQDGFNYLDKLK------DKD-GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAI 333 (392) Q Consensus 261 ~a~~v~~~~~~~~L~~lk------d~~-g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~ 333 (392) ..+++|||..+..|++.. +++ |..+ ..++.-.+++|.+ |+++ +.. |. ...++||.. ++ T Consensus 159 ~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~----~~~G~ig~~~G~~-Vi~s-~~~-~~------~t~~l~~~g--A~ 223 (274) T protein:vir:96 159 PMVLFISPLDAGKLRGDATTNFTRATELGDDV----IVKGAFGEALGAV-IVRS-NKL-EA------GTAILAKKG--AV 223 (274) T ss_pred ccEEEeCHHHHHHHHhhccccccccccccccc----eeccccceecCeE-EEEe-CCC-CC------ceEEEEecc--ce Confidence 668899999999997631 121 1111 1233445677754 5443 322 21 234677754 45 Q ss_pred eeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCC Q lcl|Aclame:pro 334 VLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~ 389 (392) ..+.+.++.++.++.. .+....+++.+++++++++|++++++++..- .-+- T Consensus 224 ~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~-~~~~ 274 (274) T protein:vir:96 224 KLITKRDFFLETDRDP----STKTTALYSDKHYVAYLYDESKAVKITKGSG-SLEM 274 (274) T ss_pred eeeecCCccccccccc----ccccCEEEEeEEEEEEEEcCCcEEEEEcCCc-cccC Confidence 5666778888776654 2456788899999999999999999983321 1111 No 120 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.43 E-value=6.3e-14 Score=92.96 Aligned_cols=262 Identities=11% Similarity=0.070 Sum_probs=174.7 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceee-ccCC-cceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP-VRTR-SGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) +....+.-+-.++|+.+...+.+.+.....+.+++.+-. ..+. ..++.+|.....+.+..+.|+...+. ++.+.... T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccch-hhccccee Confidence 333344445677899999899988888777777764432 1111 12344444443345667888877764 46788888 Q ss_pred EechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cchhhHHHHHHHHHHHhhhcccC Q lcl|Aclame:pro 184 QYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISP 260 (392) Q Consensus 184 ~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~---~~~~~~d~~~~~~~~~~~~~~~~ 260 (392) ++..++.+..+.++++....+..++...+.++++..+++..|..++....++.. .+..+++.+.+++.. +...... T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~-lgd~~~~ 158 (274) T protein:vir:95 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLTGLQTAIDK-FNDEDLE 158 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hcccccc Confidence 888888888899999988877778899999999999999999988776654432 234568888888764 4434445 Q ss_pred CceEEEcHHHHHHHHHhh------ccC-CceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhce Q lcl|Aclame:pro 261 NAILLTNQDGFNYLDKLK------DKD-GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAI 333 (392) Q Consensus 261 ~a~~v~~~~~~~~L~~lk------d~~-g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~ 333 (392) ..+++|||..+..|++.. +++ |..+ ..++.-.+++|.+ |+++ +.. |. ...++||.. ++ T Consensus 159 ~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~----~~~G~ig~~~G~~-Vi~s-~~~-~~------~t~~l~~~g--A~ 223 (274) T protein:vir:95 159 PMVLFISPLDAGKLRGDATTNFTRATELGDDV----IVKGAFGEALGAV-IVRS-NKL-EA------GTAILAKKG--AV 223 (274) T ss_pred ccEEEeCHHHHHHHHhhccccccccccccccc----eeccccceecCeE-EEEe-CCC-CC------ceEEEEecc--ce Confidence 668899999999997631 121 1111 1233445677754 5443 322 21 234677754 45 Q ss_pred eeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCC Q lcl|Aclame:pro 334 VLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~ 389 (392) ..+.+.++.++.++.. .+....+++.+++++++++|++++++++..- .-+- T Consensus 224 ~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~-~~~~ 274 (274) T protein:vir:95 224 KLITKRDFFLETDRDP----STKTTALYSDKHYVAYLYDESKAVKITKGSG-SLEM 274 (274) T ss_pred eeeecCCccccccccc----ccccCEEEEeEEEEEEEEcCCcEEEEEcCCc-cccC Confidence 5666778888776654 2456788899999999999999999983321 1111 No 121 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.33 E-value=1.6e-12 Score=85.31 Aligned_cols=272 Identities=8% Similarity=0.054 Sum_probs=171.9 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccc---cccccccccccccccceee Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF---AEITEMGEIPETDNPKFSN 182 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~---~~~~E~~~~~~~~~~~~~~ 182 (392) |...+-..++.+.+..+...||+.+.+.+.|+.++.-..+.++...+.....-.+..+ .|..-....++ +..+|.+ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~-~~~t~~~ 79 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGK-AAATFTK 79 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCccc-cccccce Confidence 3334445566778888889999999999999999888777776666655443333222 22111112233 3568999 Q ss_pred EEechhheeeehhhHHHHHhh--h-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------------ccch Q lcl|Aclame:pro 183 VQYAVKDRAGILPLSRSLLQD--S-DQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-------------------KQAI 240 (392) Q Consensus 183 v~~~~~~i~~~~~iS~e~l~d--s-~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-------------------~~~~ 240 (392) ++...+.+.+.+.|-+.+.+- + ..+...+-.+.-.++++.+.+..+++|..++. .++. T Consensus 80 ~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~~gg~ 159 (310) T protein:vir:97 80 VNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGATGSA 159 (310) T ss_pred eeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecCCCCCC Confidence 999999999999999876542 2 34555565677788999999999999754321 1233 Q ss_pred hhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHh-hccCCceeecccc-cCCcccceecccceEEecCccccc--cc Q lcl|Aclame:pro 241 KSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKL-KDKDGKYILQSDP-TQKNKKLFAGTNPVVVVSNRFLKS--KG 316 (392) Q Consensus 241 ~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~l-kd~~g~~l~~~~~-~~~~~~~~~g~~pv~~~~~~~~~~--~~ 316 (392) .+.|++..++. .+......+.+++|||+++.+|+.+ +..+++.++.+.. ..|.+...|++-|++.. +.++.+ .+ T Consensus 160 ~t~d~LDeLl~-~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~-d~ip~~~~~~ 237 (310) T protein:vir:97 160 ISFAILDELMD-LVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRN-DYIPTNQTKG 237 (310) T ss_pred CCHHHHHHHHH-HHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEe-CccCCCcccc Confidence 45666666554 3333344667899999998877754 3444555544322 23444446666666543 333222 12 Q ss_pred ccCCcceEEEEehhh-----ceeee---eccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEE-ecc Q lcl|Aclame:pro 317 TTAKKAPLIIGDLKE-----AIVLF---KREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDL 383 (392) Q Consensus 317 ~~~~~~~~~~Gd~~~-----~~~~~---~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~-~~~ 383 (392) ...+...|+...|.. ++... ...++++..-.+.. .++...++++++++.++.+|+|+.+|. +.. T Consensus 238 ~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~---~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 238 GTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESE---DSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred ccCCceeEEEEeeCccccccceeccccCCccceeEEeCCccc---CCcceeEEEEEeeeEEEecccceeeeccccC Confidence 345566777655542 32111 12356666644322 256678999999999999999999996 555 No 122 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.25 E-value=4.7e-13 Score=88.15 Aligned_cols=227 Identities=11% Similarity=0.059 Sum_probs=155.9 Q ss_pred cceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 140 VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKK 219 (392) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~ 219 (392) -+-+.+ + -++.+|.. -+.+.-+.||.+++. ...+++..+...++.+..+.|+++....+.-|......+.++.. T Consensus 1 ~~~~~~-G--dtit~P~~--iGda~~v~eG~~i~~-~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~ 74 (231) T protein:vir:73 1 ENGINL-A--NLCEYPND--IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLS 74 (231) T ss_pred CccccC-C--ceEEeccc--ccchhhhcCCCcCCh-hhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHH Confidence 111111 1 23444443 346678999998885 46888999999999999999999998877777889999999999 Q ss_pred HHHHHHHHHhhcccccc--ccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCc--eeecccccCCcc Q lcl|Aclame:pro 220 SKVTRNVLILGVIEKLT--KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGK--YILQSDPTQKNK 295 (392) Q Consensus 220 ~~~~~d~~~~~~~~~~~--~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~--~l~~~~~~~~~~ 295 (392) +++++|..++....+.+ ..+..+++.+.+++.. +......+.+++|||.++..||+..+.... .+-..-..+|.- T Consensus 75 iA~kvD~di~~~~~~a~l~~~~~~t~d~i~~A~~~-fgde~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g~~i~~~G~i 153 (231) T protein:vir:73 75 LANKVDDDLLKAAKTTSQTVSTKANVDGVQAALDI-FNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTY 153 (231) T ss_pred HHHhhhHHHHHhhccccccccccccHHHHHHHHHH-hccccccceEEEEcchHHHhhhhccchhhhhhhhccceeeeccc Confidence 99999999887665544 3455788999888764 444455667899999999999985433211 111111224455 Q ss_pred cceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccc Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~a 375 (392) ..++|. +|+++ +. +|..+. -...+++. +.++.++...+++++.++... .....+.+.++++.++.+|.. T Consensus 154 G~i~G~-~Vi~S-~~-~~~~~~--~~~~~i~~--~gAl~~~~k~~~~vEtdRd~~----~k~~~i~~~~~y~v~l~~~~~ 222 (231) T protein:vir:73 154 ADVLGA-QIVRS-KK-LAEGSA--LMFKIVSN--SPALKLVLKRGVQVETDRDIV----TKTTVITADEHYAAYLYDLTK 222 (231) T ss_pred ceEcce-EEEEc-CC-CCCCce--eeeeEEee--ccceeeeecccceeecccccc----ccccEEEEeEEEEEEEEcCcc Confidence 567775 44443 32 232211 11122222 346778888898888877542 445788999999999999999 Q ss_pred eEEEEeccc Q lcl|Aclame:pro 376 AVYGEIDLS 384 (392) Q Consensus 376 f~~l~~~~~ 384 (392) +++++++.. T Consensus 223 vv~~t~~g~ 231 (231) T protein:vir:73 223 VVNITFTGV 231 (231) T ss_pred EEEEEeecC Confidence 999999977 No 123 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.23 E-value=2.7e-12 Score=83.99 Aligned_cols=354 Identities=15% Similarity=0.087 Sum_probs=166.1 Q ss_pred CCHHHHHHHHHHHH--------------HHHHHHHHh---hh------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MSKELRELLAKLEG--------------KKEEVRSLM---GE------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETE 57 (392) Q Consensus 1 M~kel~el~~~~~~--------------~~~e~~~~~---~~------~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~ 57 (392) =++-++.+...+.. ..+..|+.. .. +..+.-.....++.++..+... ....... T Consensus 8 ~d~~~RR~~~~L~~~EvSvv~~PAY~nA~vt~vRe~e~~~~~e~~~~~e~~en~~e~~~~~~~~~~E~Rs---~~~~i~~ 84 (410) T protein:vir:83 8 SDEYIRRLENELREKESLVRGIYDRANASNRDVNEEEGQMVAECRGRMEQIKNQMEQAQEVNRIAFETRS---KGQAVDA 84 (410) T ss_pred hhhHHHHHHHHhhhhheeeeccccccccccccchhhhccccccccCcccchhhhhHHHHHHHHHHHHHHH---HHHHHHh Confidence 11112222222210 001111110 00 1111111122223322222211 1111111 Q ss_pred HhhccccccccccchhhHHHHH--HHHHHhcchhhHHHHHHHHhhhh-hhhhccccccccceecchhhhhHHHHhHHhhh Q lcl|Aclame:pro 58 ERNNGREVETRNVDGEMEYRDV--FMKALRNKPLNAEEREFLEDDLE-QRAMSGLTGEDGGLVIPQDIQTQINELARSFD 134 (392) Q Consensus 58 ~~~~~~~~~~~~~~~~~~~~~a--~~~~~~~~~~~~~~~~~~~~~~~-~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~ 134 (392) .. ......+.....++|++ +.+.+-+....... ..+...+ .++.....+.+...+||.++....++.+.+.. T Consensus 85 ~~---~~~r~~p~~~~veyRSaGE~lkal~~~~~Gd~~--A~~~~e~~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r 159 (410) T protein:vir:83 85 AI---SAMRGSPVGTEVEYRSAGEYMLDMWNSAQGNAS--AADRLEVYARAADHQKTGDLQGVIPDPIVGPVIDFIDSAR 159 (410) T ss_pred hh---ccCcCCCCCCCcccccHHHHHHHHhccCCchHH--HHHHHHHHHHhhccCcccccccccchhHhhhHHHHHhhcc Confidence 11 11122222233355544 44444322111111 1111122 12333333334455678789999999999999 Q ss_pred hhhhhcceeeccCCcceeEEEeecCCccccc-------cccccccccccccceeeEEechhheeeehhhHHHHHhhhHHH Q lcl|Aclame:pro 135 ALEQYVTVEPVRTRSGSRVLEKNSDMIPFAE-------ITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN 207 (392) Q Consensus 135 ~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~ 207 (392) +|.+++...|.++.+..|++ .+..+.++- -.||+..+ ..+.+|+.-+...++++++..+||+.++-|.+. T Consensus 160 ~i~slf~tLP~~g~T~eY~v--~t~~~tV~~q~~~~kqa~EGd~L~-~gKl~~~t~tA~ikTyGGyt~LSRQ~IERs~v~ 236 (410) T protein:vir:83 160 PLVSTLGTLPLNNATFYRPI--VSQRPAVGLQGVAGGASDEKTELD-SQKMVIDRLTVNAKTLGGYVNVSRQAIDFSSPS 236 (410) T ss_pred chhhhhhhCCCCCCeeEEee--eccccccccccccccccccccccc-ccceeeeeccceeehhcCcccccceeeecCChh Confidence 99999998888877666644 343333322 22444443 335566666667999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhc----cccccccchhhHHHHHHHHHHHh---hhc--ccCCceEEEcHHHHHHHHHhh Q lcl|Aclame:pro 208 ILKYVTKWLGKKSKVTRNVLILGV----IEKLTKQAIKSLDDIKDVLNVKL---DPA--ISPNAILLTNQDGFNYLDKLK 278 (392) Q Consensus 208 l~~~v~~~l~~~~~~~~d~~~~~~----~~~~~~~~~~~~d~~~~~~~~~~---~~~--~~~~a~~v~~~~~~~~L~~lk 278 (392) ..+...+.|..+++.+-+...-.. .......+..+.+.++.++.... ..+ ...-..+.++|+.+..+.++- T Consensus 237 ~L~~~lraL~~AYA~atea~vra~L~~t~t~~~a~~~~Tad~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f 316 (410) T protein:vir:83 237 ALDLVVNGLGQQYAIETEALVGAALASTSTGAVGYGNATADNVASAIWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLF 316 (410) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccHHHHHHHHHHHHHHHhhhhccceeeeEEechhhhhhcccee Confidence 889999999888877665532211 11122233334455444433222 222 112236789999976554321 Q ss_pred ccCCceeeccc------c-cCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchh Q lcl|Aclame:pro 279 DKDGKYILQSD------P-TQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGK 351 (392) Q Consensus 279 d~~g~~l~~~~------~-~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~ 351 (392) -++++.|... . ..+-.+.+++ .||++..+ +..+..+|-|.. ++..++..+-.++..+..- T Consensus 317 -~~~~~~~~dt~Gfg~~~lg~gi~G~~~~-ipVvm~~~---------a~AgTA~f~~~~-Ai~~~eS~~gp~qL~d~~i- 383 (410) T protein:vir:83 317 -APVNPTNAHSTGFEAGRFGQGVMGSISG-IPVVMSAA---------LGSGDAYLFSTA-AIECFEQRVGTLQVVEPSV- 383 (410) T ss_pred -eccCCCCcccccccccccccchhhhhcc-cceEEecC---------CCcCeeeEeccc-eeeeeecCCceeEeeCCch- Confidence 1122222111 1 1122233444 46665332 122233454653 5666666652333333211 Q ss_pred hhhcCceeEEEEEeeCcEEecccceEEEEec Q lcl|Aclame:pro 352 AFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) Q Consensus 352 ~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~ 382 (392) .|...=.. .++.+.+..+++++-|.=. T Consensus 384 ---~nLt~~yS-gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 384 ---FGLQVAYA-GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred ---hhhhhhhe-eeeeeccccccceeeeccC Confidence 11111111 5567888888888755422 No 124 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.20 E-value=2.1e-11 Score=79.14 Aligned_cols=288 Identities=11% Similarity=0.040 Sum_probs=157.3 Q ss_pred HhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccc Q lcl|Aclame:pro 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~ 163 (392) +.+.......++...........+... . +|.+++++....+++.+.+.++++++++++++.+.++.+..... +.-.. T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~it~~~-l-~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~-G~r~~ 77 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKDIGLAE-L-DGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGV-PRLSG 77 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhhccccc-c-CceeecHHHHHHHHHHHhhccchhhhcceeeccccccccccccc-ceeec Confidence 111111122222222233333333222 2 34556777778999999999999999999998887777543322 11111 Q ss_pred cccccccccccccccceeeEEec-hhheeeehhhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc- Q lcl|Aclame:pro 164 AEITEMGEIPETDNPKFSNVQYA-VKDRAGILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK- 237 (392) Q Consensus 164 ~~~~E~~~~~~~~~~~~~~v~~~-~~~i~~~~~iS~e~l~ds----~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~- 237 (392) --..|++..++...++...|.+. .+++.....+..+.+.+. ...+++.|.+.++++++.-++...+.|...... T Consensus 78 r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~ 157 (360) T protein:vir:99 78 HTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNL 157 (360) T ss_pred cccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhccc Confidence 11223333332223344455552 345555556666666653 235679999999999999877766665433210 Q ss_pred ---cchh--------------------------------------------------------hHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 238 ---QAIK--------------------------------------------------------SLDDIKDVLNVKLDPAI 258 (392) Q Consensus 238 ---~~~~--------------------------------------------------------~~d~~~~~~~~~~~~~~ 258 (392) .+.. ....+...+...++..| T Consensus 158 ~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~ky 237 (360) T protein:vir:99 158 QSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRY 237 (360) T ss_pred ccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhh Confidence 0000 01223444556778888 Q ss_pred cCC----ceEEEcHHHHHH-HHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhce Q lcl|Aclame:pro 259 SPN----AILLTNQDGFNY-LDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAI 333 (392) Q Consensus 259 ~~~----a~~v~~~~~~~~-L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~ 333 (392) +.+ -+|+||+.+... .+.|.+-+. +|--.-+.+++....+| +|+..+. .+|. ..++|-++++. T Consensus 238 r~~~~~~~~~~~s~~~~~~yr~~L~~R~t-~LGd~~l~g~~~~~~~G-ipi~~v~--~~pd-------~~~mlT~p~NL- 305 (360) T protein:vir:99 238 RESDAYSPVLMTSPNQVQSYTMSLTERED-PLGSAVIFGDSDITPFS-YDLVGVN--GFPD-------EYMMFTDPNNL- 305 (360) T ss_pred hcCcccceEEEccCchHHHHHHHHhccCc-ccchhheecccccccce-eeeEEcC--CCCC-------CceEEeccCce- Confidence 753 389999998544 344543332 22111122233345567 4554433 3332 35889999885 Q ss_pred eeeeccceEEEEeccchhhhhcCc--eeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 334 VLFKREDMELASTDVGGKAFTRNT--LDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~f~~~~--~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ..+.+..+++..+.+... ..... +.......+|+.+.+++|.+.++= ...|.+ T Consensus 306 i~g~~~~iri~~~~e~~~-~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~-----~~~~~~ 360 (360) T protein:vir:99 306 AFGLYEEMELDQSTDTDK-VHEQRLHSRNWLEGQFDFQIKEQQAGVLVTD-----LETPTA 360 (360) T ss_pred eEEeeeeeEEeecccchh-hhhhceeeeEEEEEEeeEEEEecccEEEEec-----CCCCCC Confidence 466678888876555432 22223 223345668889999999998872 222233 No 125 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.96 E-value=1.1e-10 Score=75.28 Aligned_cols=271 Identities=13% Similarity=0.096 Sum_probs=149.7 Q ss_pred hhhhhcccccccccee-------cchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecC---Cccccccccccc Q lcl|Aclame:pro 102 EQRAMSGLTGEDGGLV-------IPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD---MIPFAEITEMGE 171 (392) Q Consensus 102 ~~~a~~~~~~~~gg~~-------iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~---~~~~~~~~E~~~ 171 (392) ...-....+..+|+.+ -|+-+...|.+.+...-..-.|.+...- ...+.+.+..... ...+..++|+++ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a-~~~~~v~f~~~~p~~~~~d~e~VaEggE 79 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGA-NPNGVVAYNEGNPSFLEDDVADVAEFGE 79 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccc-cccceeEEEecccccccCcHhhccCccc Confidence 0000111111223322 2555555666666544433333443222 1233444433321 246778999999 Q ss_pred cccccccceeeEEe-chhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------cchh--- Q lcl|Aclame:pro 172 IPETDNPKFSNVQY-AVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK------QAIK--- 241 (392) Q Consensus 172 ~~~~~~~~~~~v~~-~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~------~~~~--- 241 (392) +|.. .++++.-.+ ..+|.+.-+.||+|++.....+...-..+.+++.+.+..|...+..+..... .++. T Consensus 80 iP~~-~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~ 158 (318) T protein:vir:10 80 IPVS-AGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGG 158 (318) T ss_pred cccc-CCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcc Confidence 9965 467766555 5679999999999999999999999999999999999998876654422110 0100 Q ss_pred -hHHHHHHHH---H-------------HHhhhcccCCceEEEcHHHHHHHHH------hhccCCceeec-ccccCCcccc Q lcl|Aclame:pro 242 -SLDDIKDVL---N-------------VKLDPAISPNAILLTNQDGFNYLDK------LKDKDGKYILQ-SDPTQKNKKL 297 (392) Q Consensus 242 -~~d~~~~~~---~-------------~~~~~~~~~~a~~v~~~~~~~~L~~------lkd~~g~~l~~-~~~~~~~~~~ 297 (392) ...++.+++ . ..++-.|.+ ..+||||..|..|++ +-..++.+++. +.+++.-+.. T Consensus 159 ~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~p-dtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~ 237 (318) T protein:vir:10 159 KVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIP-DTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGS 237 (318) T ss_pred cccccchhhhhhhhhhhhhhhhhhhhhhhhccCccc-eeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccccccce Confidence 011222221 1 111223333 458999999999943 22334444432 2334444556 Q ss_pred eecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccc--hhhh-hcCceeEEEEEeeCcEEeccc Q lcl|Aclame:pro 298 FAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVG--GKAF-TRNTLDLRAIQRDDVQMWDNE 374 (392) Q Consensus 298 ~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~--~~~f-~~~~~~~~~~~r~~~~v~~~~ 374 (392) ++|.. ++.++..|... .++.+=...-.+.+.++++.+-.... +.+. .+..+.+|+..+....|.+|+ T Consensus 238 ~lGl~---vi~s~~~p~~~-------alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~Pk 307 (318) T protein:vir:10 238 VMGLN---VIRSRTFPIDR-------VLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPK 307 (318) T ss_pred eeceE---EeecCccCCCe-------eEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcc Confidence 77754 44456666432 23333222222335556655443321 1111 234477888888999999999 Q ss_pred ceEEEEecccCCCCCC Q lcl|Aclame:pro 375 AAVYGEIDLSAPVEQP 390 (392) Q Consensus 375 af~~l~~~~~a~~~~~ 390 (392) |+++|+ -.-|| T Consensus 308 A~~~it-----gi~~~ 318 (318) T protein:vir:10 308 AALWLT-----GIVTP 318 (318) T ss_pred eeEEEe-----eccCC Confidence 999998 22223 No 126 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=98.84 E-value=9.9e-10 Score=69.95 Aligned_cols=257 Identities=12% Similarity=0.034 Sum_probs=143.1 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceee--ccCCcceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP--VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~--~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |.. ..++|+.+...+++.+++.+.+.++++... ......++.+|+........+..+++.... ...+...+ T Consensus 1 MA~------~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) T protein:vir:79 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCc-cccccceE Confidence 221 125799999999999999888888764421 111112455555444444446667665443 34566677 Q ss_pred Eechhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------c-cchhhHHHHHHHHHHHh Q lcl|Aclame:pro 184 QYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-------K-QAIKSLDDIKDVLNVKL 254 (392) Q Consensus 184 ~~~~~~i-~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-------~-~~~~~~d~~~~~~~~~~ 254 (392) ++...+. +.-+.|++.-...+..++.++ .+....+++.++|..++......+ + .....++.+.++. ..+ T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~-~~l 151 (273) T protein:vir:79 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASAL-KEL 151 (273) T ss_pred EEEEeeecccceeeccHHHHhhcccHHHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccccchhhHHHHHHHHH-HHh Confidence 7776553 344466664444556678875 466788899999986654332211 1 1123355555544 455 Q ss_pred hhcccC--CceEEEcHHHHHHHHHhhcc-CCceeec--ccccCCcccceecccceEEecCcccccccccCCcceEEEEeh Q lcl|Aclame:pro 255 DPAISP--NAILLTNQDGFNYLDKLKDK-DGKYILQ--SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL 329 (392) Q Consensus 255 ~~~~~~--~a~~v~~~~~~~~L~~lkd~-~g~~l~~--~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~ 329 (392) +....| +.++|++|..+..|.+..+- ....... ..+..|....++|.. |+. +..+|..+ +. ..+.+- T Consensus 152 d~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~-i~~--s~~lp~~~---~~-~~~a~~- 223 (273) T protein:vir:79 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGAR-IVE--SNNLRDTD---DE-QFVAFH- 223 (273) T ss_pred hhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceE-EEe--cccccccC---ce-EEEEEe- Confidence 555543 34789999999988654321 1111111 112244455677753 433 22334321 11 122222 Q ss_pred hhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) +.+..... ....++..+.. ..| ...+++.+++|+++++|++++.++.+.+ T Consensus 224 ~~A~~~a~-~~~~~e~~r~~-~~~---~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 224 PSAAAYVS-QIDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ccceeeee-ehhhhhcccCc-ccc---eeeeeeeeeeeeEEecCceEEEEeccCC Confidence 22332222 22233333322 223 4578899999999999999999987777 No 127 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.82 E-value=1.6e-09 Score=68.81 Aligned_cols=257 Identities=12% Similarity=0.013 Sum_probs=139.1 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec-cCCc-ceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |.. -.++|+.+...+++.+++.+.+.+++..-.- .... .++.+++........+..+++.... +..+-..+ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCc-cccccceE Confidence 211 1247999999999999998888887644211 1111 2344554443333445555554432 23455666 Q ss_pred Eechhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----c----ccchhhHHHHHHHHHHHh Q lcl|Aclame:pro 184 QYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL----T----KQAIKSLDDIKDVLNVKL 254 (392) Q Consensus 184 ~~~~~~i-~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~----~----~~~~~~~d~~~~~~~~~~ 254 (392) ++...+. +.-+.|++.-...+..++.+ +.+....+++.+.|..++...... . ......++.+.++. ..+ T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~-~~l 151 (273) T protein:vir:10 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKAL-KEL 151 (273) T ss_pred EEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHH-HHh Confidence 6655443 33345665444445567877 456678899999998765532221 1 11223456666654 345 Q ss_pred hhcccC--CceEEEcHHHHHHHHHhhccC-Cceee-c-ccccCCcccceecccceEEecCcccccccccCCcceEEEEeh Q lcl|Aclame:pro 255 DPAISP--NAILLTNQDGFNYLDKLKDKD-GKYIL-Q-SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL 329 (392) Q Consensus 255 ~~~~~~--~a~~v~~~~~~~~L~~lkd~~-g~~l~-~-~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~ 329 (392) +....| +.+++++|..+..|.+...-- ..... . ..+..|....+.|.. |+.+ + -+|... + ..++.+.- T Consensus 152 d~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~-v~~s-~-~lp~~~---~-~~~~~~~~ 224 (273) T protein:vir:10 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGAR-IVES-N-NLRDTD---D-EQFVAFHP 224 (273) T ss_pred hhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceE-EEEe-c-ccccCC---c-cEEEEEec Confidence 555543 457899999999986543211 01111 1 112234455677753 4332 2 233321 1 12333332 Q ss_pred hhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) .+..... ....++..... ..| ...+++.+++|+++++|++++.++.+.+ T Consensus 225 -~A~~~a~-q~~~~e~~r~~-~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 225 -SAAAYVS-QIDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -cceeeee-eeehhhcccCC-Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 2332222 22233333222 233 3567888999999999999999987777 No 128 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.82 E-value=1.6e-09 Score=68.81 Aligned_cols=257 Identities=12% Similarity=0.013 Sum_probs=139.1 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec-cCCc-ceeEEEeecCCccccccccccccccccccceeeE Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-RTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v 183 (392) |.. -.++|+.+...+++.+++.+.+.+++..-.- .... .++.+++........+..+++.... +..+-..+ T Consensus 1 MA~------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) T protein:vir:10 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) T ss_pred Ccc------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCc-cccccceE Confidence 211 1247999999999999998888887644211 1111 2344554443333445555554432 23455666 Q ss_pred Eechhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----c----ccchhhHHHHHHHHHHHh Q lcl|Aclame:pro 184 QYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL----T----KQAIKSLDDIKDVLNVKL 254 (392) Q Consensus 184 ~~~~~~i-~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~----~----~~~~~~~d~~~~~~~~~~ 254 (392) ++...+. +.-+.|++.-...+..++.+ +.+....+++.+.|..++...... . ......++.+.++. ..+ T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~-~~l 151 (273) T protein:vir:10 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKAL-KEL 151 (273) T ss_pred EEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHH-HHh Confidence 6655443 33345665444445567877 456678899999998765532221 1 11223456666654 345 Q ss_pred hhcccC--CceEEEcHHHHHHHHHhhccC-Cceee-c-ccccCCcccceecccceEEecCcccccccccCCcceEEEEeh Q lcl|Aclame:pro 255 DPAISP--NAILLTNQDGFNYLDKLKDKD-GKYIL-Q-SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL 329 (392) Q Consensus 255 ~~~~~~--~a~~v~~~~~~~~L~~lkd~~-g~~l~-~-~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~ 329 (392) +....| +.+++++|..+..|.+...-- ..... . ..+..|....+.|.. |+.+ + -+|... + ..++.+.- T Consensus 152 d~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~-v~~s-~-~lp~~~---~-~~~~~~~~ 224 (273) T protein:vir:10 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGAR-IVES-N-NLRDTD---D-EQFVAFHP 224 (273) T ss_pred hhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceE-EEEe-c-ccccCC---c-cEEEEEec Confidence 555543 457899999999986543211 01111 1 112234455677753 4332 2 233321 1 12333332 Q ss_pred hhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) .+..... ....++..... ..| ...+++.+++|+++++|++++.++.+.+ T Consensus 225 -~A~~~a~-q~~~~e~~r~~-~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 225 -SAAAYVS-QIDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -cceeeee-eeehhhcccCC-Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 2332222 22233333222 233 3567888999999999999999987777 No 129 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.70 E-value=2e-09 Score=68.27 Aligned_cols=280 Identities=14% Similarity=0.074 Sum_probs=149.3 Q ss_pred HhcchhhHHHHHHHHhhhhhhhhccccccccc----eecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecC Q lcl|Aclame:pro 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGG----LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg----~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~ 159 (392) +-+. .....-+...+.+..+| ..| +.+..++.......+.++++..++.+.+++ .+.++ ..+ T Consensus 1 ma~~-----------~~~~~~~t~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~rti~~G~-sv~~~-~iG 66 (347) T protein:vir:94 1 MANM-----------NGGQQMGKDQGKGMSAGDKLALFL-KVFGGEVLTAFTRTSVTMNKHLVRSIQSGK-SAQFP-VLG 66 (347) T ss_pred CCcc-----------ccccccccccccCCcccchHHHHH-HHHhHHHHHHHHHHHhhhhhhhheeccccc-eEEee-ecc Confidence 0000 00000001111111111 233 788999999999999999999998887533 33444 455 Q ss_pred Cccccccccccccccc-cccceeeEEechhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc----- Q lcl|Aclame:pro 160 MIPFAEITEMGEIPET-DNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI----- 232 (392) Q Consensus 160 ~~~~~~~~E~~~~~~~-~~~~~~~v~~~~~~i-~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~----- 232 (392) ...+.....|.+...+ ..+..+++++...++ +....|.+-=-.++.+++.+.+.++...++++..|+.++..+ T Consensus 67 ~~~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~ 146 (347) T protein:vir:94 67 RTKAAYLQPGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCN 146 (347) T ss_pred ceeEeeeecCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 5667777777665322 234566666655443 222233322222345678889999999999999998775311 Q ss_pred ------------ccc---------c------ccchhhHHHHHHHHHHHhhhcccCC--ceEEEcHHHHHHHHHhhcc-CC Q lcl|Aclame:pro 233 ------------EKL---------T------KQAIKSLDDIKDVLNVKLDPAISPN--AILLTNQDGFNYLDKLKDK-DG 282 (392) Q Consensus 233 ------------~~~---------~------~~~~~~~d~~~~~~~~~~~~~~~~~--a~~v~~~~~~~~L~~lkd~-~g 282 (392) +.. + ..+...|+.+.++.. .|+...-+. -++|++|..|..|.+..+. .+ T Consensus 147 ~~~~~~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~-~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~ 225 (347) T protein:vir:94 147 LPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARA-KLTGNYVPSSDRVFYTTPDNYSAILAALMPNAA 225 (347) T ss_pred cccccccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHH-HhhhcCCCCCCCEEEeChHHHHHHHHhhccccc Confidence 000 0 011233566666543 444444443 3557789999888654332 33 Q ss_pred ceeecccccCCcccceecccceEEecCcccccccc--cCCc---------------ceEEEEehhh---------ceeee Q lcl|Aclame:pro 283 KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT--TAKK---------------APLIIGDLKE---------AIVLF 336 (392) Q Consensus 283 ~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~--~~~~---------------~~~~~Gd~~~---------~~~~~ 336 (392) .+-...+...+.-..+.|.+ |+.+ +.++....+ ..+. ..-+=+||++ ++..+ T Consensus 226 ~~~~~~~~~~G~V~~v~G~~-V~~S-n~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv 303 (347) T protein:vir:94 226 NYQALIDPSTGSIRNVMGFE-VIEV-PHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTV 303 (347) T ss_pred ccccccccccceeEEeeceE-EEEc-CccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhh Confidence 33333334445556677754 4433 333221100 0110 0012233333 23333 Q ss_pred eccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 337 KREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 337 ~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) .-.+++++..+.. .++. ..+.+.+-+|..+.||++.+.++++.+ T Consensus 304 ~~~~~~~e~~~~~--~~~~--~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 304 KLKDMALERARRA--NFQA--DQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhcccceeeeech--hhhh--hhhhhhhhhcCcccccceeEEEEecCC Confidence 4455556665432 2222 356677778999999999999998877 No 130 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.70 E-value=2.5e-09 Score=67.73 Aligned_cols=282 Identities=11% Similarity=0.003 Sum_probs=149.4 Q ss_pred HHHHhhhhhhhhccccccccce---ecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccc Q lcl|Aclame:pro 95 EFLEDDLEQRAMSGLTGEDGGL---VIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGE 171 (392) Q Consensus 95 ~~~~~~~~~~a~~~~~~~~gg~---~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~ 171 (392) +.-.....+.+...+.+..++- +--+.+..++.......+.++++.++.++.+++ .+.++ ..+...+.....+.. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~-sv~~~-~iG~~~~~~~~~g~~ 78 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFP-VMGRTKGYYLAPGEN 78 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcc-eEEEe-eecceeeeeeccccC Confidence 0000000111112222222221 122788889999999999999999988876533 33444 344455566666555 Q ss_pred cccc-cccceeeEEechhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------------- Q lcl|Aclame:pro 172 IPET-DNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI----------------- 232 (392) Q Consensus 172 ~~~~-~~~~~~~v~~~~~~i-~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~----------------- 232 (392) .... ..+..+++++...++ +.-..|.+-=.-++.+++.+.+.++..+++++..|..++..+ T Consensus 79 l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~ 158 (347) T protein:vir:88 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) T ss_pred CCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCc Confidence 3321 124556666655554 223334333333345678888999999999999998765321 Q ss_pred ccccc--------------cchhhHHHHHHHHHHHhhhcccC--CceEEEcHHHHHHHHHhhcc-CCceeecccccCCcc Q lcl|Aclame:pro 233 EKLTK--------------QAIKSLDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKDK-DGKYILQSDPTQKNK 295 (392) Q Consensus 233 ~~~~~--------------~~~~~~d~~~~~~~~~~~~~~~~--~a~~v~~~~~~~~L~~lkd~-~g~~l~~~~~~~~~~ 295 (392) ++... .+...++.+.++.. .++...-| +-++|++|..|..|.+.... ...+.-......+.. T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~-~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) T protein:vir:88 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARA-RLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) T ss_pred cccccccccccccccchhhhHHHHHHHHHHHHH-HHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhccee Confidence 11000 01123566666543 45544444 34779999999988653322 223332223444555 Q ss_pred cceecccceEEecCccccccccc-----------------CCcceEEEEehhhceeee---------eccceEEEEeccc Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTT-----------------AKKAPLIIGDLKEAIVLF---------KREDMELASTDVG 349 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~-----------------~~~~~~~~Gd~~~~~~~~---------~~~~~~~~~~~~~ 349 (392) ..++|.. |+++.+ .+-..... .....-+-+||++...++ .-.+++++..+.. T Consensus 238 g~i~G~~-V~~s~n-lp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~ 315 (347) T protein:vir:88 238 RNVMGFE-VIEVPH-LTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP 315 (347) T ss_pred eeeccce-EEEeec-ccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeech Confidence 5677764 444332 21100000 001111334454432222 2333445554432 Q ss_pred hhhhhcCceeEEEEEeeCcEEecccceEEEEecccC Q lcl|Aclame:pro 350 GKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 350 ~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a 385 (392) .+|. ..+++.+.+|.+++||++.+.++++.+| T Consensus 316 -~~~~---d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 316 -EFQA---DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred -hhHH---HHhhhhhhhcCceeccceEEEEEeCCCC Confidence 2333 3677888899999999999999988777 No 131 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.66 E-value=6.5e-09 Score=65.47 Aligned_cols=279 Identities=11% Similarity=0.030 Sum_probs=149.4 Q ss_pred HHHHhhhhhhhhcccccccc-c---eecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccc Q lcl|Aclame:pro 95 EFLEDDLEQRAMSGLTGEDG-G---LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMG 170 (392) Q Consensus 95 ~~~~~~~~~~a~~~~~~~~g-g---~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~ 170 (392) +.......+.+........| | .+--+.+..++.......+.++++.+++.+.+++ .+.+++ .+...+.....|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gk-s~~~~~-iG~~~~~~~~~G~ 78 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGK-SAQFPV-LGRTQAAYLAPGE 78 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccc-eEEEee-ecceEEEeeecCC Confidence 00000001111111111111 1 2223788899999999999999999999988643 444443 4566777777776 Q ss_pred cccccc-ccceee--EEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------------- Q lcl|Aclame:pro 171 EIPETD-NPKFSN--VQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------------- 233 (392) Q Consensus 171 ~~~~~~-~~~~~~--v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~-------------- 233 (392) +...+. .+...+ ++++-.+++.+ .|.+-=--++++++.+.+.+++.+++++..|+.++..+. T Consensus 79 ~l~~~~~~~~~~e~~ltID~~~y~~~-~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~ 157 (345) T protein:vir:22 79 NLDDKRKDIKHTEKVITIDGLLTADV-LIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIE 157 (345) T ss_pred CCCCCCCCcccceEEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 643221 245566 44444333332 222211222456889999999999999999987653110 Q ss_pred ---cc--------c-------ccchhhHHHHHHHHHHHhhhcccCC--ceEEEcHHHHHHHHHhhccC-CceeecccccC Q lcl|Aclame:pro 234 ---KL--------T-------KQAIKSLDDIKDVLNVKLDPAISPN--AILLTNQDGFNYLDKLKDKD-GKYILQSDPTQ 292 (392) Q Consensus 234 ---~~--------~-------~~~~~~~d~~~~~~~~~~~~~~~~~--a~~v~~~~~~~~L~~lkd~~-g~~l~~~~~~~ 292 (392) .. + ..+...++.+.++. ..++...-+. -++|++|..|..|.+-+.-+ ..+.-...... T Consensus 158 ~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~-~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~ 236 (345) T protein:vir:22 158 GLGTATVIETTQNKAALTDQVALGKEIIAALTKAR-AALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEK 236 (345) T ss_pred ccccccccccccccccccccccCHHHHHHHHHHHH-HHhhhcCCCccCCEEEeChHHHHHHhcccccccccccccccccc Confidence 00 0 01123356666553 3455555443 36799999999886543322 22222212223 Q ss_pred CcccceecccceEEecCccccccc------------------------ccCCcceEEEEehhhceeeeeccceEEEEecc Q lcl|Aclame:pro 293 KNKKLFAGTNPVVVVSNRFLKSKG------------------------TTAKKAPLIIGDLKEAIVLFKREDMELASTDV 348 (392) Q Consensus 293 ~~~~~~~g~~pv~~~~~~~~~~~~------------------------~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~ 348 (392) |.-..+.|.+ |+.+ +.++.... ......+.+|... .++..+...+++++..+. T Consensus 237 G~V~~i~G~~-V~~s-n~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~-~A~~~v~~~~~~~e~~r~ 313 (345) T protein:vir:22 237 GSIRNVMGFE-VVEV-PHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHR-SAVGTVKLRDLALERARR 313 (345) T ss_pred ceEEEEeceE-EEec-ccccccccCccccCcccccccccccccceeeeeccCceEEEEEeh-hheeeeeeecceeeeeec Confidence 3345567754 3332 22221000 0001122333333 345455555666777664 Q ss_pred chhhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 349 GGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 349 ~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) . .+|. ..+++.+-+|.++.||++.+.+++|.- T Consensus 314 ~-~~~~---d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 314 A-NFQA---DQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred h-hHHH---HHHHHHHhcCCcccccceeEEEEEeeC Confidence 3 3443 366777778999999999998887765 No 132 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.65 E-value=6.1e-09 Score=65.64 Aligned_cols=280 Identities=10% Similarity=0.030 Sum_probs=146.9 Q ss_pred Hhhhhhhhhcc-ccccccc-eecc-hhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccc Q lcl|Aclame:pro 98 EDDLEQRAMSG-LTGEDGG-LVIP-QDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPE 174 (392) Q Consensus 98 ~~~~~~~a~~~-~~~~~gg-~~iP-~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~ 174 (392) .+-.-...++. .....++ +.++ +.+..++.......+.++++..++++.++. ++.+++ .+...+....-+++... T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~-s~~~~~-iG~~~~~~~~~g~~l~~ 78 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTN-QLRVDR-VGASTIAGRKAGEELVV 78 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccc-eEEEee-ecceeeeeecCCCCCCC Confidence 00000000110 0111122 2344 888999999999999999999999987643 444554 45566666766665543 Q ss_pred ccccceeeEEechhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc----ccc--------------- Q lcl|Aclame:pro 175 TDNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV----IEK--------------- 234 (392) Q Consensus 175 ~~~~~~~~v~~~~~~i-~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~----~~~--------------- 234 (392) +.++-+++++....+ +....|.+-=--++.+++.+.+.+++.+++++..|+.++.. ... T Consensus 79 -~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~ 157 (334) T protein:vir:80 79 -QKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGIL 157 (334) T ss_pred -CCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCcc Confidence 334556666665552 33333333222334578999999999999999999876422 000 Q ss_pred ------cc-ccchhhHHHHHHHH---HHHhhhcccC-----CceEEEcHHHHHHHHHhhccCCc-eeecc---cccCCcc Q lcl|Aclame:pro 235 ------LT-KQAIKSLDDIKDVL---NVKLDPAISP-----NAILLTNQDGFNYLDKLKDKDGK-YILQS---DPTQKNK 295 (392) Q Consensus 235 ------~~-~~~~~~~d~~~~~~---~~~~~~~~~~-----~a~~v~~~~~~~~L~~lkd~~g~-~l~~~---~~~~~~~ 295 (392) .+ .......+.+.+++ ...+....-+ +-+.|++|..|..|..-+.--.+ |.-.+ .+..+.- T Consensus 158 ~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~i 237 (334) T protein:vir:80 158 LPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGGRI 237 (334) T ss_pred eeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccceeE Confidence 00 01112233333322 2233333333 35789999999998653221111 11011 1222333 Q ss_pred cceecccceEEecCccccccccc---CCcceEEEEehhhceeee---------eccceEEEEeccchhhhhcCceeEEEE Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTT---AKKAPLIIGDLKEAIVLF---------KREDMELASTDVGGKAFTRNTLDLRAI 363 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~---~~~~~~~~Gd~~~~~~~~---------~~~~~~~~~~~~~~~~f~~~~~~~~~~ 363 (392) ..+.|.+ | +.++.++....+. ......+=|||+.....+ .-.+++.+..+.. ..|. ..+.+. T Consensus 238 ~~v~G~~-V-~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~-~~~~---d~i~~~ 311 (334) T protein:vir:80 238 AMLNGVR-V-VETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEK-KDFG---HYLDTF 311 (334) T ss_pred EEEeceE-E-EeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeech-hhHH---HHHHHH Confidence 4566654 3 3334443332111 122234455555432222 2222333333322 2232 233445 Q ss_pred EeeCcEEecccceEEEEecccCC Q lcl|Aclame:pro 364 QRDDVQMWDNEAAVYGEIDLSAP 386 (392) Q Consensus 364 ~r~~~~v~~~~af~~l~~~~~a~ 386 (392) +-+|.++.||+|.+.++++.+-| T Consensus 312 ~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 312 QSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred HHcCCceeccceEEEEEEeeecC Confidence 56799999999999999999988 No 133 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=98.51 E-value=1.6e-08 Score=63.28 Aligned_cols=284 Identities=10% Similarity=0.011 Sum_probs=139.5 Q ss_pred HhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcc-eeEEEeecCCcccccccccccccccc Q lcl|Aclame:pro 98 EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSG-SRVLEKNSDMIPFAEITEMGEIPETD 176 (392) Q Consensus 98 ~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~E~~~~~~~~ 176 (392) .+....-.-...+......+||+.+..++++.+.+...+.++++-.......| ++.+++. +...+....+++..+- . T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~-g~~~~~d~~~~~~i~~-~ 78 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRI-SELGVEDKATDVPVGV-Q 78 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEecc-CcceeeeecCCCcccc-c Confidence 00000000000111222346899999999999998888888765443322223 3445543 3455666666665543 3 Q ss_pred ccceeeEEechhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----------------cc Q lcl|Aclame:pro 177 NPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-----------------KQ 238 (392) Q Consensus 177 ~~~~~~v~~~~~~i-~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-----------------~~ 238 (392) ..+-.++++...+. +.-+.|++.-..++..++.+.+.++...+++++.|..++......+ .. T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~ 158 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGNG 158 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCch Confidence 44556666666443 4556677766666778899999999999999999987665321110 01 Q ss_pred chhhHHHHHHHHHHHhhhcccCC--ceEEEcHHHHHHHHHhhccCCc-eeecccccCCcccceecccceEEecCcccccc Q lcl|Aclame:pro 239 AIKSLDDIKDVLNVKLDPAISPN--AILLTNQDGFNYLDKLKDKDGK-YILQSDPTQKNKKLFAGTNPVVVVSNRFLKSK 315 (392) Q Consensus 239 ~~~~~d~~~~~~~~~~~~~~~~~--a~~v~~~~~~~~L~~lkd~~g~-~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~ 315 (392) ....++.++.+. ..++...-+. -.+|++|..+..|.+...-... +.-...+..|....++|.. |+++ +.++... T Consensus 159 ~~~~~~~i~~a~-~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~-V~~S-n~lp~~~ 235 (341) T protein:vir:94 159 QAFSFAVFLAAR-RLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVR-VIRT-SLIGNNS 235 (341) T ss_pred hhhhHHHHHHHH-HHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceE-EEEe-ccccccc Confidence 122455555543 4555554443 4678899999999653211111 1111123344455677754 4332 3332211 Q ss_pred cccCCcce---------------EEE----Eehhhce-eeeeccce-EEEEec------------cchhhhh--cCceeE Q lcl|Aclame:pro 316 GTTAKKAP---------------LII----GDLKEAI-VLFKREDM-ELASTD------------VGGKAFT--RNTLDL 360 (392) Q Consensus 316 ~~~~~~~~---------------~~~----Gd~~~~~-~~~~~~~~-~~~~~~------------~~~~~f~--~~~~~~ 360 (392) +....... ..+ |++.... .++.+..+ +++.-+ .....|. +-...+ T Consensus 236 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 315 (341) T protein:vir:94 236 ATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLM 315 (341) T ss_pred cccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhh Confidence 11100000 000 0111100 01111111 000000 0000111 111344 Q ss_pred EEEEeeCcEEecccceEEEEecccCCCC Q lcl|Aclame:pro 361 RAIQRDDVQMWDNEAAVYGEIDLSAPVE 388 (392) Q Consensus 361 ~~~~r~~~~v~~~~af~~l~~~~~a~~~ 388 (392) ++-+-+|.++.||++.+-|+ .++++. T Consensus 316 ~~~~~~G~~~lrp~~~v~~~--~~~~~~ 341 (341) T protein:vir:94 316 VGRQAYGARLYRPLHAVNIH--TTGDTV 341 (341) T ss_pred hhhhhhcccccCcceeEEEe--cCcCCC Confidence 56666899999999976444 444433 No 134 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.48 E-value=3.5e-08 Score=61.49 Aligned_cols=285 Identities=12% Similarity=0.045 Sum_probs=141.7 Q ss_pred HHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecC Q lcl|Aclame:pro 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~ 159 (392) +.. ...+.. ....++....+....-...| +.+..++.......+.++++.+++.+.+++ .+.+++ .+ T Consensus 1 ma~-~~~~~~---------~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~g~-s~~~~~-iG 67 (344) T protein:vir:10 1 MAN-MTGGQQ---------LGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVRSISSGK-SAQFPV-LG 67 (344) T ss_pred Ccc-cccccc---------CCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhcccceeeeecccc-eEEEEe-ec Confidence 000 000000 00000000000000001223 788899999999999999999999988643 444444 35 Q ss_pred Ccccccccccccccccc-ccceeeEEechhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc---- Q lcl|Aclame:pro 160 MIPFAEITEMGEIPETD-NPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE---- 233 (392) Q Consensus 160 ~~~~~~~~E~~~~~~~~-~~~~~~v~~~~~~i-~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~---- 233 (392) ...+.....|.+...+. .+.-.++++...++ +....|.+-=-.++.+++.+.+.++...++++..|+.++..+. T Consensus 68 ~~~~~~~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~ 147 (344) T protein:vir:10 68 RTQAAYLAPGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCN 147 (344) T ss_pred eeEEEeeecCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 56667777776654321 23445555544332 1222222222223456789999999999999999987643210 Q ss_pred -------------cc-----cc----------cchhhHHHHHHHHHHHhhhcccCC--ceEEEcHHHHHHHHHhhccC-C Q lcl|Aclame:pro 234 -------------KL-----TK----------QAIKSLDDIKDVLNVKLDPAISPN--AILLTNQDGFNYLDKLKDKD-G 282 (392) Q Consensus 234 -------------~~-----~~----------~~~~~~d~~~~~~~~~~~~~~~~~--a~~v~~~~~~~~L~~lkd~~-g 282 (392) .. +. .+...++.+.++. ..++...-|. -+.|++|..|..|.+-+.-+ + T Consensus 148 ~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~-~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~ 226 (344) T protein:vir:10 148 VESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKAR-AALTKNYVPSSDRVFYCDPDSYSAILAALMPNAA 226 (344) T ss_pred cccccccccccccccceeecccccccccchhhhHHHHHHHHHHHH-HHHhhcCCCccCCEEEeChHHHHHHhhccccccc Confidence 00 00 0112344455543 3444444443 35678999999885433222 1 Q ss_pred ceeecccccCCcccceecccceEEecCcccccc-cc----cCC--------cceEEEEehhh---------ceeeeeccc Q lcl|Aclame:pro 283 KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSK-GT----TAK--------KAPLIIGDLKE---------AIVLFKRED 340 (392) Q Consensus 283 ~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~-~~----~~~--------~~~~~~Gd~~~---------~~~~~~~~~ 340 (392) .+.-......|.-..+.|.+ |+. ++.++... +. .++ ....+..+|++ ++..+...+ T Consensus 227 ~~~~~~~~~~G~V~~v~G~~-V~~-Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~ 304 (344) T protein:vir:10 227 NYAALIDPEKGSIRNVMGFE-VVE-VPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRD 304 (344) T ss_pred ccccccceeeeEEEEEeceE-EEe-ccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhcc Confidence 22111122233344567654 433 33332110 00 000 00011223322 233344445 Q ss_pred eEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEeccc Q lcl|Aclame:pro 341 MELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) Q Consensus 341 ~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~ 384 (392) ++++..+.. .+|- ..+++.+-+|.++.||++...++++.- T Consensus 305 ~~~e~~r~~-~~~~---d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 305 LALERARRA-NFQA---DQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred ceeecccch-hHHH---HHHHHHhhcccceecccceEEEEeecC Confidence 566665542 3443 356677778999999999988877755 No 135 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.46 E-value=6.3e-08 Score=60.04 Aligned_cols=285 Identities=9% Similarity=-0.001 Sum_probs=150.3 Q ss_pred HhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccc Q lcl|Aclame:pro 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~ 163 (392) +. .-....|....+...+-...| +.+..++.......+.++++..+.++.+++ .+.+++. +...+ T Consensus 1 ms------------~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~-s~~~~~i-G~~~~ 65 (335) T protein:vir:63 1 MS------------FLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSN-VVRLDRL-GNVEA 65 (335) T ss_pred CC------------Ccccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhccccceeeeccce-eEEEeee-eeeee Confidence 00 000011111122222223444 889999999999999999999999997643 4555543 55667 Q ss_pred cccccccccccccccceeeEEechhhee-eehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc----ccc---- Q lcl|Aclame:pro 164 AEITEMGEIPETDNPKFSNVQYAVKDRA-GILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV----IEK---- 234 (392) Q Consensus 164 ~~~~E~~~~~~~~~~~~~~v~~~~~~i~-~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~----~~~---- 234 (392) ....-|.+.-. +.+.-++..+...++- ....|-+-=--++.+++.+.+.+++.+++++..|+.++.. ... T Consensus 66 ~~~~pG~~l~~-~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~ 144 (335) T protein:vir:63 66 KGRRAGEELER-SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPV 144 (335) T ss_pred ecccCCcCcCC-CCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCcc Confidence 76766665443 3345566666655542 2222222222234578899999999999999999876421 000 Q ss_pred --------c--------cccchhhHHHHHHHH---HHHhhhcccC-----CceEEEcHHHHHHHHHhhccCCc-eeec-- Q lcl|Aclame:pro 235 --------L--------TKQAIKSLDDIKDVL---NVKLDPAISP-----NAILLTNQDGFNYLDKLKDKDGK-YILQ-- 287 (392) Q Consensus 235 --------~--------~~~~~~~~d~~~~~~---~~~~~~~~~~-----~a~~v~~~~~~~~L~~lkd~~g~-~l~~-- 287 (392) . +......++.+..++ ...+....-+ +.+.+++|..|..|..-+.--++ |.-. T Consensus 145 ~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~ 224 (335) T protein:vir:63 145 DLEDAFSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGA 224 (335) T ss_pred ccCCCcCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccc Confidence 0 011112344443332 2344444433 35789999999999764322222 1110 Q ss_pred -ccccCCcccceecccceEEecCcccccccccC----CcceEEEEehhhce---------eeeeccceEEEEeccchhhh Q lcl|Aclame:pro 288 -SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTA----KKAPLIIGDLKEAI---------VLFKREDMELASTDVGGKAF 353 (392) Q Consensus 288 -~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~----~~~~~~~Gd~~~~~---------~~~~~~~~~~~~~~~~~~~f 353 (392) .+...+....+.|.+ |+. ++ -+|..+..+ +....+=|||++.. ....-..++.++.++. ..| T Consensus 225 ~~~~~~g~v~~v~Gv~-V~~-sn-~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~-~~~ 300 (335) T protein:vir:63 225 TNDYVKSRVAILNGVK-VLE-TP-RFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDN-EKF 300 (335) T ss_pred cccccCceeEEeeceE-EEe-ec-cCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeecc-chh Confidence 112233344566654 332 33 334322111 12223344554322 2222223333333322 233 Q ss_pred hcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCC Q lcl|Aclame:pro 354 TRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQ 391 (392) Q Consensus 354 ~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~ 391 (392) . ..+.+.+-+|.++.||++.+.++++......-.+ T Consensus 301 ~---~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:63 301 S---WVLDTFQMYNIGARRPDTAGAIELKGIGAFDITA 335 (335) T ss_pred h---HHhHHHHHcCCcccccceEEEEEEcCCCceeecC Confidence 2 3445555589999999999999988766655555 No 136 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.46 E-value=6.1e-08 Score=60.15 Aligned_cols=269 Identities=8% Similarity=-0.048 Sum_probs=126.3 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhh---------hcceeeccCCcceeEEEeecCCcccccccccccccccc Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQ---------YVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETD 176 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~---------l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~ 176 (392) |. ++.-+-.++|+.+...+.+...+.+.+++ +.....-++...++|+....+ ..+.-+.|+.+++.. T Consensus 1 MA--~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~-Gd~~~~~~~~~i~~~- 76 (351) T protein:vir:15 1 MA--ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLT-GDPDNWTDSDDIDVN- 76 (351) T ss_pred CC--ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCC-CcccccCCCcccchh- Confidence 22 22335667888887667666655555533 111111234444455443332 245566777665543 Q ss_pred ccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc------------------ccccc Q lcl|Aclame:pro 177 NPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE------------------KLTKQ 238 (392) Q Consensus 177 ~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~------------------~~~~~ 238 (392) ..+-.+-....+..+.-..++++...-+.-+....+.++++....+..++.++..+. ..... T Consensus 77 kitt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~ 156 (351) T protein:vir:15 77 NLTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSE 156 (351) T ss_pred eecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceeccccccccc Confidence 344333333444455555566654443334556667777777777766665544321 11122 Q ss_pred chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCccccccccc Q lcl|Aclame:pro 239 AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTT 318 (392) Q Consensus 239 ~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~ 318 (392) ...+++.+.+++...-+.....-++|+||+.++..|++..--+- +++.-....-.+++| .+|++. + .+|..+.. T Consensus 157 ~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~---~~~s~~~~~i~t~~G-~~Vivd-D-~~p~~~~~ 230 (351) T protein:vir:15 157 PMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIET---IQPQNGATPFEAYNG-LRIVLD-D-DIEIDLTD 230 (351) T ss_pred cccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhh---ccccccCcccceecc-eEEEEc-C-CCccccCC Confidence 33456788888765445444345789999999999986541110 111111122345666 455443 3 34443322 Q ss_pred CCc---ceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 319 AKK---APLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 319 ~~~---~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ... ..++||. .++...++ ...++..+.... ..++-.+....| -+++|.++..-+-+.+....+|.- T Consensus 231 ~~~~~ytsyl~~~--GAi~~~~~-~~~ve~~rd~~~--~~g~d~l~~r~~---~~~hp~G~s~~~~~~~~~~~sPt~ 299 (351) T protein:vir:15 231 KTKPVSTSYIFAP--GAVRYSTN-MRSTETKYDPLI--NGGQDVIVQKRV---GTIHVAGTSIKASFSPSKASFPTI 299 (351) T ss_pred CCCceeEEEEEec--ceeeeecC-CcCcceeecccC--CCCceEEEEeee---eeeeeeeeeecccccccCcCCcCh Confidence 222 2455543 12222222 222333332211 123333333333 335666665432111111222222 No 137 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.44 E-value=7.1e-08 Score=59.76 Aligned_cols=285 Identities=8% Similarity=-0.013 Sum_probs=148.2 Q ss_pred HhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccc Q lcl|Aclame:pro 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~ 163 (392) +. .....++-...++..+-...| +.+..++.......+.++++..+.++.+++ .+.+++ .+...+ T Consensus 1 ms------------~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~-s~~~~~-iG~~~~ 65 (335) T protein:vir:78 1 MS------------FLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSN-VVRLDR-LGNVEA 65 (335) T ss_pred CC------------ccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhccccceeeeccce-eEEEee-eeeeee Confidence 00 000001111112222223344 888999999999999999999999997643 455554 355566 Q ss_pred cccccccccccccccceeeEEechhhee-eehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc----ccc---- Q lcl|Aclame:pro 164 AEITEMGEIPETDNPKFSNVQYAVKDRA-GILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV----IEK---- 234 (392) Q Consensus 164 ~~~~E~~~~~~~~~~~~~~v~~~~~~i~-~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~----~~~---- 234 (392) .+..-|.+.- .+.+.-++..+....+- ....|-+-=--++++++.+.+.+++.+++++..|+.++.. ... T Consensus 66 ~~~~pG~~l~-~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~ 144 (335) T protein:vir:78 66 KGRRAGEELE-RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPV 144 (335) T ss_pred cccccCcccC-CCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 6666665543 33345566666555542 2222222222234578899999999999999999876421 000 Q ss_pred --------c--------cccchhhHHHHHHHHH---HHhhhcccC-----CceEEEcHHHHHHHHHhhccCCceee-c-- Q lcl|Aclame:pro 235 --------L--------TKQAIKSLDDIKDVLN---VKLDPAISP-----NAILLTNQDGFNYLDKLKDKDGKYIL-Q-- 287 (392) Q Consensus 235 --------~--------~~~~~~~~d~~~~~~~---~~~~~~~~~-----~a~~v~~~~~~~~L~~lkd~~g~~l~-~-- 287 (392) + +......++.+.+++. ..+.....+ +-+.+++|..|..|..-+.--.+.+- . T Consensus 145 ~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~ 224 (335) T protein:vir:78 145 DLEDAFSPGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGA 224 (335) T ss_pred ccCCCcCCCcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccccccccc Confidence 0 0111123344444332 223333333 35789999999999764322222111 0 Q ss_pred -ccccCCcccceecccceEEecCcccccccccC----CcceEEEEehhh---------ceeeeeccceEEEEeccchhhh Q lcl|Aclame:pro 288 -SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTA----KKAPLIIGDLKE---------AIVLFKREDMELASTDVGGKAF 353 (392) Q Consensus 288 -~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~----~~~~~~~Gd~~~---------~~~~~~~~~~~~~~~~~~~~~f 353 (392) .+...+....++|.+ |+. ++ -+|..+..+ +....+=+||+. ++....-.+++.++.++. ..| T Consensus 225 ~~~~~~g~v~~v~Gv~-V~~-Sn-~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~-~~~ 300 (335) T protein:vir:78 225 TNDYVKSRVAILNGVK-VLE-TP-RFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDH-DQF 300 (335) T ss_pred ccccccceeEEeeceE-EEe-ec-cCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeecc-chh Confidence 112233344566654 322 33 334322110 111112223332 333333333444443332 233 Q ss_pred hcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCC Q lcl|Aclame:pro 354 TRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQ 391 (392) Q Consensus 354 ~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~ 391 (392) . ..+.+.+-+|.++.||++.+.++++......-.+ T Consensus 301 ~---~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:78 301 S---WVLDTFQMYNIGARRPDTAGAIELKGIEAFDITA 335 (335) T ss_pred h---HhhhHHHHcCCcccCcceEEEEEecCCCcccccC Confidence 2 3445555689999999999999988876665555 No 138 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.43 E-value=3.5e-08 Score=61.46 Aligned_cols=281 Identities=10% Similarity=0.022 Sum_probs=141.0 Q ss_pred HHHhhhhhhhhccccccccc---eecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccc Q lcl|Aclame:pro 96 FLEDDLEQRAMSGLTGEDGG---LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEI 172 (392) Q Consensus 96 ~~~~~~~~~a~~~~~~~~gg---~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~ 172 (392) -.......-+...+.+...| .+--+.+.++++......+.++++.++.++.+++ .+.+++ .+...+.....|+.. T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~-sv~i~~-iG~~tv~~~t~G~~l 78 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGK-SAQFPV-MGRTSGVYLAPGERL 78 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccc-eEEEec-ccceeeeeecCCCCc Confidence 00000000001111111111 1122688889999888889999999999887532 444444 455666667766654 Q ss_pred ccc-cccceeeEEechhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc----------------- Q lcl|Aclame:pro 173 PET-DNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE----------------- 233 (392) Q Consensus 173 ~~~-~~~~~~~v~~~~~~i-~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~----------------- 233 (392) ..+ ...+-.++++...++ +....|-+-=-.++.+++.+-+.++...++++..|..++.-.. T Consensus 79 ~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~ 158 (347) T protein:vir:94 79 SDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGLG 158 (347) T ss_pred CCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCc Confidence 221 112334534443332 1111222111112345788889999999999999987653110 Q ss_pred ccc-----cc---------chhhHHHHHHHHHHHhhhcccCC--ceEEEcHHHHHHHHHhhccCC-ceeecccccCCccc Q lcl|Aclame:pro 234 KLT-----KQ---------AIKSLDDIKDVLNVKLDPAISPN--AILLTNQDGFNYLDKLKDKDG-KYILQSDPTQKNKK 296 (392) Q Consensus 234 ~~~-----~~---------~~~~~d~~~~~~~~~~~~~~~~~--a~~v~~~~~~~~L~~lkd~~g-~~l~~~~~~~~~~~ 296 (392) ... .. ....++.+.++. ..++...-+. -+.|++|..|..|..-++-+. .+.-......|.-. T Consensus 159 ~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~-~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 237 (347) T protein:vir:94 159 TASVLEVGKKADLDTPAKLGEAIIGQLTIAR-AKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETGNIR 237 (347) T ss_pred ccceeeccccccccchhhhHHHHHHHHHHHH-HHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccccccccceE Confidence 000 00 011234444433 3455555443 467999999988754332222 22222233345456 Q ss_pred ceecccceEEecCcccccccc----------cCCcceE--------EEEehhhc---------eeeeeccceEEEEeccc Q lcl|Aclame:pro 297 LFAGTNPVVVVSNRFLKSKGT----------TAKKAPL--------IIGDLKEA---------IVLFKREDMELASTDVG 349 (392) Q Consensus 297 ~~~g~~pv~~~~~~~~~~~~~----------~~~~~~~--------~~Gd~~~~---------~~~~~~~~~~~~~~~~~ 349 (392) .++|.+ |+. ++.++....+ .++.... +-|||++. +..+...+++++..+.. T Consensus 238 ~i~G~~-V~~-Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~ 315 (347) T protein:vir:94 238 NVMGFV-VVE-VPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDV 315 (347) T ss_pred EEeceE-EEe-cCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhch Confidence 678864 333 4444321111 1111111 22233322 22233334455554432 Q ss_pred hhhhhcCceeEEEEEeeCcEEecccceEEEEecccC Q lcl|Aclame:pro 350 GKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 350 ~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a 385 (392) .+|. ..+++.+.+|.++.||++.+.++++.+- T Consensus 316 -~~~~---d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 316 -DAQG---DLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred -hhHH---HHhhhhhhhcCcccccceeEEEEecCCC Confidence 3333 3678888899999999999999877444 No 139 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.42 E-value=1.1e-07 Score=58.73 Aligned_cols=281 Identities=12% Similarity=0.041 Sum_probs=141.7 Q ss_pred HhcchhhHHHHHHHHhhhhhhhhccccccccc----eecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecC Q lcl|Aclame:pro 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGG----LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg----~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~ 159 (392) +-+. ....+.+...+.+...| ..| +.+..++.......+.++++.+...+.++. .+.++ ..+ T Consensus 1 ~~~~-----------~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G~-sv~i~-~iG 66 (347) T protein:vir:33 1 MANI-----------QGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGK-SAQFP-VIG 66 (347) T ss_pred CCCC-----------ccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccccccccc-eeEee-ecc Confidence 0000 00000000111111111 234 788889999999999999999988776532 33343 344 Q ss_pred Cccccccccccccccc-cccceeeEEechh--heeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc--- Q lcl|Aclame:pro 160 MIPFAEITEMGEIPET-DNPKFSNVQYAVK--DRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE--- 233 (392) Q Consensus 160 ~~~~~~~~E~~~~~~~-~~~~~~~v~~~~~--~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~--- 233 (392) ...+.....+...... ..+...+.++... ++.. ..|.+-=-.++..++.+.+.++...++++..|..++..+. T Consensus 67 ~~t~~~~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~-~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~ 145 (347) T protein:vir:33 67 RTKAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLV 145 (347) T ss_pred ceeeeeecCCCCCCCCCCCCccceEEEEechhhhhh-HHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 5566666666554311 1234455445433 3222 1222221222456788889999999999999987752110 Q ss_pred ------------------cc----ccc--------chhhHHHHHHHHHHHhhhcccC--CceEEEcHHHHHHHHHhhcc- Q lcl|Aclame:pro 234 ------------------KL----TKQ--------AIKSLDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKDK- 280 (392) Q Consensus 234 ------------------~~----~~~--------~~~~~d~~~~~~~~~~~~~~~~--~a~~v~~~~~~~~L~~lkd~- 280 (392) .. ++. +...|+.+.++. ..++...-| +-+.|++|..|..|.+...- T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~-~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~ 224 (347) T protein:vir:33 146 NLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIAR-ASLTKNYVPAADRTFYTTPDNYSAILAALMPN 224 (347) T ss_pred hhhcccccccccccccccccccccccccccchhhhHHHHHHHHHHHH-HHHhhcCCCccCcEEEeCHHHHHHHhcccccc Confidence 00 000 112245555543 344444444 34679999999998654322 Q ss_pred CCceeecccccCCcccceecccceEEecCcccccccc------cCCcceE--------EEEehh---------hceeeee Q lcl|Aclame:pro 281 DGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT------TAKKAPL--------IIGDLK---------EAIVLFK 337 (392) Q Consensus 281 ~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~------~~~~~~~--------~~Gd~~---------~~~~~~~ 337 (392) +..|.-......|.-..++|.+ |+. ++.++..... .++.... +-++|+ .++..+. T Consensus 225 ~~d~~~~~~~~~G~V~~i~G~~-V~~-Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~ 302 (347) T protein:vir:33 225 AANYQALLDPERGTIRNVMGFE-VVE-VPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVK 302 (347) T ss_pred ccccccccccccceeEEEecee-EEE-ecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeee Confidence 2223222223344445677754 444 3333221110 0111101 112221 1222233 Q ss_pred ccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCC Q lcl|Aclame:pro 338 REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) Q Consensus 338 ~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~ 386 (392) ..+++++..+.. .+|- -.+++.+.+|.+++||++.+.++++.... T Consensus 303 ~~~~~~e~~r~~-~~~~---d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 303 LKDLALERARRA-NYQA---DQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred eeceeeeeccch-hhhh---HhhhhhhhcCCceecccceEEEecCCCCC Confidence 334455555543 2332 35677777899999999999998876555 No 140 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.40 E-value=2e-07 Score=57.36 Aligned_cols=282 Identities=11% Similarity=0.028 Sum_probs=138.3 Q ss_pred HHHHHhcchhhHHHHHHHHhhhhhhhhccccccccce---ecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEe Q lcl|Aclame:pro 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGL---VIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEK 156 (392) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~---~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~ 156 (392) +....- + .......+.+...+. +--+.+..++....+..+.++++.++..+.++. .+.+++ T Consensus 1 ma~~~~-~--------------~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~-sv~i~~ 64 (347) T protein:vir:15 1 MANIQG-G--------------QQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGK-SAQFPV 64 (347) T ss_pred CCcccc-C--------------CccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccc-eeEeee Confidence 000000 0 000000000001110 112567888889999999999999988777533 344443 Q ss_pred ecCCccccccccccccccc-cccceeeEEechh--heeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 157 NSDMIPFAEITEMGEIPET-DNPKFSNVQYAVK--DRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE 233 (392) Q Consensus 157 ~~~~~~~~~~~E~~~~~~~-~~~~~~~v~~~~~--~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~ 233 (392) .+...+.....+...... ...+..++++... ++.. ..|.+-=-.++..++.+.+.++...++++..|+.++..+. T Consensus 65 -ig~~t~~~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~-~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~ 142 (347) T protein:vir:15 65 -IGRTKAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELA 142 (347) T ss_pred -ccceeeeeeccCCCCCCCCCCCccceEEEEechhhhhh-HHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 444566666666544311 1234455444433 3322 1222222223456788889999999999999987763211 Q ss_pred -----------------cc------cccc------hhhHHHHHHHH---HHHhhhcccC--CceEEEcHHHHHHHHHhhc Q lcl|Aclame:pro 234 -----------------KL------TKQA------IKSLDDIKDVL---NVKLDPAISP--NAILLTNQDGFNYLDKLKD 279 (392) Q Consensus 234 -----------------~~------~~~~------~~~~d~~~~~~---~~~~~~~~~~--~a~~v~~~~~~~~L~~lkd 279 (392) .. ...+ ...++.+.+++ ...++...-+ +-+.|++|..|..|.+-.+ T Consensus 143 ~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~ 222 (347) T protein:vir:15 143 GLVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALM 222 (347) T ss_pred HHhhccccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccc Confidence 00 0000 11123333332 2334444443 2356789999999865433 Q ss_pred cCC-ceeecccccCCcccceecccceEEecCcccccccc------cCCcc------------------eEEEEehhhcee Q lcl|Aclame:pro 280 KDG-KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT------TAKKA------------------PLIIGDLKEAIV 334 (392) Q Consensus 280 ~~g-~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~------~~~~~------------------~~~~Gd~~~~~~ 334 (392) -.. .|.-......|.-..++|.. |+. ++.++....+ .++.. +.++.. +.++. T Consensus 223 ~~~~d~~~~~~~~~G~Vg~i~G~~-V~~-Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h-~~A~g 299 (347) T protein:vir:15 223 PNAANYQALIDHERGTIRNVMGFE-VVE-VPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQH-RSAVG 299 (347) T ss_pred cccccccccccccceEEEEEeceE-EEe-cccccccccccccccccccccccccccccceeeeccccceeeeec-cceee Confidence 222 12111122334445677754 433 3434321110 01111 111111 12232 Q ss_pred eeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCC Q lcl|Aclame:pro 335 LFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) Q Consensus 335 ~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~ 386 (392) .+.-.+++++..+.. .+|- -.+++.+.+|.+++||++.+.++++.... T Consensus 300 ~v~~~~~~~e~~~~~-~~~~---d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 300 TVKLKDLALERARRA-NYQA---DQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred eeEeeceeeeecccc-hhhh---hhhehhhhcCCceeccccEEEEecCCCCC Confidence 333344556665543 2222 46677778899999999999998876555 No 141 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.38 E-value=2.6e-07 Score=56.72 Aligned_cols=284 Identities=13% Similarity=-0.009 Sum_probs=142.9 Q ss_pred HHHhhhhh-hhhccccccc-cc-----eecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccc Q lcl|Aclame:pro 96 FLEDDLEQ-RAMSGLTGED-GG-----LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITE 168 (392) Q Consensus 96 ~~~~~~~~-~a~~~~~~~~-gg-----~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E 168 (392) .......+ .+.+.++... || .+--+.+..++.......+.++++.+++++.+++ .+.+++ .+...+....- T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gk-sv~f~~-iG~~t~~~~t~ 78 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGK-SLQFIY-TGRMTSSFHTP 78 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCc-eEEEEe-eeeeEEeeecC Confidence 00000000 0111111111 11 1223678889999999999999999999888543 444444 34555665555 Q ss_pred cccccc--ccccceee--EEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc----------- Q lcl|Aclame:pro 169 MGEIPE--TDNPKFSN--VQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE----------- 233 (392) Q Consensus 169 ~~~~~~--~~~~~~~~--v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~----------- 233 (392) |.+.-. ...+...+ +++.-.++.. ..|.+-=-.++.+++.+.+.++...++++..|+.++..+- T Consensus 79 G~~i~~~~~~d~~~te~~l~ID~~~y~~-~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~ 157 (375) T protein:vir:10 79 GTPILGNADKAPPVAEKTIVMDDLLISS-AFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSA 157 (375) T ss_pred CcCcCCccccCCCCCceEEEecchhhhh-hhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 544321 11122233 4444333332 2222222223456889999999999999999987653210 Q ss_pred --------------ccc-----ccchhhHHHHHHHHHHHhhhcccCC--ceEEEcHHHHHHHHHhhccCCceeecccc-- Q lcl|Aclame:pro 234 --------------KLT-----KQAIKSLDDIKDVLNVKLDPAISPN--AILLTNQDGFNYLDKLKDKDGKYILQSDP-- 290 (392) Q Consensus 234 --------------~~~-----~~~~~~~d~~~~~~~~~~~~~~~~~--a~~v~~~~~~~~L~~lkd~~g~~l~~~~~-- 290 (392) +.+ ..+...|+.+.++. ..++...-+. -++|++|..|..|.+-+|.+ .+...+. T Consensus 158 ~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~-~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~--~~~n~d~~~ 234 (375) T protein:vir:10 158 TNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAA-AAMDEKGVSSQGRCAVLNPRQYYALIQDIGSN--GLVNRDVQG 234 (375) T ss_pred ccccccCcceeeeccccccccccCHHHHHHHHHHHH-HHHhhcCCCCCCCEEEeChHHHHHHHhcCCcc--ceeeecccc Confidence 000 11233456666654 4555555443 46799999999986655433 1221111 Q ss_pred ----cCCcccceecccceEEecCccccccc-----------------------------ccCCcceEEEEeh-------- Q lcl|Aclame:pro 291 ----TQKNKKLFAGTNPVVVVSNRFLKSKG-----------------------------TTAKKAPLIIGDL-------- 329 (392) Q Consensus 291 ----~~~~~~~~~g~~pv~~~~~~~~~~~~-----------------------------~~~~~~~~~~Gd~-------- 329 (392) ..+.-..+.|.+ |+. ++.++...+ ..++...-+-+|| T Consensus 235 ~~~~~~g~v~~i~Gv~-V~~-Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~ 312 (375) T protein:vir:10 235 SALQSGNGVIEIAGIH-IYK-SMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCG 312 (375) T ss_pred cceeccceEEEEeceE-EEE-eccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEE Confidence 111123455643 332 333332111 0111111223333 Q ss_pred ----hhceeeeeccceEEEEecc-chhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCC Q lcl|Aclame:pro 330 ----KEAIVLFKREDMELASTDV-GGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ 389 (392) Q Consensus 330 ----~~~~~~~~~~~~~~~~~~~-~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~ 389 (392) +.++..+.-.+++++++.. ....++ ...+.+.+-+|..+.||++.+.|+..+++|++= T Consensus 313 ~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q--~~~i~~~~a~G~~~lrp~~av~l~~~~~~~~~~ 375 (375) T protein:vir:10 313 LIFQKEAAGVVEAIGPQVQVTNGDVSVIYQ--GDVILGRMAMGADYLNPAAAVELYIGATAPSAF 375 (375) T ss_pred EEEchhheeeeeeeccccccccchhhheee--eeeeeeeeeeccCccCceeEEEEecCcCccccC Confidence 1233344445556665421 111222 345567777899999999998887665555444 No 142 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.33 E-value=9.7e-08 Score=59.04 Aligned_cols=215 Identities=13% Similarity=0.031 Sum_probs=133.1 Q ss_pred hhhhhhccccccc-cceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccc Q lcl|Aclame:pro 101 LEQRAMSGLTGED-GGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPK 179 (392) Q Consensus 101 ~~~~a~~~~~~~~-gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~ 179 (392) -..-++...|-.. +..+-|......|++.+.+.++|+..+.......+.+ +.+.+..+-+.++|..=+...+++ ..+ T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~-~~~~v~~~LP~~~fR~lN~g~~~s-~~t 78 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTG-HRTTIRSGLPSATWRLLNYGVQPS-KST 78 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCc-ceeeEeeccCCceeeecCCccCcc-cce Confidence 0001111111112 2334566677789999999999999999887754433 445566777888999888888865 689 Q ss_pred eeeEEechhheeeehhhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------------------- Q lcl|Aclame:pro 180 FSNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILGVIEKLT--------------------- 236 (392) Q Consensus 180 ~~~v~~~~~~i~~~~~iS~e~l~ds~--~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~--------------------- 236 (392) +.+++...+-+.+.+.|.+.+.+... -++...-.+...+++++.....+++|..+.. T Consensus 79 t~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~q 158 (328) T protein:vir:95 79 TVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQN 158 (328) T ss_pred eEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCccccccccc Confidence 99999999999999999998887642 2334444555777888888887776521100 Q ss_pred ------cc------------------------------------------------------------------------ Q lcl|Aclame:pro 237 ------KQ------------------------------------------------------------------------ 238 (392) Q Consensus 237 ------~~------------------------------------------------------------------------ 238 (392) ++ T Consensus 159 iidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~N 238 (328) T protein:vir:95 159 IIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIAN 238 (328) T ss_pred eeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 00 Q ss_pred -------chhhHHHHHHHHHH---HhhhcccCCceEEEcHHHHHHHHHhh-ccCCceeecccccCCcccceecccceEEe Q lcl|Aclame:pro 239 -------AIKSLDDIKDVLNV---KLDPAISPNAILLTNQDGFNYLDKLK-DKDGKYILQSDPTQKNKKLFAGTNPVVVV 307 (392) Q Consensus 239 -------~~~~~d~~~~~~~~---~~~~~~~~~a~~v~~~~~~~~L~~lk-d~~g~~l~~~~~~~~~~~~~~g~~pv~~~ 307 (392) ......+++++|.. .++.....+.+|+||.+....|++.. +..+-.+ .+.-..+...+.|++.||..+ T Consensus 239 Id~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~-~~~~~~g~~~t~~~gipir~~ 317 (328) T protein:vir:95 239 IDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAI-SVKETEGEWWTSFRGVPIRET 317 (328) T ss_pred CcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceee-eeeccCCcceeEECCeEEEEE Confidence 00012234444432 34566677789999999999998754 4444343 333344445566777788765 Q ss_pred cCcccccccccCCcceEE Q lcl|Aclame:pro 308 SNRFLKSKGTTAKKAPLI 325 (392) Q Consensus 308 ~~~~~~~~~~~~~~~~~~ 325 (392) +. +.. .+..++ T Consensus 318 da-i~~------tE~~vv 328 (328) T protein:vir:95 318 DA-LLE------TEARVV 328 (328) T ss_pred ee-eec------CccccC Confidence 42 111 111111 No 143 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.31 E-value=5.7e-07 Score=54.81 Aligned_cols=266 Identities=8% Similarity=0.010 Sum_probs=133.7 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhh---------hcceee--ccCCcceeEEEeecCCcccccccccccccc Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQ---------YVTVEP--VRTRSGSRVLEKNSDMIPFAEITEMGEIPE 174 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~---------l~~~~~--~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~ 174 (392) |. ++.-+-.++|+.+...+.....+.+.|++ +..... .++...++|+....+ ..+.-+.|+++.+. T Consensus 1 MA--~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~-Gd~~~v~~~~~i~~ 77 (324) T protein:vir:59 1 MA--YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLD-GDSQVLNDTDDLVP 77 (324) T ss_pred CC--ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCC-CcccccCCCcccch Confidence 22 23335667888887777766666665533 111111 233333444433222 34556677776654 Q ss_pred ccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------------ccccc Q lcl|Aclame:pro 175 TDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK---------------LTKQA 239 (392) Q Consensus 175 ~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~---------------~~~~~ 239 (392) +..+-.+.....+..+.-..++++...-+.-+....+.++++....+..+..++..+.. ..... T Consensus 78 -~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~ 156 (324) T protein:vir:59 78 -QKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADG 156 (324) T ss_pred -hhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeeccccc Confidence 34554544444445555556666554434445666788888887777777655443211 11122 Q ss_pred hhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccC Q lcl|Aclame:pro 240 IKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTA 319 (392) Q Consensus 240 ~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~ 319 (392) ..+++.+.+++.+ +-.....-++|+||+.++..|++..-.+ ++ .+.-....-.+++| .+|++. +..+....... T Consensus 157 ~~s~~~l~~A~~~-~GD~~~~~~~ivmhS~v~~~L~~~~li~--~~-~~s~~~~~i~~~~G-~~Vivd-D~~p~~~~~~~ 230 (324) T protein:vir:59 157 IYSAETFVDASYK-LGDHESLLTAIGMHSATMASAVKQDLIE--FV-KDSQSGIRFPTYMN-KRVIVD-DSMPVETLEDG 230 (324) T ss_pred eecHHHHHHHHHH-hCCcccCcEEEEEchHHHHHHHHhhhhh--hc-cccccCceeeeecc-cEEEEe-CCCCccccCCC Confidence 2456788888765 3334456678999999999998763221 11 11111222345666 455543 32222222111 Q ss_pred C--cceEEEEehhhceeeee-ccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 320 K--KAPLIIGDLKEAIVLFK-REDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 320 ~--~~~~~~Gd~~~~~~~~~-~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) + -..++||. .++.... +..+.++.++.. ..++..+....++. ++|.++..-.-+. ...+|-- T Consensus 231 ~~~y~s~l~~~--GAi~~~~~~~~v~vE~dRd~----~~g~~~l~~r~~~~---~~p~G~s~~~~~~--~~~sPt~ 295 (324) T protein:vir:59 231 TKVFTSYLFGA--GALGYAEGQPEVPTETARNA----LGSQDILINRKHFV---LHPRGVKFTENAM--AGTTPTD 295 (324) T ss_pred CceEEEEEEec--CeEEEeecCCCcceecccCc----cccceEEEEeeEEE---eEeeeEEeccccc--CCCCCCh Confidence 1 13455653 2333333 334555555432 34566666766654 4555554332111 1112211 No 144 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.21 E-value=2.8e-07 Score=56.53 Aligned_cols=280 Identities=13% Similarity=0.008 Sum_probs=135.4 Q ss_pred HHHHhcchhhHHHHHHHHhhhhhhhhcccccccc----ceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEe Q lcl|Aclame:pro 81 MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDG----GLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEK 156 (392) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g----g~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~ 156 (392) ...+.+.. .....|. .....++ ..+| +.+..+++......+.++++.+...+.++. ++.+++ T Consensus 1 ~~~~~~~~----------~~~~~~~--~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r~i~~G~-tv~i~~ 66 (332) T protein:vir:78 1 MTTLSNFS----------LPNQANG--GARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSYDLRGGK-SKQFMF 66 (332) T ss_pred Cccccccc----------CCccccC--Cccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccccccccc-eEEEEe Confidence 00000000 0000000 0111112 1333 788899999999999999999988887533 444444 Q ss_pred ecCCccccccccccccccccccceeeEEechh--heeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 157 NSDMIPFAEITEMGEIPETDNPKFSNVQYAVK--DRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK 234 (392) Q Consensus 157 ~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~--~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~ 234 (392) . +...+.....|........+.-.++++... ++..+ .|.+-=-.++..++.+.+.++...++++..|..++..... T Consensus 67 i-g~~~~~~~~~g~~l~~~~~~~~~~~~l~ID~~ky~~~-~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~ 144 (332) T protein:vir:78 67 T-GKLSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQ-FVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAK 144 (332) T ss_pred c-cceeEeeecCCCCCCCCCCCCCceEEEEEehhhhhHH-HHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3 455556555555432211233344444433 33322 2222111224467889999999999999999876542110 Q ss_pred ----------------------ccccchhhHHHHHHHHHHHhhhcccCCc--eEEEcHHHHHHHHHhhccC--Cceeecc Q lcl|Aclame:pro 235 ----------------------LTKQAIKSLDDIKDVLNVKLDPAISPNA--ILLTNQDGFNYLDKLKDKD--GKYILQS 288 (392) Q Consensus 235 ----------------------~~~~~~~~~d~~~~~~~~~~~~~~~~~a--~~v~~~~~~~~L~~lkd~~--g~~l~~~ 288 (392) ........|+.++++.. .++...-|.. .+|++|..|..|.+.+|.. .++.... T Consensus 145 aa~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~i~~a~~-~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~ 223 (332) T protein:vir:78 145 ASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAA-VLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNS 223 (332) T ss_pred hhcccCcccccccccccccCCccccCHHHHHHHHHHHHH-HHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeecccc Confidence 01112234566666543 4555554433 4577999999986643321 1111010 Q ss_pred --cccCCc-ccceecccceEEecCcccccccc------cCCcceEEEEehhhc---------eeeeeccceEEEEec--c Q lcl|Aclame:pro 289 --DPTQKN-KKLFAGTNPVVVVSNRFLKSKGT------TAKKAPLIIGDLKEA---------IVLFKREDMELASTD--V 348 (392) Q Consensus 289 --~~~~~~-~~~~~g~~pv~~~~~~~~~~~~~------~~~~~~~~~Gd~~~~---------~~~~~~~~~~~~~~~--~ 348 (392) ...++. ...+.|.+ | +.++.++...+. .++....+-|+|++. +......+++++... . T Consensus 224 ~~~~~~g~~i~~i~G~~-V-~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~ 301 (332) T protein:vir:78 224 QGDMNSGKGLYSIAGIR-I-LKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF 301 (332) T ss_pred ccceecceeeeEEeeeE-E-EecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhccc Confidence 111121 23456654 3 333434322111 111111234444442 222222223333211 1 Q ss_pred chhhhhcCceeEEEEEeeCcEEecccceEEEEec Q lcl|Aclame:pro 349 GGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) Q Consensus 349 ~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~ 382 (392) ..++|- -.+++.+.+|.+++||++.+.|+-. T Consensus 302 ~~~~~~---d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 302 NVQYQG---DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred chhhhH---hhhhhhhhhcCceecccceEEEeeC Confidence 122332 3567777899999999999877733 No 145 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.21 E-value=5e-07 Score=55.15 Aligned_cols=282 Identities=10% Similarity=-0.000 Sum_probs=139.5 Q ss_pred HHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcc-eeEEEeec Q lcl|Aclame:pro 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSG-SRVLEKNS 158 (392) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~-~~~~~~~~ 158 (392) +. .+.+. .-...... ..+....++|+.+..++++.+.+.+.+.++++.....+..+ ++.+++ . T Consensus 1 ~~-~~~~~-------------~~~~~~~~-~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~-~ 64 (381) T protein:vir:80 1 MA-TIQGT-------------GGYKGSAV-DLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPN-I 64 (381) T ss_pred Cc-eeccc-------------ccccCccc-chhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeec-c Confidence 00 00000 00000000 11112356899999999999998888888776644433223 344444 3 Q ss_pred CCccccccccccccccccccceeeEEechhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-- Q lcl|Aclame:pro 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL-- 235 (392) Q Consensus 159 ~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i-~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~-- 235 (392) +.+.+....+++...- ...+..++++...+. +.-+.|++.-...+..++.+.+.+.+..++++..|+.++...... T Consensus 65 g~~~a~d~~~g~~i~~-~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~ 143 (381) T protein:vir:80 65 SRAAVYDKQPQTPVNL-QARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINA 143 (381) T ss_pred CcceeeeecCCCcccc-cccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 3556777887766543 345566666666443 344677776666677889999999999999999998876432100 Q ss_pred -----------------------cccchhhHHHHHHHHHHHhhhcccC--CceEEEcHHHHHHHHHhhcc-CCceeeccc Q lcl|Aclame:pro 236 -----------------------TKQAIKSLDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKDK-DGKYILQSD 289 (392) Q Consensus 236 -----------------------~~~~~~~~d~~~~~~~~~~~~~~~~--~a~~v~~~~~~~~L~~lkd~-~g~~l~~~~ 289 (392) ......+++.++++. ..++...-| +-++|++|..+..|.+...- +-.+.-... T Consensus 144 ~~~~~~~t~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~-~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~ 222 (381) T protein:vir:80 144 FPSQRIYSYDTTLGDGTVNAHLTGTPAPLTYAALLLAK-QKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKP 222 (381) T ss_pred ccccccccccccccccccccccccchhhHHHHHHHHHH-HHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchh Confidence 011223566677655 345555544 34789999999998654211 112222233 Q ss_pred ccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcE Q lcl|Aclame:pro 290 PTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQ 369 (392) Q Consensus 290 ~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~ 369 (392) +..|....++|.. |+. ++ .+|..+. ....+.+|-.... ...+ .-..+ ...|..+...++.-..+|.. T Consensus 223 l~~G~Ig~i~G~~-Vv~-Sn-~lp~~~~--t~~~~~agap~~~-----~~~~--~~~~~-~g~~s~~a~av~~~k~yd~~ 289 (381) T protein:vir:80 223 VTSGVVGTILGME-VIV-TT-QIGINSL--TGYVNGQGAPTQP-----TPGV--LGSPY-LPDQAGTANVVNTGSASDLA 289 (381) T ss_pred hhceeeeEEcceE-EEe-ec-ccccccc--cceeeeccccccc-----cccc--ccccc-ccccccceeeeeeeeeecee Confidence 4455566777754 433 23 3343211 1112222221110 0000 11111 12233344455555556665 Q ss_pred Eec-ccceEEEEeccc--CCCCCCCC Q lcl|Aclame:pro 370 MWD-NEAAVYGEIDLS--APVEQPQG 392 (392) Q Consensus 370 v~~-~~af~~l~~~~~--a~~~~~~~ 392 (392) +.. ...+.......+ +.-.+..| T Consensus 290 ~~~~~~~~~~~~g~~~~~~~~~~~~~ 315 (381) T protein:vir:80 290 VSLSYFGLPVFSGAGATAADGGQTLG 315 (381) T ss_pred eeeeeccceeeecceeeecCCCceee Confidence 532 222221111100 11111111 No 146 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.18 E-value=5.3e-07 Score=54.98 Aligned_cols=241 Identities=12% Similarity=0.050 Sum_probs=118.6 Q ss_pred hcceeeccCCcceeEEEeecCCccccccccccccccc-cccceee--EEechhheeeehhhHHHHHhhhHHHHHHHHHHH Q lcl|Aclame:pro 139 YVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPET-DNPKFSN--VQYAVKDRAGILPLSRSLLQDSDQNILKYVTKW 215 (392) Q Consensus 139 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~-~~~~~~~--v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~ 215 (392) |++. +.++ ..+.+++. +...+....-|.+.... ..+.-.+ +++.-.++..+. |.+-=-.++++++.+...++ T Consensus 1 ~vr~--i~~g-~s~~~~~i-G~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~-VdDiD~~qa~~Dlr~e~s~~ 75 (324) T protein:vir:99 1 MTRT--ITSG-KSAQFPVM-GRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVL-IYDIEDAMNHYDVRSEYSTQ 75 (324) T ss_pred Ceee--eecC-ceEEEeee-eeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhh-hhhHHHHhcCccchhHHHHH Confidence 3333 4432 23444443 45556666655543210 1123334 444444443322 22111122457899999999 Q ss_pred HHHHHHHHHHHHHhhcc------------------cc-------ccc-----cchhhHHHHHHHHHHHhhhcccCC--ce Q lcl|Aclame:pro 216 LGKKSKVTRNVLILGVI------------------EK-------LTK-----QAIKSLDDIKDVLNVKLDPAISPN--AI 263 (392) Q Consensus 216 l~~~~~~~~d~~~~~~~------------------~~-------~~~-----~~~~~~d~~~~~~~~~~~~~~~~~--a~ 263 (392) ...++++..|+.++.-. +. .++ .....++.+.++. ..++...-+. -+ T Consensus 76 ~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~-~~Lde~~VP~~gR~ 154 (324) T protein:vir:99 76 MGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYAR-AAFAKKYIPAGDRT 154 (324) T ss_pred HHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHH-HHHhhcCCCCCCCE Confidence 99999999997764221 00 000 0112345555543 3445444443 36 Q ss_pred EEEcHHHHHHHHHhhcc-CCceeecccccCCcccceecccceEEecCccccccccc------------------------ Q lcl|Aclame:pro 264 LLTNQDGFNYLDKLKDK-DGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTT------------------------ 318 (392) Q Consensus 264 ~v~~~~~~~~L~~lkd~-~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~------------------------ 318 (392) .|++|..+..|..-+.. ++.+.-...+..+.-..++|.+ |+ .++.++...+.. T Consensus 155 ~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~-V~-~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky 232 (324) T protein:vir:99 155 FYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFE-VV-ETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKM 232 (324) T ss_pred EEeChHHHHHHhhcccccccccccccceecceEEEEeceE-EE-ecCCcccccccccccccccccccccccccccccccc Confidence 79999999877543222 2333333334455556677764 33 334343211110 Q ss_pred ---CCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEeccc-CCCCCCCC Q lcl|Aclame:pro 319 ---AKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS-APVEQPQG 392 (392) Q Consensus 319 ---~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~-a~~~~~~~ 392 (392) ......++.-. +++..+.-..++++..+.. .+|- ..+++.+-+|.++.||++.+.+++++. +|..+|.- T Consensus 233 ~~d~~~~~gl~~~~-~a~~tv~~~~~~~e~~~~~-~~~~---d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~~ 305 (324) T protein:vir:99 233 TVGADNVVGLFVHR-SAVATLKLKDMALERARRP-EYQA---DQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAPDV 305 (324) T ss_pred ccccCceeEEEEeh-hheEEEeeecceecceech-hhHH---HhhhhhhhhcCcccccceEEEEEEccCccccccchh Confidence 01111122222 2233333344455555543 2333 456677778999999999988887764 34333322 No 147 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.15 E-value=9.4e-07 Score=53.62 Aligned_cols=283 Identities=8% Similarity=-0.018 Sum_probs=140.0 Q ss_pred chhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecc---hhhhhHHHHhHHhhhhhhhhcceee-cc Q lcl|Aclame:pro 71 DGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIP---QDIQTQINELARSFDALEQYVTVEP-VR 146 (392) Q Consensus 71 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP---~~~~~~ii~~~~~~~~l~~l~~~~~-~~ 146 (392) .+++.+..+... .................+.|++.. +.+.+.+++........+.++.+.. ++ T Consensus 1 ~~~~~~~~~~~~-------------~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~ 67 (319) T protein:vir:10 1 MTTKKFDEADKS-------------NVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELS 67 (319) T ss_pred CCCcchhHHhhH-------------HHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCC Confidence 222222221111 111111111111222223354444 3455567777776666666666542 22 Q ss_pred CCcceeEEEeecCCccccccccccc-cccccccceeeEEechhheeeehhhHHHHHhhhH---HHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 147 TRSGSRVLEKNSDMIPFAEITEMGE-IPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD---QNILKYVTKWLGKKSKV 222 (392) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~E~~~-~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~---~~l~~~v~~~l~~~~~~ 222 (392) -..-.+.+......+.+.|++.++. .|-. +..+.........++..+.++..-|+.+. .++..--....+.++.+ T Consensus 68 ~~~~~~~~~~~~~~G~a~~~~d~~~dip~v-~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~ 146 (319) T protein:vir:10 68 PTDKTFEYMTFDKVGTAQIIADYTDDLPLV-DALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQ 146 (319) T ss_pred CceEEEEeeeeccccceeeecCccccccce-eccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHH Confidence 2222233333344456778877655 3433 45677777888888888888876666543 35777777888888889 Q ss_pred HHHHHHhhcccccc--------------cc---------chhhHHHHHHHHHHHh--hhcccCCceEEEcHHHHHHHHHh Q lcl|Aclame:pro 223 TRNVLILGVIEKLT--------------KQ---------AIKSLDDIKDVLNVKL--DPAISPNAILLTNQDGFNYLDKL 277 (392) Q Consensus 223 ~~d~~~~~~~~~~~--------------~~---------~~~~~d~~~~~~~~~~--~~~~~~~a~~v~~~~~~~~L~~l 277 (392) .+|+.++.|....+ .+ ....++++..++.... ......+..++++|+.|..|... T Consensus 147 ~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~ 226 (319) T protein:vir:10 147 LVNRLVFKGSAPHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIR 226 (319) T ss_pred hhceEEEeecccccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcc Confidence 99888777643211 00 1122344444443322 12334556799999999999765 Q ss_pred hccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehh-hceeeeeccceEEEEeccchhhhhcC Q lcl|Aclame:pro 278 KDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLK-EAIVLFKREDMELASTDVGGKAFTRN 356 (392) Q Consensus 278 kd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~-~~~~~~~~~~~~~~~~~~~~~~f~~~ 356 (392) ....|.-++.- +.... +++.+...+.+...+ ..+...+++..-+ +.+.+.....++. .+.... .= T Consensus 227 ~~~~~~t~l~~-lk~~~-------~~l~I~~~pel~~ag-~~g~~~~v~y~~~~~~~~~~v~~~~~~--~~~e~~---~l 292 (319) T protein:vir:10 227 MPETTMSYLDY-FKSQN-------SGIEIDSIAELEDID-GAGTKGVLVYEKNPMNMSIEIPEAFNM--LPAQPK---DL 292 (319) T ss_pred cCCCCeeHHHH-HHHhc-------CCceEEEeeeecccC-CCcceEEEEEecCCceEEEecCcceee--eeeeec---Cc Confidence 55555433321 11111 122222222222222 2233344433322 2232322233332 221111 11 Q ss_pred ceeEEEEEeeCc-EEecccceEEEE-e Q lcl|Aclame:pro 357 TLDLRAIQRDDV-QMWDNEAAVYGE-I 381 (392) Q Consensus 357 ~~~~~~~~r~~~-~v~~~~af~~l~-~ 381 (392) .+.+..+.|+++ .+.+|.||++++ + T Consensus 293 ~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 293 HFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eEEEeeeeeeEEEEEEccceeEeeecC Confidence 244456777764 677899999887 5 No 148 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.13 E-value=4e-06 Score=50.19 Aligned_cols=280 Identities=11% Similarity=0.025 Sum_probs=136.3 Q ss_pred hhhhhhhccccc-ccc-ceec-chhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccccc Q lcl|Aclame:pro 100 DLEQRAMSGLTG-EDG-GLVI-PQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETD 176 (392) Q Consensus 100 ~~~~~a~~~~~~-~~g-g~~i-P~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~ 176 (392) +.--...+.... ..| -..+ -+.+..++.......+.++++..++++.+++ .+.+++. +...++...-|... +.. T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gk-S~q~~~i-G~~~~~~~~~G~~l-d~~ 77 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTN-SVSNKYI-GETELQVLSPGKSP-DAS 77 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccc-eEEeeee-eeeEEeeeccCccc-CCC Confidence 000001111111 111 1223 3788889999999999999999999988654 4555543 44455555544432 233 Q ss_pred ccceeeEEechhhee-eehhhHHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------------ccc---- Q lcl|Aclame:pro 177 NPKFSNVQYAVKDRA-GILPLSRSLLQDSDQN-ILKYVTKWLGKKSKVTRNVLILGVIE--------------KLT---- 236 (392) Q Consensus 177 ~~~~~~v~~~~~~i~-~~~~iS~e~l~ds~~~-l~~~v~~~l~~~~~~~~d~~~~~~~~--------------~~~---- 236 (392) .+.-++.++...++- ....|-+----+++++ +.+.+.+++.+++++..|+.++...- ... T Consensus 78 ~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g~ 157 (364) T protein:vir:10 78 PTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHGF 157 (364) T ss_pred CcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCcc Confidence 455566666555432 1122221111123455 67888888899999888887643110 000 Q ss_pred --------ccchhhHHHHHHHH---HHHhhhcccCC--ceEEEcHHHHHHHHHhhccCCc-eee--cccccCCcccceec Q lcl|Aclame:pro 237 --------KQAIKSLDDIKDVL---NVKLDPAISPN--AILLTNQDGFNYLDKLKDKDGK-YIL--QSDPTQKNKKLFAG 300 (392) Q Consensus 237 --------~~~~~~~d~~~~~~---~~~~~~~~~~~--a~~v~~~~~~~~L~~lkd~~g~-~l~--~~~~~~~~~~~~~g 300 (392) .........+.+++ ...++...-+. -+.+++|..|..|.+-..--.+ |.. ..+...+....+.| T Consensus 158 ~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v~G 237 (364) T protein:vir:10 158 SIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKSWN 237 (364) T ss_pred eeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEEec Confidence 00111122333322 12344444433 4779999999988653221111 100 01112233345666 Q ss_pred ccceEEecCcccccccccC----------------CcceEEEEehh---------hceeeeeccceEEEEeccchhhhhc Q lcl|Aclame:pro 301 TNPVVVVSNRFLKSKGTTA----------------KKAPLIIGDLK---------EAIVLFKREDMELASTDVGGKAFTR 355 (392) Q Consensus 301 ~~pv~~~~~~~~~~~~~~~----------------~~~~~~~Gd~~---------~~~~~~~~~~~~~~~~~~~~~~f~~ 355 (392) .+ | +.++.+ |..+..+ +..--+.|||. +++....-.+++.+..+.. ..| T Consensus 238 v~-V-v~Sn~l-P~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~-~~~-- 311 (364) T protein:vir:10 238 TP-I-VPSNRF-PKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEK-KEK-- 311 (364) T ss_pred eE-E-Eecccc-ccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeecc-cee-- Confidence 54 3 333444 3211100 00000123332 2333444456666665432 222 Q ss_pred CceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 356 NTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 356 ~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ...+.+.+-+|.++.||++++.++...++- |+- T Consensus 312 -~~~ida~~a~G~g~lRPeaa~~i~~~~~~~---~~~ 344 (364) T protein:vir:10 312 -TWYIDTFLAEGAIPDRWEAVAVVTAADTAE---LAT 344 (364) T ss_pred -eeeeeeehcccCcccCccceEEEEecCCCC---Ccc Confidence 234455666899999999999886443322 333 No 149 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.12 E-value=1.2e-06 Score=53.10 Aligned_cols=267 Identities=7% Similarity=-0.018 Sum_probs=125.2 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhh---------hcceeeccCCcceeEEEeecCCcccccccccc-ccccc Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQ---------YVTVEPVRTRSGSRVLEKNSDMIPFAEITEMG-EIPET 175 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~---------l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~-~~~~~ 175 (392) |...++.-.-.++|+.+...+...+.+.+.|++ +......++...++|+....+ ..+.-+.|+. ..+. T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~-G~~~~~~dg~~~i~~- 78 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLT-GDSEVLGNGDKALET- 78 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCC-CcccccCCCccccch- Confidence 333334445677888887777777666555432 111111244444555543232 2344455654 3432 Q ss_pred cccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------------------- Q lcl|Aclame:pro 176 DNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK--------------------- 234 (392) Q Consensus 176 ~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~--------------------- 234 (392) +..+-.+-....++.+.-..++++...-+.-+....+.+++++...+..+..++..+.. T Consensus 79 ~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~ 158 (330) T protein:vir:10 79 GKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQ 158 (330) T ss_pred hhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheecc Confidence 33443444444444555555555554334445566677777766555555443332110 Q ss_pred ccccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCccccc Q lcl|Aclame:pro 235 LTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKS 314 (392) Q Consensus 235 ~~~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~ 314 (392) .+..+...++.+.++..+. ......-.+|+||+.++..|++..--+ + +++......-.+++| .+|++. +..+.. T Consensus 159 ~~~~a~~s~~~l~~A~~~~-GD~~~~~~~ivmhS~v~~~L~~~~li~--~-~~~s~~~~~i~~~~G-~~Vivd-D~~p~~ 232 (330) T protein:vir:10 159 SKASTGIDAGMVLDAKQLL-GDSADQVTAIAMHSAVYTKLQKDNLIQ--Y-IQPTTATINIPTYLG-YRVIID-DGIAPT 232 (330) T ss_pred cccccccCHHHHHHHHHHh-ccccccceEEEEcHHHHHHHHHhhhhh--h-hcccccCcccccccc-eEEEEe-CCCCCC Confidence 1122234567777776543 333345678999999999998643111 1 111111223345666 445443 333222 Q ss_pred ccccCCcceEEEEehhhceeeeec---cceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCC Q lcl|Aclame:pro 315 KGTTAKKAPLIIGDLKEAIVLFKR---EDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQ 391 (392) Q Consensus 315 ~~~~~~~~~~~~Gd~~~~~~~~~~---~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~ 391 (392) . ..-..++||. .++.+.+. ..+.++.++. ...++..+....++ +++|.+|..-....+....+|- T Consensus 233 ~---~~yt~yl~~~--GAi~~~~~~~~~~v~~EtdRd----~~~g~~~l~~r~~~---~~hp~G~s~~~~~~~~~~~sPt 300 (330) T protein:vir:10 233 G---DIYTSYLFRT--GSIGLNTGNPSGLTTFETSRE----AAKGNDMIYTRRAL---VMHPYGVKWTGAEVDAGNITPS 300 (330) T ss_pred C---CceeEEEEec--CceeeecccCCccccccccCC----ccccceEEEEeeEE---EeeeeeeeecccccccCcCCcC Confidence 1 1223455553 22323221 1233444433 23455555555554 4556666543321111222333 Q ss_pred C Q lcl|Aclame:pro 392 G 392 (392) Q Consensus 392 ~ 392 (392) . T Consensus 301 ~ 301 (330) T protein:vir:10 301 N 301 (330) T ss_pred h Confidence 3 No 150 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.07 E-value=6e-07 Score=54.68 Aligned_cols=265 Identities=9% Similarity=-0.008 Sum_probs=141.3 Q ss_pred hccccccccceecch---hhhhHHHHhHHhhhhhhhhcceee-ccCCcceeEEEeecCCcccccccccccccccccccee Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQ---DIQTQINELARSFDALEQYVTVEP-VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFS 181 (392) Q Consensus 106 ~~~~~~~~gg~~iP~---~~~~~ii~~~~~~~~l~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~ 181 (392) ++......+|.++-. .+.+.+++.....-..+.++.+.. ++...-.+.+......+.+.|.+.++...+..+..+. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~ 80 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALAT 80 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccce Confidence 333322344444442 455667776666555665555443 2222223333333444566777776542223345677 Q ss_pred eEEechhheeeehhhHHHHHhhhH---HHHHHHHHHHHHHHHHHHHHHHHhhcccccc---------------cc----c Q lcl|Aclame:pro 182 NVQYAVKDRAGILPLSRSLLQDSD---QNILKYVTKWLGKKSKVTRNVLILGVIEKLT---------------KQ----A 239 (392) Q Consensus 182 ~v~~~~~~i~~~~~iS~e~l~ds~---~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~---------------~~----~ 239 (392) +.....+.++..+.++..-|+.+. .++..--....+.++.+.+|+.++.|....+ .+ . T Consensus 81 ~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W~~~ 160 (296) T protein:vir:10 81 ERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSWSQP 160 (296) T ss_pred eEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCccCH Confidence 888888888888888877666543 3577777788888889999988777643211 01 1 Q ss_pred hhhHHHHHHHHHHHh--hhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccc Q lcl|Aclame:pro 240 IKSLDDIKDVLNVKL--DPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) Q Consensus 240 ~~~~d~~~~~~~~~~--~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~ 317 (392) ...++++..++.... ......+..++++|..+..|.+.-...|.-++.- +.... ++..++..+.+...+ T Consensus 161 t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~-ik~~~-------~~l~i~~~~~l~~a~- 231 (296) T protein:vir:10 161 TTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEF-FRQNN-------SGVTVEFVQYLNDYN- 231 (296) T ss_pred HHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHH-HHHhc-------CCceEEEeeeeccCC- Confidence 123556666554322 2244556689999999999876554444333221 11111 122222222222211 Q ss_pred cCCcceEEEEehhh-ceeeeeccceEEEEeccchhhhhcCceeEEEEEeeC-cEEecccceEEEE-eccc Q lcl|Aclame:pro 318 TAKKAPLIIGDLKE-AIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD-VQMWDNEAAVYGE-IDLS 384 (392) Q Consensus 318 ~~~~~~~~~Gd~~~-~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~-~~v~~~~af~~l~-~~~~ 384 (392) ..+...+++-+-+. .+.+.....++. .+... ..-...++...|++ ..+.+|.||++++ ++-+ T Consensus 232 ~~g~~~~v~~~~~~~~~~~~v~~~~~~--~~~e~---~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 232 GTGTSAAIAYEKDPNNMAIEIPEATNA--LPAQP---KDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred CCcceEEEEEEcCCceEEEEcCcceee--ecccc---cCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 22333444433222 233333333332 22111 12235667788886 6888999999986 5555 No 151 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.02 E-value=1.6e-06 Score=52.42 Aligned_cols=271 Identities=14% Similarity=0.074 Sum_probs=138.9 Q ss_pred hccc-cccccceec-chhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccccccccee-- Q lcl|Aclame:pro 106 MSGL-TGEDGGLVI-PQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFS-- 181 (392) Q Consensus 106 ~~~~-~~~~gg~~i-P~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~-- 181 (392) +..+ .++....+| |+.++..|...+++......+.+....+ .|.-......+.+...-..+++... .+..+-. T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g--~GDtV~InsIg~~tV~dY~~~~~i~-~d~ltt~~~ 77 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFP--DGDKLTIPSVGTPVVRSRPEQGDFT-FDNLDTGEI 77 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccC--CCCeEEeccccccccccccCCCCcc-cccCCCceE Confidence 2222 223344444 9899899998888877766665543332 2333333334455555555555432 1222223 Q ss_pred eEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc----------cccc-------------ccc Q lcl|Aclame:pro 182 NVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV----------IEKL-------------TKQ 238 (392) Q Consensus 182 ~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~----------~~~~-------------~~~ 238 (392) .+.++-.|+.++. |+++.. +...+|.+...++.+++++...|..+..- .++. +.. T Consensus 78 ~l~IDq~KYfaf~-VdDD~~-Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~ 155 (322) T protein:vir:31 78 SIILRDEVYAGNA-ISKKLR-QDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTD 155 (322) T ss_pred EEEEehhhhhccc-cchhHH-HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCC Confidence 4556666666554 777554 56788999999999999999888765431 1111 111 Q ss_pred chhhHHHHHHHHHHHhhhcccCC-c-eEEEcHHHHHHHHHh-----hccCCcee--ecccccCCc--ccceecccceEEe Q lcl|Aclame:pro 239 AIKSLDDIKDVLNVKLDPAISPN-A-ILLTNQDGFNYLDKL-----KDKDGKYI--LQSDPTQKN--KKLFAGTNPVVVV 307 (392) Q Consensus 239 ~~~~~d~~~~~~~~~~~~~~~~~-a-~~v~~~~~~~~L~~l-----kd~~g~~l--~~~~~~~~~--~~~~~g~~pv~~~ 307 (392) ....|+.++++. ..++...-|. . ..|++|..+..|..+ --.++|+. ..-+...+. ...++|.. |++ T Consensus 156 ~~~ay~~lv~l~-~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~-V~~- 232 (322) T protein:vir:31 156 QTMDVTDFSRVN-YVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGID-LFV- 232 (322) T ss_pred chhhHHHHHHHH-HHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhcee-eee- Confidence 234677888854 4566555553 3 457789998877443 12233432 121111111 24566764 433 Q ss_pred cCcccccc-c---------ccCCcce--EEEEehhh-ceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEeccc Q lcl|Aclame:pro 308 SNRFLKSK-G---------TTAKKAP--LIIGDLKE-AIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE 374 (392) Q Consensus 308 ~~~~~~~~-~---------~~~~~~~--~~~Gd~~~-~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~ 374 (392) +|.+.... . +.++... +-+.|+.- ..+...|+-.+-+..... ..| --.+|+.+|+|.++.+|+ T Consensus 233 SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~-~~~---~d~~~~~~~~g~g~~r~e 308 (322) T protein:vir:31 233 SNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDD-YND---DLNTATTARWGNGLVRDE 308 (322) T ss_pred eccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCc-ccc---ccceeeeeeecceeeccc Confidence 34321110 0 0011000 01111110 011111111122222221 122 246789999999999999 Q ss_pred ceEEEEecccCCCCC Q lcl|Aclame:pro 375 AAVYGEIDLSAPVEQ 389 (392) Q Consensus 375 af~~l~~~~~a~~~~ 389 (392) ..+.|. +.++|++= T Consensus 309 ~l~~~~-a~~~~~~~ 322 (322) T protein:vir:31 309 NLVCVL-ANADKVTF 322 (322) T ss_pred ceEEEE-eccccccC Confidence 998776 33445444 No 152 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=97.99 E-value=3.1e-07 Score=56.26 Aligned_cols=263 Identities=10% Similarity=-0.006 Sum_probs=135.6 Q ss_pred hhhhhh---hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccccc Q lcl|Aclame:pro 100 DLEQRA---MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETD 176 (392) Q Consensus 100 ~~~~~a---~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~ 176 (392) +.-.|. -+.....+-+..+--++.+.+-..+..-..+++..+..|+..++---.++.......+.-|+||.++|- + T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe~Ipl-s 79 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPL-S 79 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccCCcccch-h Confidence 000010 011111222233334566666666666566666668888876553323444455567788999998884 4 Q ss_pred ccceee---EEechhheeeehhhHHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHH- Q lcl|Aclame:pro 177 NPKFSN---VQYAVKDRAGILPLSRSLLQDSDQ-NILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKSLDDIKDVLN- 251 (392) Q Consensus 177 ~~~~~~---v~~~~~~i~~~~~iS~e~l~ds~~-~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~~~~~d~~~~~~~- 251 (392) ..+-.. .+++.+|.+.-+ |.|.++.+.+ +-...-.++|...+..+++..++..+.+.+.+...+.+.+..++. T Consensus 80 kvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~~~t~~~lQ~Ala~ 157 (296) T protein:vir:98 80 KVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALAS 157 (296) T ss_pred hheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccceeeechhhHHHHHHH Confidence 566543 667778877774 9999864433 345677788899999999999988887666554445566666553 Q ss_pred ------HHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccc---cccCCcc Q lcl|Aclame:pro 252 ------VKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSK---GTTAKKA 322 (392) Q Consensus 252 ------~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~---~~~~~~~ 322 (392) ...+.....+.++++||.+...+++-..-.-+-.|...+. ..++|. -| +.+..+ |.. .+..++- T Consensus 158 ~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~it~qt~fG~tyl----~nfLG~-~I-I~S~kV-~~G~~~~T~~~Ni 230 (296) T protein:vir:98 158 AWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYL----VDFTGT-VI-ISTNDV-TKGEIWATVPENI 230 (296) T ss_pred HhhhhhhhccccCCCceEEEEehHHHHHHhcCCccchhheechhhh----hhcccc-EE-EEcCcC-CCceEEEeeecce Confidence 1223222345689999999887642211111112221111 125663 22 333322 222 1222222 Q ss_pred eEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEe----------------eCcEEecccceEEEEecccC Q lcl|Aclame:pro 323 PLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQR----------------DDVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 323 ~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r----------------~~~~v~~~~af~~l~~~~~a 385 (392) .+.+.|++. +.+.-.+. |..|.+++.+..+ .-+-+-+++++++.+++++- T Consensus 231 ~~ay~~~~~-------~~l~~~f~------~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 231 IFAYINPNN-------SELAKEFN------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred EEEeecccc-------cchhhhhc------cccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEecCCC Confidence 222222210 11111111 1112222222211 11234567888999986544 No 153 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=97.95 E-value=4.9e-06 Score=49.70 Aligned_cols=280 Identities=11% Similarity=0.015 Sum_probs=139.1 Q ss_pred HHHHhhhhhhhhccccccccceecchhhhhHHHHhHHh-hhhhhhhcceeeccCCcceeEEEeecCCccc------cccc Q lcl|Aclame:pro 95 EFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARS-FDALEQYVTVEPVRTRSGSRVLEKNSDMIPF------AEIT 167 (392) Q Consensus 95 ~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~-~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~------~~~~ 167 (392) +..+... .-...+.+.. ...+| +++..++.....+ .+.|++-++...-.+.+.++..+........ .... T Consensus 1 ~~~~~~~-~~~~~Ms~~i-~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (322) T protein:vir:10 1 MKLNAIM-SMLPLIAGDI-DQAFV-QTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSA 77 (322) T ss_pred Cccccee-eeeeeeechh-hhHHH-HHHHHHHHHHHHHhhhhhhcccccccccccccceeeccccccccccccccccccc Confidence 0000000 0000011111 11222 5566666555554 4666665554433333323222221111010 0011 Q ss_pred ccc-ccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-ccc-----c---- Q lcl|Aclame:pro 168 EMG-EIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVI-EKL-----T---- 236 (392) Q Consensus 168 E~~-~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~-~~~-----~---- 236 (392) .+. ..|.. ...++...+.....+....|.+.-.-+...+..+...+..+.+++++.|..++.+. +.+ + T Consensus 78 d~~~dtp~~-~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~ 156 (322) T protein:vir:10 78 DGTYPTPVN-NKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVE 156 (322) T ss_pred CcccCCCcc-ccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccc Confidence 111 11211 11233333333333444567766666677788999999999999999999877631 111 0 Q ss_pred ---------ccchhhHHHHHHHHHHHhhhcccCC---ceEEEcHHHHHHHHHhhccCC-ceeecccc-cCCcccceeccc Q lcl|Aclame:pro 237 ---------KQAIKSLDDIKDVLNVKLDPAISPN---AILLTNQDGFNYLDKLKDKDG-KYILQSDP-TQKNKKLFAGTN 302 (392) Q Consensus 237 ---------~~~~~~~d~~~~~~~~~~~~~~~~~---a~~v~~~~~~~~L~~lkd~~g-~~l~~~~~-~~~~~~~~~g~~ 302 (392) ...+.+++.++.+.. .++.+.-++ -.+|.+|..|..|.....-.. .|.-.... .+|...+++|.. T Consensus 157 ~~ss~~i~~g~~g~t~~kl~~a~~-~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf~ 235 (322) T protein:vir:10 157 FLATQEIGDGTKPISFDYVTEITE-RFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGYT 235 (322) T ss_pred cCCCcccccCccchhHHHHHHHHH-HHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeEE Confidence 012344666666543 344444443 257889999999865432222 22222222 234456678764 Q ss_pred ceEEecCcccccccc----------cCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEec Q lcl|Aclame:pro 303 PVVVVSNRFLKSKGT----------TAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWD 372 (392) Q Consensus 303 pv~~~~~~~~~~~~~----------~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~ 372 (392) + +.++.++.+.++ .......+++. ++++....+..++.++++..+.. +...+++.+-+|..+++ T Consensus 236 -~-i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~-k~Av~~a~~~dv~~~i~~~~~~~---~a~~I~~~~~~Ga~ri~ 309 (322) T protein:vir:10 236 -W-IVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMT-DMALGYHSCKDIWTKVAEDPSAS---FAWRIYSAFTADCVRVE 309 (322) T ss_pred -E-EEeccCCccccccccccccCCCCccceeEEEEe-cCceeEEEeeeeeEEeeccCCcc---hhhhhhhhhhhCceEec Confidence 3 333333322111 11112223332 45677777777888776654432 23456677889999999 Q ss_pred ccceEEEEecccC Q lcl|Aclame:pro 373 NEAAVYGEIDLSA 385 (392) Q Consensus 373 ~~af~~l~~~~~a 385 (392) |++++.+..+-+- T Consensus 310 ~~gVv~i~~~e~~ 322 (322) T protein:vir:10 310 DEHIFKLRLKNSL 322 (322) T ss_pred cCcEEEEEEeccC Confidence 9999999987777 No 154 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=97.90 E-value=6.9e-07 Score=54.35 Aligned_cols=266 Identities=10% Similarity=0.022 Sum_probs=139.7 Q ss_pred hhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcc--eeEEEeecCCccccccccccccccccc Q lcl|Aclame:pro 100 DLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSG--SRVLEKNSDMIPFAEITEMGEIPETDN 177 (392) Q Consensus 100 ~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~E~~~~~~~~~ 177 (392) +.... +..+..+-+..+--++.+.+-..+..-..+++..+..|+..++- .|.++.......++-|+||..+| .+. T Consensus 1 M~~e~--nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Ip-lsk 77 (303) T protein:vir:10 1 MSAEN--NLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIP-LTK 77 (303) T ss_pred CCCCc--CCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccc-hhh Confidence 11111 11111222222333566666555555555666667777765442 23344444556677899999888 455 Q ss_pred ccee---eEEechhheeeehhhHHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHhhcccccc------ccchhhHHHHH Q lcl|Aclame:pro 178 PKFS---NVQYAVKDRAGILPLSRSLLQDSDQ-NILKYVTKWLGKKSKVTRNVLILGVIEKLT------KQAIKSLDDIK 247 (392) Q Consensus 178 ~~~~---~v~~~~~~i~~~~~iS~e~l~ds~~-~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~------~~~~~~~d~~~ 247 (392) .+-. ..+++.+|++..+ |.|.++.+.+ +-...-.++|...+...++..++..+.+.+ ......++.+. T Consensus 78 vt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~t~~t~~s~~glq 155 (303) T protein:vir:10 78 VTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKRTNKTKLSAENLQ 155 (303) T ss_pred heeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccccccceeecHHHHH Confidence 6543 4677788888755 9999864433 345566778888888888888877665543 33445677777 Q ss_pred HHHHHHh------hhcccCCceEEEcHHHHHHHHHhhccCCc-eeecccccCCcccceecccceEEecCccccc--cccc Q lcl|Aclame:pro 248 DVLNVKL------DPAISPNAILLTNQDGFNYLDKLKDKDGK-YILQSDPTQKNKKLFAGTNPVVVVSNRFLKS--KGTT 318 (392) Q Consensus 248 ~~~~~~~------~~~~~~~a~~v~~~~~~~~L~~lkd~~g~-~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~--~~~~ 318 (392) .++.... ... ..+.++++||.+...+++-..-+.+ --|..++.. .++|.. | +.+..++.- ..+. T Consensus 156 ~Al~~~~~kl~~~~ed-~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L~----nfLG~~-I-I~S~kv~~G~~~~T~ 228 (303) T protein:vir:10 156 GALSKGRANLSVLLDD-EITPIAFVNPNDTAEYLANGFINSTGAQFGVNLLT----PYVGVK-I-VEFADVPQGEVWMTV 228 (303) T ss_pred HHHHhhhhhccccccc-cccEEEEEchHHHHHHhhcCCcchhhhhhhhhhhh----hhhcce-E-EEeccCCCceEEEee Confidence 7664331 222 2445899999999988642211111 112222211 267764 3 333322211 1223 Q ss_pred CCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEe----------------eCcEEecccceEEEEec Q lcl|Aclame:pro 319 AKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQR----------------DDVQMWDNEAAVYGEID 382 (392) Q Consensus 319 ~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r----------------~~~~v~~~~af~~l~~~ 382 (392) .++-.+.+.+.+ +.+.-.+ .|..|.+++.+..+ .-+-+-+++++++.+++ T Consensus 229 ~~Ni~~ay~~~~--------g~l~~~f------~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~ 294 (303) T protein:vir:10 229 AENLNVAYANPR--------GELSRAF------AFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIK 294 (303) T ss_pred ccceEEEEecCc--------hhhhhhh------hhccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEEe Confidence 333333333332 1111111 11122222222221 11234577899999998 Q ss_pred ccCCCCCCC Q lcl|Aclame:pro 383 LSAPVEQPQ 391 (392) Q Consensus 383 ~~a~~~~~~ 391 (392) ..-....|. T Consensus 295 ~~e~~~~~~ 303 (303) T protein:vir:10 295 KDEAGELPS 303 (303) T ss_pred ccccCCCCC Confidence 877767777 No 155 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=97.83 E-value=2.1e-06 Score=51.69 Aligned_cols=215 Identities=9% Similarity=0.011 Sum_probs=131.1 Q ss_pred hhhhhhccccccc-cceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccc Q lcl|Aclame:pro 101 LEQRAMSGLTGED-GGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPK 179 (392) Q Consensus 101 ~~~~a~~~~~~~~-gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~ 179 (392) -...+....|-.. +..+-|......|+|.+.+.++|+..++........+... ...++-+.++|..=+...+++ ..+ T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t-~vrt~LP~~~fR~lN~g~~~s-~~t 78 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRT-SVRTGLPTPTWRKLYGGVLPN-KSS 78 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccce-eEEeecCCchhhhcCCccccc-cce Confidence 0000111111112 2234455666789999999999998888865554444432 233556778898888777765 689 Q ss_pred eeeEEechhheeeehhhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------------------- Q lcl|Aclame:pro 180 FSNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-------------------- 237 (392) Q Consensus 180 ~~~v~~~~~~i~~~~~iS~e~l~ds~--~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~-------------------- 237 (392) +.+++...+-+.+.+.|-+.+.+... -++...-.+...+++.+++...+++|..+..+ T Consensus 79 t~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~q 158 (330) T protein:vir:10 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) T ss_pred EEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCchhh Confidence 99999999999999999998876532 23444555667788888888777776221000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 238 -------------------------------------------------------------------------------- 237 (392) Q Consensus 238 -------------------------------------------------------------------------------- 237 (392) T Consensus 159 vIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI 238 (330) T protein:vir:10 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) T ss_pred eeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEE Confidence Q ss_pred --------cchhhHHHHHHHHHH---HhhhcccCCceEEEcHHHHHHHHHh-hccCCceeecccccCCcccceecccceE Q lcl|Aclame:pro 238 --------QAIKSLDDIKDVLNV---KLDPAISPNAILLTNQDGFNYLDKL-KDKDGKYILQSDPTQKNKKLFAGTNPVV 305 (392) Q Consensus 238 --------~~~~~~d~~~~~~~~---~~~~~~~~~a~~v~~~~~~~~L~~l-kd~~g~~l~~~~~~~~~~~~~~g~~pv~ 305 (392) ......++++++|.. .++...+.+.+|+||++...+|++. .+.++..+-. .-..+...+.|++.||. T Consensus 239 ~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~-~~~~g~~~t~~~gipir 317 (330) T protein:vir:10 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTW-ETVSGERVMTFDGIPVQ 317 (330) T ss_pred eecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeee-eecCCeeeEEECCeEEE Confidence 000123456665543 3355566778999999999999975 4554433322 22344455677777887 Q ss_pred EecCcccccccccCCcceEE Q lcl|Aclame:pro 306 VVSNRFLKSKGTTAKKAPLI 325 (392) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~ 325 (392) .++. +.. .+..++ T Consensus 318 ~~Da-il~------tE~~vv 330 (330) T protein:vir:10 318 RTDA-LLN------TESRVV 330 (330) T ss_pred EEee-eec------CccccC Confidence 6542 111 111111 No 156 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=97.81 E-value=3.6e-06 Score=50.42 Aligned_cols=284 Identities=11% Similarity=0.023 Sum_probs=133.1 Q ss_pred hhhhhhhccccc-cccc-eec-chhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccccc Q lcl|Aclame:pro 100 DLEQRAMSGLTG-EDGG-LVI-PQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETD 176 (392) Q Consensus 100 ~~~~~a~~~~~~-~~gg-~~i-P~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~ 176 (392) +.--..++.... ..|. ..+ -+.+.+++.......+.++++..++++.+++ .+.+++. +...+.+..-|... ... T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~Gk-S~qf~~i-G~~~a~y~~~G~~l-dg~ 77 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNKYL-GETELQVLAPGQSP-NAT 77 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccc-eEEEEEE-eeeEEeeecccccc-CCC Confidence 000001111111 1111 222 3788889999999999999999999988654 4455543 44455555544432 233 Q ss_pred ccceeeEEechhhee-eehhhHHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHhhccc------c-------------- Q lcl|Aclame:pro 177 NPKFSNVQYAVKDRA-GILPLSRSLLQDSDQN-ILKYVTKWLGKKSKVTRNVLILGVIE------K-------------- 234 (392) Q Consensus 177 ~~~~~~v~~~~~~i~-~~~~iS~e~l~ds~~~-l~~~v~~~l~~~~~~~~d~~~~~~~~------~-------------- 234 (392) .+.-++..+....+- ....|.+----+++++ +.+.+.+++.+++++..|+.++.... + T Consensus 78 ~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g~ 157 (402) T protein:vir:97 78 PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGF 157 (402) T ss_pred CcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCccccccc Confidence 455556555555432 1122221111123455 67888889999999999987653210 0 Q ss_pred ccccc------hhhHHHHHHHH---HHHhhhcccCC--ceEEEcHHHHHHHHHhhccCCc-eeec--ccccCCcccceec Q lcl|Aclame:pro 235 LTKQA------IKSLDDIKDVL---NVKLDPAISPN--AILLTNQDGFNYLDKLKDKDGK-YILQ--SDPTQKNKKLFAG 300 (392) Q Consensus 235 ~~~~~------~~~~d~~~~~~---~~~~~~~~~~~--a~~v~~~~~~~~L~~lkd~~g~-~l~~--~~~~~~~~~~~~g 300 (392) ....+ ....+.+.+++ ...++...-+. -+.+++|..|..|.+-++--.+ |... ..+..+....+.| T Consensus 158 s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v~G 237 (402) T protein:vir:97 158 SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYN 237 (402) T ss_pred ccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEEec Confidence 00000 12223333322 12334333333 4789999999998753221111 1100 0122333345666 Q ss_pred ccceEEecCccccccc--------ccC--CcceEEEEehhhce-eeeeccce-EEEEeccc------hhhhhcCceeEEE Q lcl|Aclame:pro 301 TNPVVVVSNRFLKSKG--------TTA--KKAPLIIGDLKEAI-VLFKREDM-ELASTDVG------GKAFTRNTLDLRA 362 (392) Q Consensus 301 ~~pv~~~~~~~~~~~~--------~~~--~~~~~~~Gd~~~~~-~~~~~~~~-~~~~~~~~------~~~f~~~~~~~~~ 362 (392) .+ | +.++.++ +.+ +.+ +...-+-|||+... .+|.++-+ +++.-+-+ ...|. ..+.+ T Consensus 238 v~-V-v~SnnlP-~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~---~~id~ 311 (402) T protein:vir:97 238 CP-V-IPSNRFP-TFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKT---YYIDT 311 (402) T ss_pred eE-E-EecCccc-cccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHH---HHHHH Confidence 54 3 2334443 211 011 11111235555422 22222211 22222111 12222 23345 Q ss_pred EEeeCcEEecccceEEEEecc-cCCCCCCCC Q lcl|Aclame:pro 363 IQRDDVQMWDNEAAVYGEIDL-SAPVEQPQG 392 (392) Q Consensus 363 ~~r~~~~v~~~~af~~l~~~~-~a~~~~~~~ 392 (392) .+-+|..+.||++..+++.+- ..+..++-- T Consensus 312 ~~a~G~g~~RPeaa~vv~~~~~~t~~~~~~~ 342 (402) T protein:vir:97 312 FMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) T ss_pred HHHhCCcccCccceEEEEEecccccccCCcc Confidence 555799999999999887655 122111111 No 157 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=97.80 E-value=6.4e-06 Score=49.04 Aligned_cols=259 Identities=12% Similarity=-0.001 Sum_probs=136.7 Q ss_pred cccccccceecc--hhhhhHHHHhHHhhhhhhhhcceee-ccCCcceeEEEeecCCccccccccccccccccccceeeEE Q lcl|Aclame:pro 108 GLTGEDGGLVIP--QDIQTQINELARSFDALEQYVTVEP-VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQ 184 (392) Q Consensus 108 ~~~~~~gg~~iP--~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~ 184 (392) ..+...|.++.- +.+.+.+++.+......+.++.+.. ++-..-.+.+........+.|.+.++..-+..+..+.... T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 444445554332 2456678888888777777765542 2222222333333444566777776653223345677778 Q ss_pred echhheeeehhhHHHHHhhhH---HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------------ Q lcl|Aclame:pro 185 YAVKDRAGILPLSRSLLQDSD---QNILKYVTKWLGKKSKVTRNVLILGVIEKLTK------------------------ 237 (392) Q Consensus 185 ~~~~~i~~~~~iS~e~l~ds~---~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~------------------------ 237 (392) .....++.-+.++..-|+.+. .++..--....+.++...+|+.++.|....+. T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~~~ 160 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNVSK 160 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcccccccc Confidence 888888888888877666553 35777777888889999999887777432110 Q ss_pred ----cchhhHHHHHHHHHHHhh--hcccCCceEEEcHHHHHHHHHhh--ccCCceeecccccCCcccceecccceEEecC Q lcl|Aclame:pro 238 ----QAIKSLDDIKDVLNVKLD--PAISPNAILLTNQDGFNYLDKLK--DKDGKYILQSDPTQKNKKLFAGTNPVVVVSN 309 (392) Q Consensus 238 ----~~~~~~d~~~~~~~~~~~--~~~~~~a~~v~~~~~~~~L~~lk--d~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~ 309 (392) +....++++..++..... .....+-.++++|+.+..|.... +..|.-++.- +....+. ..++.. T Consensus 161 w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~-l~~~~~~-------~~I~~~ 232 (301) T protein:vir:80 161 WEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKV-LQDNAWF-------SAIVRV 232 (301) T ss_pred cccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHH-HHHHcCc-------ceEEEc Confidence 011124555555543322 23334557999999999997543 4444433321 1111111 112222 Q ss_pred cccccccccCCcceEEEEeh-hhceeeeeccceEEEEeccchhhhhcC-ceeEEEEEeeC-cEEecccceEEEE-e Q lcl|Aclame:pro 310 RFLKSKGTTAKKAPLIIGDL-KEAIVLFKREDMELASTDVGGKAFTRN-TLDLRAIQRDD-VQMWDNEAAVYGE-I 381 (392) Q Consensus 310 ~~~~~~~~~~~~~~~~~Gd~-~~~~~~~~~~~~~~~~~~~~~~~f~~~-~~~~~~~~r~~-~~v~~~~af~~l~-~ 381 (392) +.+...+. .+...+++-.- .+.+.+.....++. .+.. .++ ......+.|++ ..+.+|.||++++ + T Consensus 233 p~L~~~g~-~g~~~~v~~~~~~d~~~~~v~~~~~~--~~~e----~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 233 PDLAGMGT-AGSDSFAVIHDSNETAELIIPMDITR--HPEE----YSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred ceeccCCC-CcccEEEEEecCCcEEEEEecCceee--ecce----ecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 22222221 23333332211 12222222233322 2211 122 13344567775 5778999999887 5 No 158 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=97.77 E-value=1.3e-05 Score=47.33 Aligned_cols=216 Identities=10% Similarity=0.011 Sum_probs=129.2 Q ss_pred hhhhhhcccccccc-ceecchh-hhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccccccc Q lcl|Aclame:pro 101 LEQRAMSGLTGEDG-GLVIPQD-IQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNP 178 (392) Q Consensus 101 ~~~~a~~~~~~~~g-g~~iP~~-~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~ 178 (392) -........|-... ..+-|.. +...|++.+.+.++|+..+.........+.. ..+.++-+.++|..=+...+++ .. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~-~~vrt~LP~~~fR~lN~g~~~s-~~ 78 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHK-TTVRSGLPTGTWRKLNYGVQPE-KS 78 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccce-eeEEeccCCchhhccCCccCcc-cc Confidence 00000011111111 1122433 4457999999999999998887655554433 3455667888999888888765 68 Q ss_pred ceeeEEechhheeeehhhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------------- Q lcl|Aclame:pro 179 KFSNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILGVIEKLT-------------------- 236 (392) Q Consensus 179 ~~~~v~~~~~~i~~~~~iS~e~l~ds~--~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-------------------- 236 (392) ++.+++...+-+.+.+.|.+.+.+... -++...-.+...+++.+.+...+++|..+.. T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999999877632 2234445556777888888877776531100 Q ss_pred -------cc----------------------------------------------------------------------- Q lcl|Aclame:pro 237 -------KQ----------------------------------------------------------------------- 238 (392) Q Consensus 237 -------~~----------------------------------------------------------------------- 238 (392) ++ T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 00 Q ss_pred ---------chhhHHHHHHHHHH---HhhhcccCCceEEEcHHHHHHHHHh-hccCCceeecccccCCcccceecccceE Q lcl|Aclame:pro 239 ---------AIKSLDDIKDVLNV---KLDPAISPNAILLTNQDGFNYLDKL-KDKDGKYILQSDPTQKNKKLFAGTNPVV 305 (392) Q Consensus 239 ---------~~~~~d~~~~~~~~---~~~~~~~~~a~~v~~~~~~~~L~~l-kd~~g~~l~~~~~~~~~~~~~~g~~pv~ 305 (392) ......+++++|.. .+++....+.+|+||.+....|++. .+......+......+...+.|++.||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 00011234444432 2345556778999999999999875 4443333344444455566777778887 Q ss_pred EecCcccccccccCCcceEE Q lcl|Aclame:pro 306 VVSNRFLKSKGTTAKKAPLI 325 (392) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~ 325 (392) .++.-. . .+..++ T Consensus 319 ~~dai~-~------tE~~Vv 331 (331) T protein:vir:10 319 RTDALL-L------TEARVV 331 (331) T ss_pred Eeeeee-c------CccccC Confidence 654321 1 111111 No 159 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=97.77 E-value=1.3e-05 Score=47.33 Aligned_cols=216 Identities=10% Similarity=0.011 Sum_probs=129.2 Q ss_pred hhhhhhcccccccc-ceecchh-hhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccccccc Q lcl|Aclame:pro 101 LEQRAMSGLTGEDG-GLVIPQD-IQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNP 178 (392) Q Consensus 101 ~~~~a~~~~~~~~g-g~~iP~~-~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~ 178 (392) -........|-... ..+-|.. +...|++.+.+.++|+..+.........+.. ..+.++-+.++|..=+...+++ .. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~-~~vrt~LP~~~fR~lN~g~~~s-~~ 78 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHK-TTVRSGLPTGTWRKLNYGVQPE-KS 78 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccce-eeEEeccCCchhhccCCccCcc-cc Confidence 00000011111111 1122433 4457999999999999998887655554433 3455667888999888888765 68 Q ss_pred ceeeEEechhheeeehhhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------------- Q lcl|Aclame:pro 179 KFSNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILGVIEKLT-------------------- 236 (392) Q Consensus 179 ~~~~v~~~~~~i~~~~~iS~e~l~ds~--~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-------------------- 236 (392) ++.+++...+-+.+.+.|.+.+.+... -++...-.+...+++.+.+...+++|..+.. T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999999877632 2234445556777888888877776531100 Q ss_pred -------cc----------------------------------------------------------------------- Q lcl|Aclame:pro 237 -------KQ----------------------------------------------------------------------- 238 (392) Q Consensus 237 -------~~----------------------------------------------------------------------- 238 (392) ++ T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 00 Q ss_pred ---------chhhHHHHHHHHHH---HhhhcccCCceEEEcHHHHHHHHHh-hccCCceeecccccCCcccceecccceE Q lcl|Aclame:pro 239 ---------AIKSLDDIKDVLNV---KLDPAISPNAILLTNQDGFNYLDKL-KDKDGKYILQSDPTQKNKKLFAGTNPVV 305 (392) Q Consensus 239 ---------~~~~~d~~~~~~~~---~~~~~~~~~a~~v~~~~~~~~L~~l-kd~~g~~l~~~~~~~~~~~~~~g~~pv~ 305 (392) ......+++++|.. .+++....+.+|+||.+....|++. .+......+......+...+.|++.||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 00011234444432 2345556778999999999999875 4443333344444455566777778887 Q ss_pred EecCcccccccccCCcceEE Q lcl|Aclame:pro 306 VVSNRFLKSKGTTAKKAPLI 325 (392) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~ 325 (392) .++.-. . .+..++ T Consensus 319 ~~dai~-~------tE~~Vv 331 (331) T protein:vir:10 319 RTDALL-L------TEARVV 331 (331) T ss_pred Eeeeee-c------CccccC Confidence 654321 1 111111 No 160 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=97.77 E-value=1.3e-05 Score=47.33 Aligned_cols=216 Identities=10% Similarity=0.011 Sum_probs=129.2 Q ss_pred hhhhhhcccccccc-ceecchh-hhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccccccc Q lcl|Aclame:pro 101 LEQRAMSGLTGEDG-GLVIPQD-IQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNP 178 (392) Q Consensus 101 ~~~~a~~~~~~~~g-g~~iP~~-~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~ 178 (392) -........|-... ..+-|.. +...|++.+.+.++|+..+.........+.. ..+.++-+.++|..=+...+++ .. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~-~~vrt~LP~~~fR~lN~g~~~s-~~ 78 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHK-TTVRSGLPTGTWRKLNYGVQPE-KS 78 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccce-eeEEeccCCchhhccCCccCcc-cc Confidence 00000011111111 1122433 4457999999999999998887655554433 3455667888999888888765 68 Q ss_pred ceeeEEechhheeeehhhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------------- Q lcl|Aclame:pro 179 KFSNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILGVIEKLT-------------------- 236 (392) Q Consensus 179 ~~~~v~~~~~~i~~~~~iS~e~l~ds~--~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-------------------- 236 (392) ++.+++...+-+.+.+.|.+.+.+... -++...-.+...+++.+.+...+++|..+.. T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:98 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999999877632 2234445556777888888877776531100 Q ss_pred -------cc----------------------------------------------------------------------- Q lcl|Aclame:pro 237 -------KQ----------------------------------------------------------------------- 238 (392) Q Consensus 237 -------~~----------------------------------------------------------------------- 238 (392) ++ T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:98 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 00 Q ss_pred ---------chhhHHHHHHHHHH---HhhhcccCCceEEEcHHHHHHHHHh-hccCCceeecccccCCcccceecccceE Q lcl|Aclame:pro 239 ---------AIKSLDDIKDVLNV---KLDPAISPNAILLTNQDGFNYLDKL-KDKDGKYILQSDPTQKNKKLFAGTNPVV 305 (392) Q Consensus 239 ---------~~~~~d~~~~~~~~---~~~~~~~~~a~~v~~~~~~~~L~~l-kd~~g~~l~~~~~~~~~~~~~~g~~pv~ 305 (392) ......+++++|.. .+++....+.+|+||.+....|++. .+......+......+...+.|++.||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:98 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 00011234444432 2345556778999999999999875 4443333344444455566777778887 Q ss_pred EecCcccccccccCCcceEE Q lcl|Aclame:pro 306 VVSNRFLKSKGTTAKKAPLI 325 (392) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~ 325 (392) .++.-. . .+..++ T Consensus 319 ~~dai~-~------tE~~Vv 331 (331) T protein:vir:98 319 RTDALL-L------TEARVV 331 (331) T ss_pred Eeeeee-c------CccccC Confidence 654321 1 111111 No 161 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=97.76 E-value=4.7e-06 Score=49.81 Aligned_cols=287 Identities=13% Similarity=0.067 Sum_probs=135.0 Q ss_pred hhhhhhhccccccccc---eecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccccc Q lcl|Aclame:pro 100 DLEQRAMSGLTGEDGG---LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETD 176 (392) Q Consensus 100 ~~~~~a~~~~~~~~gg---~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~ 176 (392) +.--...+....+++| .+--+.+.+++.......+.++++..++++.+++ .+.+++. +...+.+..-|.+. ..+ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gk-S~qf~~l-G~s~a~y~~pG~~l-dg~ 77 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTN-TVSNKYL-GETELQVLAPGQSP-AAT 77 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccc-eEEEEEe-eeeEEeeecCCCCc-CCC Confidence 0000001111111111 2234678888999999999999999999998754 4555543 55667777766653 333 Q ss_pred ccceeeEEechhhe-eeehhhHHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHhhcc---------------ccc---- Q lcl|Aclame:pro 177 NPKFSNVQYAVKDR-AGILPLSRSLLQDSDQN-ILKYVTKWLGKKSKVTRNVLILGVI---------------EKL---- 235 (392) Q Consensus 177 ~~~~~~v~~~~~~i-~~~~~iS~e~l~ds~~~-l~~~v~~~l~~~~~~~~d~~~~~~~---------------~~~---- 235 (392) .+.-++..+....+ .....|..----+++++ +.+.+.+++.+++++..|+.++..+ +.. T Consensus 78 ~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g~ 157 (400) T protein:vir:10 78 STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHGF 157 (400) T ss_pred CcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCcccccc Confidence 45555655555444 22233322111223466 6888888889999998888664311 000 Q ss_pred -------cccchhhHHHHHHHHH---HHhhhcccCC--ceEEEcHHHHHHHHHhh-ccCCceeec--ccccCCcccceec Q lcl|Aclame:pro 236 -------TKQAIKSLDDIKDVLN---VKLDPAISPN--AILLTNQDGFNYLDKLK-DKDGKYILQ--SDPTQKNKKLFAG 300 (392) Q Consensus 236 -------~~~~~~~~d~~~~~~~---~~~~~~~~~~--a~~v~~~~~~~~L~~lk-d~~g~~l~~--~~~~~~~~~~~~g 300 (392) ........+.+..++. ..+...+-+. -++++.|..|..|..-+ --|-.|... .++..+....+.| T Consensus 158 s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~G 237 (400) T protein:vir:10 158 SVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSSYN 237 (400) T ss_pred ceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEEec Confidence 0001112222322211 1233222222 36677777787774321 001111111 1112222334566 Q ss_pred ccceEEecCccccccc--------ccC--CcceEEEEehhhceee-eecc-ceEEEEeccchhhh---hcCceeEEEEEe Q lcl|Aclame:pro 301 TNPVVVVSNRFLKSKG--------TTA--KKAPLIIGDLKEAIVL-FKRE-DMELASTDVGGKAF---TRNTLDLRAIQR 365 (392) Q Consensus 301 ~~pv~~~~~~~~~~~~--------~~~--~~~~~~~Gd~~~~~~~-~~~~-~~~~~~~~~~~~~f---~~~~~~~~~~~r 365 (392) .+ |+. ++.+ |+.. +.+ +...-+-|||+....+ |.++ -.+++.-+-+...| ..-...+.+++- T Consensus 238 v~-Iv~-Sn~l-P~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a 314 (400) T protein:vir:10 238 CP-VIP-SNRF-PKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMS 314 (400) T ss_pred eE-EEe-eCcC-CcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHH Confidence 43 322 3333 3211 011 1111133666654222 2222 12233322222111 111233455566 Q ss_pred eCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 366 DDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 366 ~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) +|..+.||+|...++.+-++-.+..-| T Consensus 315 ~G~g~~RPeaa~vv~~~~~~~~~~~~~ 341 (400) T protein:vir:10 315 EGAIPDRWEAVSVVTTKRQSTGAVDSG 341 (400) T ss_pred hCCcccchhheEEEEecCCcccccccC Confidence 899999999999998765433332222 No 162 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=97.71 E-value=4.4e-06 Score=49.94 Aligned_cols=216 Identities=8% Similarity=-0.073 Sum_probs=126.5 Q ss_pred hhhhhhccccccc-cceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccc Q lcl|Aclame:pro 101 LEQRAMSGLTGED-GGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPK 179 (392) Q Consensus 101 ~~~~a~~~~~~~~-gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~ 179 (392) -........|-.. +..+-|......|+|.+.+.++|+..+.........+... ...++-+.++|..=+...+++ ..+ T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~-~vrt~LP~~~fR~lN~g~~~s-~~t 78 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKT-TIRAGIPEPVWRRYNQGVQPT-KTQ 78 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccce-eEEEecCCchhhhcCCccccc-cce Confidence 0000001111111 1223344555679999999999998888865554444432 233556778898888777765 689 Q ss_pred eeeEEechhheeeehhhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------------------- Q lcl|Aclame:pro 180 FSNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-------------------- 237 (392) Q Consensus 180 ~~~v~~~~~~i~~~~~iS~e~l~ds~--~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~-------------------- 237 (392) +.+++...+-+.+.+.|-+.+.+... -++...-.+...+++.+++...+++|..+..+ T Consensus 79 t~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~ 158 (335) T protein:vir:73 79 TVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAAS 158 (335) T ss_pred EEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccccCc Confidence 99999999999999999997766432 23455555567788888888877776211000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 238 -------------------------------------------------------------------------------- 237 (392) Q Consensus 238 -------------------------------------------------------------------------------- 237 (392) T Consensus 159 a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvR 238 (335) T protein:vir:73 159 AENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISR 238 (335) T ss_pred ccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEE Confidence Q ss_pred ----------cchhhHHHHHHHHHHHh-----hhcccCCceEEEcHHHHHHHHHh-hccCCceeecccccCCcccceecc Q lcl|Aclame:pro 238 ----------QAIKSLDDIKDVLNVKL-----DPAISPNAILLTNQDGFNYLDKL-KDKDGKYILQSDPTQKNKKLFAGT 301 (392) Q Consensus 238 ----------~~~~~~d~~~~~~~~~~-----~~~~~~~a~~v~~~~~~~~L~~l-kd~~g~~l~~~~~~~~~~~~~~g~ 301 (392) .......+|+++|...+ +.......+|+||++...+|++. ++..+.. +...-..+...+.|++ T Consensus 239 I~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~-l~~~~~~g~~~t~~~g 317 (335) T protein:vir:73 239 ICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVN-LTIEEYGGKKIVSFLG 317 (335) T ss_pred EeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCcee-eeeeccCCceeEEECC Confidence 00011245555554333 33233446899999999999874 4444433 3333345555566777 Q ss_pred cceEEecCcccccccccCCcceEEEE Q lcl|Aclame:pro 302 NPVVVVSNRFLKSKGTTAKKAPLIIG 327 (392) Q Consensus 302 ~pv~~~~~~~~~~~~~~~~~~~~~~G 327 (392) .||..++.-. .+ +. .++. T Consensus 318 ipir~~Dail-~t------E~-~v~~ 335 (335) T protein:vir:73 318 IPIRRVDAIL-NT------ES-AVTA 335 (335) T ss_pred eEEEEEeeee-cC------cc-cccC Confidence 7887654211 11 11 1111 No 163 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=97.71 E-value=6.9e-06 Score=48.88 Aligned_cols=283 Identities=9% Similarity=0.010 Sum_probs=134.5 Q ss_pred cccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecc--hhhhhHHHHhHHhhhhhhhhcceee- Q lcl|Aclame:pro 68 RNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIP--QDIQTQINELARSFDALEQYVTVEP- 144 (392) Q Consensus 68 ~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP--~~~~~~ii~~~~~~~~l~~l~~~~~- 144 (392) .....+.+ .........+. ........|.+++. +.+.+.|++.....-.-+.++.+.. T Consensus 1 ~~~~~~~~------------------~~~~~~~~~~~-~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~ 61 (314) T protein:vir:10 1 MAIKFDAE------------------QAKITTHLEQM-GVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNE 61 (314) T ss_pred CccchHHH------------------HHHHHHHHHhh-cccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccC Confidence 00000000 00011111110 11111222334433 2455567776555555555544432 Q ss_pred ccCCcceeEEEeecCCccccccccccc-cccccccceeeEEechhheeeehhhHHHHHhhhH---HHHHHHHHHHHHHHH Q lcl|Aclame:pro 145 VRTRSGSRVLEKNSDMIPFAEITEMGE-IPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD---QNILKYVTKWLGKKS 220 (392) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~E~~~-~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~---~~l~~~v~~~l~~~~ 220 (392) ++...-.+.+......+.+.|++..+. .|- .+..+.+.....+.++..+.+|..-++.+. .+|.+--....+.++ T Consensus 62 ~~~~~et~~~~~~e~~G~a~~~~d~~~dip~-vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~ 140 (314) T protein:vir:10 62 IPGHAKYFEYPEFDGVGIAQIIADYSDDLPL-VDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAH 140 (314) T ss_pred CCCceeEEEeeeeccccceeeeCCcccccce-eecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHH Confidence 111111233333444456678887655 443 345677888888888888888876666542 357777777888888 Q ss_pred HHHHHHHHhhccccc---------------cccchhh----HHHHHHHHHHHhh--hcccCCceEEEcHHHHHHHHHhhc Q lcl|Aclame:pro 221 KVTRNVLILGVIEKL---------------TKQAIKS----LDDIKDVLNVKLD--PAISPNAILLTNQDGFNYLDKLKD 279 (392) Q Consensus 221 ~~~~d~~~~~~~~~~---------------~~~~~~~----~d~~~~~~~~~~~--~~~~~~a~~v~~~~~~~~L~~lkd 279 (392) ...+|+.++.|.... ..+.+.+ ++++..++..... .....+..++++|+.+..|...-+ T Consensus 141 ~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~ 220 (314) T protein:vir:10 141 DNLLDKLVWSGSAPHGIVSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVP 220 (314) T ss_pred HHhhceEEEeecccccceeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhccccc Confidence 888888777764321 1112233 4445444433322 133344578999999988865444 Q ss_pred cCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEeh-hhceeeeeccceEEEEeccchhhhhcCce Q lcl|Aclame:pro 280 KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL-KEAIVLFKREDMELASTDVGGKAFTRNTL 358 (392) Q Consensus 280 ~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~-~~~~~~~~~~~~~~~~~~~~~~~f~~~~~ 358 (392) ..|.-++.- +... ++++.+..-+.+.+.+. .+...+++-+- .+.+.+.....++ ..+.... .-.+ T Consensus 221 ~~~~tvl~~-l~~n-------~~~l~I~~~~el~~ag~-~g~~~~v~y~~~~~~~~~~vp~~~~--~l~~e~~---~~~~ 286 (314) T protein:vir:10 221 QTNLSYGEL-FTRN-------NPGLTIRFLQFLDNYDG-AGGKAALAFEKSPLNMSIEIPEVTN--VLPAQPK---DLHF 286 (314) T ss_pred CCCccHHHH-HHHh-------CCCcEEEEcccccccCC-CcceEEEEEecCCcEEEEecCccce--eecceec---CceE Confidence 444332211 1111 13333333333333222 22333332221 1222222222322 2221111 1123 Q ss_pred eEEEEEeeC-cEEecccceEEEE-eccc Q lcl|Aclame:pro 359 DLRAIQRDD-VQMWDNEAAVYGE-IDLS 384 (392) Q Consensus 359 ~~~~~~r~~-~~v~~~~af~~l~-~~~~ 384 (392) .+....|++ ..+.+|.||++++ ++-+ T Consensus 287 ~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 287 RYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred EEcceeeeEEEEEECcceeEeeeeeecC Confidence 444567775 5677899999886 5555 No 164 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=97.70 E-value=2.8e-06 Score=51.06 Aligned_cols=262 Identities=10% Similarity=-0.012 Sum_probs=128.9 Q ss_pred hccccccccceec-ch--hhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccccccccee- Q lcl|Aclame:pro 106 MSGLTGEDGGLVI-PQ--DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFS- 181 (392) Q Consensus 106 ~~~~~~~~gg~~i-P~--~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~- 181 (392) |..........+. |. ++.+.+-..+..-..+++..+..|+..++ ++.+|+......+.-|+||.++|- +..+-. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~-tIt~pK~~~tgda~dVaEGe~Ipl-skvt~~~ 78 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDL-KIQTYKWEVTLDQTDPGEGETIPL-SKVTRTK 78 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCC-eEEeeeeeeecccccccCCcccch-hhheeee Confidence 1111112222233 22 23333433344444455555777776543 556666666667788999998884 456654 Q ss_pred --eEEechhheeeehhhHHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccchh-----hHHHHHHHHHHH Q lcl|Aclame:pro 182 --NVQYAVKDRAGILPLSRSLLQDSDQ-NILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIK-----SLDDIKDVLNVK 253 (392) Q Consensus 182 --~v~~~~~~i~~~~~iS~e~l~ds~~-~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~~~~-----~~d~~~~~~~~~ 253 (392) ..+++.+|.+..+ |.|.++.+.+ +-...-.++|...+...++..++..+.+++..... .+..+...+.. T Consensus 79 ~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~tg~~lq~a~a~~~~al~~- 155 (295) T protein:vir:99 79 DKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKVKGVGLQKALSASWAKLAT- 155 (295) T ss_pred eeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceeeehhhHHHHHHHhhhhhhh- Confidence 3666777777754 9999864433 34566778888899999999988887665443221 12222222221 Q ss_pred hhhcccCCceEEEcHHHHHHHHHhhccC--CceeecccccCCcccceecccceEEecCccccc--ccccCCcceEEEEeh Q lcl|Aclame:pro 254 LDPAISPNAILLTNQDGFNYLDKLKDKD--GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKS--KGTTAKKAPLIIGDL 329 (392) Q Consensus 254 ~~~~~~~~a~~v~~~~~~~~L~~lkd~~--g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~--~~~~~~~~~~~~Gd~ 329 (392) ....+..+.+.++||.+...+++-..-+ ..-.|..++.. .++|...|++ +..++.- ..+..++-.+.+-+. T Consensus 156 f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~----nfLG~q~II~-S~kv~~G~~~aT~~~Ni~~ay~~~ 230 (295) T protein:vir:99 156 FNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLK----NFLGMQNVIV-MPSVPEGKIYSTAVENLVFASLNV 230 (295) T ss_pred cccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhh----hhhccceEEE-cccCCCceEEEeeccceEEEEecC Confidence 1223345668999999999886533221 11112222111 2566543333 2222111 112222222222222 Q ss_pred hhceeeeeccceEEEEeccchhhhhcCceeEEEEEe----------------eCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQR----------------DDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r----------------~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) +. +++.=. -+|..|++.+.+..+ .-+-+-+++++++.++++..+ |-.| T Consensus 231 ~~-------g~l~~~------f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~~--~~~~ 294 (295) T protein:vir:99 231 KG-------GDLGGL------FADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAAV--PGIG 294 (295) T ss_pred Cc-------hhhhhh------hhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecCcC--CCCC Confidence 10 111100 011122222222221 112345678888888864333 2333 No 165 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=97.42 E-value=4.3e-05 Score=44.54 Aligned_cols=265 Identities=9% Similarity=-0.038 Sum_probs=135.3 Q ss_pred hhhccccc-cccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceee Q lcl|Aclame:pro 104 RAMSGLTG-EDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSN 182 (392) Q Consensus 104 ~a~~~~~~-~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~ 182 (392) .+....+- +.-.+..=.++.+.|...-....|+.+++.....++....+.. .....+...-..||.+.+... ..-.. T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~-d~l~~~~~~~~~EG~da~~~~-~~~r~ 78 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQT-DELRQPGKNTRVEGEDATIKA-GSFTT 78 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEe-eecCCccccccccCccccccc-ccCCE Confidence 11111111 1111222346777888887888999999887666544333221 223333345566777655432 12122 Q ss_pred EEec-hhheeeehhhHHHHHhhhHHHHHHHHHHHH---HHHHHHHHHHHHhhcccc-----ccc---------------- Q lcl|Aclame:pro 183 VQYA-VKDRAGILPLSRSLLQDSDQNILKYVTKWL---GKKSKVTRNVLILGVIEK-----LTK---------------- 237 (392) Q Consensus 183 v~~~-~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l---~~~~~~~~d~~~~~~~~~-----~~~---------------- 237 (392) ..-+ ..-+...+.||..+..-...+....+..++ ...+.+.++..+++|... .+. T Consensus 79 ~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~ 158 (317) T protein:vir:88 79 MLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGS 158 (317) T ss_pred EeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCce Confidence 2222 223345556665554433333223333333 334566777777776421 000 Q ss_pred --------------------cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCC---- Q lcl|Aclame:pro 238 --------------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQK---- 293 (392) Q Consensus 238 --------------------~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~---- 293 (392) ....+-+++.+++...-+... ....++++|.....|.++-..++.++..+..... T Consensus 159 ~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg-~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~ 237 (317) T protein:vir:88 159 LGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGG-QANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQT 237 (317) T ss_pred eccCccccccCCCccccccccccccHHHHHHHHHHHHhcCC-CCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEE Confidence 001234555666655555554 3345789999999999885444545533211110 Q ss_pred --cccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEe Q lcl|Aclame:pro 294 --NKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) Q Consensus 294 --~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~ 371 (392) .-.+-+| .|.++-++.+|. ..+++.|+... .+..-.++..+....++ +.....++..+++.+. T Consensus 238 v~~~~tdfG--~v~ii~~r~lp~-------~~~~~~D~~~~-~l~~Lr~~~~e~laKtG-----d~~k~~i~~E~tLe~~ 302 (317) T protein:vir:88 238 VDVYESDFG--KYTIRANRWFHE-------NTLFVFDPKMH-SLCYLRPFFQHELAKTG-----DSEKRQLLVEYTFRVN 302 (317) T ss_pred EEEEEeCCe--EEEEEeCCCCCC-------CeEEEEccccc-ceeecccceeeccCCCc-----ccceeEEEEEEEEEEc Confidence 0011233 244555666553 35788898753 23222444444443333 3345667788999999 Q ss_pred cccceEEEEecccCC Q lcl|Aclame:pro 372 DNEAAVYGEIDLSAP 386 (392) Q Consensus 372 ~~~af~~l~~~~~a~ 386 (392) +|+|..++.--++.- T Consensus 303 N~~a~a~i~~l~~~~ 317 (317) T protein:vir:88 303 NEKSGALIRDVVAQL 317 (317) T ss_pred CccceeEEEEecccC Confidence 999999888433333 No 166 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=97.42 E-value=2.2e-05 Score=46.11 Aligned_cols=286 Identities=13% Similarity=0.079 Sum_probs=131.0 Q ss_pred hhhhhhhccccccccc---eecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccccccccccccc Q lcl|Aclame:pro 100 DLEQRAMSGLTGEDGG---LVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETD 176 (392) Q Consensus 100 ~~~~~a~~~~~~~~gg---~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~ 176 (392) +.--...+....+++| .+--+.+.+++.......+.++++..++++.+++ .+.+++. +...+.+..-|+.. ... T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gk-S~qf~~~-G~s~~~~~~pG~~l-d~~ 77 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTN-TVSNKYL-GETELQVLAPGQSP-AAT 77 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccc-eEEEEEe-eeeEeeeecCCCCc-CCC Confidence 0000001111111111 2234678888889998999999999999998754 4555543 45566666665543 233 Q ss_pred ccceeeEEechhhe-eeehhhHHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHhhccc------c----ccc------- Q lcl|Aclame:pro 177 NPKFSNVQYAVKDR-AGILPLSRSLLQDSDQN-ILKYVTKWLGKKSKVTRNVLILGVIE------K----LTK------- 237 (392) Q Consensus 177 ~~~~~~v~~~~~~i-~~~~~iS~e~l~ds~~~-l~~~v~~~l~~~~~~~~d~~~~~~~~------~----~~~------- 237 (392) .+.-++..+...++ .....|.+-=--+++++ +.+-+.+++.+++++.+|+.++..+- + ..+ T Consensus 78 ~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G~ 157 (401) T protein:vir:70 78 STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHGF 157 (401) T ss_pred CcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCce Confidence 45555655554443 12222221111123455 67788888888888888876533220 0 000 Q ss_pred ---------cchh----hHHHHHHHHHHHhhhcccCCc--eEEEcHHHHHHHHHhhc-cCCceeec--ccccCCccccee Q lcl|Aclame:pro 238 ---------QAIK----SLDDIKDVLNVKLDPAISPNA--ILLTNQDGFNYLDKLKD-KDGKYILQ--SDPTQKNKKLFA 299 (392) Q Consensus 238 ---------~~~~----~~d~~~~~~~~~~~~~~~~~a--~~v~~~~~~~~L~~lkd-~~g~~l~~--~~~~~~~~~~~~ 299 (392) .... ..+.+.++.. .++..+-+.+ ++++.|..|..|..-.. -|-.|-.. ..+..+....+. T Consensus 158 ~i~v~~~~~~~~~~~~~l~~ai~dA~~-~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~va 236 (401) T protein:vir:70 158 SINVEVAEGEALVNPQYVMAAVEFALE-QQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTLSSY 236 (401) T ss_pred EEeccccccccccCHHHHHHHHHHHHH-HHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEEEEe Confidence 0011 1223333332 3443333333 56667777766643210 01011000 111223334466 Q ss_pred cccceEEecCccccccc--------ccC--CcceEEEEehhhcee-eeeccc-eEEEEeccchhhhh---cCceeEEEEE Q lcl|Aclame:pro 300 GTNPVVVVSNRFLKSKG--------TTA--KKAPLIIGDLKEAIV-LFKRED-MELASTDVGGKAFT---RNTLDLRAIQ 364 (392) Q Consensus 300 g~~pv~~~~~~~~~~~~--------~~~--~~~~~~~Gd~~~~~~-~~~~~~-~~~~~~~~~~~~f~---~~~~~~~~~~ 364 (392) |.+ |+. ++.+ |..+ +.+ +...-+-|||+.... +|.++- .+++.-+-+...|. .-...+-+.+ T Consensus 237 Gv~-Vv~-Snnl-P~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~ 313 (401) T protein:vir:70 237 NCP-VIP-SNRF-PKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFM 313 (401) T ss_pred ceE-EEe-eccc-cccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHH Confidence 653 332 2333 2211 011 111112366665422 222221 22333222211110 1112334566 Q ss_pred eeCcEEecccceEEEEeccc----CCCCCCCC Q lcl|Aclame:pro 365 RDDVQMWDNEAAVYGEIDLS----APVEQPQG 392 (392) Q Consensus 365 r~~~~v~~~~af~~l~~~~~----a~~~~~~~ 392 (392) -+|..+.||+|...++.+-+ ++..++-. T Consensus 314 a~g~g~~RPeaa~vv~~k~~~~~~~~~~~~~~ 345 (401) T protein:vir:70 314 AEGAIPDRWEAVSVVTTKRNTTTGAVEGTDGA 345 (401) T ss_pred HhCCcccchhheEEEeecCcccccccccCCcc Confidence 68999999999988865544 22222211 No 167 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=97.39 E-value=5.1e-05 Score=44.11 Aligned_cols=287 Identities=9% Similarity=0.007 Sum_probs=138.5 Q ss_pred Hhcchhh----HHHHHH-HHhhhhhhhhccccccccceecc---hhhhhHHHHhHHhhhhhhhhcceee-ccCCcceeEE Q lcl|Aclame:pro 84 LRNKPLN----AEEREF-LEDDLEQRAMSGLTGEDGGLVIP---QDIQTQINELARSFDALEQYVTVEP-VRTRSGSRVL 154 (392) Q Consensus 84 ~~~~~~~----~~~~~~-~~~~~~~~a~~~~~~~~gg~~iP---~~~~~~ii~~~~~~~~l~~l~~~~~-~~~~~~~~~~ 154 (392) +|+.... ..+... ......+......+...++.++- +.+.+.|++........+.++.+.. ++-..-.+.+ T Consensus 1 ~~~~~~~~~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~ 80 (329) T protein:vir:79 1 MRGNIMSKEMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEY 80 (329) T ss_pred CccchhhhhhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEe Confidence 2221111 000000 00111111111111122233333 3456678887777777777766543 2222223334 Q ss_pred EeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhH---HHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 155 EKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD---QNILKYVTKWLGKKSKVTRNVLILGV 231 (392) Q Consensus 155 ~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~---~~l~~~v~~~l~~~~~~~~d~~~~~~ 231 (392) ......+.+.|.+..+..-+..+..+.+-....+.++..+.++..-++.+. .+|..--....+.++...+|+.+++| T Consensus 81 ~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G 160 (329) T protein:vir:79 81 QTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKG 160 (329) T ss_pred eeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEee Confidence 444445667788775432222335566666777778888888766665543 35777777788888888888887777 Q ss_pred ccccc--------------cc-----------chhhHHHHHHHHHHHhhh--cccCCceEEEcHHHHHHHHHhhccCCce Q lcl|Aclame:pro 232 IEKLT--------------KQ-----------AIKSLDDIKDVLNVKLDP--AISPNAILLTNQDGFNYLDKLKDKDGKY 284 (392) Q Consensus 232 ~~~~~--------------~~-----------~~~~~d~~~~~~~~~~~~--~~~~~a~~v~~~~~~~~L~~lkd~~g~~ 284 (392) ....+ .+ ....++++..++...... ....+..++++|+.+..|.......|.- T Consensus 161 ~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~~t 240 (329) T protein:vir:79 161 SKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETTMS 240 (329) T ss_pred cccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCCcc Confidence 43211 00 111244555554433221 2234457999999999996555555543 Q ss_pred eecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhh-ceeeeeccceEEEEeccchhhhhcCceeEEEE Q lcl|Aclame:pro 285 ILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE-AIVLFKREDMELASTDVGGKAFTRNTLDLRAI 363 (392) Q Consensus 285 l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~-~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~ 363 (392) ++.- +... ++++.+...+.+... ...+...+++.+-+. .+.+.....++ +.+.... .-.+.+..+ T Consensus 241 vl~~-lk~~-------~~~l~I~~~~el~~a-g~~g~~~~v~y~~~~~~~~~~vp~~~~--~l~~q~~---~~~~~v~~~ 306 (329) T protein:vir:79 241 YLDY-FKQQ-------NGGITIESISELEDI-DGAGTKAALVYEKDPMNMSIEIPEAFN--MLTAQPK---DLHFKVPCT 306 (329) T ss_pred HHHH-HHHh-------CCCcEEEEccccccc-CCCCceEEEEEecCCceEEEecCccee--eeeceec---CceEEEcee Confidence 3321 1111 123333332222221 223344444433332 22222223333 2221111 112344566 Q ss_pred EeeCc-EEecccceEEEE-eccc Q lcl|Aclame:pro 364 QRDDV-QMWDNEAAVYGE-IDLS 384 (392) Q Consensus 364 ~r~~~-~v~~~~af~~l~-~~~~ 384 (392) .|+++ .+.+|.||++++ +-.. T Consensus 307 ~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 307 SKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred eeEEEEEEECcceeeeeeeeeeC Confidence 77764 677899999876 2222 No 168 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=97.22 E-value=0.00013 Score=41.95 Aligned_cols=352 Identities=13% Similarity=0.126 Sum_probs=151.5 Q ss_pred CCH-HHHHHHHHHHHHHHHHHHHhhh----------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MSK-ELRELLAKLEGKKEEVRSLMGE----------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) Q Consensus 1 M~k-el~el~~~~~~~~~e~~~~~~~----------~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) |+| .|.|+.+.+...+++.-++-.+ +++-..+.+.+-+..+.-+|...+. +.. ......+. ... T Consensus 8 ~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en--~LN-a~~E~~KG--K~k 82 (400) T protein:vir:93 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIEN--ELN-AQEEKPKG--KDK 82 (400) T ss_pred cccchHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhh--hhh-hhhhhhhh--hHH Confidence 776 4666666666666555443221 1122233333333333333322111 110 00000000 000 Q ss_pred cchhhHHHHH---HHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc Q lcl|Aclame:pro 70 VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR 146 (392) Q Consensus 70 ~~~~~~~~~a---~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~ 146 (392) ....-+-.++ |...+.....+.+.+......+..++. +.++-...+|..+...|-..+..+.++++...+..++ T Consensus 83 Mt~~i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GV---tiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~ 159 (400) T protein:vir:93 83 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGV---TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 159 (400) T ss_pred HHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCc---ceeccchhccHHHHHHHHHhhhccCcceeeeeeccch Confidence 0011112223 333333333333444444444433333 3344456689888888999999999998765554443 Q ss_pred CCcceeEEEee-cCCccccccccccccccccccceeeEEechhheeeehhhHH--HHHhhhHHHHHHHHHHHHHHHHH-H Q lcl|Aclame:pro 147 TRSGSRVLEKN-SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSR--SLLQDSDQNILKYVTKWLGKKSK-V 222 (392) Q Consensus 147 ~~~~~~~~~~~-~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~--e~l~ds~~~l~~~v~~~l~~~~~-~ 222 (392) - +...+. .+...+.-.-.|..+++. ..+|.--++.+.-++....+-. .-+..+-..|..|+..+|+.++. + T Consensus 160 ~----~~V~~s~~s~~~Aq~HkdGqTK~eq-a~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk 234 (400) T protein:vir:93 160 A----LLVSRSFDSANEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK 234 (400) T ss_pred h----hhHHhhhhhhhhhhhhccCCccccc-eeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHH Confidence 2 211111 112233333334444443 3555555555544443333311 12333444678999999999998 7 Q ss_pred HHHHHHhhccccccccchhhHH----------------------HHHHHHHHHhhhcccCCceEEEcHHHH-HHHHHhhc Q lcl|Aclame:pro 223 TRNVLILGVIEKLTKQAIKSLD----------------------DIKDVLNVKLDPAISPNAILLTNQDGF-NYLDKLKD 279 (392) Q Consensus 223 ~~d~~~~~~~~~~~~~~~~~~d----------------------~~~~~~~~~~~~~~~~~a~~v~~~~~~-~~L~~lkd 279 (392) ..|.++.-|.|+++-....... .+..+.....+.+.+ ..++....+. +-|..|+. T Consensus 235 ~Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrptagr--rylivktedrkalldelrq 312 (400) T protein:vir:93 235 IVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR--RYLIVKTEDRKALLDELRQ 312 (400) T ss_pred HHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCc--eEEEEeccchHHHHHHHHh Confidence 8899999998887743332222 222222211122211 1234444443 33444544 Q ss_pred cCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCcee Q lcl|Aclame:pro 280 KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLD 359 (392) Q Consensus 280 ~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~ 359 (392) +..+.-............-.|...+++. .+..+-...+ +.|-+ |.+-+ ..++ .. ..-.|.+|... T Consensus 313 atanahvriknddaeiasevgvdeiivy-------tgskalkptv-lvdqk--yhidm-qdlt--kv--dafewktnsnm 377 (400) T protein:vir:93 313 ATANAHVRIKNDDAEIASEVGVDEIIVY-------TGSKALKPTV-LVDQK--YHIDM-QDLT--KV--DAFEWKTNSNM 377 (400) T ss_pred hccccceEeecchhhhhhhcCcceeeee-------ecccccccee-eeccc--cccch-hhhh--hh--hhheeccCCce Confidence 3322111100001111112232222221 1112222222 22332 22221 1221 11 11234566666 Q ss_pred EEEEEeeCcEEecccceEEEEec Q lcl|Aclame:pro 360 LRAIQRDDVQMWDNEAAVYGEID 382 (392) Q Consensus 360 ~~~~~r~~~~v~~~~af~~l~~~ 382 (392) +.++.-..+.|---+|-+++++. T Consensus 378 ilvetltsghvetynagavitvs 400 (400) T protein:vir:93 378 ILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred EEEeecccCcceeeccceeEeeC Confidence 66666666655555555555554 No 169 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=97.01 E-value=0.00016 Score=41.42 Aligned_cols=349 Identities=14% Similarity=0.140 Sum_probs=151.1 Q ss_pred CCH-HHHHHHHHHHHHHHHH---HHHhhh-------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MSK-ELRELLAKLEGKKEEV---RSLMGE-------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) Q Consensus 1 M~k-el~el~~~~~~~~~e~---~~~~~~-------~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) |+| .|-|...++.++++.- ++.+.. +++-..+.+++-+..+.-+|...+ .+.... .. .+..... T Consensus 1 mnkpdliekqnrlaelkennvslksqisgfevknaiedl~K~~ELe~TlSe~~iEI~k~e--n~LN~~-eE--~~KGK~k 75 (393) T protein:vir:16 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIE--NELNAQ-EE--KPKGKDK 75 (393) T ss_pred CCCcchhhhhhhhhhhhhcccchhhhccchhhhhhhhhchhHHHHHHhHhhcchhhhhhh--hhhhhh-hh--cchhhHH Confidence 987 4667777766665532 222211 122233444444444433333221 111111 00 0000000 Q ss_pred cchhhHHHHH---HHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc Q lcl|Aclame:pro 70 VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR 146 (392) Q Consensus 70 ~~~~~~~~~a---~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~ 146 (392) ....-+-.++ |...+.....+.+.+......+..++. +.++-...+|..+.-.|-..+..+.++++...+..++ T Consensus 76 Mt~~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GV---tiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~ 152 (393) T protein:vir:16 76 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGV---TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 152 (393) T ss_pred HHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCc---ceeccchhccHHHHHHHHHhhhccCcceeeeeeccch Confidence 1011122223 333333333333444444444443333 3344456689888888999999999998765554443 Q ss_pred CCcceeEEEee-cCCccccccccccccccccccceeeEEechhheeeehhhHH--HHHhhhHHHHHHHHHHHHHHHHH-H Q lcl|Aclame:pro 147 TRSGSRVLEKN-SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSR--SLLQDSDQNILKYVTKWLGKKSK-V 222 (392) Q Consensus 147 ~~~~~~~~~~~-~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~--e~l~ds~~~l~~~v~~~l~~~~~-~ 222 (392) - +...+. .+...+.-.-.|..+++. ..+|.--++.+.-++....+-. .-+..+-..|..|+..+|+.++. + T Consensus 153 ~----~~V~~s~~s~~eAq~HkdGqTK~eq-a~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk 227 (393) T protein:vir:16 153 A----LLVSRSFDSANEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK 227 (393) T ss_pred h----hhHHhhhhhhhhhhhhccCCccccc-eeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHH Confidence 2 211111 112233333334444443 3455555555544443333311 12333444678999999999998 8 Q ss_pred HHHHHHhhccccccccchhhH----------------------HHHHHHHHHHhhhcccCCceEEEcHHHHH-HHHHhhc Q lcl|Aclame:pro 223 TRNVLILGVIEKLTKQAIKSL----------------------DDIKDVLNVKLDPAISPNAILLTNQDGFN-YLDKLKD 279 (392) Q Consensus 223 ~~d~~~~~~~~~~~~~~~~~~----------------------d~~~~~~~~~~~~~~~~~a~~v~~~~~~~-~L~~lkd 279 (392) ..+.++.-|.|+++-...... |.+..+.....+.+.+ ..++....+.. -|..|+. T Consensus 228 ~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfvrptagr--rylivktedrkalldelrq 305 (393) T protein:vir:16 228 IVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR--RYLIVKTEDRKALLDELRQ 305 (393) T ss_pred HHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCc--eEEEEeccchHHHHHHHHh Confidence 889999999888774333222 2222222211122211 12344443332 3344443 Q ss_pred cCC--c-eeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcC Q lcl|Aclame:pro 280 KDG--K-YILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRN 356 (392) Q Consensus 280 ~~g--~-~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~ 356 (392) +.. + .|-.. .+. ...-.|...+++. .+..+-...+ +.|-+ |.+-+ ..++ .. ..-.|.+| T Consensus 306 atananvriknd-dte--iasevgvdeiivy-------tgskalkptv-lvdqk--yhidm-qdlt--kv--dafewktn 367 (393) T protein:vir:16 306 ATANANVRIKND-DTE--IASEVGVDEIIVY-------TGSKALKPTV-LVDQK--YHIDM-QDLT--KV--DAFEWKTN 367 (393) T ss_pred hhccCceeeecc-chh--hhhhcCcceeeee-------ecccccccee-eeccc--cccch-hhhh--hh--hhheeccC Confidence 322 1 12111 111 1112232222221 1122222222 22332 22221 1221 11 11234566 Q ss_pred ceeEEEEEeeCcEEecccceEEEEec Q lcl|Aclame:pro 357 TLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) Q Consensus 357 ~~~~~~~~r~~~~v~~~~af~~l~~~ 382 (392) ...+.++.-..+.|---+|-+++++. T Consensus 368 snmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 368 SNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred CceEEEeecccCcceeeccceeEeeC Confidence 66666666666655555555555554 No 170 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=96.88 E-value=0.00028 Score=40.06 Aligned_cols=289 Identities=12% Similarity=0.112 Sum_probs=138.1 Q ss_pred hhHHHHHHHHhhhhhhh-hcccc--ccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccc Q lcl|Aclame:pro 89 LNAEEREFLEDDLEQRA-MSGLT--GEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAE 165 (392) Q Consensus 89 ~~~~~~~~~~~~~~~~a-~~~~~--~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (392) .....+.........-+ ..... ..+..+.|-+.+...+.+.+.+.+-+++++++++|....|...-. ..+++-++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~l-gv~g~iagr 79 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGV-GVTGTIAST 79 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEee-ccCcceeec Confidence 11111111111111111 11111 122345566677788999999999999999999999888886544 333333333 Q ss_pred cccc--cccccccccceeeEEechhheeeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----- Q lcl|Aclame:pro 166 ITEM--GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLT----- 236 (392) Q Consensus 166 ~~E~--~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~----- 236 (392) +... .+....+...++.-.+..++.---..|+.+.|+.. .++|..-+++.+.++++.-.-.-.++|+..+. T Consensus 80 tdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~ 159 (355) T protein:vir:18 80 TDTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRV 159 (355) T ss_pred cccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChh Confidence 3211 11111222234444555555555555666666653 25788888888888776644333333322110 Q ss_pred --------------------------------------------ccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcHH Q lcl|Aclame:pro 237 --------------------------------------------KQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQD 269 (392) Q Consensus 237 --------------------------------------------~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~~ 269 (392) .+...+.|.++. +++..+++.++.. -+.|+.+. T Consensus 160 ~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~d 239 (355) T protein:vir:18 160 KNPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRK 239 (355) T ss_pred hCcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 111233455554 3444567777654 58888887 Q ss_pred HHH--HHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEec Q lcl|Aclame:pro 270 GFN--YLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) Q Consensus 270 ~~~--~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~ 347 (392) .++ ++..+. ..+.|--.-....-.....+|+.|.+.+ +++|.. .+++--|++....+.++..+=.+-+ T Consensus 240 Lla~k~~~l~n-~~~~ptE~~Aa~~i~s~k~iGGlpa~~~--PffP~~-------~~lVT~L~NLsIY~Q~gs~RR~~~d 309 (355) T protein:vir:18 240 LLADKYFPLVN-KQQENTESLAADIIISQKRIGNLPAVRV--PYFPAN-------AVFVTTLENLSIYFMDESHRRSIDE 309 (355) T ss_pred hhHHHHhHHhh-ccCChHHHHHHHHHHHHHhhCCceeEEc--cccCCC-------ceEEeeccccEEEEecCcEEEEEEe Confidence 654 233332 2233211000000000123455565543 455543 3677777775444455555444433 Q ss_pred cchhhhhcCceeEEEEEeeCcEEecccceEEE---EecccCCCCCCCC Q lcl|Aclame:pro 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYG---EIDLSAPVEQPQG 392 (392) Q Consensus 348 ~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l---~~~~~a~~~~~~~ 392 (392) ... ++.+--.=..--|..|-++.+++.+ ++..++++..|+| T Consensus 310 ~p~----r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~~~~~~~~~~ 353 (355) T protein:vir:18 310 NPK----KDRVENYESMNIDYVVEAYAAGCLLENITLGDFTAPAAPEG 353 (355) T ss_pred ccc----cccccchhhhcceeeeeccccEEEEeeeeecCCCCcccccC Confidence 321 2222222222234455566655554 3333333333333 No 171 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=96.26 E-value=0.00081 Score=37.53 Aligned_cols=287 Identities=12% Similarity=0.139 Sum_probs=137.3 Q ss_pred hhHHHHHHHHhhhhhhh-hcccc--ccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccc Q lcl|Aclame:pro 89 LNAEEREFLEDDLEQRA-MSGLT--GEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAE 165 (392) Q Consensus 89 ~~~~~~~~~~~~~~~~a-~~~~~--~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (392) .....+.........-+ ..... .....+.|-+.+...+.+.+.+.+-+++++++++|....|...-. ..+++-++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~l-gv~g~iagr 79 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGV-GVTGTIAST 79 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeee-ccCcccccc Confidence 11111111111111111 11111 122345566667778899999999999999999999888886544 333333333 Q ss_pred cccc--cccccccccceeeEEechhheeeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccc------ Q lcl|Aclame:pro 166 ITEM--GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKL------ 235 (392) Q Consensus 166 ~~E~--~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~------ 235 (392) +..+ .+....+...++.-.+..++.---..|+.+.|+.. .++|..-+++.+.++++.-.-.-.++|+..+ T Consensus 80 tdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~ 159 (355) T protein:vir:98 80 TDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRT 159 (355) T ss_pred ccCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChh Confidence 3221 11111222234444455555544555666666643 2578888888888877664433333332211 Q ss_pred -------------------------------------------cccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcHH Q lcl|Aclame:pro 236 -------------------------------------------TKQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQD 269 (392) Q Consensus 236 -------------------------------------------~~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~~ 269 (392) ..+...+.|.++. +++..+++.++.. -+.|+.+. T Consensus 160 ~nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~d 239 (355) T protein:vir:98 160 KNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRK 239 (355) T ss_pred hCcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 0111234455554 3444567777654 58888888 Q ss_pred HHH--HHHHhhccCCceee--cccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEE Q lcl|Aclame:pro 270 GFN--YLDKLKDKDGKYIL--QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELAS 345 (392) Q Consensus 270 ~~~--~L~~lkd~~g~~l~--~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~ 345 (392) .++ ++..+. ....|-- -.... .... .+|+.|.+.+ +++|.. .+++--|++....+.++..+=.+ T Consensus 240 Lla~k~~~l~n-~~~~ptE~~Aa~~i-~s~k-~iGGlpa~~~--PffP~~-------~~lVT~L~NLsIY~Q~gs~RR~~ 307 (355) T protein:vir:98 240 LLADKYFPLVN-KQQENSESLAADII-ISQK-RIGNLPAVRV--PYFPAN-------AVLVTTLENLSIYFMDESHRRSI 307 (355) T ss_pred hhHHHhhhHhh-ccCCcHHHHHHHHH-HHhh-hhCCceeEEc--cccCCC-------ceEEeeccccEEEEecCcEEEEE Confidence 654 233333 2222200 00000 0012 3455555543 455543 36777777754444555554444 Q ss_pred eccchhhhhcCceeEEEEEeeCcEEecccceEEEE---ec-ccCCCCCCCC Q lcl|Aclame:pro 346 TDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE---ID-LSAPVEQPQG 392 (392) Q Consensus 346 ~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~---~~-~~a~~~~~~~ 392 (392) -+... ++.+--.=..--|..|-++.+++.++ +. +.+|+++.+| T Consensus 308 ~d~p~----r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~ 354 (355) T protein:vir:98 308 DENPK----KDRVENYESMNIDYVVEVYAAGCLLENITLGDFTAPAAPESG 354 (355) T ss_pred Eeccc----cccccchhhhcceeeeeccccEEEeeceeeeCCCCCcccccC Confidence 33321 22222222222344555555555443 22 2344444444 No 172 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=96.16 E-value=0.00093 Score=37.21 Aligned_cols=268 Identities=10% Similarity=0.026 Sum_probs=94.8 Q ss_pred chhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcc-------eeeccCCcceeEEEeecC Q lcl|Aclame:pro 87 KPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVT-------VEPVRTRSGSRVLEKNSD 159 (392) Q Consensus 87 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~-------~~~~~~~~~~~~~~~~~~ 159 (392) ..+ ... -+ -.+.+....++.+.+....++... ..++.+.-..+++..... T Consensus 1 m~l-----------------sD~-----~v-fN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~ 57 (325) T protein:vir:95 1 MAL-----------------SDL-----AV-YSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVT 57 (325) T ss_pred Cch-----------------hhh-----hh-hhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeecccccccc Confidence 000 000 00 011111222222332222222211 111112112222222111 Q ss_pred Cc--cccccccccccccccccceeeEEechhheeeehhhHHHHH---hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-- Q lcl|Aclame:pro 160 MI--PFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLL---QDSDQNILKYVTKWLGKKSKVTRNVLILGVI-- 232 (392) Q Consensus 160 ~~--~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l---~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~-- 232 (392) +. ...-+.+.+.......-+..++.+....=.++.....+.+ .+..-.+...|.+.+++...+.+-+.++.++ T Consensus 58 g~~~~~~~~~~~~~vt~~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~ 137 (325) T protein:vir:95 58 GGLVRRRNAYGSGTVAEKVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYS 137 (325) T ss_pred ccccccccCCCCceeccceeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 1111222222222211233444443333233222222221 1111222233333333332222211111111 Q ss_pred --ccc--------c----ccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccce Q lcl|Aclame:pro 233 --EKL--------T----KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLF 298 (392) Q Consensus 233 --~~~--------~----~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~ 298 (392) +.. + .....++..+.++..+. -.....-+.|+||+.++..|.+.+-.+...++..+.... -.+. T Consensus 138 a~~~~~~~v~dis~~~~~~~~~~s~~~l~~A~~kl-GD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~~-i~t~ 215 (325) T protein:vir:95 138 ALSQVSDVVYDATANTDAADKLPTWNNLNNGQAKF-GDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVNV-VRDP 215 (325) T ss_pred hhcccccceeeeecccCcccccccHHHHHHHHHHh-cccccceeEEEEchHHHHHHHHhhccccccccccCCccc-cccc Confidence 110 0 11123456777776653 333344468999999999998876665544444332222 2356 Q ss_pred ecccceEEecCcccccccccC-Cc-ceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccce Q lcl|Aclame:pro 299 AGTNPVVVVSNRFLKSKGTTA-KK-APLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAA 376 (392) Q Consensus 299 ~g~~pv~~~~~~~~~~~~~~~-~~-~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af 376 (392) +| ++|+|.++ +|..+..+ +. ..++||. .++.+.+..+......+.... .+-...++.+.. -++||.++ T Consensus 216 ~G-~~VIVdD~--~p~~~~g~~~~ytty~lg~--GAi~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~t---f~lhp~G~ 285 (325) T protein:vir:95 216 FG-KLLVMTDS--PNLFAAGTPNVYHILGLVP--GGVLIGQNNDFDANEETKNGD--ENIIRTYQAEWS---YNIGVKGF 285 (325) T ss_pred CC-cEEEEeCC--CCCCCccCceeEEEEEEec--CeEEecCCCCccccccccCcc--cceeeeeeeeee---EEeeccee Confidence 66 45666543 44332222 12 2344443 233333333333322222111 122233332221 46688888 Q ss_pred EEEEecccCCCCCCCC Q lcl|Aclame:pro 377 VYGEIDLSAPVEQPQG 392 (392) Q Consensus 377 ~~l~~~~~a~~~~~~~ 392 (392) ..-+ +....+|-- T Consensus 286 sw~~---s~~g~sPt~ 298 (325) T protein:vir:95 286 AWDK---ANGGKSPTD 298 (325) T ss_pred eeec---ccccCCcCh Confidence 7622 221122332 No 173 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=96.12 E-value=0.00098 Score=37.08 Aligned_cols=289 Identities=11% Similarity=0.068 Sum_probs=138.4 Q ss_pred hhHHHHHHHHhhhhhhhhcccc---ccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccc Q lcl|Aclame:pro 89 LNAEEREFLEDDLEQRAMSGLT---GEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAE 165 (392) Q Consensus 89 ~~~~~~~~~~~~~~~~a~~~~~---~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (392) .....+.........-+-..+. ...-.+.|-+.+...+...+.+.+-+++++++++|....|..... ..+++-++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~l-g~~g~iagr 79 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGI-GVTGSIAST 79 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEec-ccCcccccc Confidence 1111111111111111111111 112356666677788899999999999999999999888886543 333333333 Q ss_pred cccc--cccccccccceeeEEechhheeeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccc------ Q lcl|Aclame:pro 166 ITEM--GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKL------ 235 (392) Q Consensus 166 ~~E~--~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~------ 235 (392) +..+ .+....+-..++.-.+..++.---..|+.+.|+.. .++|..-+++.+.++++.-.-.-.++|+..+ T Consensus 80 tdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~ 159 (357) T protein:vir:56 80 TDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRS 159 (357) T ss_pred ccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChh Confidence 2211 12211221234444444444444445666666543 2467777877777777654332222321110 Q ss_pred -------------------------------------------cccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcHH Q lcl|Aclame:pro 236 -------------------------------------------TKQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQD 269 (392) Q Consensus 236 -------------------------------------------~~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~~ 269 (392) ..+...+.|.++. +++..+++.++.. -+.|+.+. T Consensus 160 ~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~d 239 (357) T protein:vir:56 160 SNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQ 239 (357) T ss_pred hCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 0112334555554 3445567877754 57888888 Q ss_pred HHHH--HHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEec Q lcl|Aclame:pro 270 GFNY--LDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) Q Consensus 270 ~~~~--L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~ 347 (392) .++. +..+. ..+.|--......-.....+|+.|.+.+ +++|.. .+++--|++.-..+.++..+=.+-+ T Consensus 240 Lla~k~~~l~n-~~~~pTE~~Aa~~i~s~k~iGGl~a~~~--PfFP~~-------~llVT~L~NLsIY~Q~gs~RR~~~d 309 (357) T protein:vir:56 240 LLADKYFPIVN-KEQDNSEMLAADVIISQKRIGNLPAVRV--PYFPAD-------AMLITKLENLSIYYMDDSHRRVIEE 309 (357) T ss_pred hhhhhhhhHhh-ccCChHHHHHHHHHHHhhhhCCceeEEc--cccCCC-------ceEEeeccccEEEEecCcEEEEEEe Confidence 7652 32222 2222210000000000123566665543 455543 3677777765444555555544433 Q ss_pred cchhhhhcCceeEEEEEeeCcEEecccceEEEE-ecccCCCCCCCC Q lcl|Aclame:pro 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAPVEQPQG 392 (392) Q Consensus 348 ~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~-~~~~a~~~~~~~ 392 (392) ... ++.+--.=..--|..|-++.+++.++ ++-+.+.+++++ T Consensus 310 ~p~----r~riE~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~~~~~~ 351 (357) T protein:vir:56 310 NPK----LDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred ccc----cccccchhhhcceeeeeccccEEEeeeeeeccCCCCccc Confidence 321 22222222223355666666666654 333333333333 No 174 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=96.03 E-value=0.0011 Score=36.79 Aligned_cols=285 Identities=13% Similarity=0.102 Sum_probs=142.8 Q ss_pred hcchhhHHHHHHHHhhhhhhhhcccc---ccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCc Q lcl|Aclame:pro 85 RNKPLNAEEREFLEDDLEQRAMSGLT---GEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMI 161 (392) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~a~~~~~---~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~ 161 (392) .+.......+.........-+-..+. ..+-.+.|.+.+...+.+.+.+.+-+++++++++|....|..... ..+++ T Consensus 1 m~~~M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~l-g~~g~ 79 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCDALAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQV-GVGQL 79 (358) T ss_pred CcccccHHHHHHHHHHHHHHHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEee-cCCcc Confidence 11111111122111111111111111 223467787788888999999999999999999999888886443 33344 Q ss_pred cccccccccccccccccceeeEEechhheeeehhhHHHHHhhhH-----HHHHHHHHHHHHHHHHHHHHHHHhhcccc-- Q lcl|Aclame:pro 162 PFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD-----QNILKYVTKWLGKKSKVTRNVLILGVIEK-- 234 (392) Q Consensus 162 ~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~-----~~l~~~v~~~l~~~~~~~~d~~~~~~~~~-- 234 (392) -++....+ .. .+...++.-.+...+.---..|+.+.|+..+ .+|..-+++.+.++++.-.-.-.++|+.. T Consensus 80 iagrt~tr-~~--~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~ 156 (358) T protein:vir:78 80 YTGRKKGG-RF--KGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAAD 156 (358) T ss_pred cceecCCC-cc--ccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeecc Confidence 44433322 11 2223445555555555555566777776543 26788888888887765443222232111 Q ss_pred -------------------------------------------ccccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcH Q lcl|Aclame:pro 235 -------------------------------------------LTKQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQ 268 (392) Q Consensus 235 -------------------------------------------~~~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~ 268 (392) .+.+...+.|.++. ++...+++.++.. -+.|+.+ T Consensus 157 ~Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~ 236 (358) T protein:vir:78 157 DTDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGT 236 (358) T ss_pred CCChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 11122345566654 3456778887764 5888888 Q ss_pred HHHHH--HHHhhccCCce---eecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEE Q lcl|Aclame:pro 269 DGFNY--LDKLKDKDGKY---ILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMEL 343 (392) Q Consensus 269 ~~~~~--L~~lkd~~g~~---l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~ 343 (392) ..++. +..+. ..+.| +-...+ .. .+|+.|.+.+ +++|.. .+++--|++.-..+.++..+= T Consensus 237 dLla~k~~~l~n-~~~~pTE~~Aa~~i----~k-~iGGlpa~~~--PfFP~~-------~ilVT~L~NLsIY~Q~gs~RR 301 (358) T protein:vir:78 237 DLVAAAQAKLYS-EATKPSEQIAAQQL----AK-SIAGRKAYIP--PFFPGK-------RMVVTTLDNLHCYTQRGTRKR 301 (358) T ss_pred hhhhHHhhhHhh-cCCCcHHHHHHHHH----HH-HhCCCeEEEc--cccCCC-------ceEEeeccccEEEEecCcEEE Confidence 88652 33333 22232 111000 12 2566666553 455543 367777777544455555554 Q ss_pred EEeccchhhhhcCceeEEEEEeeCcEEecccceEEEE---ecccCCCCCCCC Q lcl|Aclame:pro 344 ASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE---IDLSAPVEQPQG 392 (392) Q Consensus 344 ~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~---~~~~a~~~~~~~ 392 (392) .+-+... ++.+--.=..--|..|-++.+++.++ ++-.+..++|++ T Consensus 302 ~~~d~p~----r~riE~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~pa~~~~ 349 (358) T protein:vir:78 302 KADDNQD----SKSFDNQYWRMEGYALGEHKAYGGFEEADIEIGADPAVLAV 349 (358) T ss_pred EEEeccc----cccccchhhhcceeeeeccccEEEEeeeeeeeCCCCCcccc Confidence 4433322 22222222222355566666665543 332222222222 No 175 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=95.98 E-value=0.0012 Score=36.65 Aligned_cols=286 Identities=11% Similarity=0.011 Sum_probs=120.3 Q ss_pred HHHHHhc--chhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhh--hcceeeccCCcceeEEE Q lcl|Aclame:pro 80 FMKALRN--KPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQ--YVTVEPVRTRSGSRVLE 155 (392) Q Consensus 80 ~~~~~~~--~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~--l~~~~~~~~~~~~~~~~ 155 (392) +.+.+++ +.++-.-+ .....+..+.....-+.+. .+++.+.....+.. +++......+..++.++ T Consensus 1 ~~~~~~~~~~~~~~~~~----------~~~~~~~~~nt~~l~~k~~-~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp 69 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQ----------HFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVM 69 (319) T ss_pred CCcccccccceeEeehh----------hhhccCCCcchHHHHHHHH-HHHHHHHHHhhhhhhcccCcceEeccCcEEEEe Confidence 1111110 00000000 0111222223333333333 33444433333221 12321112233455566 Q ss_pred eecCCccccccccccccccc-cccceeeEEechhheeeehhhHHHHHhhhHHH--HHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 156 KNSDMIPFAEITEMGEIPET-DNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN--ILKYVTKWLGKKSKVTRNVLILGVI 232 (392) Q Consensus 156 ~~~~~~~~~~~~E~~~~~~~-~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~--l~~~v~~~l~~~~~~~~d~~~~~~~ 232 (392) +.... +..-..-++..... -+.++...+++-.+.-.+.-=.-. .+++... +...+.+.....+.-.+|...+..+ T Consensus 70 ~i~~~-gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D-~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skl 147 (319) T protein:vir:97 70 KGDTT-ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALD-RKDTEGNIDINYVVARQGAEVVAPYLDNLRFATL 147 (319) T ss_pred eeccc-ccccccCCCCcccCCcccceeEEEeecccccccccchhh-HhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHH Confidence 55542 22222222111111 112334444544443332210000 1111111 2223344444444445554322211 Q ss_pred -------ccccccchhhHHHHHHHHHHHhhhcc-cCCceEEEcHHHHHHHHHhhccCCce-eecccccCCcccceecccc Q lcl|Aclame:pro 233 -------EKLTKQAIKSLDDIKDVLNVKLDPAI-SPNAILLTNQDGFNYLDKLKDKDGKY-ILQSDPTQKNKKLFAGTNP 303 (392) Q Consensus 233 -------~~~~~~~~~~~d~~~~~~~~~~~~~~-~~~a~~v~~~~~~~~L~~lkd~~g~~-l~~~~~~~~~~~~~~g~~p 303 (392) .+.+.+....|+.+.+++. .++... ..+.+++++|..+..|.+-..-.... +.+.....+....+.|.+ T Consensus 148 a~~a~~~~~~~~t~~n~y~~i~~a~~-~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~- 225 (319) T protein:vir:97 148 ARNKAKHLTVGTGSDAQYDAVLDVSV-ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFV- 225 (319) T ss_pred HhhcccccccccCHHHHHHHHHHHHH-HHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeE- Confidence 1122234456777777664 444433 33456789999999885432111100 112222234445566654 Q ss_pred eEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecc Q lcl|Aclame:pro 304 VVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 304 v~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~ 383 (392) |+.+.+. ......+++|... +..... .--.++........| ...++...++|..|.+|++..+..... T Consensus 226 Vi~vps~-------~~k~in~i~~h~~-A~~~~~-k~~~~~~~~p~~~~~---a~~v~gr~y~d~~V~~~k~~~Iy~~~~ 293 (319) T protein:vir:97 226 IVKVPTK-------LLQGLQAIAVVGE-VLASPI-QADLAKTNSNIPGMF---GTLAEQLLYTGAFVPEHLQKYIFTIGG 293 (319) T ss_pred EEEeccc-------ccccceEEEEcCC-eeeeee-eeeeeeccCCCcccc---ceeeeeeeeeeeEEeccccceEEEeec Confidence 4443221 1233446777654 332222 222233222111222 367888888999999999887777788 Q ss_pred cCCCCCCCC Q lcl|Aclame:pro 384 SAPVEQPQG 392 (392) Q Consensus 384 ~a~~~~~~~ 392 (392) ++|++.+.| T Consensus 294 ~~~~~~~~~ 302 (319) T protein:vir:97 294 TEVATKRDG 302 (319) T ss_pred CCcccCCCc Confidence 888888888 No 176 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=95.98 E-value=0.0012 Score=36.65 Aligned_cols=286 Identities=11% Similarity=0.011 Sum_probs=120.3 Q ss_pred HHHHHhc--chhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhh--hcceeeccCCcceeEEE Q lcl|Aclame:pro 80 FMKALRN--KPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQ--YVTVEPVRTRSGSRVLE 155 (392) Q Consensus 80 ~~~~~~~--~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~--l~~~~~~~~~~~~~~~~ 155 (392) +.+.+++ +.++-.-+ .....+..+.....-+.+. .+++.+.....+.. +++......+..++.++ T Consensus 1 ~~~~~~~~~~~~~~~~~----------~~~~~~~~~nt~~l~~k~~-~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp 69 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQ----------HFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVM 69 (319) T ss_pred CCcccccccceeEeehh----------hhhccCCCcchHHHHHHHH-HHHHHHHHHhhhhhhcccCcceEeccCcEEEEe Confidence 1111110 00000000 0111222223333333333 33444433333221 12321112233455566 Q ss_pred eecCCccccccccccccccc-cccceeeEEechhheeeehhhHHHHHhhhHHH--HHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 156 KNSDMIPFAEITEMGEIPET-DNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN--ILKYVTKWLGKKSKVTRNVLILGVI 232 (392) Q Consensus 156 ~~~~~~~~~~~~E~~~~~~~-~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~--l~~~v~~~l~~~~~~~~d~~~~~~~ 232 (392) +.... +..-..-++..... -+.++...+++-.+.-.+.-=.-. .+++... +...+.+.....+.-.+|...+..+ T Consensus 70 ~i~~~-gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D-~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skl 147 (319) T protein:vir:94 70 KGDTT-ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALD-RKDTEGNIDINYVVARQGAEVVAPYLDNLRFATL 147 (319) T ss_pred eeccc-ccccccCCCCcccCCcccceeEEEeecccccccccchhh-HhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHH Confidence 55542 22222222111111 112334444544443332210000 1111111 2223344444444445554322211 Q ss_pred -------ccccccchhhHHHHHHHHHHHhhhcc-cCCceEEEcHHHHHHHHHhhccCCce-eecccccCCcccceecccc Q lcl|Aclame:pro 233 -------EKLTKQAIKSLDDIKDVLNVKLDPAI-SPNAILLTNQDGFNYLDKLKDKDGKY-ILQSDPTQKNKKLFAGTNP 303 (392) Q Consensus 233 -------~~~~~~~~~~~d~~~~~~~~~~~~~~-~~~a~~v~~~~~~~~L~~lkd~~g~~-l~~~~~~~~~~~~~~g~~p 303 (392) .+.+.+....|+.+.+++. .++... ..+.+++++|..+..|.+-..-.... +.+.....+....+.|.+ T Consensus 148 a~~a~~~~~~~~t~~n~y~~i~~a~~-~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~- 225 (319) T protein:vir:94 148 ARNKAKHLTVGTGSDAQYDAVLDVSV-ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFV- 225 (319) T ss_pred HhhcccccccccCHHHHHHHHHHHHH-HHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeE- Confidence 1122234456777777664 444433 33456789999999885432111100 112222234445566654 Q ss_pred eEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecc Q lcl|Aclame:pro 304 VVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 304 v~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~ 383 (392) |+.+.+. ......+++|... +..... .--.++........| ...++...++|..|.+|++..+..... T Consensus 226 Vi~vps~-------~~k~in~i~~h~~-A~~~~~-k~~~~~~~~p~~~~~---a~~v~gr~y~d~~V~~~k~~~Iy~~~~ 293 (319) T protein:vir:94 226 IVKVPTK-------LLQGLQAIAVVGE-VLASPI-QADLAKTNSNIPGMF---GTLAEQLLYTGAFVPEHLQKYIFTIGG 293 (319) T ss_pred EEEeccc-------ccccceEEEEcCC-eeeeee-eeeeeeccCCCcccc---ceeeeeeeeeeeEEeccccceEEEeec Confidence 4443221 1233446777654 332222 222233222111222 367888888999999999887777788 Q ss_pred cCCCCCCCC Q lcl|Aclame:pro 384 SAPVEQPQG 392 (392) Q Consensus 384 ~a~~~~~~~ 392 (392) ++|++.+.| T Consensus 294 ~~~~~~~~~ 302 (319) T protein:vir:94 294 TEVATKRDG 302 (319) T ss_pred CCcccCCCc Confidence 888888888 No 177 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=95.71 E-value=0.0016 Score=35.93 Aligned_cols=274 Identities=7% Similarity=-0.038 Sum_probs=107.4 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhccee---eccCCcc-eeEEEeecCCcccccc-----cccccccccc Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE---PVRTRSG-SRVLEKNSDMIPFAEI-----TEMGEIPETD 176 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~---~~~~~~~-~~~~~~~~~~~~~~~~-----~E~~~~~~~~ 176 (392) |. -..++|+.+...+++.+++..++.+++..- .+.+..| ++.++... .....+. +++.... .+ T Consensus 1 Ma------~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~-~~~~~~~~~~~~~~~~~~~-~~ 72 (392) T protein:vir:99 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGAERNLT-VS 72 (392) T ss_pred Cc------cccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecc-cccceeeeccccccCCccc-cc Confidence 21 123789999999999999998888877542 2222222 23333222 2222222 2222222 22 Q ss_pred ccceeeEEech-hheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------ccccchhhHHHH Q lcl|Aclame:pro 177 NPKFSNVQYAV-KDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK---------LTKQAIKSLDDI 246 (392) Q Consensus 177 ~~~~~~v~~~~-~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~---------~~~~~~~~~d~~ 246 (392) +.+-..+.+.. +..+.-+.|+++-......++..-+.+...++++.++|..++..... ........|+.+ T Consensus 73 ~~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i 152 (392) T protein:vir:99 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGV 152 (392) T ss_pred ccccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHH Confidence 23334445544 22234445666655555667777777888888999888876543221 111223457777 Q ss_pred HHHHHHHhhhcccCC-ceEEEcHHHHHHHHHhhc-cCCceeec---ccccCCcccceecccceEEecCcccccccccCCc Q lcl|Aclame:pro 247 KDVLNVKLDPAISPN-AILLTNQDGFNYLDKLKD-KDGKYILQ---SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKK 321 (392) Q Consensus 247 ~~~~~~~~~~~~~~~-a~~v~~~~~~~~L~~lkd-~~g~~l~~---~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~ 321 (392) +++. ..|+....|. ..++++|..+..|.+... ....+.-. .....+.-..++|. .|+...+ .+...+..... T Consensus 153 ~~a~-~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~-~v~~s~~-~~~~t~~a~~~ 229 (392) T protein:vir:99 153 NGAR-RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGY-EIVESTL-IPHGDAYLYHP 229 (392) T ss_pred HHHH-HHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeee-EEEeecc-cccccceeeec Confidence 7754 3455544443 467889999888753310 00001100 01112333455664 3433222 21111100000 Q ss_pred ceEEEE--------ehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEec---ccceEE---EEecccCCC Q lcl|Aclame:pro 322 APLIIG--------DLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWD---NEAAVY---GEIDLSAPV 387 (392) Q Consensus 322 ~~~~~G--------d~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~---~~af~~---l~~~~~a~~ 387 (392) ..+.++ +....+...--..+...+.......+..+...+... .+...+. ..+|.. ++....... T Consensus 230 ~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~--~g~~~v~~~~~~~~~~~~~~~~~~~~v~ 307 (392) T protein:vir:99 230 TAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTY--FGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) T ss_pred cccccccccccccccccceeEEecccceecceeecccceeecccccccee--EEEEEEeeccccceeeeeeeeeecceee Confidence 000000 000000000001111111111111111111111110 0111111 111110 000000000 Q ss_pred CCCC-----------C Q lcl|Aclame:pro 388 EQPQ-----------G 392 (392) Q Consensus 388 ~~~~-----------~ 392 (392) .+|. | T Consensus 308 v~~v~~~~~~~~~~~~ 323 (392) T protein:vir:99 308 VAPEAGANATITAAAG 323 (392) T ss_pred eeeeecccceeEeeec Confidence 0000 0 No 178 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=95.67 E-value=0.0016 Score=35.84 Aligned_cols=289 Identities=11% Similarity=0.068 Sum_probs=138.3 Q ss_pred hhHHHHHHHHhhhhhhhhcccc---ccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccc Q lcl|Aclame:pro 89 LNAEEREFLEDDLEQRAMSGLT---GEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAE 165 (392) Q Consensus 89 ~~~~~~~~~~~~~~~~a~~~~~---~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (392) .....+.........-+-..+. ...-.+.|-+.+...+...+.+.+-+++++++++|....|..... ..+++-++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~l-g~~g~iagr 79 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGI-GVTGSIAST 79 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEec-ccCcccccc Confidence 1111111111111111111111 112356666677788899999999999999999999888886543 333333333 Q ss_pred cccc--cccccccccceeeEEechhheeeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccc------ Q lcl|Aclame:pro 166 ITEM--GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKL------ 235 (392) Q Consensus 166 ~~E~--~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~------ 235 (392) +..+ .+....+-..++.-.+..++.---..|+.+.|+.. .++|..-+++.+.++++.-.-.-.++|+..+ T Consensus 80 tdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~ 159 (357) T protein:vir:20 80 TDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRS 159 (357) T ss_pred ccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChh Confidence 2211 12211221234444444444444445666666543 2467777877777777654332222321110 Q ss_pred -------------------------------------------cccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcHH Q lcl|Aclame:pro 236 -------------------------------------------TKQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQD 269 (392) Q Consensus 236 -------------------------------------------~~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~~ 269 (392) ..+...+.|.++. +++..+++.++.. -+.|+.+. T Consensus 160 ~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~d 239 (357) T protein:vir:20 160 SNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQ 239 (357) T ss_pred hCcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 0112334555554 4445567877754 57888888 Q ss_pred HHHH--HHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEec Q lcl|Aclame:pro 270 GFNY--LDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) Q Consensus 270 ~~~~--L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~ 347 (392) .++. +..+. ..+.|--......-.....+|+.|.+.+ +++|.. .+++--|++.-..+.++..+=.+-+ T Consensus 240 Lla~k~~~l~n-~~~~ptE~~Aa~~i~s~k~iGGl~a~~~--PfFP~~-------~ilVT~L~NLsIY~Q~gs~RR~~~d 309 (357) T protein:vir:20 240 LLADKYFPIVN-KEQDNSEMLAADVIISQKRIGNLPAVRV--PYFPAD-------AMLITKLENLSIYYMDDSHRRVIEE 309 (357) T ss_pred hhhhhhhhHhh-ccCChHHHHHHHHHHHhhhhCCceeEEc--cccCCC-------ceEEeeccccEEEEecCcEEEEEEe Confidence 7652 32222 2222210000000000123566665543 455543 3677777765444555555544433 Q ss_pred cchhhhhcCceeEEEEEeeCcEEecccceEEEE-ecccCCCCCCCC Q lcl|Aclame:pro 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAPVEQPQG 392 (392) Q Consensus 348 ~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~-~~~~a~~~~~~~ 392 (392) ... ++.+--.=..--|..|-++.+++.++ ++-+.+.+++.+ T Consensus 310 ~p~----r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~p~~~ 351 (357) T protein:vir:20 310 NPK----LDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred ccc----cccccchhhhcceeeeeccccEEEeeeeeeccccCCccC Confidence 321 22222222223455666777776664 333333322322 No 179 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=95.57 E-value=0.0018 Score=35.59 Aligned_cols=357 Identities=13% Similarity=0.128 Sum_probs=152.8 Q ss_pred CCHHH---------------------------------------HHHHHHHHHHHHHHHHHhhh--hhHHHHHHHHHH-H Q lcl|Aclame:pro 1 MSKEL---------------------------------------RELLAKLEGKKEEVRSLMGE--DKVAEAEQMMEE-V 38 (392) Q Consensus 1 M~kel---------------------------------------~el~~~~~~~~~e~~~~~~~--~~~~~~~~~~~e-i 38 (392) |-+.| .+..++..++...++++... ++. +.+..+ + T Consensus 188 ~p~~~~~~~~~~~~~~~~v~d~EPa~~~~pvqAaAP~~De~airAq~~aeeraRi~~I~~l~a~Fggr~---~~l~~~~l 264 (652) T protein:vir:79 188 MPDSIRNMITPPRNSAPRVQDDEPAASRTPVQAAAPVVDENSIRAQVLAEQKARVNGINDLFAMFGGRY---QTLQAQCL 264 (652) T ss_pred hHHHHHHHhcccccccccccccccccccccccccCCcCchhHHHHHHHHHHHHHHHHHHHHHHhhcccc---chHHHHHh Confidence 11111 22233333444444444321 111 111111 0 Q ss_pred HHHHHHHHHHH-HHHHHHHHHhhc-cc--cccccccchhhHHHHHHHHHH--hcch-------------hhHHHHHHH-- Q lcl|Aclame:pro 39 RSLQKKIDLQR-SLDEAETEERNN-GR--EVETRNVDGEMEYRDVFMKAL--RNKP-------------LNAEEREFL-- 97 (392) Q Consensus 39 ~~l~~~i~~~~-~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~a~~~~~--~~~~-------------~~~~~~~~~-- 97 (392) .+..-.++..+ .+.+...+.... .. +..... ......++++...+ |.+. +....+.-. T Consensus 265 ~d~~~s~e~ar~~il~~l~~~~~p~~~~~~~~~~~-~~g~~~~d~~~~aL~~R~g~~~~~~~~~~~g~~L~elAr~~L~~ 343 (652) T protein:vir:79 265 ADPECSLEQAREKLLNEMGRESTPSNKNTPAHIYA-GNGNFVGDGIRQALMARAGFEKTERDNVYNGMTLREYARMSLTE 343 (652) T ss_pred hccCCCHHHHHHHHHHHHHhhcCCCCCCcceeEee-ccchhhHHHHHHHHHhhcCCcccccCccccCccHHHHHHHHHHh Confidence 00000111111 111112111110 00 000000 01112223332222 1110 000000000 Q ss_pred ---------HhhhhhhhhccccccccceecchhhhhHHHHhHHh-----hhhhhhhcceeeccCCcceeEEEeecCCccc Q lcl|Aclame:pro 98 ---------EDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARS-----FDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) Q Consensus 98 ---------~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~-----~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~ 163 (392) ......++++.+|+. -|.-+.+.+...+++ ......+++..+++--... ......+.+.. T Consensus 344 ~G~~~~~~~~~~~v~~A~~hsTsD-----Fp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~-~~~~lg~~~~L 417 (652) T protein:vir:79 344 RGIGVSSYNPMQMVGAAFTHSTSD-----FGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIA-HRVGMGGFSAL 417 (652) T ss_pred hccCCCCCCHHHHHHHHhhcCcch-----HHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCcccccc-ceeecCCCCCc Confidence 001112222211111 232222222222221 2345566666555432221 22334566778 Q ss_pred cccccccccccccccceeeEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------ Q lcl|Aclame:pro 164 AEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK------ 237 (392) Q Consensus 164 ~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~------ 237 (392) ..|.|+++.+-.+ ..=..-++.+.+++..+.||++++-+.+.++.+-|...+..+.+++++..++..+..+.. T Consensus 418 ~~V~E~gEyk~~t-~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk 496 (652) T protein:vir:79 418 RQVREGAEYKYVT-TGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNV 496 (652) T ss_pred cccCCCCccceee-ecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCc Confidence 8899999987543 343567899999999999999998777788888899999999999998765543322211 Q ss_pred -------------cchhh---HHHHHHHHHHHhhhc---ccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccce Q lcl|Aclame:pro 238 -------------QAIKS---LDDIKDVLNVKLDPA---ISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLF 298 (392) Q Consensus 238 -------------~~~~~---~d~~~~~~~~~~~~~---~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~ 298 (392) .+... .+....+|..+-+.. ...+..||..|.......++--+.. +-..+...+....+ T Consensus 497 ~LF~hA~H~Nl~~~aa~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~--v~~a~~~~~~~Np~ 574 (652) T protein:vir:79 497 SLFDKAKHANVLESAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSS--VKGADINAGIINPV 574 (652) T ss_pred eeecccccccccccccCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCC--Cccccccccccccc Confidence 01111 222222333332211 1355678888887766555532221 10011111112222 Q ss_pred ecccceEEecCcccccccccCCcceEEEEehhh------ceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEec Q lcl|Aclame:pro 299 AGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKE------AIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWD 372 (392) Q Consensus 299 ~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~------~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~ 372 (392) .+...| +++..+ ...+ ....++.+-.. +|..+ .++..++.- ..|..+.+.|++...+|.++++ T Consensus 575 ~~~~~~-i~eprL-~~~s----~~~wylaa~~~~dtiev~yL~G-~~~P~ie~~----~gf~~dG~~~kvrlD~G~~~iD 643 (652) T protein:vir:79 575 KDFATV-IAEPRL-DDNS----QTTFYLAASKGSDTIEVAYLNG-VDTPYIDQM----EGFSVDGVTTKVRIDAGVAPVD 643 (652) T ss_pred cccccc-cccccc-CCCC----cccEEEecCCCCCeEEEEEecC-CCCCeeeec----CCCCcceEEEEEEEeccCceee Confidence 222212 222222 1110 11122222111 12222 234444332 2488999999999999999999 Q ss_pred ccceEEEEe Q lcl|Aclame:pro 373 NEAAVYGEI 381 (392) Q Consensus 373 ~~af~~l~~ 381 (392) -.++++.+- T Consensus 644 ~RG~~k~t~ 652 (652) T protein:vir:79 644 HRGLVKCTA 652 (652) T ss_pred ccceeeecC Confidence 998886653 No 180 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=95.55 E-value=0.0016 Score=35.98 Aligned_cols=263 Identities=12% Similarity=0.016 Sum_probs=121.9 Q ss_pred ccccceecch--hhhhHHHHhHHhhhhhhhhcceee---ccCCcceeEEEeecCCccccccccccccccccccceeeEEe Q lcl|Aclame:pro 111 GEDGGLVIPQ--DIQTQINELARSFDALEQYVTVEP---VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQY 185 (392) Q Consensus 111 ~~~gg~~iP~--~~~~~ii~~~~~~~~l~~l~~~~~---~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~ 185 (392) .+..++++.+ -+...|.+.....-..+.++.+.+ ..-.+..+......+....+|.+.++..-+.-+..+++-.. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 2223343321 223345443333333333433322 11112222222222333333877765433333467778888 Q ss_pred chhheeeehhhHHHHHhhhHH---HHHHHHHHHHHHHHHHHHHHHHhhccccc-c-----------------ccc----- Q lcl|Aclame:pro 186 AVKDRAGILPLSRSLLQDSDQ---NILKYVTKWLGKKSKVTRNVLILGVIEKL-T-----------------KQA----- 239 (392) Q Consensus 186 ~~~~i~~~~~iS~e~l~ds~~---~l~~~v~~~l~~~~~~~~d~~~~~~~~~~-~-----------------~~~----- 239 (392) +.+.++.-..+|.+-|..+.. ++.+--.....+++...+|+..+.|.... + ..+ T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~w~ 160 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTKVQ 160 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCccc Confidence 888888888888766665432 34554455555666666666655552210 0 000 Q ss_pred hhhHHHHHHHHHHHhh------hcccCCceEEEcHHHHHHHHHhhccC-CceeecccccCCcccceecccceEEecCccc Q lcl|Aclame:pro 240 IKSLDDIKDVLNVKLD------PAISPNAILLTNQDGFNYLDKLKDKD-GKYILQSDPTQKNKKLFAGTNPVVVVSNRFL 312 (392) Q Consensus 240 ~~~~d~~~~~~~~~~~------~~~~~~a~~v~~~~~~~~L~~lkd~~-g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~ 312 (392) ..+.+.+++.+...+. .....+..++|.|+.+..|....-++ |.-++. -+...++. .-++|+.+..-... T Consensus 161 ~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~-~l~~n~~~--~~g~~l~I~~v~~~ 237 (304) T protein:vir:52 161 AMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALE-FLTKHLSA--AAGRQVAIKALPSN 237 (304) T ss_pred cCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHH-HHHHhccc--ccCCcceEEEeccc Confidence 1134444444332221 11133457999999999986543222 222221 11111111 11233332221111 Q ss_pred ccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCc--eeEEEEEeeCc-EEecccceEEEEe Q lcl|Aclame:pro 313 KSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNT--LDLRAIQRDDV-QMWDNEAAVYGEI 381 (392) Q Consensus 313 ~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~--~~~~~~~r~~~-~v~~~~af~~l~~ 381 (392) -..+...+...+++.+-+.-+..+ .-.|-+++.+.. .+|. +.+=++.|+++ .+.+|.+|+++.. T Consensus 238 ~~~~g~~g~~r~vvY~~d~~~~~~-~vP~p~~~l~~q----~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 238 YGTRVTDGKTRAMVYVNSKEHVIF-DVPMSPTVLDAQ----PKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred ccccCCCCceEEEEEecChhheEE-ecCccccccchh----hcCCceEEecceeeeeeEEEEccceeeeecC Confidence 112233455556665554433222 123444444421 2343 33446777777 6668999999988 No 181 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=95.42 E-value=0.0021 Score=35.26 Aligned_cols=289 Identities=12% Similarity=0.092 Sum_probs=136.6 Q ss_pred hhHHHHHHHHhhhhhhhhcccc---ccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccc Q lcl|Aclame:pro 89 LNAEEREFLEDDLEQRAMSGLT---GEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAE 165 (392) Q Consensus 89 ~~~~~~~~~~~~~~~~a~~~~~---~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (392) .....+.........-+-..+. ...-.+.|-+.+...+...+.+.+-+++++++++|....|..... ..+++-++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~l-g~~g~iagr 79 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGI-GVTGSIAST 79 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEec-ccCcccccc Confidence 1111111111111111111111 112356666677788899999999999999999999888886543 333333333 Q ss_pred cccc--cccccccccceeeEEechhheeeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccc------ Q lcl|Aclame:pro 166 ITEM--GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKL------ 235 (392) Q Consensus 166 ~~E~--~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~------ 235 (392) +... .+....+-..++.-.+..++.---..|+.+.|+.. .++|..-+++.+.++++.-.-.-.++|+..+ T Consensus 80 tdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~ 159 (357) T protein:vir:60 80 TDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRS 159 (357) T ss_pred cccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChh Confidence 2211 12111221234444444444444455666666543 2467777877777777654332222321110 Q ss_pred -------------------------------------------cccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcHH Q lcl|Aclame:pro 236 -------------------------------------------TKQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQD 269 (392) Q Consensus 236 -------------------------------------------~~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~~ 269 (392) ..+...+.|.++. +++..+++.++.. -+.|+.+. T Consensus 160 ~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~d 239 (357) T protein:vir:60 160 SNQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQ 239 (357) T ss_pred hCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 1112334555554 4445567877754 58888888 Q ss_pred HHH--HHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEec Q lcl|Aclame:pro 270 GFN--YLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) Q Consensus 270 ~~~--~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~ 347 (392) .++ ++..+. ..+.|--......-.....+|+.|.+.+ +++|.. .+++--|++.-..+.++..+=.+-+ T Consensus 240 Lla~k~~~l~n-~~~~pTE~~Aa~~i~s~k~iGGl~a~~~--PfFP~~-------~llVT~L~NLsIY~Q~gs~RR~~~d 309 (357) T protein:vir:60 240 LLADKYFPIVN-REQDNSEMLAADVIISQKRIGNLPAVRV--PYFPAD-------AMLITKLENLSIYYMDDSHRRVIEE 309 (357) T ss_pred hhhHHhhhHhh-cCCChHHHHHHHHHHHhhhhcCcceEEc--cccCCC-------ceEEeeccccEEEEecCcEEEEEEe Confidence 765 233333 2223210000000000113555665543 455543 3677777775444555555544433 Q ss_pred cchhhhhcCceeEEEEEeeCcEEecccceEEEE-eccc-----CCCCCCCC Q lcl|Aclame:pro 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLS-----APVEQPQG 392 (392) Q Consensus 348 ~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~-~~~~-----a~~~~~~~ 392 (392) ... ++.+--.=..--|..|-++.+++.++ ++-+ ++...+++ T Consensus 310 ~p~----r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~pa~~~~~~~ 356 (357) T protein:vir:60 310 NPK----LDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKATAEPG 356 (357) T ss_pred ccc----cccccchhhhcceeeeeccccEEEeeeeeeccCcccccCCCCCC Confidence 321 22222222222355556666666554 2222 22222222 No 182 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=94.82 E-value=0.0034 Score=34.11 Aligned_cols=282 Identities=14% Similarity=0.141 Sum_probs=138.7 Q ss_pred hhHHHHHHHHhhhhh-hhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccc Q lcl|Aclame:pro 89 LNAEEREFLEDDLEQ-RAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) Q Consensus 89 ~~~~~~~~~~~~~~~-~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (392) .....+......... .-+......+..+.|.+.+...+.+.+.+.+-+++++++++|....|...-. ..+++-++.+. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~l-g~~g~iagrtd 79 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGI-GVSGTIASRTD 79 (338) T ss_pred CCHHHHHHHHHHHHHHHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeee-ccCcccccccc Confidence 111111111111111 1122223334567777788888999999999999999999999888876544 33333333332 Q ss_pred c--ccccccccccceeeEEechhheeeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------- Q lcl|Aclame:pro 168 E--MGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEK--------- 234 (392) Q Consensus 168 E--~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~--------- 234 (392) . ++.....+-..++.-.+..++.---..|+.+.|+.. .++|..-+++.+.++++.-.-.-.++|+.. T Consensus 80 T~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~n 159 (338) T protein:vir:11 80 TTGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAAN 159 (338) T ss_pred CCCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhC Confidence 1 111111211134444455555544455666666643 357888888888887765443333333221 Q ss_pred ------------------------------------ccccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcHHHHHH-- Q lcl|Aclame:pro 235 ------------------------------------LTKQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQDGFNY-- 273 (392) Q Consensus 235 ------------------------------------~~~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~~~~~~-- 273 (392) ++.+...+.|.++. +++..+++.++.. -+.|+.+..++. T Consensus 160 PllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~ 239 (338) T protein:vir:11 160 PLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKY 239 (338) T ss_pred cCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHH Confidence 11122344555554 3445667887754 488888876552 Q ss_pred HHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhh Q lcl|Aclame:pro 274 LDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAF 353 (392) Q Consensus 274 L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f 353 (392) +..+. ....|--.-....-.....+|+.|.+.+ +++|.. .+++--|++....+.++..+=.+-+... T Consensus 240 ~~l~n-~~~~ptE~~Aa~~~~s~k~iGGlpa~~~--PffP~~-------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~--- 306 (338) T protein:vir:11 240 FPMVN-KDQPATEKIATDLILSQKRMGGLPPVEV--PYVPEK-------GLMVTTLKNLSLYWQIGGRRRYLKEVPE--- 306 (338) T ss_pred hHHHh-cCCChHHHHHHHHHHHhhhhCCceeEEc--cccCCC-------ceEEeeccccEEEEecCcEEEEEEeccc--- Confidence 22232 2222210000000000113455565543 455543 3677777775444455555444433321 Q ss_pred hcCceeEEEEEeeCcEEecccceEEEE-ecccC Q lcl|Aclame:pro 354 TRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSA 385 (392) Q Consensus 354 ~~~~~~~~~~~r~~~~v~~~~af~~l~-~~~~a 385 (392) ++.+--.=..--|..|-++.+++.++ ++... T Consensus 307 -r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 307 -KNRIENYESSNDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred -cccccchhhhccceeeeccccEEEeecceecC Confidence 22222222223355666666666654 22222 No 183 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=94.64 E-value=0.0039 Score=33.80 Aligned_cols=283 Identities=12% Similarity=0.132 Sum_probs=138.3 Q ss_pred hhHHHHHHHHhhhhhhh-hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccc Q lcl|Aclame:pro 89 LNAEEREFLEDDLEQRA-MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) Q Consensus 89 ~~~~~~~~~~~~~~~~a-~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (392) .....+.........-+ ........-.+.|-+.+...+.+.+.+.+-+++++++++|....|...-.. .+++-++... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg-~~g~iagrt~ 79 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLS-VSGPIASRTD 79 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeec-cCcceeeeec Confidence 11111111111111111 111112233455655777788899999999999999999998888765443 3333333332 Q ss_pred ccc-ccccccccceeeEEechhheeeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------- Q lcl|Aclame:pro 168 EMG-EIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKL--------- 235 (392) Q Consensus 168 E~~-~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~--------- 235 (392) .+. ...+.+...++.-.+..++.---..|+.+.|+.. .++|..-+++.+.++++.-.-.-.++|+..+ T Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nP 159 (337) T protein:vir:79 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANP 159 (337) T ss_pred CCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCc Confidence 221 1211222234444555555554556666666653 3578888888888887664443333332211 Q ss_pred -----------------------------------cccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcHHHHHH--HH Q lcl|Aclame:pro 236 -----------------------------------TKQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQDGFNY--LD 275 (392) Q Consensus 236 -----------------------------------~~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~~~~~~--L~ 275 (392) +.+...+.|.++. +++..+++.++.. -+.|+.+..++. +. T Consensus 160 llqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~ 239 (337) T protein:vir:79 160 LLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKYFP 239 (337) T ss_pred CccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhH Confidence 1112234455443 3444567777754 578888877652 22 Q ss_pred HhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhc Q lcl|Aclame:pro 276 KLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTR 355 (392) Q Consensus 276 ~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~ 355 (392) .+. ..+.|--.-....-.....+|+.|.+.+ +++|.. .+++--|++....+.++..+=.+-+... + T Consensus 240 l~n-~~~~ptE~~Aa~~i~s~k~iGGlpa~~~--PffP~~-------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~----r 305 (337) T protein:vir:79 240 IVN-ATQAPTERLAADLIVSQKRIGNLPAVRV--PFFPKR-------ALMVTKLSNLSIYYQEGARRRTLKEVPE----R 305 (337) T ss_pred Hhc-cCCCcHHHHHHHHHHHhhhhCCceeEEc--cccCCC-------ceEEeechhcEEEEecCcEEEEEEEccc----c Confidence 233 2222200000000000113555565543 455543 3677777775444555555544433322 2 Q ss_pred CceeEEEEEeeCcEEecccceEEEE-ecccCC Q lcl|Aclame:pro 356 NTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAP 386 (392) Q Consensus 356 ~~~~~~~~~r~~~~v~~~~af~~l~-~~~~a~ 386 (392) +.+--.=..--|+.|-++.+++.++ ++-... T Consensus 306 ~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 306 DRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ccccchhhccceeeeeccccEEEEeceeecCC Confidence 2322222223356667777777664 332222 No 184 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=94.63 E-value=0.0039 Score=33.79 Aligned_cols=284 Identities=11% Similarity=0.162 Sum_probs=137.4 Q ss_pred hhHHHHHHHHhhhhh-hhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccc Q lcl|Aclame:pro 89 LNAEEREFLEDDLEQ-RAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) Q Consensus 89 ~~~~~~~~~~~~~~~-~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (392) .....+......... .-.......+..+.|-+.+...+...+.+.+-+++++++++|....|..... ..+++-++... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~l-g~~g~iagrtd 79 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGL-GVSGPVASTTD 79 (339) T ss_pred CChHHHHHHHHHHHHHHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEee-ccCcceeeccc Confidence 111111111111111 1112222334456676777788999999999999999999999888876543 23333333321 Q ss_pred cc-cccccccccceeeEEechhheeeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------- Q lcl|Aclame:pro 168 EM-GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEK---------- 234 (392) Q Consensus 168 E~-~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~---------- 234 (392) .. .+....+-..++.-.+...+.---..|+.+.|+.. .++|..-+++.+.++++.-.-.-.++|+.. T Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nP 159 (339) T protein:vir:79 80 TTQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANP 159 (339) T ss_pred CCCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCc Confidence 11 11221221234444444444444445666666543 346788888888877765332222222111 Q ss_pred -----------------------------------ccccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcHHHHH--HH Q lcl|Aclame:pro 235 -----------------------------------LTKQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQDGFN--YL 274 (392) Q Consensus 235 -----------------------------------~~~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~~~~~--~L 274 (392) ++.+...+.|.++. +++..+++.++.. -+.|+.+...+ ++ T Consensus 160 llqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~~ 239 (339) T protein:vir:79 160 MLQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKYF 239 (339) T ss_pred CccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHhh Confidence 00112334555553 3445667888754 57888888765 23 Q ss_pred HHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhh Q lcl|Aclame:pro 275 DKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) Q Consensus 275 ~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~ 354 (392) ..+. ....|--.-....-.....+|+.|.+.+ +++|.. .+++--|++.-..+.++..+=.+-+... T Consensus 240 ~l~n-~~~~ptE~~Aa~~i~s~k~iGGl~a~~~--PfFP~~-------~llVT~L~NLsIY~Q~gs~RR~~~d~p~---- 305 (339) T protein:vir:79 240 PLVN-RDRDPVQQIAADLIISQKRIGNLPAIRV--PYFPAN-------GLLVTRLDNLSIYYQEGGRRRTILDNAK---- 305 (339) T ss_pred hHhh-cCCChHHHHHHHHHHHhhhhCCceeEEc--cccCCC-------ceEEeechhcEEEEecCcEEEEEEeccc---- Confidence 3333 2222210000000001123555665543 455543 3677777775444555555544443322 Q ss_pred cCceeEEEEEeeCcEEecccceEEEE-ecccCCC Q lcl|Aclame:pro 355 RNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAPV 387 (392) Q Consensus 355 ~~~~~~~~~~r~~~~v~~~~af~~l~-~~~~a~~ 387 (392) ++.+--.=..--|+.|-++.+++.++ ++-+..+ T Consensus 306 r~rie~y~s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 306 RDRIENYESSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred cccccchhhccceeeeeccccEEEeeeeecccCC Confidence 22222222223356666777777664 3222222 No 185 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=94.49 E-value=0.0043 Score=33.57 Aligned_cols=283 Identities=12% Similarity=0.133 Sum_probs=137.2 Q ss_pred hhHHHHHHHHhhhhhhh-hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccc Q lcl|Aclame:pro 89 LNAEEREFLEDDLEQRA-MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) Q Consensus 89 ~~~~~~~~~~~~~~~~a-~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (392) .....+.........-+ .......+..+.|-+.+...+...+.+.+-+++++++++|....|...... .+++-++... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg-~~g~iagrtd 79 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLS-VSGPIASRTD 79 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecc-cCcceeeeec Confidence 11111111111111111 111122234566766777889999999999999999999998888765432 3333333222 Q ss_pred ccc-ccccccccceeeEEechhheeeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------- Q lcl|Aclame:pro 168 EMG-EIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKL--------- 235 (392) Q Consensus 168 E~~-~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~--------- 235 (392) .+. ...+.+...++.-.+...+.---..|+.+.|+.. .++|..-+++.+.++++.-.-.-.++|+..+ T Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nP 159 (337) T protein:vir:78 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANP 159 (337) T ss_pred CCCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCc Confidence 221 1211222234444444444444445666666543 3467888888888777654432223332111 Q ss_pred -----------------------------------cccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcHHHHHH--HH Q lcl|Aclame:pro 236 -----------------------------------TKQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQDGFNY--LD 275 (392) Q Consensus 236 -----------------------------------~~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~~~~~~--L~ 275 (392) +.+...+.|.++. +++..+++.++.. -+.|+.+...+. +. T Consensus 160 llqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~ 239 (337) T protein:vir:78 160 LLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) T ss_pred CccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHH Confidence 1112234555553 3444567877654 588888888652 32 Q ss_pred HhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhc Q lcl|Aclame:pro 276 KLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTR 355 (392) Q Consensus 276 ~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~ 355 (392) .+. ..+.|--.-....-.....+|+.|.+.+ +++|.. .+++--|++.-..+.++..+=.+-+... + T Consensus 240 l~n-~~~~ptE~~Aa~~i~s~k~iGGl~a~~~--PfFP~~-------~ilVT~L~NLsIY~Q~gs~RR~~~d~p~----r 305 (337) T protein:vir:78 240 IVN-ATQAPTERLAADLIVSQKRIGNLPAVRV--PFFPKR-------ALMVTKLSNLSIYYQEGARRRTLKEVPE----R 305 (337) T ss_pred HHh-cCCCcHHHHHHHHHHHhhhhcCcceEEc--cccCCC-------ceEEeechhcEEEEecCcEEEEEEeccc----c Confidence 333 2222210000000001113555665543 455543 3677777775444555555544443322 2 Q ss_pred CceeEEEEEeeCcEEecccceEEEE-ecccCC Q lcl|Aclame:pro 356 NTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAP 386 (392) Q Consensus 356 ~~~~~~~~~r~~~~v~~~~af~~l~-~~~~a~ 386 (392) +.+--.=..--|+.|-++.+++.++ ++-... T Consensus 306 ~rie~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 306 DRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ccccchhhccceeeeeccccEEEEeceeecCC Confidence 2322222223356677777777665 332222 No 186 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=94.45 E-value=0.0044 Score=33.52 Aligned_cols=283 Identities=12% Similarity=0.133 Sum_probs=138.5 Q ss_pred hhHHHHHHHHhhhhhhh-hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccccccc Q lcl|Aclame:pro 89 LNAEEREFLEDDLEQRA-MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) Q Consensus 89 ~~~~~~~~~~~~~~~~a-~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (392) .....+.........-+ .......+..+.|-+.+...+.+.+.+.+-+++++++++|....|...-.. .+++-++... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg-~~g~iagrt~ 79 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLS-VSGPIASRTD 79 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeec-cCcceeeeec Confidence 11111111111111111 111122233455655777788899999999999999999998888765443 3333333332 Q ss_pred ccc-ccccccccceeeEEechhheeeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------- Q lcl|Aclame:pro 168 EMG-EIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-------- 236 (392) Q Consensus 168 E~~-~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-------- 236 (392) .+. ...+.+...++.-.+..++.---..|+.+.|+.. .++|..-+++.+.++++.-.-.-.++|+..+. T Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nP 159 (337) T protein:vir:10 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANP 159 (337) T ss_pred CCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCc Confidence 221 1211222234444555555554556666666653 35788888888888876644433333322111 Q ss_pred ------------------------------------ccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcHHHHHH--HH Q lcl|Aclame:pro 237 ------------------------------------KQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQDGFNY--LD 275 (392) Q Consensus 237 ------------------------------------~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~~~~~~--L~ 275 (392) .+...+.|.++. +++..+++.++.. -+.|+.+..++. +. T Consensus 160 llqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~ 239 (337) T protein:vir:10 160 LLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) T ss_pred CccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhH Confidence 112234455443 3444567777754 578888877652 22 Q ss_pred HhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhc Q lcl|Aclame:pro 276 KLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTR 355 (392) Q Consensus 276 ~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~ 355 (392) .+. ..+.|--.-....-.....+|+.|.+.+ +++|.. .+++--|++....+.++..+=.+-+... + T Consensus 240 l~n-~~~~ptE~~Aa~~i~s~k~iGGlpa~~~--PffP~~-------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~----r 305 (337) T protein:vir:10 240 IVN-ATQAPTERLAADLIVSQKRIGNLPAVRV--PFFPKR-------ALMVTKLSNLSIYYQEGARRRTLKEVPE----R 305 (337) T ss_pred Hhc-cCCCcHHHHHHHHHHHhhhhCCceeEEc--cccCCC-------ceEEeechhcEEEEecCcEEEEEEEccc----c Confidence 233 2222200000000000113555565543 455543 3677777775444555555544433322 2 Q ss_pred CceeEEEEEeeCcEEecccceEEEE-ecccCC Q lcl|Aclame:pro 356 NTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAP 386 (392) Q Consensus 356 ~~~~~~~~~r~~~~v~~~~af~~l~-~~~~a~ 386 (392) +.+--.=..--|+.|-++.+++.++ ++-... T Consensus 306 ~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 306 DRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ccccchhhccceeeeeccccEEEEeceeecCC Confidence 2322222223356667777777665 322222 No 187 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=93.86 E-value=0.0061 Score=32.71 Aligned_cols=360 Identities=12% Similarity=0.080 Sum_probs=146.1 Q ss_pred CCHHH-HHHHHHHHHHHHHHHHHhhhh--hHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH-hhccc-c--ccccccch Q lcl|Aclame:pro 1 MSKEL-RELLAKLEGKKEEVRSLMGED--KVAEAEQMMEEVRSLQKKIDLQ-RSLDEAETEE-RNNGR-E--VETRNVDG 72 (392) Q Consensus 1 M~kel-~el~~~~~~~~~e~~~~~~~~--~~~~~~~~~~ei~~l~~~i~~~-~~~~~~~~~~-~~~~~-~--~~~~~~~~ 72 (392) ..-.+ .+..+...++.+.++.+...- ...++ +...+.+..-.++.. .++.+.+... ..... . ...... . T Consensus 260 ~~adirA~~~aae~~r~aaI~a~fa~f~~~~a~l--~a~~l~d~~~s~d~ar~~lL~~l~~~~~p~~~~~~~~~~~~~-~ 336 (693) T protein:vir:95 260 TEADIRARILAEESGRRSAITAAFGAFSTGHAEL--LATCLNDMNITVDQAREKLLAAIGADTQPAAALSAGAHIHAG-N 336 (693) T ss_pred CcchhhHHHHHHHHHHHHHHHHHHHhccCChHHH--HHHHHhhcCCCHHHHHHHHHHHHhhccCCCCCcCcCccccCC-c Confidence 11111 112222233333333332210 11111 111111111111111 1111111111 10000 0 000000 0 Q ss_pred hhHHHHHHHHHH--hcc-------------hhhHHHHHHH-----------HhhhhhhhhccccccccceecchhhhhHH Q lcl|Aclame:pro 73 EMEYRDVFMKAL--RNK-------------PLNAEEREFL-----------EDDLEQRAMSGLTGEDGGLVIPQDIQTQI 126 (392) Q Consensus 73 ~~~~~~a~~~~~--~~~-------------~~~~~~~~~~-----------~~~~~~~a~~~~~~~~gg~~iP~~~~~~i 126 (392) ....++.+...+ |.+ .+..-.+... ......+++..+|+. -|.-+.+.+ T Consensus 337 g~~~~d~~~~al~~R~g~~~~~~~n~~~g~~L~elAr~~L~~rg~~~~~~~~~~~~~~a~~htTSD-----Fp~IL~~~~ 411 (693) T protein:vir:95 337 GNLVGDSVRASVLARIGRGERQADNAYNGMTLRELARASLVDRGIGVASLNAPQMVGLAFTHTSSD-----FGLILLDVA 411 (693) T ss_pred hhHHHHHHHHHHHHhcCcccccCCccccCCcHHHHHHHHHHhcCCccCCCCHHHHHHHHHhcCcch-----hHHHHHHHH Confidence 001111111111 111 0000000000 001112222221111 122122212 Q ss_pred HHhHHh-----hhhhhhhcceeeccCCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHH Q lcl|Aclame:pro 127 NELARS-----FDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLL 201 (392) Q Consensus 127 i~~~~~-----~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l 201 (392) ...+++ .......+...+++--... ......+.+....|.|+++.+-. ...=..-++...+++..+.||++++ T Consensus 412 nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~-~~~~lg~~~~L~~V~E~gEyk~~-t~~e~~e~~~l~tyG~~~~iTRqai 489 (693) T protein:vir:95 412 NKSVLAGWEEAEETFPLWTKSGILTDFKPA-RRVGLGEFSSLRQVREGAEYKYV-TLGERGEQIILATYGELFSITRQAI 489 (693) T ss_pred HHHHHHHHHhhhhHHHHHhccCCCCccccc-ceeecCCCCChhhcCCCCceeee-ecCCccceeehhhcCCeeeecHHhh Confidence 222221 2334444554444332211 12233455666788999887632 2222345788999999999999998 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------------ccchhhHHHHHH---HHHHHhh---- Q lcl|Aclame:pro 202 QDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-------------------KQAIKSLDDIKD---VLNVKLD---- 255 (392) Q Consensus 202 ~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-------------------~~~~~~~d~~~~---~~~~~~~---- 255 (392) -+.+.++.+-|...+..+.+++++..++..+..+. +.+..+.+.+-. +|..+-. T Consensus 490 INDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~Nl~tga~sals~~sl~~a~~am~~qk~~~~~ 569 (693) T protein:vir:95 490 INDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHSNLLTGAASALSIDSLSKAKTQMATQKAQVEK 569 (693) T ss_pred hccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeeccccccccccccccChHHHHHHHHHHHHhhcchhc Confidence 88788888889999999999999887665433211 111222333322 2332210 Q ss_pred ----hcccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehh- Q lcl|Aclame:pro 256 ----PAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLK- 330 (392) Q Consensus 256 ----~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~- 330 (392) .-...+..||..+......+++-.+.-.+ ..+...+....+.|...| +.++-+.+.+ ...-+++.|.. T Consensus 570 ~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~--~a~~~~~~~NP~~~~~~v--i~~prL~~~s---~~~Wyl~a~~~~ 642 (693) T protein:vir:95 570 GKGRTLNIRPGFVLTPVALEDKANQIINSESVP--GADVNSGIVNPIRAFAQV--IGEPRLDDAS---ATAWYMAAKKGS 642 (693) T ss_pred cCCceeecccceEEecchHHHHHHHHhcccccc--ccccccccccchhccccc--cccceecCCC---CCceEEecCCCC Confidence 11235567888888877666655332111 011111112223332222 1222221110 01112222211 Q ss_pred ----hceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccC Q lcl|Aclame:pro 331 ----EAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 331 ----~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a 385 (392) -+|.-+ .++..++.. ..|..+.+.|++...+|.+++|-.++++ .+.| T Consensus 643 dtie~~yL~G-~~~P~ie~~----~gf~~dG~~~kvr~D~G~~~iD~Rg~~k---n~GA 693 (693) T protein:vir:95 643 DTIEVAYLDG-VDTPYLEQQ----EGFTVDGVASKVRIDAGVAPLDFRGLQK---SNGA 693 (693) T ss_pred CeEEEEEecC-CCCCeEeec----CCCCcceEEEEEEEeccCceeecccccc---CCCC Confidence 122222 234444442 3588999999999999999998776652 3223 No 188 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=90.97 E-value=0.018 Score=30.13 Aligned_cols=281 Identities=12% Similarity=0.128 Sum_probs=135.1 Q ss_pred hcchhhHHHHHHHHhhhhhhh-hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccc Q lcl|Aclame:pro 85 RNKPLNAEEREFLEDDLEQRA-MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~a-~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~ 163 (392) .+.......+.........-+ ..........+.|-+.+...+.+.+.+.+-+++++++++|....|...-. ..+++-+ T Consensus 1 m~~~m~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~l-g~~g~ia 79 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQLAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDV-GVSGLYT 79 (341) T ss_pred CcccccHHHHHHHHHHHHHHHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeec-cccccee Confidence 111111111111111111111 11122233455566677788999999999999999999999888876543 2333333 Q ss_pred cccccccccccccccceeeEEechhheeeehhhHHHHHhhhH-----HHHHHHHHHHHHHHHHHHHHHHHhhcccccc-- Q lcl|Aclame:pro 164 AEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD-----QNILKYVTKWLGKKSKVTRNVLILGVIEKLT-- 236 (392) Q Consensus 164 ~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~-----~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~-- 236 (392) +.... +..+ . .+.++...+...+.---..|+.+.|+..+ ++|..-+++.+.++++.-.-.-.++|+..+. T Consensus 80 grtdt-~R~~-r-~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~T 156 (341) T protein:vir:27 80 GRKAG-GRFT-K-QVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADT 156 (341) T ss_pred eccCC-Ccee-c-ccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCC Confidence 33332 2222 1 13445445555555444556666665433 6788888888888887655544445433211 Q ss_pred -------------------------------------ccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcHHHHHH--H Q lcl|Aclame:pro 237 -------------------------------------KQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQDGFNY--L 274 (392) Q Consensus 237 -------------------------------------~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~~~~~~--L 274 (392) .+...+.|.++. ++...+++.++.. -+.|+.+...+. + T Consensus 157 d~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~ 236 (341) T protein:vir:27 157 DPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAAQA 236 (341) T ss_pred ChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhhhhhh Confidence 111233455443 3445567777754 478888777652 2 Q ss_pred HHhhccCCce---eecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccch- Q lcl|Aclame:pro 275 DKLKDKDGKY---ILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG- 350 (392) Q Consensus 275 ~~lkd~~g~~---l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~- 350 (392) ..+. ....| +-..-+ ..+ +|+.|.+.+ +++|.. .+++--|++....+.++..+=.+-+... T Consensus 237 ~l~n-~~~~ptE~~Aa~~i----~k~-iGGlpa~~~--PffP~~-------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r 301 (341) T protein:vir:27 237 KLYD-KADKPSEQIAAQKL----DKT-IAGRPAYVP--PFLPDN-------AMVVTIPENLQVLTQHGTAQRKAKHESDR 301 (341) T ss_pred hhhc-cCCCCHHHHHHHHH----HHh-hCCCeEEEc--cccCCC-------ceEEeeccceEEEEecCcEEEEEEecccc Confidence 2222 21111 110000 123 455555543 455543 3677777775444444555444433322 Q ss_pred hhhhcCceeEEEEEeeCcEEecccceE-----EEEecccCCCCCCCC Q lcl|Aclame:pro 351 KAFTRNTLDLRAIQRDDVQMWDNEAAV-----YGEIDLSAPVEQPQG 392 (392) Q Consensus 351 ~~f~~~~~~~~~~~r~~~~v~~~~af~-----~l~~~~~a~~~~~~~ 392 (392) +.++..+=. +.|-+-.+|. .+++.+++---.-.- T Consensus 302 ~rie~yes~--------YvVEdyg~~~~~~~~~vkl~~~~~~~~~~~ 340 (341) T protein:vir:27 302 KRSKTHTGA--------WKVTQWVCWKRSPLTTQKKSTSALNHRSER 340 (341) T ss_pred ccccchhhh--------heeehhhhhhhccccccccCcccccccccc Confidence 112211112 3333333343 333333332222222 No 189 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=90.93 E-value=0.018 Score=30.11 Aligned_cols=284 Identities=13% Similarity=0.089 Sum_probs=139.2 Q ss_pred HhcchhhHHHHHHHHhhhhhhhhccccc----ccc-ceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeec Q lcl|Aclame:pro 84 LRNKPLNAEEREFLEDDLEQRAMSGLTG----EDG-GLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~----~~g-g~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~ 158 (392) ++ ...+.........-+-..+.. ..+ .+.|-+.+...+...+.+.+-+++++++++|....|..... .. T Consensus 1 M~-----~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~l-g~ 74 (342) T protein:vir:10 1 MK-----DLTLEKYNAYLARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGL-DS 74 (342) T ss_pred CC-----hHHHHHHHHHHHHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEec-cc Confidence 11 111111111111111111111 112 36676677788999999999999999999999888886543 33 Q ss_pred CCcccccccccc--ccccccccceeeEEechhheeeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 159 DMIPFAEITEMG--EIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEK 234 (392) Q Consensus 159 ~~~~~~~~~E~~--~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~ 234 (392) +++-++.+...+ +....+...++.-.+...+.---..|+.+.|+.. .++|..-+++.+.++++.-.-.-.++|+.. T Consensus 75 ~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~ 154 (342) T protein:vir:10 75 AHTVASTTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSR 154 (342) T ss_pred CcccccccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceee Confidence 333444332111 1221222244444455555544455666666643 346788888888877765443333333221 Q ss_pred c-------------------------------------------cccchhhHHHHHH-HHHHHhhhcccCC--ceEEEcH Q lcl|Aclame:pro 235 L-------------------------------------------TKQAIKSLDDIKD-VLNVKLDPAISPN--AILLTNQ 268 (392) Q Consensus 235 ~-------------------------------------------~~~~~~~~d~~~~-~~~~~~~~~~~~~--a~~v~~~ 268 (392) + +.+...+.|.++. +++..+++.++.. -+.|+.+ T Consensus 155 A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~ 234 (342) T protein:vir:10 155 AATSDRNSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGR 234 (342) T ss_pred ccCCChhhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 1 1112234555554 4445567877654 5888888 Q ss_pred HHHHH--HHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEe Q lcl|Aclame:pro 269 DGFNY--LDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELAST 346 (392) Q Consensus 269 ~~~~~--L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~ 346 (392) ...+. +..+... +.|--.-....-.....+|+.|.+.+ +++|.. .+++--|++.-..+.++..+=.+- T Consensus 235 dLladk~~~l~n~~-~~ptE~~Aa~~i~s~k~iGGl~a~~~--PfFP~~-------~ilVT~L~NLsIY~Q~gs~RR~~~ 304 (342) T protein:vir:10 235 KLLADKYFPIVNQQ-NAPTEELAADIVISQKRIGGLKAVRV--PFFPAN-------AILITKLENLAIYVQEGTTRKHIE 304 (342) T ss_pred hhhHHHHHHHHhcC-CChHHHHHHHHHHhhhhhcCceeEEc--cccCCC-------ceEEeeccccEEEEecCcEEEEEE Confidence 87652 3333322 22210000000001113555665543 455543 367777777544455555554443 Q ss_pred ccchhhhhcCceeEEEEEeeCcEEecccceEEEE-ecccCCCCCCC Q lcl|Aclame:pro 347 DVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE-IDLSAPVEQPQ 391 (392) Q Consensus 347 ~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~-~~~~a~~~~~~ 391 (392) +... ++.+--.=..--|..|-++.+++.++ ++...| . T Consensus 305 d~p~----r~rie~y~s~Ne~YvVEd~~~~a~iE~i~i~~~----~ 342 (342) T protein:vir:10 305 NVPK----KDRIETYESENIDYVVEDYGCAALIENITLKDK----E 342 (342) T ss_pred eccc----cccccchhhhccceeeeccccEEEeecceecCC----C Confidence 3322 22332222233466777777777775 332222 2 No 190 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=90.62 E-value=0.02 Score=29.91 Aligned_cols=297 Identities=10% Similarity=0.001 Sum_probs=111.2 Q ss_pred cccccccccchhhHHHHHHHHHHhc--chhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhh- Q lcl|Aclame:pro 62 GREVETRNVDGEMEYRDVFMKALRN--KPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQ- 138 (392) Q Consensus 62 ~~~~~~~~~~~~~~~~~a~~~~~~~--~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~- 138 (392) -..+.... -+-+.+.+++ +.++-.-+ .....+..++....-+.+...+.+.....+.-.. T Consensus 1 ~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~ 63 (329) T protein:vir:10 1 MDGIFITG-------VKTMNKEIKNATGKLKLNLQ----------HFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPA 63 (329) T ss_pred CCceEEec-------hhhhhhhhhcccceeEEehh----------hhcCCccCCchhHHHHHHHHHHHHHHHhhceeeee Confidence 00000000 0000011110 00000000 0111122222333333343333333332221111 Q ss_pred hcceeeccCCcceeEEEeecCCccccccc--cccccccccccceeeEEechhheeeehhhHHHHHhhhHH--HHHHHHHH Q lcl|Aclame:pro 139 YVTVEPVRTRSGSRVLEKNSDMIPFAEIT--EMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQ--NILKYVTK 214 (392) Q Consensus 139 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~--E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~~--~l~~~v~~ 214 (392) +++...-..+..++.+++.... ...-.. .+-..... ..++...+++-.+.-.+.-=.-. .+++.. .+...+.+ T Consensus 64 ~~N~~~e~~~g~tVkIp~i~~~-gl~DY~R~~g~~~g~v-t~~~~t~tidqdR~~~F~VD~~D-~dEtn~~l~a~~i~~~ 140 (329) T protein:vir:10 64 VISNDAIFMQGRSFTVIKGDVT-ELKDYKRNATNEFDHP-QIQETTYFLDQEKYWGRFVDALD-RRDTEGNIDINYVVAK 140 (329) T ss_pred ecccceeeccCcEEEEeeeccc-ccccccCCCCcccccc-ccceeEEEeecccceeeecchhh-HhhhhhhhhHHHHHHH Confidence 1221111122334555555432 222222 22211111 22344455554444333210000 111111 12233344 Q ss_pred HHHHHHHHHHHHHHhhcc----c---cccccchhhHHHHHHHHHHHhhhcc-cCCceEEEcHHHHHHHHHhhccCCce-e Q lcl|Aclame:pro 215 WLGKKSKVTRNVLILGVI----E---KLTKQAIKSLDDIKDVLNVKLDPAI-SPNAILLTNQDGFNYLDKLKDKDGKY-I 285 (392) Q Consensus 215 ~l~~~~~~~~d~~~~~~~----~---~~~~~~~~~~d~~~~~~~~~~~~~~-~~~a~~v~~~~~~~~L~~lkd~~g~~-l 285 (392) .....+.-.+|...+..+ + +.+.+....|+.+.+++. .++... ..+-+++++|..+..|.+...-.... . T Consensus 141 ~~~~~v~pEiDay~~skla~~a~~~~~~~~t~~nay~~i~~a~~-~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~ 219 (329) T protein:vir:10 141 QASEVVAPYLDNLRFATLARNKAKHLTVGSGADAQYDAVLDVSV-ELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDN 219 (329) T ss_pred HHHHHhhhHHHHHHHHHHHhhcccccccccCHHHHHHHHHHHHH-HHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccc Confidence 444455555554432221 1 112233345677777654 344433 33446789999999886421110000 0 Q ss_pred ecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEe Q lcl|Aclame:pro 286 LQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQR 365 (392) Q Consensus 286 ~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r 365 (392) .......+.-..+.|. +|+.+.+.. .....+++|..+ +.....+ --.++........ +...++...+ T Consensus 220 ~~~~~~~g~Vg~idG~-~Ii~vps~~-------~k~in~ii~~~~-A~~~~~K-~~~~~~~~p~~~~---~a~~v~gr~y 286 (329) T protein:vir:10 220 RQQVLGKGVQGELDGF-TIVKVPSKM-------LQGVEAMAVIGE-VMASPIQ-ANEAKLNSNVPGM---FGTLAEQMLY 286 (329) T ss_pred cccceeeeeeeeecCe-EEEEecCCc-------ccceeEEEEcCC-ceeeeee-eeeeeeeCCCCcc---chheeeeeee Confidence 1111223334456665 455443322 223345666654 3322222 1223322211111 2357888889 Q ss_pred eCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 366 DDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 366 ~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) +|..|.+|++..+.....++++++..+ T Consensus 287 yd~~V~~~k~~~I~~~~~~a~~~~~~~ 313 (329) T protein:vir:10 287 TGAFVPEHLQKYIFTIGGKEVETNRDG 313 (329) T ss_pred eeeEEEccccCEEEEecccCcccCCCC Confidence 999999999776554333333333333 No 191 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=90.40 E-value=0.021 Score=29.78 Aligned_cols=285 Identities=9% Similarity=0.015 Sum_probs=130.6 Q ss_pred hhHHHHHHHHhhhhhhhhcccc-----ccccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCccc Q lcl|Aclame:pro 89 LNAEEREFLEDDLEQRAMSGLT-----GEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) Q Consensus 89 ~~~~~~~~~~~~~~~~a~~~~~-----~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~ 163 (392) .....+.........-+-..+. ..+.-+.|.+.+...+.+.+.+.+-+++++++++|.-..+... ....++..+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~-~~~~sg~~t 79 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAID-LRSNRKRHY 79 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEE-EeecCcccc Confidence 1111111111111111111111 1223477777787889999999999999999999976555443 333333322 Q ss_pred cccccccccccccccceeeEEechhheeeehhhHHHHHhhh--HHH-HHHHHHHHHHHHHHHHHHHHHhhccccc----- Q lcl|Aclame:pro 164 AEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQN-ILKYVTKWLGKKSKVTRNVLILGVIEKL----- 235 (392) Q Consensus 164 ~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds--~~~-l~~~v~~~l~~~~~~~~d~~~~~~~~~~----- 235 (392) +-....+...+.. ..+.-.+...+.---..|+.+.|+.- .++ |..-+++.+.++++.-.-.-.++|+..+ T Consensus 80 ~r~~t~~~~~~~~--~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T~n 157 (343) T protein:vir:98 80 GAHDRRTPIQQRW--TRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDTSD 157 (343) T ss_pred CccccCCCccccc--cCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCCCC Confidence 2222211111110 01111223333323334555555543 244 7777777777766553322223332111 Q ss_pred -----------------------------------cc-cchhhHHHHHHHHHHHhhhcccCC--ceEEEcHHHHHH--HH Q lcl|Aclame:pro 236 -----------------------------------TK-QAIKSLDDIKDVLNVKLDPAISPN--AILLTNQDGFNY--LD 275 (392) Q Consensus 236 -----------------------------------~~-~~~~~~d~~~~~~~~~~~~~~~~~--a~~v~~~~~~~~--L~ 275 (392) +. +...+.|.++-.+...+++.++.. -+.|+.+...+. +. T Consensus 158 PllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~ 237 (343) T protein:vir:98 158 PNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLVAKEASL 237 (343) T ss_pred cchhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhhhhhhhh Confidence 11 123345666544444667777654 578888887543 22 Q ss_pred HhhccCCceeeccccc--CCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhh Q lcl|Aclame:pro 276 KLKDKDGKYILQSDPT--QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAF 353 (392) Q Consensus 276 ~lkd~~g~~l~~~~~~--~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f 353 (392) .+. ..+++--..-.. ..... .+|+.|.+.+ +++|.. .+++--|++.-..+.++..+=.+-+... T Consensus 238 l~n-~~~~~ptEk~Aa~~~~~~k-~iGGl~a~~~--PfFP~~-------~llVT~L~NLsIY~Q~gs~RR~~~d~p~--- 303 (343) T protein:vir:98 238 VYK-GNGLIATEKAALNTHDLMK-SFGGMPAMIV--PNMPPR-------AAIVTSLSNLSIYTQEGSMRRGMKDDDD--- 303 (343) T ss_pred hhh-hcCCChHHHHHHHHHHHHH-hhCCCeeEEc--cccCCC-------ceEEeeccccEEEEecCcEEEEEEeccc--- Confidence 222 222210000000 00112 3466666543 455543 3677777775444555555544443322 Q ss_pred hcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 354 TRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 354 ~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ++.+--.=..--|..|-++.+++.++--.-+.. ..+| T Consensus 304 -r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~-~~~g 340 (343) T protein:vir:98 304 -KKAVRDSYYRNEAYAVEDCGKFMAVDFTKVKLS-SGKG 340 (343) T ss_pred -cccccchhhhcceeeeeccccEEEeeeeeeeec-CCCC Confidence 223322222334667777777777652222221 1334 No 192 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=90.21 E-value=0.022 Score=29.67 Aligned_cols=274 Identities=12% Similarity=0.010 Sum_probs=117.6 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHh--hhhhhhhcceeeccC Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARS--FDALEQYVTVEPVRT 147 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~--~~~l~~l~~~~~~~~ 147 (392) ...+.+...+-.+.+.... .....+...-...+..+-.+||++-=+.+..+|..+... .-.+++-+...+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~-----e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~S 75 (463) T protein:vir:95 1 MTIEKNLSDVQQKYADQFQ-----EDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS 75 (463) T ss_pred CCcccccchHHHHHHhhhh-----HHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 1111111111111111110 000111100001111233445666555665555444332 223455556666666 Q ss_pred CcceeEEEeecCC-ccccccccccccccccccceeeEEechhheeeehhhHHHH-HhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 148 RSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKKSKVTRN 225 (392) Q Consensus 148 ~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~-l~ds~~~l~~~v~~~l~~~~~~~~d 225 (392) ....|..-..-++ ....++.|++... .+++.+.+.....+-++...-+|.-+ +.++..+....+.++-.-.++.+++ T Consensus 76 TV~~y~~~~~~G~~g~~~f~~E~g~~~-~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE 154 (463) T protein:vir:95 76 TVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIE 154 (463) T ss_pred hhhhheeeeccCccccccccccccccc-cCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHH Confidence 5555544333333 5667899987655 67899999999999998888888654 4555667888888888899999999 Q ss_pred HHHhhccccccccc---hhhHHHHHHHHHH---------------------HhhhcccCCceEEEcHHHHHHHHHhhccC Q lcl|Aclame:pro 226 VLILGVIEKLTKQA---IKSLDDIKDVLNV---------------------KLDPAISPNAILLTNQDGFNYLDKLKDKD 281 (392) Q Consensus 226 ~~~~~~~~~~~~~~---~~~~d~~~~~~~~---------------------~~~~~~~~~a~~v~~~~~~~~L~~lkd~~ 281 (392) .+.+.|...-.+.+ +..+|.+.++|.. ...-+|....-++|+.-+.+.|..---.. T Consensus 155 ~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~ 234 (463) T protein:vir:95 155 WASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGR 234 (463) T ss_pred HHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCc Confidence 99999877655532 2345555544311 11112222223445555544443211111 Q ss_pred CceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEE Q lcl|Aclame:pro 282 GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLR 361 (392) Q Consensus 282 g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~ 361 (392) -|.+..++.... ..|. +| ..+ ...++.+.+.-+..- .+- T Consensus 235 qrv~~~~N~~~~----~~G~-~v-------------------------~~f--~s~~G~I~L~~s~~m-----~~~---- 273 (463) T protein:vir:95 235 QMQLMQDNSGNV----NTGY-SV-------------------------NGF--YSSRGFIKLHGSTVM-----ENE---- 273 (463) T ss_pred eEEEEcCCCCce----eeee-ec-------------------------cce--eeeeeeeeeCCceec-----CCc---- Confidence 111211111100 1110 00 000 111112211110000 000 Q ss_pred EEEeeCc-EEecccceEEEEecccCCCC-----------------------------------CCC----C Q lcl|Aclame:pro 362 AIQRDDV-QMWDNEAAVYGEIDLSAPVE-----------------------------------QPQ----G 392 (392) Q Consensus 362 ~~~r~~~-~v~~~~af~~l~~~~~a~~~-----------------------------------~~~----~ 392 (392) .++|. ....|+||+...++++.-++ |++ | T Consensus 274 --~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~~~g 342 (463) T protein:vir:95 274 --LILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDG 342 (463) T ss_pred --ccccchhhcCCCCccCceeEEEEeeccCCCCCCcccccceEEEEEEECCCCCcccchheeeeeeeccce Confidence 00010 01223333322222211110 010 1 No 193 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=90.21 E-value=0.022 Score=29.67 Aligned_cols=274 Identities=12% Similarity=0.010 Sum_probs=117.6 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHh--hhhhhhhcceeeccC Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARS--FDALEQYVTVEPVRT 147 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~--~~~l~~l~~~~~~~~ 147 (392) ...+.+...+-.+.+.... .....+...-...+..+-.+||++-=+.+..+|..+... .-.+++-+...+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~-----e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~S 75 (463) T protein:vir:99 1 MTIEKNLSDVQQKYADQFQ-----EDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS 75 (463) T ss_pred CCcccccchHHHHHHhhhh-----HHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 1111111111111111110 000111100001111233445666555665555444332 223455556666666 Q ss_pred CcceeEEEeecCC-ccccccccccccccccccceeeEEechhheeeehhhHHHH-HhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 148 RSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKKSKVTRN 225 (392) Q Consensus 148 ~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~-l~ds~~~l~~~v~~~l~~~~~~~~d 225 (392) ....|..-..-++ ....++.|++... .+++.+.+.....+-++...-+|.-+ +.++..+....+.++-.-.++.+++ T Consensus 76 TV~~y~~~~~~G~~g~~~f~~E~g~~~-~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE 154 (463) T protein:vir:99 76 TVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIE 154 (463) T ss_pred hhhhheeeeccCccccccccccccccc-cCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHH Confidence 5555544333333 5667899987655 67899999999999998888888654 4555667888888888899999999 Q ss_pred HHHhhccccccccc---hhhHHHHHHHHHH---------------------HhhhcccCCceEEEcHHHHHHHHHhhccC Q lcl|Aclame:pro 226 VLILGVIEKLTKQA---IKSLDDIKDVLNV---------------------KLDPAISPNAILLTNQDGFNYLDKLKDKD 281 (392) Q Consensus 226 ~~~~~~~~~~~~~~---~~~~d~~~~~~~~---------------------~~~~~~~~~a~~v~~~~~~~~L~~lkd~~ 281 (392) .+.+.|...-.+.+ +..+|.+.++|.. ...-+|....-++|+.-+.+.|..---.. T Consensus 155 ~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~ 234 (463) T protein:vir:99 155 WASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGR 234 (463) T ss_pred HHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCc Confidence 99999877655532 2345555544311 11112222223445555544443211111 Q ss_pred CceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEE Q lcl|Aclame:pro 282 GKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLR 361 (392) Q Consensus 282 g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~ 361 (392) -|.+..++.... ..|. +| ..+ ...++.+.+.-+..- .+- T Consensus 235 qrv~~~~N~~~~----~~G~-~v-------------------------~~f--~s~~G~I~L~~s~~m-----~~~---- 273 (463) T protein:vir:99 235 QMQLMQDNSGNV----NTGY-SV-------------------------NGF--YSSRGFIKLHGSTVM-----ENE---- 273 (463) T ss_pred eEEEEcCCCCce----eeee-ec-------------------------cce--eeeeeeeeeCCceec-----CCc---- Confidence 111211111100 1110 00 000 111112211110000 000 Q ss_pred EEEeeCc-EEecccceEEEEecccCCCC-----------------------------------CCC----C Q lcl|Aclame:pro 362 AIQRDDV-QMWDNEAAVYGEIDLSAPVE-----------------------------------QPQ----G 392 (392) Q Consensus 362 ~~~r~~~-~v~~~~af~~l~~~~~a~~~-----------------------------------~~~----~ 392 (392) .++|. ....|+||+...++++.-++ |++ | T Consensus 274 --~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~~~g 342 (463) T protein:vir:99 274 --LILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDG 342 (463) T ss_pred --ccccchhhcCCCCccCceeEEEEeeccCCCCCCcccccceEEEEEEECCCCCcccchheeeeeeeccce Confidence 00010 01223333322222211110 010 1 No 194 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=90.01 E-value=0.023 Score=29.55 Aligned_cols=266 Identities=10% Similarity=0.003 Sum_probs=114.0 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc--CCcc-eeEEEeecCCccccccccccccccccccceee Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR--TRSG-SRVLEKNSDMIPFAEITEMGEIPETDNPKFSN 182 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~--~~~~-~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~ 182 (392) |. .-....+.|+-+..++++.+++..++.+++..-.-. ...| ++.+++.. .. -+.++..... +..+-.. T Consensus 1 m~---~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~-~~---~v~dg~~~~~-~~~te~~ 72 (418) T protein:vir:10 1 MA---VQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPY-RV---KSASGRTLVK-QPMVDQT 72 (418) T ss_pred CC---ccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCC-ce---eecccCCccc-cccccce Confidence 11 123445669889999999999999988887652211 1122 33333321 11 1222222221 2233344 Q ss_pred EEechhh-eeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------cccchhhHHHHHHHHHHHhh Q lcl|Aclame:pro 183 VQYAVKD-RAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL------TKQAIKSLDDIKDVLNVKLD 255 (392) Q Consensus 183 v~~~~~~-i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~------~~~~~~~~d~~~~~~~~~~~ 255 (392) +.+...+ .+.-+.|+.+-...+..++..-+.+....+++..+|..+....... .......|++++++ ...++ T Consensus 73 v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~gt~~~~~~~i~~a-~~~Ld 151 (418) T protein:vir:10 73 IPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTPGVRPGAFIDFANA-GAKQT 151 (418) T ss_pred EEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccCCcCcchHHHHHHH-HHHHH Confidence 4444322 2344456655555556678777778888888998888765432111 11223458888885 45666 Q ss_pred hcccCC-c--eEEEcHHHHHHHHHhhccCCceeeccc-----ccCCcccceecccceEEecCcccccccccC-CcceEEE Q lcl|Aclame:pro 256 PAISPN-A--ILLTNQDGFNYLDKLKDKDGKYILQSD-----PTQKNKKLFAGTNPVVVVSNRFLKSKGTTA-KKAPLII 326 (392) Q Consensus 256 ~~~~~~-a--~~v~~~~~~~~L~~lkd~~g~~l~~~~-----~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~-~~~~~~~ 326 (392) ....|. + ..|++|..+..|.+ +.. .++... +..+.-..+.|.. |+.+ + .+|...... .....+. T Consensus 152 ~~~VP~~G~R~lVv~P~~~~~L~~--~~~--~~~~~~~~~~~lr~G~IG~i~GF~-V~~S-~-nip~~tag~~~~t~~v~ 224 (418) T protein:vir:10 152 TYAVPQDGMRHAVLDPFTCASLSD--EVT--KLFKESMVEQAYKMGYRGNVAAYE-VYES-Q-NLPKHTVGDHGGTPLVN 224 (418) T ss_pred hcCCCCCCceEEEeCHHHHHHHhh--hcc--ccccccccchhhheeeeeeeeceE-EEEe-c-CCCcccccccccceeee Confidence 666663 3 45899999877743 222 222221 2234445677764 4443 2 333222111 1222333 Q ss_pred EehhhceeeeeccceEEEEeccchhhhhcCc-eeEEEEE---eeCcEE-ecccceEE-------------EEeccc---- Q lcl|Aclame:pro 327 GDLKEAIVLFKREDMELASTDVGGKAFTRNT-LDLRAIQ---RDDVQM-WDNEAAVY-------------GEIDLS---- 384 (392) Q Consensus 327 Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~-~~~~~~~---r~~~~v-~~~~af~~-------------l~~~~~---- 384 (392) |-...+..+....+ ....++ ....+. +.|-+.. ++...+ .++.-|++ +++.++ T Consensus 225 ga~~~~~~~~~~~~----t~s~~g-~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~ 299 (418) T protein:vir:10 225 GTVVNGDTVGFDGG----TASTTG-FLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDG 299 (418) T ss_pred cccccceeEEEeec----ceeecc-ceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEeccccccc Confidence 33322221111000 000000 000011 1111100 000000 01111111 111100 Q ss_pred ------------------CCCCCCCC Q lcl|Aclame:pro 385 ------------------APVEQPQG 392 (392) Q Consensus 385 ------------------a~~~~~~~ 392 (392) ...++|+. T Consensus 300 ~~~~~~~~~~~~~~~~~~~v~a~~a~ 325 (418) T protein:vir:10 300 TATINNENGDPVSLTAYQNVTALPAD 325 (418) T ss_pred cccccccccccccccCCCcccccccC Confidence 00011111 No 195 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=89.90 E-value=0.024 Score=29.49 Aligned_cols=267 Identities=13% Similarity=0.067 Sum_probs=102.3 Q ss_pred hhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcce---------eeccCCcceeEEEeecCCcccccccccc Q lcl|Aclame:pro 100 DLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTV---------EPVRTRSGSRVLEKNSDMIPFAEITEMG 170 (392) Q Consensus 100 ~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~E~~ 170 (392) +..-.+.+. -.-.++|+.+...+.+...+.+.|++=.-+ ...++...++|+....++....+.+... T Consensus 1 M~~~~~~T~----l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~ 76 (367) T protein:vir:80 1 MPDFNNQVR----LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) T ss_pred Ccchhhhhh----hhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCC Confidence 000000000 011234444433333333333332211000 0123444455555444332222211111 Q ss_pred --ccccccccceee--EEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHH--------HHHHHHHHhhcc------ Q lcl|Aclame:pro 171 --EIPETDNPKFSN--VQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKS--------KVTRNVLILGVI------ 232 (392) Q Consensus 171 --~~~~~~~~~~~~--v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~--------~~~~d~~~~~~~------ 232 (392) +.+....-+..+ +.+...+--....++..+-- + +....|.++++.-- ...+...|-... T Consensus 77 ~~~~t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG-~--dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~ 153 (367) T protein:vir:80 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAG-S--NPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFAT 153 (367) T ss_pred cccccccccccchheeeeehhcccchhhhHHHHhhC-c--hHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhh Confidence 111111112222 22233333333345554432 2 33444555544333 333322221100 Q ss_pred -----------------------cc-ccccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhh------ccCC Q lcl|Aclame:pro 233 -----------------------EK-LTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLK------DKDG 282 (392) Q Consensus 233 -----------------------~~-~~~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lk------d~~g 282 (392) +. ..+.....++.++++.. .+-.....-+.++||+..+..|++++ +++| T Consensus 154 ~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~-~lGD~~~~l~~i~mHS~V~~~L~~~~li~~i~~sd~ 232 (367) T protein:vir:80 154 IKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAF-TMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKG 232 (367) T ss_pred hhhhhccccccccccCceeeeeeccCCCccceecHHHHHHHHH-HhccccccccEEEEchHHHHHHHhccccccccCCCC Confidence 00 01123345677777754 33334445678999999999998764 3333 Q ss_pred ceeecccccCCcccceecccceEEecCcccccccccCC-c-ceEEEEehhhceeeeeccc-eEEEEeccchhhhh-cCce Q lcl|Aclame:pro 283 KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAK-K-APLIIGDLKEAIVLFKRED-MELASTDVGGKAFT-RNTL 358 (392) Q Consensus 283 ~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~-~-~~~~~Gd~~~~~~~~~~~~-~~~~~~~~~~~~f~-~~~~ 358 (392) ...-.++.| ++|++ ++ -+|..+..+. . .+++||.= ++...+..+ +-+++.+.. .-. .+.. T Consensus 233 ---------~~~i~ty~G-~~VIv-DD-~~Pv~~~~a~~~yttYlfg~G--Ai~~~~~~~~~~~E~~Rd~--~~~~~gG~ 296 (367) T protein:vir:80 233 ---------QLTIPTYMG-KVVIV-DD-GMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRE--LRGNGSGL 296 (367) T ss_pred ---------ccccceecc-eeEEE-eC-CCcccccCCCceEEEEEEecc--eeeecccCCccceecccch--hhhcCCce Confidence 122345555 45544 33 3444332222 2 34667642 222222111 112333321 111 1223 Q ss_pred eEEEEEeeCcEEecccceEEEEecccCCCCC--CCC Q lcl|Aclame:pro 359 DLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQ--PQG 392 (392) Q Consensus 359 ~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~--~~~ 392 (392) -+....|. .+.||.+|....-.-++|+-+ +.| T Consensus 297 d~L~~Rr~--~~~hP~G~s~~~~~v~~~~~~~~~~~ 330 (367) T protein:vir:80 297 EYILERKE--WIVHPGGFNWLDADVTIPDNTGSPSG 330 (367) T ss_pred EEEEeeee--EEeecceeeecccccccccccccccc Confidence 33333333 688998887665443333311 111 No 196 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=88.15 E-value=0.034 Score=28.63 Aligned_cols=288 Identities=14% Similarity=0.133 Sum_probs=129.3 Q ss_pred cchhhHHHHH---HHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc Q lcl|Aclame:pro 70 VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR 146 (392) Q Consensus 70 ~~~~~~~~~a---~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~ 146 (392) ....-+-.++ |...+.....+.+.+......+..++. +.++-...+|..+...|-..+..+.++++...+..++ T Consensus 1 mtn~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GV---tiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~ 77 (318) T protein:vir:86 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGV---TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 77 (318) T ss_pred CcchhhhhHHHHHHHHHHhccCCchhhhhhhhhhhhhcCc---eeeccchhccHHHHHHHHHhhhccCcceeeeeeccch Confidence 1122222222 333333333333455544444444333 2334455689888888999999999998865554443 Q ss_pred CCcceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHH--HHHhhhHHHHHHHHHHHHHHHHH-HH Q lcl|Aclame:pro 147 TRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSR--SLLQDSDQNILKYVTKWLGKKSK-VT 223 (392) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~--e~l~ds~~~l~~~v~~~l~~~~~-~~ 223 (392) -.... . ...+...+.-.-.|..+++. ..+|.--++.+.-++....+-. .-+..+-..|..++..+|+.++. +. T Consensus 78 ~~~V~--~-s~~s~AeAq~HkdGqTK~eq-a~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~ 153 (318) T protein:vir:86 78 ALLVS--R-SFDSSAEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKI 153 (318) T ss_pred hhhhh--h-hhhhhhhhhhhccCCccccc-eeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHH Confidence 22111 1 11222333333344445543 3555555555544443333311 12333444678999999999998 78 Q ss_pred HHHHHhhccccccccchhhHHHHHHH------------------HHHHhhhcccCCc---eEEEcHHHH-HHHHHhhccC Q lcl|Aclame:pro 224 RNVLILGVIEKLTKQAIKSLDDIKDV------------------LNVKLDPAISPNA---ILLTNQDGF-NYLDKLKDKD 281 (392) Q Consensus 224 ~d~~~~~~~~~~~~~~~~~~d~~~~~------------------~~~~~~~~~~~~a---~~v~~~~~~-~~L~~lkd~~ 281 (392) .|.++.-|.|.++-.......++... +...++- .++.+ .++....+. +-|..|+.+. T Consensus 154 Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagttpfanaieeavdf-vrptagrrylivkaedrkalldelrqat 232 (318) T protein:vir:86 154 VDLALVEGDGSNGFKSIDKEADVKKIKKITTKAKSAGTTPFANAIEEAVDF-VRPTAGRRYLIVKAEDRKALLDELRQAT 232 (318) T ss_pred HHhhheeecCCCCccchhhHHHHHHHHHHhhhhhccCCCchhhHHHHHHhh-hccCCCceEEEEeecchHHHHHHHHhhc Confidence 89999889888774443333333221 1111110 01111 233333333 2334444332 Q ss_pred Cce--eecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCcee Q lcl|Aclame:pro 282 GKY--ILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLD 359 (392) Q Consensus 282 g~~--l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~ 359 (392) .+. -+..+.+ ....-.|...+++. .+..+-...+ +-|-+ |.+-+ ..++ .. ..-.|.+|... T Consensus 233 anahvriknddt--eiasevgvdeiivy-------tgskalkptv-lvdqk--yhidm-qdlt--kv--dafewktnsnm 295 (318) T protein:vir:86 233 ANAHVRIKNDDT--EIASEVGVDEIIVY-------TGSKALKPTV-LVDQK--YHIDM-QDLT--KV--DAFEWKTNSNM 295 (318) T ss_pred ccceeEEeccch--hhhhhcCcceeeee-------ecccccccee-eeccc--eecch-hhhh--hh--hcceeccCCce Confidence 221 1111111 11111232222221 1112222222 22332 22221 1221 01 11234566666 Q ss_pred EEEEEeeCcEEecccceEEEEec Q lcl|Aclame:pro 360 LRAIQRDDVQMWDNEAAVYGEID 382 (392) Q Consensus 360 ~~~~~r~~~~v~~~~af~~l~~~ 382 (392) +.++.-..+.|---+|-.++++. T Consensus 296 ilvetltsghvetynagavitvs 318 (318) T protein:vir:86 296 ILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred EEEeecccCcceeecCceeEEeC Confidence 66666666655555555555554 No 197 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=87.06 E-value=0.041 Score=28.17 Aligned_cols=270 Identities=12% Similarity=0.020 Sum_probs=102.9 Q ss_pred hccccccccceecch--hhhhHHHHhHHhhhhhhhhcceee----------ccCCcceeEEEeecC-Cccccccccc--c Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQ--DIQTQINELARSFDALEQYVTVEP----------VRTRSGSRVLEKNSD-MIPFAEITEM--G 170 (392) Q Consensus 106 ~~~~~~~~gg~~iP~--~~~~~ii~~~~~~~~l~~l~~~~~----------~~~~~~~~~~~~~~~-~~~~~~~~E~--~ 170 (392) |. ++.-.-..+|. .+...+.+...+.+.|++ ..++. .++...++|+....+ ..+..+-..+ + T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~q-SGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~ 77 (349) T protein:vir:78 1 MA--ITTIGDIVTGNIPVLASYMTEDPVEKTAFFD-SGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQD 77 (349) T ss_pred CC--ceEEeeeeccCHHHHHHHHHHhhHHhhhhhh-ccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCccc Confidence 11 11112234554 354445555444444433 11111 233444555543322 2222211111 1 Q ss_pred ccccccccceeeEEechhhee--eehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------c------ Q lcl|Aclame:pro 171 EIPETDNPKFSNVQYAVKDRA--GILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE--------K------ 234 (392) Q Consensus 171 ~~~~~~~~~~~~v~~~~~~i~--~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~--------~------ 234 (392) ..+....-+..++-...+.-. ..-.++.++-- + +....|.+++++-..+...+.++..+. . T Consensus 78 ~~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG-~--dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~ 154 (349) T protein:vir:78 78 IATPRAIQTGEMMARVAYLNEGFGQADLTVELTS-Q--NPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHE 154 (349) T ss_pred ccccccccccceeeeeeeeccccchhHHHHHhhC-c--hHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhh Confidence 111111112333333222222 22334544432 2 334555555554443333332222111 0 Q ss_pred -------ccccchhhHHHHHHHHHHHhhhc----ccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccc Q lcl|Aclame:pro 235 -------LTKQAIKSLDDIKDVLNVKLDPA----ISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNP 303 (392) Q Consensus 235 -------~~~~~~~~~d~~~~~~~~~~~~~----~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~p 303 (392) ....+..++..++++..+.-+.. ...-+.++||+.++..|++.+--. | +++.-....-.+++|. + T Consensus 155 ~~~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~--~-i~~s~~~~~i~ty~G~-~ 230 (349) T protein:vir:78 155 QNDMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID--F-IRDAENNTMFATYQGY-R 230 (349) T ss_pred cccceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhh--h-ccCcccCcccceecCe-E Confidence 11222345566666654332322 223358999999999998653211 0 1111111223455664 4 Q ss_pred eEEecCcccccccccC-Cc-ceEEEEehhhceeeeeccc-eEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEE Q lcl|Aclame:pro 304 VVVVSNRFLKSKGTTA-KK-APLIIGDLKEAIVLFKRED-MELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE 380 (392) Q Consensus 304 v~~~~~~~~~~~~~~~-~~-~~~~~Gd~~~~~~~~~~~~-~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~ 380 (392) |++. + .+|..+..+ +. .+++||. .++...+... ..++..+.....-..++-.+....|+ +.||.+|..-. T Consensus 231 VivD-D-~~Pv~~~g~~~~yttylfg~--GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~---~~hp~G~s~~~ 303 (349) T protein:vir:78 231 VIVD-D-SMTVVGQGAQRKFISIIFGQ--GAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYRFTS 303 (349) T ss_pred EEEe-C-CCccccCCCCceEEEEEeec--ceEEEccCCCccceeeecccccCCcceeEEEEEeeEE---Eeeeeeeeecc Confidence 5443 3 334332222 22 3467774 2332322221 22333332211111234455555444 56676665444 Q ss_pred ecccCCC-----CCCCC Q lcl|Aclame:pro 381 IDLSAPV-----EQPQG 392 (392) Q Consensus 381 ~~~~a~~-----~~~~~ 392 (392) -..+.+. .+|-= T Consensus 304 a~v~~~~~~~~~~sPt~ 320 (349) T protein:vir:78 304 AVITGNGTETIARSASW 320 (349) T ss_pred ccccCCccccccCCCCh Confidence 2222111 11111 No 198 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=83.74 E-value=0.066 Score=27.06 Aligned_cols=260 Identities=12% Similarity=0.052 Sum_probs=86.5 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc--C--CcceeEE---EeecCCcccccccccccccccccc Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR--T--RSGSRVL---EKNSDMIPFAEITEMGEIPETDNP 178 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~--~--~~~~~~~---~~~~~~~~~~~~~E~~~~~~~~~~ 178 (392) +..+.-++-- +--+.+....++.+.+...+++-+..-.+. . -.|.+.. ....+....--+...+......-- T Consensus 1 ~~~t~~sdl~-vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit 79 (315) T protein:vir:96 1 MATTVNSDLV-IYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIA 79 (315) T ss_pred Cceeeeccee-eehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceecc Confidence 2211111111 122334445556666555555433221111 0 0111111 000000000011111111111101 Q ss_pred ceeeEEechhheeeehhh--HHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhc----ccc------ccccchhhH Q lcl|Aclame:pro 179 KFSNVQYAVKDRAGILPL--SRSLLQ---DSDQNILKYVTKWLGKKSKVTRNVLILGV----IEK------LTKQAIKSL 243 (392) Q Consensus 179 ~~~~v~~~~~~i~~~~~i--S~e~l~---ds~~~l~~~v~~~l~~~~~~~~d~~~~~~----~~~------~~~~~~~~~ 243 (392) +...+-++. ..+.-++ +...+. +..-....-|...+..+..+.+-...+.+ .+. ....+.... T Consensus 80 ~~~dvaVk~--~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~t~~~~~~~~a~~~~ 157 (315) T protein:vir:96 80 ADEMVSVKV--PWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGSNAGMNVSGELATEGK 157 (315) T ss_pred cccceeEEE--eecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccccccccccccCH Confidence 122222221 1133333 333333 11111112222222222222111111111 111 112334456 Q ss_pred HHHHHHHHHHhhhcccCCceEEEcHHHHHHHHHhhccCCceeeccccc--CCcccceecccceEEecCcccccccccCCc Q lcl|Aclame:pro 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPT--QKNKKLFAGTNPVVVVSNRFLKSKGTTAKK 321 (392) Q Consensus 244 d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~--~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~ 321 (392) ..+.++..+ +-.....-+.|+||..++..|.+ +.=. ..++..... -+.+.+.+| ++|+|.+ -+|.. T Consensus 158 ~~l~dA~~k-lGD~~~~l~~~vMHS~v~~~L~~-q~L~-~~~~~~~~~~~~~~~~~~lG-krViVdD--~~P~~------ 225 (315) T protein:vir:96 158 KVLTKGLRT-MGDKASSIAIWVMDSTSYFDIVD-EAID-NKLYEEAGVVVYGGTPGTLG-KPVLVTD--QCPAT------ 225 (315) T ss_pred HHHHHHHHH-hcccccCeeEEEEchHHHHHHHH-hhhh-hhcccccceeEecCcCcccc-cEEEEEC--CCCcc------ Confidence 677777665 33333444689999999999976 2111 122211100 011234566 4566643 23331 Q ss_pred ceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCc-EEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 322 APLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 322 ~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~-~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) ..+.||. .++.+.+...+.....+. .++-.+....|..+ -+++|.+|..-+ +.. .+|-- T Consensus 226 ~~~gl~~--GAi~~~~~~~~~~~~~~~------~g~e~l~~~~r~e~tf~l~p~G~sw~~---~~~-~sPt~ 285 (315) T protein:vir:96 226 KIFGLVA--GAVMITESQAPGMRSYQI------DDQENLAIGFRAEGTANVEVLGYKWKT---KTN-VNPAS 285 (315) T ss_pred eeeeeec--ceeeecCCCccccccccC------CCcceeEEEEeeeeEeeeeeeeEEeec---CCC-cCCCh Confidence 1122222 122222222221111111 12233334444444 467777776532 222 22322 No 199 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=83.58 E-value=0.067 Score=27.01 Aligned_cols=285 Identities=14% Similarity=0.127 Sum_probs=126.3 Q ss_pred cchhhHHHHHH---HHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc Q lcl|Aclame:pro 70 VDGEMEYRDVF---MKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR 146 (392) Q Consensus 70 ~~~~~~~~~a~---~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~ 146 (392) ....-+..++. ...++....+.+-........ +-+..+..+..+-+|..+...|-..+....|++....+..++ T Consensus 1 mtnfiesqnavteffdvlkknsgkseiknawnakl---aengvtitdttfqlprklvesintallntnpvfkvfhvtnvg 77 (318) T protein:vir:94 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKL---AENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 77 (318) T ss_pred CccchhhhhhHHHHHHHHhcccChhhhhhhhhhhh---hhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhhh Confidence 11222222222 222222222222222222222 233344445566688888888888888888887766554443 Q ss_pred CCcceeEEEee-cCCccccccccccccccccccceeeEEechhheeeehhhHHH--HHhhhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 147 TRSGSRVLEKN-SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRS--LLQDSDQNILKYVTKWLGKKSKVT 223 (392) Q Consensus 147 ~~~~~~~~~~~-~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e--~l~ds~~~l~~~v~~~l~~~~~~~ 223 (392) .+...+. .+...+.....|..+.+ ...++.-=++.|--++.+..+... -|++|-..|...+..+|..++..+ T Consensus 78 ----allvsrsfdssneaqvhkdgqtkte-qaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltqaivnk 152 (318) T protein:vir:94 78 ----ALLVSRSFDSSNEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK 152 (318) T ss_pred ----heeeeccccccchhhhhcccccccc-cceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHhh Confidence 2222222 33334444444544544 335555556667666666655543 356666677888888888887765 Q ss_pred H-HHHHhhccccccccchh----------------------hHHHHHHHHHHHhhhcccCCceEEEcHHHHH-HHHHhhc Q lcl|Aclame:pro 224 R-NVLILGVIEKLTKQAIK----------------------SLDDIKDVLNVKLDPAISPNAILLTNQDGFN-YLDKLKD 279 (392) Q Consensus 224 ~-d~~~~~~~~~~~~~~~~----------------------~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~-~L~~lkd 279 (392) + |-++..|.|+++-.... ..|.+..+.....+.+.+ ..++....+.. -|..|+. T Consensus 153 ivdlalvegdgtngfksidkeadvkkikkittkaksagktpfadaieeavdfvrptagr--rylivktedrkalldelrq 230 (318) T protein:vir:94 153 IVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR--RYLIVKTEDRKALLDELRQ 230 (318) T ss_pred hhheeeeecCCcchhhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCCCc--eEEEEeccchHHHHHHHHh Confidence 5 45555666655432221 123333332222222221 12344444433 3444443 Q ss_pred cCC--c-eeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcC Q lcl|Aclame:pro 280 KDG--K-YILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRN 356 (392) Q Consensus 280 ~~g--~-~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~ 356 (392) +.. + .|-.+ .+. ...-.|...+++. .+..+-... ++-|-+ |.+-+ .+++ ..+ .-.|.+| T Consensus 231 atananvriknd-dte--iasevgvdeiivy-------tgskavkpt-vlvdqk--yhidm-qdlt--kvd--afewktn 292 (318) T protein:vir:94 231 ATANANVRIKND-DTE--IASEVGVDEIIVY-------TGSKAVKPT-VLVDQK--YHIDM-QDLT--KVD--AFEWKTN 292 (318) T ss_pred hhcccceEEecc-chh--hhhhcCcceeEEe-------eccccccce-eEeccc--eecch-hhhh--hhh--ceeeccC Confidence 322 1 12111 111 1112233222221 111121222 233432 22222 2221 111 1234566 Q ss_pred ceeEEEEEeeCcEEecccceEEEEec Q lcl|Aclame:pro 357 TLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) Q Consensus 357 ~~~~~~~~r~~~~v~~~~af~~l~~~ 382 (392) ...+.++.-..+.+---+|-+++++. T Consensus 293 snmilvetltsghvetynagavitvs 318 (318) T protein:vir:94 293 SNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred CceEEEEecccCcceeecCceeEEeC Confidence 66666666666655555555555554 No 200 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=83.38 E-value=0.069 Score=26.95 Aligned_cols=286 Identities=7% Similarity=-0.062 Sum_probs=119.9 Q ss_pred cchhhHHHHHHHHHHhcchhhHHH---------HHHHHhhhhhhhhccccccccceecchhhh----hHHHHhHHhhhhh Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEE---------REFLEDDLEQRAMSGLTGEDGGLVIPQDIQ----TQINELARSFDAL 136 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~---------~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~----~~ii~~~~~~~~l 136 (392) ......++. .-+-+-.-... ...+.+..... ...+++.+| ||+.+. +.+++.+...... T Consensus 1 ~~~~~~~~~----l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~--~~~~~~~~~--~~~~l~~~i~p~~~~~~~~~~~~ 72 (336) T protein:vir:36 1 MRDAQRIQN----LARAGVILPRSVQNVSTPLTEYAMDAADLSP--HLSSTGSSG--IPNYLTTYVDPSVIDILVAPMKA 72 (336) T ss_pred CchHHHHHH----HhhcCeeecchhhhhhhHHHHhhhhhhhccC--ccccCCCcc--hHHHHHHhhccceEeeecchhhh Confidence 111111100 00000000000 00011100000 111122222 555433 3567777777777 Q ss_pred hhhcceeeccCCc-ceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhH---HHHHHHH Q lcl|Aclame:pro 137 EQYVTVEPVRTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD---QNILKYV 212 (392) Q Consensus 137 ~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~---~~l~~~v 212 (392) ..|+.+.+++.-. ....+......+.+.+.+-+++.|-. +.....-+-+.+.++..+.++..-+..+. .++.+-- T Consensus 73 ~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~-d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~K 151 (336) T protein:vir:36 73 AELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDS-GANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASEL 151 (336) T ss_pred hhhccccccCCccceeEEEeeeeceeeEEEeeccCCCcee-ecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHH Confidence 7777776554321 12233333444556677777777643 34555566677778888888844444332 2455555 Q ss_pred HHHHHHHHHHHHHHHHhhccccc------------------c-----ccchhhHHHHHHHHHHHhh-hcc----cCCceE Q lcl|Aclame:pro 213 TKWLGKKSKVTRNVLILGVIEKL------------------T-----KQAIKSLDDIKDVLNVKLD-PAI----SPNAIL 264 (392) Q Consensus 213 ~~~l~~~~~~~~d~~~~~~~~~~------------------~-----~~~~~~~d~~~~~~~~~~~-~~~----~~~a~~ 264 (392) +...++++...+|+-.+.|.... + ++....++|+..++..... ... .....+ T Consensus 152 a~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL 231 (336) T protein:vir:36 152 NYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRM 231 (336) T ss_pred HHHHHHHHHHhhCcEEEEeccccceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEE Confidence 55566666666665444432210 0 0112234555544432221 111 234578 Q ss_pred EEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEE Q lcl|Aclame:pro 265 LTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELA 344 (392) Q Consensus 265 v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~ 344 (392) +|.++.+..|.+- +..|.-++. -.... +|-+.++..+-+ .++.+....++-+--. ......+. T Consensus 232 ~LP~~~~~~Ls~~-n~~g~Tvl~-~lk~n-------~Pnl~i~t~pEl---~~a~g~~~~l~~~~~~-----~~~t~~~~ 294 (336) T protein:vir:36 232 GLPPTAMSDLSKT-NQYGLAAAA-KLKDI-------FPKLEFVTIPEY---DTASGRLVQLWAPRVE-----GKDTATCG 294 (336) T ss_pred EechHHHHhccCC-CccCccHHH-HHHHh-------cCccEEEEcccc---ccCCCceEEEEEEecC-----CCcceeee Confidence 9999988888532 333322221 01111 111333332222 2222333333321100 00111111 Q ss_pred Eec---cchhhhhcCceeEEEEEeeCc-EEecccceEEEE-e Q lcl|Aclame:pro 345 STD---VGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGE-I 381 (392) Q Consensus 345 ~~~---~~~~~f~~~~~~~~~~~r~~~-~v~~~~af~~l~-~ 381 (392) +.. ....-...-.+..-+..|.+| .+.+|.||++++ + T Consensus 295 ~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 295 FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cchhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 100 000000112245566777666 555799999887 4 No 201 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=79.11 E-value=0.11 Score=25.89 Aligned_cols=268 Identities=14% Similarity=0.100 Sum_probs=123.5 Q ss_pred ccccccccceecchhhhhHHHHhHHhhhhhhhhcc-eeeccCCcceeEEEeecCCccccccccccccccccccceeeEEe Q lcl|Aclame:pro 107 SGLTGEDGGLVIPQDIQTQINELARSFDALEQYVT-VEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQY 185 (392) Q Consensus 107 ~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~-~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~ 185 (392) ...++..-.+.+..+++..|...+++...=..+.+ +.-.+++ -++.++ ..+.+...-..|.+... .....-++|++ T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G-~~L~I~-tiGs~~~~~~~E~~~~~-~~~i~TGEIt~ 77 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSG-ETLHIK-TIGSVTLQEAEEDTPLI-YNPIETGEITF 77 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCC-CEEEec-ccCceeeeccccCCCee-ecccccceEEE Confidence 23344344444556666777777776544334444 3333332 244443 33444444555555544 33345578888 Q ss_pred chhheeeeh-hhHHHHHhhhHH--HHHHHHHHHHHHHHHHHHHHHHhh----------------ccccc----cccchhh Q lcl|Aclame:pro 186 AVKDRAGIL-PLSRSLLQDSDQ--NILKYVTKWLGKKSKVTRNVLILG----------------VIEKL----TKQAIKS 242 (392) Q Consensus 186 ~~~~i~~~~-~iS~e~l~ds~~--~l~~~v~~~l~~~~~~~~d~~~~~----------------~~~~~----~~~~~~~ 242 (392) ....+++-. .||+.+-+|+-. .+.+.+.-+-++++-...+..++. |..-. .+.+... T Consensus 78 ~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~~ 157 (313) T protein:vir:95 78 QITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGVFA 157 (313) T ss_pred EEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCceeh Confidence 888887665 799999998632 233333333344444444433322 11100 0111112 Q ss_pred HHHHHHHHHHHhhhccc--CCceEEEcHHHHHHHHHhh------ccCCceeecccccCCcc--cceecccceEEecCccc Q lcl|Aclame:pro 243 LDDIKDVLNVKLDPAIS--PNAILLTNQDGFNYLDKLK------DKDGKYILQSDPTQKNK--KLFAGTNPVVVVSNRFL 312 (392) Q Consensus 243 ~d~~~~~~~~~~~~~~~--~~a~~v~~~~~~~~L~~lk------d~~g~~l~~~~~~~~~~--~~~~g~~pv~~~~~~~~ 312 (392) ..++.. +....+.+.. ..-++++.|.....|..+. ..+|+.|+..++.-+.. ..++|-. +.+++..- T Consensus 158 ~~~~~~-~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~D--i~~SN~L~ 234 (313) T protein:vir:95 158 LKHLIA-MRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWD--ILTSNRLH 234 (313) T ss_pred hhHHHH-hhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhh--hhhhhhhh Confidence 233333 3333332222 3347899999988887663 23467776554433221 1123321 22333222 Q ss_pred cc---ccccCCcceEEEEehhh--------ceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEe Q lcl|Aclame:pro 313 KS---KGTTAKKAPLIIGDLKE--------AIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) Q Consensus 313 ~~---~~~~~~~~~~~~Gd~~~--------~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~ 381 (392) .. ++..++ ..++|++=. -++...|+..+-+-.. .++-..+... ..+|.|+++.+.+-.+.+-- T Consensus 235 ~AN~~D~~tT~--~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~--~~~~~~~~~~--~~~R~G~Gi~R~~~L~~~~~ 308 (313) T protein:vir:95 235 VANYNDGTTTG--NGYVGNLFMCILDDQTKPIMGAWRRMPKSEGER--NKDRARDEHV--VRCRYGFGIQRLDTLGLLAT 308 (313) T ss_pred hcccccccccc--Cceeeeeeeeeecccccceeeeecccccccccc--ccccccccce--eeeeecccceeecceeEEEe Confidence 11 122222 223443311 1111111111111111 1111122233 45788999998888876664 Q ss_pred cccCC Q lcl|Aclame:pro 382 DLSAP 386 (392) Q Consensus 382 ~~~a~ 386 (392) .+++- T Consensus 309 ~A~~~ 313 (313) T protein:vir:95 309 SATAY 313 (313) T ss_pred ccccC Confidence 54444 No 202 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=78.83 E-value=0.11 Score=25.83 Aligned_cols=286 Identities=7% Similarity=-0.070 Sum_probs=121.7 Q ss_pred cchhhHHHHHHHHHHhcchh-hH--------HHHHHHHhhhhhhhhccccccccceecchhh----hhHHHHhHHhhhhh Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPL-NA--------EEREFLEDDLEQRAMSGLTGEDGGLVIPQDI----QTQINELARSFDAL 136 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~-~~--------~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~----~~~ii~~~~~~~~l 136 (392) ......++. .-+-+-. .. .....+.+..... ...+. +...||.-+ .+.+++.+...... T Consensus 1 ~~~~~~~~~----l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~--~~~~~--~~~~i~~~l~~~i~p~~~~~~~~p~~a 72 (336) T protein:vir:10 1 MRDAQRIQN----LARAGVILPRSVQNVSTPLTEYAMDAADLSP--HLSST--GSSGIPNYLTTYVDPAVIDILVAPMKA 72 (336) T ss_pred CchHHHHHH----HhhcCeeecchhhhhhhhHHHhhhhhhhccC--ccccC--CCchhHHHHHhhcccceeeehhhhhhh Confidence 111111100 0000000 00 0000111111110 11112 222355432 25667777777777 Q ss_pred hhhcceeeccCCc-ceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhH---HHHHHHH Q lcl|Aclame:pro 137 EQYVTVEPVRTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD---QNILKYV 212 (392) Q Consensus 137 ~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~---~~l~~~v 212 (392) ..|+.+.+++.-. ....+......+.+.+.+-+++.|-. +.....-+-+.+.++..+.++..-+..+. .++.+-- T Consensus 73 ~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~-d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~K 151 (336) T protein:vir:10 73 AELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDS-GANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASEL 151 (336) T ss_pred hhhccccccCCccceeEEEeeeeceeeEEEeeccCCCcee-ecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHH Confidence 7777776554321 12233333444556677777777644 34555666677788888888855444332 3456656 Q ss_pred HHHHHHHHHHHHHHHHhhccccc------------------c-----ccchhhHHHHHHHHHHHhh-hcc----cCCceE Q lcl|Aclame:pro 213 TKWLGKKSKVTRNVLILGVIEKL------------------T-----KQAIKSLDDIKDVLNVKLD-PAI----SPNAIL 264 (392) Q Consensus 213 ~~~l~~~~~~~~d~~~~~~~~~~------------------~-----~~~~~~~d~~~~~~~~~~~-~~~----~~~a~~ 264 (392) +...++++...+|+-.+.|.... + ++....++|+..++..... ... .....+ T Consensus 152 a~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL 231 (336) T protein:vir:10 152 NYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRM 231 (336) T ss_pred HHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceE Confidence 66666666666666444432211 0 0112234555544432221 111 234578 Q ss_pred EEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEE Q lcl|Aclame:pro 265 LTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELA 344 (392) Q Consensus 265 v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~ 344 (392) +|.++.+..|.+- +..|.-++. -.... +|-+.++..+-. .++.+....++-+--. ......+. T Consensus 232 ~LP~~~~~~Ls~~-n~~g~Tvl~-~lk~n-------~Pnl~i~t~pEl---~~a~G~~~~l~~~~~~-----~~~t~~~~ 294 (336) T protein:vir:10 232 GLPPTAMSDLSKT-NQYGLAAAA-KLKDI-------FPKLEFVTIPEY---DTASGRLVQLWAPRVE-----GKDTATCG 294 (336) T ss_pred EecHHHHHhccCC-CccCccHHH-HHHHh-------cCccEEEEcccc---ccCCCceEEEEEEecC-----CCcceeee Confidence 9999988888532 333322221 01111 112333332222 2222333333321100 00111111 Q ss_pred Eec---cchhhhhcCceeEEEEEeeCc-EEecccceEEEE-e Q lcl|Aclame:pro 345 STD---VGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGE-I 381 (392) Q Consensus 345 ~~~---~~~~~f~~~~~~~~~~~r~~~-~v~~~~af~~l~-~ 381 (392) +.. ....-...-.+..-+..|.+| .+.+|.||++++ + T Consensus 295 ~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 295 FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cchhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 100 000000112245566777666 555799999887 4 No 203 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=76.04 E-value=0.14 Score=25.27 Aligned_cols=270 Identities=11% Similarity=0.019 Sum_probs=103.9 Q ss_pred hccccccccceecch--hhhhHHHHhHHhhhhhhhhcceee----------ccCCcceeEEEeec-CCccccccccc--c Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQ--DIQTQINELARSFDALEQYVTVEP----------VRTRSGSRVLEKNS-DMIPFAEITEM--G 170 (392) Q Consensus 106 ~~~~~~~~gg~~iP~--~~~~~ii~~~~~~~~l~~l~~~~~----------~~~~~~~~~~~~~~-~~~~~~~~~E~--~ 170 (392) |. ++.-.-..+|. .+...+.+...+.+.|++ ..++. .++...++|+.... ++.+..+-+.. . T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~q-SGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~ 77 (349) T protein:vir:94 1 MA--ITTIGNIVTGNIPVLASYMTEDPVEKTAFFN-SGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQD 77 (349) T ss_pred CC--ceEEeeeeccChHHHHHHHHHhHHHhhhhhh-ccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCccc Confidence 11 11112234554 354445555444444444 12211 22333444444332 22222111111 0 Q ss_pred ccccccccceeeEEechhh--eeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------- Q lcl|Aclame:pro 171 EIPETDNPKFSNVQYAVKD--RAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK-------------- 234 (392) Q Consensus 171 ~~~~~~~~~~~~v~~~~~~--i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~-------------- 234 (392) ..+....-+..++-...+. --..-.++.++--+ +..+.|.+++++-..+...+.++..+.. T Consensus 78 ~~t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG~---dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~ 154 (349) T protein:vir:94 78 IATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQ---NPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHE 154 (349) T ss_pred ccccccccccceeeeeeeeccccchhHHHHHhhCc---hHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccc Confidence 1111111122332222222 22333445554332 3345555555554444443333221110 Q ss_pred -------ccccchhhHHHHHHHHHHHhhhcc----cCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccc Q lcl|Aclame:pro 235 -------LTKQAIKSLDDIKDVLNVKLDPAI----SPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNP 303 (392) Q Consensus 235 -------~~~~~~~~~d~~~~~~~~~~~~~~----~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~p 303 (392) ....+......++++..+.-+... ..-+.++||+.++..|++.+--. .+++.-....-.+++| ++ T Consensus 155 ~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~---~i~~s~~~~~i~ty~G-~~ 230 (349) T protein:vir:94 155 QNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID---FIRDAENNTMFATYQG-YR 230 (349) T ss_pred cCceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhh---hccCcccCcccceecC-cE Confidence 012233455666666654333322 23357999999999998654211 0111111112345566 44 Q ss_pred eEEecCccccccccc-C-CcceEEEEehhhceeeeecc-ceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEE Q lcl|Aclame:pro 304 VVVVSNRFLKSKGTT-A-KKAPLIIGDLKEAIVLFKRE-DMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGE 380 (392) Q Consensus 304 v~~~~~~~~~~~~~~-~-~~~~~~~Gd~~~~~~~~~~~-~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~ 380 (392) |++. | .+|..+.. . .-..++||. .++...+.. .+.++..+.....-..++-.+....|+ +.||.+|..-+ T Consensus 231 VivD-D-~~Pv~~~g~~~~yttylfg~--GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~---~~hp~G~s~~~ 303 (349) T protein:vir:94 231 VIVD-D-SMTVVGQDTSRKFISIIFGQ--GAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYSFTS 303 (349) T ss_pred EEEe-C-CCccccCCCCceEEEEEeec--ceEEeecCCCCcceeeecccccCCcceeEEEEEeeEE---Eeeeeeeeecc Confidence 5543 3 33432221 2 223457764 223333322 122344333211101234444444443 56777776544 Q ss_pred ecccCC-----CCCCCC Q lcl|Aclame:pro 381 IDLSAP-----VEQPQG 392 (392) Q Consensus 381 ~~~~a~-----~~~~~~ 392 (392) -..+.+ ..+|-= T Consensus 304 a~v~~~~~~~~~~sPt~ 320 (349) T protein:vir:94 304 AVITGNGTETIARSASW 320 (349) T ss_pred cccCCCccccccCCCCh Confidence 221211 111221 No 204 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=70.04 E-value=0.21 Score=24.25 Aligned_cols=288 Identities=8% Similarity=-0.028 Sum_probs=122.1 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhh-------ccccccccceecchhh----hhHHHHhHHhhhhhhh Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAM-------SGLTGEDGGLVIPQDI----QTQINELARSFDALEQ 138 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~-------~~~~~~~gg~~iP~~~----~~~ii~~~~~~~~l~~ 138 (392) ..... .+...-+-+-.-...+..........++ ...+.+++| ||..+ .+.+++.+........ T Consensus 1 ~~~~~----~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g--~~~~l~~~i~p~~~~~~~~~~~~~~ 74 (336) T protein:vir:78 1 MRDAQ----RIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILVAPMKAAE 74 (336) T ss_pred CchHH----HHHHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcc--hHHHHHHhcccceeeehhhhhhhhh Confidence 00000 0111001010000000000000111111 111222222 44432 2466777777777777 Q ss_pred hcceeeccCCc-ceeEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhH---HHHHHHHHH Q lcl|Aclame:pro 139 YVTVEPVRTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD---QNILKYVTK 214 (392) Q Consensus 139 l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~---~~l~~~v~~ 214 (392) ++.+..++.-. ..+.+......+.+.+.+-+++.|-. +...++..-+.+.++..+.++..-+..+. .++.+--+. T Consensus 75 l~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~v-d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~ 153 (336) T protein:vir:78 75 LVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDS-GTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNY 153 (336) T ss_pred hcccccCCCccccEEEEeeeecceeeEEeecccCCCee-ecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHH Confidence 77776654321 13344444445566677777777654 45667777788888888888866555443 345555555 Q ss_pred HHHHHHHHHHHHHHhhccccc---------------c--c------cchhhHHHHHHHHHHHh-hhc--cc--CCceEEE Q lcl|Aclame:pro 215 WLGKKSKVTRNVLILGVIEKL---------------T--K------QAIKSLDDIKDVLNVKL-DPA--IS--PNAILLT 266 (392) Q Consensus 215 ~l~~~~~~~~d~~~~~~~~~~---------------~--~------~~~~~~d~~~~~~~~~~-~~~--~~--~~a~~v~ 266 (392) ..++++.+.+|+..+.|.... + . +....++|+..++.... ... .. .+..++| T Consensus 154 aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~L 233 (336) T protein:vir:78 154 SSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGL 233 (336) T ss_pred HHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEe Confidence 556666666665444442210 0 0 01112334443332211 111 11 2336899 Q ss_pred cHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEe-hh--hceeeeeccceEE Q lcl|Aclame:pro 267 NQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGD-LK--EAIVLFKREDMEL 343 (392) Q Consensus 267 ~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd-~~--~~~~~~~~~~~~~ 343 (392) .+..+..|.+- +..|.-++. -.... +|-+.++.-+-+ .++.+....+|-. +. .-..+...+. + T Consensus 234 p~~~~~~L~~~-n~~g~tv~~-~lk~n-------~Pnl~i~t~pel---~~Agg~~~~~~~~~~~~~~t~~~~~p~~--f 299 (336) T protein:vir:78 234 PPTAMSDLSKT-NQYGLSAAA-KLKEI-------FPKLEFVTIPEY---DTASGRLVQLWAPRVEGKDTATCGFTEK--M 299 (336) T ss_pred chHHHHhccCC-CccCccHHH-HHHHh-------cCccEEEEcccc---cccCcceEEEEEeeccCCcceeeecchh--h Confidence 99999988542 333322221 01111 111333322222 2222222222211 10 0000111111 1 Q ss_pred EEeccchhhhhcCceeEEEEEeeCc-EEecccceEEEE-e Q lcl|Aclame:pro 344 ASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGE-I 381 (392) Q Consensus 344 ~~~~~~~~~f~~~~~~~~~~~r~~~-~v~~~~af~~l~-~ 381 (392) ...+.. .....+..-+..|.+| .+.+|-||++++ + T Consensus 300 ~~lpvq---~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 300 RAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hcccee---ecCceeEeccccceeeeeeeccchheeeccC Confidence 111110 0112345566677666 555799999877 4 No 205 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=67.26 E-value=0.25 Score=23.84 Aligned_cols=175 Identities=17% Similarity=0.110 Sum_probs=77.7 Q ss_pred eeeehhhHHHHHhh-----hHHHHHHHHHHHHHHHHHHHHHHHHhhccc----------------c--cc----ccchhh Q lcl|Aclame:pro 190 RAGILPLSRSLLQD-----SDQNILKYVTKWLGKKSKVTRNVLILGVIE----------------K--LT----KQAIKS 242 (392) Q Consensus 190 i~~~~~iS~e~l~d-----s~~~l~~~v~~~l~~~~~~~~d~~~~~~~~----------------~--~~----~~~~~~ 242 (392) |-+ .-+|.-+++| +++++.+...+++.++++...|+.+..... . .. ..+... T Consensus 1 iD~-lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l 79 (221) T protein:vir:17 1 MDD-LLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAI 79 (221) T ss_pred CCc-chhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHH Confidence 111 1234444443 567889999999999999999987644211 0 00 111234 Q ss_pred HHHHHHHHHHHhhhcccCC-c-eEEEcHHHHHHHHHhhcc-CCceeeccc---ccCC-cccceecccceEEecCcccccc Q lcl|Aclame:pro 243 LDDIKDVLNVKLDPAISPN-A-ILLTNQDGFNYLDKLKDK-DGKYILQSD---PTQK-NKKLFAGTNPVVVVSNRFLKSK 315 (392) Q Consensus 243 ~d~~~~~~~~~~~~~~~~~-a-~~v~~~~~~~~L~~lkd~-~g~~l~~~~---~~~~-~~~~~~g~~pv~~~~~~~~~~~ 315 (392) ++.+.++. ..++...-+. . .+|++|..+..|.+-.|. -.+.-+..+ ...+ .-..+.|.+ |+. ++ -+|+. T Consensus 80 ~dai~~a~-~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~-V~~-Sn-nlP~~ 155 (221) T protein:vir:17 80 VDGFFEAA-AVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIR-IYK-SN-VLASL 155 (221) T ss_pred HHHHHHHH-HHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcE-EEE-ec-cCCcc Confidence 45565543 4566666554 3 456799888877543221 111112111 1111 122355543 332 33 33432 Q ss_pred cccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 316 GTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 316 ~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~~a~~~~~~~ 392 (392) +. . +....-|+|... ....+.++. .|.+ .-+.+.+|+|...+++= .+|..||-- T Consensus 156 ~g-t-~~~~~ag~~~~~--~~~~~~yr~--------~fs~----------~~glv~~~~Avgtvkl~-~~~~~~~~~ 209 (221) T protein:vir:17 156 YG-T-NLVTDPGDATTS--GENNGSYRP--------AITD----------RAGLVFHKEAADTVEVL-LPPSRPPLV 209 (221) T ss_pred cc-c-ccccCCcccccc--ccccccccc--------cccc----------eEEEEEcchheeeeeee-cCCCCCcee Confidence 11 1 111112222110 111111111 1211 12677889888655432 234344433 No 206 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=66.56 E-value=0.27 Score=23.74 Aligned_cols=297 Identities=13% Similarity=0.060 Sum_probs=126.0 Q ss_pred hhccccccccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhh--hhh Q lcl|Aclame:pro 59 RNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSF--DAL 136 (392) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~--~~l 136 (392) ...............+...+++.|++..+ ...+..+..++|++--+.+..+|..+.... -.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~KS~~tg----------------~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~ 64 (462) T protein:vir:96 1 MHKDTNLTAEQNKYADKFQEEVMKSYQTG----------------YGITPDTQVDAGALRREILDDQITMLTWTQDDLIF 64 (462) T ss_pred CccccccchhhhhhhchhhHHHHHHHhcC----------------CCcCCccccccchhhhhhhhhhhheeeecccchhh Confidence 00000000000000000001111111110 011112334456665556655555443322 234 Q ss_pred hhhcceeeccCCcceeEEEeecCC-ccccccccccccccccccceeeEEechhheeeehhhHHHH-HhhhHHHHHHHHHH Q lcl|Aclame:pro 137 EQYVTVEPVRTRSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTK 214 (392) Q Consensus 137 ~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~-l~ds~~~l~~~v~~ 214 (392) ++-+...+..+....|..-..-++ ....++.|++..+ .+++.+.+.....+-++.-..+|... |..+..+..+...+ T Consensus 65 ~~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~-~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~ 143 (462) T protein:vir:96 65 YREISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVAP-VSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTE 143 (462) T ss_pred hhhcCCchhhhhhhhheeeeccCccccccccccccccc-cCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHH Confidence 444566666555555544333333 5567899988655 67899999999999998877777654 34456677788888 Q ss_pred HHHHHHHHHHHHHHhhccccccccc---hhhHHHHHHHHH---------------------HHhhhcccCCceEEEcHHH Q lcl|Aclame:pro 215 WLGKKSKVTRNVLILGVIEKLTKQA---IKSLDDIKDVLN---------------------VKLDPAISPNAILLTNQDG 270 (392) Q Consensus 215 ~l~~~~~~~~d~~~~~~~~~~~~~~---~~~~d~~~~~~~---------------------~~~~~~~~~~a~~v~~~~~ 270 (392) +-...++.+++.+.+.|...-.+.. +..+|.|.++|. .....+|....-+.|+.-+ T Consensus 144 dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v 223 (462) T protein:vir:96 144 DAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGV 223 (462) T ss_pred HHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCChhheecchHH Confidence 8888999999999999876655422 234444433321 1122334444456788888 Q ss_pred HHHHHHhhccCCceeecccccCC------------------cccceecccceEEecCcccccccccCCcceEEEEehhhc Q lcl|Aclame:pro 271 FNYLDKLKDKDGKYILQSDPTQK------------------NKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEA 332 (392) Q Consensus 271 ~~~L~~lkd~~g~~l~~~~~~~~------------------~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~ 332 (392) .+.|..---..-|.+.+++.... .+.++++.+-+.--+....|+. T Consensus 224 ~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~p~a----------------- 286 (462) T protein:vir:96 224 HADFVNSVLGRQMQLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPLPNA----------------- 286 (462) T ss_pred HHHHHHhhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccCCCC----------------- Confidence 77775322111122222221110 0111111111111111111110 Q ss_pred eeeeeccceEEEEeccchhhh----hcCceeEEEEEeeCcEEecccceE-----------EEEecccC-CCCCCCC Q lcl|Aclame:pro 333 IVLFKREDMELASTDVGGKAF----TRNTLDLRAIQRDDVQMWDNEAAV-----------YGEIDLSA-PVEQPQG 392 (392) Q Consensus 333 ~~~~~~~~~~~~~~~~~~~~f----~~~~~~~~~~~r~~~~v~~~~af~-----------~l~~~~~a-~~~~~~~ 392 (392) ...-.++.++-.-....| ...+..+++...-+..--.|...+ +|++++.+ ...+|+. T Consensus 287 ---p~~~~vsaTv~t~~~g~f~~~~d~~~y~Y~V~avs~dgeS~PS~~VtaTva~~~~gv~ltIt~~a~~~~~~~~ 359 (462) T protein:vir:96 287 ---PQPATVKATVETGKKGLFTDEHDRAELTYKVVVNSDDAQSAPSEAVTATVNNATDGVKLEISVNAMYQQQPQF 359 (462) T ss_pred ---CCCCceeEEEEeCCCCCCCCccCceeEEEEEEEECCCCccccceeeEeeeecccccceEEEEEcCCccccceE Confidence 000011111000000011 012233333333332222333332 23333221 1111111 No 207 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=62.59 E-value=0.33 Score=23.21 Aligned_cols=285 Identities=10% Similarity=0.059 Sum_probs=124.4 Q ss_pred hhhhhhhccccccc-cceecc---hhh-hhHHHHhHHhhhhhhhhcceeeccCCcceeEEEe-ecCCcc-ccccccc--- Q lcl|Aclame:pro 100 DLEQRAMSGLTGED-GGLVIP---QDI-QTQINELARSFDALEQYVTVEPVRTRSGSRVLEK-NSDMIP-FAEITEM--- 169 (392) Q Consensus 100 ~~~~~a~~~~~~~~-gg~~iP---~~~-~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~E~--- 169 (392) .....+...+..++ -|.+-| +-+ ....+...++...+.+++...++|-..|.-...+ ...-+. ..-..|| T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 11111211111111 122223 222 2334444455688889999999987665422211 111000 0011111 Q ss_pred ------------------------------cccccccccceeeEEechhheeeehhhHHHHHh-hhHHHHHHHH-HHHHH Q lcl|Aclame:pro 170 ------------------------------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ-DSDQNILKYV-TKWLG 217 (392) Q Consensus 170 ------------------------------~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~-ds~~~l~~~v-~~~l~ 217 (392) +..+.-..++-..+..+.++++.+..+|+++++ +++.+|...+ .+.|. T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~ 160 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMN 160 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhh Confidence 111111122223566789999999999998765 3444555544 33333 Q ss_pred HHHHHHHH---HHHhhcc----------------ccccccchhhHHHHHHHHHHHhh------------------hcccC Q lcl|Aclame:pro 218 KKSKVTRN---VLILGVI----------------EKLTKQAIKSLDDIKDVLNVKLD------------------PAISP 260 (392) Q Consensus 218 ~~~~~~~d---~~~~~~~----------------~~~~~~~~~~~d~~~~~~~~~~~------------------~~~~~ 260 (392) -+..+.+| ..+++.- +.....+..+++++..+.. .|. ....+ T Consensus 161 g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~-~L~~nRapk~t~~i~~s~~~dTk~i~ 239 (401) T protein:vir:95 161 GATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQ-ILTENRTPTQTTIITGSRMIDTKVIG 239 (401) T ss_pred hhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHH-HHHhcccccchhhhhhhhccCccccc Confidence 33333333 2333211 1122334445666655422 222 11122 Q ss_pred Cc-eEEEcHHHHHHHHHhhccCCceeecccccCCcccceecc-----cceEEecCcc-cccccc---------------- Q lcl|Aclame:pro 261 NA-ILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGT-----NPVVVVSNRF-LKSKGT---------------- 317 (392) Q Consensus 261 ~a-~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~-----~pv~~~~~~~-~~~~~~---------------- 317 (392) .+ +-+||+..-..|+.++|-.|.|=|.+-..-+.+.+++.+ .-|.++.++. .+-.+. T Consensus 240 ~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~ 319 (401) T protein:vir:95 240 ATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMV 319 (401) T ss_pred cceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccccccc Confidence 33 347899999999999998887777664444444443321 0122222222 211000 Q ss_pred ----cCCcc-eEEEEehhhceeeeeccceE----EE--Eeccc--hhhhhcCceeEEEEE-eeCcEEecccceEEEEecc Q lcl|Aclame:pro 318 ----TAKKA-PLIIGDLKEAIVLFKREDME----LA--STDVG--GKAFTRNTLDLRAIQ-RDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 318 ----~~~~~-~~~~Gd~~~~~~~~~~~~~~----~~--~~~~~--~~~f~~~~~~~~~~~-r~~~~v~~~~af~~l~~~~ 383 (392) ..+.. .+++|+-.-+..-+...+.. +- ..-+. ++.=..||..+.++. ..++.+.+++-++.++ + T Consensus 320 ~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~ie--s 397 (401) T protein:vir:95 320 SGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALIK--T 397 (401) T ss_pred cCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEEE--e Confidence 00111 24566532222112222211 11 11110 000023444444433 3567777888776555 5 Q ss_pred cCCC Q lcl|Aclame:pro 384 SAPV 387 (392) Q Consensus 384 ~a~~ 387 (392) .+|. T Consensus 398 ~a~~ 401 (401) T protein:vir:95 398 VAPL 401 (401) T ss_pred ecCC Confidence 5665 No 208 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=61.14 E-value=0.36 Score=23.02 Aligned_cols=287 Identities=11% Similarity=0.016 Sum_probs=112.7 Q ss_pred ccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhh--hhhhhhcceee Q lcl|Aclame:pro 67 TRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSF--DALEQYVTVEP 144 (392) Q Consensus 67 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~--~~l~~l~~~~~ 144 (392) ........+. +.... ++. ..+...-......+-.+||++-=+.+..+|..+.... -.+++-+...+ T Consensus 1 ~~~~~n~~~~-------~~~~~---e~~--~Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~ 68 (464) T protein:vir:80 1 MTEKKNTERQ-------LTSVQ---EEV--IKGFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRP 68 (464) T ss_pred CCcchhhHhh-------cCccc---HHH--HHHHHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCch Confidence 0000000000 00000 000 0000000011112233456665556655554443322 23444556666 Q ss_pred ccCCcceeEEEeecCC-ccccccccccccccccccceeeEEechhheeeehhhHHHH-HhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 145 VRTRSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKKSKV 222 (392) Q Consensus 145 ~~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~-l~ds~~~l~~~v~~~l~~~~~~ 222 (392) ..+....|..-...++ ....++.|++..+ .+++.+.+.....+-+..---+|..+ |.++..+-.....++-...++. T Consensus 69 a~STV~~y~~~~~~G~~g~~~f~~E~g~~~-~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~~~d~~~~~~~dai~~va~ 147 (464) T protein:vir:80 69 ATSTVAKYDVYLAHGRVGHTRFTREIGVAP-ISDPNLRQKTVNMKYVSDTKNMSIATGLVNNIEDPMRILTDDAISVVAK 147 (464) T ss_pred hhhhhhhhheeeccCccccccccccccccc-cCCCceEEEEEEeeeeecceeeeeehhhhcchhhHHHHHHHHHHHHHHH Confidence 6655555544333333 5567899987655 66799998888877555433334332 2334445566666777777889 Q ss_pred HHHHHHhhcccccccc----chhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHH-hhccCCce--eecccccCCcc Q lcl|Aclame:pro 223 TRNVLILGVIEKLTKQ----AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK-LKDKDGKY--ILQSDPTQKNK 295 (392) Q Consensus 223 ~~d~~~~~~~~~~~~~----~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~-lkd~~g~~--l~~~~~~~~~~ 295 (392) +++.+.+.|...-.+. -+.-+|.|.++|... +--..... -++++.+++... +.-.-|.+ +|.|......- T Consensus 148 tiE~a~FyGds~l~~~~~~~~gleFDGl~~lI~~~-NViDarG~--~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f 224 (464) T protein:vir:80 148 TIEWASFYGDSDLSENPDAGSGLEFDGLAKLIDKH-NVLDAKGA--SLTEALLNQASVLVGKGYGTPTDAYMPIGVQADF 224 (464) T ss_pred HHHHHHhhhccccCCCCCCccccchhhhHhhcCCC-ceeecCCC--CcCHHHHhhhhhhhhcccCChhhcccchhHHHHH Confidence 9999999987765542 334567777655211 11111111 133444333321 11122322 34433333221 Q ss_pred -cceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEeccc Q lcl|Aclame:pro 296 -KLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE 374 (392) Q Consensus 296 -~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~ 374 (392) ...++..-+.+..+.-... +--|...+ ...++.+.+.-+. |..+...+.. . ....|. T Consensus 225 ~n~~l~~q~~~~~~n~~~~~----------~G~~v~~f--~sa~G~i~L~~s~-----~m~~~~~ld~-~----~~~~~~ 282 (464) T protein:vir:80 225 VNQQLDRQVQVISDNGQNAT----------MGFNVKGF--NSARGFIRLHGST-----VMELEQILDE-N----RMQLPN 282 (464) T ss_pred HhhhcCceeEEEcCCCCcce----------eeeecccc--cccccceeccCcc-----ccCccccccc-c----cccCCC Confidence 2223322122111100000 00011111 1112333332111 1111111100 0 012233 Q ss_pred ceEEEEecccCCCCCCCC Q lcl|Aclame:pro 375 AAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 375 af~~l~~~~~a~~~~~~~ 392 (392) |+..-+++++..+. ..| T Consensus 283 apaapsvt~tv~~~-~~g 299 (464) T protein:vir:80 283 APQKATVKATLEAG-TKG 299 (464) T ss_pred CcCCceeEEEecCC-ccc Confidence 43333333322111 112 No 209 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=53.44 E-value=0.53 Score=22.10 Aligned_cols=281 Identities=9% Similarity=0.047 Sum_probs=127.7 Q ss_pred hhhHHHHHHHHhhhhhhhhccccc-cccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccc Q lcl|Aclame:pro 88 PLNAEEREFLEDDLEQRAMSGLTG-EDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEI 166 (392) Q Consensus 88 ~~~~~~~~~~~~~~~~~a~~~~~~-~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (392) ........+.....+.......+. .+..+.|.+.+...+.+.+.+.+-+++++++++|....|..... ..+++-++.. T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~l-g~~g~iagrt 79 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFG-ATEKGVTGRK 79 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEee-ccCccccccc Confidence 000011111111111111111111 12357787788888999999999999999999999888876543 3333333333 Q ss_pred ccccccccccccceeeEEechhheeeehhhHHHHHhhhH--HH-HHHHHHHHHHHHHHHHHHHHHhhcccc--------- Q lcl|Aclame:pro 167 TEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD--QN-ILKYVTKWLGKKSKVTRNVLILGVIEK--------- 234 (392) Q Consensus 167 ~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~--~~-l~~~v~~~l~~~~~~~~d~~~~~~~~~--------- 234 (392) ..+.... ...++.-.+..++.---..|+.+.|+..+ ++ +...+..-+.++++.-.-.-.++|+.. T Consensus 80 dt~r~r~---~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPll 156 (336) T protein:vir:37 80 QTGRNLA---TLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTTKTDL 156 (336) T ss_pred CCCCCcc---ccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCCCccc Confidence 3222211 12233344444444444556666666542 23 222222223333322121111222110 Q ss_pred ---------------------------------ccccchhhHHHHHH-HHHHHhhhcccC--CceEEEcHHHHHH-HHHh Q lcl|Aclame:pro 235 ---------------------------------LTKQAIKSLDDIKD-VLNVKLDPAISP--NAILLTNQDGFNY-LDKL 277 (392) Q Consensus 235 ---------------------------------~~~~~~~~~d~~~~-~~~~~~~~~~~~--~a~~v~~~~~~~~-L~~l 277 (392) ++.+...+.|.++. +++ .+++.++. .-+.|+.+...+. -..| T Consensus 157 qDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~-~I~~~~~~d~dLVvivG~dLla~~~~~l 235 (336) T protein:vir:37 157 SDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQ-GLDFRHQNRNDLVFLVGADLVSKETKLI 235 (336) T ss_pred cccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHh-ccchHHhcCCCeEEEEchhhhhhhhhhh Confidence 00111334565543 344 56777765 4578888877542 1122 Q ss_pred hccCC-ceeecccc-cCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhc Q lcl|Aclame:pro 278 KDKDG-KYILQSDP-TQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTR 355 (392) Q Consensus 278 kd~~g-~~l~~~~~-~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~ 355 (392) -..++ +|--.-.. ......+ +|+.|.+.+ +++|.. .+++--|++....+.++..+=.+-+... + T Consensus 236 ~~~~~~~PtE~~Aa~~~~~~k~-iGGlpa~~~--PffP~~-------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~----r 301 (336) T protein:vir:37 236 QQKHGLTPTEKAALGSHNLMGS-FGGMNAITP--PNFPAR-------AAAVTTLKNLSVYTEAESVRRSLRNDED----K 301 (336) T ss_pred hhhcCCCHHHHHHHHHHHHHHh-hCCceEEEc--cccCCC-------ceEEeeccccEEEEecCcEEEEEEEccc----c Confidence 22222 22000000 0001223 455665543 455543 3677777775444455555444433321 2 Q ss_pred CceeEEEEEeeCcEEecccceEEEE---ecccCCC Q lcl|Aclame:pro 356 NTLDLRAIQRDDVQMWDNEAAVYGE---IDLSAPV 387 (392) Q Consensus 356 ~~~~~~~~~r~~~~v~~~~af~~l~---~~~~a~~ 387 (392) +.+--.=..--|..|-++.+++.++ ++-.+.+ T Consensus 302 ~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 302 KGLVTSYYRQEGYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred ccccchhhhcceeeeeccccEEEeeeeeeeccccC Confidence 2322222233455666777777664 3333343 No 210 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=46.88 E-value=0.72 Score=21.37 Aligned_cols=280 Identities=13% Similarity=0.031 Sum_probs=108.7 Q ss_pred cccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhh--hhhhhhcceeec Q lcl|Aclame:pro 68 RNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSF--DALEQYVTVEPV 145 (392) Q Consensus 68 ~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~--~~l~~l~~~~~~ 145 (392) .+.+.-.....+..+. ++. ....|+.+--+.+.+++..+.... -.+++-....+. T Consensus 1 ~~~~~~~~~~~a~~~a----------------------l~~-a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a 57 (470) T protein:vir:10 1 MPYEHLKHLDEATLKA----------------------LNA-AGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKA 57 (470) T ss_pred CChhHhhhhhHHHHHH----------------------HHH-hhhcchhhhhhhhccceeEeeecCccchhhhhcCCchh Confidence 1111111111111111 111 111222232222322222222111 122233344444 Q ss_pred cCCcceeEEEee-cCCccccccccccccccccccceeeEEechhheeeehhhHHHH---HhhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 146 RTRSGSRVLEKN-SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL---LQDSDQNILKYVTKWLGKKSK 221 (392) Q Consensus 146 ~~~~~~~~~~~~-~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~---l~ds~~~l~~~v~~~l~~~~~ 221 (392) .+....|..... .+....+.+.|++- ++.+++.+.+..+..+-++.-..||.-. ++....++...+.++---.++ T Consensus 58 ~STV~ey~~~~~rhG~~g~s~~~E~~l-~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia 136 (470) T protein:vir:10 58 KAYEHEYNVVTARHDKIGYAAFREGGL-PRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVA 136 (470) T ss_pred hhHhhhhhhhccccccccceeeccccc-CccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHH Confidence 443333322111 23344445678764 4567899999999999999988999764 344455788888888888889 Q ss_pred HHHHHHHhhcccccc-----ccchhhHHHHHHHHHHH-hhhcccCCceEEEcHHHHHHHH-Hhhcc--CCc--eeecccc Q lcl|Aclame:pro 222 VTRNVLILGVIEKLT-----KQAIKSLDDIKDVLNVK-LDPAISPNAILLTNQDGFNYLD-KLKDK--DGK--YILQSDP 290 (392) Q Consensus 222 ~~~d~~~~~~~~~~~-----~~~~~~~d~~~~~~~~~-~~~~~~~~a~~v~~~~~~~~L~-~lkd~--~g~--~l~~~~~ 290 (392) ++++.+++.|...-+ ...+.-+|.+.++|... -.+-+...+..+ +.+.+.+.. .++.+ -|. .+|.|.. T Consensus 137 ~tiE~a~FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~L-s~~~L~~aa~~I~~~~~fGt~TD~~lp~~ 215 (470) T protein:vir:10 137 NEFEYLAFYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPL-SIDLLWEAESRVVSTQAFANPTAVFISYV 215 (470) T ss_pred HHHHhhhhhhccccccccCcccCceeccchhhhccCCCCccccccCCCCc-cHHHHHHHHhhhcccccccChhhhccchh Confidence 999999999866433 23344567766654311 011111112222 555554443 24322 222 2343332 Q ss_pred cCCc-ccceecccceEEecCcccccccccCCcceEEEE-ehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCc Q lcl|Aclame:pro 291 TQKN-KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIG-DLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV 368 (392) Q Consensus 291 ~~~~-~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~G-d~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~ 368 (392) .... ...+++.+.|.+-.|.- .+..| |...+ ...++.+.+.-+..-..+-..+ ..+++. T Consensus 216 vka~f~~~~~~~qRv~~~~N~~-----------~~~~G~~v~~f--~sa~G~I~L~~s~~m~~~~k~~------p~~l~~ 276 (470) T protein:vir:10 216 DKLNLQASFYQISRVMTTADRR-----------AGLLGADAQSY--IGVRGEHSLYPSQFLGDFHKFN------PARFGA 276 (470) T ss_pred HHHHHHHhhcCceEEEEecCCC-----------ceeeeeeccce--eeeeeeeeecccccccchhhcC------cccCCc Confidence 2221 12233333333211110 00011 11111 1112233221111100000000 011222 Q ss_pred EE---ecccceEEEEecccCCCCCCCC Q lcl|Aclame:pro 369 QM---WDNEAAVYGEIDLSAPVEQPQG 392 (392) Q Consensus 369 ~v---~~~~af~~l~~~~~a~~~~~~~ 392 (392) .+ .-|..++-+.-+. ..++.|.+ T Consensus 277 ~v~~~aAP~~~~tv~~t~-~~~a~~~~ 302 (470) T protein:vir:10 277 EVGDFAAPSNSWTVSTTD-NFVTLPYN 302 (470) T ss_pred ccCCcccCceeEEeecCC-Cceeeccc Confidence 21 1222111111000 00011111 No 211 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=46.53 E-value=0.73 Score=21.33 Aligned_cols=268 Identities=10% Similarity=0.066 Sum_probs=100.8 Q ss_pred ccccccccceecchhhhhHHHHhHHhh---h-hhhhhcceeeccCCcceeEEEeecC--Cccccccccccccccccccce Q lcl|Aclame:pro 107 SGLTGEDGGLVIPQDIQTQINELARSF---D-ALEQYVTVEPVRTRSGSRVLEKNSD--MIPFAEITEMGEIPETDNPKF 180 (392) Q Consensus 107 ~~~~~~~gg~~iP~~~~~~ii~~~~~~---~-~l~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~~~~~~~ 180 (392) +..+. .--++-|.++...|-..+.+. + .+-.+...+.+. ...+.+..... ...+.+++.+.+.+-...-.+ T Consensus 1 M~~~~-~~d~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~~~--~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~r~g~ 77 (348) T protein:vir:98 1 MSWTL-DTEFIEPTQLTGLIREALRDLQVNRFRLARWLPNVDVD--DITFEFLRGGGGLAETASYRSWDTESKIGRREGL 77 (348) T ss_pred Ccchh-hhhccCHHHHHHHHHHHhhccCcchhhHHhcCCCcccc--ceEEEEEeccCCceeeeeeecCCCccceeecccc Confidence 11111 111233444433332222111 1 122333333222 22222222221 122445555444332222345 Q ss_pred eeEEechhheeeehhhHHHHHhh----hHHHHHHHHHHH---HHHHHHHHHHHH----Hhhc--------------c--- Q lcl|Aclame:pro 181 SNVQYAVKDRAGILPLSRSLLQD----SDQNILKYVTKW---LGKKSKVTRNVL----ILGV--------------I--- 232 (392) Q Consensus 181 ~~v~~~~~~i~~~~~iS~e~l~d----s~~~l~~~v~~~---l~~~~~~~~d~~----~~~~--------------~--- 232 (392) ...++.+-.++-...++.+-+.. ....+..++.+. +.+.+...++.. +..| . T Consensus 78 ~~~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~vDyg~~~~ 157 (348) T protein:vir:98 78 AKVMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQTVDFGRIGS 157 (348) T ss_pred eeeeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceEEccccCcc Confidence 66666666666555555432221 111233333332 233333333322 2211 0 Q ss_pred ----cc--c-cccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHH---Hhhcc-C------CceeecccccCCcc Q lcl|Aclame:pro 233 ----EK--L-TKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLD---KLKDK-D------GKYILQSDPTQKNK 295 (392) Q Consensus 233 ----~~--~-~~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~---~lkd~-~------g~~l~~~~~~~~~~ 295 (392) ++ . ...+...+.+|.+.+.+..+..+.....++|++..|..|+ ++++. . ..++..+.... .. T Consensus 158 ~~~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~~-~~ 236 (348) T protein:vir:98 158 HSVVAAVLWSVHATATPISDLESWVATYEDTNGQSPGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVSVEQLN-TV 236 (348) T ss_pred cccccccccCCCCCCCHHHHHHHHHHHHHHccCCcceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccCHHHHH-HH Confidence 00 0 0122234456666665555555667778999999999985 23322 1 12233221111 11 Q ss_pred cceecccceEEecCccccccccc---CCcceEEEEeh-hh---------ceeee----------------eccceEEEEe Q lcl|Aclame:pro 296 KLFAGTNPVVVVSNRFLKSKGTT---AKKAPLIIGDL-KE---------AIVLF----------------KREDMELAST 346 (392) Q Consensus 296 ~~~~g~~pv~~~~~~~~~~~~~~---~~~~~~~~Gd~-~~---------~~~~~----------------~~~~~~~~~~ 346 (392) ...+|.+.+.+.+ ......+.. -.+..+++..- +. ++..+ ...++-+..- T Consensus 237 ~~~~g~~~i~~~d-~~~~~~g~~~~~~p~~~i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~~~~~~~~~~i~~~~~ 315 (348) T protein:vir:98 237 LSSMGLPPIEVYD-AKVAVDGVSTRITPANAIALLPEPGATDAAQPTELGATLLGTTAESLEDDYALAPGEQPGIVAATW 315 (348) T ss_pred HHhhCCeEEEEee-eEEEcCCceeceecCCeEEEEecCCcccccccccccceecccchhhhccccccceeccCceeeeee Confidence 2235655554432 222222110 01111222110 00 00000 0000111000 Q ss_pred ccchhhhhcCceeEEEEEeeCcEEecccceEEEEecc Q lcl|Aclame:pro 347 DVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) Q Consensus 347 ~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~~l~~~~ 383 (392) .+. + --...+++..+.=-.+.+|++++++++=+ T Consensus 316 ~~~-d---P~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 316 KTK-D---PVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred eec-C---CcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 000 0 01234444555444556788888887665 No 212 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=45.83 E-value=0.76 Score=21.25 Aligned_cols=279 Identities=9% Similarity=0.036 Sum_probs=128.4 Q ss_pred hhhHHHHHHHHhhhhhhhhcccccc-ccceecchhhhhHHHHhHHhhhhhhhhcceeeccCCcceeEEEeecCCcccccc Q lcl|Aclame:pro 88 PLNAEEREFLEDDLEQRAMSGLTGE-DGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEI 166 (392) Q Consensus 88 ~~~~~~~~~~~~~~~~~a~~~~~~~-~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (392) ........+.....+.......+.. +..+.|.+.+...+.+.+.+.+-+++++++++|....|...... .+++-++.. T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg-~~g~iagrt 79 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGA-TEKGVTGRK 79 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeec-cCccccccc Confidence 0000111111111111111111111 23477777888889999999999999999999998888765433 333333322 Q ss_pred ccccccccccccceeeEEechhheeeehhhHHHHHhhhH--HHHH-HHHHHHHHHHHHHHHHHHHhhccccc-------- Q lcl|Aclame:pro 167 TEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD--QNIL-KYVTKWLGKKSKVTRNVLILGVIEKL-------- 235 (392) Q Consensus 167 ~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~--~~l~-~~v~~~l~~~~~~~~d~~~~~~~~~~-------- 235 (392) ..+ -.+ . ...++.-.+..++.---..|+.+.|+..+ +++. ..+..-+.++++.-.-.-.++|+..+ T Consensus 80 dt~-R~~-~-~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPll 156 (336) T protein:vir:37 80 QTG-RNL-A-NLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVADNTTKADL 156 (336) T ss_pred CCC-ccc-c-ccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhcccceeeccCCCCCcc Confidence 221 111 1 13444444555555445566667766542 2322 22222233333322211122221110 Q ss_pred ----------------------------------cccchhhHHHHHH-HHHHHhhhcccC--CceEEEcHHHHHH-HHHh Q lcl|Aclame:pro 236 ----------------------------------TKQAIKSLDDIKD-VLNVKLDPAISP--NAILLTNQDGFNY-LDKL 277 (392) Q Consensus 236 ----------------------------------~~~~~~~~d~~~~-~~~~~~~~~~~~--~a~~v~~~~~~~~-L~~l 277 (392) +.+...+.|.++. +++ .+++.++. .-+.|+.+...+. -..| T Consensus 157 qDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~-~I~~~~~~d~dLVvivG~dLla~~~~~l 235 (336) T protein:vir:37 157 SDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQ-GLDFRHQNRNDLVFLVGADLVSKETKLI 235 (336) T ss_pred cccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHh-cCchHHhcCCCeEEEEchhhhhhhhhhh Confidence 0111334555443 344 56777765 4578888877542 1223 Q ss_pred hccCC-ceeeccccc---CCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhh Q lcl|Aclame:pro 278 KDKDG-KYILQSDPT---QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAF 353 (392) Q Consensus 278 kd~~g-~~l~~~~~~---~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f 353 (392) -..++ +|- .... .....+ +|+.|.+.+ +++|.. .+++--|++....+.++..+=.+-+... T Consensus 236 ~~~~~~~Pt--E~~Aa~~~~~~k~-iGGlpa~~~--PffP~~-------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~--- 300 (336) T protein:vir:37 236 QQKHGLTPT--EKAALGSHNLMGS-FGGMNAITP--PNFPAR-------AAAVTTLKNLSVYTEAESVRRSLRNDED--- 300 (336) T ss_pred hhhcCCCHH--HHHHHHHHHHHHh-hCCceeEEc--cccCCC-------ceEEeechhcEEEEecCcEEEEEEEccc--- Confidence 22222 221 0000 001123 455665543 455543 3677777775444555555444433321 Q ss_pred hcCceeEEEEEeeCcEEecccceEEEE---ecccCCC Q lcl|Aclame:pro 354 TRNTLDLRAIQRDDVQMWDNEAAVYGE---IDLSAPV 387 (392) Q Consensus 354 ~~~~~~~~~~~r~~~~v~~~~af~~l~---~~~~a~~ 387 (392) ++.+--.=..--|..|-++.+++.++ ++-.+.+ T Consensus 301 -r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 301 -KKGLVTSYYRQEGYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred -cccccchhhhcceeeeeccccEEEeeeeeeeecCcC Confidence 22222222223355666777776654 3333443 No 213 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=45.79 E-value=0.76 Score=21.24 Aligned_cols=297 Identities=13% Similarity=0.075 Sum_probs=116.3 Q ss_pred cccchhhHH--------HHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhh--hhh Q lcl|Aclame:pro 68 RNVDGEMEY--------RDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFD--ALE 137 (392) Q Consensus 68 ~~~~~~~~~--------~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~--~l~ 137 (392) .+.....+. -+.+.|.+..+ ...+..+-.+|+++--+.+...|..+..... .++ T Consensus 1 ~~~~~~~~~~~~n~~~~~e~~~Ks~~ag----------------y~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~ 64 (467) T protein:vir:80 1 MPKNNKEEVKEVNLNSVQEDALKSFTTG----------------YGITPDTQTDAGALRREFLDDQISMLTWTENDLTFY 64 (467) T ss_pred CCCcchhhhhhcccccCHHHHHHHHHcc----------------cccCCccccCcchhhhhhhhhhhheeeccccchhhh Confidence 111111111 01122222111 1112223345666655666666555443322 234 Q ss_pred hhcceeeccCCcceeEEEeecCC-ccccccccccccccccccceeeEEechhheeeehhhHHHH-HhhhHHHHHHHHHHH Q lcl|Aclame:pro 138 QYVTVEPVRTRSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKW 215 (392) Q Consensus 138 ~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~-l~ds~~~l~~~v~~~ 215 (392) .-+...+..+....|..-..-++ ....++.|++... .+++.+.+.....+-++.-..+|.-+ +..+..+..+...++ T Consensus 65 ~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~-~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ 143 (467) T protein:vir:80 65 KDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDD 143 (467) T ss_pred hhcccchhhhhhhhheeeeccCccccccccccccccc-cCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHH Confidence 44455555554444443333333 5567899987655 67899999999999998866666544 233345667788888 Q ss_pred HHHHHHHHHHHHHhhccccccc----cchhhHHHHHHHHH---------------------HHhhhcccCCceEEEcHHH Q lcl|Aclame:pro 216 LGKKSKVTRNVLILGVIEKLTK----QAIKSLDDIKDVLN---------------------VKLDPAISPNAILLTNQDG 270 (392) Q Consensus 216 l~~~~~~~~d~~~~~~~~~~~~----~~~~~~d~~~~~~~---------------------~~~~~~~~~~a~~v~~~~~ 270 (392) -...++.+++.+.+.|...-.. .-+.-.|.|..++. ...--+|....-++|+.-+ T Consensus 144 ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v 223 (467) T protein:vir:80 144 AIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGV 223 (467) T ss_pred HHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhH Confidence 8888899999998887655421 11122233322211 1111123333335555555 Q ss_pred HHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccce-------EE Q lcl|Aclame:pro 271 FNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDM-------EL 343 (392) Q Consensus 271 ~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~-------~~ 343 (392) .+.|..--- ..++.+..+-. .....|. +|. .+++.-+.-.-....++|+... .-.+..+. .+ T Consensus 224 ~a~~~~~~L-~~q~~v~~~n~---~~~~~G~-~v~----g~~sa~G~I~l~gs~il~~~~~--l~~~~~~~~~Apsp~~v 292 (467) T protein:vir:80 224 QADFVNQQL-SKQTQLVRDNG---NNVSVGF-NIQ----GFHSARGFIKLHGSTVMENEQI--LDERILALPTAPQPAKV 292 (467) T ss_pred Hhhhhhhhc-CceEEEEcCCC---Cceeeee-ccc----ceecceeeeeecCceeeccccC--CCcccccccccccCCcc Confidence 554411000 01111111000 0011111 110 0000000000001112222211 00000000 00 Q ss_pred EEecc--chhhhhc---CceeEEEEEeeCcEEecccce-----------EEEEecc-cCCCCCCCC Q lcl|Aclame:pro 344 ASTDV--GGKAFTR---NTLDLRAIQRDDVQMWDNEAA-----------VYGEIDL-SAPVEQPQG 392 (392) Q Consensus 344 ~~~~~--~~~~f~~---~~~~~~~~~r~~~~v~~~~af-----------~~l~~~~-~a~~~~~~~ 392 (392) ..+.. ....|.. ..+.+++...-+.+--.|... +.|+++. +.+..+|.. T Consensus 293 saT~~~~~~g~~~~~~~a~y~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~y 358 (467) T protein:vir:80 293 TATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQF 358 (467) T ss_pred ceeeecccCCcccCCCcceEEEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcceE Confidence 00000 0001110 113333333322222222222 2334432 223333433 No 214 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=42.98 E-value=0.86 Score=20.94 Aligned_cols=274 Identities=11% Similarity=-0.019 Sum_probs=101.4 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeec-------cCCcceeEEEeecCCccccccc-cccccccccc Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPV-------RTRSGSRVLEKNSDMIPFAEIT-EMGEIPETDN 177 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~-E~~~~~~~~~ 177 (392) |. .+. --.+|+-+..+.++.+++..++.+++..-.- .+++.+++.|. ......+.. .+... ..++ T Consensus 1 Ma-N~l---lT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~--~~~~~d~~~~~~~~~-~~~d 73 (423) T protein:vir:10 1 MP-NNL---DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPH--QFSSLRTPTGDISGQ-NKNN 73 (423) T ss_pred Cc-cch---hhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCC--ceeeeccCCcccccc-ccCc Confidence 11 111 1136888888999999999988888765221 12222333322 111111111 11111 1112 Q ss_pred ccee--eEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----cc-c-cchhhHHHHHH Q lcl|Aclame:pro 178 PKFS--NVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK-----LT-K-QAIKSLDDIKD 248 (392) Q Consensus 178 ~~~~--~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~-----~~-~-~~~~~~d~~~~ 248 (392) ..-. .++++-+|...+--=..|+. ...-+++.++... .++++..+|..+...... .+ + .....|+++++ T Consensus 74 l~e~~v~l~id~~k~va~~v~d~E~~-~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~~gt~~t~~~a~~~i~~ 151 (423) T protein:vir:10 74 LISGKATGRVGNYITVAVEYQQLEEA-IKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQ 151 (423) T ss_pred cccceeEEEeeceeeeeeeechHHHh-cChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhccccccccCCcccchHHHHHH Confidence 2223 35555555555443344544 3345677766555 577888888776542111 11 1 12234677766 Q ss_pred HHHHHhhhcccCC--ceEEEcHHHHHHHHHhhc--cCCceeecccccCCcc-cceecccceEEecCccccc--ccccCCc Q lcl|Aclame:pro 249 VLNVKLDPAISPN--AILLTNQDGFNYLDKLKD--KDGKYILQSDPTQKNK-KLFAGTNPVVVVSNRFLKS--KGTTAKK 321 (392) Q Consensus 249 ~~~~~~~~~~~~~--a~~v~~~~~~~~L~~lkd--~~g~~l~~~~~~~~~~-~~~~g~~pv~~~~~~~~~~--~~~~~~~ 321 (392) + ...|+....+. -..|++|..+..|.+-.. ..+...-...+..+.- ..+.|+. |+. ++ -+|. .+...+. T Consensus 152 a-~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFd-v~~-Sn-nip~~T~gt~~~t 227 (423) T protein:vir:10 152 T-ASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIR-ALM-SN-GLASRTQGAFGGT 227 (423) T ss_pred H-HHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceE-EEE-eC-CCccccccccccc Confidence 4 34565555553 467999999887753110 0111111112222222 3566653 333 22 2332 2222211 Q ss_pred ceEEEEehhhceeeeeccceEEEEe--c--cchhhhhcCceeEEEE---EeeCcEEe------cccce------------ Q lcl|Aclame:pro 322 APLIIGDLKEAIVLFKREDMELAST--D--VGGKAFTRNTLDLRAI---QRDDVQMW------DNEAA------------ 376 (392) Q Consensus 322 ~~~~~Gd~~~~~~~~~~~~~~~~~~--~--~~~~~f~~~~~~~~~~---~r~~~~v~------~~~af------------ 376 (392) .....|-.-.+.........++... . .++..-.-|.+.|-+. .+....+. .+.-| T Consensus 228 ~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g 307 (423) T protein:vir:10 228 LTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGG 307 (423) T ss_pred eeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccCC Confidence 1111111000000000001111000 0 0000000111111110 00000000 00111 Q ss_pred -EEEEeccc--CCCC---------CCCC Q lcl|Aclame:pro 377 -VYGEIDLS--APVE---------QPQG 392 (392) Q Consensus 377 -~~l~~~~~--a~~~---------~~~~ 392 (392) ..+++..+ ++++ +|+. T Consensus 308 ~~tv~i~p~~i~~~~~~~~~~v~a~~a~ 335 (423) T protein:vir:10 308 DVTVTLSGVPIYDTTNPQYNSVSRQVEA 335 (423) T ss_pred ceeeeccCccccccCCcccccccccccC Confidence 11222211 1111 1111 No 215 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=42.21 E-value=0.89 Score=20.85 Aligned_cols=293 Identities=9% Similarity=-0.009 Sum_probs=120.7 Q ss_pred HHHHHHHHHHHHHHhhccccccccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhc-------cccccccce- Q lcl|Aclame:pro 45 IDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMS-------GLTGEDGGL- 116 (392) Q Consensus 45 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~-------~~~~~~gg~- 116 (392) ++. ........++++... .+.+.. ...........++. ..+..+.|. T Consensus 1 ~~~-------------------~~~~~~~~~l~~~g~-~~~~~~-----~~~~~~~~~~~a~d~~~~~~~~~~~~~~~i~ 55 (339) T protein:vir:94 1 MSI-------------------NNDRTDIKQLEKVGI-IFDGYS-----PKSISSEVSAYAMDAVNLTPTLQTTANAGIP 55 (339) T ss_pred Cce-------------------echHHHHHHHHhhce-eeccch-----hhhcchhhHhhhccccccccccccccccchh Confidence 000 000000000000000 000000 00000001111111 112222232 Q ss_pred -ecchhhhhHHHHhHHhhhhhhhhcceeeccCC-cceeEEEeecCCcccccccccccccccc-ccceeeEEechhheeee Q lcl|Aclame:pro 117 -VIPQDIQTQINELARSFDALEQYVTVEPVRTR-SGSRVLEKNSDMIPFAEITEMGEIPETD-NPKFSNVQYAVKDRAGI 193 (392) Q Consensus 117 -~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~E~~~~~~~~-~~~~~~v~~~~~~i~~~ 193 (392) ..++-+.+.|++........+.++.+.+.+.- ...+.+......+.+.+.+.+++.|-.+ ...+.+.++....++- T Consensus 56 a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~- 134 (339) T protein:vir:94 56 AWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQNYRYQTWT- 134 (339) T ss_pred hhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEeEEEEEEEE- Confidence 12334456777888888888888888776542 2345555556666777888877775332 2345555544444433 Q ss_pred hhhHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------------cc--c--chhh----HHHHHH Q lcl|Aclame:pro 194 LPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKL--------------TK--Q--AIKS----LDDIKD 248 (392) Q Consensus 194 ~~iS~e~l~ds---~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~--------------~~--~--~~~~----~d~~~~ 248 (392) .++..-+..+ ..++.+--.....+++...+|+..+.|.... .. + ...+ ++|+.. T Consensus 135 -~y~~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~ 213 (339) T protein:vir:94 135 -EYGDLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVA 213 (339) T ss_pred -eecHHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCCCCcccCCHHHHHHHHHH Confidence 3443322222 2346666666666677777776655553211 01 1 1122 344444 Q ss_pred HHHHHhh-hc--ccCC--ceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcce Q lcl|Aclame:pro 249 VLNVKLD-PA--ISPN--AILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) Q Consensus 249 ~~~~~~~-~~--~~~~--a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~ 323 (392) ++..... .. ...+ -.++|.|+.+..|..- +..|.-++. -..... |.+.++.-+.+. ++.+... T Consensus 214 ~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~-n~~~~Tvl~-~lk~n~-------pnl~i~~~~el~---~a~g~~~ 281 (339) T protein:vir:94 214 MVGRLISQSGGLITGQERMVMALAPSALNNVNRT-NNFGLSAGA-KIAQTY-------PNIQFVAVPEFD---TASGRLV 281 (339) T ss_pred HHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC-CcCCccHHH-HHHHhc-------CCcEEEEccccc---cCCCceE Confidence 3332211 11 1122 3689999999988643 333432321 011111 113333322222 2222222 Q ss_pred EEEEehh---hceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCc-EEecccceEEEE-e Q lcl|Aclame:pro 324 LIIGDLK---EAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGE-I 381 (392) Q Consensus 324 ~~~Gd~~---~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~-~v~~~~af~~l~-~ 381 (392) .++-+.. +-..+. -.+.+...+.. ...-.+..-+..|.+| .+++|.||++++ + T Consensus 282 ~~~~~~~~~~~~~~~~--~p~~~~~lpvq---~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 282 QLWVPEVNGQPTGEVA--FAEKLRSHSIE---RYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred EEEEEeccCCcceEEE--cchhhhccccE---EcCceEEecceeeeeeEEEEccceeeeeecC Confidence 2221110 000011 11111121111 0112345567777555 666899999887 4 No 216 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=38.60 E-value=1.1 Score=20.45 Aligned_cols=288 Identities=8% Similarity=-0.033 Sum_probs=113.8 Q ss_pred cchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhh-------ccccccccceecchhhh----hHHHHhHHhhhhhhh Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAM-------SGLTGEDGGLVIPQDIQ----TQINELARSFDALEQ 138 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~-------~~~~~~~gg~~iP~~~~----~~ii~~~~~~~~l~~ 138 (392) ..... .+...-+-+-.-...+..........++ ...+.+++| ||..+. +.+++.+.....+.. T Consensus 1 ~~~~~----~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g--~~~~l~~~i~p~~~~~~~~~~~~~~ 74 (336) T protein:vir:10 1 MRDAQ----RIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILVAPMKAAE 74 (336) T ss_pred CchHH----HHHHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcc--hHHHHHhhcCcceeeeeechhchhh Confidence 00000 0111001010000010000000111111 111222222 454332 455666666666666 Q ss_pred hcceeeccCCcce-eEEEeecCCccccccccccccccccccceeeEEechhheeeehhhHHHHHhhhH---HHHHHHHHH Q lcl|Aclame:pro 139 YVTVEPVRTRSGS-RVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD---QNILKYVTK 214 (392) Q Consensus 139 l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~l~ds~---~~l~~~v~~ 214 (392) ++.+.+.+.-.-. ..+......+.+.+.+-..+.|-. +...+.-.-+.+.++..+.++..-+.... .++.+--+. T Consensus 75 l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~-d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~ 153 (336) T protein:vir:10 75 LVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDS-GTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNY 153 (336) T ss_pred hcccccCCCcceeeEEEEeeeeeeeEEEccccCCCcce-eeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHH Confidence 6666553321111 112222233344555655566643 34555556667777888888866555433 245555555 Q ss_pred HHHHHHHHHHHHHHhhccccc-----------------cc------cchhhHHHHHHHHHHHh-hhc--cc--CCceEEE Q lcl|Aclame:pro 215 WLGKKSKVTRNVLILGVIEKL-----------------TK------QAIKSLDDIKDVLNVKL-DPA--IS--PNAILLT 266 (392) Q Consensus 215 ~l~~~~~~~~d~~~~~~~~~~-----------------~~------~~~~~~d~~~~~~~~~~-~~~--~~--~~a~~v~ 266 (392) ..++++.+.+|+..+.|.... .. +....++|+..++.... ... .. .+..++| T Consensus 154 aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~L 233 (336) T protein:vir:10 154 SSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGL 233 (336) T ss_pred HHHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEe Confidence 555666666665444432210 00 11122344444332211 111 11 1336899 Q ss_pred cHHHHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehh---hceeeeeccceEE Q lcl|Aclame:pro 267 NQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLK---EAIVLFKREDMEL 343 (392) Q Consensus 267 ~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~---~~~~~~~~~~~~~ 343 (392) .+..+..|.+ ++..|.-++. -.... +|-+.+++-+-+ .++.+....+|-.-. .-..+...+. + T Consensus 234 p~~~~~~L~~-~n~~g~tv~~-~lk~n-------~Pnl~i~t~pel---~~Agg~~~~~~~~~~~~~~t~~~~~P~~--f 299 (336) T protein:vir:10 234 PPTAMSDLSK-TNQYGLSAAA-KLKEI-------FPKLEFVTIPEY---DTASGRLVQLWAPRVEGKDTATCGFTEK--M 299 (336) T ss_pred chHHHHhccC-CCccCccHHH-HHHHh-------CCccEEEEcccc---cccCCceEEEEEecccCCcceeeecChh--h Confidence 9999998854 2333322221 01111 111333332222 222222222221110 0000111111 1 Q ss_pred EEeccchhhhhcCceeEEEEEeeCc-EEecccceEEEE-e Q lcl|Aclame:pro 344 ASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGE-I 381 (392) Q Consensus 344 ~~~~~~~~~f~~~~~~~~~~~r~~~-~v~~~~af~~l~-~ 381 (392) ...+.. ...-.+..-+..|.+| .+.+|-||++++ + T Consensus 300 ~~lpvq---~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 300 RAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred hcccee---ecCceeEeccccceeeeeeeccchheeeccC Confidence 111110 0112345566677666 455799998876 4 No 217 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=36.17 E-value=1.2 Score=20.17 Aligned_cols=257 Identities=12% Similarity=0.048 Sum_probs=114.3 Q ss_pred ccccccccceecchhhhhHHHHhHHhh-hhhhhhcceeeccCCcceeEEEeecCCccc-cccccccccccccccceeeEE Q lcl|Aclame:pro 107 SGLTGEDGGLVIPQDIQTQINELARSF-DALEQYVTVEPVRTRSGSRVLEKNSDMIPF-AEITEMGEIPETDNPKFSNVQ 184 (392) Q Consensus 107 ~~~~~~~gg~~iP~~~~~~ii~~~~~~-~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~E~~~~~~~~~~~~~~v~ 184 (392) +..+...-. .+-+.+...+....... .....+++..+-..-..++.+ ...-+.. .|.+| .+ ...+.-..-+ T Consensus 1 m~it~~~l~-~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~--lg~~p~l~e~~Ge---~~-~~~l~~~~~~ 73 (302) T protein:vir:10 1 MLINKQSLN-AAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKW--LSTFPKMRRWIGA---KV-VKNLKAYKYV 73 (302) T ss_pred CcccHHHHH-HHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeecee--cCCCCCccccccc---ee-ecccccccee Confidence 111110000 00011111121221111 223444544432222222322 2222332 44444 33 2234445678 Q ss_pred echhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------------------- Q lcl|Aclame:pro 185 YAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ-------------------------- 238 (392) Q Consensus 185 ~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~~~~~-------------------------- 238 (392) +..+++...+.||++.+.|...++.+-+.+.+..+.++..+..++.-+..+..+ T Consensus 74 i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~ 153 (302) T protein:vir:10 74 VENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTA 153 (302) T ss_pred EEeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccch Confidence 999999999999999999988888999999999999999988766643321100 Q ss_pred ---------chhhHHHHHHHHHHHhhhc----ccCCceEEEcHHHHHHHHHhhccCCceeecccccCCcccceecccceE Q lcl|Aclame:pro 239 ---------AIKSLDDIKDVLNVKLDPA----ISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVV 305 (392) Q Consensus 239 ---------~~~~~d~~~~~~~~~~~~~----~~~~a~~v~~~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~ 305 (392) ...+++....+|....... ...+..+|..|.....-+++-.+ ++.. .+....+.|. +. T Consensus 154 ~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~-~~~~------~g~~Np~~g~--~~ 224 (302) T protein:vir:10 154 PLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN-PKLA------DNTPNPYVGT--AE 224 (302) T ss_pred hhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc-cccC------CCCcceeccc--eE Confidence 0111222223332222111 12334567776666555544211 2210 1111122332 22 Q ss_pred EecCcccccccccCCcceEEEEehhhc--eeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEecccceE--EEEe Q lcl|Aclame:pro 306 VVSNRFLKSKGTTAKKAPLIIGDLKEA--IVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAV--YGEI 381 (392) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~Gd~~~~--~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~af~--~l~~ 381 (392) ++.++.+.+ +..=+++.|.+.. +.+-.+++..++..+ .|..+.+.++.+..+|+.-+-.-+|. .+-+ T Consensus 225 ~vv~p~L~s-----~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~----~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~ 295 (302) T protein:vir:10 225 LVVDGRIES-----DTAWFLLDTTKPVKPFIFQPRKQPEFVSQV----NLDSDDVFNLRKLKFGAEARAAAGYGFWQLAY 295 (302) T ss_pred EEEeeccCC-----CCceEEEecCCccceEEEcCccccEEEecc----CCCCCceEEEEEEEEeeeeeeecchhhhhhhh Confidence 333333322 1223455554431 222334455554422 46677788888777775333322221 1111 Q ss_pred cccCCCC Q lcl|Aclame:pro 382 DLSAPVE 388 (392) Q Consensus 382 ~~~a~~~ 388 (392) +....++ T Consensus 296 ~s~g~~~ 302 (302) T protein:vir:10 296 GSTGTGA 302 (302) T ss_pred ccCccCC Confidence 2111111 No 218 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=33.30 E-value=1.4 Score=19.84 Aligned_cols=272 Identities=10% Similarity=0.006 Sum_probs=99.3 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhh--hhcceeeccCCcceeEEEeecC--Ccccccccccccccccccccee Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALE--QYVTVEPVRTRSGSRVLEKNSD--MIPFAEITEMGEIPETDNPKFS 181 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~--~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~~~~~~~~ 181 (392) |.. --.++-|.++...|..+.....+++ .+.....+.+. ++....... ...+.+++.+.+.+-...-.+. T Consensus 1 M~~----i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~--~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~ 74 (348) T protein:vir:27 1 MGL----IYDKVTASNIAGYFNALQENVSSTLGESIFPARKQLGT--KLSYIKGASGQSVALKAAAFDTNVTIRDRVSAE 74 (348) T ss_pred Ccc----hhhhcCHHHHHHHHHhccchhhhhhHhhcCCCccccce--eEEEEeeccCceeEeeeecCCCCcceeccccee Confidence 100 0112233344333333333333222 23343333322 222222221 1224455554443322233456 Q ss_pred eEEechhheeeehhhHHHHHhhh-------HHH----HHHHHH---HHHHHHHHHHHHHHHhhccc-------------- Q lcl|Aclame:pro 182 NVQYAVKDRAGILPLSRSLLQDS-------DQN----ILKYVT---KWLGKKSKVTRNVLILGVIE-------------- 233 (392) Q Consensus 182 ~v~~~~~~i~~~~~iS~e~l~ds-------~~~----l~~~v~---~~l~~~~~~~~d~~~~~~~~-------------- 233 (392) ..++.+-.++....++..-+... ..+ +...+. ..+.+.+...++..+...+. T Consensus 75 ~~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~ 154 (348) T protein:vir:27 75 MHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKD 154 (348) T ss_pred eeeeecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEE Confidence 66666666665555554332221 011 111111 22233344444432222110 Q ss_pred ---------------cccccchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHHH---hhccC----Cc-eeecccc Q lcl|Aclame:pro 234 ---------------KLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK---LKDKD----GK-YILQSDP 290 (392) Q Consensus 234 ---------------~~~~~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~~---lkd~~----g~-~l~~~~~ 290 (392) ..+..+...+.++.+... .+...+.....++|++..|..|++ +++.- +. ....+.. T Consensus 155 vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~-~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~ 233 (348) T protein:vir:27 155 IDYGVKPDHKKQVSKSWAEPGATPLADLEDAIE-TARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKAE 233 (348) T ss_pred EeecCCcccceeeeeccCCCCCCHHHHHHHHHH-HHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHHH Confidence 001112233456665543 334445566788999999999863 33221 11 0111111 Q ss_pred cCCcccceecccceEEecCccccccccc---CCcceEEEEehhh-ceeee--e----------ccceEEEEecc--chhh Q lcl|Aclame:pro 291 TQKNKKLFAGTNPVVVVSNRFLKSKGTT---AKKAPLIIGDLKE-AIVLF--K----------REDMELASTDV--GGKA 352 (392) Q Consensus 291 ~~~~~~~~~g~~pv~~~~~~~~~~~~~~---~~~~~~~~Gd~~~-~~~~~--~----------~~~~~~~~~~~--~~~~ 352 (392) ... -...+|++.+.+.+..+....+.. -....++++.... +...+ . .....+...+. .-.. T Consensus 234 ~~~-~~~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (348) T protein:vir:27 234 LEN-YIADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGIAVTT 312 (348) T ss_pred HHH-HHHhhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeeeCCeeEEEe Confidence 110 011234455555444443222211 1122233222111 11111 0 00000000000 0000 Q ss_pred h-hcC--ceeEEEEEeeCcEEecccceEEEEecccC Q lcl|Aclame:pro 353 F-TRN--TLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) Q Consensus 353 f-~~~--~~~~~~~~r~~~~v~~~~af~~l~~~~~a 385 (392) | +.+ ...+.+..+.=-.+.+|+++.++++-++- T Consensus 313 ~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 313 TKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred eecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 0 011 23334444444456678899888866544 No 219 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=32.86 E-value=1.4 Score=19.79 Aligned_cols=306 Identities=13% Similarity=0.062 Sum_probs=116.3 Q ss_pred hhccccccccccchhhHHHHHHHHHHhcchhhHHHHHHHHhhhhhhhhccccccccceecchhhhhHHHHhHHhhh--hh Q lcl|Aclame:pro 59 RNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFD--AL 136 (392) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~iP~~~~~~ii~~~~~~~--~l 136 (392) .......+.-........-+++.|.+..+ ...+..+-.+|+++-=+.+..+|..+..... .+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~Ks~~ag----------------y~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~ 64 (468) T protein:vir:63 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTG----------------YGITPDTQTDAGALRREFLDDQISMLTWTENDLTF 64 (468) T ss_pred CCCCcchhhccccChhHHHHHHHHHHHcC----------------cccCCccccCcchhhhhhhhhhhheeeecccchhh Confidence 00000000000101111112222222111 1112223344566655556555544433222 23 Q ss_pred hhhcceeeccCCcceeEEEeecCC-ccccccccccccccccccceeeEEechhheeeehhhHHHH-HhhhHHHHHHHHHH Q lcl|Aclame:pro 137 EQYVTVEPVRTRSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTK 214 (392) Q Consensus 137 ~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~-l~ds~~~l~~~v~~ 214 (392) +.-+...+..+....|..-..-++ ....++.|++... .+++.+.+.....+-++.-..+|.-+ +..+..+..+...+ T Consensus 65 ~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~-~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~ 143 (468) T protein:vir:63 65 YKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTD 143 (468) T ss_pred hhhcccchhhhhhhhheeeeccCccccccccccccccc-cCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHH Confidence 444455555554444443333333 5567899987655 67899999999999998866666544 23334566778888 Q ss_pred HHHHHHHHHHHHHHhhccccccc----cchhhHHHHHHHHH---------------------HHhhhcccCCceEEEcHH Q lcl|Aclame:pro 215 WLGKKSKVTRNVLILGVIEKLTK----QAIKSLDDIKDVLN---------------------VKLDPAISPNAILLTNQD 269 (392) Q Consensus 215 ~l~~~~~~~~d~~~~~~~~~~~~----~~~~~~d~~~~~~~---------------------~~~~~~~~~~a~~v~~~~ 269 (392) +-...++.+++.+.+.|...-.. .-+.-.|.|..++. ...--+|....-++|+.- T Consensus 144 ~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~ 223 (468) T protein:vir:63 144 DAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVG 223 (468) T ss_pred HHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchh Confidence 88888899999998887655421 11122233322211 111112333333555555 Q ss_pred HHHHHHHhhccCCceeecccccCCcccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccce-------E Q lcl|Aclame:pro 270 GFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDM-------E 342 (392) Q Consensus 270 ~~~~L~~lkd~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~-------~ 342 (392) +.+.|..--- ..++.+..+-. .....|. +|. .+++.-+.-.-....++|+... .-.+..+. . T Consensus 224 v~a~~~~~~L-~~q~~v~~~n~---~~~~~G~-~v~----g~~sa~G~I~l~gs~il~~~~~--l~~~~~~~~~Apsp~~ 292 (468) T protein:vir:63 224 VQADFVNQQL-SKQTQLVRDNG---NNVSVGF-NIQ----GFHSARGFIKLHGSTVMENEQI--LDERILALPTAPQPAK 292 (468) T ss_pred HHhhhhhhhc-CceEEEEcCCC---Cceeeee-ccc----ceecceeeeeecCceeeccccC--CCcccccccccccCCc Confidence 5554411000 01111111000 0011111 110 0000000000001112222211 00000000 0 Q ss_pred EEEecc--chhhhhc---CceeEEEEEeeCcEEecccce-----------EEEEecc-cCCCCCCCC Q lcl|Aclame:pro 343 LASTDV--GGKAFTR---NTLDLRAIQRDDVQMWDNEAA-----------VYGEIDL-SAPVEQPQG 392 (392) Q Consensus 343 ~~~~~~--~~~~f~~---~~~~~~~~~r~~~~v~~~~af-----------~~l~~~~-~a~~~~~~~ 392 (392) +..+.. ....|.. ..+.+++...-+.+--.|... +.|+++. +.+..+|.. T Consensus 293 vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~y 359 (468) T protein:vir:63 293 VTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQF 359 (468) T ss_pred cceeeecccCCcccCCCcceEEEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcceE Confidence 000000 0001110 113333333322222222222 2334432 223333433 No 220 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=32.00 E-value=1.5 Score=19.69 Aligned_cols=273 Identities=10% Similarity=-0.027 Sum_probs=99.6 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc-------CCcceeEEEeecCCcccccc-cc-cccccccc Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR-------TRSGSRVLEKNSDMIPFAEI-TE-MGEIPETD 176 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~-------~~~~~~~~~~~~~~~~~~~~-~E-~~~~~~~~ 176 (392) |. .+. .-.||+-+..+.++.+++..++-++++.-.-. +++.+++.| . ....... .. +.... .+ T Consensus 1 MA-N~l---lT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p--~-~~~v~d~~~~~~~~~~-~~ 72 (423) T protein:vir:35 1 MA-NNL---ESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRP--H-QFKSERTETGDITGKD-KN 72 (423) T ss_pred Cc-cch---hhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeC--C-cceeecccCcCCCCcc-cc Confidence 11 111 11379999999999999999988887653211 122223322 2 1111111 11 11111 12 Q ss_pred ccceee--EEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-----cc-cc-cchhhHHHHH Q lcl|Aclame:pro 177 NPKFSN--VQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-----KL-TK-QAIKSLDDIK 247 (392) Q Consensus 177 ~~~~~~--v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~-----~~-~~-~~~~~~d~~~ 247 (392) +..-.+ +++.-+|... +.++.+-...+..+|+.++.... .+++..+|..+....- .. ++ .....|++++ T Consensus 73 ~~~e~~v~l~id~~k~~a-~~v~d~e~~l~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a~~~vgt~~t~~~~~~~i~ 150 (423) T protein:vir:35 73 GLFSAKATGKVGKYITVA-VEWTQIEEALKLNQLDQILSPIH-ERMVTDLETELAHFMMNNGALSLGSPNTAIKKWADVA 150 (423) T ss_pred ccccceeeEEeccceecc-ceeCHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccccCCcchHHHHH Confidence 222233 4444444433 34554444434556777666554 5566667666544211 11 11 1124577877 Q ss_pred HHHHHHhhhcccCC--ceEEEcHHHHHHHHHhhc--cCCceeecccccCCc-ccceecccceEEecCcccccccccCCcc Q lcl|Aclame:pro 248 DVLNVKLDPAISPN--AILLTNQDGFNYLDKLKD--KDGKYILQSDPTQKN-KKLFAGTNPVVVVSNRFLKSKGTTAKKA 322 (392) Q Consensus 248 ~~~~~~~~~~~~~~--a~~v~~~~~~~~L~~lkd--~~g~~l~~~~~~~~~-~~~~~g~~pv~~~~~~~~~~~~~~~~~~ 322 (392) ++- ..|+....+. -..|++|..+..|.+-.. ....-.-...+..+. ...+.|+. |+. ++. +|.....+... T Consensus 151 ~a~-~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFd-v~~-Snn-vp~~T~gt~~~ 226 (423) T protein:vir:35 151 QTA-SFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIR-ALM-SNG-LASRKQGDFDG 226 (423) T ss_pred HHH-HHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceE-EEE-cCC-Ccccccccccc Confidence 754 4566655554 355999999888742110 000000111122222 24566653 333 333 33221111111 Q ss_pred eEEE-Eehh-hceeeeecc----ceEEEEeccchhhhhcCceeEEEEEee---CcEE------ecccceEEE-------- Q lcl|Aclame:pro 323 PLII-GDLK-EAIVLFKRE----DMELASTDVGGKAFTRNTLDLRAIQRD---DVQM------WDNEAAVYG-------- 379 (392) Q Consensus 323 ~~~~-Gd~~-~~~~~~~~~----~~~~~~~~~~~~~f~~~~~~~~~~~r~---~~~v------~~~~af~~l-------- 379 (392) ...+ +-.. ......... ++...+...++.....|.+.|-+..-+ ...+ .++.=|+++ T Consensus 227 ~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~ 306 (423) T protein:vir:35 227 AITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTAS 306 (423) T ss_pred ceeeccccccccccccccccceeeeeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEecccccccc Confidence 1111 0000 000000000 001011000010001112222111000 0000 000001110 Q ss_pred ---EecccCCCCCCCC Q lcl|Aclame:pro 380 ---EIDLSAPVEQPQG 392 (392) Q Consensus 380 ---~~~~~a~~~~~~~ 392 (392) +++-.++...|+. T Consensus 307 g~~~v~i~p~~~~~~~ 322 (423) T protein:vir:35 307 GDVTVKLSGVPIYDEK 322 (423) T ss_pred CceeEEccccccccCC Confidence 1111111111111 No 221 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=27.58 E-value=1.8 Score=19.14 Aligned_cols=271 Identities=11% Similarity=0.012 Sum_probs=104.9 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhh-hccee---------ecc---CCcceeEEEeecCCcccccccccc-- Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQ-YVTVE---------PVR---TRSGSRVLEKNSDMIPFAEITEMG-- 170 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~-l~~~~---------~~~---~~~~~~~~~~~~~~~~~~~~~E~~-- 170 (392) +..+....+.......++..+.....+.++..+ |...- ... +++.++......+ ...+.++. T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~---g~gv~Gd~~l 77 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLR---GKPTYGDARV 77 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecc---cCCcccCcee Confidence 222222222222233445555555555554443 32210 000 1111111111111 11222211 Q ss_pred ccccccccceeeEEechhheeeehhhHHHH-HhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------------- Q lcl|Aclame:pro 171 EIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK--------------- 234 (392) Q Consensus 171 ~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~-l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~--------------- 234 (392) +-.+. ..+|.+-++.+..+..-+.....+ -+-+.++|...-++.|..-+.+..|..++.-+.. T Consensus 78 eGnee-~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~ 156 (364) T protein:vir:93 78 EGKEE-SLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFT 156 (364) T ss_pred ecccc-ceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCcc Confidence 11222 345555555544444444332222 2234567777777777777776666654422110 Q ss_pred ------cccc--------------------chhhHHHHHHHHHHHhhhcccC--------------Cc--eEEEcHHHHH Q lcl|Aclame:pro 235 ------LTKQ--------------------AIKSLDDIKDVLNVKLDPAISP--------------NA--ILLTNQDGFN 272 (392) Q Consensus 235 ------~~~~--------------------~~~~~d~~~~~~~~~~~~~~~~--------------~a--~~v~~~~~~~ 272 (392) ..++ ...+++.+..+. ..++..... .. +++|||.-+. T Consensus 157 ~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~-~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~ 235 (364) T protein:vir:93 157 GYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAV-EKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQAT 235 (364) T ss_pred cccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHH-HHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhh Confidence 0000 011222222222 111111100 01 6789999998 Q ss_pred HHHHhhc--------------cCCceeecccccCCcccceecccceEEecCcccccccccCCcc----eEEEEehhhcee Q lcl|Aclame:pro 273 YLDKLKD--------------KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKA----PLIIGDLKEAIV 334 (392) Q Consensus 273 ~L~~lkd--------------~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~~~~----~~~~Gd~~~~~~ 334 (392) .|+.-+| ...+|||.. ...+|++-+|+.-...+-.+.....+.. .+++|-=.-++. T Consensus 236 ~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G------~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a 309 (364) T protein:vir:93 236 DMRTAAGGTWIDFQKAAAAAEGRNNPIFKG------GLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIA 309 (364) T ss_pred hhhhcCCHHHHHHHHHhhhcccccCCceec------CeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEE Confidence 8874332 122455542 2234544333221122111222222211 244553211111 Q ss_pred eeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEe--cccceEEEEecccCCCCC Q lcl|Aclame:pro 335 LFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMW--DNEAAVYGEIDLSAPVEQ 389 (392) Q Consensus 335 ~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~--~~~af~~l~~~~~a~~~~ 389 (392) .+...++...+.++.-+ + .|...+-+..-+|++-. +.+=|-++.+.++++.-+ T Consensus 310 ~g~~~g~~~~w~Ee~~D-~-gn~~~i~~~~i~G~kK~rF~~~DfGvi~idtaa~~~~ 364 (364) T protein:vir:93 310 YGTANGLRFDWEETVKD-Y-GNEPAIAAGFIAGMKKARFNNKDFGVISIDTAAKKHS 364 (364) T ss_pred eecCCCCCceeeecccC-C-CCchhhhhhhHhhhhhcccCCccceEEEecccccccC Confidence 11224444444444321 1 23344444444444333 234566677777777666 No 222 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=26.26 E-value=2 Score=18.97 Aligned_cols=273 Identities=11% Similarity=-0.019 Sum_probs=100.5 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhcceeecc-------CCcceeEEEeecCCccc-cccc-ccccccccc Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVR-------TRSGSRVLEKNSDMIPF-AEIT-EMGEIPETD 176 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~~~~-------~~~~~~~~~~~~~~~~~-~~~~-E~~~~~~~~ 176 (392) |. .+. --.+|+-+..+.++.+++..++.++++.-.-. +++.+++.|. ...+ .+-. .+.... .+ T Consensus 1 Ma-N~l---lT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~---~~~~~~~~~~~~~~~~-~~ 72 (423) T protein:vir:17 1 MP-NNL---DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPH---QFSSLRTPTGDISGQN-KN 72 (423) T ss_pred Cc-cch---hhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCC---cceeecccCcccCCcc-cC Confidence 11 111 11378888889999999998888876653211 2222223221 1111 1111 111111 11 Q ss_pred cccee--eEEechhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----ccc--cchhhHHHHH Q lcl|Aclame:pro 177 NPKFS--NVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEK-----LTK--QAIKSLDDIK 247 (392) Q Consensus 177 ~~~~~--~v~~~~~~i~~~~~iS~e~l~ds~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~-----~~~--~~~~~~d~~~ 247 (392) +..=. .++++-+|...+--=..|.. ....+++.++... .++++..+|..+...... .+. +....|++++ T Consensus 73 ~l~e~~v~l~id~~k~va~~v~d~E~~-~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a~~~~gt~~t~~~a~~~i~ 150 (423) T protein:vir:17 73 NLISGKATGRVGNYITVAVEYQQLEEA-IKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVA 150 (423) T ss_pred ccccceeEEEeeceeeeeeeecHHHHh-cChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhccccccccCCcccccHHHHH Confidence 22212 35555555555443344444 3445677666555 577888888766443111 111 1123577777 Q ss_pred HHHHHHhhhcccCC--ceEEEcHHHHHHHHHhhc--cCCceeecccccCCc-ccceecccceEEecCcccccc--cccCC Q lcl|Aclame:pro 248 DVLNVKLDPAISPN--AILLTNQDGFNYLDKLKD--KDGKYILQSDPTQKN-KKLFAGTNPVVVVSNRFLKSK--GTTAK 320 (392) Q Consensus 248 ~~~~~~~~~~~~~~--a~~v~~~~~~~~L~~lkd--~~g~~l~~~~~~~~~-~~~~~g~~pv~~~~~~~~~~~--~~~~~ 320 (392) ++ ...|+....+. -..|++|..+..|.+-.. ......-...+-.+. ...+.|+. |+. ++. +|.. +...+ T Consensus 151 ~a-~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFd-vy~-Snn-ip~~T~gt~~~ 226 (423) T protein:vir:17 151 QT-ASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIR-ALM-SNG-LASRTQGAFGG 226 (423) T ss_pred HH-HHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceE-EEE-eCC-Cccccccceec Confidence 74 34566555553 467999999888753110 000001111122222 23566653 333 222 3322 11111 Q ss_pred cceEEEEehhhceeeee--ccceEE--EEeccchhhhhcCceeEEEE---EeeCcEEe------cccceE---------- Q lcl|Aclame:pro 321 KAPLIIGDLKEAIVLFK--REDMEL--ASTDVGGKAFTRNTLDLRAI---QRDDVQMW------DNEAAV---------- 377 (392) Q Consensus 321 ~~~~~~Gd~~~~~~~~~--~~~~~~--~~~~~~~~~f~~~~~~~~~~---~r~~~~v~------~~~af~---------- 377 (392) ....-.+..-.+..... ...+.+ .+...++..-..|.+.|-+. .+....+. .+.-|+ T Consensus 227 t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~ 306 (423) T protein:vir:17 227 TLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSS 306 (423) T ss_pred eeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEeccccccc Confidence 10000110000000000 000111 11000000000111111110 00011000 111111 Q ss_pred ---EEEeccc--CCCCC---------CCC Q lcl|Aclame:pro 378 ---YGEIDLS--APVEQ---------PQG 392 (392) Q Consensus 378 ---~l~~~~~--a~~~~---------~~~ 392 (392) .+++..+ ++++. |+. T Consensus 307 ~~~tv~i~p~~i~~~~~~~~~~v~a~~a~ 335 (423) T protein:vir:17 307 GDVTVTLSGVPIYDTTNPQYNSVSRQVAA 335 (423) T ss_pred CceEEEecCccccccCCcccccceecccC Confidence 1222211 11111 111 No 223 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=25.54 E-value=2 Score=18.88 Aligned_cols=322 Identities=11% Similarity=0.044 Sum_probs=106.2 Q ss_pred HHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhHHHHHHHHHHhcchhhHH Q lcl|Aclame:pro 13 EGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAE 92 (392) Q Consensus 13 ~~~~~e~~~~~~~~~~~~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 92 (392) ...++...+-+..-....++. ..-..+...++.+ +.-.+.+...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~--------------------------------~~~~~~~~~~l~~-~gi~~~~~~~~~~ 47 (382) T protein:vir:96 1 MSHISKTHSRLAGRHAKPFDL--------------------------------KNVTHEAVAALGR-IGLVFDHAVVQDQ 47 (382) T ss_pred CCCcceeeeecCCccccchhh--------------------------------hcccHHHHHHHhc-cccccCcccchhH Confidence 000000000000000000000 0000000000000 0000000000000 Q ss_pred HHHHHHhhh--hhhhhcc---ccccccceecchhh----hhHHHHhHHhhhhhhhhcceeeccCCc-ceeEEEeecCCcc Q lcl|Aclame:pro 93 EREFLEDDL--EQRAMSG---LTGEDGGLVIPQDI----QTQINELARSFDALEQYVTVEPVRTRS-GSRVLEKNSDMIP 162 (392) Q Consensus 93 ~~~~~~~~~--~~~a~~~---~~~~~gg~~iP~~~----~~~ii~~~~~~~~l~~l~~~~~~~~~~-~~~~~~~~~~~~~ 162 (392) -+....... ...++-. +-.+.++.-||.++ ...+++.+........++.+..++.-. ..+.+......+. T Consensus 48 ~~~~~~~~~~~~~~amDa~~~~~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~ 127 (382) T protein:vir:96 48 IKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGT 127 (382) T ss_pred hhhhhhhhhhhhhcccccccCCccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccc Confidence 000000000 0011110 01111222256544 445677777777777777776543221 1233444444456 Q ss_pred cccccccccccccc-ccceeeEEechhheeeehhhH-HHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccc---- Q lcl|Aclame:pro 163 FAEITEMGEIPETD-NPKFSNVQYAVKDRAGILPLS-RSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEK---- 234 (392) Q Consensus 163 ~~~~~E~~~~~~~~-~~~~~~v~~~~~~i~~~~~iS-~e~l~ds--~~~l~~~v~~~l~~~~~~~~d~~~~~~~~~---- 234 (392) +.+.+-+++.|-.+ ...+.+.++ ..+.....++ .|+..-+ ..++.+--+....+++.+.+|+..+.|... T Consensus 128 A~~ygd~~D~Pl~d~~~~~~~r~v--~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~ 205 (382) T protein:vir:96 128 AVEYGDHTNIPLTSWNANFERRTI--VRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGN 205 (382) T ss_pred eEEeecccCCCccccccceeEEEE--EEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCc Confidence 66777777665432 123444443 4444445554 4444432 234455455555566666666655555210 Q ss_pred --------------ccc--c--chhh----HHHHHHHHHHHhh-h--cccCC---ceEEEcHHHHHHHHHhhccCCceee Q lcl|Aclame:pro 235 --------------LTK--Q--AIKS----LDDIKDVLNVKLD-P--AISPN---AILLTNQDGFNYLDKLKDKDGKYIL 286 (392) Q Consensus 235 --------------~~~--~--~~~~----~d~~~~~~~~~~~-~--~~~~~---a~~v~~~~~~~~L~~lkd~~g~~l~ 286 (392) .++ + ...+ ++|+..++..... . ....+ -.+++.|+.+..|.. .+..|.-++ T Consensus 206 ~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~-~n~~g~Tvl 284 (382) T protein:vir:96 206 RTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSV-TTPYGISVS 284 (382) T ss_pred ceEEEEeCCCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhccc-cCccCccHH Confidence 000 0 1112 3444443332211 1 11122 257889988887743 223332222 Q ss_pred cccccCCcccceecccceEEecCcccccccccC--CcceE-EEEe-hh------h-ceeee-eccceEEEEeccchhhhh Q lcl|Aclame:pro 287 QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTA--KKAPL-IIGD-LK------E-AIVLF-KREDMELASTDVGGKAFT 354 (392) Q Consensus 287 ~~~~~~~~~~~~~g~~pv~~~~~~~~~~~~~~~--~~~~~-~~Gd-~~------~-~~~~~-~~~~~~~~~~~~~~~~f~ 354 (392) . -.... +|.+.++.-+-+...+... +...+ ++.+ +. . ....+ .+-.+....... ... T Consensus 285 ~-~lk~n-------~Pnl~i~t~peL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~v---e~~ 353 (382) T protein:vir:96 285 D-WIEQT-------YPKMRIVSAPELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGV---EKR 353 (382) T ss_pred H-HHHHh-------cCCcEEEEccccccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeeeccc---eee Confidence 1 00011 1112222211111111111 11111 1111 00 0 00000 000000000000 000 Q ss_pred cCceeEEEEEe-eCcEEecccceEEEE-e Q lcl|Aclame:pro 355 RNTLDLRAIQR-DDVQMWDNEAAVYGE-I 381 (392) Q Consensus 355 ~~~~~~~~~~r-~~~~v~~~~af~~l~-~ 381 (392) .-.+......| .|..+++|.||++++ + T Consensus 354 ~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 354 AKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred cceeEeccccceeeeEEEcchhhhhccCC Confidence 00111222233 455677899998776 4 No 224 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=22.95 E-value=2.4 Score=18.52 Aligned_cols=268 Identities=10% Similarity=0.047 Sum_probs=102.8 Q ss_pred hccccccccceecchhhhhHHHHhHHhhhhhhhhccee---ecc-CCcceeEEEeecCCcccccc-ccccccccccccce Q lcl|Aclame:pro 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE---PVR-TRSGSRVLEKNSDMIPFAEI-TEMGEIPETDNPKF 180 (392) Q Consensus 106 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~l~~~~---~~~-~~~~~~~~~~~~~~~~~~~~-~E~~~~~~~~~~~~ 180 (392) |. +. -.++.+...+.+.....+....|+... .+. .+..++.+++....+-..+- ...+-.+..-+.++ T Consensus 1 MA--~~-----n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~ 73 (299) T protein:vir:79 1 MA--AL-----NYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAW 73 (299) T ss_pred Cc--cc-----hhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcce Confidence 11 11 123556666666666665544433221 111 11224555554432222221 21122221112355 Q ss_pred eeEEechhheeeehhhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhh-------ccccc----cccchhhHHHHH Q lcl|Aclame:pro 181 SNVQYAVKDRAGILPLSRSLLQDSD--QNILKYVTKWLGKKSKVTRNVLILG-------VIEKL----TKQAIKSLDDIK 247 (392) Q Consensus 181 ~~v~~~~~~i~~~~~iS~e~l~ds~--~~l~~~v~~~l~~~~~~~~d~~~~~-------~~~~~----~~~~~~~~d~~~ 247 (392) ...+++-.+.-.+.-=... .+.+. ..+...+.+.....+.-.+|+..+. ..+.. ..+....|+.+. T Consensus 74 ~t~~ldqdr~~~f~vD~~D-vdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~ 152 (299) T protein:vir:79 74 EPKVLTNQRKWSTLVHPAD-INQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFD 152 (299) T ss_pred eEEEeeccccceeccchhh-HHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHH Confidence 5566665554433211000 01111 1122223333333333333432221 11111 122344567777 Q ss_pred HHHHHHhhhcccC--CceEEEcHHHHHHHHHhhc--cCCceeecccccCCcccceecccceEEecCccccc----cc--- Q lcl|Aclame:pro 248 DVLNVKLDPAISP--NAILLTNQDGFNYLDKLKD--KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKS----KG--- 316 (392) Q Consensus 248 ~~~~~~~~~~~~~--~a~~v~~~~~~~~L~~lkd--~~g~~l~~~~~~~~~~~~~~g~~pv~~~~~~~~~~----~~--- 316 (392) +++. .++....+ +.+++++|..+..|.+.+. ............++....+.|.+.+.|-++.+... .+ T Consensus 153 ~~~~-~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~ 231 (299) T protein:vir:79 153 KLME-KMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKV 231 (299) T ss_pred HHHH-HHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceeccCccc Confidence 7664 44444333 3567899999998864321 01111111112233445577764433434333211 11 Q ss_pred -ccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEEeeCcEEeccc-ceEEEEecccCC Q lcl|Aclame:pro 317 -TTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE-AAVYGEIDLSAP 386 (392) Q Consensus 317 -~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~r~~~~v~~~~-af~~l~~~~~a~ 386 (392) ..+..-.++++..+..+.+.-...+.+ +.|.. .+.+-..+.-..+.|.-|.+.+ .-+++.++++-. T Consensus 232 ~~~ak~in~ii~~~~a~~~~~K~~~~~~-~~P~~---~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 232 GAGAKQIFMSLVHPSAIITPVSYQFSKL-DEPTA---VTEGKYFYFEESFEDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred cCcccccceEEEcCCeeeeeEeeeeEEe-ecCCC---CCccceeeeeeeeeeeeeeccccCeEEEEeeecCC Confidence 112223356655432222222223333 23322 1222233333344555555432 223445554433 No 225 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=21.97 E-value=2.5 Score=18.39 Aligned_cols=296 Identities=11% Similarity=0.014 Sum_probs=119.2 Q ss_pred cchhhHHHHHHHHHHhcchhh-------HHHHHHHHhhhhhhh-hcc------ccccccceecchhhhhHHHHhHHhh-- Q lcl|Aclame:pro 70 VDGEMEYRDVFMKALRNKPLN-------AEEREFLEDDLEQRA-MSG------LTGEDGGLVIPQDIQTQINELARSF-- 133 (392) Q Consensus 70 ~~~~~~~~~a~~~~~~~~~~~-------~~~~~~~~~~~~~~a-~~~------~~~~~gg~~iP~~~~~~ii~~~~~~-- 133 (392) .....+-++.+.+.+-++... .+...........++ ++. .+-.+|+++--+.+..++..+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ 80 (514) T protein:vir:10 1 MYTQDKTKDIMKKSFFGGDRAVAFDTNKEDILNENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERD 80 (514) T ss_pred CCccchhhHHHhhhhcccceeeeecCcHHHHHHHhcchhhhhhhhccccccCCccccCccchhhhhhccceeEeeecCcc Confidence 222222233333333333211 111111111111111 221 2223445554444544443333322 Q ss_pred hhhhhhcceeeccCCcceeEEEeecCC-ccccccccccccccccccceeeEEechhheeeehhhHHHH-HhhhHHHHHHH Q lcl|Aclame:pro 134 DALEQYVTVEPVRTRSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKY 211 (392) Q Consensus 134 ~~l~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~~~~~~~~~v~~~~~~i~~~~~iS~e~-l~ds~~~l~~~ 211 (392) -.+++-+...++.+....|..-...++ ....++.|++ .++.+++.+....+..+-++.-..+|..+ +.++..+.... T Consensus 81 ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~g-i~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~d~~~~ 159 (514) T protein:vir:10 81 FTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIG-IGDVNNPNERQRTINIKYIVDTHVTSIALQRANTIVDSLKV 159 (514) T ss_pred hhhhhhcCCchhhHHHhhhhhhcccCcccccccccccc-cCcCCCcceEEEEEeeeeeeeeeeeeehhhhccchhhHHHH Confidence 223344455555544444433222232 3567889987 55577899999888888887666665443 34577788888 Q ss_pred HHHHHHHHHHHHHHHHHhhccccccc---cchhhHHHHHHHHHHHhhhcccCCceEEEcHHHHHHHH-HhhccCCc--ee Q lcl|Aclame:pro 212 VTKWLGKKSKVTRNVLILGVIEKLTK---QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLD-KLKDKDGK--YI 285 (392) Q Consensus 212 v~~~l~~~~~~~~d~~~~~~~~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~a~~v~~~~~~~~L~-~lkd~~g~--~l 285 (392) ..+.-...++.+++.+.+.|.....+ ..+...|.|+..|.. ++-....+. -+++..++... .+.-+-|. .+ T Consensus 160 ~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~--~NvIDarG~-~Ls~~~ln~aA~~i~~gfGt~TD~ 236 (514) T protein:vir:10 160 QEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAP--ENHIDLRGG-RLSPAALNMAARKIGEGFGTPTDA 236 (514) T ss_pred HHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcC--CCeEecCCC-CccHHHHhhhhhhhhcccCChhhe Confidence 88888889999999999988665444 344677887776621 111111111 33455544433 12222232 24 Q ss_pred ecccccCCc-ccceecccceEEecCcccccccccCCcceEEEEehhhceeeeeccceEEEEeccchhhhhcCceeEEEEE Q lcl|Aclame:pro 286 LQSDPTQKN-KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQ 364 (392) Q Consensus 286 ~~~~~~~~~-~~~~~g~~pv~~~~~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~ 364 (392) |.|...... ...+++.+.|.+-.+.- .+..|-.-+.+ +..++.+.+.-+ +.+.-+. T Consensus 237 ylp~~vka~f~~~~~~~qRV~~~~n~~-----------~~~~G~~v~~f-~s~~G~I~L~gs-----------~im~~~n 293 (514) T protein:vir:10 237 YMPIGIKADFVNQHLNGQRVMLPGQTG-----------GMTTGLDIDKF-LSAHGSIRIQGS-----------TIMDSDN 293 (514) T ss_pred eCchHHHHHHhhcccCcceEEeecCcc-----------ceeeeeeccce-eEeccceeecCC-----------eeecccc Confidence 443322221 12233333332211100 00111110000 111222221110 1111111 Q ss_pred eeCcEEe-cccce----EEEEeccc-------CCCCCCCC Q lcl|Aclame:pro 365 RDDVQMW-DNEAA----VYGEIDLS-------APVEQPQG 392 (392) Q Consensus 365 r~~~~v~-~~~af----~~l~~~~~-------a~~~~~~~ 392 (392) +++.... .|.|= +.++++.- ++.+...| T Consensus 294 ~L~~~~~~~~~Ap~~~~va~svT~~~~g~~~~ad~t~~~g 333 (514) T protein:vir:10 294 KLDFDRPVSPTAPTAPQLSATVTPDGGGLWHEADKTDSKG 333 (514) T ss_pred cCccCCccCCcCCCCCcceEEEecCcccccCccccccccc Confidence 1111111 01000 00000000 00000000 Done!