Query lcl|Aclame:protein:vir:9643|NCBI_annot:major coat protein|genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Match_columns 377 No_of_seqs 135 out of 729 Neff 9.2 Searched_HMMs 1612 Date Sat Nov 30 11:12:40 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_33 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_33_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:98635 Length: 377 100.0 4.2E-92 2.6E-95 521.5 40.6 377 1-377 1-377 (377) 2 protein:vir:9643 Length: 377 # 100.0 1.2E-91 7.5E-95 519.0 40.2 377 1-377 1-377 (377) 3 protein:vir:100632 Length: 381 100.0 1.7E-84 1.1E-87 479.8 37.5 368 1-377 1-368 (381) 4 protein:vir:9509 Length: 381 # 100.0 7E-83 4.3E-86 471.0 37.6 368 1-377 1-370 (381) 5 protein:vir:101291 Length: 381 100.0 7E-83 4.3E-86 471.0 37.6 368 1-377 1-370 (381) 6 protein:vir:78350 Length: 383 100.0 4.1E-82 2.5E-85 466.8 37.8 370 1-377 1-375 (383) 7 protein:vir:95963 Length: 395 100.0 9.6E-79 5.9E-82 448.3 37.6 366 1-377 4-376 (395) 8 protein:vir:4092 Length: 390 # 100.0 8.4E-72 5.2E-75 410.3 37.2 360 1-377 1-368 (390) 9 protein:vir:95376 Length: 425 100.0 1E-63 6.2E-67 366.0 29.8 351 1-377 19-421 (425) 10 protein:vir:80128 Length: 466 100.0 2.9E-63 1.8E-66 363.5 32.3 369 1-377 36-448 (466) 11 protein:vir:4456 Length: 401 # 100.0 4E-60 2.5E-63 346.2 33.9 357 1-377 1-401 (401) 12 protein:vir:100247 Length: 425 100.0 6.1E-59 3.8E-62 339.7 33.0 357 1-377 21-424 (425) 13 protein:vir:485 Length: 407 # 100.0 5.3E-58 3.3E-61 334.6 32.9 356 1-377 1-400 (407) 14 protein:vir:1328 Length: 392 # 100.0 6.2E-58 3.8E-61 334.2 29.3 348 1-377 1-391 (392) 15 protein:vir:6242 Length: 390 # 100.0 1.8E-57 1.1E-60 331.7 28.3 346 1-377 1-389 (390) 16 protein:vir:7855 Length: 497 # 100.0 4E-56 2.5E-59 324.3 31.2 369 1-377 1-493 (497) 17 protein:vir:101650 Length: 497 100.0 4E-56 2.5E-59 324.3 31.2 369 1-377 1-493 (497) 18 protein:vir:4511 Length: 409 # 100.0 8.2E-56 5.1E-59 322.6 28.9 345 1-377 1-406 (409) 19 protein:vir:6212 Length: 434 # 100.0 5.8E-55 3.6E-58 317.9 30.8 344 1-377 1-433 (434) 20 protein:vir:105038 Length: 428 100.0 2.6E-54 1.6E-57 314.3 33.4 353 1-377 1-428 (428) 21 protein:vir:1433 Length: 435 # 100.0 2.6E-54 1.6E-57 314.4 32.9 349 3-377 1-433 (435) 22 protein:vir:80376 Length: 435 100.0 4.9E-54 3E-57 312.9 31.9 349 3-377 1-433 (435) 23 protein:vir:8102 Length: 543 # 100.0 2.9E-54 1.8E-57 314.1 29.1 347 1-377 140-542 (543) 24 protein:vir:10364 Length: 390 100.0 2.2E-53 1.4E-56 309.2 33.7 342 1-375 1-390 (390) 25 protein:vir:81070 Length: 390 100.0 1.8E-52 1.1E-55 304.3 33.3 342 1-375 1-390 (390) 26 protein:vir:100135 Length: 418 100.0 4.2E-53 2.6E-56 307.7 29.2 344 1-377 17-415 (418) 27 protein:vir:97053 Length: 390 100.0 2.4E-52 1.5E-55 303.6 33.2 343 1-375 1-390 (390) 28 protein:vir:81160 Length: 371 100.0 3.2E-52 2E-55 302.9 33.5 331 1-377 1-371 (371) 29 protein:vir:4339 Length: 395 # 100.0 3E-52 1.9E-55 303.0 33.1 347 1-377 1-395 (395) 30 protein:vir:2685 Length: 387 # 100.0 8.1E-53 5E-56 306.2 29.4 334 1-377 1-381 (387) 31 protein:vir:96978 Length: 387 100.0 8.1E-53 5E-56 306.2 29.4 334 1-377 1-381 (387) 32 protein:vir:94424 Length: 387 100.0 8.1E-53 5E-56 306.2 29.4 334 1-377 1-381 (387) 33 protein:vir:78640 Length: 352 100.0 9.1E-54 5.6E-57 311.4 23.9 329 5-377 1-346 (352) 34 protein:vir:93881 Length: 387 100.0 1.8E-52 1.1E-55 304.2 30.7 334 1-377 1-381 (387) 35 protein:vir:81227 Length: 413 100.0 1.4E-52 8.7E-56 304.9 28.7 346 1-377 1-410 (413) 36 protein:vir:4953 Length: 397 # 100.0 9E-52 5.6E-55 300.4 32.7 329 1-377 1-385 (397) 37 protein:vir:1268 Length: 397 # 100.0 3.1E-52 1.9E-55 303.0 30.1 331 1-377 1-397 (397) 38 protein:vir:9361 Length: 402 # 100.0 2E-52 1.2E-55 304.0 27.7 334 1-377 16-396 (402) 39 protein:vir:102119 Length: 404 100.0 7.7E-52 4.8E-55 300.8 29.2 344 1-377 1-400 (404) 40 protein:vir:104256 Length: 458 100.0 4.1E-51 2.6E-54 296.8 33.2 350 1-377 24-458 (458) 41 protein:vir:96762 Length: 632 100.0 5.7E-52 3.5E-55 301.5 28.0 343 1-376 245-632 (632) 42 protein:vir:102873 Length: 392 100.0 1.8E-51 1.1E-54 298.7 29.2 331 1-377 1-384 (392) 43 protein:vir:102082 Length: 392 100.0 1.8E-51 1.1E-54 298.7 29.2 331 1-377 1-384 (392) 44 protein:vir:107593 Length: 392 100.0 1.8E-51 1.1E-54 298.7 29.2 331 1-377 1-384 (392) 45 protein:vir:105004 Length: 392 100.0 1.8E-51 1.1E-54 298.7 29.2 331 1-377 1-384 (392) 46 protein:vir:1025 Length: 408 # 100.0 3.1E-51 2E-54 297.5 29.4 332 1-377 3-393 (408) 47 protein:vir:1886 Length: 385 # 100.0 1.5E-50 9.6E-54 293.7 32.0 342 1-377 1-384 (385) 48 protein:vir:191 Length: 385 # 100.0 1.5E-50 9.6E-54 293.7 32.0 342 1-377 1-384 (385) 49 protein:vir:2430 Length: 318 # 100.0 1.5E-51 9.4E-55 299.2 25.9 282 61-377 1-313 (318) 50 protein:vir:4997 Length: 397 # 100.0 7.6E-51 4.7E-54 295.4 29.7 331 1-377 1-385 (397) 51 protein:vir:4830 Length: 397 # 100.0 4.1E-50 2.5E-53 291.4 33.2 331 1-377 1-385 (397) 52 protein:vir:81100 Length: 415 100.0 3.5E-50 2.2E-53 291.7 32.2 343 1-377 1-404 (415) 53 protein:vir:79987 Length: 415 100.0 3.5E-50 2.2E-53 291.7 32.2 343 1-377 1-404 (415) 54 protein:vir:98339 Length: 415 100.0 3.5E-50 2.2E-53 291.7 32.2 343 1-377 1-404 (415) 55 protein:vir:4600 Length: 415 # 100.0 8.7E-50 5.4E-53 289.5 34.1 343 1-377 1-404 (415) 56 protein:vir:4700 Length: 415 # 100.0 8.7E-50 5.4E-53 289.5 34.1 343 1-377 1-404 (415) 57 protein:vir:41 Length: 299 # N 100.0 1.3E-51 7.8E-55 299.6 23.4 271 62-377 1-298 (299) 58 protein:vir:4226 Length: 326 # 100.0 1.5E-51 9.1E-55 299.3 23.2 292 56-377 1-323 (326) 59 protein:vir:7771 Length: 330 # 100.0 3.9E-51 2.4E-54 297.0 25.1 283 67-377 1-323 (330) 60 protein:vir:9410 Length: 415 # 100.0 9.3E-50 5.7E-53 289.4 32.2 343 1-377 1-404 (415) 61 protein:vir:5739 Length: 366 # 100.0 9.4E-52 5.8E-55 300.3 21.2 338 1-377 1-366 (366) 62 protein:vir:7409 Length: 408 # 100.0 2.8E-50 1.7E-53 292.3 28.6 330 1-377 1-393 (408) 63 protein:vir:3991 Length: 404 # 100.0 6.3E-50 3.9E-53 290.3 29.7 330 1-377 1-393 (404) 64 protein:vir:93616 Length: 645 100.0 2.3E-49 1.4E-52 287.3 30.3 342 1-377 193-645 (645) 65 protein:vir:9704 Length: 394 # 100.0 3.8E-49 2.4E-52 286.0 31.2 324 1-377 2-390 (394) 66 protein:vir:97148 Length: 324 100.0 7.8E-50 4.8E-53 289.8 26.6 292 35-377 1-315 (324) 67 protein:vir:3870 Length: 400 # 100.0 2.4E-49 1.5E-52 287.2 28.3 327 1-377 14-399 (400) 68 protein:vir:80684 Length: 315 100.0 9.7E-50 6E-53 289.3 25.1 271 79-377 1-306 (315) 69 protein:vir:95763 Length: 297 100.0 8.9E-50 5.5E-53 289.5 24.3 274 67-377 1-296 (297) 70 protein:vir:94142 Length: 304 100.0 1.5E-49 9.1E-53 288.3 25.4 278 67-376 1-304 (304) 71 protein:vir:105905 Length: 304 100.0 1.5E-49 9.1E-53 288.3 25.4 278 67-376 1-304 (304) 72 protein:vir:8420 Length: 477 # 100.0 4.1E-49 2.5E-52 285.9 27.6 353 1-377 1-471 (477) 73 protein:vir:100172 Length: 394 100.0 1.1E-48 6.8E-52 283.5 29.9 328 1-377 1-384 (394) 74 protein:vir:104085 Length: 320 100.0 8E-50 5E-53 289.7 22.3 286 61-377 1-318 (320) 75 protein:vir:8187 Length: 311 # 100.0 2.3E-49 1.4E-52 287.2 24.7 268 81-377 1-310 (311) 76 protein:vir:3845 Length: 395 # 100.0 6.6E-49 4.1E-52 284.7 26.9 330 1-377 1-383 (395) 77 protein:vir:94673 Length: 419 100.0 1.1E-48 7.1E-52 283.4 28.1 350 1-377 1-417 (419) 78 protein:vir:78223 Length: 333 100.0 3.3E-49 2E-52 286.4 25.1 284 72-377 1-332 (333) 79 protein:vir:100884 Length: 389 100.0 1.9E-48 1.2E-51 282.2 28.9 327 1-377 1-382 (389) 80 protein:vir:96392 Length: 324 100.0 1E-48 6.3E-52 283.7 26.8 290 42-377 1-315 (324) 81 protein:vir:78830 Length: 324 100.0 1E-48 6.3E-52 283.7 26.8 290 42-377 1-315 (324) 82 protein:vir:9309 Length: 324 # 100.0 9.6E-49 6E-52 283.8 26.2 290 42-377 1-315 (324) 83 protein:vir:78523 Length: 338 100.0 1.2E-48 7.3E-52 283.4 25.0 289 66-377 1-335 (338) 84 protein:vir:2344 Length: 397 # 100.0 8.1E-49 5E-52 284.2 23.9 275 65-377 1-306 (397) 85 protein:vir:2504 Length: 305 # 100.0 9.3E-49 5.8E-52 283.9 24.1 277 79-377 1-298 (305) 86 protein:vir:99749 Length: 324 100.0 3.6E-48 2.2E-51 280.7 26.5 292 35-377 1-315 (324) 87 protein:vir:103955 Length: 324 100.0 3.9E-48 2.4E-51 280.5 26.2 292 35-377 1-315 (324) 88 protein:vir:101607 Length: 379 100.0 1.6E-47 1E-50 277.1 29.5 333 1-377 1-379 (379) 89 protein:vir:96223 Length: 324 100.0 4.7E-48 2.9E-51 280.1 26.5 282 35-377 1-315 (324) 90 protein:vir:1383 Length: 421 # 100.0 3.2E-48 2E-51 280.9 23.9 331 1-377 3-383 (421) 91 protein:vir:962 Length: 397 # 100.0 1.3E-46 7.8E-50 272.2 26.5 326 1-377 15-397 (397) 92 protein:vir:9574 Length: 300 # 100.0 1.5E-47 9.5E-51 277.2 19.6 267 79-377 1-300 (300) 93 protein:vir:1084 Length: 437 # 100.0 2.6E-46 1.6E-49 270.5 24.4 329 1-377 10-427 (437) 94 protein:vir:9759 Length: 303 # 100.0 3.8E-46 2.4E-49 269.6 25.3 265 81-377 1-303 (303) 95 protein:vir:4856 Length: 293 # 100.0 5.2E-47 3.2E-50 274.3 20.1 258 75-377 1-281 (293) 96 protein:vir:1638 Length: 298 # 100.0 2.3E-46 1.4E-49 270.8 20.7 268 83-376 1-298 (298) 97 protein:vir:94771 Length: 298 100.0 1.6E-44 1E-47 260.6 25.1 265 83-376 1-298 (298) 98 protein:vir:99920 Length: 311 100.0 7.3E-45 4.6E-48 262.5 19.9 271 79-376 1-311 (311) 99 protein:vir:4197 Length: 314 # 100.0 1.4E-38 8.9E-42 228.1 21.4 281 68-377 1-314 (314) 100 protein:vir:4159 Length: 315 # 100.0 4E-37 2.5E-40 220.1 20.9 279 66-374 1-315 (315) 101 protein:vir:3158 Length: 321 # 100.0 4.2E-36 2.6E-39 214.5 22.3 290 62-377 1-312 (321) 102 protein:vir:97397 Length: 517 100.0 2.3E-31 1.4E-34 188.6 24.4 340 1-377 131-517 (517) 103 protein:vir:4074 Length: 480 # 99.9 3.4E-28 2.1E-31 171.2 11.1 321 1-377 118-477 (480) 104 protein:vir:9820 Length: 272 # 99.9 3.3E-26 2E-29 160.3 19.9 253 79-377 1-269 (272) 105 protein:vir:3033 Length: 272 # 99.9 3.3E-26 2E-29 160.3 19.9 253 79-377 1-269 (272) 106 protein:vir:93742 Length: 274 99.7 4.9E-19 3.1E-22 121.0 18.0 254 79-377 1-270 (274) 107 protein:vir:80930 Length: 278 99.6 1.1E-17 6.8E-21 113.6 15.9 260 79-377 1-277 (278) 108 protein:vir:3613 Length: 272 # 99.6 3.8E-17 2.4E-20 110.6 16.6 253 79-377 1-272 (272) 109 protein:vir:96123 Length: 274 99.6 1.7E-16 1E-19 107.1 17.8 254 79-377 1-270 (274) 110 protein:vir:94494 Length: 274 99.5 7.8E-16 4.8E-19 103.4 18.6 254 79-377 1-270 (274) 111 protein:vir:97433 Length: 274 99.5 7.8E-16 4.8E-19 103.4 18.6 254 79-377 1-270 (274) 112 protein:vir:105334 Length: 276 99.5 4.1E-16 2.5E-19 105.0 16.7 254 79-377 1-270 (276) 113 protein:vir:94933 Length: 330 99.5 4.7E-16 2.9E-19 104.6 16.9 292 54-377 1-330 (330) 114 protein:vir:96833 Length: 275 99.5 1.4E-15 8.6E-19 102.0 16.3 254 79-377 1-271 (275) 115 protein:vir:96262 Length: 274 99.4 2E-14 1.2E-17 95.7 17.5 254 79-377 1-270 (274) 116 protein:vir:95898 Length: 274 99.4 2E-14 1.2E-17 95.7 17.5 254 79-377 1-270 (274) 117 protein:vir:1239 Length: 274 # 99.4 3E-14 1.9E-17 94.7 17.0 254 79-377 1-270 (274) 118 protein:vir:79928 Length: 393 99.3 2.8E-12 1.8E-15 83.9 20.0 336 1-377 1-381 (393) 119 protein:vir:99424 Length: 360 99.3 1.3E-12 8.1E-16 85.8 17.8 315 42-377 1-357 (360) 120 protein:vir:95107 Length: 270 99.1 1.4E-11 8.6E-15 80.1 15.3 251 79-377 1-265 (270) 121 protein:vir:97255 Length: 310 99.1 3.8E-11 2.3E-14 77.7 17.7 270 64-376 1-310 (310) 122 protein:vir:93858 Length: 400 99.0 1E-10 6.3E-14 75.4 18.5 343 1-375 1-400 (400) 123 protein:vir:739 Length: 231 # 98.9 7.1E-11 4.4E-14 76.2 13.4 217 113-377 1-231 (231) 124 protein:vir:105822 Length: 273 98.9 1.2E-10 7.6E-14 74.9 14.1 254 85-377 1-273 (273) 125 protein:vir:102605 Length: 273 98.9 1.2E-10 7.6E-14 74.9 14.1 254 85-377 1-273 (273) 126 protein:vir:7990 Length: 273 # 98.9 1.5E-10 9.2E-14 74.5 13.4 254 79-377 1-273 (273) 127 protein:vir:8885 Length: 347 # 98.6 1.8E-09 1.1E-12 68.6 13.0 288 58-377 1-346 (347) 128 protein:vir:80213 Length: 334 98.6 2.1E-09 1.3E-12 68.2 13.2 284 59-377 1-332 (334) 129 protein:vir:8324 Length: 410 # 98.6 6.1E-09 3.8E-12 65.6 15.2 331 1-375 22-410 (410) 130 protein:vir:3364 Length: 347 # 98.6 5.8E-09 3.6E-12 65.7 15.0 290 58-377 1-345 (347) 131 protein:vir:1541 Length: 347 # 98.6 1.5E-08 9.2E-12 63.5 17.2 291 58-377 1-345 (347) 132 protein:vir:94576 Length: 347 98.6 5.9E-09 3.7E-12 65.7 14.9 286 58-377 1-347 (347) 133 protein:vir:6324 Length: 335 # 98.6 4.2E-09 2.6E-12 66.5 13.7 287 64-377 1-328 (335) 134 protein:vir:78935 Length: 335 98.6 5E-09 3.1E-12 66.1 13.8 281 64-377 1-328 (335) 135 protein:vir:10450 Length: 344 98.6 7E-09 4.3E-12 65.3 14.4 292 58-377 1-344 (344) 136 protein:vir:2201 Length: 345 # 98.6 4.7E-09 2.9E-12 66.3 13.3 288 58-377 1-345 (345) 137 protein:vir:78739 Length: 332 98.5 4.2E-09 2.6E-12 66.5 12.4 282 61-375 1-332 (332) 138 protein:vir:103323 Length: 364 98.5 4.5E-08 2.8E-11 60.9 17.0 288 67-377 1-339 (364) 139 protein:vir:94711 Length: 347 98.3 1.9E-08 1.2E-11 62.9 11.6 283 60-377 1-346 (347) 140 protein:vir:94622 Length: 341 98.3 1.3E-08 8.4E-12 63.7 10.6 277 72-377 1-339 (341) 141 protein:vir:100057 Length: 375 98.3 9.7E-08 6E-11 59.0 13.7 294 67-377 1-370 (375) 142 protein:vir:3136 Length: 322 # 98.3 2.8E-08 1.7E-11 62.0 10.7 274 78-377 1-318 (322) 143 protein:vir:80180 Length: 381 98.2 2E-07 1.3E-10 57.3 14.0 286 57-377 1-310 (381) 144 protein:vir:95318 Length: 328 98.1 1.9E-07 1.2E-10 57.4 12.5 242 57-352 1-328 (328) 145 protein:vir:99675 Length: 324 98.1 2.6E-07 1.6E-10 56.7 13.0 245 112-377 1-296 (324) 146 protein:vir:105645 Length: 400 98.1 4E-07 2.5E-10 55.6 13.3 280 67-377 1-333 (400) 147 protein:vir:103285 Length: 296 98.0 7.7E-06 4.8E-09 48.6 18.7 271 67-375 1-296 (296) 148 protein:vir:106647 Length: 303 97.9 3.5E-07 2.2E-10 55.9 11.2 264 67-377 1-296 (303) 149 protein:vir:9927 Length: 295 # 97.9 1.3E-06 7.9E-10 52.9 13.0 253 79-377 1-288 (295) 150 protein:vir:7019 Length: 401 # 97.8 5.5E-07 3.4E-10 54.9 10.6 280 67-377 1-333 (401) 151 protein:vir:97031 Length: 402 97.8 1.1E-06 6.6E-10 53.3 11.6 280 67-377 1-333 (402) 152 protein:vir:107687 Length: 319 97.8 1.9E-05 1.2E-08 46.5 18.9 290 62-377 1-318 (319) 153 protein:vir:9875 Length: 296 # 97.8 2.9E-06 1.8E-09 50.9 13.6 264 62-377 1-295 (296) 154 protein:vir:108211 Length: 318 97.7 1.6E-06 9.8E-10 52.4 11.7 283 67-377 1-317 (318) 155 protein:vir:80068 Length: 301 97.7 2.4E-05 1.5E-08 45.9 20.9 276 81-377 1-300 (301) 156 protein:vir:98525 Length: 331 97.7 1.7E-06 1E-09 52.2 11.3 246 60-352 1-331 (331) 157 protein:vir:107826 Length: 331 97.7 1.7E-06 1E-09 52.2 11.3 246 60-352 1-331 (331) 158 protein:vir:107388 Length: 331 97.7 1.7E-06 1E-09 52.2 11.3 246 60-352 1-331 (331) 159 protein:vir:103759 Length: 330 97.6 4E-06 2.5E-09 50.2 12.2 241 60-352 1-330 (330) 160 protein:vir:5974 Length: 324 # 97.5 4E-05 2.5E-08 44.7 16.1 262 79-377 1-290 (324) 161 protein:vir:1663 Length: 393 # 97.2 2.8E-05 1.7E-08 45.6 12.6 340 1-375 1-393 (393) 162 protein:vir:93966 Length: 400 97.1 5.5E-05 3.4E-08 44.0 12.8 342 1-375 1-400 (400) 163 protein:vir:7324 Length: 335 # 96.9 4.7E-05 2.9E-08 44.3 10.9 244 57-323 1-335 (335) 164 protein:vir:102944 Length: 330 96.7 0.00039 2.4E-07 39.3 15.3 269 79-377 1-296 (330) 165 protein:vir:79642 Length: 329 96.7 0.00042 2.6E-07 39.1 20.2 299 42-377 1-329 (329) 166 protein:vir:861 Length: 318 # 96.6 6.7E-05 4.2E-08 43.5 10.2 299 38-375 1-318 (318) 167 protein:vir:99075 Length: 392 96.5 0.00051 3.2E-07 38.6 14.2 256 85-377 1-290 (392) 168 protein:vir:102655 Length: 322 96.4 0.00051 3.2E-07 38.6 13.8 277 73-377 1-321 (322) 169 protein:vir:8843 Length: 317 # 96.3 0.00024 1.5E-07 40.4 11.6 287 76-377 1-316 (317) 170 protein:vir:1583 Length: 351 # 96.0 0.0011 6.9E-07 36.8 14.6 260 79-377 1-294 (351) 171 protein:vir:104342 Length: 314 95.2 0.0025 1.6E-06 34.8 19.7 286 63-375 1-314 (314) 172 protein:vir:348 Length: 321 # 94.9 0.0016 1E-06 35.9 10.8 288 60-377 1-321 (321) 173 protein:vir:270 Length: 341 # 94.8 0.0033 2.1E-06 34.2 14.8 296 63-377 1-332 (341) 174 protein:vir:5255 Length: 304 # 93.5 0.0074 4.6E-06 32.3 16.0 274 84-374 1-304 (304) 175 protein:vir:94870 Length: 318 91.5 0.016 9.7E-06 30.5 11.6 301 22-375 1-318 (318) 176 protein:vir:108303 Length: 418 90.8 0.019 1.2E-05 30.0 16.0 254 82-377 1-286 (418) 177 protein:vir:100603 Length: 529 89.8 0.024 1.5E-05 29.4 14.4 346 1-377 1-516 (529) 178 protein:vir:103463 Length: 521 88.6 0.031 1.9E-05 28.8 14.6 344 1-377 1-499 (521) 179 protein:vir:1781 Length: 221 # 84.1 0.063 3.9E-05 27.2 9.2 179 160-377 1-202 (221) 180 protein:vir:98856 Length: 343 82.5 0.076 4.7E-05 26.7 17.0 295 67-377 1-340 (343) 181 protein:vir:105522 Length: 423 77.1 0.13 7.9E-05 25.5 12.7 251 79-377 1-290 (423) 182 protein:vir:106286 Length: 534 71.8 0.19 0.00012 24.5 15.9 349 3-377 1-534 (534) 183 protein:vir:174 Length: 423 # 71.1 0.2 0.00012 24.4 14.0 259 79-377 1-307 (423) 184 protein:vir:1153 Length: 338 # 70.3 0.21 0.00013 24.3 16.8 295 67-373 1-338 (338) 185 protein:vir:79548 Length: 652 68.1 0.24 0.00015 24.0 19.3 347 1-374 222-652 (652) 186 protein:vir:78558 Length: 336 67.4 0.25 0.00016 23.9 14.4 301 43-377 1-336 (336) 187 protein:vir:78777 Length: 358 67.3 0.25 0.00016 23.8 16.2 302 63-377 1-346 (358) 188 protein:vir:3525 Length: 423 # 66.3 0.27 0.00017 23.7 14.0 257 79-377 1-307 (423) 189 protein:vir:105374 Length: 423 65.7 0.28 0.00017 23.6 14.1 259 79-377 1-307 (423) 190 protein:vir:101039 Length: 529 64.9 0.29 0.00018 23.5 14.5 347 1-377 1-529 (529) 191 protein:vir:98566 Length: 355 57.1 0.44 0.00027 22.5 17.5 302 67-377 1-348 (355) 192 protein:vir:98143 Length: 524 56.5 0.46 0.00028 22.5 13.9 348 1-377 1-524 (524) 193 protein:vir:79157 Length: 339 55.6 0.48 0.0003 22.3 17.6 297 67-377 1-338 (339) 194 protein:vir:101811 Length: 529 54.4 0.51 0.00031 22.2 14.8 340 1-377 1-529 (529) 195 protein:vir:106734 Length: 336 53.2 0.54 0.00033 22.1 13.7 300 43-377 1-336 (336) 196 protein:vir:100331 Length: 342 52.2 0.56 0.00035 22.0 16.0 290 67-367 1-342 (342) 197 protein:vir:7214 Length: 521 # 52.1 0.56 0.00035 22.0 15.8 346 1-377 1-499 (521) 198 protein:vir:1829 Length: 355 # 51.7 0.57 0.00036 21.9 17.1 302 67-377 1-354 (355) 199 protein:vir:6061 Length: 357 # 50.8 0.6 0.00037 21.8 15.4 304 67-377 1-350 (357) 200 protein:vir:94070 Length: 339 48.6 0.66 0.00041 21.6 14.3 312 1-377 1-338 (339) 201 protein:vir:3643 Length: 336 # 47.8 0.69 0.00043 21.5 17.2 308 43-377 1-336 (336) 202 protein:vir:80835 Length: 464 44.7 0.8 0.0005 21.1 9.8 294 27-377 1-336 (464) 203 protein:vir:107947 Length: 519 41.6 0.92 0.00057 20.8 16.1 348 3-377 1-497 (519) 204 protein:vir:2016 Length: 357 # 41.4 0.93 0.00058 20.8 15.9 304 67-377 1-348 (357) 205 protein:vir:5694 Length: 357 # 41.1 0.94 0.00058 20.7 15.2 304 67-377 1-348 (357) 206 protein:vir:80986 Length: 528 40.6 0.96 0.0006 20.7 13.7 346 1-377 1-528 (528) 207 protein:vir:104011 Length: 337 40.5 0.97 0.0006 20.7 18.6 295 67-377 1-337 (337) 208 protein:vir:78186 Length: 337 38.0 1.1 0.00067 20.4 17.5 295 67-377 1-337 (337) 209 protein:vir:101557 Length: 336 36.1 1.2 0.00074 20.2 17.7 310 43-377 1-336 (336) 210 protein:vir:79171 Length: 337 34.7 1.3 0.00079 20.0 18.5 295 67-377 1-337 (337) 211 protein:vir:94800 Length: 319 24.3 2.2 0.0014 18.7 17.1 275 50-377 1-294 (319) 212 protein:vir:97331 Length: 319 24.3 2.2 0.0014 18.7 17.1 275 50-377 1-294 (319) 213 protein:vir:6901 Length: 522 # 23.4 2.3 0.0014 18.6 14.1 345 1-377 1-509 (522) No 1 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=4.2e-92 Score=521.53 Aligned_cols=377 Identities=97% Similarity=1.345 Sum_probs=358.7 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHh Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~ 80 (377) |+|+++++++.+++++++.+++++....+++.+.+++....+.+++..+.+.+.++.+......+.++++|+++|++++. T Consensus 1 M~i~~k~~~~~~~~~~~l~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~ 80 (377) T protein:vir:98 1 MAINLKELPKYREAVAELSAKISAGATSEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHh Confidence 99999999999999999999998888778888888888888888888888889999999998999999999999999999 Q ss_pred ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEEcCCcceeeecccccccccccccceeEeecceeE Q lcl|Aclame:pro 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKL 160 (377) Q Consensus 81 ~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~ 160 (377) .+++++||++||+++.++|++.+++.||||++|++.+++|+.++|+.++.+.+.|++|.++..++++++|++++|.+|++ T Consensus 81 ~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl 160 (377) T protein:vir:98 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKL 160 (377) T ss_pred ccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCcceEEEEecCCcceeEeecccccCcccCccceeEeecceeE Confidence 99999999999999999999999999999999999999999999999999999999988888777899999999999999 Q ss_pred EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhh Q lcl|Aclame:pro 161 TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL 240 (377) Q Consensus 161 ~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 240 (377) +++++||++||+||.+|+++||+++++++|+++++.+|++|+|++||+||++.++.............+++.+.+.+..+ T Consensus 161 ~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 240 (377) T protein:vir:98 161 TAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL 240 (377) T ss_pred EeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccccccchhhhHhhh Confidence 99999999999999999999999999999999999999999999999999999888887777777777778888888889 Q ss_pred hccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccccCCCceEEecCCCCcceE Q lcl|Aclame:pro 241 SDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKA 320 (377) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~i 320 (377) ..+.+..+..+..++|+..+....+++.+.+|+++|+|||++++.++|.++..+++|+|+++||+|++++++++||++++ T Consensus 241 ~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i 320 (377) T protein:vir:98 241 SDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKA 320 (377) T ss_pred hhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEEecCCCCcccE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 321 i~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +||||++|.+++|++++|++|+|.+|.+|+++||+++|+||+|++++||++|+|++| T Consensus 321 ~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEecceeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=1.2e-91 Score=519.04 Aligned_cols=377 Identities=100% Similarity=1.381 Sum_probs=358.3 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHh Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~ 80 (377) |+|++|++++..++++++.+++++...++++.+++++....+++++..+.+.++++.+..+...+.++++|+++|++++. T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~ 80 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHh Confidence 99999999999999999999999888888888888888888888888888888988888888899999999999999999 Q ss_pred ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEEcCCcceeeecccccccccccccceeEeecceeE Q lcl|Aclame:pro 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKL 160 (377) Q Consensus 81 ~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~ 160 (377) .+++++||++||+++.++|++.+++.||||++|++.|+++..++|+.++.+.+.|++|.++.+++++++|++++|.+|++ T Consensus 81 ~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl 160 (377) T protein:vir:96 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKL 160 (377) T ss_pred cCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceeEeecccccccccCccceeEeeeeeeE Confidence 99999999999999999999999999999999999999999999999999999999988888777889999999999999 Q ss_pred EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhh Q lcl|Aclame:pro 161 TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL 240 (377) Q Consensus 161 ~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 240 (377) +++++||++||+||.+|+++||+++|+++|+++++.+|++|+|++||+||++.++.......+.....+++.+....+.+ T Consensus 161 ~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (377) T protein:vir:96 161 TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL 240 (377) T ss_pred EeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeecccccccc Confidence 99999999999999999999999999999999999999999999999999999988888888888888888888888888 Q ss_pred hccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccccCCCceEEecCCCCcceE Q lcl|Aclame:pro 241 SDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKA 320 (377) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~i 320 (377) +..++..+++.+..+++..+.++.+++....++.+|+|||.|++.+++.+.+++++|+|+++||+|++++++++||++++ T Consensus 241 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~~p~~v~~s~~~p~~~i 320 (377) T protein:vir:96 241 SDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKA 320 (377) T ss_pred ccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccccccccCCCCCceeccCCCceEEecCCCCcccE Confidence 99999999999999999998888888889999999999999999999989999999999999999999999999999999 Q ss_pred EEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 321 i~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +|||||+|.+++|++++|++|+|.+|.+|+++||+++|+||+|++++||++|+++.| T Consensus 321 ~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEcCcEEEEEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=1.7e-84 Score=479.79 Aligned_cols=368 Identities=40% Similarity=0.655 Sum_probs=318.4 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHh Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~ 80 (377) |+||+++ +..+.+.++.+.+++....+++.+.++.....+.++.....+.++++.+....+.+.++.+|+++|+++.. T Consensus 1 m~~kl~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~l~~~e~~~~~~~~~ 78 (381) T protein:vir:10 1 MTINLSE--TFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQTLSANQRNFFMDINK 78 (381) T ss_pred CchhHHH--HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccCHHHHHHHHHHhh Confidence 9998753 34444555556666555555666666666777777777777888888888888899999999999987664 Q ss_pred ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEEcCCcceeeecccccccccccccceeEeecceeE Q lcl|Aclame:pro 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKL 160 (377) Q Consensus 81 ~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~ 160 (377) +++++|||+||+++.++|++.+++.||||++|++++++++.++|+.++.+.+.|+++.++.+++++|+|+++++.+||+ T Consensus 79 -~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl 157 (381) T protein:vir:10 79 -SVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKL 157 (381) T ss_pred -cCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCcceEEEeecCCcceEEeecccccccccCccceeEeecceeE Confidence 5667899999999999999999999999999999999999999999999999999888888778889999999999999 Q ss_pred EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhh Q lcl|Aclame:pro 161 TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL 240 (377) Q Consensus 161 ~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 240 (377) +++++||++||+||.+|+++||+++|+++|+++++.+|++|||++||+||++.+........ ....+..+...+ T Consensus 158 ~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~------g~~~~~~~~~~~ 231 (381) T protein:vir:10 158 TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTD------GAYPEKEEQGTL 231 (381) T ss_pred EeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCccccccc------cccccccccccc Confidence 99999999999999999999999999999999999999999999999999986543322111 111222334456 Q ss_pred hccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccccCCCceEEecCCCCcceE Q lcl|Aclame:pro 241 SDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKA 320 (377) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~i 320 (377) +..+....+..+..++..+.+....+...+.++.+|+|||.|++.+++..+.++++|+|++.+|+|++|+++++||+++| T Consensus 232 t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~lp~g~~vv~~~~~p~~~i 311 (381) T protein:vir:10 232 TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKV 311 (381) T ss_pred cccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecCCCCceeEEcCCCCcCcE Confidence 67777777888888888888888888888899999999999999999888888999999999999999999999999999 Q ss_pred EEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 321 i~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +|||||+|.+++|++++|++|+|.+|.+|+++||+++|+||+|++++||++|+++.- T Consensus 312 ~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:10 312 LTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred EEEEcccEEEEEecccEEEeechhhhhcCceEEEEEEEEcCEEecCCcEEEEEEeec Confidence 999999999999999999999999999999999999999999999999999998855 No 4 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=7e-83 Score=470.99 Aligned_cols=368 Identities=39% Similarity=0.647 Sum_probs=317.8 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHh Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~ 80 (377) |+|++++ +..+++.++.++++.....+.+.+...+..+.+.++...+.+.++++.+......+.++.+|+++|+++.. T Consensus 1 m~ik~~~--~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~~~~ 78 (381) T protein:vir:95 1 MTINLSE--TFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINK 78 (381) T ss_pred CchhhHH--HHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHHHhc Confidence 8887654 44445556666666655556666667777777777777777888888888888889999999999987654 Q ss_pred ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEEcCCcceeeecccccccccccccceeEeecceeE Q lcl|Aclame:pro 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKL 160 (377) Q Consensus 81 ~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~ 160 (377) +++++|||+||++++++|++.+++.||||++|++.+++++.++|+.++.+.+.|++|.++.+++++++|++++|.+|++ T Consensus 79 -~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl 157 (381) T protein:vir:95 79 -NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKL 157 (381) T ss_pred -ccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcceEEEEecCCcceeeecccccccccccccceeeeecceeE Confidence 6677899999999999999999999999999999999999999999999999999988888777889999999999999 Q ss_pred EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhh Q lcl|Aclame:pro 161 TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL 240 (377) Q Consensus 161 ~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 240 (377) +++++||++||+||.+|+++||+++|+++|+++++.+|++|+|++||+||++.+........ ....+..+.+.+ T Consensus 158 ~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~------g~~~~~~~~~t~ 231 (381) T protein:vir:95 158 TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTE------GAYPEKEEQGTL 231 (381) T ss_pred EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCccccccc------cccccccccccc Confidence 99999999999999999999999999999999999999999999999999987654322111 112223344556 Q ss_pred hccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccccCCCceEEecCCCCcceE Q lcl|Aclame:pro 241 SDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKA 320 (377) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~i 320 (377) +..++...++.+..+...+.....++...+.++.+|+|||.|++.+++....++++|+|++.+|+|++|+++++||++++ T Consensus 232 t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~~g~~vv~s~~~p~~~i 311 (381) T protein:vir:95 232 TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKV 311 (381) T ss_pred ccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCCCCceEEecCCCCcCcE Confidence 66777777888888888888877777778889999999999999988887788899999999999999999999999999 Q ss_pred EEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeec--C Q lcl|Aclame:pro 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAG--G 377 (377) Q Consensus 321 i~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a--~ 377 (377) +|||||+|.+++|++++|++|+|.+|.+|+++||+++|+||+|++++||++|+++. + T Consensus 312 ifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~ 370 (381) T protein:vir:95 312 LTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) T ss_pred EEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCC Confidence 99999999999999999999999999999999999999999999999999977776 3 No 5 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=7e-83 Score=470.99 Aligned_cols=368 Identities=39% Similarity=0.647 Sum_probs=317.8 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHh Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~ 80 (377) |+|++++ +..+++.++.++++.....+.+.+...+..+.+.++...+.+.++++.+......+.++.+|+++|+++.. T Consensus 1 m~ik~~~--~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~~~~ 78 (381) T protein:vir:10 1 MTINLSE--TFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINK 78 (381) T ss_pred CchhhHH--HHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHHHhc Confidence 8887654 44445556666666655556666667777777777777777888888888888889999999999987654 Q ss_pred ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEEcCCcceeeecccccccccccccceeEeecceeE Q lcl|Aclame:pro 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKL 160 (377) Q Consensus 81 ~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~ 160 (377) +++++|||+||++++++|++.+++.||||++|++.+++++.++|+.++.+.+.|++|.++.+++++++|++++|.+|++ T Consensus 79 -~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl 157 (381) T protein:vir:10 79 -NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKL 157 (381) T ss_pred -ccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcceEEEEecCCcceeeecccccccccccccceeeeecceeE Confidence 6677899999999999999999999999999999999999999999999999999988888777889999999999999 Q ss_pred EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhh Q lcl|Aclame:pro 161 TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL 240 (377) Q Consensus 161 ~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 240 (377) +++++||++||+||.+|+++||+++|+++|+++++.+|++|+|++||+||++.+........ ....+..+.+.+ T Consensus 158 ~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~------g~~~~~~~~~t~ 231 (381) T protein:vir:10 158 TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTE------GAYPEKEEQGTL 231 (381) T ss_pred EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCccccccc------cccccccccccc Confidence 99999999999999999999999999999999999999999999999999987654322111 112223344556 Q ss_pred hccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccccCCCceEEecCCCCcceE Q lcl|Aclame:pro 241 SDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKA 320 (377) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~i 320 (377) +..++...++.+..+...+.....++...+.++.+|+|||.|++.+++....++++|+|++.+|+|++|+++++||++++ T Consensus 232 t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~~g~~vv~s~~~p~~~i 311 (381) T protein:vir:10 232 TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKV 311 (381) T ss_pred ccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCCCCceEEecCCCCcCcE Confidence 66777777888888888888877777778889999999999999988887788899999999999999999999999999 Q ss_pred EEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeec--C Q lcl|Aclame:pro 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAG--G 377 (377) Q Consensus 321 i~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a--~ 377 (377) +|||||+|.+++|++++|++|+|.+|.+|+++||+++|+||+|++++||++|+++. + T Consensus 312 ifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~ 370 (381) T protein:vir:10 312 LTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) T ss_pred EEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCC Confidence 99999999999999999999999999999999999999999999999999977776 3 No 6 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=4.1e-82 Score=466.79 Aligned_cols=370 Identities=43% Similarity=0.694 Sum_probs=312.0 Q ss_pred CCccHHH-HHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHhccccccccHHHHHHH Q lcl|Aclame:pro 1 MAINLKE-LPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAK----NEEEMERMFDLRDKNRELTAEEIKFF 75 (377) Q Consensus 1 m~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~lt~~e~~~~ 75 (377) |+|++++ +++..++++++.+.++.+..++++.+.+.+..+.+.+++..+ .+.+.+.......+.+.++.+|++++ T Consensus 1 M~~kl~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~lt~~e~~~~ 80 (383) T protein:vir:78 1 MTIKLKNNLANYEEKRTAFVNAVKNEDTQEIQNKAYVEMVDAMAADIMEQAKKEARQEADAYISASRTDKNITNEEIKFF 80 (383) T ss_pred CchhHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhHHHHHHH Confidence 9999854 456677788888877777667777777776666655544333 33344444455666778999999999 Q ss_pred HHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEEcCCcceeeecccccccccccccceeEee Q lcl|Aclame:pro 76 NDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDF 155 (377) Q Consensus 76 ~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l 155 (377) +.+. .+++++|||+||++++++|++.+++.||||++|++.|++|+.++|+.++.+.+.|+++.++.+++++++|++++| T Consensus 81 ~~~~-~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l 159 (383) T protein:vir:78 81 NDIN-KEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLRTKFLKSETSGVAVWGKIFGEIKGQLDATFSDEES 159 (383) T ss_pred HHHh-ccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCCceEEEEEcCCcceEEeecccccccccCcceeeEee Confidence 7665 467789999999999999999999999999999999999999999999999999999888887778999999999 Q ss_pred cceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhh Q lcl|Aclame:pro 156 SQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKE 235 (377) Q Consensus 156 ~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~ 235 (377) .+|+++++++||+|||+||.+|+++||+++++++|+++++.+|++|+|++||+||++.+....... .....+.. T Consensus 160 ~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~------~~~~~~~~ 233 (383) T protein:vir:78 160 IQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVV------DGVYAEKA 233 (383) T ss_pred cceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCccccc------cccccccc Confidence 999999999999999999999999999999999999999999999999999999998654332211 11112223 Q ss_pred hhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccccCCCceEEecCCC Q lcl|Aclame:pro 236 AIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAV 315 (377) Q Consensus 236 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~l~~~~~v~~s~~~ 315 (377) +.+.++..+...+...+..+.+......++......++..|+|||.+++.+++.++.++++|+|+++||+|++++++++| T Consensus 234 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~t~l~~~~~iv~s~~~ 313 (383) T protein:vir:78 234 ATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSLNANGVYVTALPFNLNIIESLFV 313 (383) T ss_pred ccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchhccCCCCceeeecCCCceEEecCCC Confidence 33445556666667777766666666677777778889999999999999999988899999999999999999999999 Q ss_pred CcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 316 ETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 316 ~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |+++++||||++|.+++|++++|++|+|.+|.+|+++||+++|+||+|++++||++|+++-- T Consensus 314 p~~~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~ 375 (383) T protein:vir:78 314 PEKKAISYVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNIN 375 (383) T ss_pred CcccEEEeeccceEEEecccceEEecchhhhhcCceEEEEEEEEcCEEecCCeEEEEEEEec Confidence 99999999999999999999999999999999999999999999999999999999887744 No 7 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=9.6e-79 Score=448.31 Aligned_cols=366 Identities=41% Similarity=0.692 Sum_probs=299.6 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHH-HHhccccccccHHHHHHH Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEE----MERM-FDLRDKNRELTAEEIKFF 75 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~-~~~~~~~~~lt~~e~~~~ 75 (377) |+++.++++++.+.++++.+.+++....+++.+++.+..+.++.++......+ .... .......+.++.+|++++ T Consensus 4 ~~~~~e~~~~~~e~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~r~~~~l~~ee~~~~ 83 (395) T protein:vir:95 4 MKQNNVKLKNYHEHKKQFANLVQNGASDEEQSKAFGAMFDALSNDLQEEITAEINNRVVDNGILAKRSQDPLTSEERKFF 83 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccchHHHHHH Confidence 44556777777777777777777777777777777776666554443333322 2222 222335567999999988 Q ss_pred HHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEEcCCcceeeecccccccccccccceeEee Q lcl|Aclame:pro 76 NDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDF 155 (377) Q Consensus 76 ~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l 155 (377) +.+. .+++++||++||++++++|++.+++.+|||++|+++|++++.++|+.++.+.+.|++++++..++++++|++|+| T Consensus 84 ~~~~-~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l 162 (395) T protein:vir:95 84 NDIN-YDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGIKTRVIKADPAGQAVWGKVFGEIKGQLDAAFREENF 162 (395) T ss_pred HHHh-hccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccCccccccceeeee Confidence 7655 467778999999999999999999999999999999999999999999999999998888887788999999999 Q ss_pred cceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCC--cceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 156 SQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL--QPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 156 ~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~--~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) .+|+++++++||+|||+|+.+|+++||+++|+++|++++|++|++|+|++ ||.||++.+...+.......... T Consensus 163 ~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~----- 237 (395) T protein:vir:95 163 TQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSG----- 237 (395) T ss_pred ceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeecccccccccccccccc----- Confidence 99999999999999999999999999999999999999999999999986 69999987665443332222111 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccccCCCceEEecC Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESL 313 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~l~~~~~v~~s~ 313 (377) .++..+....+..+..+...+....+.......++..|+|||++++++.+.+.+++.+|+|+++||+|+||++++ T Consensus 238 -----~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~~~~~~G~~~~~lg~g~~v~~~~ 312 (395) T protein:vir:95 238 -----TLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYTYLTANGGFVTVLPYNVTIITSE 312 (395) T ss_pred -----hhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcceeccCCCcceeccCCcceEEEcC Confidence 111222223334444444555555555566677889999999999999888888889999999999999999999 Q ss_pred CCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 314 AVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 314 ~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +||+++++|||||+|++++|++++|++++|.+|.+|+++||+++|+||+|++++||++|+|+.. T Consensus 313 ~~p~~~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~ 376 (395) T protein:vir:95 313 FVPEGKLVAFVTDRYNAVRGGGLTVKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVA 376 (395) T ss_pred CCCCCcEEEEecccEEEEEecceEEEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEeecc Confidence 9999999999999999999999999999999999999999999999999999999999999965 No 8 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=8.4e-72 Score=410.25 Aligned_cols=360 Identities=31% Similarity=0.462 Sum_probs=283.9 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhccccccccHHHHHHH Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEME-----RMFDLRDKNRELTAEEIKFF 75 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~lt~~e~~~~ 75 (377) |+...+.+.+..+..+++.+++++....+++.+.++.....++.+...+.+.+.+ .........+.++.++|+++ T Consensus 1 ik~L~e~~~e~~e~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~ 80 (390) T protein:vir:40 1 MNNLDKKDSETLNISTAFLNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKYY 80 (390) T ss_pred CchHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHHH Confidence 6555555566666677777777766666666666666555554443333222211 12223345567899999999 Q ss_pred HHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceeeecccccccccccccceeEe Q lcl|Aclame:pro 76 NDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQD 154 (377) Q Consensus 76 ~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~ 154 (377) +.+++.+++++||++||++++++|++.+++.++|+++|+++|+++ ...+|+.++.+.+.|++|+++.++.++++|++++ T Consensus 81 ~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~ 160 (390) T protein:vir:40 81 NEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQ 160 (390) T ss_pred HHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcCCcceeeeccccccCccccccceeeE Confidence 999999999999999999999999999999999999999999975 4679999999999999988888777889999999 Q ss_pred ecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhh Q lcl|Aclame:pro 155 FSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDK 234 (377) Q Consensus 155 l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~ 234 (377) |.+|+++++++||+|||+||.+++++||+++|++++++++|++|++|+|+++|.||++.....+........... T Consensus 161 l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~----- 235 (390) T protein:vir:40 161 TGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATP----- 235 (390) T ss_pred eeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeeccccccccccccccccc----- Confidence 999999999999999999999999999999999999999999999999999999999877655444333322221 Q ss_pred hhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhc-ccccccCCCCcccc-ccCCCceEEec Q lcl|Aclame:pro 235 EAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLE-AKFTSRNQFGEYVT-VLPHGITILES 312 (377) Q Consensus 235 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~-~~~~~~~~~G~~~~-~l~~~~~v~~s 312 (377) ++..+...+... +...+ ........++.+|+|||++++..+ .....++.+|.|+. .+++|+||+++ T Consensus 236 -----~t~~~~~~~~~~---l~~~~----~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~~~~g~pvv~~ 303 (390) T protein:vir:40 236 -----LTDLTPATLATK---VMLPL----TDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGILPVPLEIVQS 303 (390) T ss_pred -----cchhhHHHHHHH---HHHHh----hcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCccccccCCCceeEEEc Confidence 122222222222 21111 111233456789999999987544 33345778888874 45689999999 Q ss_pred CCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 313 LAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 313 ~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++||+++++||||++|++++|++++|++++|.+|.+|++.||+++|+||++++++||++|++++= T Consensus 304 ~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 304 VAVPVGKAVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDITGL 368 (390) T ss_pred CCCCCCcEEEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEecccceEEEEeecc Confidence 99999999999999999999999999999999999999999999999999999999999999887 No 9 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=1e-63 Score=365.97 Aligned_cols=351 Identities=19% Similarity=0.226 Sum_probs=240.9 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHH---------- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPE-------------EQEKLFEAAFTTMGDEILAKNEEEMERM---------- 57 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------- 57 (377) |....+.+++..+..+++.+++......+ .+.+.+++....++.++.. ...+.+.. T Consensus 19 l~el~~~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~~~-~~~~l~~~~~~~~~~~~~ 97 (425) T protein:vir:95 19 LDELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEIAQ-LEDELEQINSKQPSNQSR 97 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhhhccchhhh Confidence 43333322222222222222211110000 0011111111111111000 00000000 Q ss_pred -------------------HHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec Q lcl|Aclame:pro 58 -------------------FDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT 118 (377) Q Consensus 58 -------------------~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~ 118 (377) ............+.+.+.+...+.+++++||++||+++.+.|++.+++.++|+++|+++|+ T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~ 177 (425) T protein:vir:95 98 QKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRV 177 (425) T ss_pred hhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeec Confidence 0000011112233344445555667778899999999999999999999999999999999 Q ss_pred CCceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcce Q lcl|Aclame:pro 119 SLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAI 198 (377) Q Consensus 119 ~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~ 198 (377) +|+.++|+..+.+.+.|++|+++.++...++|++|++.+++++++++||+|||+||.+++++||+++|++++++++|.+| T Consensus 178 ~g~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~i 257 (425) T protein:vir:95 178 KGTTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAI 257 (425) T ss_pred CceeEEEEecCCccccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999998888765555899999999999999999999999999999999999999999999999999 Q ss_pred eeccCCC--cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEE Q lcl|Aclame:pro 199 VKGNGLL--QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKL 276 (377) Q Consensus 199 l~G~G~~--~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (377) |+|+|++ +|.||++.++....... .+...++ ..+..+.... .......++.+| T Consensus 258 l~G~G~~~~~p~Gil~~~~~~~~~~~-~~~~~~~-------------------~~~~~~~~~~-----~~~~~~~~~~~~ 312 (425) T protein:vir:95 258 VKGTGAANKQPLGIIPSLPPENQVTV-EADNNLL-------------------KNLVKQIGLI-----DTGDDSVGEIVA 312 (425) T ss_pred hccCCCCccccceeeccccccccccc-ccccchH-------------------HHHHHHHHhh-----hhhccccCceEE Confidence 9999964 89999987554332111 1111111 1111111100 011223467889 Q ss_pred Eeccchhhhhcccc-cccCCCCccccc-------cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhc Q lcl|Aclame:pro 277 LLNPEDRWTLEAKF-TSRNQFGEYVTV-------LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAME 348 (377) Q Consensus 277 ~~n~~~~~~~~~~~-~~~~~~G~~~~~-------l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~ 348 (377) +||+.+++..+..+ ..++.+|.|+.. ..+|.||+++++||+++++||||++|++++|++++|.+|+|.+|.+ T Consensus 313 v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l~G~pvv~~~~~~~~~i~~Gd~~~~~~~~~~~~~i~~~~~~~f~~ 392 (425) T protein:vir:95 313 VMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRVVFNNFLDDDTVLFGEFEQYTLVERENITIDSSTHVKFTE 392 (425) T ss_pred EEeChHHHHHHHHHHhhcCCCCceeeccCCCCCccccceeeEEcCcCCCccEEEEecccEEEEeecceEEEeeccccccc Confidence 99999987644332 345677777632 1258899999999999999999999999999999999999999999 Q ss_pred CcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 349 DLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 349 ~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |+++||+++|+||++++|+||++|++++- T Consensus 393 ~~~~~~~~~r~d~~~~~~~a~~~~~i~~~ 421 (425) T protein:vir:95 393 DQTAFRGKGRFDGKPVKPEAFVLVTITDP 421 (425) T ss_pred CceEEEEEEeeCcEeecccceEEEEecCc Confidence 99999999999999999999999999984 No 10 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=2.9e-63 Score=363.47 Aligned_cols=369 Identities=16% Similarity=0.188 Sum_probs=242.2 Q ss_pred CCccHHH-------------HHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHH-------HH---HHHHH----- Q lcl|Aclame:pro 1 MAINLKE-------------LPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDE-------IL---AKNEE----- 52 (377) Q Consensus 1 m~~~~~~-------------l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~---~~~~~----- 52 (377) +..++++ +++..+.+.++.+++.... ++-+.++....++... .. ..... T Consensus 36 l~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~---~ei~~le~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 112 (466) T protein:vir:80 36 LEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLE---GEIKELENELEQLNNKEPKNNSEPAQVSGARTQQFVGG 112 (466) T ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHhhhccCchhHHHHhhhhhHHhhH Confidence 1111111 1111111111111111000 0000011100000000 00 00000 Q ss_pred -HHHHHHH-----hccccccccHHHHHHHHHH----HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCce Q lcl|Aclame:pro 53 -EMERMFD-----LRDKNRELTAEEIKFFNDI----DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRL 122 (377) Q Consensus 53 -~~~~~~~-----~~~~~~~lt~~e~~~~~~~----~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~~ 122 (377) ...+... .......+..+.+.++.+. ....+.++++++||+++++.|++.++++++|+++|++.+++|.. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g~~ 192 (466) T protein:vir:80 113 ETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKGTA 192 (466) T ss_pred HHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCcee Confidence 0000000 0000111222233333222 22344566789999999999999999999999999999999999 Q ss_pred EEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeecc Q lcl|Aclame:pro 123 KALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGN 202 (377) Q Consensus 123 ~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~ 202 (377) ++|+....+.+.|++|++..+ +++++|++|++.+|+++++++||+|||+||.+++++||+++|+++++++++.+||+|+ T Consensus 193 ~~~~~~~~~~a~wv~E~~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~ 271 (466) T protein:vir:80 193 RQNIAGAIPEGVWTEAVANLN-ELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGT 271 (466) T ss_pred EeeeecCCcceeecccccccc-cccccccceeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeecc Confidence 999998889999998777664 6789999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceeeeeccccccccccccccccccchhhhh-hhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccc Q lcl|Aclame:pro 203 GLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEA-IADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPE 281 (377) Q Consensus 203 G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 281 (377) |+++|+||++.....+.................. ...+.. ........+..++. .....+....+++++|+||+. T Consensus 272 G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~w~~~~~ 347 (466) T protein:vir:80 272 GTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDP-TGKSAEEFFSELVL---KLSKARANYSNGMKFWAMSSN 347 (466) T ss_pred CCCCcceeeecccccccccccccccccccccchhhhhhhhh-hccchhhHHHHHHH---HHHhhhccccCCceeEEecch Confidence 9999999999876555444433332222211111 111100 11111111111111 112234556778889999999 Q ss_pred hhhhhcccccccCCCCccccc-----cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEE Q lcl|Aclame:pro 282 DRWTLEAKFTSRNQFGEYVTV-----LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTK 356 (377) Q Consensus 282 ~~~~~~~~~~~~~~~G~~~~~-----l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~ 356 (377) ++..+.......+.+|.|+.. ..+|.||+++++||+++++||||++|++++|++++|.++++.+|.+|+++||++ T Consensus 348 ~~~~l~~~~~~~~~~g~~~~~~~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~ 427 (466) T protein:vir:80 348 THAVLMSKAITFNSAGALVASLNNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFKGT 427 (466) T ss_pred hHHHhhcccccccCCccccccCCCcccccccceeecCccCccceeeeccccEEEEeecceEEEechhhhhhcCcEEEEEE Confidence 877665544444566666522 136889999999999999999999999999999999999999999999999999 Q ss_pred EEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 357 NYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 357 ~r~dg~~~~~~af~~l~~~a~ 377 (377) +|+||+|++++|||+|+++.= T Consensus 428 ~r~dg~~~~~~afv~~~~~~~ 448 (466) T protein:vir:80 428 ARYDGKPVFGEGFVAVNIANA 448 (466) T ss_pred EEEccEEeccCceEEEEecCC Confidence 999999999999999998876 No 11 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=4e-60 Score=346.21 Aligned_cols=357 Identities=14% Similarity=0.137 Sum_probs=243.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCH-----HHHHHHHHHHHHHHHHHHHHH--HHHHHHHH-HHh----cccccccc Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATP-----EEQEKLFEAAFTTMGDEILAK--NEEEMERM-FDL----RDKNRELT 68 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-~~~----~~~~~~lt 68 (377) |+|+++++++..+++.+..+.+++..++ +.+........+.+..++... ...+.++. ... ........ T Consensus 1 m~~~lk~l~~~~~el~~~~~~~k~~~~~~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (401) T protein:vir:44 1 MAVDIKDVEQVAQELQQKFDDFKAKNDKRVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKVA 80 (401) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh Confidence 9999999998877777766655443211 111111111112222111110 00000000 000 11112233 Q ss_pred HHHHHHHHHH----------------HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCc Q lcl|Aclame:pro 69 AEEIKFFNDI----------------DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSG 131 (377) Q Consensus 69 ~~e~~~~~~~----------------~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~ 131 (377) .++++.|... ...+++++||++||+++.++|++.+++.++|+++|+++|+++ ..++|+..+++ T Consensus 81 ~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 160 (401) T protein:vir:44 81 AEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGT 160 (401) T ss_pred HHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCc Confidence 4455444332 234566789999999999999999999999999999999865 57899999889 Q ss_pred ceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeee Q lcl|Aclame:pro 132 TAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLL 211 (377) Q Consensus 132 ~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil 211 (377) .+.|+.|+++.+....++|+++++.+|+++++++||+|||+||.+++++||.++|+++++++++.+|++|||+++|.||+ T Consensus 161 ~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil 240 (401) T protein:vir:44 161 ASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFL 240 (401) T ss_pred cceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceee Confidence 99999887777656668999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc Q lcl|Aclame:pro 212 KDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT 291 (377) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~ 291 (377) +.................... .+...... ++.+..++..+ ......+.+|+|||+++..+.. T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~----t~~~~~~~----~d~i~~~~~~l-------~~~~~~~a~~v~n~~~~~~L~~--- 302 (401) T protein:vir:44 241 AYESTEESDKARAFGKLQHIV----SGEATAVT----ADAIIKLIYTL-------RKAHRTGAKFMMNNNSLFAIRL--- 302 (401) T ss_pred ccccccccccccccccccccc----cccccccC----HHHHHHHHHhc-------chhhhcCCEEEEcHHHHHHHHH--- Confidence 876544433221111100000 00000011 11222222111 1123456789999998766542 Q ss_pred ccCCCCccccc---------cCCCceEEecCCCCcc-----eEEEEeccc-EEEEecceeeEEeechhhhhcCcEEEEEE Q lcl|Aclame:pro 292 SRNQFGEYVTV---------LPHGITILESLAVETG-----KAIAFVANR-YDAFMATASTIEEYDQTFAMEDLQLYLTK 356 (377) Q Consensus 292 ~~~~~G~~~~~---------l~~~~~v~~s~~~~~~-----~ii~gd~s~-y~~~~~~~~~i~~~~~~~f~~~~~~~~~~ 356 (377) .++.+|.|+-. ..+|+||+++++||.. .++||||++ |.+.+|.++++.++ .+|.+|+++||++ T Consensus 303 lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~--~~~~~~~v~~~a~ 380 (401) T protein:vir:44 303 LKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRD--PYTNKPFVGFYTT 380 (401) T ss_pred hhccCCceeecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeee--ccccCCcEEEEEE Confidence 23445544310 1257788889888742 278999997 88999999988654 4578999999999 Q ss_pred EEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 357 NYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 357 ~r~dg~~~~~~af~~l~~~a~ 377 (377) +|+||++++++||++|+++|- T Consensus 381 ~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 381 KRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred EEeccEEecccceEEEEeecC Confidence 999999999999999999999 No 12 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=6.1e-59 Score=339.73 Aligned_cols=357 Identities=13% Similarity=0.131 Sum_probs=231.0 Q ss_pred CCccHHH-HHHHHHHHHHHHHHHHhcc--CHHHHHHHHHHHH------------HHHHHHHHHH--HHHHHHHHHH---- Q lcl|Aclame:pro 1 MAINLKE-LPKYREAVAELSAKISAGA--TPEEQEKLFEAAF------------TTMGDEILAK--NEEEMERMFD---- 59 (377) Q Consensus 1 m~~~~~~-l~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~------------~~~~~~~~~~--~~~~~~~~~~---- 59 (377) |+..+.+ +.+..++++++.+++.... -.+++.+.++... +.+..++... ...+...... T Consensus 21 ~~~~l~e~ra~~~~e~~~l~~~~~~~~~~~k~~~~~~~~~~~~~~~~~e~~~~~~~~~~ei~~~~~~~~~~~~~~~~~~~ 100 (425) T protein:vir:10 21 VPRGIISVRAEGPTEVKALIENLQKAFHDFKAEHTKQLDAVKAGLPTSDALAKVDKVSADLEALQAAVDEANIKIAAAQM 100 (425) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 3332211 1122223333333221110 0011111111100 0111111100 0000000000 Q ss_pred -hccccccccHHHHHHHHH---------HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEEc Q lcl|Aclame:pro 60 -LRDKNRELTAEEIKFFND---------IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAE 128 (377) Q Consensus 60 -~~~~~~~lt~~e~~~~~~---------~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~~~ 128 (377) ........+.++++.|.. ....+++++||++||++++++|++.+++.++|+++|+++|++ +..++|+.+ T Consensus 101 ~~~~~~~~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~ 180 (425) T protein:vir:10 101 GANGVKPLRDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNM 180 (425) T ss_pred ccccccccccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEc Confidence 011111223344555532 234567788999999999999999999999999999999987 568999999 Q ss_pred CCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcce Q lcl|Aclame:pro 129 TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPV 208 (377) Q Consensus 129 ~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~ 208 (377) +.+.+.|++|.+..++...++|+++++.+++++++++||+|||+||.+++++||.++|++++++++|.+|++|||+++|. T Consensus 181 ~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~ 260 (425) T protein:vir:10 181 GGTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPN 260 (425) T ss_pred CCcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcc Confidence 99999999887776655558999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcc Q lcl|Aclame:pro 209 GLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEA 288 (377) Q Consensus 209 Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~ 288 (377) ||++.++..+............... .... ... ++.+.+++..+ ...+.++.+|+|||+++..+.. T Consensus 261 Gil~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~----~d~l~~l~~~l-------~~~~~~~a~~vmn~~~~~~L~~ 325 (425) T protein:vir:10 261 GLLTYIAGGANAAKHPFGAIEVVNS-GAAA---DIT----SDGIIDLVYDL-------PSAFTGNARFAMNRNTQRQVRK 325 (425) T ss_pred eeeeccccccccccccccccccccc-cccc---ccc----HHHHHHHHhhh-------hhhhccCCEEEEchHHHHHHHH Confidence 9999876554433322111100000 0000 011 11122222111 1223467789999999766542 Q ss_pred cccccCCCCcccc---------ccCCCceEEecCCCCc-----ceEEEEeccc-EEEEecceeeEEeechhhhhcCcEEE Q lcl|Aclame:pro 289 KFTSRNQFGEYVT---------VLPHGITILESLAVET-----GKAIAFVANR-YDAFMATASTIEEYDQTFAMEDLQLY 353 (377) Q Consensus 289 ~~~~~~~~G~~~~---------~l~~~~~v~~s~~~~~-----~~ii~gd~s~-y~~~~~~~~~i~~~~~~~f~~~~~~~ 353 (377) .++.+|.|+- ...+|+||+++++||. ..|+||||++ |.+++|.++++. .+.+|.+|++.| T Consensus 326 ---lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~--~d~~~~~~~~~~ 400 (425) T protein:vir:10 326 ---LKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVL--RDPYTAKPYVLF 400 (425) T ss_pred ---hhcCCCceeeccCccCCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEE--ecccccCCcEEE Confidence 2344554431 1225678888998884 3389999998 788999988764 555788999999 Q ss_pred EEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 354 LTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 354 ~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |+..|+||++++|+||++|+++|- T Consensus 401 ~~~~r~d~~v~~~~A~~~l~~~as 424 (425) T protein:vir:10 401 YTTKRVGGGLLNPEPMRAMKVAAS 424 (425) T ss_pred EEEEEeccEeecccceEEEEeecc Confidence 999999999999999999999999 No 13 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=5.3e-58 Score=334.58 Aligned_cols=356 Identities=13% Similarity=0.116 Sum_probs=230.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhcc----CH-HHHHHHHHHHHHHHHHHHHHH--HHHHHHHHH---Hhcc--cccccc Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGA----TP-EEQEKLFEAAFTTMGDEILAK--NEEEMERMF---DLRD--KNRELT 68 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~----~~-~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~---~~~~--~~~~lt 68 (377) |+- ++++.+..+++....+.+++.. ++ +++........+.+..++... .....+... .... ...... T Consensus 1 l~~-~k~l~~~i~e~~~~~~~~k~~~~~~~~~~e~~~~~l~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (407) T protein:vir:48 1 MAD-VKDVEQVAQELQRKFDDFKEKNDKRIDAIEQEKGKLAGEVETLNGKLAELENLKSDLEAELAEVKRPAGGTQNKVA 79 (407) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh Confidence 443 3444444333333333322211 10 000011111111111111000 000000000 0000 011122 Q ss_pred HHHHHHHHH----------------HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCc Q lcl|Aclame:pro 69 AEEIKFFND----------------IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSG 131 (377) Q Consensus 69 ~~e~~~~~~----------------~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~ 131 (377) .+++++|.. ....+++++||++||++++++|++.++++++|+++|+++|+++ ..++|+..+++ T Consensus 80 ~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 159 (407) T protein:vir:48 80 SEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGGT 159 (407) T ss_pred hHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCc Confidence 233333322 2234566789999999999999999999999999999999864 68999999999 Q ss_pred ceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeee Q lcl|Aclame:pro 132 TAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLL 211 (377) Q Consensus 132 ~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil 211 (377) .+.|++|.+..++...++|+++++.+++++++++||+|||+||.+++++||.++|++++++++|.+|++|||+++|.||+ T Consensus 160 ~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil 239 (407) T protein:vir:48 160 TSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFL 239 (407) T ss_pred ceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceee Confidence 99999887776655568999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc Q lcl|Aclame:pro 212 KDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT 291 (377) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~ 291 (377) +.................... ...... ...+.+++.+..+ ...+.++..|+|||.++..+.. T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~d~i~~l~~~l-----------~~~~~~~a~~v~n~~~~~~L~~--- 301 (407) T protein:vir:48 240 AYESTDEDDKTRAFGKLQHIA-SGAASG---VTADAIIKLIYTL-----------RKAHRSGAKFMMNNSSLFAIRL--- 301 (407) T ss_pred ecccccccccccccccccccc-cccccc---cChHHHHHHHHhh-----------chhhhcCCEEEEcHHHHHHHHH--- Confidence 876544333221111110000 000000 1111122222211 1123456789999998765432 Q ss_pred ccCCCCcccc---------ccCCCceEEecCCCCc-----ceEEEEeccc-EEEEecceeeEEeechhhhhcCcEEEEEE Q lcl|Aclame:pro 292 SRNQFGEYVT---------VLPHGITILESLAVET-----GKAIAFVANR-YDAFMATASTIEEYDQTFAMEDLQLYLTK 356 (377) Q Consensus 292 ~~~~~G~~~~---------~l~~~~~v~~s~~~~~-----~~ii~gd~s~-y~~~~~~~~~i~~~~~~~f~~~~~~~~~~ 356 (377) .++.+|.|+- ...+|+||+++++||+ ..++||||++ |.+++|.++++.++ .+|.+|+++||++ T Consensus 302 lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d--~~~~~~~~~~~~~ 379 (407) T protein:vir:48 302 LKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRD--PYTNKPFVGFYTT 379 (407) T ss_pred hhccCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEee--ccccCCcEEEEEE Confidence 2345555431 1225778899999885 2378999997 88999999988764 4578999999999 Q ss_pred EEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 357 NYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 357 ~r~dg~~~~~~af~~l~~~a~ 377 (377) +|+||++++|+||++|+++|- T Consensus 380 ~r~d~~v~~~~a~~~l~~~aa 400 (407) T protein:vir:48 380 KRTGGMLVDSQAIKLMKIGAA 400 (407) T ss_pred EEeccEEecccceEEEEeecc Confidence 999999999999999999999 No 14 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=6.2e-58 Score=334.22 Aligned_cols=348 Identities=15% Similarity=0.102 Sum_probs=230.1 Q ss_pred CCc-cHHHHHHHHHHHHHHHHHHHh----ccCHHHHHHHHH---HHHHHHHHHHHHHHHH--HHHHHH---Hhcc-c--- Q lcl|Aclame:pro 1 MAI-NLKELPKYREAVAELSAKISA----GATPEEQEKLFE---AAFTTMGDEILAKNEE--EMERMF---DLRD-K--- 63 (377) Q Consensus 1 m~~-~~~~l~~~~~~~~~~~~~~~~----~~~~~~~~~~~~---~~~~~~~~~~~~~~~~--~~~~~~---~~~~-~--- 63 (377) |.. ++++|.++++++.+....+.+ ....+++.+.++ ...+.+.+++...... +..... .... . T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGSG 80 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccc Confidence 543 345554444443333332222 211122222222 2223333222211110 000000 0000 0 Q ss_pred -cccccHHH-----------HHHHHHH--HhccCCCCCceeccHHHHHHHHHH-HHhhhhhhhhceeEecCC--ceEEEE Q lcl|Aclame:pro 64 -NRELTAEE-----------IKFFNDI--DKNVGGKDKFKLLPEETMVQVFDD-LVAEHPLLKVINFKNTSL--RLKALT 126 (377) Q Consensus 64 -~~~lt~~e-----------~~~~~~~--~~~~~~s~gg~lvP~~~~~~Ii~~-~~~~s~l~~~~~v~~~~~--~~~~p~ 126 (377) ......++ ++.+... ...++.+++|.++|+++.+.+|.. +...++++++++++++++ .+.+|+ T Consensus 81 ~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (392) T protein:vir:13 81 AQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTV 160 (392) T ss_pred hhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEE Confidence 00011111 1111111 112334455666777777777655 455567888999998753 479999 Q ss_pred EcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCc Q lcl|Aclame:pro 127 AETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQ 206 (377) Q Consensus 127 ~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~ 206 (377) .++.+.+.|++|.++. ++++++|+++++.+++++++++||+|||+||.+++++||.++|++++++++|.+||+|+|+++ T Consensus 161 ~~~~~~a~~v~E~~~~-~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~ 239 (392) T protein:vir:13 161 ITGRATAGIVGETAEI-PESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQ 239 (392) T ss_pred EcCCcceeeecccccc-cccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCcc Confidence 9999999999877666 567899999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhh Q lcl|Aclame:pro 207 PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL 286 (377) Q Consensus 207 P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~ 286 (377) |.||++..+.............. ....+.+.... + .....++..|+|||+++..+ T Consensus 240 p~Gil~~~~~~~~~~~~~~~~~~--------------~~d~l~~~~~~----l-------~~~~~~~a~~v~n~~~~~~l 294 (392) T protein:vir:13 240 PRGILTDATGANAAFGEADADSK--------------VSDALIDLFHE----V-------PSAYRKNAKFVVNDLRAAQM 294 (392) T ss_pred ccccccccccccccccccccccc--------------cHHHHHHHHHh----h-------hhhhhcCCEEEEcHHHHHHH Confidence 99999876544333222111110 00111111111 1 11234567899999987765 Q ss_pred cccccccCCCCcccc---------ccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEE Q lcl|Aclame:pro 287 EAKFTSRNQFGEYVT---------VLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKN 357 (377) Q Consensus 287 ~~~~~~~~~~G~~~~---------~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~ 357 (377) .. .++.+|.|+- ...+|+||++++++|+++|+||||++|.++++++++++++.+.+|.+|+++||++. T Consensus 295 ~~---lkd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~ 371 (392) T protein:vir:13 295 RK---LKDANGQYLWQSALTVGAPDTFNGKVVETDDGMPADKVLFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQ 371 (392) T ss_pred HH---hhccCCceeecCCcCCCCCceecceeeEEcCCCCCCcEEEeeccceeEEeecceEEEeeccccccCCcEEEEEEE Confidence 42 3456666541 12267889999999999999999999999999999999999999999999999999 Q ss_pred EEcCEEecccceEEEEeecC Q lcl|Aclame:pro 358 YFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 358 r~dg~~~~~~af~~l~~~a~ 377 (377) |+||++++|+||++|+++++ T Consensus 372 r~d~~~~~~~A~~~~~~~~a 391 (392) T protein:vir:13 372 RADGLLVDARGAKVLTVTPA 391 (392) T ss_pred EeccEEecccceEEEEeecc Confidence 99999999999999999999 No 15 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=1.8e-57 Score=331.69 Aligned_cols=346 Identities=17% Similarity=0.115 Sum_probs=225.4 Q ss_pred CCc-cHHHHHHHHHHH----HHHHHHHHhccCHHHHHHHHHH---HHHHHHHHHHHHHHHH-----HHHHHHhcc----- Q lcl|Aclame:pro 1 MAI-NLKELPKYREAV----AELSAKISAGATPEEQEKLFEA---AFTTMGDEILAKNEEE-----MERMFDLRD----- 62 (377) Q Consensus 1 m~~-~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~-----~~~~~~~~~----- 62 (377) |.. +++++.++++++ +.+.++..+....+++.+.++. ..+.+.+++....... ......... T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSG 80 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 533 345554444333 3333322222222222222332 2222333332211111 000000000 Q ss_pred ccccccHHHH-----------HHHHHH--HhccCCC-CCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEE Q lcl|Aclame:pro 63 KNRELTAEEI-----------KFFNDI--DKNVGGK-DKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALT 126 (377) Q Consensus 63 ~~~~lt~~e~-----------~~~~~~--~~~~~~s-~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~ 126 (377) .......++. +.+... ...++.+ +|++++|+.+...|++.++..++|+++|+++++++ .+++|+ T Consensus 81 ~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~ 160 (390) T protein:vir:62 81 AQRSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTV 160 (390) T ss_pred chhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEE Confidence 0011111111 111111 1122333 44554444444555567778888999999999864 479999 Q ss_pred EcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCc Q lcl|Aclame:pro 127 AETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQ 206 (377) Q Consensus 127 ~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~ 206 (377) .++.+.+.|++|.++.+ +++++|+++++.+++++++++||+|||+||.+++++||+++|+++++.++|.+|++|+| + T Consensus 161 ~~~~~~a~wv~E~~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G--~ 237 (390) T protein:vir:62 161 ITGRSSASIVGETAEIP-ESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTG--Q 237 (390) T ss_pred EcCCcceeeeccccccc-ccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCC--c Confidence 99999999998777664 67899999999999999999999999999999999999999999999999999999987 7 Q ss_pred ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhh Q lcl|Aclame:pro 207 PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL 286 (377) Q Consensus 207 P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~ 286 (377) |.||++...........+.... . ....+++....+ ...+..+..|+|||+++..+ T Consensus 238 p~Gi~~~~~~~~~~~~~~~~~~-~-------------~~~~l~~~~~~l-----------~~~~~~~a~~vmn~~~~~~L 292 (390) T protein:vir:62 238 PRGILTDASPATATFLATDTDS-K-------------VSDALIDLFHEV-----------PSAYRANAKYVVNDLRAAQM 292 (390) T ss_pred cccccccccccccceecccccc-c-------------chHHHHHHHHhh-----------hhhhhcCCEEEEchHHHHHH Confidence 9999987654433222211110 0 011111111111 11123467899999987655 Q ss_pred cccccccCCCCccccc---------cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEE Q lcl|Aclame:pro 287 EAKFTSRNQFGEYVTV---------LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKN 357 (377) Q Consensus 287 ~~~~~~~~~~G~~~~~---------l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~ 357 (377) .. .++.+|.|+-. ..+|.||++++++|++.|+||||++|+++++++++++++.+.+|.+|++.||+++ T Consensus 293 ~~---lkd~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~ 369 (390) T protein:vir:62 293 RK---LKDANGQYLWQSGLTVGAPSLFNGKVVETDDGMPADKILFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQ 369 (390) T ss_pred HH---hhccCCCeeecCCcCCCccceecccceEEecCCCCccEEEeeccceeEEeecceEEEeeccccccCCcEEEEEEE Confidence 32 24556655411 2356788999999999999999999999999999999999999999999999999 Q ss_pred EEcCEEecccceEEEEeecC Q lcl|Aclame:pro 358 YFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 358 r~dg~~~~~~af~~l~~~a~ 377 (377) |+||++++|+||++|+++++ T Consensus 370 r~d~~~~~~~A~~~l~~~~~ 389 (390) T protein:vir:62 370 RADGLLVDARGAKVLTVTPG 389 (390) T ss_pred EeCcEeechhheEEEEeecC Confidence 99999999999999999999 No 16 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=4e-56 Score=324.29 Aligned_cols=369 Identities=16% Similarity=0.112 Sum_probs=222.0 Q ss_pred CCcc-------------HHHHHHHHHH----HHHHHHHHHhc-------cCHHHH----HHH---HHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MAIN-------------LKELPKYREA----VAELSAKISAG-------ATPEEQ----EKL---FEAAFTTMGDEILAK 49 (377) Q Consensus 1 m~~~-------------~~~l~~~~~~----~~~~~~~~~~~-------~~~~~~----~~~---~~~~~~~~~~~~~~~ 49 (377) |+-+ ++++.+...+ .+++.+++... .....+ .+. ..+..+.+..++... T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3321 1111111111 11111111000 000000 000 000000000000000 Q ss_pred -H------H----------HHHHHHHHhccccc-------c-cc-------HHHHHHHH---------HHHhccCCCCCc Q lcl|Aclame:pro 50 -N------E----------EEMERMFDLRDKNR-------E-LT-------AEEIKFFN---------DIDKNVGGKDKF 88 (377) Q Consensus 50 -~------~----------~~~~~~~~~~~~~~-------~-lt-------~~e~~~~~---------~~~~~~~~s~gg 88 (377) . . .............+ . .. .+.+..+. .....+++++|| T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:78 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 0 0 00000000000000 0 00 00111111 111235667899 Q ss_pred eeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcC-CcceeeecccccccccccccceeEeecceeEEEeehh Q lcl|Aclame:pro 89 KLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVI 166 (377) Q Consensus 89 ~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~i 166 (377) ++||+++...|++.+++.++|+++++++++++ .++||+.++ .+.+.|++|+++. ++++++|++|++.+|+++++++| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~-~~s~~~f~~i~~~~~k~a~~~~i 239 (497) T protein:vir:78 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTY-PFSSEEFARVYEQVGKVANALTI 239 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCccc-ccccccceeeEeeeeeeEeecHh Confidence 99999999999999999999999999999875 589999865 4689999877665 56889999999999999999999 Q ss_pred hHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhh----hhhc Q lcl|Aclame:pro 167 PKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIA----DLSD 242 (377) Q Consensus 167 S~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~----~l~~ 242 (377) |+|||+|+. ++++||.++|+++|++++|.+||+|+|+++|.||++..+..+.................... .... T Consensus 240 S~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) T protein:vir:78 240 TDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) T ss_pred HHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccch Confidence 999999985 69999999999999999999999999999999999887655444332221111100000000 0000 Q ss_pred cChHHH---------------------------HHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCC Q lcl|Aclame:pro 243 LDPDTA---------------------------VELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQ 295 (377) Q Consensus 243 ~~~~~~---------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~ 295 (377) ...... ...+..+...... ...........|+|||.++..+. ..++. T Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~vmn~~~~~~l~---~lkd~ 392 (497) T protein:vir:78 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVD---IQLTLFQTPNAVVMNPRDWELLR---LTKDA 392 (497) T ss_pred hhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhh---hhhhcccCCCeEEEchHHHHHHH---HhhcC Confidence 000000 0000000000000 00011112236999999876653 23566 Q ss_pred CCccccc---------------cCCCceEEecCCCCcceEEEEeccc--EEEEecceeeEEeech--hhhhcCcEEEEEE Q lcl|Aclame:pro 296 FGEYVTV---------------LPHGITILESLAVETGKAIAFVANR--YDAFMATASTIEEYDQ--TFAMEDLQLYLTK 356 (377) Q Consensus 296 ~G~~~~~---------------l~~~~~v~~s~~~~~~~ii~gd~s~--y~~~~~~~~~i~~~~~--~~f~~~~~~~~~~ 356 (377) +|.|+.. -.+|+||+++++||+++++||||++ |.+.+|.+++|+++++ ..|.+|+++||+. T Consensus 393 ~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~ 472 (497) T protein:vir:78 393 NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) T ss_pred CCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEE Confidence 6665421 1257899999999999999999997 5678999999999986 4599999999999 Q ss_pred EEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 357 NYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 357 ~r~dg~~~~~~af~~l~~~a~ 377 (377) .|+|+.+.+|+||++|+++++ T Consensus 473 ~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:78 473 ERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred EeecceeeccccEEEEEecCC Confidence 999999999999999999999 No 17 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=4e-56 Score=324.29 Aligned_cols=369 Identities=16% Similarity=0.112 Sum_probs=222.0 Q ss_pred CCcc-------------HHHHHHHHHH----HHHHHHHHHhc-------cCHHHH----HHH---HHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MAIN-------------LKELPKYREA----VAELSAKISAG-------ATPEEQ----EKL---FEAAFTTMGDEILAK 49 (377) Q Consensus 1 m~~~-------------~~~l~~~~~~----~~~~~~~~~~~-------~~~~~~----~~~---~~~~~~~~~~~~~~~ 49 (377) |+-+ ++++.+...+ .+++.+++... .....+ .+. ..+..+.+..++... T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3321 1111111111 11111111000 000000 000 000000000000000 Q ss_pred -H------H----------HHHHHHHHhccccc-------c-cc-------HHHHHHHH---------HHHhccCCCCCc Q lcl|Aclame:pro 50 -N------E----------EEMERMFDLRDKNR-------E-LT-------AEEIKFFN---------DIDKNVGGKDKF 88 (377) Q Consensus 50 -~------~----------~~~~~~~~~~~~~~-------~-lt-------~~e~~~~~---------~~~~~~~~s~gg 88 (377) . . .............+ . .. .+.+..+. .....+++++|| T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:10 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 0 0 00000000000000 0 00 00111111 111235667899 Q ss_pred eeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcC-CcceeeecccccccccccccceeEeecceeEEEeehh Q lcl|Aclame:pro 89 KLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVI 166 (377) Q Consensus 89 ~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~i 166 (377) ++||+++...|++.+++.++|+++++++++++ .++||+.++ .+.+.|++|+++. ++++++|++|++.+|+++++++| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~-~~s~~~f~~i~~~~~k~a~~~~i 239 (497) T protein:vir:10 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTY-PFSSEEFARVYEQVGKVANALTI 239 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCccc-ccccccceeeEeeeeeeEeecHh Confidence 99999999999999999999999999999875 589999865 4689999877665 56889999999999999999999 Q ss_pred hHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhh----hhhc Q lcl|Aclame:pro 167 PKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIA----DLSD 242 (377) Q Consensus 167 S~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~----~l~~ 242 (377) |+|||+|+. ++++||.++|+++|++++|.+||+|+|+++|.||++..+..+.................... .... T Consensus 240 S~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (497) T protein:vir:10 240 TDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGA 318 (497) T ss_pred HHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccch Confidence 999999985 69999999999999999999999999999999999887655444332221111100000000 0000 Q ss_pred cChHHH---------------------------HHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCC Q lcl|Aclame:pro 243 LDPDTA---------------------------VELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQ 295 (377) Q Consensus 243 ~~~~~~---------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~ 295 (377) ...... ...+..+...... ...........|+|||.++..+. ..++. T Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~vmn~~~~~~l~---~lkd~ 392 (497) T protein:vir:10 319 FVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVD---IQLTLFQTPNAVVMNPRDWELLR---LTKDA 392 (497) T ss_pred hhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhh---hhhhcccCCCeEEEchHHHHHHH---HhhcC Confidence 000000 0000000000000 00011112236999999876653 23566 Q ss_pred CCccccc---------------cCCCceEEecCCCCcceEEEEeccc--EEEEecceeeEEeech--hhhhcCcEEEEEE Q lcl|Aclame:pro 296 FGEYVTV---------------LPHGITILESLAVETGKAIAFVANR--YDAFMATASTIEEYDQ--TFAMEDLQLYLTK 356 (377) Q Consensus 296 ~G~~~~~---------------l~~~~~v~~s~~~~~~~ii~gd~s~--y~~~~~~~~~i~~~~~--~~f~~~~~~~~~~ 356 (377) +|.|+.. -.+|+||+++++||+++++||||++ |.+.+|.+++|+++++ ..|.+|+++||+. T Consensus 393 ~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~ 472 (497) T protein:vir:10 393 NGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAE 472 (497) T ss_pred CCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEE Confidence 6665421 1257899999999999999999997 5678999999999986 4599999999999 Q ss_pred EEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 357 NYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 357 ~r~dg~~~~~~af~~l~~~a~ 377 (377) .|+|+.+.+|+||++|+++++ T Consensus 473 ~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:10 473 ERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred EeecceeeccccEEEEEecCC Confidence 999999999999999999999 No 18 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=8.2e-56 Score=322.58 Aligned_cols=345 Identities=17% Similarity=0.216 Sum_probs=232.2 Q ss_pred CCccHHHHHHHHHH----HHHHHHHHHhccCHHHHHHHHH---HHHHHHHHHHHHHH------H---------------- Q lcl|Aclame:pro 1 MAINLKELPKYREA----VAELSAKISAGATPEEQEKLFE---AAFTTMGDEILAKN------E---------------- 51 (377) Q Consensus 1 m~~~~~~l~~~~~~----~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~------~---------------- 51 (377) |+ +++|.++.++ .+++.++.++....+++...++ ...+.+.+++.... . T Consensus 1 M~--l~eL~e~r~~l~~e~~~l~~k~~~~~~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 78 (409) T protein:vir:45 1 MK--LHELKQKRNTIATDMRALNEKIGDNAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDP 78 (409) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCC Confidence 55 5555544443 3344443333222222222222 22222222211100 0 Q ss_pred -------HHHHHHHH--hccccccccHHHHHHHHHHHh--ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC Q lcl|Aclame:pro 52 -------EEMERMFD--LRDKNRELTAEEIKFFNDIDK--NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL 120 (377) Q Consensus 52 -------~~~~~~~~--~~~~~~~lt~~e~~~~~~~~~--~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~ 120 (377) .+..+.+. .+.....++.++++.+.+... .+++++||++||+++.++|++.+++.++|+++|+++|+++ T Consensus 79 ~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 158 (409) T protein:vir:45 79 ENNSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSD 158 (409) T ss_pred CCcchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCC Confidence 00111111 112234566777777655443 3456689999999999999999999999999999999875 Q ss_pred c--eEEEEEcCC-cceeeecccccccccccccceeEeecceeEE-EeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 121 R--LKALTAETS-GTAVWGDIFGEIKGQLKQAFKEQDFSQFKLT-AFVVIPKDALKFGPKWLKQFITEQLKEAIAVALEL 196 (377) Q Consensus 121 ~--~~~p~~~~~-~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~-~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~ 196 (377) . +.+|...+. ..+.|++|.+.. ++++++|+++++.++|++ ++++||+|||+||.+++++||.++|+++++++++. T Consensus 159 ~~~~~~~~~~~~~~~~~~v~E~~~~-~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~ 237 (409) T protein:vir:45 159 GRTMEWATADGTSEVGVLLGENEEA-GEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEAR 237 (409) T ss_pred CceEEEEeeccCccccccccccccc-cccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH Confidence 4 455655543 456798766554 578899999999999986 67899999999999999999999999999999999 Q ss_pred ceeeccCCC---cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCc Q lcl|Aclame:pro 197 AIVKGNGLL---QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQ 273 (377) Q Consensus 197 a~l~G~G~~---~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 273 (377) +||+|+|++ +|.||++.++.......++ ..++ ..+.+.+..+ .. ....... T Consensus 238 a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~--~~~~---------------d~i~~l~~~l----~~-----~~~~~a~ 291 (409) T protein:vir:45 238 YLIQGTGAGTPKQPKGLAASVTGTTQTAAAN--AVKW---------------QEILALKHSI----DP-----AYRRGPK 291 (409) T ss_pred HhhccCCCCCccccceeeecccccccccccc--ccch---------------HHHHHHHHhh----hh-----hhccCCe Confidence 999999976 7999998766433322211 1111 1111111111 00 1111223 Q ss_pred eEEEeccchhhhhcccccccCCCCcccc---------ccCCCceEEecCCCCc-----ceEEEEecccEEEEecceeeEE Q lcl|Aclame:pro 274 VKLLLNPEDRWTLEAKFTSRNQFGEYVT---------VLPHGITILESLAVET-----GKAIAFVANRYDAFMATASTIE 339 (377) Q Consensus 274 ~~~~~n~~~~~~~~~~~~~~~~~G~~~~---------~l~~~~~v~~s~~~~~-----~~ii~gd~s~y~~~~~~~~~i~ 339 (377) .+|+||+.++..+.. .++.+|.|+. ...+|+||+++++||+ ..++||||++|++++++++.++ T Consensus 292 ~~~~~n~~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~ 368 (409) T protein:vir:45 292 FRLAFNDNTLKLISE---MEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILK 368 (409) T ss_pred EEEEECHHHHHHHHH---hhcCCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhhhheeeccceEEE Confidence 456789888765432 2455555541 1236788999999884 3488999999999999999999 Q ss_pred eechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 340 EYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 340 ~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+.+.+|.+|++.||+..|+||++++|+||++|++++. T Consensus 369 ~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s 406 (409) T protein:vir:45 369 RLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGS 406 (409) T ss_pred EeecccccCCcEEEEEEEEeccEeechhheEEEEeccC Confidence 99999999999999999999999999999999999777 No 19 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=5.8e-55 Score=317.92 Aligned_cols=344 Identities=12% Similarity=0.066 Sum_probs=221.5 Q ss_pred CCccH--HHHHHHHHHHHHHHH-HHHhccCHHHHHHHHH-------HHHHHHHHHHHHH-----HH-------------- Q lcl|Aclame:pro 1 MAINL--KELPKYREAVAELSA-KISAGATPEEQEKLFE-------AAFTTMGDEILAK-----NE-------------- 51 (377) Q Consensus 1 m~~~~--~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~-----~~-------------- 51 (377) |+++. +++.+..+++....+ +.......+++.+... +....+.+++... .. T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~~~ 80 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQGKVEKNEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEKKE 80 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhhc Confidence 55532 333333333322222 2222111111111111 1111111111000 00 Q ss_pred ---------------HHHHHHHHh--------ccccccccHHHHHHHHHHHh-----------ccCCCCCceeccHHHHH Q lcl|Aclame:pro 52 ---------------EEMERMFDL--------RDKNRELTAEEIKFFNDIDK-----------NVGGKDKFKLLPEETMV 97 (377) Q Consensus 52 ---------------~~~~~~~~~--------~~~~~~lt~~e~~~~~~~~~-----------~~~~s~gg~lvP~~~~~ 97 (377) .+.+..+.. .........+++++|..... ..++++||++||+++.+ T Consensus 81 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~ 160 (434) T protein:vir:62 81 DPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNGSVTIPDFLSK 160 (434) T ss_pred chhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccccccceecchhhHH Confidence 000000000 00001122344544433211 23456799999999999 Q ss_pred HHHHHHHhhhhhhhhceeEecCCceEEEEEcCCcceeeec--ccccccccccccceeEeecceeEEEeehhhHHHHhcCH Q lcl|Aclame:pro 98 QVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGD--IFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP 175 (377) Q Consensus 98 ~Ii~~~~~~s~l~~~~~v~~~~~~~~~p~~~~~~~a~w~~--e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~ 175 (377) .|++.++++++|+++|+++++++++++|+....+.+.|.. +++...+.++++|++|++.+|+++++++||+|||+||. T Consensus 161 ~Ii~~l~~~~~i~~~~~~~~~~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~ 240 (434) T protein:vir:62 161 EIITYAQEENFLRRLGTGVKTKENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLARTG 240 (434) T ss_pred HHHHhhhhhhhhhhhcceeccCCceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHHHhcch Confidence 9999999999999999999999999999988777777753 33445567889999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcceeeccCCCcc-eeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHH Q lcl|Aclame:pro 176 KWLKQFITEQLKEAIAVALELAIVKGNGLLQP-VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVP 254 (377) Q Consensus 176 ~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P-~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 254 (377) +||++||.++|++++++++|.+||+|+|+++| .|+++....... .....+ ++.+.. T Consensus 241 ~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~----~~~~~~-------------------~d~l~~ 297 (434) T protein:vir:62 241 LPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFK----TDEKNL-------------------YDALVK 297 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeeccccccc----ccccch-------------------hhHHHH Confidence 99999999999999999999999999999875 566643221110 000001 111111 Q ss_pred HHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc-----------ccCCCceEEecCCCCcce---- Q lcl|Aclame:pro 255 VMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT-----------VLPHGITILESLAVETGK---- 319 (377) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~-----------~l~~~~~v~~s~~~~~~~---- 319 (377) +...+ ...+..+.+|+|||.++..+.. .++.+|.|+- ...+|+||++++++|.+. T Consensus 298 l~~~l-------~~~~~~~a~~v~n~~~~~~L~~---lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~ 367 (434) T protein:vir:62 298 MKNTP-------VKEVRKKARWVLNTAALTKIET---MKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDT 367 (434) T ss_pred HHhhc-------chhhhcCCEEEEcHHHHHHHHH---hhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCc Confidence 21111 1123456789999998765532 2455565541 023577888898887543 Q ss_pred --EEEEecccEEEEecc-eeeEEeechhhhhcCcEEEEEEEEEcCEEec-ccceEEEEee----cC Q lcl|Aclame:pro 320 --AIAFVANRYDAFMAT-ASTIEEYDQTFAMEDLQLYLTKNYFYGKAKD-NHTAALLTLA----GG 377 (377) Q Consensus 320 --ii~gd~s~y~~~~~~-~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~-~~af~~l~~~----a~ 377 (377) |+|||||+|++++|. .++++++.+.+|.+|+++||++.|+|||+++ |++.+++++. +| T Consensus 368 ~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~ 433 (434) T protein:vir:62 368 PVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTG 433 (434) T ss_pred eEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCC Confidence 789999999888875 6889999999999999999999999999997 8999988655 33 No 20 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=2.6e-54 Score=314.34 Aligned_cols=353 Identities=13% Similarity=0.055 Sum_probs=233.2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhc------cCHHHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHH----H-hccc--- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAG------ATPEEQE--KLFEAAFTTMGDEILAKNEEE-MERMF----D-LRDK--- 63 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~------~~~~~~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~----~-~~~~--- 63 (377) |++ +++|++...++.+..+.+.+. .++++.. ..++...+.+..++......+ ..... . .... T Consensus 1 M~k-l~~L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~ 79 (428) T protein:vir:10 1 MPQ-IEELRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAV 79 (428) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhcccc Confidence 877 566655554443333332221 1222211 222333333333332111100 00000 0 0000 Q ss_pred -----ccccc-HHHHHH----------H--------------HHHH-hccCCCCCceeccHHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 64 -----NRELT-AEEIKF----------F--------------NDID-KNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKV 112 (377) Q Consensus 64 -----~~~lt-~~e~~~----------~--------------~~~~-~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~ 112 (377) ..... ....+. + ...+ ..++++.||++||+++.++|++.+++.++|+++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~ 159 (428) T protein:vir:10 80 IVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKL 159 (428) T ss_pred ccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhh Confidence 00000 000000 0 0001 123345789999999999999999999999998 Q ss_pred -ceeEec-CCceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 113 -INFKNT-SLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAI 190 (377) Q Consensus 113 -~~v~~~-~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~ 190 (377) ++++|+ +|++++|+.++.+.+.|++|+++. ++++++|++|++.+++++++++||+|||+||.+++++||.++|++++ T Consensus 160 ~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~-~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai 238 (428) T protein:vir:10 160 GARSIPLPNGNMSLPRLAGGATASYTGENQDA-KVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAI 238 (428) T ss_pred cceeeecCCcceEEEEEeCCcceeeeccCccc-cccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHH Confidence 777776 478999999999999999876665 56889999999999999999999999999999999999999999999 Q ss_pred HHHhhcceeeccCCC-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhc Q lcl|Aclame:pro 191 AVALELAIVKGNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLK 269 (377) Q Consensus 191 a~~~~~a~l~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 269 (377) ++++|++|++|+|++ +|.||++.................... ....+.+.+.... ..... T Consensus 239 ~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~--------~~~~~ 299 (428) T protein:vir:10 239 SVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLD-----------TIDTYLDSIILMS--------MDGNS 299 (428) T ss_pred HHHHHHHHhccCCCCccccccccccccccccccccccccccHH-----------HHHHHHHHHHHhh--------hcccc Confidence 999999999999985 899999875543332222211111000 0111111111100 01112 Q ss_pred ccCceEEEeccchhhhhcccccccCCCCccccc-----cCCCceEEecCCCCcc--------eEEEEecccEEEEeccee Q lcl|Aclame:pro 270 IAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTV-----LPHGITILESLAVETG--------KAIAFVANRYDAFMATAS 336 (377) Q Consensus 270 ~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~-----l~~~~~v~~s~~~~~~--------~ii~gd~s~y~~~~~~~~ 336 (377) ...+..|+|||.+++.+.. .++.+|.|+.. ..+|+||+.+++||++ .++|||||+|++++++++ T Consensus 300 ~~~~~~~v~n~~~~~~L~~---lkd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i 376 (428) T protein:vir:10 300 NMISSGWGMSNRTYMKLFG---LRDGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNM 376 (428) T ss_pred ccccCEEEEcHHHHHHHHH---hhccCCceeccCCCCCeeeceeeEEeccccccccCCCccceEEEEecceEEEEEecce Confidence 2345789999999876543 24567776521 1257889999998753 489999999999999999 Q ss_pred eEEeechh-----------hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 337 TIEEYDQT-----------FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 337 ~i~~~~~~-----------~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +++++++. .|.+|++.||+.+|+|+++.+|+||++|+--.= T Consensus 377 ~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 377 KVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred EEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 99999874 588999999999999999999999999986666 No 21 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=2.6e-54 Score=314.38 Aligned_cols=349 Identities=16% Similarity=0.085 Sum_probs=230.7 Q ss_pred ccHHHHHHHHHHHHHHHHHH----Hhc--cCHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------hc-- Q lcl|Aclame:pro 3 INLKELPKYREAVAELSAKI----SAG--ATPEEQ--EKLFEAAFTTMGDEILAKNEEEMERMFD-----------LR-- 61 (377) Q Consensus 3 ~~~~~l~~~~~~~~~~~~~~----~~~--~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~-- 61 (377) |++++|++..+++.+..+.+ .+. .+++++ .+.++...+.+..++......+...... .. T Consensus 1 M~i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 80 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAPA 80 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhcc Confidence 55566655554444433332 211 122222 2233333333333332211111000000 00 Q ss_pred ---cccccccHHHH-----------------------HHH--------HHHHhccCCCCCceeccHHHHHHHHHHHHhhh Q lcl|Aclame:pro 62 ---DKNRELTAEEI-----------------------KFF--------NDIDKNVGGKDKFKLLPEETMVQVFDDLVAEH 107 (377) Q Consensus 62 ---~~~~~lt~~e~-----------------------~~~--------~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s 107 (377) ...+....+++ ... ....+.+++.+||++||+++.++|++.+++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~ 160 (435) T protein:vir:14 81 AAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKS 160 (435) T ss_pred ccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhc Confidence 00000001100 000 01122345567999999999999999999999 Q ss_pred hhhhh-ceeEec-CCceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCH--HHHHHHHH Q lcl|Aclame:pro 108 PLLKV-INFKNT-SLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP--KWLKQFIT 183 (377) Q Consensus 108 ~l~~~-~~v~~~-~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~--~~~~~~l~ 183 (377) +++++ ++++|+ ++++++|+.++.+.+.|++|.+.. ++++++|+++++.+++++++++||+|||+||. +++++||. T Consensus 161 ~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~-~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~ 239 (435) T protein:vir:14 161 VVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDI-PTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVV 239 (435) T ss_pred hhhhhcceeeecCCCceEEEEEeCCcceeeeccCccc-cccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHH Confidence 99997 788876 467999999999999999877666 47889999999999999999999999999995 46999999 Q ss_pred HHHHHHHHHHhhcceeeccCCC-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 184 EQLKEAIAVALELAIVKGNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVN 262 (377) Q Consensus 184 ~~la~~~a~~~~~a~l~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 262 (377) ++|++++++++|++|++|+|++ +|.||++..........+..... .. ....+..++..... T Consensus 240 ~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~--~~---------------~~~~~~~l~~~~~~- 301 (435) T protein:vir:14 240 GDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTL--QK---------------IETDLGKVILALEN- 301 (435) T ss_pred HHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccch--hh---------------HHHHHHHHHHHhhh- Confidence 9999999999999999999985 79999875433322222111110 00 00111111111100 Q ss_pred hhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc-----ccCCCceEEecCCCCcc--------eEEEEecccEE Q lcl|Aclame:pro 263 DKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT-----VLPHGITILESLAVETG--------KAIAFVANRYD 329 (377) Q Consensus 263 ~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~-----~l~~~~~v~~s~~~~~~--------~ii~gd~s~y~ 329 (377) ......+..|+|||.++..+.. .++.+|.|+. ...+|+||+.++.||++ .++||||++|+ T Consensus 302 ----~~~~~~~~~~v~n~~~~~~L~~---lkd~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~ 374 (435) T protein:vir:14 302 ----ADANLTQPGWIMAPRTFRFLEG---LRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGDVF 374 (435) T ss_pred ----ccccccCCEEEEcHHHHHHHHH---hhccCCceeccCCCCCeeecceeEeeccccccccCCCccceEEEeecccEE Confidence 0111235679999999866532 2456666642 12268899999998763 59999999999 Q ss_pred EEecceeeEEeechh-----------hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 330 AFMATASTIEEYDQT-----------FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 330 ~~~~~~~~i~~~~~~-----------~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +++|+++++.++++. +|.+|+++||+.+|+|+++++|+||++|+=.++ T Consensus 375 i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:14 375 IGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAW 433 (435) T ss_pred EEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCC Confidence 999999999999874 488999999999999999999999999998887 No 22 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=4.9e-54 Score=312.86 Aligned_cols=349 Identities=15% Similarity=0.086 Sum_probs=231.1 Q ss_pred ccHHHHHHHHHHHHHHHHHH----Hhc--cCHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH-----------Hhccc Q lcl|Aclame:pro 3 INLKELPKYREAVAELSAKI----SAG--ATPEEQE--KLFEAAFTTMGDEILAKNEEEMERMF-----------DLRDK 63 (377) Q Consensus 3 ~~~~~l~~~~~~~~~~~~~~----~~~--~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~ 63 (377) |++++|.+.+.++.+..+.+ .++ .+++++. +.++...+.+..++......+..... ..... T Consensus 1 M~l~eL~~~r~~~~~~~~~l~~~~~e~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 80 (435) T protein:vir:80 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTASA 80 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhcccc Confidence 55556655444433333322 221 1222221 22333333333333221110000000 00000 Q ss_pred ccc-----ccHHHH-----------------------HHH--------HHHHhccCCCCCceeccHHHHHHHHHHHHhhh Q lcl|Aclame:pro 64 NRE-----LTAEEI-----------------------KFF--------NDIDKNVGGKDKFKLLPEETMVQVFDDLVAEH 107 (377) Q Consensus 64 ~~~-----lt~~e~-----------------------~~~--------~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s 107 (377) ... -..+.+ .+. ....+.+++..||++||+++.++|++.+++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~ 160 (435) T protein:vir:80 81 AAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKS 160 (435) T ss_pred ccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhc Confidence 000 000000 000 01122345567999999999999999999999 Q ss_pred hhhhh-ceeEec-CCceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCH--HHHHHHHH Q lcl|Aclame:pro 108 PLLKV-INFKNT-SLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP--KWLKQFIT 183 (377) Q Consensus 108 ~l~~~-~~v~~~-~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~--~~~~~~l~ 183 (377) +++++ ++++|+ ++++++|+.++.+.+.|++|.+.. ++++++|++|++.+++++++++||+|+|+||. +++++||. T Consensus 161 ~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~-~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~ 239 (435) T protein:vir:80 161 VVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDI-PTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVV 239 (435) T ss_pred hhhhccceeeecCCCceEEEEEeCCcceeeeccCccc-cccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHH Confidence 99998 788876 468999999999999999876665 56889999999999999999999999999995 47999999 Q ss_pred HHHHHHHHHHhhcceeeccCCC-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 184 EQLKEAIAVALELAIVKGNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVN 262 (377) Q Consensus 184 ~~la~~~a~~~~~a~l~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 262 (377) ++|+++++++++.+|++|+|++ +|.||++................ . .....+..++..+. T Consensus 240 ~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~--~---------------~~~~d~~~~~~~~~-- 300 (435) T protein:vir:80 240 GDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTL--Q---------------KIETDLGKAILALE-- 300 (435) T ss_pred HHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccch--h---------------hHHHHHHHHHHHhh-- Confidence 9999999999999999999975 79999987644333222111110 0 00001111111100 Q ss_pred hhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc-----ccCCCceEEecCCCCcc--------eEEEEecccEE Q lcl|Aclame:pro 263 DKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT-----VLPHGITILESLAVETG--------KAIAFVANRYD 329 (377) Q Consensus 263 ~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~-----~l~~~~~v~~s~~~~~~--------~ii~gd~s~y~ 329 (377) .......+..|+|||.++..+.. .++.+|.|+. ...+|+||+.+++||++ .++||||++|+ T Consensus 301 ---~~~~~~~~~~~vmn~~~~~~L~~---lkd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~ 374 (435) T protein:vir:80 301 ---NADANLTQPGWIMAPRTFRFLEG---LRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGDVF 374 (435) T ss_pred ---ccccccccCEEEEcHHHHHHHHh---hhccCCceeccCCCCCeEeeeeeEEeccccccccCCCCcceEEEEEcccEE Confidence 00112245689999999866532 3456676641 12367889999999853 58999999999 Q ss_pred EEecceeeEEeechhh-----------hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 330 AFMATASTIEEYDQTF-----------AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 330 ~~~~~~~~i~~~~~~~-----------f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +++|++++|+++++.. |.+|+++||+.+|+|+++.+|+||++|+=.+. T Consensus 375 i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:80 375 IGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAW 433 (435) T ss_pred EEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCC Confidence 9999999999999864 88999999999999999999999999998887 No 23 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=2.9e-54 Score=314.07 Aligned_cols=347 Identities=13% Similarity=0.058 Sum_probs=225.5 Q ss_pred CC-ccHHHHHHHHH----HHHHHHHHHHhcc--CHHHHHHHHHHH---HHHHHHHHHHH--HHHHH-------------- Q lcl|Aclame:pro 1 MA-INLKELPKYRE----AVAELSAKISAGA--TPEEQEKLFEAA---FTTMGDEILAK--NEEEM-------------- 54 (377) Q Consensus 1 m~-~~~~~l~~~~~----~~~~~~~~~~~~~--~~~~~~~~~~~~---~~~~~~~~~~~--~~~~~-------------- 54 (377) .. +.++++..... +.+.+.+...... ..++..+.++.. ...+....... ...+. T Consensus 140 ~~~~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~ 219 (543) T protein:vir:81 140 LEPDSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSP 219 (543) T ss_pred ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh Confidence 11 12222222111 1111111111100 000001111111 11111000000 00000 Q ss_pred --HHHH---HhccccccccHHHHHHHHHHHh-ccCCCCCceeccHHHHHHHH-HHHHhhhhhhhhceeEecCCceEEEEE Q lcl|Aclame:pro 55 --ERMF---DLRDKNRELTAEEIKFFNDIDK-NVGGKDKFKLLPEETMVQVF-DDLVAEHPLLKVINFKNTSLRLKALTA 127 (377) Q Consensus 55 --~~~~---~~~~~~~~lt~~e~~~~~~~~~-~~~~s~gg~lvP~~~~~~Ii-~~~~~~s~l~~~~~v~~~~~~~~~p~~ 127 (377) .... ........++.++++.+..... ..++++||++||+++++.|| +.++..++|++++++.+++|...+|+. T Consensus 220 ~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~g~~~~~~~ 299 (543) T protein:vir:81 220 AYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVATGDVWHGVS 299 (543) T ss_pred hhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccCCcceEEEEe Confidence 0000 0011112344445555544432 34567899999999999877 667888999999999999999999999 Q ss_pred cCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCC-c Q lcl|Aclame:pro 128 ETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL-Q 206 (377) Q Consensus 128 ~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~-~ 206 (377) ++.+.+.|++|++.. ++++++|+++++.+++++++++||++||+|+ +++++||.++|+++++++++.+||+|+|++ + T Consensus 300 ~~~~~a~~v~Eg~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~ 377 (543) T protein:vir:81 300 SAAVQWSWDAEFEEV-SDDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQGNQ 377 (543) T ss_pred cCCcceeecccCccc-cccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc Confidence 999999999877665 5688999999999999999999999999998 699999999999999999999999999985 8 Q ss_pred ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhh Q lcl|Aclame:pro 207 PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL 286 (377) Q Consensus 207 P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~ 286 (377) |.||++..........+.......+ ..+..++..+ +.....+..|+|||.++..+ T Consensus 378 p~Gi~~~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~l-------~~~~~~~~~~v~n~~~~~~l 432 (543) T protein:vir:81 378 PTGIVTALAGTAAEIAPVTAETFAL------------------ADVYAVYEQL-------AARHRRQGAWLANNLIYNKI 432 (543) T ss_pred cccchhhcccccccccccccccccH------------------HHHHHHHHhh-------hccccCCcEEEEcHHHHHHH Confidence 9999986554333222221111111 1111111111 11233556899999997766 Q ss_pred cccccccCCCCcccc--------ccCCCceEEecCCCCcce----------EEEEecccEEEEecceeeEEeechhh--- Q lcl|Aclame:pro 287 EAKFTSRNQFGEYVT--------VLPHGITILESLAVETGK----------AIAFVANRYDAFMATASTIEEYDQTF--- 345 (377) Q Consensus 287 ~~~~~~~~~~G~~~~--------~l~~~~~v~~s~~~~~~~----------ii~gd~s~y~~~~~~~~~i~~~~~~~--- 345 (377) .. .++.+|.|+. ...+|+||+.+++||.+. ++||||++|+++++++++|.++++.+ T Consensus 433 ~~---lkd~~G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~ 509 (543) T protein:vir:81 433 RQ---FDTQGGAGLWTTIGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTN 509 (543) T ss_pred HH---hhcCCCceeccCcCCCCCccccceeeEEeccccccccccccCCcceEEEeeccceeEEeecccEEEEeccccccc Confidence 42 2344554431 123678888898887542 89999999999999999999988764 Q ss_pred -hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 346 -AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 346 -f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+|+++|++++|+|+++.+++||++|++++. T Consensus 510 ~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~ 542 (543) T protein:vir:81 510 RRPNGSRGWFAYYRMGADVVNPNAFRLLNVETA 542 (543) T ss_pred hhhcCceEEEEEEeeccEeecccceEEEEeccc Confidence 55789999999999999999999999999999 No 24 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=2.2e-53 Score=309.24 Aligned_cols=342 Identities=11% Similarity=0.054 Sum_probs=228.9 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhc------cCHHHHHHHHHHH---HHHHHHHHHHH-HHHHHHHHHHhccccc----- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAG------ATPEEQEKLFEAA---FTTMGDEILAK-NEEEMERMFDLRDKNR----- 65 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~---~~~~~~~~~~~-~~~~~~~~~~~~~~~~----- 65 (377) |+..+++|++..+++.+..+.+.+. .+.+++ ..+++. .+.+..++... .+.+............ T Consensus 1 m~e~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~-~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (390) T protein:vir:10 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASAR-SKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVG 79 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccchh Confidence 8888887776655555444433221 112222 222221 22222222110 0000000000000000 Q ss_pred -cc-cHHH-HH-----------------HHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEE Q lcl|Aclame:pro 66 -EL-TAEE-IK-----------------FFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKA 124 (377) Q Consensus 66 -~l-t~~e-~~-----------------~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~ 124 (377) .. ..++ +. .........++..+|.++|+++.+.|++.+++.++|+++|+++|+++ .+++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:10 80 DLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEE Confidence 00 0011 11 11111122344456678888899999999999999999999999875 5899 Q ss_pred EEEcCC-cceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccC Q lcl|Aclame:pro 125 LTAETS-GTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNG 203 (377) Q Consensus 125 p~~~~~-~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G 203 (377) |+.++. +.+.|++|+++. ++++++|+++++.+++++++++||++||+|+. ++++||.++|++++++++|.+||+|+| T Consensus 160 ~~~~~~~~~a~~v~Eg~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~~~~~~il~G~G 237 (390) T protein:vir:10 160 VQETGFVNNAAIVAEGALK-PESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTG 237 (390) T ss_pred EEEecCCcceeeecCCccc-cccccceeEEEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 998764 688999876665 57889999999999999999999999999985 899999999999999999999999999 Q ss_pred CCc-ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccch Q lcl|Aclame:pro 204 LLQ-PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) Q Consensus 204 ~~~-P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 282 (377) +++ |.||++.............. ..+ +.+..++..+ .+ ....+..|+|||++ T Consensus 238 ~~~~p~Gi~~~~~~~~~~~~~~~~-~~~-------------------~~~~~~~~~l------~~-~~~~~~~~v~n~~~ 290 (390) T protein:vir:10 238 ANDGLLGLIPQATTYAAPTTIAGA-TRV-------------------DQLRLAMLQA------SL-AEYPASGIVINPID 290 (390) T ss_pred CCcccccccccccccccccccccc-chH-------------------HHHHHHHHhh------cc-ccCCCCEEEEcHHH Confidence 875 99999765433222111110 000 1111111111 01 12244579999999 Q ss_pred hhhhcccccccCCCCccccc--------cCCCceEEecCCCCcceEEEEeccc-EEEEecceeeEEeech-hhhhcCcEE Q lcl|Aclame:pro 283 RWTLEAKFTSRNQFGEYVTV--------LPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQ-TFAMEDLQL 352 (377) Q Consensus 283 ~~~~~~~~~~~~~~G~~~~~--------l~~~~~v~~s~~~~~~~ii~gd~s~-y~~~~~~~~~i~~~~~-~~f~~~~~~ 352 (377) +..+.. .++.+|.|+.. ..+|+||+++++||+++++||||++ |.+.+|++++++.+++ .+|.+|++. T Consensus 291 ~~~L~~---lkd~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 367 (390) T protein:vir:10 291 WAAIEL---AKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVT 367 (390) T ss_pred HHHHHH---hhcCCCceeecCCcCcCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEE Confidence 766542 34566665421 2268899999999999999999997 6789999999999875 689999999 Q ss_pred EEEEEEEcCEEecccceEEEEee Q lcl|Aclame:pro 353 YLTKNYFYGKAKDNHTAALLTLA 375 (377) Q Consensus 353 ~~~~~r~dg~~~~~~af~~l~~~ 375 (377) ||+..|+||++++|+||++++++ T Consensus 368 ~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 368 VLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEEEeeccEEeccccEEEEEeC Confidence 99999999999999999999999 No 25 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=1.8e-52 Score=304.25 Aligned_cols=342 Identities=11% Similarity=0.065 Sum_probs=227.5 Q ss_pred CCccHHHHHHHHHHHHHHHHHH----Hhc-cCHHHHHHHHHHH---HHHHHHHHHHHH--HHHHHHHHHhcc--ccccc- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKI----SAG-ATPEEQEKLFEAA---FTTMGDEILAKN--EEEMERMFDLRD--KNREL- 67 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~----~~~-~~~~~~~~~~~~~---~~~~~~~~~~~~--~~~~~~~~~~~~--~~~~l- 67 (377) |...+++|++..+++.+..+.+ ... ...++..+.+.+. ...+..++.... ..+... ..... ..+.. T Consensus 1 m~~l~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~-~~~~~~~~~~~~~ 79 (390) T protein:vir:81 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEG-NGAGGDVQHVSVG 79 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cccccccccccch Confidence 8877777766444443333322 111 1111112222222 222222221100 000000 00000 00000 Q ss_pred ----cHHHHHH------------------HHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEE Q lcl|Aclame:pro 68 ----TAEEIKF------------------FNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKA 124 (377) Q Consensus 68 ----t~~e~~~------------------~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~ 124 (377) ..+..+. .......+++.++|+++|+++...|++.+++.++|+++|+++|+++ .+++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:81 80 DMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEE Confidence 0001110 1111122455678889999999999999999999999999999875 5789 Q ss_pred EEEcCC-cceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccC Q lcl|Aclame:pro 125 LTAETS-GTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNG 203 (377) Q Consensus 125 p~~~~~-~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G 203 (377) |+.++. +.+.|++|.++. ++++++|+++++.+++++++++||+|+|+|+ .++++||.++|++++++++|++|++|+| T Consensus 160 ~~~~~~~~~a~~v~Eg~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g 237 (390) T protein:vir:81 160 VQETGFVNNAAIVAEGALK-PESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTG 237 (390) T ss_pred EEEecCCcceeeecCCccc-ccccceeeEEEEeeeEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 998764 688999876655 5788999999999999999999999999998 5899999999999999999999999999 Q ss_pred CCc-ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccch Q lcl|Aclame:pro 204 LLQ-PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) Q Consensus 204 ~~~-P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 282 (377) +++ |.||++.............. ..+ +.+..++... .+. ......|+|||++ T Consensus 238 ~~~~~~Gi~~~~~~~~~~~~~~~~-~~~-------------------~~~~~~~~~~------~~~-~~~~~~~v~~~~~ 290 (390) T protein:vir:81 238 ANDGLLGLIPQATTYAAPTTIAGA-TRV-------------------DQLRLAMLQA------SLA-EYNPSGIVINPID 290 (390) T ss_pred CCCcccceeecccccccccccccc-hhH-------------------HHHHHHHHhh------ccc-cCCCCEEEEcHHH Confidence 976 99999765433222111111 000 1111111111 011 1233479999999 Q ss_pred hhhhcccccccCCCCccccc--------cCCCceEEecCCCCcceEEEEeccc-EEEEecceeeEEeech-hhhhcCcEE Q lcl|Aclame:pro 283 RWTLEAKFTSRNQFGEYVTV--------LPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQ-TFAMEDLQL 352 (377) Q Consensus 283 ~~~~~~~~~~~~~~G~~~~~--------l~~~~~v~~s~~~~~~~ii~gd~s~-y~~~~~~~~~i~~~~~-~~f~~~~~~ 352 (377) +..+.. .++.+|.|+.. ..+|+||+.+++||+++++||||++ |.+.+|+++.++.+++ .+|.+|++. T Consensus 291 ~~~l~~---lkd~~G~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~ 367 (390) T protein:vir:81 291 WAAIEL---AKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGEDFQRNMIT 367 (390) T ss_pred HHHHHH---hhcCCCceeecCcccccCceecceeeEEcCCCCCCcEEEEehhceEEEEEecceEEEEecccchhhcCcEE Confidence 765542 24555655411 2268899999999999999999998 7889999999999875 689999999 Q ss_pred EEEEEEEcCEEecccceEEEEee Q lcl|Aclame:pro 353 YLTKNYFYGKAKDNHTAALLTLA 375 (377) Q Consensus 353 ~~~~~r~dg~~~~~~af~~l~~~ 375 (377) ||+.+|+||++++|+|||+++++ T Consensus 368 ~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 368 VLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEEEeeccEEecccceEEEEeC Confidence 99999999999999999999999 No 26 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=4.2e-53 Score=307.73 Aligned_cols=344 Identities=11% Similarity=0.094 Sum_probs=223.4 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCH------------HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhc--cc Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATP------------EEQEKLFEAAFT---TMGDEILAKNEEEMERMFDLR--DK 63 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~--~~ 63 (377) |....+++++.++++++..+.+++.... ++..+.++.... .+..++...... ..+..... .. T Consensus 17 ~~el~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~-~~~~~~~~~~~~ 95 (418) T protein:vir:10 17 DSHPEQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQK-LARGGGSAELET 95 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-Hhhcccccccch Confidence 3333333344333333333333221110 011111111111 111111000000 00000000 00 Q ss_pred ccc-----ccHHHHHHHH-----------------H--HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC Q lcl|Aclame:pro 64 NRE-----LTAEEIKFFN-----------------D--IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS 119 (377) Q Consensus 64 ~~~-----lt~~e~~~~~-----------------~--~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~ 119 (377) .+. -..++.+.+. . .....+++++|++||+++++.|++.+++.++|+++|+++|++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~ 175 (418) T protein:vir:10 96 PKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTS 175 (418) T ss_pred hhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeecc Confidence 000 0011111111 1 112345667899999999999999999999999999999987 Q ss_pred C-ceEEEEEcC-CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 120 L-RLKALTAET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELA 197 (377) Q Consensus 120 ~-~~~~p~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a 197 (377) + .+++|+.++ .+.+.|++|.++. ++++++|+++++.+++++++++||++||+|+ .++++||+++|++++++++|.+ T Consensus 176 ~~~~~~~~~~~~~~~a~~v~E~~~~-~~~~~~f~~v~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~l~~a~~~~~d~a 253 (418) T protein:vir:10 176 SSSIEYTVETGFTNNAAAVAEGAQK-PTSDLKFNLKNQPVRTIAHLFKASRQILDDA-PALQSYIDGRARYGLQLTEEGQ 253 (418) T ss_pred CCceeEEEEecCCCceeeeccCccc-cccccceeeEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHH Confidence 5 489999776 5788999876665 5778999999999999999999999999998 4899999999999999999999 Q ss_pred eeeccCCC-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEE Q lcl|Aclame:pro 198 IVKGNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKL 276 (377) Q Consensus 198 ~l~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (377) |++|+|++ +|.||++............ ....+ ..+.+.+..+ ......+..| T Consensus 254 ~l~G~g~~~~p~Gi~~~~~~~~~~~~~~-~~~~~---------------~~i~~~~~~~-----------~~~~~~~~~~ 306 (418) T protein:vir:10 254 ILKGDGTGANILGILPQASAFMPSITLA-NATPI---------------DKIRLALLQA-----------VLAEFPATGI 306 (418) T ss_pred HhccCCCCcccccccccccccccccccc-ccccH---------------HHHHHHHHhh-----------ccccCCCCEE Confidence 99999987 4999998754433222111 11111 1111111111 0111234469 Q ss_pred EeccchhhhhcccccccCCCCcccc--------ccCCCceEEecCCCCcceEEEEeccc-EEEEecceeeEEeechh--h Q lcl|Aclame:pro 277 LLNPEDRWTLEAKFTSRNQFGEYVT--------VLPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQT--F 345 (377) Q Consensus 277 ~~n~~~~~~~~~~~~~~~~~G~~~~--------~l~~~~~v~~s~~~~~~~ii~gd~s~-y~~~~~~~~~i~~~~~~--~ 345 (377) +|||.++..+.. .++.+|.|+. ...+|+||+.+++||+++++||||++ |.++++++++|.++++. . T Consensus 307 v~n~~~~~~L~~---lkd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~ 383 (418) T protein:vir:10 307 VLNPIDWASIEL---TKDSQGRYIVGNPVNGTTPRLWNLPVVETQAMTANEFLVGAFSMAAQIFDRMEIEVLLSTENVDD 383 (418) T ss_pred EEcHHHHHHHHH---hhcCCCceeccccccCCCceecceeeEEcCCCCCCcEEEeeccceEEEEEecceEEEEecccchh Confidence 999999765532 2345555441 12368899999999999999999998 78899999999998876 4 Q ss_pred hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 346 AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 346 f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+|++.||+.+|+||++++|+||+++++++- T Consensus 384 f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~ 415 (418) T protein:vir:10 384 FEKNMVSIRAEERLALAVYRPESFVTGALVEQ 415 (418) T ss_pred hhcCceEEEEEEeeccEEecccceEEEEeccC Confidence 99999999999999999999999999999876 No 27 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=2.4e-52 Score=303.57 Aligned_cols=343 Identities=10% Similarity=0.058 Sum_probs=228.1 Q ss_pred CCccHHHHHHHHHHHHHHHHHH----Hhc-cCHHHHHHHHHHH---HHHHHHHHHHHHH-HHHHHHHHhcc--cccc--- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKI----SAG-ATPEEQEKLFEAA---FTTMGDEILAKNE-EEMERMFDLRD--KNRE--- 66 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~----~~~-~~~~~~~~~~~~~---~~~~~~~~~~~~~-~~~~~~~~~~~--~~~~--- 66 (377) |....++|.+..+++.+..+.+ ... ...++..+.+++. ...+..++..... .+......... ..+. T Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~ 80 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGD 80 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccchh Confidence 6665566655444333333322 111 1112222222222 2222222221110 00000000000 0000 Q ss_pred --ccHHHHHHH------------------HHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEE Q lcl|Aclame:pro 67 --LTAEEIKFF------------------NDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKAL 125 (377) Q Consensus 67 --lt~~e~~~~------------------~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p 125 (377) -..++.+.+ .......++.++|++||+++++.|++.+++.++|+++++++|+++ ..++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~ 160 (390) T protein:vir:97 81 MFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYV 160 (390) T ss_pred hhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEE Confidence 011111111 111223456678999999999999999999999999999999865 57999 Q ss_pred EEcC-CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCC Q lcl|Aclame:pro 126 TAET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGL 204 (377) Q Consensus 126 ~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~ 204 (377) +.++ .+.+.|++|+++. ++++++|+++++.+++++++++||+||++|+ .++++||.++|++++++++|.+|++|+|+ T Consensus 161 ~~~~~~~~a~~v~Eg~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~la~a~~~~~d~a~l~G~g~ 238 (390) T protein:vir:97 161 QETGFVNNAAIVAEGALK-PESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGA 238 (390) T ss_pred EEecCCcceeeecCCccc-cccccceeEEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 9876 4689999876665 5788999999999999999999999999998 58999999999999999999999999998 Q ss_pred Cc-ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchh Q lcl|Aclame:pro 205 LQ-PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDR 283 (377) Q Consensus 205 ~~-P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~ 283 (377) ++ |.||++.....+....... ...+ ..+.+.+..+ ......+..|+|||+++ T Consensus 239 ~~~p~Gi~~~~~~~~~~~~~~~-~~~~---------------d~~~~~~~~~-----------~~~~~~~~~~v~n~~~~ 291 (390) T protein:vir:97 239 NDGLLGLIPQATTYAAPTTIAG-ATRV---------------DQLRLAMLQA-----------SLAEYPASGIVINPIDW 291 (390) T ss_pred Cccccceeeccccccccccccc-cchH---------------HHHHHHHHhh-----------ccccCCCCEEEEcHHHH Confidence 75 9999986543332211111 1000 0011111110 11112345799999997 Q ss_pred hhhcccccccCCCCcccc--------ccCCCceEEecCCCCcceEEEEeccc-EEEEecceeeEEeech-hhhhcCcEEE Q lcl|Aclame:pro 284 WTLEAKFTSRNQFGEYVT--------VLPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQ-TFAMEDLQLY 353 (377) Q Consensus 284 ~~~~~~~~~~~~~G~~~~--------~l~~~~~v~~s~~~~~~~ii~gd~s~-y~~~~~~~~~i~~~~~-~~f~~~~~~~ 353 (377) ..+.. .++.+|.|+. ...+|+||+++++||+++++||||++ |.++++.+++++.+++ .+|.+|++.| T Consensus 292 ~~L~~---lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~ 368 (390) T protein:vir:97 292 AAIEL---AKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTV 368 (390) T ss_pred HHHHH---hhcCCCceeecCccCCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEE Confidence 66542 2455665541 12268899999999999999999997 7889999999999875 6899999999 Q ss_pred EEEEEEcCEEecccceEEEEee Q lcl|Aclame:pro 354 LTKNYFYGKAKDNHTAALLTLA 375 (377) Q Consensus 354 ~~~~r~dg~~~~~~af~~l~~~ 375 (377) |+.+|+|+++.+|+||++++++ T Consensus 369 r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 369 LAEERLALVVYRPEALITGSFA 390 (390) T ss_pred EEEEeeccEEeccccEEEEEeC Confidence 9999999999999999999999 No 28 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=3.2e-52 Score=302.91 Aligned_cols=331 Identities=11% Similarity=0.072 Sum_probs=230.4 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhcc---ccccccHHHHHHH Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEE--EMERMFDLRD---KNRELTAEEIKFF 75 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~---~~~~lt~~e~~~~ 75 (377) |+++++++.+..+.+.+....+.+....+ +.+......+.+.+++...... +.++...... .......++++.| T Consensus 1 M~k~l~~l~e~~~~~~~e~~~~~~~~~~e-~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (371) T protein:vir:81 1 MPKELRELLEQINNKKEEARKLLAENKIE-EAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAF 79 (371) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHH Confidence 88777666555544444444433322211 1222222233333333222111 1111111000 0111222344444 Q ss_pred HH--------HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEEcCCcceeeecccccccc Q lcl|Aclame:pro 76 ND--------IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL---RLKALTAETSGTAVWGDIFGEIKG 144 (377) Q Consensus 76 ~~--------~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~---~~~~p~~~~~~~a~w~~e~~~~~~ 144 (377) .. ....+++++||++||++++.+|++.+++.++|+++++++|+++ ...+|+..+.+.+.|+.|+++.++ T Consensus 80 ~~~l~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 159 (371) T protein:vir:81 80 VNHIRTRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGE 159 (371) T ss_pred HHHHHHHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeecccccccc Confidence 32 3445667789999999999999999999999999999999863 356777777788999988777666 Q ss_pred cccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeecccccccccccc Q lcl|Aclame:pro 145 QLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTG 224 (377) Q Consensus 145 ~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~ 224 (377) .++++|+++++.+++++++++||+|+|+||.+++++||.+.|++++++++|.+|++|+|++.|.|+.+. T Consensus 160 ~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~~~~~----------- 228 (371) T protein:vir:81 160 KATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKTAIADL----------- 228 (371) T ss_pred ccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccH----------- Confidence 677999999999999999999999999999999999999999999999999999999999998887521 Q ss_pred ccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccc-- Q lcl|Aclame:pro 225 RDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTV-- 302 (377) Q Consensus 225 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~-- 302 (377) ... ...+... .......+..|+|||.++..+.. .++.+|.|+-. T Consensus 229 ------~~i---------------~~~~~~~----------l~~~~~~~a~~vmn~~~~~~L~~---lkd~~g~~l~~~~ 274 (371) T protein:vir:81 229 ------DGL---------------KQIINVQ----------LDPVFRSTSSVIVNQDAFNWLDT---LKDQNGQYLLQPS 274 (371) T ss_pred ------HHH---------------HHHHHhh----------cchhhhcCCEEEEcHHHHHHHHH---hhccCCCeeeecc Confidence 000 0000000 01112345689999999766542 24455554411 Q ss_pred -------cCCCceEEecCCCCc------------ceEEEEeccc-EEEEecceeeEEeechh--hhhcCcEEEEEEEEEc Q lcl|Aclame:pro 303 -------LPHGITILESLAVET------------GKAIAFVANR-YDAFMATASTIEEYDQT--FAMEDLQLYLTKNYFY 360 (377) Q Consensus 303 -------l~~~~~v~~s~~~~~------------~~ii~gd~s~-y~~~~~~~~~i~~~~~~--~f~~~~~~~~~~~r~d 360 (377) ..+|+||+.++++|. ..++||||++ |.+++|.+++|.++++. .|.+|++.||+.+|+| T Consensus 275 ~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d 354 (371) T protein:vir:81 275 ISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMD 354 (371) T ss_pred cCCCCCceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec Confidence 225777888877762 3589999998 67889999999999876 5889999999999999 Q ss_pred CEEecccceEEEEeecC Q lcl|Aclame:pro 361 GKAKDNHTAALLTLAGG 377 (377) Q Consensus 361 g~~~~~~af~~l~~~a~ 377 (377) +++++|+||+++++++- T Consensus 355 ~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 355 VKMRDDEAFVFGEVQLA 371 (371) T ss_pred cEEecccceEEEEEecC Confidence 99999999999999999 No 29 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=3e-52 Score=303.04 Aligned_cols=347 Identities=10% Similarity=0.042 Sum_probs=232.3 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHH-H-------HHHHHHHHHHHHHHHH---HHHHHHHHHHHHh--ccccc-- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEE-Q-------EKLFEAAFTTMGDEIL---AKNEEEMERMFDL--RDKNR-- 65 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~-------~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~--~~~~~-- 65 (377) |....+++++.++++++..+.+++...... . .+...+..+.+..+.. +............ ..... T Consensus 1 m~~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEEA 80 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccch Confidence 777677777777666666555543221100 0 0011111111111110 0000000000000 00000 Q ss_pred ccc-------HHHHHHH-HHH-----------HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEE Q lcl|Aclame:pro 66 ELT-------AEEIKFF-NDI-----------DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKAL 125 (377) Q Consensus 66 ~lt-------~~e~~~~-~~~-----------~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p 125 (377) ... ....+.+ ... ....++..+|++||++++++|++.+++.++|+++|+++|+++ .+++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~ 160 (395) T protein:vir:43 81 PKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYV 160 (395) T ss_pred hhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEE Confidence 000 0111111 111 112345678899999999999999999999999999999876 48999 Q ss_pred EEcC-CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCC Q lcl|Aclame:pro 126 TAET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGL 204 (377) Q Consensus 126 ~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~ 204 (377) +.++ .+.+.|++|+++. ++++++|+++++.+++++++++||++||+|+. ++++||.++|++++++++|.+|++|+|+ T Consensus 161 ~~~~~~~~a~~v~E~~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~v~~~la~a~~~~~d~~~l~G~g~ 238 (395) T protein:vir:43 161 RETGFVNNAAPVSEGTQK-PYSDLTFELENAPVRTIAHLFKASRQILDDAS-ALQSYIDARARYGLMLVEECQLLYGNGT 238 (395) T ss_pred EEecCCCceeeecCCccc-cccccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 9866 4689999876655 57889999999999999999999999999975 7999999999999999999999999999 Q ss_pred Ccc-eeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchh Q lcl|Aclame:pro 205 LQP-VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDR 283 (377) Q Consensus 205 ~~P-~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~ 283 (377) ++| .||++................. .++.+..++... ......+..|+|||.++ T Consensus 239 ~~~~~Gi~~~~~~~~~~~~~~~~~~~------------------~~~~i~~~~~~~-------~~~~~~~~~~vmn~~~~ 293 (395) T protein:vir:43 239 GANLHGIIPQAQAYAPPSGVVVTAEQ------------------RIDRIRLAILQA-------QLAEFPASGIVLNPIDW 293 (395) T ss_pred CCccccccccccccccccccccccch------------------hHHHHHHHHHhh-------ccccCCCcEEEEcHHHH Confidence 765 8999876544433322211110 011111111111 11122345799999997 Q ss_pred hhhcccccccCCCCcccc--------ccCCCceEEecCCCCcceEEEEeccc-EEEEecceeeEEeechh--hhhcCcEE Q lcl|Aclame:pro 284 WTLEAKFTSRNQFGEYVT--------VLPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQT--FAMEDLQL 352 (377) Q Consensus 284 ~~~~~~~~~~~~~G~~~~--------~l~~~~~v~~s~~~~~~~ii~gd~s~-y~~~~~~~~~i~~~~~~--~f~~~~~~ 352 (377) ..+.. .++.+|.|+. ...+|+||+.+++||+++++||||++ |.+++|.+++|+.+++. .|.+|++. T Consensus 294 ~~l~~---lkd~~G~~i~~~~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 370 (395) T protein:vir:43 294 ALIEL---NKDAENRYIIGSPQNGTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVT 370 (395) T ss_pred HHHHH---hhccCCceeccccccCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEE Confidence 65532 2455565542 12268899999999999999999998 77899999999988765 58999999 Q ss_pred EEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 353 YLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 353 ~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ||+.+|+|+++++|+||++|+++|- T Consensus 371 ~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 371 IRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred EEEEEeeccEEecccceEEEEeccC Confidence 9999999999999999999999999 No 30 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=8.1e-53 Score=306.18 Aligned_cols=334 Identities=12% Similarity=0.084 Sum_probs=213.8 Q ss_pred CCccH---HHHHHHHHHHHHHHHHHHh----c-cCHHHHHHHHHHHHHHHHHHHHH---H---HHHHHHHHHH-hccccc Q lcl|Aclame:pro 1 MAINL---KELPKYREAVAELSAKISA----G-ATPEEQEKLFEAAFTTMGDEILA---K---NEEEMERMFD-LRDKNR 65 (377) Q Consensus 1 m~~~~---~~l~~~~~~~~~~~~~~~~----~-~~~~~~~~~~~~~~~~~~~~~~~---~---~~~~~~~~~~-~~~~~~ 65 (377) |+... +++.+..++++++.+.+.+ . ...++.. ..+...+.+.+++.. + .+.+.+.... ...... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~-~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 79 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIK-QLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQ 79 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCC Confidence 76521 2333333444444333322 1 1112211 111212222211110 0 0001111100 000101 Q ss_pred cccHHHH---------HH----------------HHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC Q lcl|Aclame:pro 66 ELTAEEI---------KF----------------FNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL 120 (377) Q Consensus 66 ~lt~~e~---------~~----------------~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~ 120 (377) ....+++ +. .......+++++||++||++++++|++.++++++||++|+++++++ T Consensus 80 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~ 159 (387) T protein:vir:26 80 SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG 159 (387) T ss_pred CCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC Confidence 1111111 00 0112335677889999999999999999999999999999998875 Q ss_pred ceEEEEEc-CCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhc-ce Q lcl|Aclame:pro 121 RLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALEL-AI 198 (377) Q Consensus 121 ~~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~-a~ 198 (377) ..+|+.. ..+++.|++|+++. ++++++|+++++.+++++++++||+|||+||.+++++||.++|+++++++++. +| T Consensus 160 -~~~p~~~~~~~~a~~v~Eg~~~-~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~ 237 (387) T protein:vir:26 160 -LEIPRVSYTLDDDDFITDVETA-KELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDAL 237 (387) T ss_pred -ceeeeeeccCCccccccccccc-cccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHh Confidence 5677644 55789999766554 56789999999999999999999999999999999999999999999999765 67 Q ss_pred eeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 199 VKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 199 l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) .+|+|+++|.|+++..+...+... .+ ++.+..++..+ ...+..+..|+| T Consensus 238 ~~g~g~g~~~g~~~~~~~~~~~~~-----~~-------------------~d~i~~~~~~l-------~~~y~~na~~im 286 (387) T protein:vir:26 238 AVSPKSGLEHMSFYNGSVKEVEGA-----DM-------------------YDAIINALADL-------HEDYRDNATIYM 286 (387) T ss_pred hcCCCccccceeeecccccccccc-----ch-------------------HHHHHHHHhcc-------ChhhhcCCEEEE Confidence 789999999999865433221110 00 11111111111 112345678999 Q ss_pred ccchhhhhcccccccCCCC-----ccccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEE Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQFG-----EYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLY 353 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~~G-----~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~ 353 (377) |+.+++.+..... +.+| .+.++ +|+||++++.++ +++||||++|++. +.++.+.++.+. .+|+++| T Consensus 287 n~~t~~~~~~~~~--~~~~~~~~~~~~~l--lG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~~--~~~~~~~ 357 (387) T protein:vir:26 287 RYADYVKIISVLS--NGTTNFFDTPAEKV--FGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLF 357 (387) T ss_pred echHHHHHHHHHh--cCCCcccccCCccc--cccceEEecCCC--ceeeechhhhhhh-hhhhhheecccc--cCCceEE Confidence 9999877654432 2222 23344 467788887765 5899999997654 456777777664 4799999 Q ss_pred EEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 354 LTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 354 ~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++.+|+||++++|+||++|+++|. T Consensus 358 ~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:26 358 VLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred EEEEEeCcEeechhheEEEEeecC Confidence 999999999999999999999888 No 31 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=8.1e-53 Score=306.18 Aligned_cols=334 Identities=12% Similarity=0.084 Sum_probs=213.8 Q ss_pred CCccH---HHHHHHHHHHHHHHHHHHh----c-cCHHHHHHHHHHHHHHHHHHHHH---H---HHHHHHHHHH-hccccc Q lcl|Aclame:pro 1 MAINL---KELPKYREAVAELSAKISA----G-ATPEEQEKLFEAAFTTMGDEILA---K---NEEEMERMFD-LRDKNR 65 (377) Q Consensus 1 m~~~~---~~l~~~~~~~~~~~~~~~~----~-~~~~~~~~~~~~~~~~~~~~~~~---~---~~~~~~~~~~-~~~~~~ 65 (377) |+... +++.+..++++++.+.+.+ . ...++.. ..+...+.+.+++.. + .+.+.+.... ...... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~-~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 79 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIK-QLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQ 79 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCC Confidence 76521 2333333444444333322 1 1112211 111212222211110 0 0001111100 000101 Q ss_pred cccHHHH---------HH----------------HHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC Q lcl|Aclame:pro 66 ELTAEEI---------KF----------------FNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL 120 (377) Q Consensus 66 ~lt~~e~---------~~----------------~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~ 120 (377) ....+++ +. .......+++++||++||++++++|++.++++++||++|+++++++ T Consensus 80 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~ 159 (387) T protein:vir:96 80 SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG 159 (387) T ss_pred CCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC Confidence 1111111 00 0112335677889999999999999999999999999999998875 Q ss_pred ceEEEEEc-CCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhc-ce Q lcl|Aclame:pro 121 RLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALEL-AI 198 (377) Q Consensus 121 ~~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~-a~ 198 (377) ..+|+.. ..+++.|++|+++. ++++++|+++++.+++++++++||+|||+||.+++++||.++|+++++++++. +| T Consensus 160 -~~~p~~~~~~~~a~~v~Eg~~~-~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~ 237 (387) T protein:vir:96 160 -LEIPRVSYTLDDDDFITDVETA-KELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDAL 237 (387) T ss_pred -ceeeeeeccCCccccccccccc-cccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHh Confidence 5677644 55789999766554 56789999999999999999999999999999999999999999999999765 67 Q ss_pred eeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 199 VKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 199 l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) .+|+|+++|.|+++..+...+... .+ ++.+..++..+ ...+..+..|+| T Consensus 238 ~~g~g~g~~~g~~~~~~~~~~~~~-----~~-------------------~d~i~~~~~~l-------~~~y~~na~~im 286 (387) T protein:vir:96 238 AVSPKSGLEHMSFYNGSVKEVEGA-----DM-------------------YDAIINALADL-------HEDYRDNATIYM 286 (387) T ss_pred hcCCCccccceeeecccccccccc-----ch-------------------HHHHHHHHhcc-------ChhhhcCCEEEE Confidence 789999999999865433221110 00 11111111111 112345678999 Q ss_pred ccchhhhhcccccccCCCC-----ccccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEE Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQFG-----EYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLY 353 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~~G-----~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~ 353 (377) |+.+++.+..... +.+| .+.++ +|+||++++.++ +++||||++|++. +.++.+.++.+. .+|+++| T Consensus 287 n~~t~~~~~~~~~--~~~~~~~~~~~~~l--lG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~~--~~~~~~~ 357 (387) T protein:vir:96 287 RYADYVKIISVLS--NGTTNFFDTPAEKV--FGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLF 357 (387) T ss_pred echHHHHHHHHHh--cCCCcccccCCccc--cccceEEecCCC--ceeeechhhhhhh-hhhhhheecccc--cCCceEE Confidence 9999877654432 2222 23344 467788887765 5899999997654 456777777664 4799999 Q ss_pred EEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 354 LTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 354 ~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++.+|+||++++|+||++|+++|. T Consensus 358 ~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:96 358 VLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred EEEEEeCcEeechhheEEEEeecC Confidence 999999999999999999999888 No 32 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=8.1e-53 Score=306.18 Aligned_cols=334 Identities=12% Similarity=0.084 Sum_probs=213.8 Q ss_pred CCccH---HHHHHHHHHHHHHHHHHHh----c-cCHHHHHHHHHHHHHHHHHHHHH---H---HHHHHHHHHH-hccccc Q lcl|Aclame:pro 1 MAINL---KELPKYREAVAELSAKISA----G-ATPEEQEKLFEAAFTTMGDEILA---K---NEEEMERMFD-LRDKNR 65 (377) Q Consensus 1 m~~~~---~~l~~~~~~~~~~~~~~~~----~-~~~~~~~~~~~~~~~~~~~~~~~---~---~~~~~~~~~~-~~~~~~ 65 (377) |+... +++.+..++++++.+.+.+ . ...++.. ..+...+.+.+++.. + .+.+.+.... ...... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~-~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 79 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIK-QLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQ 79 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCC Confidence 76521 2333333444444333322 1 1112211 111212222211110 0 0001111100 000101 Q ss_pred cccHHHH---------HH----------------HHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC Q lcl|Aclame:pro 66 ELTAEEI---------KF----------------FNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL 120 (377) Q Consensus 66 ~lt~~e~---------~~----------------~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~ 120 (377) ....+++ +. .......+++++||++||++++++|++.++++++||++|+++++++ T Consensus 80 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~ 159 (387) T protein:vir:94 80 SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG 159 (387) T ss_pred CCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC Confidence 1111111 00 0112335677889999999999999999999999999999998875 Q ss_pred ceEEEEEc-CCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhc-ce Q lcl|Aclame:pro 121 RLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALEL-AI 198 (377) Q Consensus 121 ~~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~-a~ 198 (377) ..+|+.. ..+++.|++|+++. ++++++|+++++.+++++++++||+|||+||.+++++||.++|+++++++++. +| T Consensus 160 -~~~p~~~~~~~~a~~v~Eg~~~-~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~ 237 (387) T protein:vir:94 160 -LEIPRVSYTLDDDDFITDVETA-KELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDAL 237 (387) T ss_pred -ceeeeeeccCCccccccccccc-cccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHh Confidence 5677644 55789999766554 56789999999999999999999999999999999999999999999999765 67 Q ss_pred eeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 199 VKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 199 l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) .+|+|+++|.|+++..+...+... .+ ++.+..++..+ ...+..+..|+| T Consensus 238 ~~g~g~g~~~g~~~~~~~~~~~~~-----~~-------------------~d~i~~~~~~l-------~~~y~~na~~im 286 (387) T protein:vir:94 238 AVSPKSGLEHMSFYNGSVKEVEGA-----DM-------------------YDAIINALADL-------HEDYRDNATIYM 286 (387) T ss_pred hcCCCccccceeeecccccccccc-----ch-------------------HHHHHHHHhcc-------ChhhhcCCEEEE Confidence 789999999999865433221110 00 11111111111 112345678999 Q ss_pred ccchhhhhcccccccCCCC-----ccccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEE Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQFG-----EYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLY 353 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~~G-----~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~ 353 (377) |+.+++.+..... +.+| .+.++ +|+||++++.++ +++||||++|++. +.++.+.++.+. .+|+++| T Consensus 287 n~~t~~~~~~~~~--~~~~~~~~~~~~~l--lG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~~--~~~~~~~ 357 (387) T protein:vir:94 287 RYADYVKIISVLS--NGTTNFFDTPAEKV--FGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLF 357 (387) T ss_pred echHHHHHHHHHh--cCCCcccccCCccc--cccceEEecCCC--ceeeechhhhhhh-hhhhhheecccc--cCCceEE Confidence 9999877654432 2222 23344 467788887765 5899999997654 456777777664 4799999 Q ss_pred EEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 354 LTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 354 ~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++.+|+||++++|+||++|+++|. T Consensus 358 ~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:94 358 VLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred EEEEEeCcEeechhheEEEEeecC Confidence 999999999999999999999888 No 33 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=9.1e-54 Score=311.38 Aligned_cols=329 Identities=12% Similarity=0.082 Sum_probs=213.2 Q ss_pred HHHHHHHHHHHHHHHHHHHh---ccCHHHHH--HHHHHHHHHH----HHHHHHHHHHH-HHHHHHhccccccccHHHHHH Q lcl|Aclame:pro 5 LKELPKYREAVAELSAKISA---GATPEEQE--KLFEAAFTTM----GDEILAKNEEE-MERMFDLRDKNRELTAEEIKF 74 (377) Q Consensus 5 ~~~l~~~~~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~----~~~~~~~~~~~-~~~~~~~~~~~~~lt~~e~~~ 74 (377) ++++++..++++++.++... ..+..+.. .......... ..+.......+ .+.........+.+ ..... T Consensus 1 ~eei~~l~~~~~~l~~~~~~l~~~~d~~e~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~--~~~~~ 78 (352) T protein:vir:78 1 MEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLNDNEKLVKAKAEFYRHAILPNEFEKPS--MEAQR 78 (352) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccchhhhHHHHHHHHHHHHhhhhHHHHHH--hhHHH Confidence 55555555555444443322 11111100 0000000000 00000000001 11111000000000 00111 Q ss_pred HHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEEc-CCcceeeecccccccccccccceeE Q lcl|Aclame:pro 75 FNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 75 ~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) .......+++++||++||+++.++|++.++++++||++|+++++++ ..+|+.. +.+++.|++|++.. ++++++|++| T Consensus 79 ~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~-~~~p~~~~~~~~a~~v~E~~~~-~~~~~~f~~v 156 (352) T protein:vir:78 79 LLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG-LEIPRVSYTLDDDDFITDVETA-KELKLKGDTV 156 (352) T ss_pred HHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC-ceEEEEecCCCccccccccccc-ccccccceee Confidence 1233456678899999999999999999999999999999998876 4566644 45789999866555 5678999999 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhc-ceeeccCCCcceeeeeccccccccccccccccccch Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALEL-AIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKT 232 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~-a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~ 232 (377) ++.+|+++++++||+|||+||.+|+++||.++|+++++++++. +|.+|+|+++|.|+++.......... .. T Consensus 157 ~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~-----~~--- 228 (352) T protein:vir:78 157 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGA-----NM--- 228 (352) T ss_pred eecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceecccccccccc-----ch--- Confidence 9999999999999999999999999999999999999998655 77889999999999876443322111 00 Q ss_pred hhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCC-----ccccccCCCc Q lcl|Aclame:pro 233 DKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFG-----EYVTVLPHGI 307 (377) Q Consensus 233 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G-----~~~~~l~~~~ 307 (377) ++.+..++..+ ...+.++..|+||+.+++.+..... ++++ .+.++ +|+ T Consensus 229 ----------------~d~i~~~~~~l-------~~~~~~~a~~~mn~~t~~~l~~~~~--~~~~~~~~~~~~~l--lG~ 281 (352) T protein:vir:78 229 ----------------YDAIINALADL-------HEDYRDNATIYMRYADYVKIISVLS--NGTTNFFDTPAEKV--FGK 281 (352) T ss_pred ----------------HHHHHHHHhcc-------ChhhhcCCEEEEehHHHHHHHHHHh--ccCCcccccCCccc--ccc Confidence 11111111111 1223456789999999877654322 2222 23344 467 Q ss_pred eEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 308 TILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 308 ~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ||++++.++ +++||||++|++. +.++.++.+.+. .+++++|++.+|+||++++|+||++|+++|. T Consensus 282 PV~~~~~~~--~~~~Gdf~~~~~~-~~~~~~~~~~~~--~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~ 346 (352) T protein:vir:78 282 PVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLFVLTAWYDQQRTLDSAFRIAKAKES 346 (352) T ss_pred ceEEecCCC--ceeEeehhhhhhh-hhhheeeeeccc--cCCeeEEEEEeeeCceeechhheEEEEeecc Confidence 788887765 6899999997664 456777777664 4899999999999999999999999999999 No 34 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=1.8e-52 Score=304.20 Aligned_cols=334 Identities=12% Similarity=0.082 Sum_probs=214.1 Q ss_pred CCccH---HHHHHHHHHHHHHHHHHHh----c-cCHHHHHHHHHHHHHHHHHHHH---HH---HHHHHHHHHHh-ccccc Q lcl|Aclame:pro 1 MAINL---KELPKYREAVAELSAKISA----G-ATPEEQEKLFEAAFTTMGDEIL---AK---NEEEMERMFDL-RDKNR 65 (377) Q Consensus 1 m~~~~---~~l~~~~~~~~~~~~~~~~----~-~~~~~~~~~~~~~~~~~~~~~~---~~---~~~~~~~~~~~-~~~~~ 65 (377) |+.-. +++.++.++++++.+.+.. . ...++.. ..+...+.+.+++. .+ ...+.+..... ..... T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~-~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 79 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIK-QLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQ 79 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCC Confidence 76521 2233334444444333321 1 1112111 11111222211111 00 00011111000 00000 Q ss_pred cccHHH----------HHH---------------HHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC Q lcl|Aclame:pro 66 ELTAEE----------IKF---------------FNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL 120 (377) Q Consensus 66 ~lt~~e----------~~~---------------~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~ 120 (377) ....++ |.. .......+++++||++||++++++|++.++++++|+++|+++++++ T Consensus 80 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~ 159 (387) T protein:vir:93 80 SLNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG 159 (387) T ss_pred CcchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC Confidence 001111 100 1122345677889999999999999999999999999999998875 Q ss_pred ceEEEEEc-CCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhc-ce Q lcl|Aclame:pro 121 RLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALEL-AI 198 (377) Q Consensus 121 ~~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~-a~ 198 (377) .++|+.. +.+.+.|++|++.. ++++++|+++++.+++++++++||+|||+||.+|+++||.++|+++++++++. +| T Consensus 160 -~~~p~~~~~~~~a~~v~E~~~~-~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~ 237 (387) T protein:vir:93 160 -LEIPRVSYTLDDDDFITDVETA-KELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDAL 237 (387) T ss_pred -ceEEEEeecCCccccccCcccc-cccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHh Confidence 5677654 55779999876655 56789999999999999999999999999999999999999999999999766 67 Q ss_pred eeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 199 VKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 199 l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) .+|+|+++|.|++.......+... .+ ++.+..++..+ ...+..+..|+| T Consensus 238 ~~g~g~g~p~g~l~~~~~~~v~~~-----~~-------------------~d~i~~~~~~l-------~~~~~~~a~~~m 286 (387) T protein:vir:93 238 AVSPKSGLDHMSFYNGSVKEVEGA-----DM-------------------YDAIINALADL-------HEDYRDNATIYM 286 (387) T ss_pred hcCCCccccceeeecccccccccc-----ch-------------------HHHHHHHHhcc-------ChhhhcCCEEEE Confidence 889999999999865332221110 00 11111111111 112335678999 Q ss_pred ccchhhhhcccccccCCCCcc-----ccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEE Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQFGEY-----VTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLY 353 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~~G~~-----~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~ 353 (377) |+.+++.++... .+++|.| .++ +|+||++++.++ +++||||++|++. +.++.+.++.+ +.+++++| T Consensus 287 n~~t~~~~~~~~--~d~~~~~~~~~~~~l--lG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~--~~~~~~~~ 357 (387) T protein:vir:93 287 RYADYVKIISVL--SNGTTNFFDTPAEKV--FGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKD--VKKGEYLF 357 (387) T ss_pred echHHHHHHHHH--hcCCCcccccCCccc--cccceEEecCCC--ceeeeehhhhhee-hhhheeeeccc--ccCCceeE Confidence 999987765443 2334333 344 467788887765 5899999997664 55677777665 45899999 Q ss_pred EEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 354 LTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 354 ~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++..|+||++++|+||++|++++. T Consensus 358 ~~~~r~d~~v~~~eA~~~l~~k~~ 381 (387) T protein:vir:93 358 VLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred EEEeeeCceeechhheEEEEeecC Confidence 999999999999999999999887 No 35 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=1.4e-52 Score=304.86 Aligned_cols=346 Identities=16% Similarity=0.116 Sum_probs=216.7 Q ss_pred CCccHHH-HHHHHHH-HHHHHH---HHHhc-cCHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHH-HHHh------ Q lcl|Aclame:pro 1 MAINLKE-LPKYREA-VAELSA---KISAG-ATPEEQEKLFEAAFTTMGDEIL-------AKNEEEMER-MFDL------ 60 (377) Q Consensus 1 m~~~~~~-l~~~~~~-~~~~~~---~~~~~-~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~-~~~~------ 60 (377) |-+...+ ..+..++ ..++.+ ++... ...++..+..+.....+.+... .+....... .... T Consensus 1 ~~ke~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (413) T protein:vir:81 1 MVKEAGDAPTNAQVAEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYKSIGEF 80 (413) T ss_pred ChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhhhhhh Confidence 3332211 1111111 111111 11110 0000000101110000000000 000000000 0000 Q ss_pred ------------------ccccccccHHHHHHHH-HHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC- Q lcl|Aclame:pro 61 ------------------RDKNRELTAEEIKFFN-DIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL- 120 (377) Q Consensus 61 ------------------~~~~~~lt~~e~~~~~-~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~- 120 (377) ..........+.+.+. .....++++++|++||++++++|++.+++.++|+++++++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 160 (413) T protein:vir:81 81 FAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNT 160 (413) T ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCC Confidence 0000001111111111 12234556789999999999999999999999999999999876 Q ss_pred ceEEEEEcCC----cceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 121 RLKALTAETS----GTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALEL 196 (377) Q Consensus 121 ~~~~p~~~~~----~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~ 196 (377) ..++|+.... ..+.|+.|+++.++...++|+++++.+++++++++||+|||+|+. ++++||+++|++++++++|+ T Consensus 161 ~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~ 239 (413) T protein:vir:81 161 TIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYD-FLVSYINARLLEELAIEEER 239 (413) T ss_pred ceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 4788887643 457899877776544447899999999999999999999999986 59999999999999999999 Q ss_pred ceeeccCCCcc-eeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceE Q lcl|Aclame:pro 197 AIVKGNGLLQP-VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVK 275 (377) Q Consensus 197 a~l~G~G~~~P-~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (377) +|++|+|+++| .||++.....+....+... .+..+...+..... ......+ . T Consensus 240 ~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~---------------------~~~~i~~~~~~~~~-----~~~~~~~-~ 292 (413) T protein:vir:81 240 QLLLGDGTGNNLTGLLKRDGIQTLAVSNKDE---------------------LADSIYKAMTNISL-----ATPFQAD-A 292 (413) T ss_pred HHhccCCCCCcccccccccccccccccccch---------------------hHHHHHHHHHHhhh-----hccCCCc-E Confidence 99999999875 7998765443332221110 01111111111000 1111222 4 Q ss_pred EEeccchhhhhcccccccCCCCcccc----------------ccCCCceEEecCCCCcceEEEEeccc-EEEEecceeeE Q lcl|Aclame:pro 276 LLLNPEDRWTLEAKFTSRNQFGEYVT----------------VLPHGITILESLAVETGKAIAFVANR-YDAFMATASTI 338 (377) Q Consensus 276 ~~~n~~~~~~~~~~~~~~~~~G~~~~----------------~l~~~~~v~~s~~~~~~~ii~gd~s~-y~~~~~~~~~i 338 (377) |+|||+++..+.. .++.+|.|+. ...+|+||+.+++||+++++||||++ |.+++|+++++ T Consensus 293 ~vmn~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~~~~~~~~~~~~v 369 (413) T protein:vir:81 293 LVINPLDYQELRL---AKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRSAASVLRKGGVRI 369 (413) T ss_pred EEEcHHHHHHHHH---hhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEEEecccEEEEEEecceEE Confidence 9999999776532 2344554431 12358899999999999999999997 78899999999 Q ss_pred Eeechh--hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 339 EEYDQT--FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 339 ~~~~~~--~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +.+++. +|.+|++.||+.+|+|+++.+|+||++|++++- T Consensus 370 ~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 410 (413) T protein:vir:81 370 DSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEV 410 (413) T ss_pred EEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecCC Confidence 998875 599999999999999999999999999999988 No 36 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=9e-52 Score=300.43 Aligned_cols=329 Identities=14% Similarity=0.110 Sum_probs=218.1 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhcc--------CHHHHHHHHHHHHHHHHHHHHHHH--HHHHH-HHHH-hcc-c---- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGA--------TPEEQEKLFEAAFTTMGDEILAKN--EEEME-RMFD-LRD-K---- 63 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~-~~~~-~~~-~---- 63 (377) |+. +++|.+..+++.+..+.+.+.. ...++.+......+.+.++..... ..+.+ .... ... . T Consensus 1 Mk~-~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (397) T protein:vir:49 1 MKT-SNELHDLWVAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKKPL 79 (397) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Confidence 774 5555444443333333322211 011111111111222221111110 00000 0000 000 0 Q ss_pred ---cccccHHHHHHHHHH-----------HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC---CceEEEE Q lcl|Aclame:pro 64 ---NRELTAEEIKFFNDI-----------DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS---LRLKALT 126 (377) Q Consensus 64 ---~~~lt~~e~~~~~~~-----------~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~---~~~~~p~ 126 (377) ......++++.|... +..+++++||++||+++++.|++.+++.++|+++|+++|++ +...+|. T Consensus 80 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 159 (397) T protein:vir:49 80 TKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEK 159 (397) T ss_pred ccchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEe Confidence 111223444444332 23456678999999999999999999999999999999875 3456776 Q ss_pred EcC-CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCC Q lcl|Aclame:pro 127 AET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL 205 (377) Q Consensus 127 ~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~ 205 (377) ..+ .+.+.|++|+++.++.++++|+++++.+++++++++||+|||+||.+++++||.++|++++++++|.+|++|+|++ T Consensus 160 ~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~ 239 (397) T protein:vir:49 160 WTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAAL 239 (397) T ss_pred eccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 554 4679999887777666789999999999999999999999999999999999999999999999999999999998 Q ss_pred cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhh Q lcl|Aclame:pro 206 QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWT 285 (377) Q Consensus 206 ~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~ 285 (377) +|.+... ++ ....+.+..+ ......+..|+|||+++.. T Consensus 240 ~~~~~~~----------------~~---------------d~i~~~~~~l-----------~~~~~~~a~~vmn~~~~~~ 277 (397) T protein:vir:49 240 PTKPTLT----------------KW---------------DDIIDLEAKV-----------DPAIKQTSFFLTNTSGFTA 277 (397) T ss_pred ccccccc----------------cH---------------HHHHHHHHhh-----------hhhhcCCCEEEEcHHHHHH Confidence 7654321 01 1111111111 1112345789999999766 Q ss_pred hcccccccCCCCccc-----------cccCCCceEEe--cCCCCc-----ceEEEEeccc-EEEEecceeeEEeechh-- Q lcl|Aclame:pro 286 LEAKFTSRNQFGEYV-----------TVLPHGITILE--SLAVET-----GKAIAFVANR-YDAFMATASTIEEYDQT-- 344 (377) Q Consensus 286 ~~~~~~~~~~~G~~~-----------~~l~~~~~v~~--s~~~~~-----~~ii~gd~s~-y~~~~~~~~~i~~~~~~-- 344 (377) +.. .++.+|.|+ +++ |+||++ +..+|. ..++||||++ |.+++|++++++++++. T Consensus 278 l~~---lkd~~G~~l~~~~~~~~~~~~l~--G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~ 352 (397) T protein:vir:49 278 LKK---VKNALGDYLMERDVKSPTGYSID--GFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGG 352 (397) T ss_pred HHH---hhcCCCceeeccCcCCCCCceec--ceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEeccccc Confidence 542 234455543 344 555544 333443 3489999997 67899999999998865 Q ss_pred hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 345 FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 345 ~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|.+|++.||+..|+||++.+|+||+++++++. T Consensus 353 ~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 353 AFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 385 (397) T ss_pred hhhcCceeEEEEeeeCcEEecccceEEEEeecc Confidence 699999999999999999999999999999998 No 37 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=3.1e-52 Score=302.96 Aligned_cols=331 Identities=13% Similarity=0.085 Sum_probs=224.6 Q ss_pred CCccH-HHHHHHHHHHHHHHHHHHhcc---CHHHHHHHHHHHHHHHHHHHHH-------HH------------------- Q lcl|Aclame:pro 1 MAINL-KELPKYREAVAELSAKISAGA---TPEEQEKLFEAAFTTMGDEILA-------KN------------------- 50 (377) Q Consensus 1 m~~~~-~~l~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~-------~~------------------- 50 (377) |.|++ ++++++.++++++.+.++... ..++.... ....+.+.+++.. .. T Consensus 1 ~~~~m~k~l~el~~~~~~~~~~~~~~~~~~~~ee~~~~-~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (397) T protein:vir:12 1 MPMQMSKKEIALRQQFTEKKQQADKALQEGNTDEARAL-LDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEG 79 (397) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcc Confidence 66654 345555555555544443322 11211111 1111111111100 00 Q ss_pred -----------HHHHHHHHHhccccccccHHHHHHHHH----HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhcee Q lcl|Aclame:pro 51 -----------EEEMERMFDLRDKNRELTAEEIKFFND----IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINF 115 (377) Q Consensus 51 -----------~~~~~~~~~~~~~~~~lt~~e~~~~~~----~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v 115 (377) ..++.+.+.....+..+..+++..+.. ....+++++||++||+++.+.|++.+++.++|+++|++ T Consensus 80 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~ 159 (397) T protein:vir:12 80 QRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTV 159 (397) T ss_pred cccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcce Confidence 001111111112223334444433221 22345667899999999999999999999999999999 Q ss_pred EecC---CceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 116 KNTS---LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAV 192 (377) Q Consensus 116 ~~~~---~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~ 192 (377) +|++ +.+.+|+.++.+.+.|++|+++.++.+.++|++|++.+++++++++||+|+++||.+++++||.++|++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~ 239 (397) T protein:vir:12 160 EPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVV 239 (397) T ss_pred eeccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHH Confidence 9875 4577888888899999988877765667999999999999999999999999999999999999999999999 Q ss_pred HhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccC Q lcl|Aclame:pro 193 ALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAG 272 (377) Q Consensus 193 ~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 272 (377) ++|.+|++|+|+++|.|+++. .... ..+.. . ....... T Consensus 240 ~~d~~il~G~g~~~~~g~~~~-----------------~~i~---------------~~~~~---~-------l~~~~~~ 277 (397) T protein:vir:12 240 TRNNLILAAIASLKKVDIDGL-----------------DGIK---------------KALNV---T-------LDPMVAP 277 (397) T ss_pred HHHHHHHhccccccccccccH-----------------HHHH---------------HHHhh---c-------cchhhhC Confidence 999999999999999988531 0000 00000 0 0111234 Q ss_pred ceEEEeccchhhhhcccccccCCCCcccc---------ccCCCceEEecCC-CC-----cceEEEEeccc-EEEEeccee Q lcl|Aclame:pro 273 QVKLLLNPEDRWTLEAKFTSRNQFGEYVT---------VLPHGITILESLA-VE-----TGKAIAFVANR-YDAFMATAS 336 (377) Q Consensus 273 ~~~~~~n~~~~~~~~~~~~~~~~~G~~~~---------~l~~~~~v~~s~~-~~-----~~~ii~gd~s~-y~~~~~~~~ 336 (377) +..|+|||+++..+.. .++.+|.|+. ...+|+||+.+++ ++ +..++||||++ |.+++|+++ T Consensus 278 ~a~~~~n~~~~~~L~~---lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 354 (397) T protein:vir:12 278 GSIVLTNQDGYDWLDT---LKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQ 354 (397) T ss_pred CCEEEEcHHHHHHHHH---hhccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecce Confidence 5789999998765532 2455565541 1125667765443 33 22389999998 568889999 Q ss_pred eEEeechh--hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 337 TIEEYDQT--FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 337 ~i~~~~~~--~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +|+.+++. .|.+|++.||+.+|+||++.+|+||+++++++= T Consensus 355 ~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 355 SIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 99887654 589999999999999999999999999999999 No 38 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=2e-52 Score=304.03 Aligned_cols=334 Identities=13% Similarity=0.093 Sum_probs=210.7 Q ss_pred CCccH---HHHHHHHHHHHHHHHHHHh----c-cCHHHHHHHHHHHHHHHHHHHH---HH---HHHHHHHHH-Hhccccc Q lcl|Aclame:pro 1 MAINL---KELPKYREAVAELSAKISA----G-ATPEEQEKLFEAAFTTMGDEIL---AK---NEEEMERMF-DLRDKNR 65 (377) Q Consensus 1 m~~~~---~~l~~~~~~~~~~~~~~~~----~-~~~~~~~~~~~~~~~~~~~~~~---~~---~~~~~~~~~-~~~~~~~ 65 (377) |+... +++.+..++++++.+.+.. . ...++. +..+...+.+.+++. .+ .+.+.+... ....... T Consensus 16 mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~-~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 94 (402) T protein:vir:93 16 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDI-KQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQ 94 (402) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCC Confidence 65311 2223333333333333221 1 111211 111222222211111 00 000000000 0000000 Q ss_pred cccHHHH---------HH----------------HHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC Q lcl|Aclame:pro 66 ELTAEEI---------KF----------------FNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL 120 (377) Q Consensus 66 ~lt~~e~---------~~----------------~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~ 120 (377) ....+++ +. .......+++++||++||++++++|++.++++++||++|+++++++ T Consensus 95 ~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~ 174 (402) T protein:vir:93 95 SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG 174 (402) T ss_pred CCchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC Confidence 0111111 00 0112335667889999999999999999999999999999998865 Q ss_pred ceEEEEEc-CCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhc-ce Q lcl|Aclame:pro 121 RLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALEL-AI 198 (377) Q Consensus 121 ~~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~-a~ 198 (377) ..+|+.. +.+++.|++|+++. ++++++|+++++.+++++++++||+|||+||.+|+++||.++|+++++++++. +| T Consensus 175 -~~~p~~~~~~~~a~~v~Eg~~~-~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~ 252 (402) T protein:vir:93 175 -LEIPRVSYTLDDDDFITDVETA-KELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDAL 252 (402) T ss_pred -ceeeeeeccCCccccccccccc-cccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHh Confidence 5677654 55778999876655 56789999999999999999999999999999999999999999999999765 67 Q ss_pred eeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 199 VKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 199 l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) .+|+|+++|.|+++..+...+... .. ++.+..++..+ ...+..+..|+| T Consensus 253 ~~g~g~g~p~g~~~~~~~~~~~~~-----~~-------------------~d~l~~~~~~l-------~~~y~~na~~im 301 (402) T protein:vir:93 253 AVSPKSGLEHMSFYNGSVKEVEGA-----DM-------------------YDAIINALADL-------HEDYRDNATIYM 301 (402) T ss_pred hcCCCccccceeeecccccccccc-----ch-------------------HHHHHHHHhcc-------ChhhhcCCEEEE Confidence 889999999999865433221110 00 11111111111 112345678999 Q ss_pred ccchhhhhcccccccCCCC-----ccccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEE Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQFG-----EYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLY 353 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~~G-----~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~ 353 (377) |+.+++.++.... +.+| .+.++ +|+||++++.++ +++||||++|++.. .++.++.+.+. ..|+++| T Consensus 302 n~~t~~~~~~~~~--d~~~~~~~~~~~~l--lG~PV~~t~~~~--~i~~GDf~~~~~~~-~~~~~~~~~~~--~~~~~~~ 372 (402) T protein:vir:93 302 RYADYVKIISVLS--NGTTNFFDTPAEKV--FGKPVVFTDAAV--KPIVGDFNYFGINY-DGTTYDTDKDV--KKGEYLF 372 (402) T ss_pred echHHHHHHHHHh--cCCCcccccCCccc--cccceEEecCCC--ceeeechhhhhhhh-hhhhhhhhhcc--cCCceEE Confidence 9999877655432 2333 23344 466788887765 68999999865543 45666666664 3699999 Q ss_pred EEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 354 LTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 354 ~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++.+|+||++++|+||++|++++- T Consensus 373 ~~~~r~Dg~v~~~~A~~~l~ik~~ 396 (402) T protein:vir:93 373 VLTAWYDQQRTLDSAFRIAKAKEN 396 (402) T ss_pred EEEEEeCcEEechhheEEEEeecC Confidence 999999999999999999999877 No 39 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=7.7e-52 Score=300.79 Aligned_cols=344 Identities=13% Similarity=0.112 Sum_probs=231.1 Q ss_pred CCccHHHHHHHHHHHHHHHHHH-HhccCHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhc----------------- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKI-SAGATPEEQEKLFEAAFTTMGDEILAKNEEE-MERMFDLR----------------- 61 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~----------------- 61 (377) |.+++++|.+..+++.+..+.+ .+.....++.+...+..+.+.+++......+ .++..... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDGVTAEELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNG 80 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHH Confidence 9988888877665555544444 3222222222333333334433332211110 11000000 Q ss_pred ------cccccccHHHHHHH------HHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC---CceEEEE Q lcl|Aclame:pro 62 ------DKNRELTAEEIKFF------NDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS---LRLKALT 126 (377) Q Consensus 62 ------~~~~~lt~~e~~~~------~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~---~~~~~p~ 126 (377) .....+...+++.+ ......+++++||++||+++.++|++.+++.++|++++++.|++ +++.+|+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~ 160 (404) T protein:vir:10 81 ALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEK 160 (404) T ss_pred HHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEE Confidence 00000111111111 11223456688999999999999999999999999999999875 4678999 Q ss_pred EcCCcceeeeccccccccc-ccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCC Q lcl|Aclame:pro 127 AETSGTAVWGDIFGEIKGQ-LKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL 205 (377) Q Consensus 127 ~~~~~~a~w~~e~~~~~~~-~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~ 205 (377) ..+.+.+.|+.|+++.+.. .+++|+++++.+++++++++||+|||+|+.+++++||.++|++++++++|.+|++|+|++ T Consensus 161 ~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~ 240 (404) T protein:vir:10 161 RSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGD 240 (404) T ss_pred ecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC Confidence 8888999999887776543 468999999999999999999999999999999999999999999999999999999987 Q ss_pred c-ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhh Q lcl|Aclame:pro 206 Q-PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRW 284 (377) Q Consensus 206 ~-P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 284 (377) + |.||++..+..+....... +++ .+..++.. ........+.+|+|||.++. T Consensus 241 ~~~~gi~~~~~~~~~~~~~~~---~~~-------------------~~~~~~~~------~l~~~~~~~~~~v~n~~~~~ 292 (404) T protein:vir:10 241 EHATGIMTANKFKKITLPKSP---ALK-------------------DFKKCKNV------ELLNVFKATSSWIVNQDGFN 292 (404) T ss_pred Ccccceeeccccceeeccccc---cHH-------------------HHHHHHHh------hhhccccCCCEEEEcHHHHH Confidence 5 6788765444332222111 111 11111110 01122345678999999876 Q ss_pred hhcccccccCCCCccc-----------cccCCCceEEe-cCCCCc-----ceEEEEeccc-EEEEecceeeEEeechh-- Q lcl|Aclame:pro 285 TLEAKFTSRNQFGEYV-----------TVLPHGITILE-SLAVET-----GKAIAFVANR-YDAFMATASTIEEYDQT-- 344 (377) Q Consensus 285 ~~~~~~~~~~~~G~~~-----------~~l~~~~~v~~-s~~~~~-----~~ii~gd~s~-y~~~~~~~~~i~~~~~~-- 344 (377) .+.. .++.+|.|+ +++ |+||+. ++.++. ..++||||++ |.+++|++++|.++++. T Consensus 293 ~L~~---lkd~~G~~l~~~~~~~~~~~~l~--G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~ 367 (404) T protein:vir:10 293 YLDS---LEDKTGRPYLQPDPKDPTQYRFL--GLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAG 367 (404) T ss_pred HHHH---hhccCCceeeccCcCCCCCcccc--ceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccccc Confidence 5542 234444443 344 555553 333432 3489999997 77899999999998764 Q ss_pred hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 345 FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 345 ~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|.+|++.||+.+|+|+++.+++||+++++++. T Consensus 368 ~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~a 400 (404) T protein:vir:10 368 AFETNTTKARIIMRIDGNVKDSEALLIAEIPVE 400 (404) T ss_pred hhhcCceEEEEEEeeccEEecccceEEEEeecc Confidence 489999999999999999999999999999999 No 40 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=4.1e-51 Score=296.80 Aligned_cols=350 Identities=17% Similarity=0.117 Sum_probs=217.5 Q ss_pred CCcc--HHHHHHHH-HHHHHHHHHHHhcc---CHHHHH---------HHHHHHHH-----------HHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MAIN--LKELPKYR-EAVAELSAKISAGA---TPEEQE---------KLFEAAFT-----------TMGDEILAKNEEEM 54 (377) Q Consensus 1 m~~~--~~~l~~~~-~~~~~~~~~~~~~~---~~~~~~---------~~~~~~~~-----------~~~~~~~~~~~~~~ 54 (377) |... .++++... ++..+....+.+.. .++.+. +....... ...+.+.. ...+. T Consensus 24 ~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~l~ee~~~~~~~~a~~~e~~~~~~~~-~~~~~ 102 (458) T protein:vir:10 24 LTAAQKAQEAERMRKEQEEKELARMNDLVSKAVGEDRKRLEEALELVKSLDEKSKKSNELFAQTVEKQQETIVG-LQDEI 102 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Confidence 1111 11111110 00000000000000 000000 00000000 00000000 00000 Q ss_pred HHHHH--------hcccccccc--------HHHHHHH-------------------HHHHhccCCCCCceeccHHHHHHH Q lcl|Aclame:pro 55 ERMFD--------LRDKNRELT--------AEEIKFF-------------------NDIDKNVGGKDKFKLLPEETMVQV 99 (377) Q Consensus 55 ~~~~~--------~~~~~~~lt--------~~e~~~~-------------------~~~~~~~~~s~gg~lvP~~~~~~I 99 (377) ..... .....+.+. ..+++.+ ......+++++||++||+++++.| T Consensus 103 ~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~i 182 (458) T protein:vir:10 103 KSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRI 182 (458) T ss_pred HHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHH Confidence 00000 000000000 0011111 111123455679999999999999 Q ss_pred HHHHHhhhhhhhhceeEecCCc-eEEEEEcCCcceeeeccccccccc-----ccccceeEeecceeEEEeehhhHHHHhc Q lcl|Aclame:pro 100 FDDLVAEHPLLKVINFKNTSLR-LKALTAETSGTAVWGDIFGEIKGQ-----LKQAFKEQDFSQFKLTAFVVIPKDALKF 173 (377) Q Consensus 100 i~~~~~~s~l~~~~~v~~~~~~-~~~p~~~~~~~a~w~~e~~~~~~~-----~~~~f~~i~l~~~k~~~~~~iS~ell~d 173 (377) ++.+++.++|+++|+++|++++ ..+|+.++.+.+.|++|.+..++. ++++|+++++.+++++++++||++||+| T Consensus 183 i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~d 262 (458) T protein:vir:10 183 IRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEED 262 (458) T ss_pred HHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhc Confidence 9999999999999999999765 678999989999999877665432 4578999999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHH Q lcl|Aclame:pro 174 GPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLV 253 (377) Q Consensus 174 s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 253 (377) |.++|++||.++|+++|++++|.+||+|+|+++|.||++.................. +......+++.+. T Consensus 263 s~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~----------~~~~~~~i~~~~~ 332 (458) T protein:vir:10 263 AIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGS----------VLVTAKTISKLRR 332 (458) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeeccccccc----------ccccHHHHHHHHH Confidence 999999999999999999999999999999999999998766544332221111100 0011122222222 Q ss_pred HHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc-------------ccCCCceEEecCCCCcc-- Q lcl|Aclame:pro 254 PVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT-------------VLPHGITILESLAVETG-- 318 (377) Q Consensus 254 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~-------------~l~~~~~v~~s~~~~~~-- 318 (377) .+ .....++..|+|||.++..+.. .++.+|.|+. ...+|+||+++++||++ T Consensus 333 ~l-----------~~~~~~~~~~v~~~~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~ 398 (458) T protein:vir:10 333 KL-----------GRHGLKLSKLVLIVSMDAYYDL---LEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKAN 398 (458) T ss_pred hh-----------hhhhcCCCEEEEcHHHHHHHHh---hcccCCceeeccccccccccCcCceecceeeEEccccccccC Confidence 11 1122356789999998765532 2344444321 01257889999999864 Q ss_pred --eEEEEeccc-EEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 319 --KAIAFVANR-YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 319 --~ii~gd~s~-y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +++||||+. |.++++.++++.++ .++.++++.||+..|+|+.+.+|+|||+.+++|- T Consensus 399 ~~~~~~~~f~~~~~~~~~~~~~v~~d--~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 399 SAEFAVIVYKDNFVMPRQRAVTVERE--RQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred CcceEEEEecccEEEEEeeceEEEee--cccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 589999975 88999999998764 4578999999999999999999999999999999 No 41 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=5.7e-52 Score=301.53 Aligned_cols=343 Identities=16% Similarity=0.132 Sum_probs=223.4 Q ss_pred CCc--cH----HHHHHHHHHHHHHHHHHHhccCH--HH--------HHHHHHH----------HHHHHHHHHHHH----- Q lcl|Aclame:pro 1 MAI--NL----KELPKYREAVAELSAKISAGATP--EE--------QEKLFEA----------AFTTMGDEILAK----- 49 (377) Q Consensus 1 m~~--~~----~~l~~~~~~~~~~~~~~~~~~~~--~~--------~~~~~~~----------~~~~~~~~~~~~----- 49 (377) ..+ ++ +.++..+ ++..+++...... +. +...... ....+...+... T Consensus 245 ~~~~~~ai~~g~sld~~r---a~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~ 321 (632) T protein:vir:96 245 RSLAQEAIQKGHTVDQFR---ALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDW 321 (632) T ss_pred hhhHHHHHhccccHHHHH---HHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccch Confidence 111 00 1111111 1111111110000 00 0000000 000000000000 Q ss_pred ----HHHHHHHHHHhcccc--cc--ccHHHHHHHHHHHhccCCCCCceeccHHH-HHHHHHHHHhhhhhhhh-ceeEec- Q lcl|Aclame:pro 50 ----NEEEMERMFDLRDKN--RE--LTAEEIKFFNDIDKNVGGKDKFKLLPEET-MVQVFDDLVAEHPLLKV-INFKNT- 118 (377) Q Consensus 50 ----~~~~~~~~~~~~~~~--~~--lt~~e~~~~~~~~~~~~~s~gg~lvP~~~-~~~Ii~~~~~~s~l~~~-~~v~~~- 118 (377) ...+.........+. +. +..+ .........+++++||++||+++ .+.||+.++..++++++ ++++|. T Consensus 322 ~~a~~~~e~a~~~a~~~G~~arg~~~~~~--~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~ 399 (632) T protein:vir:96 322 SKAGFEREVSLAIADASGKEARGFYMPHE--VLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGL 399 (632) T ss_pred hhhhhhhHHHHHHHHhhhhhhhhhhhhHH--HHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecC Confidence 000000000000000 00 0000 01122334566778999999886 67999999999999998 777775 Q ss_pred CCceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcce Q lcl|Aclame:pro 119 SLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAI 198 (377) Q Consensus 119 ~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~ 198 (377) +|+++||+.++++.+.|++|+++.+ +++++|++++|.+++++++++||+|||+||.++++++|+++|+++++.++|.+| T Consensus 400 ~g~~~ip~~~~~~~a~wv~E~~~~~-~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~ 478 (632) T protein:vir:96 400 VGDVDIPKKTSGANFYWIGEDEDVQ-DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAM 478 (632) T ss_pred CcceEEEEEeCCceeEeecCCcccc-ccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHh Confidence 5789999999999999998877765 678999999999999999999999999999999999999999999999999999 Q ss_pred eeccCC-CcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEE Q lcl|Aclame:pro 199 VKGNGL-LQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLL 277 (377) Q Consensus 199 l~G~G~-~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (377) |+|+|+ ++|.||++.....+..... ...++ ....+....+.. .....++..|+ T Consensus 479 l~G~G~~~~p~Gi~~~~~~~~~~~~~--~~~~~---------------~~i~~~~~~i~~---------~~~~~~~~~~~ 532 (632) T protein:vir:96 479 LTGTGLANDPVGLLNMTGVPALTYPA--GGVDW---------------ASVVDMETKIST---------FNADAGRLAYL 532 (632) T ss_pred hcccCCCCccceeeecccccceeccc--ccCCH---------------HHHHHHHHHHhh---------cccccCccEEE Confidence 999996 6899999865543322111 11111 011111111110 01113467899 Q ss_pred eccchhhhhcccccccCCCCcccc--ccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEE Q lcl|Aclame:pro 278 LNPEDRWTLEAKFTSRNQFGEYVT--VLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLT 355 (377) Q Consensus 278 ~n~~~~~~~~~~~~~~~~~G~~~~--~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~ 355 (377) |||.+...+... ...+.+|.|+. ....|+||+.++++|+++++||||++|+++++++++|.++++.+|.+|++.||+ T Consensus 533 ~~~~~~~~l~~~-~l~d~~G~~i~~~~~l~G~pv~~s~~ip~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~v~~~~ 611 (632) T protein:vir:96 533 TSVTQRGAAKKA-QVFDNTGERIWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRV 611 (632) T ss_pred EchhHHHHHHHH-hccCCCCceeecCCeecccceEeccccccCcEEEeecceEEEEEecceEEEEccccccccCceEEEE Confidence 999876444321 13456677653 233688899999999999999999999999999999999999999999999999 Q ss_pred EEEEcCEEecccceEEEEeec Q lcl|Aclame:pro 356 KNYFYGKAKDNHTAALLTLAG 376 (377) Q Consensus 356 ~~r~dg~~~~~~af~~l~~~a 376 (377) ++|+|+++++++||++++.+| T Consensus 612 ~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 612 FQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred EeecCceeechhhhhheeecC Confidence 999999999999999999999 No 42 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=1.8e-51 Score=298.73 Aligned_cols=331 Identities=15% Similarity=0.085 Sum_probs=224.5 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHH Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKN-----------------------EEEMERM 57 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~ 57 (377) |.++|+++.+..+++.+....+.+..+.++. +...+..+.++.++.... ..++++. T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~-~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEA-EQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 9988887766655554444443322222211 112222222222221111 1112222 Q ss_pred HHhccccccccHHHHHHHHH-----HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEEcC Q lcl|Aclame:pro 58 FDLRDKNRELTAEEIKFFND-----IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL---RLKALTAET 129 (377) Q Consensus 58 ~~~~~~~~~lt~~e~~~~~~-----~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~---~~~~p~~~~ 129 (377) +.....+..++.+++.+... ....+++++||++||+++.+.|++.+++.++|+++|+++++++ +..+|+..+ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~ 159 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC Confidence 22222233344444433321 1233456789999999999999999999999999999999863 456787788 Q ss_pred CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCccee Q lcl|Aclame:pro 130 SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVG 209 (377) Q Consensus 130 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~G 209 (377) .+.+.|++|.++.++...++|+++++.+++++++++||+|||+||.+++++||.+.|+++++++++.+|++|+|+++|.| T Consensus 160 ~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~ 239 (392) T protein:vir:10 160 MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA 239 (392) T ss_pred CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 88999998887776555689999999999999999999999999999999999999999999999999999999887655 Q ss_pred eeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc Q lcl|Aclame:pro 210 LLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK 289 (377) Q Consensus 210 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~ 289 (377) ..+ +.. +...+... .......+..|+|||+++..+.. T Consensus 240 ~~~-----------------~d~---------------i~~~~~~~----------l~~~~~~~a~~vm~~~~~~~L~~- 276 (392) T protein:vir:10 240 IKS-----------------LDD---------------IKDVLNVK----------LDPAISPNAILLTNQDGFNYLDK- 276 (392) T ss_pred ccC-----------------HHH---------------HHHHHHHh----------hhhhhccCCEEEEcHHHHHHHHH- Confidence 421 010 01111000 01123356789999999776642 Q ss_pred ccccCCCCccc-----------cccCCCceEEecCCCC--------cceEEEEeccc-EEEEecceeeEEeech--hhhh Q lcl|Aclame:pro 290 FTSRNQFGEYV-----------TVLPHGITILESLAVE--------TGKAIAFVANR-YDAFMATASTIEEYDQ--TFAM 347 (377) Q Consensus 290 ~~~~~~~G~~~-----------~~l~~~~~v~~s~~~~--------~~~ii~gd~s~-y~~~~~~~~~i~~~~~--~~f~ 347 (377) .++.+|.|+ +++|+|+-++.++..+ +..++||||++ |.+++|.+++++++++ ..|. T Consensus 277 --lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 277 --LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred --hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 234445443 4444332222222211 22379999998 6789999999999875 4699 Q ss_pred cCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 348 EDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 348 ~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +|++.||+.+|+||++++++||++|++++. T Consensus 355 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred cCceEEEEEEeeccEEecccceEEEEeccc Confidence 999999999999999999999999999877 No 43 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=1.8e-51 Score=298.73 Aligned_cols=331 Identities=15% Similarity=0.085 Sum_probs=224.5 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHH Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKN-----------------------EEEMERM 57 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~ 57 (377) |.++|+++.+..+++.+....+.+..+.++. +...+..+.++.++.... ..++++. T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~-~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEA-EQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 9988887766655554444443322222211 112222222222221111 1112222 Q ss_pred HHhccccccccHHHHHHHHH-----HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEEcC Q lcl|Aclame:pro 58 FDLRDKNRELTAEEIKFFND-----IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL---RLKALTAET 129 (377) Q Consensus 58 ~~~~~~~~~lt~~e~~~~~~-----~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~---~~~~p~~~~ 129 (377) +.....+..++.+++.+... ....+++++||++||+++.+.|++.+++.++|+++|+++++++ +..+|+..+ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~ 159 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC Confidence 22222233344444433321 1233456789999999999999999999999999999999863 456787788 Q ss_pred CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCccee Q lcl|Aclame:pro 130 SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVG 209 (377) Q Consensus 130 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~G 209 (377) .+.+.|++|.++.++...++|+++++.+++++++++||+|||+||.+++++||.+.|+++++++++.+|++|+|+++|.| T Consensus 160 ~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~ 239 (392) T protein:vir:10 160 MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA 239 (392) T ss_pred CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 88999998887776555689999999999999999999999999999999999999999999999999999999887655 Q ss_pred eeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc Q lcl|Aclame:pro 210 LLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK 289 (377) Q Consensus 210 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~ 289 (377) ..+ +.. +...+... .......+..|+|||+++..+.. T Consensus 240 ~~~-----------------~d~---------------i~~~~~~~----------l~~~~~~~a~~vm~~~~~~~L~~- 276 (392) T protein:vir:10 240 IKS-----------------LDD---------------IKDVLNVK----------LDPAISPNAILLTNQDGFNYLDK- 276 (392) T ss_pred ccC-----------------HHH---------------HHHHHHHh----------hhhhhccCCEEEEcHHHHHHHHH- Confidence 421 010 01111000 01123356789999999776642 Q ss_pred ccccCCCCccc-----------cccCCCceEEecCCCC--------cceEEEEeccc-EEEEecceeeEEeech--hhhh Q lcl|Aclame:pro 290 FTSRNQFGEYV-----------TVLPHGITILESLAVE--------TGKAIAFVANR-YDAFMATASTIEEYDQ--TFAM 347 (377) Q Consensus 290 ~~~~~~~G~~~-----------~~l~~~~~v~~s~~~~--------~~~ii~gd~s~-y~~~~~~~~~i~~~~~--~~f~ 347 (377) .++.+|.|+ +++|+|+-++.++..+ +..++||||++ |.+++|.+++++++++ ..|. T Consensus 277 --lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 277 --LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred --hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 234445443 4444332222222211 22379999998 6789999999999875 4699 Q ss_pred cCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 348 EDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 348 ~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +|++.||+.+|+||++++++||++|++++. T Consensus 355 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred cCceEEEEEEeeccEEecccceEEEEeccc Confidence 999999999999999999999999999877 No 44 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=1.8e-51 Score=298.73 Aligned_cols=331 Identities=15% Similarity=0.085 Sum_probs=224.5 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHH Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKN-----------------------EEEMERM 57 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~ 57 (377) |.++|+++.+..+++.+....+.+..+.++. +...+..+.++.++.... ..++++. T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~-~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEA-EQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 9988887766655554444443322222211 112222222222221111 1112222 Q ss_pred HHhccccccccHHHHHHHHH-----HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEEcC Q lcl|Aclame:pro 58 FDLRDKNRELTAEEIKFFND-----IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL---RLKALTAET 129 (377) Q Consensus 58 ~~~~~~~~~lt~~e~~~~~~-----~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~---~~~~p~~~~ 129 (377) +.....+..++.+++.+... ....+++++||++||+++.+.|++.+++.++|+++|+++++++ +..+|+..+ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~ 159 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC Confidence 22222233344444433321 1233456789999999999999999999999999999999863 456787788 Q ss_pred CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCccee Q lcl|Aclame:pro 130 SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVG 209 (377) Q Consensus 130 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~G 209 (377) .+.+.|++|.++.++...++|+++++.+++++++++||+|||+||.+++++||.+.|+++++++++.+|++|+|+++|.| T Consensus 160 ~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~ 239 (392) T protein:vir:10 160 MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA 239 (392) T ss_pred CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 88999998887776555689999999999999999999999999999999999999999999999999999999887655 Q ss_pred eeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc Q lcl|Aclame:pro 210 LLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK 289 (377) Q Consensus 210 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~ 289 (377) ..+ +.. +...+... .......+..|+|||+++..+.. T Consensus 240 ~~~-----------------~d~---------------i~~~~~~~----------l~~~~~~~a~~vm~~~~~~~L~~- 276 (392) T protein:vir:10 240 IKS-----------------LDD---------------IKDVLNVK----------LDPAISPNAILLTNQDGFNYLDK- 276 (392) T ss_pred ccC-----------------HHH---------------HHHHHHHh----------hhhhhccCCEEEEcHHHHHHHHH- Confidence 421 010 01111000 01123356789999999776642 Q ss_pred ccccCCCCccc-----------cccCCCceEEecCCCC--------cceEEEEeccc-EEEEecceeeEEeech--hhhh Q lcl|Aclame:pro 290 FTSRNQFGEYV-----------TVLPHGITILESLAVE--------TGKAIAFVANR-YDAFMATASTIEEYDQ--TFAM 347 (377) Q Consensus 290 ~~~~~~~G~~~-----------~~l~~~~~v~~s~~~~--------~~~ii~gd~s~-y~~~~~~~~~i~~~~~--~~f~ 347 (377) .++.+|.|+ +++|+|+-++.++..+ +..++||||++ |.+++|.+++++++++ ..|. T Consensus 277 --lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 277 --LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred --hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 234445443 4444332222222211 22379999998 6789999999999875 4699 Q ss_pred cCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 348 EDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 348 ~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +|++.||+.+|+||++++++||++|++++. T Consensus 355 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred cCceEEEEEEeeccEEecccceEEEEeccc Confidence 999999999999999999999999999877 No 45 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=1.8e-51 Score=298.73 Aligned_cols=331 Identities=15% Similarity=0.085 Sum_probs=224.5 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHH Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKN-----------------------EEEMERM 57 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~ 57 (377) |.++|+++.+..+++.+....+.+..+.++. +...+..+.++.++.... ..++++. T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~-~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEA-EQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 9988887766655554444443322222211 112222222222221111 1112222 Q ss_pred HHhccccccccHHHHHHHHH-----HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEEcC Q lcl|Aclame:pro 58 FDLRDKNRELTAEEIKFFND-----IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL---RLKALTAET 129 (377) Q Consensus 58 ~~~~~~~~~lt~~e~~~~~~-----~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~---~~~~p~~~~ 129 (377) +.....+..++.+++.+... ....+++++||++||+++.+.|++.+++.++|+++|+++++++ +..+|+..+ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~ 159 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSD 159 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC Confidence 22222233344444433321 1233456789999999999999999999999999999999863 456787788 Q ss_pred CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCccee Q lcl|Aclame:pro 130 SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVG 209 (377) Q Consensus 130 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~G 209 (377) .+.+.|++|.++.++...++|+++++.+++++++++||+|||+||.+++++||.+.|+++++++++.+|++|+|+++|.| T Consensus 160 ~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~ 239 (392) T protein:vir:10 160 MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA 239 (392) T ss_pred CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 88999998887776555689999999999999999999999999999999999999999999999999999999887655 Q ss_pred eeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc Q lcl|Aclame:pro 210 LLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK 289 (377) Q Consensus 210 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~ 289 (377) ..+ +.. +...+... .......+..|+|||+++..+.. T Consensus 240 ~~~-----------------~d~---------------i~~~~~~~----------l~~~~~~~a~~vm~~~~~~~L~~- 276 (392) T protein:vir:10 240 IKS-----------------LDD---------------IKDVLNVK----------LDPAISPNAILLTNQDGFNYLDK- 276 (392) T ss_pred ccC-----------------HHH---------------HHHHHHHh----------hhhhhccCCEEEEcHHHHHHHHH- Confidence 421 010 01111000 01123356789999999776642 Q ss_pred ccccCCCCccc-----------cccCCCceEEecCCCC--------cceEEEEeccc-EEEEecceeeEEeech--hhhh Q lcl|Aclame:pro 290 FTSRNQFGEYV-----------TVLPHGITILESLAVE--------TGKAIAFVANR-YDAFMATASTIEEYDQ--TFAM 347 (377) Q Consensus 290 ~~~~~~~G~~~-----------~~l~~~~~v~~s~~~~--------~~~ii~gd~s~-y~~~~~~~~~i~~~~~--~~f~ 347 (377) .++.+|.|+ +++|+|+-++.++..+ +..++||||++ |.+++|.+++++++++ ..|. T Consensus 277 --lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 277 --LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred --hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 234445443 4444332222222211 22379999998 6789999999999875 4699 Q ss_pred cCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 348 EDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 348 ~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +|++.||+.+|+||++++++||++|++++. T Consensus 355 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred cCceEEEEEEeeccEEecccceEEEEeccc Confidence 999999999999999999999999999877 No 46 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=3.1e-51 Score=297.46 Aligned_cols=332 Identities=10% Similarity=0.065 Sum_probs=214.4 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhc---------cCHHHHHHHHHHHHHHHHHHHHH---HH-HHHHHHHHHhcc-cccc Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAG---------ATPEEQEKLFEAAFTTMGDEILA---KN-EEEMERMFDLRD-KNRE 66 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~---~~-~~~~~~~~~~~~-~~~~ 66 (377) |.|++++|.+...++.+..+.+.++ .+.++..+ .....+.+.++... +. ..+.+....... .... T Consensus 3 ~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (408) T protein:vir:10 3 VKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSE-LKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGP 81 (408) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 4445666655444333333332221 11111111 11111111111110 00 000000000000 0001 Q ss_pred -------ccHHHHHHHHH---------------HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---c Q lcl|Aclame:pro 67 -------LTAEEIKFFND---------------IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL---R 121 (377) Q Consensus 67 -------lt~~e~~~~~~---------------~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~---~ 121 (377) ......+.|.. ....+++++||++||++++++|++.+++.++|+++|+++|+++ . T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~ 161 (408) T protein:vir:10 82 LNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGS 161 (408) T ss_pred cccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcce Confidence 11122222221 1234567789999999999999999999999999999999853 3 Q ss_pred eEEEEEc-CCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee Q lcl|Aclame:pro 122 LKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK 200 (377) Q Consensus 122 ~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~ 200 (377) +.+|... ..+.+.|++|+++.++...++|++|++.+++++++++||+|||+||.+++++||.++|+++++++++.+|++ T Consensus 162 ~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~ 241 (408) T protein:vir:10 162 RVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIE 241 (408) T ss_pred EEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 4555443 346789998877776556699999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEecc Q lcl|Aclame:pro 201 GNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNP 280 (377) Q Consensus 201 G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 280 (377) |+|+++|.+-.. ++.. ..+.+.. . ....+..+.+|+||| T Consensus 242 g~g~~~~~~~~~----------------~~~~---------------l~~~~~~---~-------~~~~~~~~a~~v~n~ 280 (408) T protein:vir:10 242 VMKAAPKKPTIA----------------KFDD---------------VITMINT---A-------VDPAIIATSSLLTNQ 280 (408) T ss_pred cccccccccccc----------------cHHH---------------HHHHHHH---h-------hhhhhccCCEEEEcH Confidence 999887643110 0110 1111100 0 011233567899999 Q ss_pred chhhhhcccccccCCCCccc-----------cccCCCceEEecCCCCcc-----eEEEEeccc-EEEEecceeeEEeech Q lcl|Aclame:pro 281 EDRWTLEAKFTSRNQFGEYV-----------TVLPHGITILESLAVETG-----KAIAFVANR-YDAFMATASTIEEYDQ 343 (377) Q Consensus 281 ~~~~~~~~~~~~~~~~G~~~-----------~~l~~~~~v~~s~~~~~~-----~ii~gd~s~-y~~~~~~~~~i~~~~~ 343 (377) .++..+.. .++.+|.|+ +++|+|+.+..+..+|+. .++||||++ |.+++|+++++..+++ T Consensus 281 ~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~ 357 (408) T protein:vir:10 281 SGLNKLAL---VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNI 357 (408) T ss_pred HHHHHHHH---hhccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEccc Confidence 98765532 234444443 455555444434455542 289999998 6799999999999987 Q ss_pred hh--hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 344 TF--AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 344 ~~--f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+ |.+|++.||+.+|+||++++|+||++|++++. T Consensus 358 ~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~ 393 (408) T protein:vir:10 358 GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) T ss_pred ccchhhcCceEEEEEEeeccEEeccccEEEEEeecc Confidence 54 89999999999999999999999999999996 No 47 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=1.5e-50 Score=293.66 Aligned_cols=342 Identities=13% Similarity=0.112 Sum_probs=225.3 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHH-------HHHHH--HHHHHHHHHHhcccc--c---- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGD-------EILAK--NEEEMERMFDLRDKN--R---- 65 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~--~~~~~~~~~~~~~~~--~---- 65 (377) |.. +++|++..+++.+..+.+.+....+ .+......+.+.+ +.... ...+.+......... . T Consensus 1 M~~-l~el~~~~~~~~~e~~~l~~~~~~e--~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:18 1 MSE-LALIQKAIEESQQKMTQLFDAQKAE--IESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChH-HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 775 5555554444433333332221111 1111111111111 11110 000111100000000 0 Q ss_pred --cccHHHHHHHHHH-----------HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcC-C Q lcl|Aclame:pro 66 --ELTAEEIKFFNDI-----------DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAET-S 130 (377) Q Consensus 66 --~lt~~e~~~~~~~-----------~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~-~ 130 (377) ....+.++.+... .-..++..+|.+||++++..|++.+++.++|+++|+++|+++ .+++|+.++ . T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 157 (385) T protein:vir:18 78 SERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFT 157 (385) T ss_pred HHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCC Confidence 0111111211110 012334456778899999999999999999999999999875 589999865 5 Q ss_pred cceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCc-cee Q lcl|Aclame:pro 131 GTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQ-PVG 209 (377) Q Consensus 131 ~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~-P~G 209 (377) +.+.|++|.++. ++++++|+++++.+++++++++||+|||+|+ .++++||.++|+++++.++|.+|++|+|+++ |.| T Consensus 158 ~~a~~v~E~~~~-~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~G 235 (385) T protein:vir:18 158 NNADVVAEKALK-PESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEG 235 (385) T ss_pred cceeeeccCccc-cccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccc Confidence 688999876554 5788999999999999999999999999987 5799999999999999999999999999986 579 Q ss_pred eeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc Q lcl|Aclame:pro 210 LLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK 289 (377) Q Consensus 210 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~ 289 (377) |++............. .. .++.+..++..+ ......+..|+|||.++..+.. T Consensus 236 i~~~~~~~~~~~~~~~-~~-------------------~~d~i~~~~~~l-------~~~~~~~~~~~~~~~~~~~l~~- 287 (385) T protein:vir:18 236 LNKVATAYDTSLNATG-DT-------------------RADIIAHAIYQV-------TESEFSASGIVLNPRDWHNIAL- 287 (385) T ss_pred cccccccccccccccc-cc-------------------hHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHH- Confidence 9876543332211111 01 111121111111 1112344589999999776542 Q ss_pred ccccCCCCcccc--------ccCCCceEEecCCCCcceEEEEeccc-EEEEecceeeEEeechhh--hhcCcEEEEEEEE Q lcl|Aclame:pro 290 FTSRNQFGEYVT--------VLPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQTF--AMEDLQLYLTKNY 358 (377) Q Consensus 290 ~~~~~~~G~~~~--------~l~~~~~v~~s~~~~~~~ii~gd~s~-y~~~~~~~~~i~~~~~~~--f~~~~~~~~~~~r 358 (377) .++.+|.|+. ...+|+||+.++++|+++++||||++ |.+.++++++|+.+++.. |.+|++.||+.+| T Consensus 288 --lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r 365 (385) T protein:vir:18 288 --LKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEER 365 (385) T ss_pred --hhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEe Confidence 2445555541 12268899999999999999999997 889999999999887654 9999999999999 Q ss_pred EcCEEecccceEEEEeecC Q lcl|Aclame:pro 359 FYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 359 ~dg~~~~~~af~~l~~~a~ 377 (377) +||++.+|+||++|+++++ T Consensus 366 ~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:18 366 LALAHYRPTAIIKGTFSSG 384 (385) T ss_pred eccEEecccceEEEEeccC Confidence 9999999999999999999 No 48 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=1.5e-50 Score=293.66 Aligned_cols=342 Identities=13% Similarity=0.112 Sum_probs=225.3 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHH-------HHHHH--HHHHHHHHHHhcccc--c---- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGD-------EILAK--NEEEMERMFDLRDKN--R---- 65 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~--~~~~~~~~~~~~~~~--~---- 65 (377) |.. +++|++..+++.+..+.+.+....+ .+......+.+.+ +.... ...+.+......... . T Consensus 1 M~~-l~el~~~~~~~~~e~~~l~~~~~~e--~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:19 1 MSE-LALIQKAIEESQQKMTQLFDAQKAE--IESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChH-HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 775 5555554444433333332221111 1111111111111 11110 000111100000000 0 Q ss_pred --cccHHHHHHHHHH-----------HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcC-C Q lcl|Aclame:pro 66 --ELTAEEIKFFNDI-----------DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAET-S 130 (377) Q Consensus 66 --~lt~~e~~~~~~~-----------~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~-~ 130 (377) ....+.++.+... .-..++..+|.+||++++..|++.+++.++|+++|+++|+++ .+++|+.++ . T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 157 (385) T protein:vir:19 78 SERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFT 157 (385) T ss_pred HHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCC Confidence 0111111211110 012334456778899999999999999999999999999875 589999865 5 Q ss_pred cceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCc-cee Q lcl|Aclame:pro 131 GTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQ-PVG 209 (377) Q Consensus 131 ~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~-P~G 209 (377) +.+.|++|.++. ++++++|+++++.+++++++++||+|||+|+ .++++||.++|+++++.++|.+|++|+|+++ |.| T Consensus 158 ~~a~~v~E~~~~-~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~G 235 (385) T protein:vir:19 158 NNADVVAEKALK-PESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEG 235 (385) T ss_pred cceeeeccCccc-cccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccc Confidence 688999876554 5788999999999999999999999999987 5799999999999999999999999999986 579 Q ss_pred eeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc Q lcl|Aclame:pro 210 LLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK 289 (377) Q Consensus 210 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~ 289 (377) |++............. .. .++.+..++..+ ......+..|+|||.++..+.. T Consensus 236 i~~~~~~~~~~~~~~~-~~-------------------~~d~i~~~~~~l-------~~~~~~~~~~~~~~~~~~~l~~- 287 (385) T protein:vir:19 236 LNKVATAYDTSLNATG-DT-------------------RADIIAHAIYQV-------TESEFSASGIVLNPRDWHNIAL- 287 (385) T ss_pred cccccccccccccccc-cc-------------------hHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHH- Confidence 9876543332211111 01 111121111111 1112344589999999776542 Q ss_pred ccccCCCCcccc--------ccCCCceEEecCCCCcceEEEEeccc-EEEEecceeeEEeechhh--hhcCcEEEEEEEE Q lcl|Aclame:pro 290 FTSRNQFGEYVT--------VLPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQTF--AMEDLQLYLTKNY 358 (377) Q Consensus 290 ~~~~~~~G~~~~--------~l~~~~~v~~s~~~~~~~ii~gd~s~-y~~~~~~~~~i~~~~~~~--f~~~~~~~~~~~r 358 (377) .++.+|.|+. ...+|+||+.++++|+++++||||++ |.+.++++++|+.+++.. |.+|++.||+.+| T Consensus 288 --lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r 365 (385) T protein:vir:19 288 --LKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEER 365 (385) T ss_pred --hhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEe Confidence 2445555541 12268899999999999999999997 889999999999887654 9999999999999 Q ss_pred EcCEEecccceEEEEeecC Q lcl|Aclame:pro 359 FYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 359 ~dg~~~~~~af~~l~~~a~ 377 (377) +||++.+|+||++|+++++ T Consensus 366 ~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:19 366 LALAHYRPTAIIKGTFSSG 384 (385) T ss_pred eccEEecccceEEEEeccC Confidence 9999999999999999999 No 49 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=1.5e-51 Score=299.18 Aligned_cols=282 Identities=13% Similarity=0.030 Sum_probs=220.8 Q ss_pred ccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceeeeccc Q lcl|Aclame:pro 61 RDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIF 139 (377) Q Consensus 61 ~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e~ 139 (377) ...+..+.++++.. ...+++++|.+||+++.++|++.+++.++|+++|+++|+++ ..++|+.++.+.+.|++|. T Consensus 1 ~~~~~~~~~e~~~~-----~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (318) T protein:vir:24 1 MAAGTAFAVDHAQI-----AQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEG 75 (318) T ss_pred CCCCCCCCHHHHHh-----hcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCC Confidence 33445666777654 34566678889999999999999999999999999999875 5899999999999999876 Q ss_pred ccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccc Q lcl|Aclame:pro 140 GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTV 219 (377) Q Consensus 140 ~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~ 219 (377) ++. ++++++|+++++.++|+++++++|+|+|+||.++++++|.++|++++++++|.+|++|+|+++|.|+++....... T Consensus 76 ~~~-~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~ 154 (318) T protein:vir:24 76 DMK-PITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISI 154 (318) T ss_pred ccc-cccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccc Confidence 665 5678999999999999999999999999999999999999999999999999999999999999999876443322 Q ss_pred cccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcc Q lcl|Aclame:pro 220 DQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEY 299 (377) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~ 299 (377) ....+.... .. ......+.. . ......+..|+|||+++..+.. .++.+|.| T Consensus 155 ~~~~~~~~~--~~-------------~~~~~~~~~----~-------~~~~~~~~~~v~n~~~~~~L~~---lkd~~G~~ 205 (318) T protein:vir:24 155 ADTTGATTV--YD-------------QVAVNGLSL----L-------VNDGKKWTHTLLDDITEPILNG---AKDQNGRP 205 (318) T ss_pred cccccccch--HH-------------HHHHHHHHh----h-------ccccCCCCEEEEcHHHHHHHHH---hhccCCce Confidence 222211110 00 000111110 0 1122345689999999766542 34555554 Q ss_pred cc--------------ccCCCceEEecCCCCcce--EEEEecccEEEEecceeeEEeechhh--------------hhcC Q lcl|Aclame:pro 300 VT--------------VLPHGITILESLAVETGK--AIAFVANRYDAFMATASTIEEYDQTF--------------AMED 349 (377) Q Consensus 300 ~~--------------~l~~~~~v~~s~~~~~~~--ii~gd~s~y~~~~~~~~~i~~~~~~~--------------f~~~ 349 (377) +. .-.+|+|++.++++++++ ++||||++|+++++++++|+.+++.. |.+| T Consensus 206 l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~ 285 (318) T protein:vir:24 206 LFIESTYGEAASPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHN 285 (318) T ss_pred eecCccccCccccccCceEEEEeeEEeCCCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcC Confidence 31 123577899999999876 58999999999999999999998865 8899 Q ss_pred cEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 350 LQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 350 ~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++.||+.+|+|+++.+|+||++|+.++- T Consensus 286 ~~~~r~~~r~d~~v~~~~a~~~i~~~~a 313 (318) T protein:vir:24 286 LVAVRVEAEYAFHCNDAEAFVALTNVVS 313 (318) T ss_pred cEEEEEEEEEccEEecccceEEEEeecc Confidence 9999999999999999999999998443 No 50 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=7.6e-51 Score=295.36 Aligned_cols=331 Identities=13% Similarity=0.089 Sum_probs=218.5 Q ss_pred CCccHHHHHHHHHHHHHHHHHH----Hhc----cCHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH--h-c------ Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKI----SAG----ATPEEQEKLFEAAFTTMGDEILAKNE--EEMERMFD--L-R------ 61 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~----~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~--~-~------ 61 (377) |+. +++|.+..+++.+..+.+ ... ....++.+......+.+.+++..... .+.+.... . . T Consensus 1 Mk~-~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (397) T protein:vir:49 1 MKT-SNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPL 79 (397) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 665 455444333333333322 211 11111222222222222222111110 01111000 0 0 Q ss_pred -cccccccHHHHHHHHHH-----------HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEE Q lcl|Aclame:pro 62 -DKNRELTAEEIKFFNDI-----------DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL---RLKALT 126 (377) Q Consensus 62 -~~~~~lt~~e~~~~~~~-----------~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~---~~~~p~ 126 (377) ........++++.|... ...+++++||++||+++++.|++.+++.++|+++|+++|+++ .+.+|+ T Consensus 80 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 159 (397) T protein:vir:49 80 TKNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEK 159 (397) T ss_pred cchhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEe Confidence 00111233444444332 224457789999999999999999999999999999998863 456666 Q ss_pred EcC-CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCC Q lcl|Aclame:pro 127 AET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL 205 (377) Q Consensus 127 ~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~ 205 (377) ... .+.+.|++|.+..++...++|++|++.+++++++++||+|||+|+.+++++||.++|++++++++|.+|++|+|++ T Consensus 160 ~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~ 239 (397) T protein:vir:49 160 WADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTL 239 (397) T ss_pred eccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 544 4678999887777655568999999999999999999999999999999999999999999999999999999998 Q ss_pred cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhh Q lcl|Aclame:pro 206 QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWT 285 (377) Q Consensus 206 ~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~ 285 (377) +|.+... +++ .+.+.+..+ ......+..|+|||+++.. T Consensus 240 ~~~~~~~----------------~~d---------------~i~~~~~~l-----------~~~~~~~a~~v~n~~~~~~ 277 (397) T protein:vir:49 240 PNKPTLA----------------KWD---------------DIIDLQAKV-----------DPAIKQTSLFLTNTSGFTA 277 (397) T ss_pred ccccccc----------------CHH---------------HHHHHHHhh-----------hhhhcCCCEEEEcHHHHHH Confidence 7753221 111 111111111 1112345689999998765 Q ss_pred hcccccccCCCCccc-----------cccCCCceEEecCCCCc-----ceEEEEeccc-EEEEecceeeEEeechh--hh Q lcl|Aclame:pro 286 LEAKFTSRNQFGEYV-----------TVLPHGITILESLAVET-----GKAIAFVANR-YDAFMATASTIEEYDQT--FA 346 (377) Q Consensus 286 ~~~~~~~~~~~G~~~-----------~~l~~~~~v~~s~~~~~-----~~ii~gd~s~-y~~~~~~~~~i~~~~~~--~f 346 (377) +.. .++.+|.|+ +++|+|+.+..+..+|. ..++||||++ |.++++++++++++++. .| T Consensus 278 l~~---lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~ 354 (397) T protein:vir:49 278 LKK---VKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAF 354 (397) T ss_pred HHH---hhccCCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchh Confidence 542 234444443 45544444344445553 3589999997 77899999999998865 59 Q ss_pred hcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 347 MEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 347 ~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+|++.||+..|+||++++++||+++++++. T Consensus 355 ~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 355 ETDTTKVRVIDRFDVVSTDTEAFVPASFKAI 385 (397) T ss_pred hcCeeeEEEEEeeccEEecccceEEEEeccc Confidence 9999999999999999999999999999999 No 51 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=4.1e-50 Score=291.37 Aligned_cols=331 Identities=14% Similarity=0.108 Sum_probs=216.4 Q ss_pred CCccHHHHH----HHHHHHHHHHHHHHhcc----CHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHH----H-h----- Q lcl|Aclame:pro 1 MAINLKELP----KYREAVAELSAKISAGA----TPEEQEKLFEAAFTTMGDEILAKNE--EEMERMF----D-L----- 60 (377) Q Consensus 1 m~~~~~~l~----~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~----~-~----- 60 (377) |+. +++|+ +..++++++.+++.... ...++.+.+....+.+.+++..... .+.+... . . T Consensus 1 Mk~-~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (397) T protein:vir:48 1 MKT-SNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPL 79 (397) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhccccc Confidence 776 34443 33344444444433211 1111112222222222222111110 0000000 0 0 Q ss_pred ccccccccHHHHHHHHH-----------HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceE--EEE Q lcl|Aclame:pro 61 RDKNRELTAEEIKFFND-----------IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLK--ALT 126 (377) Q Consensus 61 ~~~~~~lt~~e~~~~~~-----------~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~--~p~ 126 (377) .........++++.+.. .+..+++++||++||++++++|++.+++.++|+++|+++|+++ ..+ ++. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 159 (397) T protein:vir:48 80 TKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEK 159 (397) T ss_pred cchhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEe Confidence 00111122334444332 2334566789999999999999999999999999999999864 233 333 Q ss_pred E-cCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCC Q lcl|Aclame:pro 127 A-ETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL 205 (377) Q Consensus 127 ~-~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~ 205 (377) . +..+.+.|++|++..++..+++|++|++.+++++++++||+|||+||.+++++||.++|++++++++|.+|++|+|++ T Consensus 160 ~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~ 239 (397) T protein:vir:48 160 WADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATL 239 (397) T ss_pred ecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 3 445678999887777666679999999999999999999999999999999999999999999999999999999998 Q ss_pred cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhh Q lcl|Aclame:pro 206 QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWT 285 (377) Q Consensus 206 ~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~ 285 (377) +|.+.+. +++ .+.+.+..+ ......+..|+|||.++.. T Consensus 240 ~~~~~~~----------------~~d---------------~i~~~~~~l-----------~~~~~~~a~~v~n~~~~~~ 277 (397) T protein:vir:48 240 PTKPTLT----------------KWD---------------DIIDLQAKV-----------DPAIKQTSFFLTNTSGFTA 277 (397) T ss_pred ccccccc----------------cHH---------------HHHHHHHHh-----------hhhhcCCCEEEECHHHHHH Confidence 7654321 011 111111111 1112345789999998765 Q ss_pred hcccccccCCCCccc-----------cccCCCceEEecCCCC-----cceEEEEeccc-EEEEecceeeEEeechh--hh Q lcl|Aclame:pro 286 LEAKFTSRNQFGEYV-----------TVLPHGITILESLAVE-----TGKAIAFVANR-YDAFMATASTIEEYDQT--FA 346 (377) Q Consensus 286 ~~~~~~~~~~~G~~~-----------~~l~~~~~v~~s~~~~-----~~~ii~gd~s~-y~~~~~~~~~i~~~~~~--~f 346 (377) +.. .++.+|.|+ +++|+|+.++.+..++ +..++||||++ |.++++++++++.+++. +| T Consensus 278 L~~---lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~ 354 (397) T protein:vir:48 278 LKK---VKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAF 354 (397) T ss_pred HHH---hhcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhh Confidence 532 234444443 4455544444333343 44589999997 56899999999998865 69 Q ss_pred hcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 347 MEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 347 ~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+|++.||+.+|+|+++++|+||+.+++++. T Consensus 355 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 355 ETDTTKIRVIDRFDVVATDTESFVPASFKAI 385 (397) T ss_pred hcCceeEEEEeeeccEEecccceEEEEeccc Confidence 9999999999999999999999999999998 No 52 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=3.5e-50 Score=291.70 Aligned_cols=343 Identities=13% Similarity=0.083 Sum_probs=219.0 Q ss_pred CCccHHHHHHHHHH----HHHHHHHHHhccCHHH--HHHHHHHHHHHHHHHHHHHHH--HHHH----------------- Q lcl|Aclame:pro 1 MAINLKELPKYREA----VAELSAKISAGATPEE--QEKLFEAAFTTMGDEILAKNE--EEME----------------- 55 (377) Q Consensus 1 m~~~~~~l~~~~~~----~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~--~~~~----------------- 55 (377) |+- ++++++..++ +.+..+......++++ +.+........+.+++..... .+.+ T Consensus 1 mk~-~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) T protein:vir:81 1 MKT-KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccch Confidence 544 3333332222 2111221111111111 111222222222222211100 0000 Q ss_pred ----------HHHHhccccccccHHHHHHHHHHHh--------ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe Q lcl|Aclame:pro 56 ----------RMFDLRDKNRELTAEEIKFFNDIDK--------NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN 117 (377) Q Consensus 56 ----------~~~~~~~~~~~lt~~e~~~~~~~~~--------~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~ 117 (377) ...........+..++++.|..... ..++++||++||+++.+.|++.+++.++|+++|++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:81 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee Confidence 0000111112234445555433321 2345678999999999999999999999999999999 Q ss_pred cCC---ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 118 TSL---RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVAL 194 (377) Q Consensus 118 ~~~---~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~ 194 (377) +++ ++.+|..++.+.+.|++|.++.++.+.++|+++++.+++++++++||+|||+||.+++++||.++|++++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~ 239 (415) T protein:vir:81 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) T ss_pred ccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHH Confidence 853 34566667778899998888877666789999999999999999999999999999999999999999999999 Q ss_pred hcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCce Q lcl|Aclame:pro 195 ELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQV 274 (377) Q Consensus 195 ~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (377) +.+|++|+|+++|.+............. .....++ ..+.+.+..+ ......+. T Consensus 240 ~~~il~g~g~g~~~~~~~~~~~~~~~~~-~~~~~~~---------------~~i~~~~~~~-----------~~~~~~~~ 292 (415) T protein:vir:81 240 NKAIIDVITKGSTGSTSSGFEKEGKKLE-VKKAKSL---------------DDIKDAINLN-----------VKPNYEHN 292 (415) T ss_pred HHHHhhccccCccccccccccccccccc-cccccch---------------hHHHHHHHhh-----------hhhccCCC Confidence 9999999999988766543222111111 1111111 1111111111 01122356 Q ss_pred EEEeccchhhhhcccccccCCCCcccc---------ccCCCceEEecCCCCcce-----EEEEeccc-EEEEecceeeEE Q lcl|Aclame:pro 275 KLLLNPEDRWTLEAKFTSRNQFGEYVT---------VLPHGITILESLAVETGK-----AIAFVANR-YDAFMATASTIE 339 (377) Q Consensus 275 ~~~~n~~~~~~~~~~~~~~~~~G~~~~---------~l~~~~~v~~s~~~~~~~-----ii~gd~s~-y~~~~~~~~~i~ 339 (377) .|+|||+++..+.. .++.+|.|+. ...+|+||+.++++|.+. ++||||++ |.+.+++++++. T Consensus 293 ~~v~n~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~ 369 (415) T protein:vir:81 293 VAIVSQTMFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQAS 369 (415) T ss_pred EEEEcHHHHHHHHH---hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEE Confidence 79999998766542 2455565541 122567788887777432 89999998 678999999999 Q ss_pred eechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 340 EYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 340 ~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+++ ..+++.||+.+|+||++.+|+||+++++++- T Consensus 370 ~~~~---~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:81 370 WTDY---MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred Eecc---ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 8875 4566789999999999999999999999887 No 53 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=3.5e-50 Score=291.70 Aligned_cols=343 Identities=13% Similarity=0.083 Sum_probs=219.0 Q ss_pred CCccHHHHHHHHHH----HHHHHHHHHhccCHHH--HHHHHHHHHHHHHHHHHHHHH--HHHH----------------- Q lcl|Aclame:pro 1 MAINLKELPKYREA----VAELSAKISAGATPEE--QEKLFEAAFTTMGDEILAKNE--EEME----------------- 55 (377) Q Consensus 1 m~~~~~~l~~~~~~----~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~--~~~~----------------- 55 (377) |+- ++++++..++ +.+..+......++++ +.+........+.+++..... .+.+ T Consensus 1 mk~-~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) T protein:vir:79 1 MKT-KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccch Confidence 544 3333332222 2111221111111111 111222222222222211100 0000 Q ss_pred ----------HHHHhccccccccHHHHHHHHHHHh--------ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe Q lcl|Aclame:pro 56 ----------RMFDLRDKNRELTAEEIKFFNDIDK--------NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN 117 (377) Q Consensus 56 ----------~~~~~~~~~~~lt~~e~~~~~~~~~--------~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~ 117 (377) ...........+..++++.|..... ..++++||++||+++.+.|++.+++.++|+++|++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:79 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee Confidence 0000111112234445555433321 2345678999999999999999999999999999999 Q ss_pred cCC---ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 118 TSL---RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVAL 194 (377) Q Consensus 118 ~~~---~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~ 194 (377) +++ ++.+|..++.+.+.|++|.++.++.+.++|+++++.+++++++++||+|||+||.+++++||.++|++++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~ 239 (415) T protein:vir:79 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) T ss_pred ccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHH Confidence 853 34566667778899998888877666789999999999999999999999999999999999999999999999 Q ss_pred hcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCce Q lcl|Aclame:pro 195 ELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQV 274 (377) Q Consensus 195 ~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (377) +.+|++|+|+++|.+............. .....++ ..+.+.+..+ ......+. T Consensus 240 ~~~il~g~g~g~~~~~~~~~~~~~~~~~-~~~~~~~---------------~~i~~~~~~~-----------~~~~~~~~ 292 (415) T protein:vir:79 240 NKAIIDVITKGSTGSTSSGFEKEGKKLE-VKKAKSL---------------DDIKDAINLN-----------VKPNYEHN 292 (415) T ss_pred HHHHhhccccCccccccccccccccccc-cccccch---------------hHHHHHHHhh-----------hhhccCCC Confidence 9999999999988766543222111111 1111111 1111111111 01122356 Q ss_pred EEEeccchhhhhcccccccCCCCcccc---------ccCCCceEEecCCCCcce-----EEEEeccc-EEEEecceeeEE Q lcl|Aclame:pro 275 KLLLNPEDRWTLEAKFTSRNQFGEYVT---------VLPHGITILESLAVETGK-----AIAFVANR-YDAFMATASTIE 339 (377) Q Consensus 275 ~~~~n~~~~~~~~~~~~~~~~~G~~~~---------~l~~~~~v~~s~~~~~~~-----ii~gd~s~-y~~~~~~~~~i~ 339 (377) .|+|||+++..+.. .++.+|.|+. ...+|+||+.++++|.+. ++||||++ |.+.+++++++. T Consensus 293 ~~v~n~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~ 369 (415) T protein:vir:79 293 VAIVSQTMFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQAS 369 (415) T ss_pred EEEEcHHHHHHHHH---hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEE Confidence 79999998766542 2455565541 122567788887777432 89999998 678999999999 Q ss_pred eechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 340 EYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 340 ~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+++ ..+++.||+.+|+||++.+|+||+++++++- T Consensus 370 ~~~~---~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:79 370 WTDY---MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred Eecc---ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 8875 4566789999999999999999999999887 No 54 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=3.5e-50 Score=291.70 Aligned_cols=343 Identities=13% Similarity=0.083 Sum_probs=219.0 Q ss_pred CCccHHHHHHHHHH----HHHHHHHHHhccCHHH--HHHHHHHHHHHHHHHHHHHHH--HHHH----------------- Q lcl|Aclame:pro 1 MAINLKELPKYREA----VAELSAKISAGATPEE--QEKLFEAAFTTMGDEILAKNE--EEME----------------- 55 (377) Q Consensus 1 m~~~~~~l~~~~~~----~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~--~~~~----------------- 55 (377) |+- ++++++..++ +.+..+......++++ +.+........+.+++..... .+.+ T Consensus 1 mk~-~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) T protein:vir:98 1 MKT-KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccch Confidence 544 3333332222 2111221111111111 111222222222222211100 0000 Q ss_pred ----------HHHHhccccccccHHHHHHHHHHHh--------ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe Q lcl|Aclame:pro 56 ----------RMFDLRDKNRELTAEEIKFFNDIDK--------NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN 117 (377) Q Consensus 56 ----------~~~~~~~~~~~lt~~e~~~~~~~~~--------~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~ 117 (377) ...........+..++++.|..... ..++++||++||+++.+.|++.+++.++|+++|++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:98 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee Confidence 0000111112234445555433321 2345678999999999999999999999999999999 Q ss_pred cCC---ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 118 TSL---RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVAL 194 (377) Q Consensus 118 ~~~---~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~ 194 (377) +++ ++.+|..++.+.+.|++|.++.++.+.++|+++++.+++++++++||+|||+||.+++++||.++|++++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~ 239 (415) T protein:vir:98 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) T ss_pred ccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHH Confidence 853 34566667778899998888877666789999999999999999999999999999999999999999999999 Q ss_pred hcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCce Q lcl|Aclame:pro 195 ELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQV 274 (377) Q Consensus 195 ~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (377) +.+|++|+|+++|.+............. .....++ ..+.+.+..+ ......+. T Consensus 240 ~~~il~g~g~g~~~~~~~~~~~~~~~~~-~~~~~~~---------------~~i~~~~~~~-----------~~~~~~~~ 292 (415) T protein:vir:98 240 NKAIIDVITKGSTGSTSSGFEKEGKKLE-VKKAKSL---------------DDIKDAINLN-----------VKPNYEHN 292 (415) T ss_pred HHHHhhccccCccccccccccccccccc-cccccch---------------hHHHHHHHhh-----------hhhccCCC Confidence 9999999999988766543222111111 1111111 1111111111 01122356 Q ss_pred EEEeccchhhhhcccccccCCCCcccc---------ccCCCceEEecCCCCcce-----EEEEeccc-EEEEecceeeEE Q lcl|Aclame:pro 275 KLLLNPEDRWTLEAKFTSRNQFGEYVT---------VLPHGITILESLAVETGK-----AIAFVANR-YDAFMATASTIE 339 (377) Q Consensus 275 ~~~~n~~~~~~~~~~~~~~~~~G~~~~---------~l~~~~~v~~s~~~~~~~-----ii~gd~s~-y~~~~~~~~~i~ 339 (377) .|+|||+++..+.. .++.+|.|+. ...+|+||+.++++|.+. ++||||++ |.+.+++++++. T Consensus 293 ~~v~n~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~ 369 (415) T protein:vir:98 293 VAIVSQTMFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQAS 369 (415) T ss_pred EEEEcHHHHHHHHH---hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEE Confidence 79999998766542 2455565541 122567788887777432 89999998 678999999999 Q ss_pred eechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 340 EYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 340 ~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+++ ..+++.||+.+|+||++.+|+||+++++++- T Consensus 370 ~~~~---~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:98 370 WTDY---MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred Eecc---ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 8875 4566789999999999999999999999887 No 55 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=8.7e-50 Score=289.54 Aligned_cols=343 Identities=13% Similarity=0.082 Sum_probs=219.1 Q ss_pred CCccHHHHHH----HHHHHHHHHHHHHhccCHH--HHHHHHHHHHHHHHHHHHHHHH--HHHH----H------------ Q lcl|Aclame:pro 1 MAINLKELPK----YREAVAELSAKISAGATPE--EQEKLFEAAFTTMGDEILAKNE--EEME----R------------ 56 (377) Q Consensus 1 m~~~~~~l~~----~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~--~~~~----~------------ 56 (377) |+.+ +++.+ +.+++.+..+.+.+..+++ ++.+..++....+..++..... .+.+ . T Consensus 1 mk~~-~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) T protein:vir:46 1 MKTK-EELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNE 79 (415) T ss_pred CchH-HHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccch Confidence 5542 22222 1222222222222211111 1112222222222222211100 0000 0 Q ss_pred -----------HHHhccccccccHHHHHHHHHHH--------hccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe Q lcl|Aclame:pro 57 -----------MFDLRDKNRELTAEEIKFFNDID--------KNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN 117 (377) Q Consensus 57 -----------~~~~~~~~~~lt~~e~~~~~~~~--------~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~ 117 (377) .............++++.|.... ...++++||++||+++.+.|++.+++.++|+++|+++| T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:46 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceee Confidence 00001111223344555554332 12345678999999999999999999999999999999 Q ss_pred cCC-ceEEE--EEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 118 TSL-RLKAL--TAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVAL 194 (377) Q Consensus 118 ~~~-~~~~p--~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~ 194 (377) +++ ..++| ..++.+.+.|++|.++.++.+.++|++|++.+++++++++||+|||+||.+++++||.++|++++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~ 239 (415) T protein:vir:46 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) T ss_pred ccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHH Confidence 863 34555 456677889998877777666789999999999999999999999999999999999999999999999 Q ss_pred hcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCce Q lcl|Aclame:pro 195 ELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQV 274 (377) Q Consensus 195 ~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (377) |.+|++|+|+++|.++............ .....++ ..+.+.+..+. .....+. T Consensus 240 d~~il~g~g~g~~~~~~~~~~~~~~~~~-~~~~~~~---------------~~i~~~~~~~~-----------~~~~~~~ 292 (415) T protein:vir:46 240 NKAIIDVITKGSTGSTSSGFEKEGKKLE-VKKAKSL---------------DDIKDAINLNV-----------KPNYEHN 292 (415) T ss_pred HHHHhhccccCCccccccccccccceec-cccccch---------------HHHHHHHHhhh-----------hhccCCC Confidence 9999999999988776543222111111 1111111 11112221111 1122356 Q ss_pred EEEeccchhhhhcccccccCCCCcccc---------ccCCCceEEecCCCCcc-----eEEEEeccc-EEEEecceeeEE Q lcl|Aclame:pro 275 KLLLNPEDRWTLEAKFTSRNQFGEYVT---------VLPHGITILESLAVETG-----KAIAFVANR-YDAFMATASTIE 339 (377) Q Consensus 275 ~~~~n~~~~~~~~~~~~~~~~~G~~~~---------~l~~~~~v~~s~~~~~~-----~ii~gd~s~-y~~~~~~~~~i~ 339 (377) .|+|||+++..+.. .++.+|.|+. ...+|+||+.++++|.+ .++||||++ |.+++|++++++ T Consensus 293 ~~v~n~~~~~~L~~---lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~ 369 (415) T protein:vir:46 293 VAIVSQTMFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQAS 369 (415) T ss_pred EEEEcHHHHHHHHH---hhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEE Confidence 79999998765532 3456666552 12357788888877743 389999998 678999999999 Q ss_pred eechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 340 EYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 340 ~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .++ |..+++.||+++|+||++++|+||+++++++- T Consensus 370 ~~~---~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:46 370 WTD---YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred eec---cccCceEEEEEEEeccEEeccccEEEEEeecc Confidence 887 45677899999999999999999999998876 No 56 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=8.7e-50 Score=289.54 Aligned_cols=343 Identities=13% Similarity=0.082 Sum_probs=219.1 Q ss_pred CCccHHHHHH----HHHHHHHHHHHHHhccCHH--HHHHHHHHHHHHHHHHHHHHHH--HHHH----H------------ Q lcl|Aclame:pro 1 MAINLKELPK----YREAVAELSAKISAGATPE--EQEKLFEAAFTTMGDEILAKNE--EEME----R------------ 56 (377) Q Consensus 1 m~~~~~~l~~----~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~--~~~~----~------------ 56 (377) |+.+ +++.+ +.+++.+..+.+.+..+++ ++.+..++....+..++..... .+.+ . T Consensus 1 mk~~-~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) T protein:vir:47 1 MKTK-EELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNE 79 (415) T ss_pred CchH-HHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccch Confidence 5542 22222 1222222222222211111 1112222222222222211100 0000 0 Q ss_pred -----------HHHhccccccccHHHHHHHHHHH--------hccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe Q lcl|Aclame:pro 57 -----------MFDLRDKNRELTAEEIKFFNDID--------KNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN 117 (377) Q Consensus 57 -----------~~~~~~~~~~lt~~e~~~~~~~~--------~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~ 117 (377) .............++++.|.... ...++++||++||+++.+.|++.+++.++|+++|+++| T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:47 80 ARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceee Confidence 00001111223344555554332 12345678999999999999999999999999999999 Q ss_pred cCC-ceEEE--EEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 118 TSL-RLKAL--TAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVAL 194 (377) Q Consensus 118 ~~~-~~~~p--~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~ 194 (377) +++ ..++| ..++.+.+.|++|.++.++.+.++|++|++.+++++++++||+|||+||.+++++||.++|++++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~ 239 (415) T protein:vir:47 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) T ss_pred ccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHH Confidence 863 34555 456677889998877777666789999999999999999999999999999999999999999999999 Q ss_pred hcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCce Q lcl|Aclame:pro 195 ELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQV 274 (377) Q Consensus 195 ~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (377) |.+|++|+|+++|.++............ .....++ ..+.+.+..+. .....+. T Consensus 240 d~~il~g~g~g~~~~~~~~~~~~~~~~~-~~~~~~~---------------~~i~~~~~~~~-----------~~~~~~~ 292 (415) T protein:vir:47 240 NKAIIDVITKGSTGSTSSGFEKEGKKLE-VKKAKSL---------------DDIKDAINLNV-----------KPNYEHN 292 (415) T ss_pred HHHHhhccccCCccccccccccccceec-cccccch---------------HHHHHHHHhhh-----------hhccCCC Confidence 9999999999988776543222111111 1111111 11112221111 1122356 Q ss_pred EEEeccchhhhhcccccccCCCCcccc---------ccCCCceEEecCCCCcc-----eEEEEeccc-EEEEecceeeEE Q lcl|Aclame:pro 275 KLLLNPEDRWTLEAKFTSRNQFGEYVT---------VLPHGITILESLAVETG-----KAIAFVANR-YDAFMATASTIE 339 (377) Q Consensus 275 ~~~~n~~~~~~~~~~~~~~~~~G~~~~---------~l~~~~~v~~s~~~~~~-----~ii~gd~s~-y~~~~~~~~~i~ 339 (377) .|+|||+++..+.. .++.+|.|+. ...+|+||+.++++|.+ .++||||++ |.+++|++++++ T Consensus 293 ~~v~n~~~~~~L~~---lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~ 369 (415) T protein:vir:47 293 VAIVSQTMFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQAS 369 (415) T ss_pred EEEEcHHHHHHHHH---hhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEE Confidence 79999998765532 3456666552 12357788888877743 389999998 678999999999 Q ss_pred eechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 340 EYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 340 ~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .++ |..+++.||+++|+||++++|+||+++++++- T Consensus 370 ~~~---~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:47 370 WTD---YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred eec---cccCceEEEEEEEeccEEeccccEEEEEeecc Confidence 887 45677899999999999999999999998876 No 57 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=1.3e-51 Score=299.64 Aligned_cols=271 Identities=12% Similarity=0.049 Sum_probs=212.8 Q ss_pred cccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc-eEEEEEcCCcceeeecccc Q lcl|Aclame:pro 62 DKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLR-LKALTAETSGTAVWGDIFG 140 (377) Q Consensus 62 ~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~-~~~p~~~~~~~a~w~~e~~ 140 (377) .+ .......+++++|.+||++++++|++.+++.++|+++|+++|++++ .++|+.+ .+.+.|++|.+ T Consensus 1 ~g------------~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~ 67 (299) T protein:vir:41 1 MG------------FNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMS-GVGAFWVDEAE 67 (299) T ss_pred CC------------cCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEc-CCceeeeecCc Confidence 00 0111234556788999999999999999999999999999998764 6788765 47799998766 Q ss_pred cccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeecccccccc Q lcl|Aclame:pro 141 EIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVD 220 (377) Q Consensus 141 ~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~ 220 (377) +. ++++++|+++++.+++++++++||+|+++||.+++++||.++|++++++++|++|++|+|+++|.||++........ T Consensus 68 ~~-~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~ 146 (299) T protein:vir:41 68 RI-QTSKPTFTKAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNL 146 (299) T ss_pred cc-cccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCccccccccccccccee Confidence 55 56789999999999999999999999999999999999999999999999999999999999999999765443322 Q ss_pred ccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccc Q lcl|Aclame:pro 221 QSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYV 300 (377) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~ 300 (377) ...+. .++ ..+.+.+..+ . .....+..|+|||.++..+.. .++.+|.|+ T Consensus 147 ~~~~~--~~~---------------~~l~~~~~~l----~-------~~~~~~~~~v~n~~~~~~L~~---lkd~~G~~l 195 (299) T protein:vir:41 147 VEETA--NKY---------------DDLNEAIGLI----E-------AEDLEPNGIATIRKQRVKYRS---TKDGNGMPI 195 (299) T ss_pred ecccc--ccH---------------HHHHHHHHhh----h-------cccCCcCEEEEcHHHHHHHHH---hhccCCcee Confidence 21111 111 1111111111 0 111234579999999876653 245556543 Q ss_pred c--------ccCCCceEEecCCCCcce----EEEEecccEEEEecceeeEEeechhh--------------hhcCcEEEE Q lcl|Aclame:pro 301 T--------VLPHGITILESLAVETGK----AIAFVANRYDAFMATASTIEEYDQTF--------------AMEDLQLYL 354 (377) Q Consensus 301 ~--------~l~~~~~v~~s~~~~~~~----ii~gd~s~y~~~~~~~~~i~~~~~~~--------------f~~~~~~~~ 354 (377) . ...+|+||+.++++|.++ ++||||++|+++++++++++++++.+ |.+|++.|| T Consensus 196 ~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 275 (299) T protein:vir:41 196 FNTATSNGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIK 275 (299) T ss_pred ecCCcCCCCceecceeeEEecccCCCCCceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 2 123688999999999876 89999999999999999999998865 789999999 Q ss_pred EEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 355 TKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 355 ~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +.+|+|+++.+|+||++|+.+|+ T Consensus 276 ~~~~~d~~v~~~~A~~~l~~~aa 298 (299) T protein:vir:41 276 ATFEVGFMVVKDEAFSAVQPKAG 298 (299) T ss_pred EEEEeccEEecccceEEEEeccC Confidence 99999999999999999999999 No 58 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=1.5e-51 Score=299.27 Aligned_cols=292 Identities=13% Similarity=0.014 Sum_probs=214.0 Q ss_pred HHHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCccee Q lcl|Aclame:pro 56 RMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAV 134 (377) Q Consensus 56 ~~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~ 134 (377) =.+...+....+..+|++++ ..+ ++.+|.+||++++++|++.+++.++|+++++++|+++ ..++|+.++.+.+. T Consensus 1 ~~~~~~r~~~~~~~~e~~a~----~~~-~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~ 75 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVA----QTG-DSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSAS 75 (326) T ss_pred CCCCccchhhhcCcchhhhe----ecc-ccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceE Confidence 00000011112333344332 222 3334557999999999999999999999999999875 58999999999999 Q ss_pred eecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeecc Q lcl|Aclame:pro 135 WGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDL 214 (377) Q Consensus 135 w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~ 214 (377) |++|.++. ++++++|+++++.+++++++++||+|||+||.+++++||.++|++++++++|+++++|+|+++|.||++.. T Consensus 76 ~v~Eg~~~-~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~ 154 (326) T protein:vir:42 76 WIGEGDMK-PITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTT 154 (326) T ss_pred EecCCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccc Confidence 99876555 56789999999999999999999999999999999999999999999999999999999999999998765 Q ss_pred ccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccC Q lcl|Aclame:pro 215 SQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRN 294 (377) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~ 294 (377) .................. ..+. .+...... .......+..|+|||.++..+.. .++ T Consensus 155 ~~~~~~~~~~~~~~~~~~---------~~~~-----~~~~~~~~-------~~~~~~~~a~~v~n~~~~~~L~~---lkd 210 (326) T protein:vir:42 155 KEVSLVDPDGTGSNADLT---------VYDA-----VAVNALSL-------LVNAGKKWTHTLLDDITEPILNG---AKD 210 (326) T ss_pred cccceeecccccccccch---------hHHH-----HHHHHHhh-------hhhhccCccEEEEeHHHHHHHHH---hhc Confidence 443332222211110000 0000 00000000 01112235679999998766542 234 Q ss_pred CCCcccc--------------ccCCCceEEecCCCCcce--EEEEecccEEEEecceeeEEeechhh------------- Q lcl|Aclame:pro 295 QFGEYVT--------------VLPHGITILESLAVETGK--AIAFVANRYDAFMATASTIEEYDQTF------------- 345 (377) Q Consensus 295 ~~G~~~~--------------~l~~~~~v~~s~~~~~~~--ii~gd~s~y~~~~~~~~~i~~~~~~~------------- 345 (377) .+|.|+. ...+|+||+.++++|+++ ++||||++|+++++++++++++++.. T Consensus 211 ~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~ 290 (326) T protein:vir:42 211 KSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVS 290 (326) T ss_pred cCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchh Confidence 4454431 123688999999999987 46899999999999999999998865 Q ss_pred -hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 346 -AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 346 -f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+|++.||+.+|+|+++++++||++|+.++- T Consensus 291 ~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~ 323 (326) T protein:vir:42 291 LWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDA 323 (326) T ss_pred hhhcCcEEEEEEEEeccEEecccceEEEeeccc Confidence 88899999999999999999999999988776 No 59 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=3.9e-51 Score=296.95 Aligned_cols=283 Identities=13% Similarity=0.037 Sum_probs=214.6 Q ss_pred ccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceeeeccccccccc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIKGQ 145 (377) Q Consensus 67 lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e~~~~~~~ 145 (377) |..++.+.. ...++.++|.+||++++++|++.+++.++|+++++++++++ ..++|+.++.+.+.|+.|.++. ++ T Consensus 1 m~~~~~~a~----~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~-~~ 75 (330) T protein:vir:77 1 MAGSTVPST----QVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAERK-PI 75 (330) T ss_pred Ccccccchh----hccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCcc-cc Confidence 555555432 23444556678888899999999999999999999999865 5899999999999999876655 57 Q ss_pred ccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCc-ceeeeecccccccccccc Q lcl|Aclame:pro 146 LKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQ-PVGLLKDLSQPTVDQSTG 224 (377) Q Consensus 146 ~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~-P~Gil~~~~~~~~~~~~~ 224 (377) ++++|+++++.++|++++++||+|||+|+.+++++||.++|++++++++|++||+|+|+++ |.||++............ T Consensus 76 ~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~ 155 (330) T protein:vir:77 76 TKGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTN 155 (330) T ss_pred ccceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeeccc Confidence 7899999999999999999999999999999999999999999999999999999999875 579988765433322221 Q ss_pred ccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc--- Q lcl|Aclame:pro 225 RDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT--- 301 (377) Q Consensus 225 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~--- 301 (377) ....... ....+..+..++..+. .....+..|+|||+++..+.. .++.+|.|+. T Consensus 156 ~~~~~~~-------------~~~~~~~l~~~~~~~~-------~~~~~~~~~vmn~~~~~~l~~---lkd~~G~~l~~~~ 212 (330) T protein:vir:77 156 LTTASGP-------------QGNAYLAVNNALSLLV-------NSGKKWTGTLLDNVTEPILNT---AVDGNGRPLFVES 212 (330) T ss_pred ccccccc-------------cchhHHHHHHHHHhhh-------hcCCCccEEEEcHHHHHHHHH---HhccCCceeecCc Confidence 1111000 0001111211111110 111234479999999766542 2344554431 Q ss_pred -----------ccCCCceEEecCCCCcce------EEEEecccEEEEecceeeEEeechhh------------------h Q lcl|Aclame:pro 302 -----------VLPHGITILESLAVETGK------AIAFVANRYDAFMATASTIEEYDQTF------------------A 346 (377) Q Consensus 302 -----------~l~~~~~v~~s~~~~~~~------ii~gd~s~y~~~~~~~~~i~~~~~~~------------------f 346 (377) ...+|+||+.+++||++. ++||||++|+++++++++|++++|.+ | T Consensus 213 ~~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f 292 (330) T protein:vir:77 213 TYTEQVGAIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLW 292 (330) T ss_pred cccccccccCCceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchh Confidence 112578899999998754 79999999999999999999998865 7 Q ss_pred hcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 347 MEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 347 ~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+|++.||+.+|+|+++++|+||++|+.++. T Consensus 293 ~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~ 323 (330) T protein:vir:77 293 QHNMVAVRCEAEFAFMVNDKDAFVKLTDQVA 323 (330) T ss_pred hcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 8999999999999999999999999988777 No 60 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=9.3e-50 Score=289.40 Aligned_cols=343 Identities=13% Similarity=0.096 Sum_probs=221.7 Q ss_pred CCccHHHHHHHH-HHHHHHHHH---HHhccCHHH--HHHHHHHHHHHHHHHHHHHHH--HHHH----------------- Q lcl|Aclame:pro 1 MAINLKELPKYR-EAVAELSAK---ISAGATPEE--QEKLFEAAFTTMGDEILAKNE--EEME----------------- 55 (377) Q Consensus 1 m~~~~~~l~~~~-~~~~~~~~~---~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~--~~~~----------------- 55 (377) |+- ++++++.- +.++++.++ .....++++ +.+........+..++..... .+.+ T Consensus 1 mk~-~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (415) T protein:vir:94 1 MKT-KEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNE 79 (415) T ss_pred CCh-HHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Confidence 433 33333221 212222221 111111111 111122122222222111000 0000 Q ss_pred ----------HHHHhccccccccHHHHHHHHHHH--------hccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe Q lcl|Aclame:pro 56 ----------RMFDLRDKNRELTAEEIKFFNDID--------KNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN 117 (377) Q Consensus 56 ----------~~~~~~~~~~~lt~~e~~~~~~~~--------~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~ 117 (377) .........+.+..+|++.|.... ...++++||++||+++.+.|++.+++.++|+++|++++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~ 159 (415) T protein:vir:94 80 ASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR 159 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceee Confidence 001111122234455666554322 12345678999999999999999999999999999999 Q ss_pred cCC---ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 118 TSL---RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVAL 194 (377) Q Consensus 118 ~~~---~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~ 194 (377) +++ ++.+|..++.+.+.|++|.++.++.+.++|+++++.+++++++++||+|||+||.+++++||.++|++++++++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~ 239 (415) T protein:vir:94 160 VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATR 239 (415) T ss_pred ccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHH Confidence 853 34566667778899998887777666789999999999999999999999999999999999999999999999 Q ss_pred hcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCce Q lcl|Aclame:pro 195 ELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQV 274 (377) Q Consensus 195 ~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (377) +.+|++|+|+++|.++.............. ...++ ..+.+.+..+. .....+. T Consensus 240 ~~~il~g~g~g~~~~~~~~~~~~~~~~~~~-~~~~~---------------~~i~~~~~~~~-----------~~~~~~~ 292 (415) T protein:vir:94 240 NKAIIDVITKGSTGSTSSGFEKEGKKLEVK-KAKSL---------------DDIKDAINLNV-----------KPNYEHN 292 (415) T ss_pred HHHHhhccccCccccccccccccccccccc-cccch---------------HHHHHHHHhhh-----------hhccCCC Confidence 999999999998877654432222111111 11111 11112111110 0112356 Q ss_pred EEEeccchhhhhcccccccCCCCcccc---------ccCCCceEEecCCCCcce-----EEEEeccc-EEEEecceeeEE Q lcl|Aclame:pro 275 KLLLNPEDRWTLEAKFTSRNQFGEYVT---------VLPHGITILESLAVETGK-----AIAFVANR-YDAFMATASTIE 339 (377) Q Consensus 275 ~~~~n~~~~~~~~~~~~~~~~~G~~~~---------~l~~~~~v~~s~~~~~~~-----ii~gd~s~-y~~~~~~~~~i~ 339 (377) .|+|||+++..+.. .++.+|.|+. ...+|+||+.++++|.+. ++||||++ |.+++|++++++ T Consensus 293 ~~vmn~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~ 369 (415) T protein:vir:94 293 VAIVSQTMFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQAS 369 (415) T ss_pred EEEEcHHHHHHHHH---hhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEE Confidence 79999998766542 3556666542 122577888888887543 89999998 678999999999 Q ss_pred eechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 340 EYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 340 ~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+++ ..+++.||+.+|+||++.+|+||+++++++- T Consensus 370 ~~~~---~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 404 (415) T protein:vir:94 370 WTDY---MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred Eecc---ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 8874 5677899999999999999999999999887 No 61 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=9.4e-52 Score=300.33 Aligned_cols=338 Identities=12% Similarity=-0.015 Sum_probs=216.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHH-HhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHH Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKI-SAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDID 79 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~ 79 (377) |+-. .....++.. -.... .....+..+...+......+........+ .... ....+..+.. .... T Consensus 1 ~a~~--~a~~~~~~~--~~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g~~~~----a~~~---a~~~~~~~~~---~~a~ 66 (366) T protein:vir:57 1 MAAA--VAVPVKAHS--VAPGIIIKEELQQYKGAGMTRMVMSIAAGKGNLAD----AAKF---AATELGDTGL---SMAI 66 (366) T ss_pred Cccc--ccccccccc--cccccccccccccccchhHHHHHHHHHhcccchhH----HHHH---HHHhhcchhh---hhhc Confidence 1110 000000000 00000 00000000000000000000000000000 0000 0000000111 0111 Q ss_pred hccCCCCCceeccHHHHHHHHHHHHhhhhhhhh-ceeEec-CCceEEEEEcCCcceeeecccccccccccccceeEeecc Q lcl|Aclame:pro 80 KNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKV-INFKNT-SLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQ 157 (377) Q Consensus 80 ~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~-~~v~~~-~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 157 (377) ..++++||++||+++.++|++.+++.++++++ ++++|+ ++++++|+.++.+.+.|++|+++. ++++++|++|++.+ T Consensus 67 -~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E~~~~-~~s~~~f~~i~~~~ 144 (366) T protein:vir:57 67 -STAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGEGKDV-VATGATFDDVKLSA 144 (366) T ss_pred -cccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeeccCccc-cccccceeEEEEee Confidence 23445799999999999999999999999998 888886 468999999999999999876665 46789999999999 Q ss_pred eeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCC-cceeeeeccccccccccccccccccchhhhh Q lcl|Aclame:pro 158 FKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEA 236 (377) Q Consensus 158 ~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (377) +|++++++||+|||+||.+++++||+++|++++++++|++|++|+|++ +|.||++..+.............+.. T Consensus 145 ~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~----- 219 (366) T protein:vir:57 145 KTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLT----- 219 (366) T ss_pred EEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchh----- Confidence 999999999999999999999999999999999999999999999985 89999987654332222111111100 Q ss_pred hhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc-----ccCCCceEEe Q lcl|Aclame:pro 237 IADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT-----VLPHGITILE 311 (377) Q Consensus 237 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~-----~l~~~~~v~~ 311 (377) ....+.+.+... . ........+..|+|||.++..+.. .++.+|.|+. ...+|+||+. T Consensus 220 -------~~~~~~~~~~~~---~-----~~~~~~~~~a~~vmn~~~~~~L~~---lkd~~G~~l~~~~~~g~l~G~Pvv~ 281 (366) T protein:vir:57 220 -------TIDEYLDSLILK---H-----MDSNSNMIRCGWGLSNRTYMTLFG---LRDGNGNKVYPEMSQGILKGYPIQR 281 (366) T ss_pred -------hHHHHHHHHHHh---h-----hccccccccCEEEecHHHHHHHHh---hhccCCceeccCCCCCeecceeeEE Confidence 011111111100 0 011122356789999999876543 2456666542 1225788999 Q ss_pred cCCCCcc--------eEEEEecccEEEEecceeeEEeechh-----------hhhcCcEEEEEEEEEcCEEecccceEEE Q lcl|Aclame:pro 312 SLAVETG--------KAIAFVANRYDAFMATASTIEEYDQT-----------FAMEDLQLYLTKNYFYGKAKDNHTAALL 372 (377) Q Consensus 312 s~~~~~~--------~ii~gd~s~y~~~~~~~~~i~~~~~~-----------~f~~~~~~~~~~~r~dg~~~~~~af~~l 372 (377) +++||++ .++||||++|+++++++++|+++++. .|.+|++.||+.+|+|+++.+|+||++| T Consensus 282 s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~l 361 (366) T protein:vir:57 282 TSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLG 361 (366) T ss_pred ccccccccccCCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEE Confidence 9999852 48999999999999999999998874 3778999999999999999999999999 Q ss_pred EeecC Q lcl|Aclame:pro 373 TLAGG 377 (377) Q Consensus 373 ~~~a~ 377 (377) +=..= T Consensus 362 t~~~~ 366 (366) T protein:vir:57 362 TGVIW 366 (366) T ss_pred ecccC Confidence 85444 No 62 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=2.8e-50 Score=292.26 Aligned_cols=330 Identities=12% Similarity=0.097 Sum_probs=212.6 Q ss_pred CCc--cHHHHHHHHHHH----HHHHHHHHhcc-----CHHHHHHHHHHHHHHHHHHHHH---HHHH-HHHHHHHhc-ccc Q lcl|Aclame:pro 1 MAI--NLKELPKYREAV----AELSAKISAGA-----TPEEQEKLFEAAFTTMGDEILA---KNEE-EMERMFDLR-DKN 64 (377) Q Consensus 1 m~~--~~~~l~~~~~~~----~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~---~~~~-~~~~~~~~~-~~~ 64 (377) |++ .+++|.+...++ +++.+++.... ..++..+ .....+.+.++... +... +.+...... ... T Consensus 1 m~~~m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) T protein:vir:74 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSE-LKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) T ss_pred CChhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 555 344544333322 22222221111 1111111 11111111111111 0000 011110000 000 Q ss_pred c-------cccHHHHHHHHHH---------------HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-- Q lcl|Aclame:pro 65 R-------ELTAEEIKFFNDI---------------DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-- 120 (377) Q Consensus 65 ~-------~lt~~e~~~~~~~---------------~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-- 120 (377) . .....+++.|... ...+++++||++||+++++.|++.+++.++|+++|+++|+++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) T protein:vir:74 80 GPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSS 159 (408) T ss_pred ccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCc Confidence 1 1111222222211 123566789999999999999999999999999999999863 Q ss_pred -ceEEEEEcC-CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcce Q lcl|Aclame:pro 121 -RLKALTAET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAI 198 (377) Q Consensus 121 -~~~~p~~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~ 198 (377) .+.+|...+ .+.+.|++|+++.++.++++|+++++.+++++++++||+|||+||.++|++||.++|++++++++|.+| T Consensus 160 ~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~i 239 (408) T protein:vir:74 160 GSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) T ss_pred ceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHH Confidence 355665554 456678887777776677999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 199 VKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 199 l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) ++|+|+++|.|.... +. .+...+... .......+.+|+| T Consensus 240 l~G~G~~~~~~~~~~----------------~~---------------~i~~~~~~~----------l~~~~~~~a~~v~ 278 (408) T protein:vir:74 240 IAAMGTVPKKPTIAN----------------FD---------------DVITMINTS----------VDPAIIATSSLLT 278 (408) T ss_pred hhccccccccccccc----------------HH---------------HHHHHHHHh----------hhhhhcCCCEEEE Confidence 999999887653210 00 011111100 0112234678999 Q ss_pred ccchhhhhcccccccCCCCccc-----------cccCCCceEEe--cCCCCc-----ceEEEEeccc-EEEEecceeeEE Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQFGEYV-----------TVLPHGITILE--SLAVET-----GKAIAFVANR-YDAFMATASTIE 339 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~~G~~~-----------~~l~~~~~v~~--s~~~~~-----~~ii~gd~s~-y~~~~~~~~~i~ 339 (377) ||.++..+.. .++.+|.|+ +++ |+||++ +..+|. ..++||||++ |.+++|++++++ T Consensus 279 n~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~~l~--G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~ 353 (408) T protein:vir:74 279 NQSGLNKLAL---VKTAEGKYLLEPDPTKPNSYLIK--GKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLL 353 (408) T ss_pred cHHHHHHHHH---hhcCCCceEeccCcCCCCCceec--ceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEE Confidence 9998766542 234455543 344 555554 334553 3489999997 678999999999 Q ss_pred eechh--hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 340 EYDQT--FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 340 ~~~~~--~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++++. .|.+|++.||+.+|+||++++|+||+++++++. T Consensus 354 ~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (408) T protein:vir:74 354 PTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAI 393 (408) T ss_pred EeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecc Confidence 98864 599999999999999999999999999999888 No 63 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=6.3e-50 Score=290.34 Aligned_cols=330 Identities=11% Similarity=0.079 Sum_probs=216.5 Q ss_pred CCcc--HHHHHHHHHHHHHHHHHHHhc---------cCHHHHHHHHHHHHHHHHHHHHHH---HH-HHHHHHHHhcc-cc Q lcl|Aclame:pro 1 MAIN--LKELPKYREAVAELSAKISAG---------ATPEEQEKLFEAAFTTMGDEILAK---NE-EEMERMFDLRD-KN 64 (377) Q Consensus 1 m~~~--~~~l~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~---~~-~~~~~~~~~~~-~~ 64 (377) |+++ +++|.+..+++.+..+.+.+. ...++..+ .....+.+..+..+. .. .+......... .. T Consensus 1 ~~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (404) T protein:vir:39 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSE-LKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (404) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 6554 666665554443333333221 11111111 111111111111111 00 01111111000 00 Q ss_pred -------ccccHHHHHHHHHH---------------HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-c Q lcl|Aclame:pro 65 -------RELTAEEIKFFNDI---------------DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-R 121 (377) Q Consensus 65 -------~~lt~~e~~~~~~~---------------~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~ 121 (377) ......+++.|... ...+++++||++||+++++.|++.+++.++|+++|+++|+++ . T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (404) T protein:vir:39 80 GPLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (404) T ss_pred cccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCc Confidence 01122233333221 224566789999999999999999999999999999999864 3 Q ss_pred eE--EEEE-cCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcce Q lcl|Aclame:pro 122 LK--ALTA-ETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAI 198 (377) Q Consensus 122 ~~--~p~~-~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~ 198 (377) .+ ++.. +..+.+.|+.|+++.++.++++|+++++.+++++++++||+||++||.+++++||.++|++++++++|++| T Consensus 160 ~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~i 239 (404) T protein:vir:39 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (404) T ss_pred ceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHH Confidence 44 4444 34567899988777765577999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 199 VKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 199 l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) ++|+|+++|.|.... +.. +...+... .......+.+|+| T Consensus 240 l~g~g~~~~~~~~~~----------------~~~---------------i~~~~~~~----------~~~~~~~~a~~v~ 278 (404) T protein:vir:39 240 IAAMGTVPKKPTIAK----------------FDD---------------VITMINTS----------VDPAIIATSSLLT 278 (404) T ss_pred Hhccccccccccccc----------------HHH---------------HHHHHHHh----------hhhhhccCCEEEE Confidence 999999887654311 000 00111000 0112234568999 Q ss_pred ccchhhhhcccccccCCCCccc-----------cccCCCceEEe--cCCCCc-----ceEEEEeccc-EEEEecceeeEE Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQFGEYV-----------TVLPHGITILE--SLAVET-----GKAIAFVANR-YDAFMATASTIE 339 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~~G~~~-----------~~l~~~~~v~~--s~~~~~-----~~ii~gd~s~-y~~~~~~~~~i~ 339 (377) ||+++..+.. .++.+|.|+ +++ |+||++ +..+|. ..++||||++ |.++++++++++ T Consensus 279 n~~~~~~L~~---lkd~~G~~l~~~~~~~~~~~~l~--G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~ 353 (404) T protein:vir:39 279 NQSGLNKLAL---VKTAEGKYLLEPDPTKPNSYLIK--GKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLL 353 (404) T ss_pred cHHHHHHHHH---hhccCCceeeccCcCCCCcceec--ceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEE Confidence 9999766642 234555543 344 555554 334443 3489999997 678999999999 Q ss_pred eechh--hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 340 EYDQT--FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 340 ~~~~~--~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++++. .|.+|++.||+.+|+|+++.+|+||+++++++. T Consensus 354 ~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (404) T protein:vir:39 354 PTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAI 393 (404) T ss_pred EeccchhhhhhceeeEEEEeeeccEEecccceEEEEeecc Confidence 98876 699999999999999999999999999998877 No 64 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=2.3e-49 Score=287.26 Aligned_cols=342 Identities=15% Similarity=0.134 Sum_probs=215.1 Q ss_pred CCc--cHHHHHHHHHHHHHHHHHHHhcc------CHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhcc------- Q lcl|Aclame:pro 1 MAI--NLKELPKYREAVAELSAKISAGA------TPEEQEKLF---EAAFTTMGDEILAKNEEEMERMFDLRD------- 62 (377) Q Consensus 1 m~~--~~~~l~~~~~~~~~~~~~~~~~~------~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 62 (377) |++ .++.+++...++.+..+.+.+.+ ..+++.+.+ ......+..++......+......... T Consensus 193 ~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~ 272 (645) T protein:vir:93 193 MNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNG 272 (645) T ss_pred cchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 544 33444444333333322221111 111111222 222222222222111111110000000 Q ss_pred ---------------------------------ccccccHHH---H------HH---HHHHHh----ccCCCCCceeccH Q lcl|Aclame:pro 63 ---------------------------------KNRELTAEE---I------KF---FNDIDK----NVGGKDKFKLLPE 93 (377) Q Consensus 63 ---------------------------------~~~~lt~~e---~------~~---~~~~~~----~~~~s~gg~lvP~ 93 (377) .+......+ + +. +..... ..+.+.||+++|+ T Consensus 273 ~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~ 352 (645) T protein:vir:93 273 NVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQ 352 (645) T ss_pred ccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCch Confidence 000000000 0 00 001111 1223458999999 Q ss_pred HHHHHHHHHHHhhhhhhhhceeE-e----cCCceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhH Q lcl|Aclame:pro 94 ETMVQVFDDLVAEHPLLKVINFK-N----TSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPK 168 (377) Q Consensus 94 ~~~~~Ii~~~~~~s~l~~~~~v~-~----~~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ 168 (377) ++.++||+.+++.+++++++... + +++++++|+.++++.+.|++|++.. ++++++|+++++.++|++++++||+ T Consensus 353 ~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~-~~s~~~f~~v~l~~~kla~~~~iS~ 431 (645) T protein:vir:93 353 EYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTK-PLTKFDFESITFSHAKVSAIAVLTE 431 (645) T ss_pred hhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCccc-cccccceeEEEEeeEEEEEeehhHH Confidence 99999999999999999986542 2 2467899999999999999876555 5788999999999999999999999 Q ss_pred HHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCC----cceeeeeccccccccccccccccccchhhhhhhhhhccC Q lcl|Aclame:pro 169 DALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL----QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLD 244 (377) Q Consensus 169 ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~----~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 244 (377) |||+||.+++++||+++|++++++++|.+||+|+|++ +|.|+++........ .....+ T Consensus 432 ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~-------~~~~~d----------- 493 (645) T protein:vir:93 432 ELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASS-------GNPDAD----------- 493 (645) T ss_pred HHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccccc-------cchHHH----------- Confidence 9999999999999999999999999999999998764 588887643221110 000000 Q ss_pred hHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc-------ccCCCceEEecCCCCc Q lcl|Aclame:pro 245 PDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT-------VLPHGITILESLAVET 317 (377) Q Consensus 245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~-------~l~~~~~v~~s~~~~~ 317 (377) ....+..+.. ......+.+|+|||.++..+.. .++.+|+|+. ...+|+||+.+++||+ T Consensus 494 ---~~~~~~~~~~---------a~~~~~~a~~vmn~~~~~~L~~---lkd~~G~~~~~~~~~~~~tL~G~PV~~s~~vp~ 558 (645) T protein:vir:93 494 ---AEAAFGQFVA---------ANLQPTGAVWLMSSTNALALSM---RKNALGQKEYPDMTLLGGSFQGLPVIVSQYVGD 558 (645) T ss_pred ---HHHHHHHHHh---------cCCCccccEEEEcHHHHHHHHh---ccccCCceeecCCCCCCceeeceeeEEeccCCc Confidence 0111111110 0011235689999998766532 3456665531 1126789999999986 Q ss_pred ceEEEEecccEEEEecceeeEEeechhh----------------------hhcCcEEEEEEEEEcCEEecccceEEEEee Q lcl|Aclame:pro 318 GKAIAFVANRYDAFMATASTIEEYDQTF----------------------AMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) Q Consensus 318 ~~ii~gd~s~y~~~~~~~~~i~~~~~~~----------------------f~~~~~~~~~~~r~dg~~~~~~af~~l~~~ 375 (377) +++||||++|++++++++.|..+++.. |.+|+++||+.+|+|+++++|+||++|+=. T Consensus 559 -~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~ 637 (645) T protein:vir:93 559 -QLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGV 637 (645) T ss_pred -ceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecc Confidence 578999999999999999998876642 889999999999999999999999999821 Q ss_pred ------cC Q lcl|Aclame:pro 376 ------GG 377 (377) Q Consensus 376 ------a~ 377 (377) +| T Consensus 638 ~~g~~~~~ 645 (645) T protein:vir:93 638 NYGSASGG 645 (645) T ss_pred cCCcccCC Confidence 11 No 65 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=3.8e-49 Score=286.04 Aligned_cols=324 Identities=13% Similarity=0.075 Sum_probs=209.8 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhcc-------CHH--HHHHHHHHHHHHHHHHHHHHH--HHHHHHHHH----hccccc Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGA-------TPE--EQEKLFEAAFTTMGDEILAKN--EEEMERMFD----LRDKNR 65 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~-------~~~--~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~----~~~~~~ 65 (377) |+.++++|.+ ++.++.+.+.+.. ..+ ++.+......+.+.+++.... ..+.+.... ...... T Consensus 2 ~~~~l~el~~---~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~ 78 (394) T protein:vir:97 2 FEEKIKEIKA---TIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGK 78 (394) T ss_pred cHHHHHHHHH---HHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 3334455543 3333333222211 111 111112222222222221110 001110000 000000 Q ss_pred cccH---HHHHHH---------------------------------HHHHhccCCCCCceeccHHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 66 ELTA---EEIKFF---------------------------------NDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPL 109 (377) Q Consensus 66 ~lt~---~e~~~~---------------------------------~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l 109 (377) .... ++++.+ .......+..+||++||+++++.|++.+++.++| T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l 158 (394) T protein:vir:97 79 EVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDL 158 (394) T ss_pred ccchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhh Confidence 0000 000000 0011234566799999999999999999999999 Q ss_pred hhhceeEecC-CceEEEEEc-CCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHH Q lcl|Aclame:pro 110 LKVINFKNTS-LRLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLK 187 (377) Q Consensus 110 ~~~~~v~~~~-~~~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la 187 (377) +++|+++|++ +..++|+.. +++.+.|++|+++.++.++++|++|++.+++++++++||+|||+||.+++++||.++|+ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la 238 (394) T protein:vir:97 159 KPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESIS 238 (394) T ss_pred hhhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHH Confidence 9999999986 457888865 45678999887777656779999999999999999999999999999999999999999 Q ss_pred HHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 188 EAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHP 267 (377) Q Consensus 188 ~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 267 (377) +++++++|.+|++|.|++.|.|..+ +. .+...+ ... T Consensus 239 ~~~~~~~~~~i~~g~~~~~~~~~~~-----------------~~---------------~~~~~~----~~~-------- 274 (394) T protein:vir:97 239 QIKVNTTNDAIAKVLKSFTTKTVKN-----------------LD---------------EIKALL----NGG-------- 274 (394) T ss_pred HHHHHHHHHHHhhcccccccccccc-----------------HH---------------HHHHHH----Hhh-------- Confidence 9999999999999988766544321 00 011111 100 Q ss_pred hcccCceEEEeccchhhhhcccccccCCCCccc-----------cccCCCceEEecCCCCcceEEEEeccc-EEEEecce Q lcl|Aclame:pro 268 LKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYV-----------TVLPHGITILESLAVETGKAIAFVANR-YDAFMATA 335 (377) Q Consensus 268 ~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~-----------~~l~~~~~v~~s~~~~~~~ii~gd~s~-y~~~~~~~ 335 (377) .....+..|+|||+++..+.. .++.+|.|+ +++|+|+.+..+..+++++++||||++ |.+++|++ T Consensus 275 ~~~~~~a~~v~n~~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 351 (394) T protein:vir:97 275 FDPAYNVSLIVSQSFYQTLDT---LKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKD 351 (394) T ss_pred hhhhhCCEEEEcHHHHHHHHH---hhccCCCeeeecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecc Confidence 011124579999998765532 244555544 344444444446677888899999998 77899999 Q ss_pred eeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 336 STIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 336 ~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++++.+++.+|. .+||+++|+||++.+|+||+.|++++- T Consensus 352 ~~~~~~~~~~~~---~~~~~~~r~d~~v~~~~a~~~~~~~~~ 390 (394) T protein:vir:97 352 LGLRWADNEIYG---QYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) T ss_pred eEEEEecccccc---eeEEEEEEEccEEecccceEEEEeccc Confidence 999999887654 589999999999999999999999988 No 66 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=7.8e-50 Score=289.82 Aligned_cols=292 Identities=12% Similarity=0.065 Sum_probs=210.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhce Q lcl|Aclame:pro 35 FEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVIN 114 (377) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~ 114 (377) +++. ...+.+.+...... .+...+ ...+...++++|++||++++++|++.+++.++|+++|+ T Consensus 1 ~~~~---------~~~~~~~~~f~~~~--------~~~~~~-~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~ 62 (324) T protein:vir:97 1 MEQT---------QKLKLNLQHFASNN--------VKPQVF-NPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGK 62 (324) T ss_pred Cccc---------hhHHHHHHHHHHhh--------hhhhhh-ccccccccCCCcceechhHHHHHHHHHHhhcchhhhcc Confidence 0000 00000010000000 000111 12223445678999999999999999999999999999 Q ss_pred eEecCC-ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 115 FKNTSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVA 193 (377) Q Consensus 115 v~~~~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~ 193 (377) ++|+++ .+++|+.++.+.+.|++|+++. ++++++|+++++.++|++++++||+|||+|+.++++++|.++|+++++++ T Consensus 63 ~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~-~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~ 141 (324) T protein:vir:97 63 YEPMEGTEKKFTFWADKPGAYWVGEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) T ss_pred eeeccCCceEEEEEecCcceeEeccCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 999875 5899999999999999876655 57889999999999999999999999999999999999999999999999 Q ss_pred hhcceeeccCCC-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccC Q lcl|Aclame:pro 194 LELAIVKGNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAG 272 (377) Q Consensus 194 ~~~a~l~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 272 (377) +|++||+|+|++ +|.||++.....+....+ ..++ ..+.+....+ .. .... T Consensus 142 ~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~---~~~~---------------~~i~~~~~~l----~~-------~~~~ 192 (324) T protein:vir:97 142 FDEAGILNQGNNPFGKSIAQSIEKTNKVIKG---DFTQ---------------DNIIDLEALL----ED-------DELE 192 (324) T ss_pred HHHHhhccCCCCccCccccccccccceeccc---cCCH---------------HHHHHHHHhh----hh-------ccCC Confidence 999999999976 688988654332221111 1111 1111111111 10 1123 Q ss_pred ceEEEeccchhhhhcccccccCCCCccccc-----cCCCceEEecCC--CCcceEEEEecccEEEEecceeeEEeechhh Q lcl|Aclame:pro 273 QVKLLLNPEDRWTLEAKFTSRNQFGEYVTV-----LPHGITILESLA--VETGKAIAFVANRYDAFMATASTIEEYDQTF 345 (377) Q Consensus 273 ~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~-----l~~~~~v~~s~~--~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~ 345 (377) ...|+|||.++..+.. .++.+|.|+.. ..+|+||+.++. ++++.++||||++|+++++++++|+.++|.. T Consensus 193 ~~~~v~n~~~~~~L~~---lkd~~g~~~~~~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~ 269 (324) T protein:vir:97 193 ANAFISKTQNRSLLRK---IVDPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQ 269 (324) T ss_pred CCEEEEcHHHHHHHHH---hhcCCCceeecCCCCccccceeeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeeccc Confidence 4579999999876542 23455554311 125667777665 4566799999999999999999999998854 Q ss_pred --------------hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 346 --------------AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 346 --------------f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+|++.||+.+|+|+++.+++||++|+.+.. T Consensus 270 ~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) T protein:vir:97 270 LSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 88999999999999999999999999999888 No 67 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=2.4e-49 Score=287.18 Aligned_cols=327 Identities=12% Similarity=0.016 Sum_probs=210.2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCH---HH---HHHHHHHHHHHHHHHHHHHHHH------------------HHHH Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATP---EE---QEKLFEAAFTTMGDEILAKNEE------------------EMER 56 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~~~~------------------~~~~ 56 (377) +....+++++..++..++.+........ ++ +.+........+.+++...... .... T Consensus 14 l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 93 (400) T protein:vir:38 14 LDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSGKKPDHPEEHSYRD 93 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhHHH Confidence 3333333333333333322222111100 00 0001111111111111000000 0000 Q ss_pred HHH----hc--------------cccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec Q lcl|Aclame:pro 57 MFD----LR--------------DKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT 118 (377) Q Consensus 57 ~~~----~~--------------~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~ 118 (377) ... .. .........+...........++++||++||+++.+.|++.++++++|+++++++|+ T Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 173 (400) T protein:vir:38 94 ALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQA 173 (400) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEec Confidence 000 00 000000000111111223334677899999999999999999999999999999998 Q ss_pred C-CceEEEEEc-CCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 119 S-LRLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALEL 196 (377) Q Consensus 119 ~-~~~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~ 196 (377) + ++.++|+.. +.+.+.|+.|+++.++.++++|++|++.+++++++++||+|||+||.+++++||.++|+++++.+++. T Consensus 174 ~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~ 253 (400) T protein:vir:38 174 STQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNG 253 (400) T ss_pred cCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 6 467888876 45678899888888777789999999999999999999999999999999999999999999999999 Q ss_pred ceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEE Q lcl|Aclame:pro 197 AIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKL 276 (377) Q Consensus 197 a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (377) +|++|+|++.|.|+.+ +... ...+.. ..+...+.+| T Consensus 254 ~i~~~~~~~~~~~~~~-----------------~~~~---------------~~~~~~------------~~~~~~~a~~ 289 (400) T protein:vir:38 254 AVATLLKGFTAKTISS-----------------VDDL---------------KHINNV------------DLDPAYSRVI 289 (400) T ss_pred hhhhcccccccccccc-----------------HHHH---------------HHHHHh------------hhhhhhCcEE Confidence 9999999877665431 0000 000000 0111224679 Q ss_pred EeccchhhhhcccccccCCCCccccc---------cCCCceEEecCCCCcc-----eEEEEeccc-EEEEecceeeEEee Q lcl|Aclame:pro 277 LLNPEDRWTLEAKFTSRNQFGEYVTV---------LPHGITILESLAVETG-----KAIAFVANR-YDAFMATASTIEEY 341 (377) Q Consensus 277 ~~n~~~~~~~~~~~~~~~~~G~~~~~---------l~~~~~v~~s~~~~~~-----~ii~gd~s~-y~~~~~~~~~i~~~ 341 (377) +|||+++..+.. .++.+|.|+.. ..+|+||+.++++|.+ .++|||||+ |.+++|+++++..+ T Consensus 290 v~~~~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~ 366 (400) T protein:vir:38 290 IASQSFYNFLDT---VKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWV 366 (400) T ss_pred EEcHHHHHHHHH---hhccCCCeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEe Confidence 999988765432 23455554411 2256778877777632 379999998 67888999999999 Q ss_pred chhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 342 DQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 342 ~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++.+|.. +||+++|+||++.+++||+.|++++. T Consensus 367 ~~~~~~~---~~~~~~r~d~~~~~~~a~~~l~~~~~ 399 (400) T protein:vir:38 367 DDQIYGQ---FLQAGMRFGVSVADEKAGYFLTYTPK 399 (400) T ss_pred cccccce---eEEEEEEeccEEecccceEEEEeecC Confidence 9887654 89999999999999999999999999 No 68 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=9.7e-50 Score=289.29 Aligned_cols=271 Identities=13% Similarity=0.019 Sum_probs=203.7 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEEcCCcceeeecccccccccccccceeEeecc Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQ 157 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 157 (377) +..++++.||++||++++++|++.+++.|++|++++++|++ +.+++|+.++.+.+.|++|.++. ++++++|+++++.+ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~-~~s~~~f~~v~l~~ 79 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVK-PSASVDVSAFTAQP 79 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCccc-cccccceeeeEeee Confidence 67788889999999999999999999999999999999987 46899999999999999876655 56889999999999 Q ss_pred eeEEEeehhhHHHHhcCHHH----HHHHHHHHHHHHHHHHhhcceeeccCC--Cc-ceeeeecccccccccccccccccc Q lcl|Aclame:pro 158 FKLTAFVVIPKDALKFGPKW----LKQFITEQLKEAIAVALELAIVKGNGL--LQ-PVGLLKDLSQPTVDQSTGRDITTY 230 (377) Q Consensus 158 ~k~~~~~~iS~ell~ds~~~----~~~~l~~~la~~~a~~~~~a~l~G~G~--~~-P~Gil~~~~~~~~~~~~~~~~~~~ 230 (377) +|++++++||+|||+|+..+ |+++|.++|++++++++|.+|++|+|. ++ |.|+.+...........+.. . T Consensus 80 ~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~- 156 (315) T protein:vir:80 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDS--A- 156 (315) T ss_pred eeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeecccc--c- Confidence 99999999999999988776 789999999999999999999999874 33 34544332211111110000 0 Q ss_pred chhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccc--cCCCCcccc------- Q lcl|Aclame:pro 231 KTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTS--RNQFGEYVT------- 301 (377) Q Consensus 231 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~--~~~~G~~~~------- 301 (377) + . .+..++.... ......+..|+|||.+...+...... ++.+|+|.. T Consensus 157 ~--------------~----d~~~~~~~~~------~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~ 212 (315) T protein:vir:80 157 T--------------A----DLVKAVGLIA------GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAG 212 (315) T ss_pred h--------------H----HHHHHHHHHh------hccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCC Confidence 0 0 0111111110 01111234699999997776432111 223343321 Q ss_pred -ccCCCceEEecCCCCcc---------eEEEEecccEEEEecceeeEEeechh--------hhhcCcEEEEEEEEEcCEE Q lcl|Aclame:pro 302 -VLPHGITILESLAVETG---------KAIAFVANRYDAFMATASTIEEYDQT--------FAMEDLQLYLTKNYFYGKA 363 (377) Q Consensus 302 -~l~~~~~v~~s~~~~~~---------~ii~gd~s~y~~~~~~~~~i~~~~~~--------~f~~~~~~~~~~~r~dg~~ 363 (377) ...+|+||+.+++||++ .++||||++|+++.+++++++++++. .|.+|++.||+.+|+|+++ T Consensus 213 ~~tl~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v 292 (315) T protein:vir:80 213 LDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAI 292 (315) T ss_pred CceecceeeEecCcCCcccccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEeccee Confidence 12357788899999853 37899999999999999999988763 4899999999999999999 Q ss_pred ecccceEEEEeecC Q lcl|Aclame:pro 364 KDNHTAALLTLAGG 377 (377) Q Consensus 364 ~~~~af~~l~~~a~ 377 (377) ++|+||++|+.++- T Consensus 293 ~~~~a~~~l~~~~a 306 (315) T protein:vir:80 293 ESLDSFAVVKEKAA 306 (315) T ss_pred ecccceEEEeeccC Confidence 99999999998775 No 69 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=8.9e-50 Score=289.49 Aligned_cols=274 Identities=14% Similarity=0.061 Sum_probs=212.4 Q ss_pred ccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcceeeecccccccc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWGDIFGEIKG 144 (377) Q Consensus 67 lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~~e~~~~~~ 144 (377) ++.+..+.. +..+++++|.+||++++++|++.+++.++|+++|+++++++ ...+|+..+.+.+.|+.|.++. + T Consensus 1 m~~~~~~~~----~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~-~ 75 (297) T protein:vir:95 1 MTVQTFNPE----NVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKI-K 75 (297) T ss_pred CCccccccc----cccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccc-c Confidence 333333222 23455678889999999999999999999999999999864 4678889888999999876665 5 Q ss_pred cccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeecccccccccccc Q lcl|Aclame:pro 145 QLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTG 224 (377) Q Consensus 145 ~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~ 224 (377) +++++|+++++.+++++++++||+|+|+||.+++++||.+++++++++++|.+|++|+|+++|.||++.......... T Consensus 76 ~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~-- 153 (297) T protein:vir:95 76 TDKPEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIG-- 153 (297) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecc-- Confidence 678999999999999999999999999999999999999999999999999999999999999999876443322111 Q ss_pred ccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccc-- Q lcl|Aclame:pro 225 RDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTV-- 302 (377) Q Consensus 225 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~-- 302 (377) ...++ ..+.+.+..+ . ........|+|||.++..+.. .++.+|.|+.. T Consensus 154 -~~~t~---------------~~i~~~~~~l----~-------~~~~~~~~~v~~~~~~~~L~~---l~d~~G~~i~~~~ 203 (297) T protein:vir:95 154 -GPINY---------------DNILKLQDAL----Y-------DADVEPNAFVSKIQNRSALRE---ARDGNKVSIYDKA 203 (297) T ss_pred -cccCH---------------HHHHHHHHHh----h-------hccCCcCEEEEcHHHHHHHHH---hhccCCceeecCC Confidence 11111 1111211111 0 011234579999999776642 24455554321 Q ss_pred --cCCCceEEe--cCCCCcceEEEEecccEEEEecceeeEEeechhh--------------hhcCcEEEEEEEEEcCEEe Q lcl|Aclame:pro 303 --LPHGITILE--SLAVETGKAIAFVANRYDAFMATASTIEEYDQTF--------------AMEDLQLYLTKNYFYGKAK 364 (377) Q Consensus 303 --l~~~~~v~~--s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~--------------f~~~~~~~~~~~r~dg~~~ 364 (377) -.+|+||+. +..+++++++||||++|+++++++++++++++.. |.+|++.||+.+|+|+++. T Consensus 204 ~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~ 283 (297) T protein:vir:95 204 ANTIDGITTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMIT 283 (297) T ss_pred CCcccceeeEeecCCCCCCceEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEee Confidence 113556665 4456778899999999999999999999998864 8899999999999999999 Q ss_pred cccceEEEEeecC Q lcl|Aclame:pro 365 DNHTAALLTLAGG 377 (377) Q Consensus 365 ~~~af~~l~~~a~ 377 (377) +|+||++|+.++. T Consensus 284 ~~~a~~~l~~at~ 296 (297) T protein:vir:95 284 KTDAFAKLTPAER 296 (297) T ss_pred cccceEEEeecCC Confidence 9999999999999 No 70 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=1.5e-49 Score=288.32 Aligned_cols=278 Identities=12% Similarity=0.014 Sum_probs=212.7 Q ss_pred ccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceeeeccccccccc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIKGQ 145 (377) Q Consensus 67 lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e~~~~~~~ 145 (377) +..++.. ..+..++++||++||++++++|++.+++.++|+++|+++|+++ ..++|+.++.+.+.|+.|.++. ++ T Consensus 1 ma~~~~~----~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~-~~ 75 (304) T protein:vir:94 1 MATPTYT----PGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERI-QT 75 (304) T ss_pred Ccccccc----cccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCccc-cc Confidence 2111111 1234556788999999999999999999999999999999865 5899999999999999877665 46 Q ss_pred ccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccc-ccccc Q lcl|Aclame:pro 146 LKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTV-DQSTG 224 (377) Q Consensus 146 ~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~-~~~~~ 224 (377) ++++|+++++.++|++++++||+|+|+||.+++++||.++|++++++++|.+|++|+|+++|.|++........ ..... T Consensus 76 ~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 155 (304) T protein:vir:94 76 SKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNV 155 (304) T ss_pred ccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccc Confidence 78999999999999999999999999999999999999999999999999999999999998876532111111 11111 Q ss_pred ccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccc-- Q lcl|Aclame:pro 225 RDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTV-- 302 (377) Q Consensus 225 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~-- 302 (377) ...+. ..++.+..++... ......+..|+|||+++..+.. .++.+|.|... T Consensus 156 ~~~~~-----------------~~~~~i~~~~~~l-------~~~~~~~~~~v~~~~~~~~L~~---lkd~~G~~l~~~~ 208 (304) T protein:vir:94 156 VTDTN-----------------NLYVDLSALMATI-------EDEELDPNGVLTTRSFRSKMRN---ALDANDRPLFDAN 208 (304) T ss_pred ccccc-----------------chHHHHHHHHHHh-------hhccCCcCEEEEcHHHHHHHHH---hhccCCcEeecCC Confidence 10000 0111122221111 1112234579999999877642 34566665421 Q ss_pred --cCCCceEEecCCCCc----ceEEEEecccEEEEecceeeEEeechhh----------------hhcCcEEEEEEEEEc Q lcl|Aclame:pro 303 --LPHGITILESLAVET----GKAIAFVANRYDAFMATASTIEEYDQTF----------------AMEDLQLYLTKNYFY 360 (377) Q Consensus 303 --l~~~~~v~~s~~~~~----~~ii~gd~s~y~~~~~~~~~i~~~~~~~----------------f~~~~~~~~~~~r~d 360 (377) ..+|+||+.++++|. +.++||||++|++++|+++++++++|.. |.+|++.||+.+|+| T Consensus 209 ~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~ 288 (304) T protein:vir:94 209 GNEIMGLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIA 288 (304) T ss_pred CccccceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEec Confidence 225788888999874 3589999999999999999999998853 899999999999999 Q ss_pred CEEecccceEEEEeec Q lcl|Aclame:pro 361 GKAKDNHTAALLTLAG 376 (377) Q Consensus 361 g~~~~~~af~~l~~~a 376 (377) +++.+|+||++||.+- T Consensus 289 ~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 289 YMNVKPEAFATLKPTE 304 (304) T ss_pred cEeecccceEEEEecC Confidence 9999999999999999 No 71 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=1.5e-49 Score=288.32 Aligned_cols=278 Identities=12% Similarity=0.014 Sum_probs=212.7 Q ss_pred ccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceeeeccccccccc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIKGQ 145 (377) Q Consensus 67 lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e~~~~~~~ 145 (377) +..++.. ..+..++++||++||++++++|++.+++.++|+++|+++|+++ ..++|+.++.+.+.|+.|.++. ++ T Consensus 1 ma~~~~~----~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~-~~ 75 (304) T protein:vir:10 1 MATPTYT----PGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERI-QT 75 (304) T ss_pred Ccccccc----cccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCccc-cc Confidence 2111111 1234556788999999999999999999999999999999865 5899999999999999877665 46 Q ss_pred ccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccc-ccccc Q lcl|Aclame:pro 146 LKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTV-DQSTG 224 (377) Q Consensus 146 ~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~-~~~~~ 224 (377) ++++|+++++.++|++++++||+|+|+||.+++++||.++|++++++++|.+|++|+|+++|.|++........ ..... T Consensus 76 ~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 155 (304) T protein:vir:10 76 SKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNV 155 (304) T ss_pred ccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccc Confidence 78999999999999999999999999999999999999999999999999999999999998876532111111 11111 Q ss_pred ccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccc-- Q lcl|Aclame:pro 225 RDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTV-- 302 (377) Q Consensus 225 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~-- 302 (377) ...+. ..++.+..++... ......+..|+|||+++..+.. .++.+|.|... T Consensus 156 ~~~~~-----------------~~~~~i~~~~~~l-------~~~~~~~~~~v~~~~~~~~L~~---lkd~~G~~l~~~~ 208 (304) T protein:vir:10 156 VTDTN-----------------NLYVDLSALMATI-------EDEELDPNGVLTTRSFRSKMRN---ALDANDRPLFDAN 208 (304) T ss_pred ccccc-----------------chHHHHHHHHHHh-------hhccCCcCEEEEcHHHHHHHHH---hhccCCcEeecCC Confidence 10000 0111122221111 1112234579999999877642 34566665421 Q ss_pred --cCCCceEEecCCCCc----ceEEEEecccEEEEecceeeEEeechhh----------------hhcCcEEEEEEEEEc Q lcl|Aclame:pro 303 --LPHGITILESLAVET----GKAIAFVANRYDAFMATASTIEEYDQTF----------------AMEDLQLYLTKNYFY 360 (377) Q Consensus 303 --l~~~~~v~~s~~~~~----~~ii~gd~s~y~~~~~~~~~i~~~~~~~----------------f~~~~~~~~~~~r~d 360 (377) ..+|+||+.++++|. +.++||||++|++++|+++++++++|.. |.+|++.||+.+|+| T Consensus 209 ~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~ 288 (304) T protein:vir:10 209 GNEIMGLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIA 288 (304) T ss_pred CccccceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEec Confidence 225788888999874 3589999999999999999999998853 899999999999999 Q ss_pred CEEecccceEEEEeec Q lcl|Aclame:pro 361 GKAKDNHTAALLTLAG 376 (377) Q Consensus 361 g~~~~~~af~~l~~~a 376 (377) +++.+|+||++||.+- T Consensus 289 ~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 289 YMNVKPEAFATLKPTE 304 (304) T ss_pred cEeecccceEEEEecC Confidence 9999999999999999 No 72 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=4.1e-49 Score=285.87 Aligned_cols=353 Identities=14% Similarity=0.099 Sum_probs=211.6 Q ss_pred CCccHHHHHH-------HHHHHHHHHHHHHhccCHH-------HHHHHHHHHHHHHHHHHH------HHHHH---HHHHH Q lcl|Aclame:pro 1 MAINLKELPK-------YREAVAELSAKISAGATPE-------EQEKLFEAAFTTMGDEIL------AKNEE---EMERM 57 (377) Q Consensus 1 m~~~~~~l~~-------~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~------~~~~~---~~~~~ 57 (377) |.+++|++.. +..++.+..+.+.+.+..+ ++...+++..+.+..++. ++.+. ..... T Consensus 1 ~~k~~eem~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~~ 80 (477) T protein:vir:84 1 MEKHLEELRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIERS 80 (477) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 6665555533 3322222222221111111 111111111111111110 00000 00000 Q ss_pred H---H-h------ccc-c-cc-----------------------ccHHHH---------------HHHH---HHH-hccC Q lcl|Aclame:pro 58 F---D-L------RDK-N-RE-----------------------LTAEEI---------------KFFN---DID-KNVG 83 (377) Q Consensus 58 ~---~-~------~~~-~-~~-----------------------lt~~e~---------------~~~~---~~~-~~~~ 83 (377) . . . ... . .. ....++ .... ... -.++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (477) T protein:vir:84 81 GKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRN 160 (477) T ss_pred hcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhcccccc Confidence 0 0 0 000 0 00 000000 0000 000 0234 Q ss_pred CCCCceeccHHH-HHHHHHHHHhhhhhhhhceeEecC---CceEEEEEcCCc-ceeeecccccc----cccccccceeEe Q lcl|Aclame:pro 84 GKDKFKLLPEET-MVQVFDDLVAEHPLLKVINFKNTS---LRLKALTAETSG-TAVWGDIFGEI----KGQLKQAFKEQD 154 (377) Q Consensus 84 ~s~gg~lvP~~~-~~~Ii~~~~~~s~l~~~~~v~~~~---~~~~~p~~~~~~-~a~w~~e~~~~----~~~~~~~f~~i~ 154 (377) ++.||++||+++ .++|++.+++.++|++++++++++ ++++||+..+++ .+.|++|+++. .++++++|++++ T Consensus 161 ~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~ 240 (477) T protein:vir:84 161 GGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQ 240 (477) T ss_pred CCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEE Confidence 556888888875 678999999999999999988865 358999876554 46678765443 346778999999 Q ss_pred ecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCC-Ccceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 155 FSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGL-LQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 155 l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~-~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) +++++++++++||+|||+||.+++++||.++|+++++.++|.+||+|+|+ ++|.||++.........+.+. .++.. T Consensus 241 ~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~--~t~~~- 317 (477) T protein:vir:84 241 ANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAG--SALEK- 317 (477) T ss_pred EeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccc--cchhh- Confidence 99999999999999999999999999999999999999999999999997 589999987554333222111 11100 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc------------ Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT------------ 301 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~------------ 301 (377) ...+...+...... ..+....+...|+|||.++..+.. .++.+|.|+. T Consensus 318 -----------~~~~~~~i~~~~~~------~~~~~~~~~~~~v~~~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~~~ 377 (477) T protein:vir:84 318 -----------HQIIYQKIADAIQR------VHTSRFLEPEVIVMHPRRWASFHA---IFAGDDRPLIVPSGPGFNNLGV 377 (477) T ss_pred -----------HHHHHHHHHHHHhh------ccccccCCccEEEEcHHHHHHHHH---hhccCCCeeeecCccccccccc Confidence 00011111111110 001112233468888877654432 1234443321 Q ss_pred ----------ccCCCceEEecCCCCcc--------eEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEE Q lcl|Aclame:pro 302 ----------VLPHGITILESLAVETG--------KAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKA 363 (377) Q Consensus 302 ----------~l~~~~~v~~s~~~~~~--------~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~ 363 (377) ...+|+||+++++||++ .++||||++|+++. .++++.++++.++.++++.|+...++++++ T Consensus 378 ~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~-~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~ 456 (477) T protein:vir:84 378 LTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE-SSVRMRALQETRAENLSVLLQVYGYLAFTA 456 (477) T ss_pred ccccccccccchhcccceEecCcccccccccCCcceEEEEEeceEEEEe-eceeEEeccccccccceeeeeehhhhhhhh Confidence 01257889999999964 48999999998876 589999999999999999999988888755 Q ss_pred e-cccceEEEEeecC Q lcl|Aclame:pro 364 K-DNHTAALLTLAGG 377 (377) Q Consensus 364 ~-~~~af~~l~~~a~ 377 (377) + +|+|||++|.+|- T Consensus 457 ~r~~~afv~~t~~~~ 471 (477) T protein:vir:84 457 ARFPQSVVEIGGTAL 471 (477) T ss_pred hccccceEEeecccc Confidence 5 6999999999988 No 73 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=1.1e-48 Score=283.53 Aligned_cols=328 Identities=15% Similarity=0.087 Sum_probs=215.1 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhcc----CHHHHHHHHHHHHHHHHHHHHHH------HHHHHH------HHH-Hhccc Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGA----TPEEQEKLFEAAFTTMGDEILAK------NEEEME------RMF-DLRDK 63 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~------~~~-~~~~~ 63 (377) |....+.++++.+.+.++.+.+.... ...+..+......+.+..+.... .+...+ ... ..... T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 80 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPN 80 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhccc Confidence 77744444555555444444443211 11111111111111111111110 000000 000 00000 Q ss_pred c----ccccHHHHHHHHHHH-----------hccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEE Q lcl|Aclame:pro 64 N----RELTAEEIKFFNDID-----------KNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTA 127 (377) Q Consensus 64 ~----~~lt~~e~~~~~~~~-----------~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~ 127 (377) . ......+++.|.... ...++++||++||++++.+|++.++++++|+++|+++|+++ ..++|+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (394) T protein:vir:10 81 GTDLKKKPIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPIL 160 (394) T ss_pred ccchhhhHHHHHHHHHHHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEE Confidence 0 011233444454432 23567789999999999999999999999999999999875 5788876 Q ss_pred cC-CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCc Q lcl|Aclame:pro 128 ET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQ 206 (377) Q Consensus 128 ~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~ 206 (377) .. .+.+.|+.|+++.++.++++|++|++.+++++++++||+|||+||.+++++||.++|++++++++|.+|++|+|+++ T Consensus 161 ~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~ 240 (394) T protein:vir:10 161 KRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFT 240 (394) T ss_pred ecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 64 46788998888887667899999999999999999999999999999999999999999999999999999999988 Q ss_pred ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhh Q lcl|Aclame:pro 207 PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL 286 (377) Q Consensus 207 P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~ 286 (377) |.++.+.. +++ .+..++.. ......+..|+|||+++..+ T Consensus 241 ~~~~~~~~--------------~~d-------------------~l~~~~~~--------~~~~~~~a~~vmn~~~~~~l 279 (394) T protein:vir:10 241 AKATTTDT--------------LVD-------------------SLKHILNV--------DLDPAYSRALVVTQSLFNTL 279 (394) T ss_pred cccccccc--------------cHH-------------------HHHHHHHh--------hhhhhccCEEEecHHHHHHH Confidence 77653211 000 01111000 01112245788888886554 Q ss_pred cccccccCCCCccc---------------cccCCCceEEecC--CCCc--c--eEEEEeccc-EEEEecceeeEEeechh Q lcl|Aclame:pro 287 EAKFTSRNQFGEYV---------------TVLPHGITILESL--AVET--G--KAIAFVANR-YDAFMATASTIEEYDQT 344 (377) Q Consensus 287 ~~~~~~~~~~G~~~---------------~~l~~~~~v~~s~--~~~~--~--~ii~gd~s~-y~~~~~~~~~i~~~~~~ 344 (377) .. .++.+|+|+ ++ +|+||++++ .++. + .++|||||+ |.+++++++++..+++. T Consensus 280 ~~---lkd~~G~~i~~~~~~~~~~~~~~~~L--~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~ 354 (394) T protein:vir:10 280 DT---LKDKNGRYLLHDASDSITDGTAKGTV--LGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSK 354 (394) T ss_pred HH---hhccCCCeeeeccccccccCCccccc--ccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEeccc Confidence 32 234455443 33 455665543 2332 2 389999998 67888999999999988 Q ss_pred hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 345 FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 345 ~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|.+ +||+++|+||++++++||+++++++- T Consensus 355 ~~~~---~~~~~~r~d~~~~~~~ai~~~~~~~~ 384 (394) T protein:vir:10 355 IYGR---YLGAAFRFGVKQADSNAGYFVTNTDA 384 (394) T ss_pred ccce---eEEEEEEeccEEeccccEEEEEeecc Confidence 7765 68999999999999999999999877 No 74 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=8e-50 Score=289.74 Aligned_cols=286 Identities=13% Similarity=0.022 Sum_probs=216.6 Q ss_pred ccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceeeeccc Q lcl|Aclame:pro 61 RDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIF 139 (377) Q Consensus 61 ~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e~ 139 (377) ...+..+..+.+. ...++++++|.+||++++++|++.+++.++|+++|+++++++ ++++|+.++.+.+.|+.|. T Consensus 1 ~~~~~~~~~~~~~-----~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~ 75 (320) T protein:vir:10 1 MAAGTAFQVDHAQ-----IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEG 75 (320) T ss_pred CCCCccCCHHHHH-----hhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCC Confidence 2222334344443 234556667778999999999999999999999999999864 6899999999999999876 Q ss_pred ccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccc Q lcl|Aclame:pro 140 GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTV 219 (377) Q Consensus 140 ~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~ 219 (377) ++. ++++++|+++++.++|++++++||+|+|+||.++++++|.+.|++++++++|++|++|+|+++|.|++........ T Consensus 76 ~~~-~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~ 154 (320) T protein:vir:10 76 DMK-PITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSL 154 (320) T ss_pred ccc-cccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccc Confidence 665 5788999999999999999999999999999999999999999999999999999999999999888755443333 Q ss_pred cccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcc Q lcl|Aclame:pro 220 DQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEY 299 (377) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~ 299 (377) ................ ...+.+.. ......++..|+|||.++..+.. .++.+|.| T Consensus 155 ~~~~~~~~~~~~~~~~-----------~~~~~~~~-----------~~~~~~~~~~~v~n~~~~~~L~~---lkd~~G~~ 209 (320) T protein:vir:10 155 ADPGGATASDLTAYDA-----------VAVNGLSL-----------LVNAKKKWTHTLLDDIVEPILNG---AKDKNGRP 209 (320) T ss_pred eecccccccccccHHH-----------HHHHHHhh-----------hhcccCCCcEEEEcHHHHHHHHH---hhccCCce Confidence 3222221111110000 00111100 01123456689999998766642 23444443 Q ss_pred cc--------------ccCCCceEEecCCCCcce--EEEEecccEEEEecceeeEEeechhh--------------hhcC Q lcl|Aclame:pro 300 VT--------------VLPHGITILESLAVETGK--AIAFVANRYDAFMATASTIEEYDQTF--------------AMED 349 (377) Q Consensus 300 ~~--------------~l~~~~~v~~s~~~~~~~--ii~gd~s~y~~~~~~~~~i~~~~~~~--------------f~~~ 349 (377) +. ...+|+||+.++++|+++ ++||||++|++++++++++++++|.. |.+| T Consensus 210 l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~ 289 (320) T protein:vir:10 210 LFIESTYTDENSPFRAGRIVSRPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHN 289 (320) T ss_pred eeccccccCccccccCceeeeeeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcC Confidence 31 123688999999999987 57899999999999999999998865 8899 Q ss_pred cEEEEEEEEEcCEEecccceEEEE-eecC Q lcl|Aclame:pro 350 LQLYLTKNYFYGKAKDNHTAALLT-LAGG 377 (377) Q Consensus 350 ~~~~~~~~r~dg~~~~~~af~~l~-~~a~ 377 (377) ++.||+.+|+|+++++|+||++|+ ++|. T Consensus 290 ~~~~r~~~~~d~~v~~~~a~~~l~~~~ap 318 (320) T protein:vir:10 290 LVAVRVEAEYAFHNNDKDAFVKLTNVVTP 318 (320) T ss_pred cEEEEEEEeeccEEecccceEEEEeccCC Confidence 999999999999999999999999 4444 No 75 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=2.3e-49 Score=287.20 Aligned_cols=268 Identities=12% Similarity=-0.023 Sum_probs=206.2 Q ss_pred ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEEcCCcceeeecccccccccccccceeEeeccee Q lcl|Aclame:pro 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFK 159 (377) Q Consensus 81 ~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k 159 (377) -.+.++||++||++++++|++.+++.|+|+++|+++|++ ++.++|+.++.+.+.|++|+++. ++++++|+++++.++| T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~-~~~~~~f~~v~l~~~k 79 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQK-SESTATFAPVTAIPRK 79 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCccc-ccccceeeEEEEeeEE Confidence 566778999999999999999999999999999999986 56999999999999999876655 5788999999999999 Q ss_pred EEEeehhhHHHHh---cCHHHHHHHHHHHHHHHHHHHhhcceeeccCC--C-cceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 160 LTAFVVIPKDALK---FGPKWLKQFITEQLKEAIAVALELAIVKGNGL--L-QPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 160 ~~~~~~iS~ell~---ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~--~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++++++||+|||+ |+.++++++|.+++++++++++|.+|++|+|. + .|.||++.+.........+...... T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~--- 156 (311) T protein:vir:81 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSAT--- 156 (311) T ss_pred EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccch--- Confidence 9999999999995 66788999999999999999999999999853 3 4678876644333222221111100 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCc-eEEEeccchhhhhcccccccCCCCcccc---------cc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQ-VKLLLNPEDRWTLEAKFTSRNQFGEYVT---------VL 303 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~n~~~~~~~~~~~~~~~~~G~~~~---------~l 303 (377) ....+..++.. ......+ ..|+|||.++..+.. .++.+|.|+- .. T Consensus 157 --------------~~~~i~~~~~~--------~~~~~~~~~~~vmn~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~t 211 (311) T protein:vir:81 157 --------------PDLAVEAAVGL--------VLGDNLSPDGVALDNTFSFMLAT---QRDSQGRKLYPELGFGTDVAS 211 (311) T ss_pred --------------HHHHHHHHHHH--------hhhcCCCceEEEEcHHHHHHHHh---hhccCCCeeecCccccCCCce Confidence 00111111111 1111112 249999999876642 2455665541 12 Q ss_pred CCCceEEecCCCCcc------------------eEEEEecccEEEEecceeeEEeechh-------hhhcCcEEEEEEEE Q lcl|Aclame:pro 304 PHGITILESLAVETG------------------KAIAFVANRYDAFMATASTIEEYDQT-------FAMEDLQLYLTKNY 358 (377) Q Consensus 304 ~~~~~v~~s~~~~~~------------------~ii~gd~s~y~~~~~~~~~i~~~~~~-------~f~~~~~~~~~~~r 358 (377) .+|+||++++.||.+ .++|||||+|+++.+++++++++++. .|.+|++.||+.+| T Consensus 212 l~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r 291 (311) T protein:vir:81 212 FAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVV 291 (311) T ss_pred ecceeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEE Confidence 257888888888742 36899999999999999999998763 49999999999999 Q ss_pred EcCEEecccceEEEEeecC Q lcl|Aclame:pro 359 FYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 359 ~dg~~~~~~af~~l~~~a~ 377 (377) +|++|++|+||++|+-+.- T Consensus 292 ~d~~v~~~~a~~~l~~a~~ 310 (311) T protein:vir:81 292 YGIGIMSTDAFAVVRDADE 310 (311) T ss_pred eccEeecccceEEEEeecc Confidence 9999999999999998877 No 76 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=6.6e-49 Score=284.71 Aligned_cols=330 Identities=12% Similarity=0.057 Sum_probs=210.2 Q ss_pred CCccHHHHHHHH----HHHHHHHHHHHhccCHH---HHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHhcccc Q lcl|Aclame:pro 1 MAINLKELPKYR----EAVAELSAKISAGATPE---EQEKLFEAAFTTMGDEIL---------AKNEEEMERMFDLRDKN 64 (377) Q Consensus 1 m~~~~~~l~~~~----~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~ 64 (377) |++ ++|.+.. ++++++.+++.....++ ......+ ....+.+++. .+...+.+......... T Consensus 1 M~~--~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~e-e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (395) T protein:vir:38 1 MNI--NQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVD-DINKLNASLKNAKMAQELAKSAYEDARANLNAEPVN 77 (395) T ss_pred CCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 555 4444333 33333333322211100 0000001 0111111110 00001111111111100 Q ss_pred cc------ccHHHH-------HHHHHHH--hccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEE Q lcl|Aclame:pro 65 RE------LTAEEI-------KFFNDID--KNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL---RLKALT 126 (377) Q Consensus 65 ~~------lt~~e~-------~~~~~~~--~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~---~~~~p~ 126 (377) .. ...+.+ +.+.... ...++++||++||++++++|++.+++.++|+++|+++|+++ .+.+|. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 157 (395) T protein:vir:38 78 KKPLPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEK 157 (395) T ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEe Confidence 00 001111 1111111 12345579999999999999999999999999999998753 345555 Q ss_pred EcC-CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCC Q lcl|Aclame:pro 127 AET-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL 205 (377) Q Consensus 127 ~~~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~ 205 (377) ..+ .+.+.|++|+++.++..+++|++|++.+++++++++||+||++|+.++|++||.++|+++++++++.+|++|+|++ T Consensus 158 ~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~ 237 (395) T protein:vir:38 158 LADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKA 237 (395) T ss_pred eccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 443 4677899877777655669999999999999999999999999999999999999999999999999999999988 Q ss_pred cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhh Q lcl|Aclame:pro 206 QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWT 285 (377) Q Consensus 206 ~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~ 285 (377) .|.+... +++. +...+... -......+.+|+|||.++.. T Consensus 238 ~~~~~~~----------------~~~~---------------i~~~~~~~----------l~~~~~~~a~~v~n~~~~~~ 276 (395) T protein:vir:38 238 PKKPTIS----------------QFDN---------------IKDLENNT----------LDPAIESTSSFITNQSGYNI 276 (395) T ss_pred ccccccc----------------cHHH---------------HHHHHHHh----------hhhhhcCCCEEEEcHHHHHH Confidence 7643211 0000 00111000 01123356789999999766 Q ss_pred hcccccccCCCCcccc---------ccCCCceEEecCCCC------cceEEEEeccc-EEEEecceeeEEeech--hhhh Q lcl|Aclame:pro 286 LEAKFTSRNQFGEYVT---------VLPHGITILESLAVE------TGKAIAFVANR-YDAFMATASTIEEYDQ--TFAM 347 (377) Q Consensus 286 ~~~~~~~~~~~G~~~~---------~l~~~~~v~~s~~~~------~~~ii~gd~s~-y~~~~~~~~~i~~~~~--~~f~ 347 (377) +.. .++.+|.|+- ...+|+||+.+++++ +..++||||++ |.++++++++|+.+++ .+|. T Consensus 277 L~~---lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~ 353 (395) T protein:vir:38 277 LSK---VKDADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFE 353 (395) T ss_pred HHH---hhccCCceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhh Confidence 532 2445555441 112566777765543 23489999997 7889999999999875 4699 Q ss_pred cCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 348 EDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 348 ~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +|++.||+..|+|+++.+|+||+++++++. T Consensus 354 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 383 (395) T protein:vir:38 354 HDTTKLRFIDRFDVQLIDDGAFAAASFKTV 383 (395) T ss_pred cCceEEEEEEeeccEEecccceEEEEeecc Confidence 999999999999999999999999999987 No 77 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=1.1e-48 Score=283.41 Aligned_cols=350 Identities=13% Similarity=0.054 Sum_probs=217.3 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhcc---C--HHHHH---HHHHHHHHHHHHHHHHHHH--HHHHHHHHh-c--c----- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGA---T--PEEQE---KLFEAAFTTMGDEILAKNE--EEMERMFDL-R--D----- 62 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~---~--~~~~~---~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~--~----- 62 (377) |+. .+.|++.+.++++..+..+... . .++.. +.+.+..+.+..+...... .+.+..... . . T Consensus 1 m~~-~~~lee~~a~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (419) T protein:vir:94 1 MPP-TPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG 79 (419) T ss_pred CCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Confidence 776 3445544444444333332111 0 01101 1111111111111111000 000000000 0 0 Q ss_pred cccccc-----HHHHHHH-----------------HHHH-----hcc-CCCCCceeccHHHHHHHHHHHHhhhhhhhhce Q lcl|Aclame:pro 63 KNRELT-----AEEIKFF-----------------NDID-----KNV-GGKDKFKLLPEETMVQVFDDLVAEHPLLKVIN 114 (377) Q Consensus 63 ~~~~lt-----~~e~~~~-----------------~~~~-----~~~-~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~ 114 (377) ..+... .+..+.+ .... ..+ ....+++++|+.+...|+..++....++++|+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~ 159 (419) T protein:vir:94 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) T ss_pred cccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcce Confidence 000000 0000000 0000 011 22345567778877777888888889999999 Q ss_pred eEecCC-ceEEEEEcC--------CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHH Q lcl|Aclame:pro 115 FKNTSL-RLKALTAET--------SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQ 185 (377) Q Consensus 115 v~~~~~-~~~~p~~~~--------~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~ 185 (377) +.++++ .+++|+.++ .+.+.|++|++.. ++++++|+++++.+++++++++||+|||+|+ .++++||.++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~ 237 (419) T protein:vir:94 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAK-PQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGR 237 (419) T ss_pred eeeccCCceeeeeeccccccccccCcccceecCCccc-cccccceeeEEeeeeeEEEeehhhHHHHHhH-HHHHHHHHHH Confidence 999875 477776543 4457899876654 5788999999999999999999999999997 5799999999 Q ss_pred HHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 186 LKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKK 265 (377) Q Consensus 186 la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 265 (377) |++++++++|.+||+|+|+++|+||++................+. ...++.+..++... T Consensus 238 la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~---------------~~~~~~l~~~~~~~------ 296 (419) T protein:vir:94 238 LTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATD---------------EPPLVDIRRAKTVA------ 296 (419) T ss_pred HHHHHHHHHHHHHHhccCcccccceeccccccccccccccccccc---------------chhHHHHHHHHHhh------ Confidence 999999999999999999999999998655433322211111110 00111111111111 Q ss_pred hhhcccCceEEEeccchhhhhcccccccCCCCccc---------cccCCCceEEecCCCCcceEEEEeccc-EEEEecce Q lcl|Aclame:pro 266 HPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYV---------TVLPHGITILESLAVETGKAIAFVANR-YDAFMATA 335 (377) Q Consensus 266 ~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~---------~~l~~~~~v~~s~~~~~~~ii~gd~s~-y~~~~~~~ 335 (377) ......+..|+|||.++..+..... +..|.|. ....+|+||+.++++|+++++||||++ |.++++++ T Consensus 297 -~~~~~~~~~~v~n~~~~~~l~~~k~--~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~ 373 (419) T protein:vir:94 297 -EIAGFPPDGVVVHPQDWESIELDQA--PGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQG 373 (419) T ss_pred -hhccCCCCEEEEcHHHHHHHHHHhh--cCCCceeecCCcccCCCccccceeeEEcCCCCCccEEEeeccceEEEEEecc Confidence 0111234579999999877643211 1112211 112367889999999999999999998 77899999 Q ss_pred eeEEeechhh--hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 336 STIEEYDQTF--AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 336 ~~i~~~~~~~--f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++++++++.. |.+|++.||+.+|+||++++|+||++|+++|- T Consensus 374 ~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa 417 (419) T protein:vir:94 374 ITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) T ss_pred eEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEeccC Confidence 9999988764 99999999999999999999999999999999 No 78 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=3.3e-49 Score=286.38 Aligned_cols=284 Identities=17% Similarity=0.085 Sum_probs=209.5 Q ss_pred HHHHHHHHh--------ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceeeeccc--- Q lcl|Aclame:pro 72 IKFFNDIDK--------NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIF--- 139 (377) Q Consensus 72 ~~~~~~~~~--------~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e~--- 139 (377) ...+++.+. .+..+.++.+||+++.++|++.+++.++|+++++++|+++ ..++|+.++.+.+.|++|. T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~ 80 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCcccc Confidence 122222221 1223345569999999999999999999999999999874 6899999999999998654 Q ss_pred ----ccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcc---eeeee Q lcl|Aclame:pro 140 ----GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQP---VGLLK 212 (377) Q Consensus 140 ----~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P---~Gil~ 212 (377) ++..++++++|+++++.++|++++++||+|||+|+.+++++||+++|++++++++|.+|++|+|+++| .|+.+ T Consensus 81 ~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~ 160 (333) T protein:vir:78 81 EQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDT 160 (333) T ss_pred cccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccccc Confidence 23445778999999999999999999999999999999999999999999999999999999998655 45554 Q ss_pred ccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccc Q lcl|Aclame:pro 213 DLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTS 292 (377) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~ 292 (377) ................. ...++.+..+..... .........|+|||.++..+...... T Consensus 161 ~~~~~~~~~~~~~~~~~----------------~~~~~~i~~~~~~~~------~~~~~~~~~~vmn~~~~~~L~~~~~~ 218 (333) T protein:vir:78 161 DNVIANTTNVDYLQETG----------------DPLLDRLLDGYDLVS------ANTDVEFNGWAVDPRFRAHLLRAQAY 218 (333) T ss_pred ccccccccccccccccc----------------chhHHHHHHHHHhhc------cccccCceEEEEcchHHHHHHHHhhh Confidence 33222211111110000 001111111111110 11122234799999988766544445 Q ss_pred cCCCCccccc---------cCCCceEEecCCCCcc---------eEEEEecccEEEEecceeeEEeechh---------- Q lcl|Aclame:pro 293 RNQFGEYVTV---------LPHGITILESLAVETG---------KAIAFVANRYDAFMATASTIEEYDQT---------- 344 (377) Q Consensus 293 ~~~~G~~~~~---------l~~~~~v~~s~~~~~~---------~ii~gd~s~y~~~~~~~~~i~~~~~~---------- 344 (377) ++.+|.|+.. ..+|+||+.++++|++ .++||||++|+++++++++|+++++. T Consensus 219 ~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~ 298 (333) T protein:vir:78 219 RDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATV 298 (333) T ss_pred cCCCCceeecCccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEecccccccccccee Confidence 5666665421 2257889999999864 48999999999999999999999874 Q ss_pred -hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 345 -FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 345 -~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|.+|++.||+.+|+|+++.+++||++|+-++- T Consensus 299 ~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a 332 (333) T protein:vir:78 299 SMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQ 332 (333) T ss_pred ehhhcCcEEEEEEEEEccEEecccceEEEeccCC Confidence 488999999999999999999999999988777 No 79 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=1.9e-48 Score=282.19 Aligned_cols=327 Identities=15% Similarity=0.086 Sum_probs=211.1 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhcc-----CHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHH----Hhcc--- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGA-----TPEEQEKLFEAAFTTMGDEILAKN------EEEMERMF----DLRD--- 62 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~----~~~~--- 62 (377) |....+.+++..+.++++.+.+.... ..++.. ......+.+.++..+.. +.+..... .... T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~~~~~~e~~~-~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQDENASVDDFQ-KIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKK 79 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 65544444444444444444433211 111111 11111111111111100 00000000 0000 Q ss_pred ---ccccccHHHHHHHHHH----------HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEc Q lcl|Aclame:pro 63 ---KNRELTAEEIKFFNDI----------DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAE 128 (377) Q Consensus 63 ---~~~~lt~~e~~~~~~~----------~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~ 128 (377) ........+++.+... +..+++++||++||+++...|++.++++++|+++|+++|+++ ..++|+.. T Consensus 80 ~~~~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 159 (389) T protein:vir:10 80 GTDLSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILK 159 (389) T ss_pred ccccchhHHHHHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEe Confidence 0001112334444332 234667789999999999999999999999999999999864 57888765 Q ss_pred C-CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcc Q lcl|Aclame:pro 129 T-SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQP 207 (377) Q Consensus 129 ~-~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P 207 (377) . ...+.|+.|+++.++.++++|+++++.++++++++++|+|||+||.+++++||.++|+++++++++.+|++|+|++.| T Consensus 160 ~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~ 239 (389) T protein:vir:10 160 RATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTA 239 (389) T ss_pred cCCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 3 456678888888877788999999999999999999999999999999999999999999999999999999998776 Q ss_pred eeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhc Q lcl|Aclame:pro 208 VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLE 287 (377) Q Consensus 208 ~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~ 287 (377) .|.... .+++. +..++.. ..+...+..|+|||+++..+. T Consensus 240 ~~~~~~--------------~~~d~-------------------l~~~~~~--------~~~~~~~a~~~~n~~~~~~L~ 278 (389) T protein:vir:10 240 KKTTTD--------------TLVDS-------------------LKHILNV--------DLDPAYSRALVVTQSLFNTLD 278 (389) T ss_pred cccccc--------------ccHHH-------------------HHHHHHh--------hhhhhhCcEEEecHHHHHHHH Confidence 554211 01111 1111100 001112356888888765443 Q ss_pred ccccccCCCCccc---------------cccCCCceEEecC--CCCc--c--eEEEEeccc-EEEEecceeeEEeechhh Q lcl|Aclame:pro 288 AKFTSRNQFGEYV---------------TVLPHGITILESL--AVET--G--KAIAFVANR-YDAFMATASTIEEYDQTF 345 (377) Q Consensus 288 ~~~~~~~~~G~~~---------------~~l~~~~~v~~s~--~~~~--~--~ii~gd~s~-y~~~~~~~~~i~~~~~~~ 345 (377) . .++.+|.|+ ++ +|+||++.+ ..+. + .++||||++ |.+++|++++|.++++.+ T Consensus 279 ~---lkd~~G~~i~~~~~~~~~~~~~~~~l--~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~ 353 (389) T protein:vir:10 279 T---LKDKNGRYLLHDASDSITDGTAKGTI--LGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKI 353 (389) T ss_pred H---hhccCCCeeeecCccccccccccccc--ccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeecccc Confidence 2 234445443 34 455554432 2232 2 289999998 789999999999999988 Q ss_pred hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 346 AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 346 f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+ .+|+++|+||++++|+||+.+++++- T Consensus 354 ~~~---~~~~~~r~d~~~~~~~a~~~~~~~~~ 382 (389) T protein:vir:10 354 YGK---YLGAAFRFGVQKADSKAGYFVTNTDV 382 (389) T ss_pred ccc---eEEEEEEeccEEecccceEEEEeecc Confidence 876 68999999999999999999998855 No 80 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=1e-48 Score=283.72 Aligned_cols=290 Identities=12% Similarity=0.077 Sum_probs=208.3 Q ss_pred HHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC- Q lcl|Aclame:pro 42 MGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL- 120 (377) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~- 120 (377) +++ .. +.+.+. ..+.... .+.+.++ ..+...++++|++||+++.++|++.+++.++|+++++++|+++ T Consensus 1 ~~~-~~-~~~~~~-~~~~~~~-------~~~~~~~-a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~ 69 (324) T protein:vir:96 1 MEQ-TQ-KLKLNL-QHFASNN-------VKPQVFN-PDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGT 69 (324) T ss_pred CCc-ch-hhhHHH-HHHHHHh-------hhhhhhc-cccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCC Confidence 000 00 000000 0010000 0011111 2234556778999999999999999999999999999999975 Q ss_pred ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee Q lcl|Aclame:pro 121 RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK 200 (377) Q Consensus 121 ~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~ 200 (377) ..++|+.++.+.+.|++|.+.. ++++++|+++++.++|++++++||+|||+||.+++++||.++|++++++++|.++|+ T Consensus 70 ~~~~p~~~~~~~a~~v~Eg~~~-~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~ 148 (324) T protein:vir:96 70 EKKFTFWADKPGAYWVGEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) T ss_pred ceEEEEEecCcceeEecCCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 5899999999999999876555 578899999999999999999999999999999999999999999999999999999 Q ss_pred ccCCC-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEec Q lcl|Aclame:pro 201 GNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLN 279 (377) Q Consensus 201 G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 279 (377) |+|++ +|.||.+.......... ...++. .+.+....+ . ........|+|| T Consensus 149 G~g~~~~~~gi~~~~~~~~~~~~---~~~t~~---------------~i~~~~~~l----~-------~~~~~~~~~vmn 199 (324) T protein:vir:96 149 NQGNNPFGKSIAQSIEKTNKVIK---GDFTQD---------------NIIDLEALL----E-------DDELEANAFISK 199 (324) T ss_pred cCCCCCcCccccccccccceecc---ccccHH---------------HHHHHHHhh----h-------hccCCCCEEEEc Confidence 99975 68888765433221111 001111 111111111 0 011233479999 Q ss_pred cchhhhhcccccccCCCCccc-------cccCCCceEEecCC--CCcceEEEEecccEEEEecceeeEEeechhh----- Q lcl|Aclame:pro 280 PEDRWTLEAKFTSRNQFGEYV-------TVLPHGITILESLA--VETGKAIAFVANRYDAFMATASTIEEYDQTF----- 345 (377) Q Consensus 280 ~~~~~~~~~~~~~~~~~G~~~-------~~l~~~~~v~~s~~--~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~----- 345 (377) |.++..+.. .++.+|.|. ++ +|+||+.++. ++++.++||||++++++++++++++.++|.. T Consensus 200 ~~~~~~L~~---l~d~~G~~~~~~~~~~~l--~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~ 274 (324) T protein:vir:96 200 TQNRSLLRK---IVDPETKERIYDRNSDSL--DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVK 274 (324) T ss_pred HHHHHHHHH---hhccCCCeeecCCCCCcc--cceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccc Confidence 999776542 234455433 33 4566666554 5566799999999999999999999998854 Q ss_pred ---------hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 346 ---------AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 346 ---------f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+|++.||+.+|+|+++.+|+||++|+.+-. T Consensus 275 ~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:96 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred cccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 88999999999999999999999999997555 No 81 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=1e-48 Score=283.72 Aligned_cols=290 Identities=12% Similarity=0.077 Sum_probs=208.3 Q ss_pred HHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC- Q lcl|Aclame:pro 42 MGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL- 120 (377) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~- 120 (377) +++ .. +.+.+. ..+.... .+.+.++ ..+...++++|++||+++.++|++.+++.++|+++++++|+++ T Consensus 1 ~~~-~~-~~~~~~-~~~~~~~-------~~~~~~~-a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~ 69 (324) T protein:vir:78 1 MEQ-TQ-KLKLNL-QHFASNN-------VKPQVFN-PDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGT 69 (324) T ss_pred CCc-ch-hhhHHH-HHHHHHh-------hhhhhhc-cccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCC Confidence 000 00 000000 0010000 0011111 2234556778999999999999999999999999999999975 Q ss_pred ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee Q lcl|Aclame:pro 121 RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK 200 (377) Q Consensus 121 ~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~ 200 (377) ..++|+.++.+.+.|++|.+.. ++++++|+++++.++|++++++||+|||+||.+++++||.++|++++++++|.++|+ T Consensus 70 ~~~~p~~~~~~~a~~v~Eg~~~-~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~ 148 (324) T protein:vir:78 70 EKKFTFWADKPGAYWVGEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) T ss_pred ceEEEEEecCcceeEecCCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 5899999999999999876555 578899999999999999999999999999999999999999999999999999999 Q ss_pred ccCCC-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEec Q lcl|Aclame:pro 201 GNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLN 279 (377) Q Consensus 201 G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 279 (377) |+|++ +|.||.+.......... ...++. .+.+....+ . ........|+|| T Consensus 149 G~g~~~~~~gi~~~~~~~~~~~~---~~~t~~---------------~i~~~~~~l----~-------~~~~~~~~~vmn 199 (324) T protein:vir:78 149 NQGNNPFGKSIAQSIEKTNKVIK---GDFTQD---------------NIIDLEALL----E-------DDELEANAFISK 199 (324) T ss_pred cCCCCCcCccccccccccceecc---ccccHH---------------HHHHHHHhh----h-------hccCCCCEEEEc Confidence 99975 68888765433221111 001111 111111111 0 011233479999 Q ss_pred cchhhhhcccccccCCCCccc-------cccCCCceEEecCC--CCcceEEEEecccEEEEecceeeEEeechhh----- Q lcl|Aclame:pro 280 PEDRWTLEAKFTSRNQFGEYV-------TVLPHGITILESLA--VETGKAIAFVANRYDAFMATASTIEEYDQTF----- 345 (377) Q Consensus 280 ~~~~~~~~~~~~~~~~~G~~~-------~~l~~~~~v~~s~~--~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~----- 345 (377) |.++..+.. .++.+|.|. ++ +|+||+.++. ++++.++||||++++++++++++++.++|.. T Consensus 200 ~~~~~~L~~---l~d~~G~~~~~~~~~~~l--~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~ 274 (324) T protein:vir:78 200 TQNRSLLRK---IVDPETKERIYDRNSDSL--DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVK 274 (324) T ss_pred HHHHHHHHH---hhccCCCeeecCCCCCcc--cceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccc Confidence 999776542 234455433 33 4566666554 5566799999999999999999999998854 Q ss_pred ---------hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 346 ---------AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 346 ---------f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+|++.||+.+|+|+++.+|+||++|+.+-. T Consensus 275 ~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:78 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred cccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 88999999999999999999999999997555 No 82 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=9.6e-49 Score=283.83 Aligned_cols=290 Identities=12% Similarity=0.077 Sum_probs=209.9 Q ss_pred HHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC- Q lcl|Aclame:pro 42 MGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL- 120 (377) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~- 120 (377) +++ ..+.+.+++....... +.+.|+ ..+.....+++++||++++++|++.+++.++|+++|+++|+++ T Consensus 1 ~~~--~~~~~~~~~~f~~~~~--------~~~~~~-a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~ 69 (324) T protein:vir:93 1 MEQ--TQKLKLNLQHFASNNV--------KPQVFN-PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGT 69 (324) T ss_pred Cch--hHHHHHHHHHHHHhhh--------hhhhcc-cccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCC Confidence 111 0111111111111110 111111 1223444567779999999999999999999999999999875 Q ss_pred ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee Q lcl|Aclame:pro 121 RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK 200 (377) Q Consensus 121 ~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~ 200 (377) .+++|+.++.+.+.|++|.+.. ++++++|+++++.++|++++++||+|||+||.+++++||.++|++++++++|+++|+ T Consensus 70 ~~~ip~~~~~~~a~~v~Eg~~~-~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~ 148 (324) T protein:vir:93 70 EKKFTFWADKPGAYWVGEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) T ss_pred ceEEEEEecCcceeeecCCccc-cccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 5899999999999999876655 567899999999999999999999999999999999999999999999999999999 Q ss_pred ccCCC-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEec Q lcl|Aclame:pro 201 GNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLN 279 (377) Q Consensus 201 G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 279 (377) |+|++ +|.|+++........... ..++ ..+.+.+.. +. . .......|+|| T Consensus 149 G~g~~~~~~~~~~~~~~~~~~~~~---~~~~---------------~~i~~~~~~----l~------~-~~~~~~~~v~n 199 (324) T protein:vir:93 149 NQGNNPFGKSIAQSIEKTNKVIKG---DFTQ---------------DNIIDLEAL----LE------D-DELEANAFISK 199 (324) T ss_pred CCCCCCcCccccccccccceeccc---cccH---------------HHHHHHHHh----hh------h-ccCCCCEEEEc Confidence 99975 688988654433221111 0111 111111111 10 0 11223479999 Q ss_pred cchhhhhcccccccCCCCccc-------cccCCCceEEecCC--CCcceEEEEecccEEEEecceeeEEeechhh----- Q lcl|Aclame:pro 280 PEDRWTLEAKFTSRNQFGEYV-------TVLPHGITILESLA--VETGKAIAFVANRYDAFMATASTIEEYDQTF----- 345 (377) Q Consensus 280 ~~~~~~~~~~~~~~~~~G~~~-------~~l~~~~~v~~s~~--~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~----- 345 (377) |+++..+.. .++.+|.|. ++ +|+||+.++. .+++.+++|||++++++++++++|+.++|.. T Consensus 200 ~~~~~~L~~---l~d~~G~~~~~~~~~~~l--~G~PVv~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~ 274 (324) T protein:vir:93 200 TQNRSLLRK---IVDPETKERIYDRNSDSL--DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVK 274 (324) T ss_pred HHHHHHHHH---hhCCCCCeeecCCCCCcc--cceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccc Confidence 999776542 245555544 33 4666766554 5567799999999999999999999998854 Q ss_pred ---------hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 346 ---------AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 346 ---------f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+|++.||+.+|+|+++.+|+||++|+.+.. T Consensus 275 ~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:93 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred cccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 88999999999999999999999999998777 No 83 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=1.2e-48 Score=283.36 Aligned_cols=289 Identities=15% Similarity=0.056 Sum_probs=205.3 Q ss_pred cccHHHHHHHHHH--HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceee------- Q lcl|Aclame:pro 66 ELTAEEIKFFNDI--DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVW------- 135 (377) Q Consensus 66 ~lt~~e~~~~~~~--~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w------- 135 (377) .-+-.|.+.-... ...+.++.++.+||++++++|++.+++.++|+++|+++|+++ .+++|+.+..+.+.| T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~ 80 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccc Confidence 0011111111100 112223456679999999999999999999999999999975 589999877655544 Q ss_pred -ecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCC---cceeee Q lcl|Aclame:pro 136 -GDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL---QPVGLL 211 (377) Q Consensus 136 -~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~---~P~Gil 211 (377) ++|.++ .++++++|+++++.++|++++++||+|||+||.+++++||.++|++++++++|.+|++|+|++ +|.||+ T Consensus 81 ~~~Eg~~-~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~ 159 (338) T protein:vir:78 81 EQREGGT-KPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGID 159 (338) T ss_pred ccccccc-ccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccc Confidence 444444 457789999999999999999999999999999999999999999999999999999999975 466776 Q ss_pred eccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc Q lcl|Aclame:pro 212 KDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT 291 (377) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~ 291 (377) +............... .....++.+..+....... .......|+|||.++..+..... T Consensus 160 ~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~------~~~~~~~~~m~~~~~~~L~~~~~ 217 (338) T protein:vir:78 160 TNNVIVNTTNVDYLQT----------------GTTPLLDRFLDGYDLVSAN------TDVDFNGWAADPRYRARLLRSQA 217 (338) T ss_pred cccccccccccccccc----------------cchhhHHHHHHHHHHhhhh------ccccceEEEEchHHHHHHHHHhh Confidence 5433222111111000 0011112222221111110 11133469999998876654434 Q ss_pred ccCCCCcccc---------ccCCCceEEecCCCCc---------ceEEEEecccEEEEecceeeEEeechh--------- Q lcl|Aclame:pro 292 SRNQFGEYVT---------VLPHGITILESLAVET---------GKAIAFVANRYDAFMATASTIEEYDQT--------- 344 (377) Q Consensus 292 ~~~~~G~~~~---------~l~~~~~v~~s~~~~~---------~~ii~gd~s~y~~~~~~~~~i~~~~~~--------- 344 (377) .++.+|.|+. ...+|+||+.+++||+ ..++||||++|+++++++++|+++++. T Consensus 218 l~d~~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~ 297 (338) T protein:vir:78 218 YRDANGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPT 297 (338) T ss_pred hccCCCceeecccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeeccccccccccc Confidence 4566666541 1226788999999874 238899999999999999999999874 Q ss_pred -----hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 345 -----FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 345 -----~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|.+|++.||+.+|+|+++++|+||++|+-++= T Consensus 298 ~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 335 (338) T protein:vir:78 298 PQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDED 335 (338) T ss_pred ccchhhhhcCcEEEEEEEEeccEeecccceEEEecccC Confidence 388999999999999999999999999887665 No 84 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=8.1e-49 Score=284.25 Aligned_cols=275 Identities=14% Similarity=0.013 Sum_probs=207.8 Q ss_pred ccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceeeeccccccc Q lcl|Aclame:pro 65 RELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIK 143 (377) Q Consensus 65 ~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e~~~~~ 143 (377) -..+++.+.. ...+++++|.+||++++++|++.+++.++|+++++++++++ .+++|+.+..+.+.|++|.++. T Consensus 1 ~g~~~e~~~~-----~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~- 74 (397) T protein:vir:23 1 MGFSADHSQI-----AQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMK- 74 (397) T ss_pred CCcCHHHHHH-----hhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCccc- Confidence 2233444432 22344444556777889999999999999999999999875 5899999999999999876555 Q ss_pred ccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccc Q lcl|Aclame:pro 144 GQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQST 223 (377) Q Consensus 144 ~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~ 223 (377) ++++++|+++++.+||++++++||+|||+|+.+++++||+++|++++++++|++||+|+|+++|.+.+............ T Consensus 75 ~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~ 154 (397) T protein:vir:23 75 PITKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISP 154 (397) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecc Confidence 57889999999999999999999999999999999999999999999999999999999998765544332222111110 Q ss_pred cccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc-- Q lcl|Aclame:pro 224 GRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT-- 301 (377) Q Consensus 224 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~-- 301 (377) ...... ..+.+..+. .....+..|+|||+++..+.. .++.+|.|+. T Consensus 155 ---~~~~~~---------------~~~~~~~l~-----------~~~~~~a~~vmn~~~~~~L~~---lkd~~G~~i~~~ 202 (397) T protein:vir:23 155 ---NAYQGL---------------GVSGLTKLV-----------TDGKKWTHTLLDDTVEPVLNG---SVDANGRPLFVE 202 (397) T ss_pred ---cchhHH---------------HHHHHHhhh-----------hcccCCCEEEEcHHHHHHHHH---hhccCCceeecc Confidence 000000 111111110 112234579999988776543 2444555431 Q ss_pred ------------ccCCCceEEecCCCCcceE--EEEecccEEEEecceeeEEeechhh--------------hhcCcEEE Q lcl|Aclame:pro 302 ------------VLPHGITILESLAVETGKA--IAFVANRYDAFMATASTIEEYDQTF--------------AMEDLQLY 353 (377) Q Consensus 302 ------------~l~~~~~v~~s~~~~~~~i--i~gd~s~y~~~~~~~~~i~~~~~~~--------------f~~~~~~~ 353 (377) ...+|+||+.++++|++++ +||||++|++++++++.+++++|.. |.+|++.| T Consensus 203 ~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ 282 (397) T protein:vir:23 203 STYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAV 282 (397) T ss_pred cccccccccccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeE Confidence 0126889999999998874 7999999999999999999998864 88999999 Q ss_pred EEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 354 LTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 354 ~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |+.+|+|+++++++||+.++.+.. T Consensus 283 ra~~r~d~~v~~~~a~~~~~~~~~ 306 (397) T protein:vir:23 283 RVEAEYGLLINDVNAFVKLTFDPV 306 (397) T ss_pred EEEeeeccceecccceEEEeeccc Confidence 999999999999999999999877 No 85 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=9.3e-49 Score=283.91 Aligned_cols=277 Identities=12% Similarity=0.011 Sum_probs=207.5 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceeeeccccccc----ccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIK----GQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e~~~~~----~~~~~~f~~i 153 (377) ++..++++||++||++++++|++.+++.++|+++++++++++ ..++|+.++.+.+.|++|.+... +.++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 778888999999999999999999999999999999999875 58999999999999998765533 3457899999 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++.+||++++++||+||++||.+++++||+++|++++++++|++|++|+|++++.+....................... T Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 159 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN- 159 (305) T ss_pred EeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccchh- Confidence 9999999999999999999999999999999999999999999999999976554433221111111111011110000 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccc--cCCCceEEe Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTV--LPHGITILE 311 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~--l~~~~~v~~ 311 (377) ..+ ....+..++.... . .......|+|||.++..+.. .++.+|+|+.. ..+|+||+. T Consensus 160 --------~~~---~~~~~~~~~~~~~---~----~~~~~~~~v~~~~~~~~l~~---lkd~~G~~i~~~~~l~G~Pv~~ 218 (305) T protein:vir:25 160 --------ESD---IVGATNRAAKAVA---S----AGWAPDTLLSSLALRYEVAN---IRDANGNPVFRDDSFAGFRTFF 218 (305) T ss_pred --------hhH---HHHHHHHHHHhhh---h----cccccceeEecHHHHHHHHH---hhccCCceeecCCcccccceEE Confidence 000 1111111111110 0 01112249999998876642 35677877621 235677888 Q ss_pred cCCCCc----ceEEEEecccEEEEecceeeEEeechh----------hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 312 SLAVET----GKAIAFVANRYDAFMATASTIEEYDQT----------FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 312 s~~~~~----~~ii~gd~s~y~~~~~~~~~i~~~~~~----------~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +++++. +.++||||++|+++++++++|+.+++. .|.+|++.+|+.+|+|+.+.+|+||+.++..-- T Consensus 219 ~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~ 298 (305) T protein:vir:25 219 NRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) T ss_pred cCccCCCCCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccc Confidence 888764 358999999999999999999998875 378899999999999999999999999998643 No 86 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=3.6e-48 Score=280.72 Aligned_cols=292 Identities=12% Similarity=0.070 Sum_probs=208.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhce Q lcl|Aclame:pro 35 FEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVIN 114 (377) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~ 114 (377) +++..+ . +.+.+........... +. ..+.....++|.+||++++++|++.+++.++|+++|+ T Consensus 1 ~~k~~~-----~----~~~~~~~~~~~~~~~~--------~~-a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~ 62 (324) T protein:vir:99 1 MEQTQK-----L----KLNLQHFASNNVKPQV--------FN-PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGK 62 (324) T ss_pred CCCchH-----h----hHHHHHHHHHhhhhhh--------cc-ccceeccCCCcceechhHHHHHHHHHHhhchhhhhcc Confidence 111000 0 0000000000000000 10 1122334556779999999999999999999999999 Q ss_pred eEecCC-ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 115 FKNTSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVA 193 (377) Q Consensus 115 v~~~~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~ 193 (377) ++|+++ ..++|+.++.+.+.|++|.+.. ++++++|+++++.++|++++++||+|||+|+.+++++||.++|+++++++ T Consensus 63 ~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~-~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~ 141 (324) T protein:vir:99 63 YEPMEGTEKKFTFWADKPGAYWVGEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) T ss_pred eeeccCCceEEEEEecCcceeEeccCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 999864 5899999989999999876655 56789999999999999999999999999999999999999999999999 Q ss_pred hhcceeeccCCC-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccC Q lcl|Aclame:pro 194 LELAIVKGNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAG 272 (377) Q Consensus 194 ~~~a~l~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 272 (377) +|+++|+|+|++ +|.|+++........... ..++ ..+.+.+ ..+. + .... T Consensus 142 ~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~---~~~~---------------~~i~~~~----~~l~------~-~~~~ 192 (324) T protein:vir:99 142 FDEAGILNQGNNPFGKSIAQSIEKTNKVIKG---DFTQ---------------DNIIDLE----ALLE------D-DELE 192 (324) T ss_pred HHHHhhhcCCCCccCccccccccccceeccc---cCCH---------------HHHHHHH----Hhhh------h-ccCC Confidence 999999999976 688888654433221111 0111 1111111 1111 1 1123 Q ss_pred ceEEEeccchhhhhcccccccCCCCcccc-----ccCCCceEEecCCCC--cceEEEEecccEEEEecceeeEEeechhh Q lcl|Aclame:pro 273 QVKLLLNPEDRWTLEAKFTSRNQFGEYVT-----VLPHGITILESLAVE--TGKAIAFVANRYDAFMATASTIEEYDQTF 345 (377) Q Consensus 273 ~~~~~~n~~~~~~~~~~~~~~~~~G~~~~-----~l~~~~~v~~s~~~~--~~~ii~gd~s~y~~~~~~~~~i~~~~~~~ 345 (377) ...|+|||+++..+.. + ++.+|.|+- ...+|+||+.++.++ ++.+++|||++|+++++++++|+.++|.. T Consensus 193 ~~~~v~n~~~~~~L~~-l--~d~~g~~~~~~~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~ 269 (324) T protein:vir:99 193 ANAFISKTQNRSLLRK-I--VDPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQ 269 (324) T ss_pred CCEEEEcHHHHHHHHH-h--hcCCCceeecCCCCccccceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeeccc Confidence 3479999999776542 2 334444321 112567777776655 45699999999999999999999998854 Q ss_pred --------------hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 346 --------------AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 346 --------------f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+|++.||+.+|+|+++.+++||++|+.+.. T Consensus 270 ~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~ 315 (324) T protein:vir:99 270 LSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 88999999999999999999999999999877 No 87 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=3.9e-48 Score=280.51 Aligned_cols=292 Identities=12% Similarity=0.059 Sum_probs=208.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhce Q lcl|Aclame:pro 35 FEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVIN 114 (377) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~ 114 (377) +++. .+.+.+.+...........+ ...+.....++|.+||++++++|++.+++.++|+++|+ T Consensus 1 ~~~~---------~~~~~~~~~f~~~~~~~~~~---------~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~ 62 (324) T protein:vir:10 1 MEQT---------QKLKLNLQHFASNNVKPQVF---------NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGK 62 (324) T ss_pred CCCc---------hHHHHHHHHHHHHhhcccee---------cccceeccCCCcceechhHHHHHHHHHHhhchhhhhcc Confidence 0000 00000111100000001111 01123344566779999999999999999999999999 Q ss_pred eEecCC-ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 115 FKNTSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVA 193 (377) Q Consensus 115 v~~~~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~ 193 (377) ++|+++ ..++|+.++.+.+.|++|.++. ++++++|+++++.++|++++++||+|||+|+.+++++||.++|+++++++ T Consensus 63 ~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~-~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~ 141 (324) T protein:vir:10 63 YEPMEGTEKKFTFWADKPGAYWVGEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) T ss_pred eeeccCCceEEEEEeCCcceeEeccCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 999864 5899999988999999876665 56789999999999999999999999999999999999999999999999 Q ss_pred hhcceeeccCCC-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccC Q lcl|Aclame:pro 194 LELAIVKGNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAG 272 (377) Q Consensus 194 ~~~a~l~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 272 (377) +|.++|+|+|++ .|.||++........... ..++ ..+.+.+.. +. . .... T Consensus 142 ~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~---~~t~---------------~~i~~~~~~----l~------~-~~~~ 192 (324) T protein:vir:10 142 FDEAGILNQGNNPFGKSIAQSIEKTNKVIKG---DFTQ---------------DNIIDLEAL----LE------D-DELE 192 (324) T ss_pred HHHHhhhcCCCCccCccccccccccceeccc---cCCH---------------HHHHHHHHh----hh------h-ccCC Confidence 999999999986 689988754433222111 0010 111111111 11 0 1123 Q ss_pred ceEEEeccchhhhhcccccccCCCCcccc-----ccCCCceEEecCCC--CcceEEEEecccEEEEecceeeEEeechhh Q lcl|Aclame:pro 273 QVKLLLNPEDRWTLEAKFTSRNQFGEYVT-----VLPHGITILESLAV--ETGKAIAFVANRYDAFMATASTIEEYDQTF 345 (377) Q Consensus 273 ~~~~~~n~~~~~~~~~~~~~~~~~G~~~~-----~l~~~~~v~~s~~~--~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~ 345 (377) ...|+|||.++..+.. .++.+|.|.- ...+|+||+.++.+ +++.+++|||++|+++++++++|++++|.. T Consensus 193 ~~~~v~n~~~~~~L~~---l~d~~g~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~ 269 (324) T protein:vir:10 193 ANAFISKTQNRSLLRK---IVDPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQ 269 (324) T ss_pred CCEEEEcHHHHHHHHH---hhccCCceeecCCCCccccceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeeccc Confidence 4479999999776542 2344554431 11256677776654 456699999999999999999999998853 Q ss_pred --------------hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 346 --------------AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 346 --------------f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+|++.||+.+|+|+++.+++||++|+.+.- T Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:10 270 LSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 88999999999999999999999999999877 No 88 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=1.6e-47 Score=277.13 Aligned_cols=333 Identities=11% Similarity=0.034 Sum_probs=210.6 Q ss_pred CCccHHHHHHH----HHHHHHHHHHHHhccCH--HHHHHHHH----HHHHHHHHHHHHH--HHHHHHHHHHhcccccccc Q lcl|Aclame:pro 1 MAINLKELPKY----REAVAELSAKISAGATP--EEQEKLFE----AAFTTMGDEILAK--NEEEMERMFDLRDKNRELT 68 (377) Q Consensus 1 m~~~~~~l~~~----~~~~~~~~~~~~~~~~~--~~~~~~~~----~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~lt 68 (377) |. +.++++. .+++++..++..++... ++..+.+. ...+++..+.... ...+.+............. T Consensus 1 m~--~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~~~~ 78 (379) T protein:vir:10 1 ME--ALEIKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSEDKS 78 (379) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 44 4444333 22222222211111000 00000000 0011111111000 0000000010000000000 Q ss_pred HHH-------HHHHHH-----------HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcC Q lcl|Aclame:pro 69 AEE-------IKFFND-----------IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAET 129 (377) Q Consensus 69 ~~e-------~~~~~~-----------~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~ 129 (377) ... .+.... ....+++++++.+||+++...|++.++..++|+++|+++++++ .+++|+.++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 158 (379) T protein:vir:10 79 DSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENG 158 (379) T ss_pred hhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeec Confidence 000 000110 0112334566678999999999999999999999999999875 589998764 Q ss_pred --CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcc Q lcl|Aclame:pro 130 --SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQP 207 (377) Q Consensus 130 --~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P 207 (377) .+.+.|++|++.. ++++++|++|++.+++++++++||+|||+|+. ++++||.++|+++++.+++.+|++|+|++.+ T Consensus 159 ~~~~~~~~v~Eg~~~-~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~ 236 (379) T protein:vir:10 159 AGEGAIGAQVEGATK-GQKDYDISMIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPNALRRDYAKAENAAFNAVLAANAT 236 (379) T ss_pred CCCcccccccCCccc-cccccceeeeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 3466788766555 56789999999999999999999999999985 6999999999999999999999999987654 Q ss_pred eeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhc Q lcl|Aclame:pro 208 VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLE 287 (377) Q Consensus 208 ~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~ 287 (377) .+.... .....++ .+.+.+..+. +. ...+..|+|||+++..+. T Consensus 237 ~~~~~~-----------~~~~~~d---------------~i~~~~~~~~----------~~-~~~~~~~vmn~~~~~~l~ 279 (379) T protein:vir:10 237 ASTEII-----------TNKNKVE---------------MLINEIAKQE----------NL-DFPVTAIVLRPTDYYDIL 279 (379) T ss_pred cccccc-----------cCcccHH---------------HHHHHHHhhh----------hc-cCCCCEEEEcHHHHHHHH Confidence 443211 0001111 1111111110 11 112346999999876653 Q ss_pred ccccccCCCCccccc-----------cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechh--hhhcCcEEEE Q lcl|Aclame:pro 288 AKFTSRNQFGEYVTV-----------LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQT--FAMEDLQLYL 354 (377) Q Consensus 288 ~~~~~~~~~G~~~~~-----------l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~--~f~~~~~~~~ 354 (377) . .++.+|.|+.. ..+|+||+++++||+++++||||++|.+.+|.++.|+++.+. +|.+|++.|| T Consensus 280 ~---lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r 356 (379) T protein:vir:10 280 V---TQKSVGAGYGLPGVVTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVTKVTTEGLSLEFSEVEGTNFVKNNITAR 356 (379) T ss_pred H---hhccCCceeccCCccCCCCCcceecceeeEecCCCCCCceEEeecccEEEEEEeceEEEEeecccccccCCcEEEE Confidence 2 24555655411 235889999999999999999999999999999999888765 5999999999 Q ss_pred EEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 355 TKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 355 ~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +.+|+|+++.+|+|||.++++|= T Consensus 357 ~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 357 IEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred EEEEeccEEecCccEEEEEecCC Confidence 99999999999999999999999 No 89 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=4.7e-48 Score=280.06 Aligned_cols=282 Identities=12% Similarity=0.082 Sum_probs=206.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHH--------HHhccCCCCCceeccHHHHHHHHHHHHhh Q lcl|Aclame:pro 35 FEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFND--------IDKNVGGKDKFKLLPEETMVQVFDDLVAE 106 (377) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~--------~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~ 106 (377) +++ .+.+..+.+++.+. ..+.....++|.+||++++++|++.+++. T Consensus 1 ~~~--------------------------~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~ 54 (324) T protein:vir:96 1 MEQ--------------------------TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMEN 54 (324) T ss_pred CCc--------------------------chhhhHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhh Confidence 000 01111111111111 11123345677799999999999999999 Q ss_pred hhhhhhceeEecCC-ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHH Q lcl|Aclame:pro 107 HPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQ 185 (377) Q Consensus 107 s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~ 185 (377) ++|+++++++|+++ .+++|+.++.+.+.|++|.+.. ++++++|+++++.+++++++++||+|||+||.+++++||.++ T Consensus 55 s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~-~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~ 133 (324) T protein:vir:96 55 SKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPM 133 (324) T ss_pred chhhhhcceeeccCCceEEEEEecCcceeeecCCccc-cccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHH Confidence 99999999999875 5899999988999999876555 578899999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhcceeeccCCC-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhh Q lcl|Aclame:pro 186 LKEAIAVALELAIVKGNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDK 264 (377) Q Consensus 186 la~~~a~~~~~a~l~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 264 (377) +++++++++|+++|+|+|++ .|.|+++........... ..++ ..+.+.+.. +. T Consensus 134 l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~---~~~~---------------~~i~~~~~~----i~---- 187 (324) T protein:vir:96 134 IAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKG---DFTQ---------------DNIIDLEAL----LE---- 187 (324) T ss_pred HHHHHHHHHHHHhhhcCCCCCcCccccccccccceeccc---ccch---------------HHHHHHHHh----hh---- Confidence 99999999999999999975 678887643332211111 1111 111111111 10 Q ss_pred hhhhcccCceEEEeccchhhhhcccccccCCCCccc-------cccCCCceEEecCC--CCcceEEEEecccEEEEecce Q lcl|Aclame:pro 265 KHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYV-------TVLPHGITILESLA--VETGKAIAFVANRYDAFMATA 335 (377) Q Consensus 265 ~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~-------~~l~~~~~v~~s~~--~~~~~ii~gd~s~y~~~~~~~ 335 (377) . .......|+|||+++..+.. .++.+|.|. ++ +|+||+.++. ++.+.++||||++++++++++ T Consensus 188 --~-~~~~~~~~i~n~~~~~~L~~---lkd~~G~~~~~~~~~~~l--~G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~ 259 (324) T protein:vir:96 188 --D-DELEANAFISKTQNRSLLRK---IVDPETKERIYDRNSDSL--DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL 259 (324) T ss_pred --h-ccCCCCEEEEcHHHHHHHHH---hhCCCCCeeecCCCCCcc--cceeeEeecCCCCCcceEEEEecceEEEEEecC Confidence 0 11233469999999766542 244555543 33 4566666544 556679999999999999999 Q ss_pred eeEEeechhh--------------hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 336 STIEEYDQTF--------------AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 336 ~~i~~~~~~~--------------f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++|+.+++.. |.+|++.||+.+|+|+++++|+||++|+.+.+ T Consensus 260 ~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:96 260 IEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred cEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 9999998753 88999999999999999999999999998877 No 90 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=3.2e-48 Score=280.94 Aligned_cols=331 Identities=11% Similarity=0.028 Sum_probs=219.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHH----hccCH--HHHHHHHHHHHHHHHHHHHHHHH--HH-------HHHHHHhccc-- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKIS----AGATP--EEQEKLFEAAFTTMGDEILAKNE--EE-------MERMFDLRDK-- 63 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~----~~~~~--~~~~~~~~~~~~~~~~~~~~~~~--~~-------~~~~~~~~~~-- 63 (377) |..++++|.++..++.+..+.+. ....+ .++.+......+.+.+++..... .. ..+....... T Consensus 3 ~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (421) T protein:vir:13 3 LFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGGRV 82 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Confidence 88888777665554444433322 21111 11111122222222222221110 00 0010000000 Q ss_pred -cccccH-----HHHHHHH----------HHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEE Q lcl|Aclame:pro 64 -NRELTA-----EEIKFFN----------DIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALT 126 (377) Q Consensus 64 -~~~lt~-----~e~~~~~----------~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~ 126 (377) ...... .+++.|. ..+...++++||++||++++..|++.+++.++|+++|+++|++ +..++|+ T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~ 162 (421) T protein:vir:13 83 IINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPV 162 (421) T ss_pred ccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEE Confidence 001111 1122221 1233456678999999999999999999999999999999987 4578888 Q ss_pred EcCCcc--eeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCC Q lcl|Aclame:pro 127 AETSGT--AVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGL 204 (377) Q Consensus 127 ~~~~~~--a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~ 204 (377) ...... +.|++|.++. ++++++|++|++.+++++++++||+|||+||.+++++||.++|++++++++|.+++ T Consensus 163 ~~~~~~~~~~~~~E~~~~-~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~----- 236 (421) T protein:vir:13 163 RAGASVDKLANLAKDTEL-VKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIV----- 236 (421) T ss_pred eecCCccceeeccccccc-cccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHh----- Confidence 766554 4567665554 56789999999999999999999999999999999999999999999998876655 Q ss_pred CcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhh Q lcl|Aclame:pro 205 LQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRW 284 (377) Q Consensus 205 ~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 284 (377) ++|.||++..+. .++ ..+.+.+..+. .....+..|+|||.++. T Consensus 237 ~~~~g~~~~~~~-----------~~~---------------d~i~~~~~~l~-----------~~~~~~a~~v~n~~~~~ 279 (421) T protein:vir:13 237 KQAKAVLAEETI-----------NDY---------------AGLVKTINSLV-----------PNARKRAIIVTNSDGRA 279 (421) T ss_pred hhhhhccccccc-----------cch---------------HHHHHHHHHhh-----------hhhcCCCEEEEcHHHHH Confidence 678888743221 111 11122222111 11234568999999876 Q ss_pred hhcccccccCCCCccccc--------cCCCceEEecCCCCcc-----eEEEEeccc-EEEEecceeeEEeechhhhhcCc Q lcl|Aclame:pro 285 TLEAKFTSRNQFGEYVTV--------LPHGITILESLAVETG-----KAIAFVANR-YDAFMATASTIEEYDQTFAMEDL 350 (377) Q Consensus 285 ~~~~~~~~~~~~G~~~~~--------l~~~~~v~~s~~~~~~-----~ii~gd~s~-y~~~~~~~~~i~~~~~~~f~~~~ 350 (377) .+.. .++.+|.|+.. ..+|+||+.+++++.+ .++||||++ |.+++|++++|+++++.+|.+|+ T Consensus 280 ~l~~---lkd~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~ 356 (421) T protein:vir:13 280 YLDG---LMDKQGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNE 356 (421) T ss_pred HHHH---hhcCCCceeecCcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeecccccccCe Confidence 6542 35667766521 2357788888887743 389999998 77899999999999999999999 Q ss_pred EEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 351 QLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 351 ~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +.||+..|+||++++++||+.+.+.-- T Consensus 357 ~~~r~~~r~d~~~~~~~a~~~~~~~~~ 383 (421) T protein:vir:13 357 TIARIIERFDVNSPLDKSSDAEKIRKF 383 (421) T ss_pred eEEEEEeeecceeecchhhheeeeccc Confidence 999999999999999999887766542 No 91 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=1.3e-46 Score=272.22 Aligned_cols=326 Identities=12% Similarity=0.064 Sum_probs=205.8 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccC---HHHHHHHHHHHHHHHHHHHHH------HHHHHH---HHHHHh-------- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGAT---PEEQEKLFEAAFTTMGDEILA------KNEEEM---ERMFDL-------- 60 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~------~~~~~~---~~~~~~-------- 60 (377) |..++++|.+..+++.+..+.+..... .+++....++..+.+..++.. +...+. ...... T Consensus 15 l~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l~~~~~~~~~~~~~~ 94 (397) T protein:vir:96 15 RSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDLEDELAKAADPTDQK 94 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh Confidence 222233333332222222222221111 111111111111111111110 000000 000000 Q ss_pred -----------ccccccccHHHHHHHHH--------HHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC- Q lcl|Aclame:pro 61 -----------RDKNRELTAEEIKFFND--------IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL- 120 (377) Q Consensus 61 -----------~~~~~~lt~~e~~~~~~--------~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~- 120 (377) .........+.+..+.. .....+..++|++||+++.+.|++ ++..+++++.|+++++++ T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~ 173 (397) T protein:vir:96 95 PKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSA 173 (397) T ss_pred hHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhcccccc Confidence 00000011112222221 123345678999999999999998 678889999999998764 Q ss_pred ceEEEEEc-CCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhccee Q lcl|Aclame:pro 121 RLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIV 199 (377) Q Consensus 121 ~~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l 199 (377) ...+|+.. ++..+.|+.|+++.++.++++|++|++.++++++++++|++||+||.+++++||.++|+++++.+++.+|+ T Consensus 174 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~ 253 (397) T protein:vir:96 174 SGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIA 253 (397) T ss_pred ceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 56777654 34667888888887767789999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEec Q lcl|Aclame:pro 200 KGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLN 279 (377) Q Consensus 200 ~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 279 (377) +|+|+++|.|+.+. +. +.+.+... .....+..|+|| T Consensus 254 ~g~g~~~~~~~~~~-----------------d~---------------~~~~~~~~------------~~~~~~a~~v~n 289 (397) T protein:vir:96 254 AVLKTATAKSVVGV-----------------DG---------------LKDLINKE------------IKKVYDVKLFIS 289 (397) T ss_pred hcccccccccccch-----------------HH---------------HHHHHHHh------------hhhhcCcEEEEc Confidence 99999988776421 10 01111100 001124679999 Q ss_pred cchhhhhcccccccCCCCcccc---------ccCCCceEEecCCC-C-----cceEEEEeccc-EEEEecceeeEEeech Q lcl|Aclame:pro 280 PEDRWTLEAKFTSRNQFGEYVT---------VLPHGITILESLAV-E-----TGKAIAFVANR-YDAFMATASTIEEYDQ 343 (377) Q Consensus 280 ~~~~~~~~~~~~~~~~~G~~~~---------~l~~~~~v~~s~~~-~-----~~~ii~gd~s~-y~~~~~~~~~i~~~~~ 343 (377) |+++..+.. .++.+|.|+. ...+|+||++++.+ + +..++|||||+ |.+++|+++++..+++ T Consensus 290 ~~~~~~l~~---lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~ 366 (397) T protein:vir:96 290 ASMYSELDK---LKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDN 366 (397) T ss_pred HHHHHHHHH---hhccCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEecc Confidence 999766543 2455665541 12245666654332 2 23489999997 6789999999999998 Q ss_pred hhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 344 TFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 344 ~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+| ++++|+++|+||+|++|+||++|+++++ T Consensus 367 ~~~---~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 367 NIY---GQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred ccc---ceeEEEEEEEccEEecccceEEEEeecC Confidence 766 4589999999999999999999999999 No 92 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=1.5e-47 Score=277.23 Aligned_cols=267 Identities=13% Similarity=0.018 Sum_probs=198.0 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEEcCCcceeeecccccccccccccceeEeecc Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQ 157 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 157 (377) +..+ ++.+|++||++++.+|++.+++.|+++++|+++|++ ++.++|+.++.+.+.|++|+++. ++++++|+++++.+ T Consensus 1 ma~~-t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~-~~s~~~f~~v~l~~ 78 (300) T protein:vir:95 1 MSEA-QLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKK-THGGVSLDPVTIVP 78 (300) T ss_pred Cccc-ccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCccc-ccccccceeeEeee Confidence 4433 445677999999999999999999999999999986 56899999999999999876555 57889999999999 Q ss_pred eeEEEeehhhHHHH---hcCHHHHHHHHHHHHHHHHHHHhhcceeeccCC--Ccce---eeeeccccccccccccccccc Q lcl|Aclame:pro 158 FKLTAFVVIPKDAL---KFGPKWLKQFITEQLKEAIAVALELAIVKGNGL--LQPV---GLLKDLSQPTVDQSTGRDITT 229 (377) Q Consensus 158 ~k~~~~~~iS~ell---~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~--~~P~---Gil~~~~~~~~~~~~~~~~~~ 229 (377) ||++++++||+||| .|+.++++++|+++|++++++++|++|++|++. +++. |............. ..... T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~-~~~~~- 156 (300) T protein:vir:95 79 LKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVP-FKDTN- 156 (300) T ss_pred EEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeec-ccccc- Confidence 99999999999999 477899999999999999999999999999653 3333 32211111110000 00000 Q ss_pred cchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc-------- Q lcl|Aclame:pro 230 YKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT-------- 301 (377) Q Consensus 230 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~-------- 301 (377) .+..+..++..... . ......|+|||.++..+.. .++.+|+|+- T Consensus 157 ------------------~~~~i~~~~~~~~~------~-~~~~~~~vmn~~~~~~L~~---lkd~~G~~i~~~~~~~~~ 208 (300) T protein:vir:95 157 ------------------PDESMEDAVGMIDG------S-ERDITGAILDPIFTTALSK---MKNAEGGKLYPELAWGGV 208 (300) T ss_pred ------------------hHHHHHHHHHHhhh------c-CCCccEEEECHHHHHHHHH---hhccCCCeeccCccccCC Confidence 01111111111110 0 1122369999999876542 2455555431 Q ss_pred -ccCCCceEEecCCCCcce------EEEEecccE-EEEecceeeEEeechh--------hhhcCcEEEEEEEEEcCEEec Q lcl|Aclame:pro 302 -VLPHGITILESLAVETGK------AIAFVANRY-DAFMATASTIEEYDQT--------FAMEDLQLYLTKNYFYGKAKD 365 (377) Q Consensus 302 -~l~~~~~v~~s~~~~~~~------ii~gd~s~y-~~~~~~~~~i~~~~~~--------~f~~~~~~~~~~~r~dg~~~~ 365 (377) ...+|+||+.++++|.+. +++|||+++ .++.|++++++++++. .|.+|++.||+.+|+|+++.+ T Consensus 209 ~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~ 288 (300) T protein:vir:95 209 PDAINGLAVDKNRTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMD 288 (300) T ss_pred CceecceeeEEecCCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeec Confidence 112577889999987543 688999985 5899999999988653 499999999999999999999 Q ss_pred ccceEEEEeecC Q lcl|Aclame:pro 366 NHTAALLTLAGG 377 (377) Q Consensus 366 ~~af~~l~~~a~ 377 (377) |+||++|+-+|| T Consensus 289 ~~a~~~l~~~~g 300 (300) T protein:vir:95 289 AASFARIVKTGG 300 (300) T ss_pred ccceEEEecCCC Confidence 999999999999 No 93 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=2.6e-46 Score=270.53 Aligned_cols=329 Identities=11% Similarity=-0.007 Sum_probs=194.6 Q ss_pred CCccHH---------------------HHHHHHHHHHHHHHHHHhc---cCHHH--HHHHHHH------------HHHHH Q lcl|Aclame:pro 1 MAINLK---------------------ELPKYREAVAELSAKISAG---ATPEE--QEKLFEA------------AFTTM 42 (377) Q Consensus 1 m~~~~~---------------------~l~~~~~~~~~~~~~~~~~---~~~~~--~~~~~~~------------~~~~~ 42 (377) +..+.+ ++++..++..++.+.+.+. ..+.. .....+. ..... T Consensus 10 l~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~ 89 (437) T protein:vir:10 10 LATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDLVAPELEENSA 89 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111 1111110101111111000 00000 0000000 00000 Q ss_pred -------H---HHHHHHHHHHHHHHHHh------------ccccccccHHHHHHHHH--------HHhccCCCCCceecc Q lcl|Aclame:pro 43 -------G---DEILAKNEEEMERMFDL------------RDKNRELTAEEIKFFND--------IDKNVGGKDKFKLLP 92 (377) Q Consensus 43 -------~---~~~~~~~~~~~~~~~~~------------~~~~~~lt~~e~~~~~~--------~~~~~~~s~gg~lvP 92 (377) . .+............... ..........+++.+.. .....++++||++|| T Consensus 90 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp 169 (437) T protein:vir:10 90 DNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDGKVIIP 169 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccccccch Confidence 0 00000000000000000 00000011111122211 122346778999999 Q ss_pred HHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEEc-CCcceeeecccccccccccccceeEeecceeEEEeehhhHHH Q lcl|Aclame:pro 93 EETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA 170 (377) Q Consensus 93 ~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~el 170 (377) +++.+.|.+ +++.++|+++|++++++ +..++|+.. ..+.+.|+.|++..++.++++|++|++.+++++++++||+|| T Consensus 170 ~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~el 248 (437) T protein:vir:10 170 ETILTPEKE-VHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVFSQEL 248 (437) T ss_pred HHHHHHHHH-hhhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccccccccceeeeeehhheeeehhhhHHH Confidence 999887654 68889999999999876 567888875 446789998888887667799999999999999999999999 Q ss_pred HhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHH Q lcl|Aclame:pro 171 LKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVE 250 (377) Q Consensus 171 l~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 250 (377) |+||.++|++||.++|+++++.+++.+|++|+|+++|.+.... .... +.+ T Consensus 249 l~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~---------------~~~~---------------~~~ 298 (437) T protein:vir:10 249 ISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKTTSTY---------------LLGD---------------LKK 298 (437) T ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc---------------chhh---------------HHH Confidence 9999999999999999999999999999999998876543110 0000 001 Q ss_pred HHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccc-----------cccCCCceEEecC--CCCc Q lcl|Aclame:pro 251 LLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYV-----------TVLPHGITILESL--AVET 317 (377) Q Consensus 251 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~-----------~~l~~~~~v~~s~--~~~~ 317 (377) .+.. .....+..+.+|+|||+++..+.. .++.+|.|+ +++ |+||++++ .+|. T Consensus 299 ~~~~----------~l~~~~~~~~~~~~~~~~~~~l~~---lkd~~g~~~~~~~~~~~~~~~l~--G~pv~~~~~~~~~~ 363 (437) T protein:vir:10 299 VLNV----------TLKPQDSAAASIVMSQSAYNLFDM---ATDAMGRPLLQPNVTAATGYTLL--GKTVVIVDDKLFPS 363 (437) T ss_pred HHHh----------hhhhhhhcCCEEEEcHHHHHHHHH---hhccCCCeeeccCccCCCCcccc--cceeEEecccccCC Confidence 0000 001122345689999988655432 234455443 344 55555543 3343 Q ss_pred ---ce--EEEEeccc-EEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 318 ---GK--AIAFVANR-YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 318 ---~~--ii~gd~s~-y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ++ ++||||++ |.+++|.++++..+++ |..+.+.+++.+|+||++++|+|||+|+.+.. T Consensus 364 ~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~--~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~ 427 (437) T protein:vir:10 364 ASAGDVNIVVAPLKKAVINFKLTEITGQFQDT--YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLK 427 (437) T ss_pred cCCCceEEEEeeccccEEEEeeeceEEEEecc--cccccceeeEEEEEccEEecccceEEEEeecc Confidence 22 89999997 6689999999987764 55667799999999999999999999997755 No 94 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=3.8e-46 Score=269.58 Aligned_cols=265 Identities=12% Similarity=-0.008 Sum_probs=198.3 Q ss_pred ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEEcCCcceeeecccccccccccccceeEeeccee Q lcl|Aclame:pro 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFK 159 (377) Q Consensus 81 ~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k 159 (377) -++.+.||++||++++++|++.+++.|+|+++|+++|++ +..++|+.++.+.+.|++|.++. ++++++|+++++.+|| T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~-~~s~~~f~~v~l~~~k 79 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKK-THGGLSLEPVTIVPIK 79 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCccc-cccccceeeEEeeeEE Confidence 456677899999999999999999999999999999997 56999999999999999876665 5788999999999999 Q ss_pred EEEeehhhHHHH---hcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcce-----ee--eeccccccccccccccccc Q lcl|Aclame:pro 160 LTAFVVIPKDAL---KFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPV-----GL--LKDLSQPTVDQSTGRDITT 229 (377) Q Consensus 160 ~~~~~~iS~ell---~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~-----Gi--l~~~~~~~~~~~~~~~~~~ 229 (377) +++++++|+||| .|+.+++++||.+++++++++++|.+|++|+|.+.+. |+ +..........+ .... T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~--~~~~- 156 (303) T protein:vir:97 80 VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFT--ESED- 156 (303) T ss_pred EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccc--cccc- Confidence 999999999999 4778999999999999999999999999997643322 21 111111000000 0000 Q ss_pred cchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc-------- Q lcl|Aclame:pro 230 YKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT-------- 301 (377) Q Consensus 230 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~-------- 301 (377) .+..+..++.... . .......|+|||.+++.+.. .++.+|.|.. T Consensus 157 ------------------~~~~i~~~~~~~~------~-~~~~~~~~vmn~~~~~~L~~---lkd~~g~~~~~~~~~~~~ 208 (303) T protein:vir:97 157 ------------------ADANIEAAVNLIQ------G-AEGVVTGLAMDTEFSTALAK---VTNGEMGPKMYPELAWGA 208 (303) T ss_pred ------------------hHHHHHHHHHHHh------h-cCCCccEEEEcHHHHHHHHH---hhccCCCeEEecCccCCC Confidence 0111222211110 0 01122359999999876642 2345554331 Q ss_pred --ccCCCceEEecCCCCcc--------eEEEEeccc-EEEEecceeeEEeechh--------hhhcCcEEEEEEEEEcCE Q lcl|Aclame:pro 302 --VLPHGITILESLAVETG--------KAIAFVANR-YDAFMATASTIEEYDQT--------FAMEDLQLYLTKNYFYGK 362 (377) Q Consensus 302 --~l~~~~~v~~s~~~~~~--------~ii~gd~s~-y~~~~~~~~~i~~~~~~--------~f~~~~~~~~~~~r~dg~ 362 (377) ...+|+||+.+++||.+ .++||||+. |.++.|++++++.+++. .|.+|++.||+.+|+|++ T Consensus 209 ~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~ 288 (303) T protein:vir:97 209 NPDSINGLKSSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWG 288 (303) T ss_pred CCceecceeeEEecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccE Confidence 12357899999998753 379999965 88999999999987653 499999999999999999 Q ss_pred EecccceEEEEeecC Q lcl|Aclame:pro 363 AKDNHTAALLTLAGG 377 (377) Q Consensus 363 ~~~~~af~~l~~~a~ 377 (377) +++|+||++|+-+-= T Consensus 289 v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 289 ILDAKSFARVTKGEV 303 (303) T ss_pred eecccceEEeeCCCC Confidence 999999999986655 No 95 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=5.2e-47 Score=274.34 Aligned_cols=258 Identities=12% Similarity=0.081 Sum_probs=197.8 Q ss_pred HHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC---CceEEEEEc-CCcceeeecccccccccccccc Q lcl|Aclame:pro 75 FNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS---LRLKALTAE-TSGTAVWGDIFGEIKGQLKQAF 150 (377) Q Consensus 75 ~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~---~~~~~p~~~-~~~~a~w~~e~~~~~~~~~~~f 150 (377) .-+.+..+++++||++||++++++|++.++++++|+++|+++|++ +...+|... ..+.+.|++|.++.++.++++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 223345677788999999999999999999999999999999875 346677654 4577999988888766567999 Q ss_pred eeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeecccccccccccccccccc Q lcl|Aclame:pro 151 KEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTY 230 (377) Q Consensus 151 ~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~ 230 (377) +++++.++|++++++||+|+++|+.+++++||.+++++++++++|++|++|+|+..+. ....++ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~----------------~~~~~~ 144 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTK----------------PTLTKW 144 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccccc----------------ccccCH Confidence 9999999999999999999999999999999999999999999999999998864321 011111 Q ss_pred chhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccc---------- Q lcl|Aclame:pro 231 KTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYV---------- 300 (377) Q Consensus 231 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~---------- 300 (377) + .+.+.+..+ ......+..|+|||+++..+.. .++.+|.|+ T Consensus 145 d---------------~i~~~~~~l-----------~~~~~~~a~~vmn~~~~~~L~~---lkd~~g~~l~~~~~~~~~~ 195 (293) T protein:vir:48 145 D---------------DIIDLEAKV-----------DPAIKQTSFFLTNTSGFTALKK---VKNALGDYLMERDVKSPTG 195 (293) T ss_pred H---------------HHHHHHHhh-----------hhhhcCCCEEEEcHHHHHHHHH---hhccCCceEeecCcCCCCC Confidence 1 111111111 0112345689999998766532 234444443 Q ss_pred -cccCCCceEEecCCCCc-----ceEEEEeccc-EEEEecceeeEEeech--hhhhcCcEEEEEEEEEcCEEecccceEE Q lcl|Aclame:pro 301 -TVLPHGITILESLAVET-----GKAIAFVANR-YDAFMATASTIEEYDQ--TFAMEDLQLYLTKNYFYGKAKDNHTAAL 371 (377) Q Consensus 301 -~~l~~~~~v~~s~~~~~-----~~ii~gd~s~-y~~~~~~~~~i~~~~~--~~f~~~~~~~~~~~r~dg~~~~~~af~~ 371 (377) +++|+|+.++.+..+|. ..++||||++ |.+++|++++++.+++ .+|.+|++.||+.+|+|+++++++||++ T Consensus 196 ~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~ 275 (293) T protein:vir:48 196 YSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVP 275 (293) T ss_pred ceecceeeEEecccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEE Confidence 45555544444545543 2489999998 6789999999999875 4799999999999999999999999999 Q ss_pred EEeecC Q lcl|Aclame:pro 372 LTLAGG 377 (377) Q Consensus 372 l~~~a~ 377 (377) +++++. T Consensus 276 l~~~~~ 281 (293) T protein:vir:48 276 ASFKAI 281 (293) T ss_pred EEeecc Confidence 999997 No 96 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=2.3e-46 Score=270.83 Aligned_cols=268 Identities=11% Similarity=-0.010 Sum_probs=197.8 Q ss_pred CCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEEcCCcceeeecccccccccccccceeEeecceeEE Q lcl|Aclame:pro 83 GGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLT 161 (377) Q Consensus 83 ~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~ 161 (377) -..+||++||++++++|++.+++.++++++|+++|++ +..++|+.++.+.+.|++|.++. ++++++|+++++.++|++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~-~~~~~~f~~v~l~~~k~a 79 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKK-THGGVTLAPQTMVPIKVE 79 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCccc-cccccceeEEEEeeeeEE Confidence 4456789999999999999999999999999999986 56999999999999999876655 577899999999999999 Q ss_pred EeehhhHHHHh---cCHHHHHHHHHHHHHHHHHHHhhcceeeccC--CCcceeeeeccccccccccccccccccchhhhh Q lcl|Aclame:pro 162 AFVVIPKDALK---FGPKWLKQFITEQLKEAIAVALELAIVKGNG--LLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEA 236 (377) Q Consensus 162 ~~~~iS~ell~---ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G--~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (377) ++++||+|||. |+.+++++||.++|++++++++|.+|++|+| +++|.++......... .+......... T Consensus 80 ~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~-~~~~~~~~~~~----- 153 (298) T protein:vir:16 80 YGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSK-VTQKVEAPRGI----- 153 (298) T ss_pred EeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccc-ccccccccccc----- Confidence 99999999994 6678999999999999999999999999964 4555544321111100 00000000000 Q ss_pred hhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc---------ccCCCc Q lcl|Aclame:pro 237 IADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT---------VLPHGI 307 (377) Q Consensus 237 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~---------~l~~~~ 307 (377) ...+..+..+...... .......|+|||.++..+.. .++.+|.|+. ...+|+ T Consensus 154 ---------~~~~~~i~~~~~~~~~-------~~~~~~~~vmn~~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~l~G~ 214 (298) T protein:vir:16 154 ---------ADPNGAIENAVELLTG-------VDADVTGIAINPSFRSALAK---QKDLQDNALFPELKWGATPDTINGL 214 (298) T ss_pred ---------ccHHHHHHHHHHHhhh-------cCCCccEEEEcHHHHHHHHH---hhccCCCeeecCcccCCCCceecce Confidence 0001111111111100 01123359999998876532 2455565541 122578 Q ss_pred eEEecCCCCcc------eEEEEecccE-EEEecceeeEEeechh--------hhhcCcEEEEEEEEEcCEEecccceEEE Q lcl|Aclame:pro 308 TILESLAVETG------KAIAFVANRY-DAFMATASTIEEYDQT--------FAMEDLQLYLTKNYFYGKAKDNHTAALL 372 (377) Q Consensus 308 ~v~~s~~~~~~------~ii~gd~s~y-~~~~~~~~~i~~~~~~--------~f~~~~~~~~~~~r~dg~~~~~~af~~l 372 (377) ||+.++++|++ .++||||+++ .++.+++++++++++. .|.+|++.||+.+|+|+++++|+||++| T Consensus 215 PV~~~~~v~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l 294 (298) T protein:vir:16 215 PVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARV 294 (298) T ss_pred eeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEE Confidence 89999988853 4889999984 5889999999887653 4899999999999999999999999999 Q ss_pred Eeec Q lcl|Aclame:pro 373 TLAG 376 (377) Q Consensus 373 ~~~a 376 (377) +-+. T Consensus 295 ~~at 298 (298) T protein:vir:16 295 TEAN 298 (298) T ss_pred eecC Confidence 9988 No 97 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=1.6e-44 Score=260.62 Aligned_cols=265 Identities=14% Similarity=0.037 Sum_probs=195.5 Q ss_pred CCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEEcCCcceeeecccccccccccccceeEeecceeEE Q lcl|Aclame:pro 83 GGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLT 161 (377) Q Consensus 83 ~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~ 161 (377) -+.+||++||+++.++|++.+++.++|+++|++++++ ++.++|+.++.+.+.|++|+++. ++++++|+++++.++|++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~-~~~~~~f~~v~l~~~k~~ 79 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKK-THGGVTLAPQTMVPIKVE 79 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccc-cccccceeEEEEeeeEEE Confidence 4446799999999999999999999999999999987 56899999999999999876655 578999999999999999 Q ss_pred EeehhhHHHHh---cCHHHHHHHHHHHHHHHHHHHhhcceeeccC--CCc---ceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 162 AFVVIPKDALK---FGPKWLKQFITEQLKEAIAVALELAIVKGNG--LLQ---PVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 162 ~~~~iS~ell~---ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G--~~~---P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++++||+|||+ |+..+++++|.++|++++++++|.+|++|++ +++ +.|+.......+.......... T Consensus 80 ~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~----- 154 (298) T protein:vir:94 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIA----- 154 (298) T ss_pred EeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccc----- Confidence 99999999995 5678999999999999999999999999953 332 2222111111111100000000 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCcccc---------ccC Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVT---------VLP 304 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~---------~l~ 304 (377) ..+..+..++..+.. .......|+|||+++..+.. .++.+|.|+- ... T Consensus 155 -------------~~~~~i~~~~~~~~~-------~~~~~~~~vmn~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~tl 211 (298) T protein:vir:94 155 -------------DPNGAIENAVELLTG-------VDADVTGIAINPSFRSALAK---QKDLQGNALFPELKWGATPDTI 211 (298) T ss_pred -------------cHHHHHHHHHHhhhh-------cCCCccEEEEcHHHHHHHHH---hhccCCCeeecCcccCCCCcee Confidence 001111111111110 01123469999998876543 2445555431 012 Q ss_pred CCceEEecCCCCcc------eEEEEeccc-EEEEecceeeEEeechh--------hhhcCcEEEEEEEEEcCEEecccce Q lcl|Aclame:pro 305 HGITILESLAVETG------KAIAFVANR-YDAFMATASTIEEYDQT--------FAMEDLQLYLTKNYFYGKAKDNHTA 369 (377) Q Consensus 305 ~~~~v~~s~~~~~~------~ii~gd~s~-y~~~~~~~~~i~~~~~~--------~f~~~~~~~~~~~r~dg~~~~~~af 369 (377) +|+||+.++++|.+ .++||||++ |.++.+++++++++++. .|.+|++.||+.+|+|+++.+|+|| T Consensus 212 ~G~PV~~~~~v~~~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~ 291 (298) T protein:vir:94 212 NGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKF 291 (298) T ss_pred cceeeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccce Confidence 57889999988753 488999998 55899999999887642 5899999999999999999999999 Q ss_pred EEEEeec Q lcl|Aclame:pro 370 ALLTLAG 376 (377) Q Consensus 370 ~~l~~~a 376 (377) ++|+-+. T Consensus 292 ~~l~~~t 298 (298) T protein:vir:94 292 ARVTEAN 298 (298) T ss_pred EEEEecC Confidence 9999888 No 98 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=7.3e-45 Score=262.54 Aligned_cols=271 Identities=14% Similarity=0.002 Sum_probs=198.2 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEEcCCcceeeecccccccccccccceeEeecc Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQ 157 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~ 157 (377) +. +.++++|++||++++++|++.+++.++|+++|+++|++ +..++|+.++.+.+.|++|.++.+ +++++|+++++.+ T Consensus 1 Ma-t~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~-~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MA-TFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKS-STTGEFDFVTSTP 78 (311) T ss_pred Cc-eecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccc-cccceeeEEEEee Confidence 33 45577899999999999999999999999999999987 579999999999999998776654 6789999999999 Q ss_pred eeEEEeehhhHHHH---hcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeee---eccccccccccccccccccc Q lcl|Aclame:pro 158 FKLTAFVVIPKDAL---KFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLL---KDLSQPTVDQSTGRDITTYK 231 (377) Q Consensus 158 ~k~~~~~~iS~ell---~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil---~~~~~~~~~~~~~~~~~~~~ 231 (377) +|++++++||+||| .|+.++|++||+++|++++++++|++|++|+|++++.++. +.....+...+....... T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~-- 156 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIA-- 156 (311) T ss_pred EEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccc-- Confidence 99999999999999 4778999999999999999999999999999977655442 222211111111111000 Q ss_pred hhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccc--------- Q lcl|Aclame:pro 232 TDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTV--------- 302 (377) Q Consensus 232 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~--------- 302 (377) .....+..++...... .... ....|+|||.++..+.. .++.+|.|+.. T Consensus 157 ---------------~~~~~i~~~~~~~~~~----~~~~-~~~~~vmn~~~~~~L~~---lkd~~G~~l~~~~~~~~~~~ 213 (311) T protein:vir:99 157 ---------------NPDLAIEAAVGLLVAN----GHPT-PVNGLALHPSIAWGLST---ARYTDGRKKFPELGLGIGVS 213 (311) T ss_pred ---------------hhHHHHHHHHHHHhhh----ccCC-CccEEEEcHHHHHHHHh---hhccCCCeeecCcccCCCCc Confidence 0000111111110000 0001 12249999998766532 34566665410 Q ss_pred cCCCceEEecCCCCc----------------ceEEEEeccc-EEEEecceeeEEeechh-------hhhcCcEEEEEEEE Q lcl|Aclame:pro 303 LPHGITILESLAVET----------------GKAIAFVANR-YDAFMATASTIEEYDQT-------FAMEDLQLYLTKNY 358 (377) Q Consensus 303 l~~~~~v~~s~~~~~----------------~~ii~gd~s~-y~~~~~~~~~i~~~~~~-------~f~~~~~~~~~~~r 358 (377) ..+|+||+.++.+|. ..+++|||++ +.++.+++++++++++. .|.+|++.||+.+| T Consensus 214 ~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r 293 (311) T protein:vir:99 214 SFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIV 293 (311) T ss_pred eecceeeEeecccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEe Confidence 225778888877652 1268899998 55889999999887653 38999999999999 Q ss_pred EcCEEecccceEEEEeec Q lcl|Aclame:pro 359 FYGKAKDNHTAALLTLAG 376 (377) Q Consensus 359 ~dg~~~~~~af~~l~~~a 376 (377) +|+++.+++++++++.+| T Consensus 294 ~d~~v~~~~~v~~~~~~A 311 (311) T protein:vir:99 294 YGWYVFTDRFVVIENAVA 311 (311) T ss_pred ecceecChhHeeeecccC Confidence 999999988888888888 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=1.4e-38 Score=228.05 Aligned_cols=281 Identities=16% Similarity=0.084 Sum_probs=191.7 Q ss_pred cHHHHHHHHHHHh-ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec-C-CceEEEEEcCC----cceeeecccc Q lcl|Aclame:pro 68 TAEEIKFFNDIDK-NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT-S-LRLKALTAETS----GTAVWGDIFG 140 (377) Q Consensus 68 t~~e~~~~~~~~~-~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~-~-~~~~~p~~~~~----~~a~w~~e~~ 140 (377) -++.|+.++.... ..++.+|||++|+++ +++++.+++.+++|++++++++ + ...++|....+ +...|.++. T Consensus 1 ~~~~~~~~~~~k~it~~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~- 78 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTK- 78 (314) T ss_pred CchhhhHHHhhcccccccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCC- Confidence 2233444442222 234557999999887 5799999999999999999865 3 35778765432 233454433 Q ss_pred cccccccccceeEeecceeEEEeehhhHHHHhcCHH--HHHHHHHHHHHHHHHHHhhcceeeccCC--------Ccceee Q lcl|Aclame:pro 141 EIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPK--WLKQFITEQLKEAIAVALELAIVKGNGL--------LQPVGL 210 (377) Q Consensus 141 ~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~--~~~~~l~~~la~~~a~~~~~a~l~G~G~--------~~P~Gi 210 (377) +..++++++|++++|.+|++...++||+|+|+|+.+ ||+++|.+.|++++++.++.+|++|+|+ ++|.|| T Consensus 79 ~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~ 158 (314) T protein:vir:41 79 VAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGW 158 (314) T ss_pred ccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhh Confidence 333568899999999999999999999999999976 9999999999999999999999999994 478999 Q ss_pred eeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc Q lcl|Aclame:pro 211 LKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF 290 (377) Q Consensus 211 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~ 290 (377) ++..+...+....... ..+ .+.+..++..+ ..+.+...++.+|+||+.+...+...+ T Consensus 159 l~~a~~~~~~~~~~~~----------------~~~---~~~~~~l~~sl----~~~yr~~~~~~~~~m~~~t~~~~r~~l 215 (314) T protein:vir:41 159 MKLAGNQYTDAEPEDE----------------NWP---LNLFDGMMDEL----DTRYLQLKPRMKFYVSNEIYNGYRKQL 215 (314) T ss_pred hhhcccceeecCcccc----------------ccH---HHHHHHHHHhc----CchhhcCCCceEEEecHHHHHHHHHHH Confidence 9765433222111110 011 11122222111 111123345789999998875544322 Q ss_pred cccCC--------CCccccccCCCceEEecCCCC-----cceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEE Q lcl|Aclame:pro 291 TSRNQ--------FGEYVTVLPHGITILESLAVE-----TGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKN 357 (377) Q Consensus 291 ~~~~~--------~G~~~~~l~~~~~v~~s~~~~-----~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~ 357 (377) ..... .|.+.++ +|+||+..++|| ++.|+||||+.|+++++..+.+.+. ....++++.|.+.. T Consensus 216 ~~~~~~l~~~~~~~~~~~~l--~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~--~~a~~~~~~~~~~~ 291 (314) T protein:vir:41 216 LVRETGLGDSALIGATGLQY--DGIPIQYVPALDALGDDKARALLTVPTNLVYGFWRNIRIEPK--RDAAMRRTEYIASL 291 (314) T ss_pred hccCCcccchhhhCCCCcee--cceeeEecccccccCCCCceEEEechhheEEEeeceeEEeec--ccCcCCeEEEEEEE Confidence 21111 1233333 467777777664 5679999999998888777766654 45679999999999 Q ss_pred EEcCEEecccceEEEEe---ecC Q lcl|Aclame:pro 358 YFYGKAKDNHTAALLTL---AGG 377 (377) Q Consensus 358 r~dg~~~~~~af~~l~~---~a~ 377 (377) |+|+.+.+.+|.++..+ +|| T Consensus 292 r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 292 RADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred EeceEEEEcCcEEEEEeeccCCC Confidence 99999998877776654 355 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=4e-37 Score=220.12 Aligned_cols=279 Identities=15% Similarity=0.067 Sum_probs=184.1 Q ss_pred cccHHHH---HHHHHHHh-ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc--eEEEEEc-C---Ccceee Q lcl|Aclame:pro 66 ELTAEEI---KFFNDIDK-NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLR--LKALTAE-T---SGTAVW 135 (377) Q Consensus 66 ~lt~~e~---~~~~~~~~-~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~--~~~p~~~-~---~~~a~w 135 (377) .|+-+.- +.++.... ..++.+||+++|++. +++++.+++.||+|++|++++..+. ..++... + .....| T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~ 79 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDVPDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDE 79 (315) T ss_pred CcccchhhcCChhhhhhhcCCcCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccccCccccccccc Confidence 2222211 11111111 234557899888775 5689999999999999998764433 3343321 1 122345 Q ss_pred ecccccccccccccceeEeecceeEEEeehhhHHHHhcCHH--HHHHHHHHHHHHHHHHHhhcceeeccCC------Ccc Q lcl|Aclame:pro 136 GDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPK--WLKQFITEQLKEAIAVALELAIVKGNGL------LQP 207 (377) Q Consensus 136 ~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~--~~~~~l~~~la~~~a~~~~~a~l~G~G~------~~P 207 (377) .++. +..++++++|+++++.++++.+.+.||+++|+|+.+ |||+||.+++++++++.++.+|++|+|+ ++| T Consensus 80 ~~~~-~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~ 158 (315) T protein:vir:41 80 TGQK-LAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMS 158 (315) T ss_pred ccCc-CCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCcccccc Confidence 5443 344567899999999999999999999999999964 9999999999999999999999999985 467 Q ss_pred eeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhc Q lcl|Aclame:pro 208 VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLE 287 (377) Q Consensus 208 ~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~ 287 (377) .||++................... .+.+..+...+. .+.+....+.+|+||+.+...+. T Consensus 159 ~G~l~~a~~~~~~~~~~~~a~~~~-----------------~d~l~~l~~sl~----~~yr~~~~~~~~imn~~t~~~~r 217 (315) T protein:vir:41 159 DGWLKLASEKLTESDVDPEAEDWP-----------------MNLFDTMIESLP----TPYRNNLPNMKFYVTWDIYRAYR 217 (315) T ss_pred ccceeccccccccccccccccccc-----------------HHHHHHHHHhcC----hHHhhcCCceEEEEcHHHHHHHH Confidence 899986543322211111111100 011111111110 11122235789999999876543 Q ss_pred ccccccCCCC-----------ccccccCCCceEEecCCCC-----cceEEEEecccEEEEecceeeEEeechhhhhcCcE Q lcl|Aclame:pro 288 AKFTSRNQFG-----------EYVTVLPHGITILESLAVE-----TGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQ 351 (377) Q Consensus 288 ~~~~~~~~~G-----------~~~~~l~~~~~v~~s~~~~-----~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~ 351 (377) .. ++..| ++.+++ |+||+..++|| ++.|+||||++|+++++.++.++++.+. .++.+ T Consensus 218 kl---k~~~g~~lw~~~~~~g~~~tl~--G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a--~~~~~ 290 (315) T protein:vir:41 218 DA---LKGRETGLGDQALTGANSILYD--GRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDA--EMRLT 290 (315) T ss_pred HH---hccCCCccccchhhcCCCceec--ccceEecccccccCCCCccEEEecccceEEEeccccEEEeeecC--CCCce Confidence 32 22333 334444 56667777664 5669999999999999999999887654 46778 Q ss_pred EEEEEEEEcCEEecccc--eEEEEe Q lcl|Aclame:pro 352 LYLTKNYFYGKAKDNHT--AALLTL 374 (377) Q Consensus 352 ~~~~~~r~dg~~~~~~a--f~~l~~ 374 (377) .|....|+|+...+.++ .+++|+ T Consensus 291 ~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 291 KYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred EEEEEEEeceeEEeccceeEeeeeC Confidence 89999999998887766 666677 No 101 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=4.2e-36 Score=214.54 Aligned_cols=290 Identities=14% Similarity=0.111 Sum_probs=195.1 Q ss_pred cccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceeeecccc Q lcl|Aclame:pro 62 DKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFG 140 (377) Q Consensus 62 ~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e~~ 140 (377) +..+.+...-++......-..++.++|++||+++.++|++.+++.++++++++++++++ ...+|....++.+.|..+++ T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~~e~ 80 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQDEG 80 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCccccccccc Confidence 11111111111111111112345678899999999999999999999999999999864 56788776666667765433 Q ss_pred -cccccccccceeEeecceeEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHhhcceeeccCCCcc------eeee Q lcl|Aclame:pro 141 -EIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP--KWLKQFITEQLKEAIAVALELAIVKGNGLLQP------VGLL 211 (377) Q Consensus 141 -~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~--~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P------~Gil 211 (377) .....++++|+++++.++++.+.++||+|+|+|+. +||+++|.+.++++|+..++.++++|+|+++| .||+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l 160 (321) T protein:vir:31 81 EWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFI 160 (321) T ss_pred ccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhh Confidence 33445779999999999999999999999999985 59999999999999999999999999998665 6887 Q ss_pred eccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc Q lcl|Aclame:pro 212 KDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT 291 (377) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~ 291 (377) +.+.........+.... + .+.+..+...+ .......++.+|+||+.+...+...+. T Consensus 161 ~~a~~~~~~~~~~~~~~---------------~----~d~l~~l~~~l-----~~~yr~~~~~v~im~~~~~~~~~~~l~ 216 (321) T protein:vir:31 161 TVAEGDVETIDAADDIL---------------D----NDLVIRTIAGL-----DSKYRARMNPALIVSEDQLLSYHYTLT 216 (321) T ss_pred hhhcccccccccccccc---------------C----HHHHHHHHHhc-----cHhHhcCCCeEEEechHHHHHHHHHHh Confidence 65433222111111100 0 01111111111 111223457899999988654443322 Q ss_pred ccCC--------CCccccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhc-CcEEEEE--EEEEc Q lcl|Aclame:pro 292 SRNQ--------FGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAME-DLQLYLT--KNYFY 360 (377) Q Consensus 292 ~~~~--------~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~-~~~~~~~--~~r~d 360 (377) ..+. .|.+.+ .+|+||+.+++||++.++|+||+.+.++.+.++.+.++.+..... ....|+. ..++| T Consensus 217 ~~~~~~~~~~l~~~~~~t--l~G~pvv~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (321) T protein:vir:31 217 DRDTPLGDNVIMGEADVN--PFSFPIIGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDD 294 (321) T ss_pred cCCCccccchhhcccccc--ccceeEEEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecc Confidence 2221 112222 258889999999999999999999999999999998876654332 2344554 45677 Q ss_pred CEEecccceEEEE-eecC Q lcl|Aclame:pro 361 GKAKDNHTAALLT-LAGG 377 (377) Q Consensus 361 g~~~~~~af~~l~-~~a~ 377 (377) ..+-+.+|+++++ +.-. T Consensus 295 ~~ve~~~a~a~~~~i~~~ 312 (321) T protein:vir:31 295 FAIENTEAVVLAEGLGDP 312 (321) T ss_pred eeEeccccEEEEecCCcc Confidence 8889999999998 3333 No 102 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=99.96 E-value=2.3e-31 Score=188.58 Aligned_cols=340 Identities=14% Similarity=0.087 Sum_probs=190.2 Q ss_pred CCccHH---HHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHH---H----HHHHHHHHHHH-HHHhcccccc--c Q lcl|Aclame:pro 1 MAINLK---ELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGD---E----ILAKNEEEMER-MFDLRDKNRE--L 67 (377) Q Consensus 1 m~~~~~---~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~----~~~~~~~~~~~-~~~~~~~~~~--l 67 (377) ++++.. +..+..+......++........+..+.++...+.+.+ + ..++...+.+. .......... - T Consensus 131 ~~vke~~~~e~~~~~~~~a~~ee~~e~~~k~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~~~~~~~~~~~ 210 (517) T protein:vir:97 131 TYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPE 210 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhcccccccccch Confidence 221110 00000000000000000000000111111111111100 0 00000000000 0000000000 0 Q ss_pred cHHHHHHH------HHHHh-----------ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEEcCC Q lcl|Aclame:pro 68 TAEEIKFF------NDIDK-----------NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETS 130 (377) Q Consensus 68 t~~e~~~~------~~~~~-----------~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~~~~p~~~~~ 130 (377) ..+....+ ..... .....-+|+++|+.+...|...+...+++++++++.+++ ...+|..... T Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~-~~~~~~~~~~ 289 (517) T protein:vir:97 211 ATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP-TLVVGGDNAL 289 (517) T ss_pred hhHHHHHHHHHHHHHHhcccccccceeeeecccccccccccchHHHHHHHHhhhhhccceeeeeecccc-ceeeeccccc Confidence 00111110 00000 011223789999999999999999999999998876543 4567777766 Q ss_pred cceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHH----HHHHHHHHHHHHHHHHhhcceeeccCCCc Q lcl|Aclame:pro 131 GTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKW----LKQFITEQLKEAIAVALELAIVKGNGLLQ 206 (377) Q Consensus 131 ~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~----~~~~l~~~la~~~a~~~~~a~l~G~G~~~ 206 (377) ..+.|+.+ ++.+++++++|+++++.++++++++++|++||+|+.+| |++||.++|+++|+++++.+||+|+|++. T Consensus 290 ~~a~~~~e-G~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~ 368 (517) T protein:vir:97 290 TQGTGHTT-GTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGV 368 (517) T ss_pred ceeeeeec-CCcccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCc Confidence 77778764 55667889999999999999999999999999999888 99999999999999999999999999874 Q ss_pred -ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhh Q lcl|Aclame:pro 207 -PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWT 285 (377) Q Consensus 207 -P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~ 285 (377) +.|+++......... .. .+ . ...+.+..+.. ......+..|+|||.++.. T Consensus 369 ~~~gi~~~a~~~~~~~---~~-~~-~---------------~~~d~i~~l~~---------a~~~a~~a~~vmn~~t~~~ 419 (517) T protein:vir:97 369 SETQIYPVVGDAWATN---VT-GT-T---------------NIQELLEKLSV---------ATPKAADSTLVIHRNDLAA 419 (517) T ss_pred cccccccccccccccc---cc-cc-c---------------hHHHHHHHHHH---------HhhhccCCEEEECHHHHHH Confidence 568875322111100 00 00 0 01111111110 0011124679999999766 Q ss_pred hcccccccCCCCcccccc---------CCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEE Q lcl|Aclame:pro 286 LEAKFTSRNQFGEYVTVL---------PHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTK 356 (377) Q Consensus 286 ~~~~~~~~~~~G~~~~~l---------~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~ 356 (377) +. ..++.+|+|+-.- .+|+..+ .+.++.+...+++++.|.++.+.++++..+-+ ..+++..|... T Consensus 420 I~---klKD~~G~Yl~~~~~~~~~~~~l~G~~~~-~~~~~~~~~~~~~~~~y~i~~~~g~~~~~~fd--~~~n~~~f~~~ 493 (517) T protein:vir:97 420 IR---FLKDKNGNYVFPVGVSNQTIATHFGFNRL-VQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTI--LVENNKEYLFE 493 (517) T ss_pred HH---HhhcCCCCeeccCcCCcccccccCCcccc-ccccccCceeEeeccccEEEeecceeeeeeee--cccCceeEeee Confidence 64 3467888886211 1222111 12334455667778889998888887644322 34788899999 Q ss_pred EEEcCEEecccceEEEEee---cC Q lcl|Aclame:pro 357 NYFYGKAKDNHTAALLTLA---GG 377 (377) Q Consensus 357 ~r~dg~~~~~~af~~l~~~---a~ 377 (377) +|++|.+..+++|++..+. || T Consensus 494 ~~~~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 494 MPISGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred eeeccccccccceEEEEEcCCCCC Confidence 9999999999999998875 55 No 103 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.91 E-value=3.4e-28 Score=171.18 Aligned_cols=321 Identities=12% Similarity=0.089 Sum_probs=155.3 Q ss_pred CCccH-----HHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHH---HHH---------HHHHHHHHHHHhccc Q lcl|Aclame:pro 1 MAINL-----KELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDE---ILA---------KNEEEMERMFDLRDK 63 (377) Q Consensus 1 m~~~~-----~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~---------~~~~~~~~~~~~~~~ 63 (377) ..++. ++.....+..+...+.........+.....+.......+. ... ....+.+.... . T Consensus 118 ~~vks~~~~~e~~~~~~e~~e~~~e~~e~~~~~~el~akl~el~k~~ee~k~~~~~~~~~~~~~~~~~~e~r~~~~---~ 194 (480) T protein:vir:40 118 TKVREENKGEQEQMGANETQEIMKQAIEAGVKVRELEAKVEELNKEREELKKEREASIPSEKPEDAERKFMRELGS---K 194 (480) T ss_pred hhhhhhhhhhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHHHhHHHHHhhhhhhhccccchhhhhhHHHHHHHH---H Confidence 11111 0000000000000000000000000000000000000000 000 00000000000 0 Q ss_pred cccccHHHHHHHHHHH---hccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEEcCCcceeeecccc Q lcl|Aclame:pro 64 NRELTAEEIKFFNDID---KNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFG 140 (377) Q Consensus 64 ~~~lt~~e~~~~~~~~---~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~~~~p~~~~~~~a~w~~e~~ 140 (377) .+. ..+..++.... ..+....+++ +|+.+.+.+.......+++...+++...++. ...|+++.. T Consensus 195 ~~~--~~e~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~----------~~~~~~e~~ 261 (480) T protein:vir:40 195 MAE--MPEQGFLREFANGADLNVVNSLGS-ITSKYARKSGIYDGAMKARFQGLTLAEDGVD----------DTFISGTFK 261 (480) T ss_pred hcc--chhhhhhhhhhhhccccccccccc-cccchhhheeechhhhhhhhhcceeeecccc----------ceeeeeeee Confidence 000 01111111111 1122233443 5555566655556666666666555433322 233433221 Q ss_pred ccccc-ccccceeEeec---ceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeecc--CCCcceeeeecc Q lcl|Aclame:pro 141 EIKGQ-LKQAFKEQDFS---QFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGN--GLLQPVGLLKDL 214 (377) Q Consensus 141 ~~~~~-~~~~f~~i~l~---~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~--G~~~P~Gil~~~ 214 (377) +...+ ...++.+.++. .+++++++++|.++|+|+. +|++||.++|++.|+.+++.+||+|+ |++.|.||.+.. T Consensus 262 ~~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~ 340 (480) T protein:vir:40 262 AGTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTAT 340 (480) T ss_pred cccccccccccccchhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceeec Confidence 11111 11234444444 5788999999999999986 89999999999999999999999995 455688876432 Q ss_pred ccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccC Q lcl|Aclame:pro 215 SQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRN 294 (377) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~ 294 (377) ...+. ..+. ...+.. ++..+ ......+.+.|+|||.|...+.. .++ T Consensus 341 ~~~~~------~~~~----------------~d~id~---L~~al------~~~y~~~a~~~vmn~~t~~~I~k---lKD 386 (480) T protein:vir:40 341 DGWTK------QIEY----------------TDLFEG---ITDAV------AECSISDAITIVMSPQTFAELRK---AKG 386 (480) T ss_pred ccccc------cchh----------------HHHHHH---HHHhh------hHHhhCCCCEEEECHHHHHHHHH---hhc Confidence 21110 0000 011111 11111 01111234468888888665532 356 Q ss_pred CCCccc-----------cccCCCceEEe-cCCCCcceEEEEecccE-EEEecceeeEEeechhhhhcCcEEEEEEEEEcC Q lcl|Aclame:pro 295 QFGEYV-----------TVLPHGITILE-SLAVETGKAIAFVANRY-DAFMATASTIEEYDQTFAMEDLQLYLTKNYFYG 361 (377) Q Consensus 295 ~~G~~~-----------~~l~~~~~v~~-s~~~~~~~ii~gd~s~y-~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg 361 (377) .+|.|+ +++ |.||+. +..+|.+..++|.++.| .+++++ .+. .+..-+..++..|....|++| T Consensus 387 ~~G~Yi~q~~~~~~~~~~ll--G~pvv~~~~~~~~~~~~~~~~~~~~~~~d~~-~~~--~~~~~~~~~~~~~~~e~~v~g 461 (480) T protein:vir:40 387 TDGHSRFNELATKEQIAQSF--GAVNLETRVWMPKDEVAVYNHDEYVLIGDLN-VEN--YNDFDLRYNVEQWLSETLVGG 461 (480) T ss_pred CCCCeeccCcccccCcceec--ccceeeeeccccCCcceeeeCCccEEEEecc-cce--ecccccccchhhhhhhhhhce Confidence 666664 344 455554 56788777777776665 567664 333 333334577778999999999 Q ss_pred EEecccceEEEEeecC Q lcl|Aclame:pro 362 KAKDNHTAALLTLAGG 377 (377) Q Consensus 362 ~~~~~~af~~l~~~a~ 377 (377) .+..|+|+++|+++++ T Consensus 462 ~~~~~~~~~~~~~~~~ 477 (480) T protein:vir:40 462 SIRGKNRSAYLKKKGS 477 (480) T ss_pred eeEccccEEEEEeccC Confidence 9999999999999999 No 104 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.91 E-value=3.3e-26 Score=160.29 Aligned_cols=253 Identities=15% Similarity=0.083 Sum_probs=186.5 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe----cCCc-eEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN----TSLR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~----~~~~-~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +..++++.+..++|+.+++.|++.+++.+.+.+++++.. .+|+ ++||+....+.+.|+.|+++. +.++++|+++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i-~~~~~~~~~~ 79 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAI-PMTQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcc-cccccccceE Confidence 555566677889999999999999999988888877532 2343 899998888899999876555 5678999999 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++.+++++..+++|+++..++..|+.+++.+.+++++++.+|+.++..- +. ....... ..++ T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~---------~~---a~~~~~~---~~t~--- 141 (272) T protein:vir:98 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDAL---------SK---STQTVEA---TATV--- 141 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHh---------cc---ccccccc---ccCH--- Confidence 9999999999999999999999999999999999999999999988521 11 0000000 0011 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc--c--cccCC-------CCccccc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK--F--TSRNQ-------FGEYVTV 302 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~--~--~~~~~-------~G~~~~~ 302 (377) +.+.+....+ ... ......|+|||.++..+... . ...+. +|...++ T Consensus 142 ----------------d~i~da~~~l---~~~----~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i 198 (272) T protein:vir:98 142 ----------------DGVSKALDIF---NDE----DDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEV 198 (272) T ss_pred ----------------HHHHHHHHHH---hcc----CCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhh Confidence 1111111111 000 11234799999988766421 1 11111 1222233 Q ss_pred cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 303 l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|+||+.|+++|++++++++...+.+..+++++++.+++. .++...+++.+|++.++.+|++++++|+++- T Consensus 199 --~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~--~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:98 199 --LGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDI--TKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred --cCeeEEEcCCCCcceEEEEcCCeEEEEecCCceeeecccc--ccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 5789999999999999998888888888899998877664 4677899999999999999999999999877 No 105 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.91 E-value=3.3e-26 Score=160.29 Aligned_cols=253 Identities=15% Similarity=0.083 Sum_probs=186.5 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe----cCCc-eEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN----TSLR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~----~~~~-~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +..++++.+..++|+.+++.|++.+++.+.+.+++++.. .+|+ ++||+....+.+.|+.|+++. +.++++|+++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i-~~~~~~~~~~ 79 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAI-PMTQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcc-cccccccceE Confidence 555566677889999999999999999988888877532 2343 899998888899999876555 5678999999 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++.+++++..+++|+++..++..|+.+++.+.+++++++.+|+.++..- +. ....... ..++ T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~---------~~---a~~~~~~---~~t~--- 141 (272) T protein:vir:30 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDAL---------SK---STQTVEA---TATV--- 141 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHh---------cc---ccccccc---ccCH--- Confidence 9999999999999999999999999999999999999999999988521 11 0000000 0011 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc--c--cccCC-------CCccccc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK--F--TSRNQ-------FGEYVTV 302 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~--~--~~~~~-------~G~~~~~ 302 (377) +.+.+....+ ... ......|+|||.++..+... . ...+. +|...++ T Consensus 142 ----------------d~i~da~~~l---~~~----~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i 198 (272) T protein:vir:30 142 ----------------DGVSKALDIF---NDE----DDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEV 198 (272) T ss_pred ----------------HHHHHHHHHH---hcc----CCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhh Confidence 1111111111 000 11234799999988766421 1 11111 1222233 Q ss_pred cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 303 l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|+||+.|+++|++++++++...+.+..+++++++.+++. .++...+++.+|++.++.+|++++++|+++- T Consensus 199 --~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~--~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:30 199 --LGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDI--TKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred --cCeeEEEcCCCCcceEEEEcCCeEEEEecCCceeeecccc--ccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 5789999999999999998888888888899998877664 4677899999999999999999999999877 No 106 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.72 E-value=4.9e-19 Score=120.95 Aligned_cols=254 Identities=15% Similarity=0.070 Sum_probs=179.7 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe-c---CC-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN-T---SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~-~---~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +....+.-+-.++|+-+.+.+.+.+++...+.+++++.+ . +| .+++|+....+++.|+.+++.+ +..+.++++. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i-~~~~it~~~~ 79 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKI-PTDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcc-ccccccccee Confidence 455555566789999999999999988877778876643 2 23 4789998766788888766555 4567899999 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++..++.+..+.++++....+..|+.+.+.+.++.++++.+|+.++..-.+.. ... ...... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~------------~~~--~~~~~~---- 141 (274) T protein:vir:93 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------LTV--NADITK---- 141 (274) T ss_pred EEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc------------ccc--cccccC---- Confidence 99999998889999999999999999999999999999999998875321110 000 000000 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc----cccCC-------CCccccc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF----TSRNQ-------FGEYVTV 302 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~----~~~~~-------~G~~~~~ 302 (377) .+.+.+.+ ..+ +. . .. ...+++|||..+..++... ..... +|...+. T Consensus 142 -----------~d~i~dA~----~~l---~d--~-~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:93 142 -----------LNGLQSAI----DKF---ND--E-DL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) T ss_pred -----------HHHHHHHH----HHh---hh--c-cC-CccEEEeCHHHHHHHHhhhhhcccccccccccceeeccccee Confidence 11111111 111 11 0 11 2246889999987776321 11111 1222233 Q ss_pred cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 303 l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|++|+.|+.+|.++++++....+.+....++.++..++.. +....+++..+++.++++|+++++++.++| T Consensus 200 --~G~~Vi~s~~~p~~t~~l~~~gai~~~~~~~~~vE~~Rd~~--~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~ 270 (274) T protein:vir:93 200 --LGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAS--TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred --cCeeEEEcCCCCcceEEEEeCCeEEEEecCCcccccccchh--hcccEEEEEEEEEEEEEcCCceEEEeeCcc Confidence 47889999999999998888887777777788887766544 345689999999999999999999999999 No 107 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.63 E-value=1.1e-17 Score=113.57 Aligned_cols=260 Identities=15% Similarity=0.079 Sum_probs=174.1 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe-c---CC-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN-T---SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~-~---~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +...++.-+..++|+-|++.+.+.+++...+.+++.+.. . +| .+++|.....+.+.+..+.+.+. ..+.++++. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~-~~~lt~~~~ 79 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAID-YSALETESV 79 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCc-cccccccee Confidence 444445556789999999999999988877777776543 2 23 47899987667777776655554 457889999 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeecc-CCCcceeeeeccccccccccccccccccch Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGN-GLLQPVGLLKDLSQPTVDQSTGRDITTYKT 232 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~-G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~ 232 (377) ++..++.+..+.++++....+..|+.+.+.++++.++++.+|+.++..- |.. ..........+. T Consensus 80 ~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~-------------~~~~~~~t~~~~-- 144 (278) T protein:vir:80 80 KHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTT-------------LEVKGAINIGLI-- 144 (278) T ss_pred eEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------cccccccccchh-- Confidence 9988888888899999999999999999999999999999999887531 111 000000000000 Q ss_pred hhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc----ccccC--CC-----Ccccc Q lcl|Aclame:pro 233 DKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK----FTSRN--QF-----GEYVT 301 (377) Q Consensus 233 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~----~~~~~--~~-----G~~~~ 301 (377) ......+.+....+. ....+ . ..+++|||..+..+... ++... ++ |...+ T Consensus 145 -------------~~~~~~~~da~~~l~--~~~~~---~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~ 205 (278) T protein:vir:80 145 -------------DKIENTFTDAPDAIE--DESIT---T-TGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGE 205 (278) T ss_pred -------------hhHHHHHHHHHHhhc--ccCCC---c-ccEEEECHHHHHHHHhhhhhhccccccccccceeecccee Confidence 001111222111110 00111 1 22578999888766422 11111 11 22223 Q ss_pred ccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 302 VLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 302 ~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) . .|++|+.|+++|.++.++..-..+..+..+++.++..++.. +....+++.++++.++++|+++|+++..|| T Consensus 206 ~--~G~~Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~vE~~Rd~~--~~~d~i~~~~~yg~~v~~~~~~v~it~~a~ 277 (278) T protein:vir:80 206 L--LGWEIVRTKKLADGNALAVKAGALKTFLKRNLLAESGRDMD--HKLTKFNADQHYAVALVDETKAVKVVPVAG 277 (278) T ss_pred e--cceeEEEcCCCCcceEEEEeccceeeeecCCcccccccchh--hccceeeeeeEEEEEEEcCcceEEEeeccC Confidence 3 47899999999999877666665656666777877665543 455689999999999999999999999999 No 108 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.61 E-value=3.8e-17 Score=110.61 Aligned_cols=253 Identities=15% Similarity=0.090 Sum_probs=169.7 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec-C---C-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT-S---L-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~-~---~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +..+.+.-...++|+-|.+.|.+.+.+...+.+++.+-+. + | .+++|.....+++.++.+++++. ..+.++++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~-~~~lt~~~~ 79 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEIS-LDKIGTTTK 79 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccC-hhhcCCcce Confidence 4444455566789999999999999888888888776442 2 3 37899987777888877666654 566889999 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++..++.+..+.++++....+..|+.+.+.++++..+++.+|+.++..- +. ...... ...++ T Consensus 80 ~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l---------~~---~~~~~~---~~~~~--- 141 (272) T protein:vir:36 80 SVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAA---------KT---TSQTVS---TKANV--- 141 (272) T ss_pred eEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHh---------cc---cccccc---ccccH--- Confidence 9999999888999999999999999999999999999999999886421 00 000000 00000 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccc---cCCC-------Ccccccc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTS---RNQF-------GEYVTVL 303 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~---~~~~-------G~~~~~l 303 (377) +.+.+....+ ... .. ...+++|||.++..++..... .+.. |.+.+. T Consensus 142 ----------------d~i~~A~~~l---gd~---~~-~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~- 197 (272) T protein:vir:36 142 ----------------DGVQAALDIF---NDE---DA-QAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV- 197 (272) T ss_pred ----------------HHHHHHHHHh---hhc---CC-CceEEEEcHHHHHHHhcccccccccccccccceeeecccee- Confidence 1111111111 000 01 124688999998877543211 1111 222222 Q ss_pred CCCceEEecCCCCcceEEEEe--c--ccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 304 PHGITILESLAVETGKAIAFV--A--NRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 304 ~~~~~v~~s~~~~~~~ii~gd--~--s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|++|+.|+++|.++.++.. | ..+.++..++++++..++.. +....+++.+++..++++|+++|+++.++= T Consensus 198 -~G~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~~~~~~vE~~R~~~--~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 198 -LGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV--TKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred -cCeeEEEeCCCCCCceeEEEEEecccceeeeecCCcccccccchh--hcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 57899999999988743221 1 12334455677777665543 445589999999999999999999999988 No 109 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.58 E-value=1.7e-16 Score=107.08 Aligned_cols=254 Identities=16% Similarity=0.104 Sum_probs=171.9 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT----SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~----~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +....+.-...++|+-+++.+.+.++....+.+++++-+. +| .+++|.....+++....+...+ +..+.+++.. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i-~~~~it~~~~ 79 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKI-PVDQIGTSKR 79 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcC-chhhccccee Confidence 4444455567899999999999998888777777765432 23 4789987755666655555555 4556888999 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++..++.+..+.++.+....+..|+.+.+.+.++.++++.+|+.++.-- +.. +.. ......++ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l---------~~a---~~~--~~~~~~~~--- 142 (274) T protein:vir:96 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEAL---------KGA---TLT--VEADITKL--- 142 (274) T ss_pred EEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHH---------hcC---CCC--cCcccccH--- Confidence 9988888888899999999888999999999999999999999887421 100 000 00000011 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc----ccccCC-------CCccccc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK----FTSRNQ-------FGEYVTV 302 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~----~~~~~~-------~G~~~~~ 302 (377) +.+.+....+. .. .. ....++|||..+..+... +..... +|.+.+. T Consensus 143 ----------------d~i~dA~~~l~---d~---~~-~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~ 199 (274) T protein:vir:96 143 ----------------DGLQTAIDKFN---DE---DL-EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEA 199 (274) T ss_pred ----------------HHHHHHHHHhc---cc---CC-CceEEEeCHHHHHHHHhcccccccccccccccceeeccccee Confidence 11111111111 10 11 234688999988766432 111111 1223333 Q ss_pred cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 303 l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|++|+.|+++|.++.++.....+.+....++.++..++.. +....+++.++++.++++|+++++|+.+++ T Consensus 200 --~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~--~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:96 200 --LGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS--RKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) T ss_pred --cCeeEEEcCCCCcceEEEEeCcceeeeecCCcccccccchh--hcccEEEEeeEEEEEEEcCccEEEEEcCcc Confidence 47789999999999987776666666667777777655443 456689999999999999999999999999 No 110 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.55 E-value=7.8e-16 Score=103.41 Aligned_cols=254 Identities=16% Similarity=0.081 Sum_probs=173.7 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT----SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~----~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +....+.-...++|+-|.+.+.+.++....+.+++++-+. +| .+++|.....+++..+.++.++. ..+.++++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~-~~~lt~~~~ 79 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccc-cccccccee Confidence 4444555567899999999999988877666777766432 24 47899877666777666555554 557888999 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++..++.+.-+.++.+....+..|+.+.+.+.++.++++.+|+.++.--. .. .... ..... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~---------~a---~~~~--~~~~~----- 140 (274) T protein:vir:94 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALM---------GA---KLTV--NADIT----- 140 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHh---------cc---Cccc--ccccc----- Confidence 99999988889999999988888999999999999999999998874211 10 0000 00000 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc----cccCC-------CCccccc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF----TSRNQ-------FGEYVTV 302 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~----~~~~~-------~G~~~~~ 302 (377) ..+.+.+.+ ..+ +.. .. ....++|||..+..++... ...+. +|.+.+. T Consensus 141 ----------~~d~i~dA~----~~l---~d~---~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:94 141 ----------KLNGLQSAI----DKF---NDE---DL-EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) T ss_pred ----------CHHHHHHHH----HHh---hcc---CC-CceEEEeCHHHHHHHHhhhhhhccccCcccccceecccccee Confidence 011111111 111 110 11 2246789999887776421 11111 1222233 Q ss_pred cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 303 l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|++|+.|+.+|.++.++.....+.++...++.++..++... ....+++..++..++++|+++++++.++| T Consensus 200 --~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~--~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:94 200 --LGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred --cCeeEEEcCCCCcceEEEEeCcceEeeecCCceeccccchhh--cccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 478999999999999887777766667777888887766543 34588999999999999999999999999 No 111 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.55 E-value=7.8e-16 Score=103.41 Aligned_cols=254 Identities=16% Similarity=0.081 Sum_probs=173.7 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT----SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~----~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +....+.-...++|+-|.+.+.+.++....+.+++++-+. +| .+++|.....+++..+.++.++. ..+.++++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~-~~~lt~~~~ 79 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccc-cccccccee Confidence 4444555567899999999999988877666777766432 24 47899877666777666555554 557888999 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++..++.+.-+.++.+....+..|+.+.+.+.++.++++.+|+.++.--. .. .... ..... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~---------~a---~~~~--~~~~~----- 140 (274) T protein:vir:97 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALM---------GA---KLTV--NADIT----- 140 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHh---------cc---Cccc--ccccc----- Confidence 99999988889999999988888999999999999999999998874211 10 0000 00000 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc----cccCC-------CCccccc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF----TSRNQ-------FGEYVTV 302 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~----~~~~~-------~G~~~~~ 302 (377) ..+.+.+.+ ..+ +.. .. ....++|||..+..++... ...+. +|.+.+. T Consensus 141 ----------~~d~i~dA~----~~l---~d~---~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:97 141 ----------KLNGLQSAI----DKF---NDE---DL-EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) T ss_pred ----------CHHHHHHHH----HHh---hcc---CC-CceEEEeCHHHHHHHHhhhhhhccccCcccccceecccccee Confidence 011111111 111 110 11 2246789999887776421 11111 1222233 Q ss_pred cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 303 l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|++|+.|+.+|.++.++.....+.++...++.++..++... ....+++..++..++++|+++++++.++| T Consensus 200 --~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~--~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:97 200 --LGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred --cCeeEEEcCCCCcceEEEEeCcceEeeecCCceeccccchhh--cccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 478999999999999887777766667777888887766543 34588999999999999999999999999 No 112 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.54 E-value=4.1e-16 Score=104.96 Aligned_cols=254 Identities=17% Similarity=0.118 Sum_probs=175.5 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe-c---CC-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN-T---SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~-~---~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +....+.-...++|+-|.+.+.+.+.+...+.+++.+-+ + +| .+.+|.....+++.++.+..++. ..+.++++. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~-~~~lt~~~~ 79 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIP-VDKIETNRR 79 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccC-cccccccee Confidence 444445556678999999999999999988888887644 2 34 37999887777888776666654 566889999 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) +...++.+..+.++.+....+..|+.+.+.+.++..+++.+++.++. .++..... ......++ T Consensus 80 ~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~---------~l~~~~~~-----~~~~~~t~--- 142 (276) T protein:vir:10 80 EAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLE---------ALRGTKLT-----VSADIGTL--- 142 (276) T ss_pred eEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHH---------HHhccccc-----ccccccCH--- Confidence 99999999999999999999989999999999999999999997763 11110000 00000111 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc----ccccC--C-----CCccccc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK----FTSRN--Q-----FGEYVTV 302 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~----~~~~~--~-----~G~~~~~ 302 (377) +.+......+ .. ... ...+++|||..+..++.. +.... + +|++.+. T Consensus 143 ----------------d~i~~A~~~l---gd---~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (276) T protein:vir:10 143 ----------------AGLEAAIDTF---DD---EDL-EPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEA 199 (276) T ss_pred ----------------HHHHHHHHHh---cc---ccC-cccEEEEcHHHHHHHHHhccccccccccccccceecccccee Confidence 1111111111 00 011 124678999998777432 11111 1 1222232 Q ss_pred cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 303 l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|++|+.|+.+|.++.++..-..+.++...++.++..++.. +....+++.+++..++++++.+++++.++| T Consensus 200 --~G~~Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~vE~dRd~~--~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (276) T protein:vir:10 200 --LGAVIVRSKKLDEGEAILAKRGAVKLITKRDFFLETDRDPS--TKTTALYSDKHYVAYLYDESKAVKVTKGAG 270 (276) T ss_pred --cceeEEEcCCCCcceEEEEeccceeeeecCCceeecccchh--hcccEEEEeeEEEEEEEcCcceEEEecCCc Confidence 57899999999999987666555556667788887776654 345688999999999999999999999999 No 113 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.54 E-value=4.7e-16 Score=104.60 Aligned_cols=292 Identities=13% Similarity=0.068 Sum_probs=182.5 Q ss_pred HHHHH--HhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEEcCC Q lcl|Aclame:pro 54 MERMF--DLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETS 130 (377) Q Consensus 54 ~~~~~--~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~~~~~ 130 (377) +-+.. ..+...+.++.+. =+-.+.+.+-.+.+.+.|......|||.+.+.++|++...+.++. +...+++.+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~---p~l~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~l 77 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQF---PELKMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVL 77 (330) T ss_pred CceecCCccccceeehhccc---cccchhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecC Confidence 00000 0111111221100 011122334445678899999999999999999999999887774 45789999889 Q ss_pred cceeeecccccccccccccceeEeecceeEEEeehhhHHHH--hcCHHHHHHHHHHHHHHHHHHHhhcceeeccCC-Ccc Q lcl|Aclame:pro 131 GTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDAL--KFGPKWLKQFITEQLKEAIAVALELAIVKGNGL-LQP 207 (377) Q Consensus 131 ~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell--~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~-~~P 207 (377) +.+.|...++..++....+|.+++...+.+.+.+.|...+. ..+..|...+-.+...++++..++..||||+++ +++ T Consensus 78 p~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F 157 (330) T protein:vir:94 78 GDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSF 157 (330) T ss_pred CcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccc Confidence 99999887766655555689999999999999999999995 556788999999999999999999999999965 678 Q ss_pred eeeeeccccccccccccc-cccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccC-ceEEEeccchhhh Q lcl|Aclame:pro 208 VGLLKDLSQPTVDQSTGR-DITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAG-QVKLLLNPEDRWT 285 (377) Q Consensus 208 ~Gil~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~n~~~~~~ 285 (377) .|+++.+.......+.+. ...+ ++.+ +.+-... ....+ ...|+||+...-. T Consensus 158 ~GL~~~~~~~q~i~tg~~gg~~T---------------~d~L-DeLl~~v-----------~~~~g~~~~~l~n~a~~r~ 210 (330) T protein:vir:94 158 QGMMGLVAASQTISAGANGGTLT---------------FELL-DQLLDLV-----------KDKDGQVDYLMSSFAMRRK 210 (330) T ss_pred cchhhcCCcccEEecCCCCCCCC---------------HHHH-HHHHHHh-----------cCCCCCCcEEEechhHHHH Confidence 899987654443322111 1111 1111 1111100 00111 2356777665433 Q ss_pred hccccc----------ccCCCCccccccCCCceEEecCCCCcc----------eEEEEec-----ccEEEEec----cee Q lcl|Aclame:pro 286 LEAKFT----------SRNQFGEYVTVLPHGITILESLAVETG----------KAIAFVA-----NRYDAFMA----TAS 336 (377) Q Consensus 286 ~~~~~~----------~~~~~G~~~~~l~~~~~v~~s~~~~~~----------~ii~gd~-----s~y~~~~~----~~~ 336 (377) +.+..- ..+..|..+... .|+|++.++.+|.+ .|++..| .+-+.+.. .|+ T Consensus 211 I~a~~R~~~~~~v~~~~~~~~G~~v~~~-~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~gl 289 (330) T protein:vir:94 211 YFSLLRALGGAAIGEVMTLPSGRQIPTY-RGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGL 289 (330) T ss_pred HHHHHHhccCCCCCCcccccCCCEEeee-CCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcc Confidence 332211 112233332111 37888888887753 2544333 23455553 366 Q ss_pred eEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEE-eecC Q lcl|Aclame:pro 337 TIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLT-LAGG 377 (377) Q Consensus 337 ~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~-~~a~ 377 (377) ++..-.+ --.++..-|+..+++...+.+++|+.+|+ +.=| T Consensus 290 sVr~~G~-~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 290 RVQNVGA-KENADETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred eeeeCCC-ccccceeeEEEEEeeeeEEechhheeeeccccCC Confidence 6533111 11356677899999999999999998885 5666 No 114 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.50 E-value=1.4e-15 Score=102.04 Aligned_cols=254 Identities=17% Similarity=0.073 Sum_probs=170.7 Q ss_pred Hhc-cCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEEcCCcceeeeccccccccccccccee Q lcl|Aclame:pro 79 DKN-VGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT----SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKE 152 (377) Q Consensus 79 ~~~-~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~----~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~ 152 (377) +.. +.+.-...++|+-|.+.+.+.+++...+.+++++-+. +| .+++|.....+++.++.+++++. ..+.++++ T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~-~~~lt~~~ 79 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIP-IDLIETKK 79 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcc-hhhcccce Confidence 222 2233445788999999999999998888888876543 23 47899887767777776666654 55788999 Q ss_pred EeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccch Q lcl|Aclame:pro 153 QDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKT 232 (377) Q Consensus 153 i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~ 232 (377) .+...++.+..+.++++....+..|+.+...+.++..+++.+|+.++.--++ . .... .....++ T Consensus 80 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~---------a---~~~~--~~~~~~~-- 143 (275) T protein:vir:96 80 RQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQG---------A---TLKV--EADITKL-- 143 (275) T ss_pred eeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhc---------c---cccc--cccccCH-- Confidence 9999999998899999999888788899999999999999999987731111 0 0000 0000111 Q ss_pred hhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc---cccC-C-------CCcccc Q lcl|Aclame:pro 233 DKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF---TSRN-Q-------FGEYVT 301 (377) Q Consensus 233 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~---~~~~-~-------~G~~~~ 301 (377) +.+.+....+. .. .. ....++|||..+..++... .... . +|.+.+ T Consensus 144 -----------------d~i~dA~~~lg---d~---~~-~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~ 199 (275) T protein:vir:96 144 -----------------AGLQTAIDKFN---DE---DL-EPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGE 199 (275) T ss_pred -----------------HHHHHHHHHhc---cc---cC-CccEEEeCHHHHHHHHhcccccccccccccccceeccccce Confidence 11111111111 00 11 2246889999887774321 1111 1 122222 Q ss_pred ccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 302 VLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 302 ~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) . .|++|+.|+.+|.++.++..-..+.++...++.++..++.. +....+++.+++..++++|+++++++.+.+ T Consensus 200 ~--~G~~Vi~s~~~p~~t~~i~~~gA~~~~~~~~~~vE~~Rd~~--~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 271 (275) T protein:vir:96 200 A--LGAIIVRSNKIKEGEAILAKRGAVKLITKRDFFLETERHAS--HKSTALFSDKHYVAYLYDESKVVKITKSAS 271 (275) T ss_pred e--cCeeEEEeCCCCcceEEEEeccceeeeecCCcccccccchh--hcCcEEEEeEEEEEEEEcCccEEEEEeccc Confidence 2 57899999999999876655444555666777777666543 455689999999999999999999999988 No 115 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.43 E-value=2e-14 Score=95.71 Aligned_cols=254 Identities=17% Similarity=0.082 Sum_probs=167.4 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT----SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~----~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +....+.=...++|+-|++.+.+.+.....+.+++.+-+. +| .+++|.....+++....+...+. ..+.+.++. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~-~~~lt~~~~ 79 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIP-TDILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccc-hhhccccee Confidence 3333344456789999999999988877777777665432 24 47899877666676665555553 456788888 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++..++.+..+.++.+-...+..|+.+.+.+.++.++++.+|+.++.--.+. .... .....++ T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a------------~~~~--~~~~~~~--- 142 (274) T protein:vir:96 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSA------------KLTV--EADITKL--- 142 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cccc--cccccCH--- Confidence 8888888888899999888887889999999999999999999876311110 0000 0000011 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc--c-ccCC--------CCccccc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF--T-SRNQ--------FGEYVTV 302 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~--~-~~~~--------~G~~~~~ 302 (377) . .+...+..+. .. .. ...+++|||..+..++... . .... +|.+.+. T Consensus 143 ------------d----~i~~A~~~lg---d~---~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:96 143 ------------T----GLQTAIDKFN---DE---DL-EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEA 199 (274) T ss_pred ------------H----HHHHHHHHhc---cc---cc-cccEEEeCHHHHHHHHhhccccccccccccccceecccccee Confidence 1 1111111111 00 11 2246789999888776432 1 1111 1222232 Q ss_pred cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 303 l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|++|+.|+.+|.++.++.-...+..+...++.++..++.. +....+++.+++..++++|++.|+++...| T Consensus 200 --~G~~Vi~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~--~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~ 270 (274) T protein:vir:96 200 --LGAVIVRSNKLEAGTAILAKKGAVKLITKRDFFLETDRDPS--TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred --cCeEEEEeCCCCCceEEEEeccceeeeecCCcccccccccc--cccCEEEEeEEEEEEEEcCCcEEEEEcCCc Confidence 47889999999998865544444445556777777766544 456689999999999999999999999999 No 116 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.43 E-value=2e-14 Score=95.71 Aligned_cols=254 Identities=17% Similarity=0.082 Sum_probs=167.4 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT----SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~----~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +....+.=...++|+-|++.+.+.+.....+.+++.+-+. +| .+++|.....+++....+...+. ..+.+.++. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~-~~~lt~~~~ 79 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIP-TDILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccc-hhhccccee Confidence 3333344456789999999999988877777777665432 24 47899877666676665555553 456788888 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++..++.+..+.++.+-...+..|+.+.+.+.++.++++.+|+.++.--.+. .... .....++ T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a------------~~~~--~~~~~~~--- 142 (274) T protein:vir:95 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSA------------KLTV--EADITKL--- 142 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cccc--cccccCH--- Confidence 8888888888899999888887889999999999999999999876311110 0000 0000011 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc--c-ccCC--------CCccccc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF--T-SRNQ--------FGEYVTV 302 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~--~-~~~~--------~G~~~~~ 302 (377) . .+...+..+. .. .. ...+++|||..+..++... . .... +|.+.+. T Consensus 143 ------------d----~i~~A~~~lg---d~---~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:95 143 ------------T----GLQTAIDKFN---DE---DL-EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEA 199 (274) T ss_pred ------------H----HHHHHHHHhc---cc---cc-cccEEEeCHHHHHHHHhhccccccccccccccceecccccee Confidence 1 1111111111 00 11 2246789999888776432 1 1111 1222232 Q ss_pred cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 303 l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|++|+.|+.+|.++.++.-...+..+...++.++..++.. +....+++.+++..++++|++.|+++...| T Consensus 200 --~G~~Vi~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~--~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~ 270 (274) T protein:vir:95 200 --LGAVIVRSNKLEAGTAILAKKGAVKLITKRDFFLETDRDPS--TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred --cCeEEEEeCCCCCceEEEEeccceeeeecCCcccccccccc--cccCEEEEeEEEEEEEEcCCcEEEEEcCCc Confidence 47889999999998865544444445556777777766544 456689999999999999999999999999 No 117 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.40 E-value=3e-14 Score=94.72 Aligned_cols=254 Identities=17% Similarity=0.084 Sum_probs=168.0 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT----SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~----~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +....+.-...++|+-|.+.+.+.+.....+.+++.+-.. +| .+++|.....+++....+...+ +..+.+.++. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i-~~~~lt~~~~ 79 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKI-PTDILETKKR 79 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcc-chhhccccee Confidence 4444445566789999999999988777666677665321 24 4789987766667666655555 4456788888 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ++..++.+.-+.++.+....+..|+.+.+.+.++.++++.+|+.++.--.+. ... ......+ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a------------~~~--~~~~a~~---- 141 (274) T protein:vir:12 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLT--VNADITK---- 141 (274) T ss_pred eEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcc------------ccc--ccccccC---- Confidence 8888888888999998888887889999999999999999999887421110 000 0000011 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc---cccCC--------CCccccc Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF---TSRNQ--------FGEYVTV 302 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~---~~~~~--------~G~~~~~ 302 (377) .+ .+.+....+ +.. .. ...+++|||..+..++... ..... +|.+.+. T Consensus 142 -----------~d----~i~dA~~~l---gd~---~~-~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:12 142 -----------LN----GLQSAIDKF---NDE---DL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) T ss_pred -----------HH----HHHHHHHHh---ccc---cc-cccEEEeCHHHHHHHHhhhhhhccccccccccceecccceee Confidence 11 111111111 100 11 2245789999887776431 11111 1222222 Q ss_pred cCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 303 LPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 303 l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|.+|+.|+.+|.++.++.-...+..+...++.++..++... ....+++.+++..++++|+..++++.++| T Consensus 200 --~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~--~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:12 200 --LGAIIVRSNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred --cCeeEEEeCCCCcceEEEEeccceeeeecCCceeccccchhh--cccEEEeeeEEEEEEEcCCceEEEEcCCc Confidence 578999999999988654444444455567788877766543 44588999999999999999999998888 No 118 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.26 E-value=2.8e-12 Score=83.90 Aligned_cols=336 Identities=15% Similarity=0.090 Sum_probs=180.7 Q ss_pred CCccHHHHH------HHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHH Q lcl|Aclame:pro 1 MAINLKELP------KYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKF 74 (377) Q Consensus 1 m~~~~~~l~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~ 74 (377) |.-=+++|+ .+.+++++++.+++++++..+.. ...-++.++.. +.-..+.....+..-+.+-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~k~lr~~me~~et~~e~~----~~~~~~~~~e~-----el~E~f~Kmm~G~~p~~eV~-- 69 (393) T protein:vir:79 1 MENWLKQLKESGFTETQVQEQKSLRTRMERGETLAEAD----ANKLALNEEET-----QILESFAKMMEGETPTNEVN-- 69 (393) T ss_pred CchHHHHHHhccCchhHHHHHHHHHHHhhhhhhhhhhh----hhhhhcchhHH-----HHHHHHHHHhcCCCchhhee-- Confidence 554455553 34456666666666554432221 11111111100 11111111111222222211 Q ss_pred HHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec-CCc-eEEEEEcCCcceeeeccccccccc--ccccc Q lcl|Aclame:pro 75 FNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT-SLR-LKALTAETSGTAVWGDIFGEIKGQ--LKQAF 150 (377) Q Consensus 75 ~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~-~~~-~~~p~~~~~~~a~w~~e~~~~~~~--~~~~f 150 (377) .+..-++.++..+||..++.-+.+..+.....-++...+.+ .|. ..+|-. +.--+.-++|.++.++. ...+| T Consensus 70 ---~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~-g~~Ra~~IgEGgE~~~~sld~~T~ 145 (393) T protein:vir:79 70 ---LREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSI-GIMRAYDVAEGQEIPEDSIDWQTH 145 (393) T ss_pred ---hhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccch-heeeeccccccccccccchhhhcC Confidence 22334556678999999999998866655544455444444 232 333321 12223334455555433 23579 Q ss_pred eeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCC-Cc--ceeeeeccccccccccccccc Q lcl|Aclame:pro 151 KEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGL-LQ--PVGLLKDLSQPTVDQSTGRDI 227 (377) Q Consensus 151 ~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~-~~--P~Gil~~~~~~~~~~~~~~~~ 227 (377) +.+++...|.+..+.+|+|+++||..|+.+++.....+++++..+.-++++.-+ ++ ..|+.+.. ....++-.. T Consensus 146 dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t----~ahptGr~~ 221 (393) T protein:vir:79 146 ESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNK----LAHTTGLDK 221 (393) T ss_pred CceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCc----cceeecCCc Confidence 999999999999999999999999999999999999999999999999998754 33 44554322 222222111 Q ss_pred cccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc-----ccccCCCCccc-- Q lcl|Aclame:pro 228 TTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK-----FTSRNQFGEYV-- 300 (377) Q Consensus 228 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~-----~~~~~~~G~~~-- 300 (377) .+ ...+++ ....+.+.+..++... + ...+++|+|.. |.+.+. .+..+..|+|. T Consensus 222 ~~-----~qNGTl---SleDllDm~~av~~~h----------y-t~svi~MHPLA-Wnv~AKna~me~~~~na~gN~~~~ 281 (393) T protein:vir:79 222 NG-----VQNDTF---SAEDFLDLIIAVMANE----------Y-TPSDLMMHPLA-WTVFAKNELMGSLQANPYGNYPAK 281 (393) T ss_pred cc-----cccccc---cHHHHHHHHHHHhccc----------C-CcceEEEcCch-hhhhhhhhhhcceeeccccccCcc Confidence 11 111111 1122333333332211 1 22467788754 332211 11122223321 Q ss_pred --------------cccCCCceEEecCCCCcce------EEEEecccE-EEEecceeeEEeechhhhhcCcEEEEEEEEE Q lcl|Aclame:pro 301 --------------TVLPHGITILESLAVETGK------AIAFVANRY-DAFMATASTIEEYDQTFAMEDLQLYLTKNYF 359 (377) Q Consensus 301 --------------~~l~~~~~v~~s~~~~~~~------ii~gd~s~y-~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~ 359 (377) .-|||+..|++|+.+|=++ ++..|-... ++-.+.+++.++.++. ..|..-++-++|+ T Consensus 282 ~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk--~rdiq~iKl~ERY 359 (393) T protein:vir:79 282 GAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEK--ARGLQNIKMIERY 359 (393) T ss_pred ccchhhhhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceeccccc--cccceeeeeeeee Confidence 2366788999999998332 233333332 2234456666666554 3678889999999 Q ss_pred cCEEecc-cceEE---EEeecC Q lcl|Aclame:pro 360 YGKAKDN-HTAAL---LTLAGG 377 (377) Q Consensus 360 dg~~~~~-~af~~---l~~~a~ 377 (377) +..+.+. +|+.+ ++++-. T Consensus 360 G~gvLn~gkaiavakNI~~~k~ 381 (393) T protein:vir:79 360 GIGILNEGKAIAVAKNISMDKS 381 (393) T ss_pred ceeeeeCCceEEEEecceeecc Confidence 9877776 44443 334433 No 119 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.25 E-value=1.3e-12 Score=85.75 Aligned_cols=315 Identities=13% Similarity=0.083 Sum_probs=164.1 Q ss_pred HHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC- Q lcl|Aclame:pro 42 MGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL- 120 (377) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~- 120 (377) +..+ ...++..+....+....... .++- +|.+++++...++++.+++.+++++.++++++.. T Consensus 1 ~~~~---------------~~~~~~~n~~~~~i~k~~it-~~~l-~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~ 63 (360) T protein:vir:99 1 MSSN---------------STIDSVRNQNMNSLSQKDIG-LAEL-DGFQLPVDVTEEFLERMQKGVQILGMADTMTLARL 63 (360) T ss_pred Ccch---------------hHHHHHhhhHHHHHHhhhcc-cccc-CceeecHHHHHHHHHHHhhccchhhhcceeecccc Confidence 0000 00000011111111111111 1122 4568888999999999999999999999988753 Q ss_pred ceEEEEEcCCcce-eeecccccccccccccceeEeec-ceeEEEeehhhHHHHhcC----HHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 121 RLKALTAETSGTA-VWGDIFGEIKGQLKQAFKEQDFS-QFKLTAFVVIPKDALKFG----PKWLKQFITEQLKEAIAVAL 194 (377) Q Consensus 121 ~~~~p~~~~~~~a-~w~~e~~~~~~~~~~~f~~i~l~-~~k~~~~~~iS~ell~ds----~~~~~~~l~~~la~~~a~~~ 194 (377) ...++...-+.-. .-..|++...+..+++...+.+. .+++.....++.+-+++. ...+++.|.+.|++++++-+ T Consensus 64 ~~ei~kig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dl 143 (360) T protein:vir:99 64 EMEVPQFGVPRLSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDL 143 (360) T ss_pred cccccccccceeeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHH Confidence 3344432211110 11123333333344555666663 345666667777776654 33577999999999999999 Q ss_pred hcceeeccCC---------Ccc-----eeeeeccccccccccccc--------cccccchhh---hhhhhhhccChHH-H Q lcl|Aclame:pro 195 ELAIVKGNGL---------LQP-----VGLLKDLSQPTVDQSTGR--------DITTYKTDK---EAIADLSDLDPDT-A 248 (377) Q Consensus 195 ~~a~l~G~G~---------~~P-----~Gil~~~~~~~~~~~~~~--------~~~~~~~~~---~~~~~l~~~~~~~-~ 248 (377) +...++|+.. +.| .|+++....-....-.+. +..+...+. .+..... .++.. - T Consensus 144 e~l~~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~ 222 (360) T protein:vir:99 144 GLMGIRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGS-GNPQPVD 222 (360) T ss_pred HHHHhhccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccc-cccccch Confidence 9999998743 222 488776532211000000 000000000 0000000 00000 0 Q ss_pred HHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCC-Cc--cc---cccCCCceEEecCCCCcceEEE Q lcl|Aclame:pro 249 VELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQF-GE--YV---TVLPHGITILESLAVETGKAIA 322 (377) Q Consensus 249 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~-G~--~~---~~l~~~~~v~~s~~~~~~~ii~ 322 (377) ...+..+++.+-..+.. .-..+..|+|+|.+.-.....+.....+ |. .. ...++|+|++.-+.+|++.++| T Consensus 223 ~~lf~~~~~~Lp~kyr~---~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~~ml 299 (360) T protein:vir:99 223 TSLFNETIQTLDSRYRE---SDAYSPVLMTSPNQVQSYTMSLTEREDPLGSAVIFGDSDITPFSYDLVGVNGFPDEYMMF 299 (360) T ss_pred HHHHHHHHHhcchhhhc---CcccceEEEccCchHHHHHHHHhccCcccchhheecccccccceeeeEEcCCCCCCceEE Confidence 11122222222111110 0011458999998765444444332211 10 11 1234688899999999999999 Q ss_pred EecccEEEEecceeeEEeechhhhh-cCc--EEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 323 FVANRYDAFMATASTIEEYDQTFAM-EDL--QLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 323 gd~s~y~~~~~~~~~i~~~~~~~f~-~~~--~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) =+++..+.+....++|+.+.+.... +.. +.+.....+|....+.+|.++++=--- T Consensus 300 T~p~NLi~g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~ 357 (360) T protein:vir:99 300 TDPNNLAFGLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLET 357 (360) T ss_pred eccCceeEEeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCC Confidence 9999999999999999876543222 222 223334567787888889988762211 No 120 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.07 E-value=1.4e-11 Score=80.11 Aligned_cols=251 Identities=14% Similarity=0.048 Sum_probs=160.8 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT----SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~----~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +.. +.-...++|+-|.+-+.+.+.+...+.+++.+-+. +| .+.+|.....+++.-+.++.++. ..+.++++- T Consensus 1 Ma~--T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~-~~~lt~~~~ 77 (270) T protein:vir:95 1 MTQ--TKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMD-TTQMSMTTT 77 (270) T ss_pred CCc--eehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccc-hhhcccchh Confidence 222 22334679999999999999888888888876443 24 37899877666766444555554 456788888 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) ....++.+.-+.++.+-...+..|..+.+.+.++..+++++++.++. .++..... ... .. T Consensus 78 ~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~---------~l~~a~~~---~~~---~~----- 137 (270) T protein:vir:95 78 KVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIA---------ELNKSKQT---ATV---SA----- 137 (270) T ss_pred eeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHH---------Hhcccccc---ccc---cc----- Confidence 88889988889999998887767788999999999999999997762 11111100 000 00 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccc---cCCC-----CccccccCC Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTS---RNQF-----GEYVTVLPH 305 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~---~~~~-----G~~~~~l~~ 305 (377) +...+.+.+. ..+.. .....+++|||.++..+...... ..++ |.+.+. . T Consensus 138 ----------t~~~~~dA~~-------~lgd~----~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~--~ 194 (270) T protein:vir:95 138 ----------DATGILDAIE-------VFNSE----NDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEI--V 194 (270) T ss_pred ----------CHHHHHHHHH-------Hhccc----cCCCcEEEEcHHHHHHHHhhhcccccccccchhccccccee--c Confidence 0011111111 11111 11123588999998877532211 1111 333333 4 Q ss_pred CceEEecC-CCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 306 GITILESL-AVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 306 ~~~v~~s~-~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |++|+.++ ..++++.++.-..-..++...++.++..++.. +....+.+.+++..+++++..++++|++-. T Consensus 195 G~~Viv~s~~~~~~~~~l~~~gAi~~~~~~~~~vEtdRd~~--~~~d~i~~~~~y~v~~~~~skvv~~t~~~a 265 (270) T protein:vir:95 195 GVSDIVKSKRVSENTAFLQRYGAMEIVNKKKPEAYTDFDIL--KRTHLLSTNYHYSVNLKDETGVVKVTFKPS 265 (270) T ss_pred ceeEEEeCCCCCceeEEEEeccceeeeecCCceeeeccchh--hcccEEEeeeEEEEEEEccceEEEEEecCC Confidence 67776655 45566655544444555666777887776654 455578899999999999999999997655 No 121 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.07 E-value=3.8e-11 Score=77.74 Aligned_cols=270 Identities=13% Similarity=0.086 Sum_probs=158.5 Q ss_pred cccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEEcCCcceeeecc---- Q lcl|Aclame:pro 64 NRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDI---- 138 (377) Q Consensus 64 ~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~~~~~~~a~w~~e---- 138 (377) -..||-.| .+.+.+......|||.+.+.|.|++...+.++.| ...+.+....+.+.+... T Consensus 1 mpaltLae---------------a~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~ 65 (310) T protein:vir:97 1 MASVTLAE---------------SAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTF 65 (310) T ss_pred CcccchHH---------------HhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccc Confidence 11122222 2346777889999999999999999999887765 456666654444332111 Q ss_pred cccccccccccceeEeecceeEEEeehhhHHHHhc--C-HHHHHHHHHHHHHHHHHHHhhcceeeccCCCcc-eeeeecc Q lcl|Aclame:pro 139 FGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--G-PKWLKQFITEQLKEAIAVALELAIVKGNGLLQP-VGLLKDL 214 (377) Q Consensus 139 ~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s-~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P-~Gil~~~ 214 (377) ..+...++..+|++++...+-+.+.+.|.+.+.+- + ..+...+=.+..++++..+.+..||||+.+++| .|+++.+ T Consensus 66 ~~~g~~~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~ 145 (310) T protein:vir:97 66 SGAGAGKAAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLC 145 (310) T ss_pred cCCCccccccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcC Confidence 11122346688999999999999999999876542 3 445555556778899999999999999986655 5999876 Q ss_pred cccccccc-ccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc-- Q lcl|Aclame:pro 215 SQPTVDQS-TGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT-- 291 (377) Q Consensus 215 ~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~-- 291 (377) .......+ +.....+ +..+-..++.+ +........++|||.+.-.+.+..- T Consensus 146 ~~~q~i~~~~~gg~~t---------------~d~LDeLl~~v-----------~~~~g~p~~~l~~~~~~r~i~A~~R~~ 199 (310) T protein:vir:97 146 ASGQKATTGATGSAIS---------------FAILDELMDLV-----------VDKDGQVDYLTMHARTLRSYKALLRAL 199 (310) T ss_pred CccceeecCCCCCCCC---------------HHHHHHHHHHH-----------hcCCCCCCEEEecHHHHHHHHHHHHHh Confidence 54333221 1111111 11111111111 0011122368899976433322111 Q ss_pred --------ccCCCCccccccCCCceEEecCCCCcc----------eEE---EEecc--cEEEEec----ceeeEEeechh Q lcl|Aclame:pro 292 --------SRNQFGEYVTVLPHGITILESLAVETG----------KAI---AFVAN--RYDAFMA----TASTIEEYDQT 344 (377) Q Consensus 292 --------~~~~~G~~~~~l~~~~~v~~s~~~~~~----------~ii---~gd~s--~y~~~~~----~~~~i~~~~~~ 344 (377) ..+..|..+... .|+|++.++.+|.+ .|+ ||+-+ +-+++.. .|+++..-.+. T Consensus 200 ~~~g~~~~~~~~~G~~v~~~-~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~ 278 (310) T protein:vir:97 200 GGASINEVVELPSGAEVPAY-SGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGES 278 (310) T ss_pred cCCCCCCccccCCCCEEeee-CCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcc Confidence 123344443222 37899998888753 244 44422 3344432 24554332110 Q ss_pred hhhcCcEEEEEEEEEcCEEecccceEEEE-eec Q lcl|Aclame:pro 345 FAMEDLQLYLTKNYFYGKAKDNHTAALLT-LAG 376 (377) Q Consensus 345 ~f~~~~~~~~~~~r~dg~~~~~~af~~l~-~~a 376 (377) =.++..-|+..+++.-.+.+++|+++|. +.- T Consensus 279 -~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 279 -EDSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred -cCCcceeEEEEEeeeEEEecccceeeeccccC Confidence 0245567888999999999999998875 222 No 122 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.03 E-value=1e-10 Score=75.38 Aligned_cols=343 Identities=15% Similarity=0.162 Sum_probs=172.3 Q ss_pred CCc-----cHHHHHHHHHH---HHHHHHHHHhccC-------------HHHHHHHHHHHHHHHHH---HHHH--H---HH Q lcl|Aclame:pro 1 MAI-----NLKELPKYREA---VAELSAKISAGAT-------------PEEQEKLFEAAFTTMGD---EILA--K---NE 51 (377) Q Consensus 1 m~~-----~~~~l~~~~~~---~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~---~~~~--~---~~ 51 (377) |.+ +--+++++-+. .++....++..+. ..+..+.+.+..-.+.+ +... + .. T Consensus 1 ~~~s~~~~~k~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e~eln~~~E~~Kgk 80 (400) T protein:vir:93 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) T ss_pred CcccccccccchHHHHHHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHhhhhhhhhhhcccc Confidence 222 11122222211 2222222221110 11111111111111111 0000 0 00 Q ss_pred HHH----------HHHHHhccccccccHHHHHHHHHHHhccCC--CCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC Q lcl|Aclame:pro 52 EEM----------ERMFDLRDKNRELTAEEIKFFNDIDKNVGG--KDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS 119 (377) Q Consensus 52 ~~~----------~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~--s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~ 119 (377) .++ ...+... .....+.+-+.++.+-..+.+. .+....+|.-+...|-+.+..+.++++..+|.+++ T Consensus 81 ~~mtefLkT~~A~~~fa~~l-~~nsg~sd~knaW~A~l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p 159 (400) T protein:vir:93 81 DKMTNFIESQNAVTEFFDVL-KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 159 (400) T ss_pred hhHHHhhhhHHHHHHHHHHH-HhhcCCcchhhhhhhhhhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeeecCC Confidence 000 0000000 1112223444444433333332 45566889999999999999999999999998885 Q ss_pred CceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHh--cCHHHHHHHHHHHHHHHHHH-Hhhc Q lcl|Aclame:pro 120 LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALK--FGPKWLKQFITEQLKEAIAV-ALEL 196 (377) Q Consensus 120 ~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~--ds~~~~~~~l~~~la~~~a~-~~~~ 196 (377) +-+.. .......-.|+.-.|..+.++..+|..-++.|.-++.+..+.+-..+ ++.-.|..||.++|.+.+-. ..+. T Consensus 160 ~l~V~-~~~dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~ 238 (400) T protein:vir:93 160 ALLVS-RSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDL 238 (400) T ss_pred ceeee-cchhhhcccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhh Confidence 44322 22233345676667888888888999999999877777666433332 23345799999999999996 5799 Q ss_pred ceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhcc-ChHHHHHHHHHHHHhhhhhhhhhhhcccCceE Q lcl|Aclame:pro 197 AIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDL-DPDTAVELLVPVMKHLSVNDKKHPLKIAGQVK 275 (377) Q Consensus 197 a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (377) +++-|+|++...++-+..........+ . ...++.. +...+...+.++. .|. ...++. T Consensus 239 Aii~GdG~Ngf~~~dk~t~Ik~I~~dt-~-----------kt~~a~~~~~qdl~E~~~d~~---------~~~-aad~~~ 296 (400) T protein:vir:93 239 ALVEGDGTNGFKSIDKEADVKKIKKIT-T-----------KAKSAGKTPFADAIEEAVDFV---------RPT-AGRRYL 296 (400) T ss_pred heeecccccccCCCcchhhhhhhhhhh-h-----------hhhhcCCccHHHHHHHHHhhh---------hhc-cCCcee Confidence 999999988666653221111111100 0 0001111 1122223322221 122 345677 Q ss_pred EEeccchhhhhcccccccCCCCccc-----------cccCCCceEEec-CCCCcceEEEEecccEEEEecceeeEEeech Q lcl|Aclame:pro 276 LLLNPEDRWTLEAKFTSRNQFGEYV-----------TVLPHGITILES-LAVETGKAIAFVANRYDAFMATASTIEEYDQ 343 (377) Q Consensus 276 ~~~n~~~~~~~~~~~~~~~~~G~~~-----------~~l~~~~~v~~s-~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~ 343 (377) .+|.|...+.+ ..+ +..+|.+. +-+|++--|+.+ ..++...|++ |=..|+ ++..+...+. T Consensus 297 Iv~s~d~~A~L-~~l--k~a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~kp~V~V-Dek~~i----~~~~~~t~~s 368 (400) T protein:vir:93 297 IVKAEDRKALL-DEL--RQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLV-DQKYHI----DMQDLTKVDA 368 (400) T ss_pred EEeccchHHHH-HHh--cCCcceeeeeeccccchhhhhcccceeeeeccCCCCCceeee-ehhhhc----cccCceeccc Confidence 77887665544 222 33444433 223333222222 2333333444 444333 3333444444 Q ss_pred hhhhcCcEEEEEEEEEcCEEecccceEEEEee Q lcl|Aclame:pro 344 TFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) Q Consensus 344 ~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~ 375 (377) -.+..++..+-....+.|-+.-+++-++++++ T Consensus 369 f~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 369 FEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eeeeeccceEEeeeeeccceecccceeeEeeC Confidence 44455666677778899999999999999999 No 123 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=98.91 E-value=7.1e-11 Score=76.23 Aligned_cols=217 Identities=16% Similarity=0.091 Sum_probs=141.7 Q ss_pred ceeEecCCceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 113 INFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAV 192 (377) Q Consensus 113 ~~v~~~~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~ 192 (377) -+-++.+-.+++|.. .+++.-+.|..+++ ....++++-+...++.+.-+.|+.+-...+..|..+...+.++.+|++ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~eG~~i~-~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCceEEeccc--ccchhhhcCCCcCC-hhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 233444556889865 45666566555554 456889999999999999999999998888888899999999999999 Q ss_pred HhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccC Q lcl|Aclame:pro 193 ALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAG 272 (377) Q Consensus 193 ~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 272 (377) ++|+.++. .+....... ....++....++...+ .. .. .. T Consensus 78 kvD~di~~------------~~~~a~l~~---~~~~t~d~i~~A~~~f----------------------gd---e~-~~ 116 (231) T protein:vir:73 78 KVDDDLLK------------AAKTTSQTV---STKANVDGVQAALDIF----------------------ND---ED-AQ 116 (231) T ss_pred hhhHHHHH------------hhccccccc---cccccHHHHHHHHHHh----------------------cc---cc-cc Confidence 99998773 011000000 0011111111111110 10 01 12 Q ss_pred ceEEEeccchhhhhcccccc---cC--C-----CCccccccCCCceEEecCCCCcceEEEEe----cccEEEEecceeeE Q lcl|Aclame:pro 273 QVKLLLNPEDRWTLEAKFTS---RN--Q-----FGEYVTVLPHGITILESLAVETGKAIAFV----ANRYDAFMATASTI 338 (377) Q Consensus 273 ~~~~~~n~~~~~~~~~~~~~---~~--~-----~G~~~~~l~~~~~v~~s~~~~~~~ii~gd----~s~y~~~~~~~~~i 338 (377) ..+++|||.+++.+...... .+ + +|.+... .|++|+.|+.+|.+..+... ..-..+....++.+ T Consensus 117 ~~vivv~p~~~~~Lrk~~~~~~~~~~~g~~i~~~G~iG~i--~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~v 194 (231) T protein:vir:73 117 AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADV--LGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQV 194 (231) T ss_pred ceEEEEcchHHHhhhhccchhhhhhhhccceeeecccceE--cceEEEEcCCCCCCceeeeeEEeeccceeeeeccccee Confidence 34688999999887653211 11 1 1223333 57899999999988765432 12245666778888 Q ss_pred EeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 339 EEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 339 ~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +..++.. .....+.+.+++..+++++..+|++++++- T Consensus 195 EtdRd~~--~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 195 ETDRDIV--TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred ecccccc--ccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 8766543 445578899999999999999999999988 No 124 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.89 E-value=1.2e-10 Score=74.93 Aligned_cols=254 Identities=15% Similarity=0.107 Sum_probs=135.3 Q ss_pred CCCceeccHHHHHHHHHHHHhhhhhhhhceeE----ecCC-ceEEEEEcCCcceeeecccccccccccccceeEeeccee Q lcl|Aclame:pro 85 KDKFKLLPEETMVQVFDDLVAEHPLLKVINFK----NTSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFK 159 (377) Q Consensus 85 s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~----~~~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k 159 (377) -.-..++|+.|+.++++.++..+.+.+++..- ...| .+.||+......+....+++... ..+.+...+++...+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~-~~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccC-ccccccceEEEEEee Confidence 11124689999999999999998888876431 1123 47888876555555544344332 234455555555543 Q ss_pred E-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhh Q lcl|Aclame:pro 160 L-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIA 238 (377) Q Consensus 160 ~-~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (377) . +.-+.|+..-...+..+++++ .+..+++++.++|..++. .+.. ......... .. T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~alA~~vD~~i~~---------~~~~---a~~~~~~~~-~~---------- 135 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIAD---------MLVD---NGTALTGSA-PT---------- 135 (273) T ss_pred eeecceEeecHHHhhhhccHHHH-HHHHHHHHHHHHHHHHHH---------HHhc---ccccccccc-cc---------- Confidence 3 333456664444455678884 567889999999987652 1100 000000000 00 Q ss_pred hhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc---ccccC---CCCcccc---ccCCCceE Q lcl|Aclame:pro 239 DLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK---FTSRN---QFGEYVT---VLPHGITI 309 (377) Q Consensus 239 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~---~~~~~---~~G~~~~---~l~~~~~v 309 (377) ++...++.+..++..+ +....| ..+ .+++++|..+..+... ..... ..+.+.. ....|++| T Consensus 136 -----~~~~~~~~i~~a~~~l--d~~~vP--~~~-R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v 205 (273) T protein:vir:10 136 -----DADDAFDLIAKALKEL--TKANVP--NVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARI 205 (273) T ss_pred -----chhHHHHHHHHHHHHh--hhcCCC--cCC-CEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEE Confidence 1111112222222211 111122 233 5578999888766432 11111 1111111 11257899 Q ss_pred EecCCCCcce---EEEEecccEEEEecceeeEEee-chhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 310 LESLAVETGK---AIAFVANRYDAFMATASTIEEY-DQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 310 ~~s~~~~~~~---ii~gd~s~y~~~~~~~~~i~~~-~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +.|+++|.+. ++.|--+-..... +...++.. ++.+| -..+++.+.++.++++|+++++|+-++- T Consensus 206 ~~s~~lp~~~~~~~~~~~~~A~~~a~-q~~~~e~~r~~~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 206 VESNNLRDTDDEQFVAFHPSAAAYVS-QIDTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred EEecccccCCccEEEEEeccceeeee-eeehhhcccCCCcc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 9999998643 4444333222221 11223222 22233 3368899999999999999999998888 No 125 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.89 E-value=1.2e-10 Score=74.93 Aligned_cols=254 Identities=15% Similarity=0.107 Sum_probs=135.3 Q ss_pred CCCceeccHHHHHHHHHHHHhhhhhhhhceeE----ecCC-ceEEEEEcCCcceeeecccccccccccccceeEeeccee Q lcl|Aclame:pro 85 KDKFKLLPEETMVQVFDDLVAEHPLLKVINFK----NTSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFK 159 (377) Q Consensus 85 s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~----~~~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k 159 (377) -.-..++|+.|+.++++.++..+.+.+++..- ...| .+.||+......+....+++... ..+.+...+++...+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~-~~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccC-ccccccceEEEEEee Confidence 11124689999999999999998888876431 1123 47888876555555544344332 234455555555543 Q ss_pred E-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhh Q lcl|Aclame:pro 160 L-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIA 238 (377) Q Consensus 160 ~-~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (377) . +.-+.|+..-...+..+++++ .+..+++++.++|..++. .+.. ......... .. T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~alA~~vD~~i~~---------~~~~---a~~~~~~~~-~~---------- 135 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIAD---------MLVD---NGTALTGSA-PT---------- 135 (273) T ss_pred eeecceEeecHHHhhhhccHHHH-HHHHHHHHHHHHHHHHHH---------HHhc---ccccccccc-cc---------- Confidence 3 333456664444455678884 567889999999987652 1100 000000000 00 Q ss_pred hhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc---ccccC---CCCcccc---ccCCCceE Q lcl|Aclame:pro 239 DLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK---FTSRN---QFGEYVT---VLPHGITI 309 (377) Q Consensus 239 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~---~~~~~---~~G~~~~---~l~~~~~v 309 (377) ++...++.+..++..+ +....| ..+ .+++++|..+..+... ..... ..+.+.. ....|++| T Consensus 136 -----~~~~~~~~i~~a~~~l--d~~~vP--~~~-R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v 205 (273) T protein:vir:10 136 -----DADDAFDLIAKALKEL--TKANVP--NVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARI 205 (273) T ss_pred -----chhHHHHHHHHHHHHh--hhcCCC--cCC-CEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEE Confidence 1111112222222211 111122 233 5578999888766432 11111 1111111 11257899 Q ss_pred EecCCCCcce---EEEEecccEEEEecceeeEEee-chhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 310 LESLAVETGK---AIAFVANRYDAFMATASTIEEY-DQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 310 ~~s~~~~~~~---ii~gd~s~y~~~~~~~~~i~~~-~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +.|+++|.+. ++.|--+-..... +...++.. ++.+| -..+++.+.++.++++|+++++|+-++- T Consensus 206 ~~s~~lp~~~~~~~~~~~~~A~~~a~-q~~~~e~~r~~~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 206 VESNNLRDTDDEQFVAFHPSAAAYVS-QIDTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred EEecccccCCccEEEEEeccceeeee-eeehhhcccCCCcc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 9999998643 4444333222221 11223222 22233 3368899999999999999999998888 No 126 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=98.85 E-value=1.5e-10 Score=74.47 Aligned_cols=254 Identities=14% Similarity=0.088 Sum_probs=135.9 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeE----ecCC-ceEEEEEcCCcceeeecccccccccccccceeE Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFK----NTSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~----~~~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) +. -..++|+.|+.++++.++....+.++++.- ...| .+.+|+......+....+.+... ..+.+...+ T Consensus 1 MA------~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~-~~~~~~~~~ 73 (273) T protein:vir:79 1 MA------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) T ss_pred Cc------chhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccC-ccccccceE Confidence 11 123689999999999999998887776432 1124 47899876555555544444333 334566666 Q ss_pred eecceeE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccch Q lcl|Aclame:pro 154 DFSQFKL-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKT 232 (377) Q Consensus 154 ~l~~~k~-~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~ 232 (377) ++...+. +.-+.|+..-...+..+++++ .+..+++++.++|+.++. .+... ........ .. T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~vD~~i~~---------~~~~a---~~~~~~~~-~~---- 135 (273) T protein:vir:79 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIAD---------MLVDN---GTALTGSA-PS---- 135 (273) T ss_pred EEEEeeecccceeeccHHHHhhcccHHHH-HHHHHHHHHHHHHHHHHH---------HHhhc---cccccccc-cc---- Confidence 6666553 344456664444456788875 567889999999986542 11000 00000000 00 Q ss_pred hhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc---ccccCC---CCcccc---cc Q lcl|Aclame:pro 233 DKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK---FTSRNQ---FGEYVT---VL 303 (377) Q Consensus 233 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~---~~~~~~---~G~~~~---~l 303 (377) ++....+.+..+...+ +... ....+ .+++++|..+..++.. ...... ++.+.+ .. T Consensus 136 -----------~~~~~~~~i~~a~~~l--d~~~--vP~~~-R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~ 199 (273) T protein:vir:79 136 -----------DADDAFDLIASALKEL--TKAN--VPNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN 199 (273) T ss_pred -----------chhhHHHHHHHHHHHh--hhcc--CCccC-cEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeE Confidence 0111111222222111 1111 12234 4578899887766432 111111 111111 11 Q ss_pred CCCceEEecCCCCcce---EEEEecccEEEEecceeeEEeec-hhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 304 PHGITILESLAVETGK---AIAFVANRYDAFMATASTIEEYD-QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 304 ~~~~~v~~s~~~~~~~---ii~gd~s~y~~~~~~~~~i~~~~-~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+|++|+.|+.+|.+. ++.|--+-..... +...++... +.+| -..+++.++++.++++|+++++|+-++- T Consensus 200 ~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~-~~~~~e~~r~~~~~---~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 200 LLGARIVESNNLRDTDDEQFVAFHPSAAAYVS-QIDTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred EeceEEEecccccccCceEEEEEeccceeeee-ehhhhhcccCcccc---eeeeeeeeeeeeEEecCceEEEEeccCC Confidence 2578999999998643 3333222222211 222233221 2223 3378899999999999999999998888 No 127 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.64 E-value=1.8e-09 Score=68.57 Aligned_cols=288 Identities=12% Similarity=0.059 Sum_probs=144.1 Q ss_pred HHhccccccccHHHHHHHHHHHhccCCCCCc--eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCcce Q lcl|Aclame:pro 58 FDLRDKNRELTAEEIKFFNDIDKNVGGKDKF--KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTA 133 (377) Q Consensus 58 ~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg--~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~~a 133 (377) ......+..+ ..+...+.+++. .+-=+.|..++.+..+..+.+++++++.++. |+ +.+|+... ..+ T Consensus 1 ~a~~~~~~~~---------~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~-~~~ 70 (347) T protein:vir:88 1 MANATGGQQI---------GANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGR-TKG 70 (347) T ss_pred CCCcccchhh---------hccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecc-eee Confidence 0000000000 011122222233 2333899999999999999999999987754 44 67886543 334 Q ss_pred eeeccccccc-ccccccceeEeecceeE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee----ccCC--- Q lcl|Aclame:pro 134 VWGDIFGEIK-GQLKQAFKEQDFSQFKL-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK----GNGL--- 204 (377) Q Consensus 134 ~w~~e~~~~~-~~~~~~f~~i~l~~~k~-~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~----G~G~--- 204 (377) ........+. +..++..++++|...++ +.-..|.+-=.-.+..|+.+.+.++.++++++..|++++. +... T Consensus 71 ~~~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~ 150 (347) T protein:vir:88 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) T ss_pred eeeccccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 4333222222 12345677766666554 2333444444444567899999999999999999998752 1110 Q ss_pred --CcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccch Q lcl|Aclame:pro 205 --LQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) Q Consensus 205 --~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 282 (377) .-+.|+-... ....+++..... ...++...++.+..+...+ +.. .....+ .+++++|.. T Consensus 151 ~~~~~~g~~~~~---~~~~~~~~~~~~-----------~~~~~~~~~~~i~~a~~~L--de~--~VP~~g-R~~vv~P~~ 211 (347) T protein:vir:88 151 SNENIAGLGQAV---VLNIGAAADLVD-----------VEARGKAILKGLTLARARL--TKN--YVPAGD-RRFYCAPED 211 (347) T ss_pred cccccCCccccc---cccccccccccc-----------hhhhHHHHHHHHHHHHHHH--hhc--CCCCCC-CEEEeCHHH Confidence 0011211100 000011111000 0011222222222222221 112 222334 466789987 Q ss_pred hhhhccccccc----CCCCcccc---ccCCCceEEecCCCCcce---E----------------------EEEecccE-- Q lcl|Aclame:pro 283 RWTLEAKFTSR----NQFGEYVT---VLPHGITILESLAVETGK---A----------------------IAFVANRY-- 328 (377) Q Consensus 283 ~~~~~~~~~~~----~~~G~~~~---~l~~~~~v~~s~~~~~~~---i----------------------i~gd~s~y-- 328 (377) |..|+...... +..+.+.. .-..|++|+.|+++|.+. . +.+|+++- T Consensus 212 y~~Ll~~~~~~~~~~~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~ 291 (347) T protein:vir:88 212 YSAILSALMPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVG 291 (347) T ss_pred HHHHhcchhhhhhhhccccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEE Confidence 77665322111 11111111 011477888898887421 0 22345441 Q ss_pred EE--------EecceeeEEeechh-hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 329 DA--------FMATASTIEEYDQT-FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 329 ~~--------~~~~~~~i~~~~~~-~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+ +.-.++.++...+. +|.. .+++.+-++.++++|++.+++++++- T Consensus 292 l~~~~~a~g~v~~~d~~~e~~r~~~~~~d---~i~~~~~~G~~~~rPe~a~~~~~~~a 346 (347) T protein:vir:88 292 LFNHRSAVGTVKLKDMALERARRPEFQAD---QIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) T ss_pred EEechhhhhheecccceeeeeechhhHHH---HhhhhhhhcCceeccceEEEEEeCCC Confidence 11 11233345544332 3333 68899999999999999999988877 No 128 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.63 E-value=2.1e-09 Score=68.17 Aligned_cols=284 Identities=13% Similarity=0.043 Sum_probs=149.0 Q ss_pred HhccccccccHHHHHHHHHHHhccCCCCCceecc-HHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCcceee Q lcl|Aclame:pro 59 DLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLP-EETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTAVW 135 (377) Q Consensus 59 ~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP-~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~~a~w 135 (377) -.......++ ...-+++++-+-++ +.+..+|.+..+..+.++++.++.++. |+ ..||+. +...+.. T Consensus 1 m~~~~~~~~t----------~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~ 69 (334) T protein:vir:80 1 MTYPAANTHT----------RPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAG 69 (334) T ss_pred CCCCcCCCcc----------ccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeee Confidence 0000001110 00112222323444 899999999999999999999998875 43 688865 4555555 Q ss_pred ecccccccccccccceeEeecceeE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee----ccCCCcce-- Q lcl|Aclame:pro 136 GDIFGEIKGQLKQAFKEQDFSQFKL-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK----GNGLLQPV-- 208 (377) Q Consensus 136 ~~e~~~~~~~~~~~f~~i~l~~~k~-~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~----G~G~~~P~-- 208 (377) ...+.++ +.+..+.++.+|....+ +.-..|..-=--.+..|+.+.+.+++++++++..|++++. |-....|. T Consensus 70 ~~~g~~l-~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~ 148 (334) T protein:vir:80 70 RKAGEEL-VVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHL 148 (334) T ss_pred ecCCCCC-CCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 5444444 33445667766666553 3333444433345667899999999999999999997742 22111111 Q ss_pred ------eeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccch Q lcl|Aclame:pro 209 ------GLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) Q Consensus 209 ------Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 282 (377) |+.......... .. ...++..+...+..++..+ .....|.......+.+++|.. T Consensus 149 ~~~~~~G~~~~~~~~g~~----~~--------------~~~~~~~l~~a~~~a~~~L--~e~dvp~~~~~~R~~vv~P~~ 208 (334) T protein:vir:80 149 KPAFHDGILLPSTISGLA----AD--------------AAADADVLVAAHRQGVEAM--VFRDLGDQLMSEGVTLLDPVI 208 (334) T ss_pred cccccCCcceeecccccc----cc--------------hhhhHHHHHHHHHHHHHHH--HhcCCCCCcCCceEEEeChHH Confidence 111111000000 00 0111222222221122211 112223222234567889988 Q ss_pred hhhhccc--cccc---CC--CCccccc---cCCCceEEecCCCCcce-----------EEEEecccEE-EEe-cce---- Q lcl|Aclame:pro 283 RWTLEAK--FTSR---NQ--FGEYVTV---LPHGITILESLAVETGK-----------AIAFVANRYD-AFM-ATA---- 335 (377) Q Consensus 283 ~~~~~~~--~~~~---~~--~G~~~~~---l~~~~~v~~s~~~~~~~-----------ii~gd~s~y~-~~~-~~~---- 335 (377) |+.|+.. +... +. ...|... ...|++|+.|+++|... ++-|||+.-. +.. ++. T Consensus 209 y~~Ll~~~r~~n~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~ 288 (334) T protein:vir:80 209 FSFLLEHDRLMNVEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISA 288 (334) T ss_pred HHHHhcccccccceeccccccccccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEE Confidence 8877643 2211 11 1112211 12488999999999542 4566776532 222 222 Q ss_pred ----eeEEeec-hhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 336 ----STIEEYD-QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 336 ----~~i~~~~-~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +..+..+ +.+|.. .+.+++-++.++++|+|.++++++.- T Consensus 289 ~~~~~~~e~~~~~~~~~d---~i~~~~a~G~g~lRPeaa~vv~~~~~ 332 (334) T protein:vir:80 289 QVHPVSAQFWEEKKDFGH---YLDTFQSYNIGQRRPDAVAVHDITVT 332 (334) T ss_pred EEeecceeeeechhhHHH---HHHHHHHcCCceeccceEEEEEEeee Confidence 2222222 222222 33455667889999999999999999 No 129 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=98.61 E-value=6.1e-09 Score=65.63 Aligned_cols=331 Identities=14% Similarity=0.104 Sum_probs=149.1 Q ss_pred CCccH--------HHHHHHHHHHHHHHHHH-HhccCHHHHHHHHHHHHHHHHHHHHH---------------------HH Q lcl|Aclame:pro 1 MAINL--------KELPKYREAVAELSAKI-SAGATPEEQEKLFEAAFTTMGDEILA---------------------KN 50 (377) Q Consensus 1 m~~~~--------~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~---------------------~~ 50 (377) |-... ..+...+++..+....- ...+..+...+. ....+.++.+... +. T Consensus 22 ~EvSvv~~PAY~nA~vt~vRe~e~~~~~e~~~~~e~~en~~e~-~~~~~~~~~E~Rs~~~~i~~~~~~~r~~p~~~~vey 100 (410) T protein:vir:83 22 KESLVRGIYDRANASNRDVNEEEGQMVAECRGRMEQIKNQMEQ-AQEVNRIAFETRSKGQAVDAAISAMRGSPVGTEVEY 100 (410) T ss_pred hheeeeccccccccccccchhhhccccccccCcccchhhhhHH-HHHHHHHHHHHHHHHHHHHhhhccCcCCCCCCCccc Confidence 11100 00001111000000000 000000001100 0001111100000 11 Q ss_pred HHHHHHHHHhccccccccHHHHHH---HHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEE Q lcl|Aclame:pro 51 EEEMERMFDLRDKNRELTAEEIKF---FNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALT 126 (377) Q Consensus 51 ~~~~~~~~~~~~~~~~lt~~e~~~---~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~~~~p~ 126 (377) ++.-+-........+ -..+-.+. +..+.....+.+-...||+++....|+.+....++.++..-.|..| .+.+|+ T Consensus 101 RSaGE~lkal~~~~~-Gd~~A~~~~e~~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v 179 (410) T protein:vir:83 101 RSAGEYMLDMWNSAQ-GNASAADRLEVYARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPI 179 (410) T ss_pred ccHHHHHHHHhccCC-chHHHHHHHHHHHHhhccCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEee Confidence 111110111100000 00111111 2223333333344446888899999999999999999876677766 578888 Q ss_pred EcCCcceee-e------cccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhc--- Q lcl|Aclame:pro 127 AETSGTAVW-G------DIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALEL--- 196 (377) Q Consensus 127 ~~~~~~a~w-~------~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~--- 196 (377) .+...+..- + .|.+.. +..+.+|+..+-..+.++++..+|++.++-|.+.+.+...+.|..+.+.+-+. T Consensus 180 ~t~~~tV~~q~~~~kqa~EGd~L-~~gKl~~~t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vr 258 (410) T protein:vir:83 180 VSQRPAVGLQGVAGGASDEKTEL-DSQKMVIDRLTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVG 258 (410) T ss_pred ecccccccccccccccccccccc-cccceeeeeccceeehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHH Confidence 766554321 1 122222 34556677777788899999999999999999999999999998888777665 Q ss_pred ceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEE Q lcl|Aclame:pro 197 AIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKL 276 (377) Q Consensus 197 a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (377) ++|.++=++ ..+.. ..+.+.+ +. +++..+..+-.. .+ .-+...+ T Consensus 259 a~L~~t~t~-------------~~a~~---~~Tad~~----~~--------~i~da~~~v~da-----~~---~~~~~~i 302 (410) T protein:vir:83 259 AALASTSTG-------------AVGYG---NATADNV----AS--------AIWQAAGAVYTA-----VK---GMGRLVI 302 (410) T ss_pred HHHHHhhhh-------------hhhhh---hccHHHH----HH--------HHHHHHHHHhhh-----hc---cceeeeE Confidence 344332110 00000 0011111 11 111111110000 00 0011122 Q ss_pred EeccchhhhhcccccccCCC------------CccccccCCCceEEecCCCCcceEEEEecccEEEEecce--eeEEeec Q lcl|Aclame:pro 277 LLNPEDRWTLEAKFTSRNQF------------GEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATA--STIEEYD 342 (377) Q Consensus 277 ~~n~~~~~~~~~~~~~~~~~------------G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~--~~i~~~~ 342 (377) .+.|...-+..+.....++. |+.+.....++||+..+..+++++.|.|-.-.......+ +.+.-.+ T Consensus 303 ~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~Ai~~~eS~~gp~qL~d~~ 382 (410) T protein:vir:83 303 AIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTAAIECFEQRVGTLQVVEPS 382 (410) T ss_pred EechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCeeeEeccceeeeeecCCceeEeeCCc Confidence 33333322222211111111 122333446889999999999999999888765555443 5555444 Q ss_pred hhhhhcCcEEEEEEEEEcCEEecccceEEEEee Q lcl|Aclame:pro 343 QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) Q Consensus 343 ~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~ 375 (377) -....++-.+|.+ -.+..+++++=|.=+ T Consensus 383 i~nLt~~ySgY~a-----~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 383 VFGLQVAYAGYFS-----TLVVNEDAIVPLVGS 410 (410) T ss_pred hhhhhhhheeeee-----eccccccceeeeccC Confidence 4455554434432 234555555444333 No 130 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.61 E-value=5.8e-09 Score=65.74 Aligned_cols=290 Identities=12% Similarity=0.026 Sum_probs=142.4 Q ss_pred HHhccccccccHHHHHHHHHHHhccCCCCCc---eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCcc Q lcl|Aclame:pro 58 FDLRDKNRELTAEEIKFFNDIDKNVGGKDKF---KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGT 132 (377) Q Consensus 58 ~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg---~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~~ 132 (377) ......++.++ .+-..+++.|. ..| +.|..++.+..+..+.+++++++.+.. |+ +.||+... .. T Consensus 1 ~~~~~~~~~~~---------t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~-~t 69 (347) T protein:vir:33 1 MANIQGGQQIG---------TNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR-TK 69 (347) T ss_pred CCCCccCcccc---------cccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccc-ee Confidence 11111111110 00011112222 234 899999999999999999999987654 44 67887644 33 Q ss_pred eeeeccccccc-ccccccceeEeec--ceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhccee-----eccCC Q lcl|Aclame:pro 133 AVWGDIFGEIK-GQLKQAFKEQDFS--QFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIV-----KGNGL 204 (377) Q Consensus 133 a~w~~e~~~~~-~~~~~~f~~i~l~--~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l-----~G~G~ 204 (377) +.......++. ...+.+..+.+|. ..++.. ..|.+-=--++..|+.+.+.++.+.++++..|+.|+ .+... T Consensus 70 ~~~~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~-~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~ 148 (347) T protein:vir:33 70 AAYLKPGENLDDKRKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLP 148 (347) T ss_pred eeeecCCCCCCCCCCCCccceEEEEechhhhhh-HHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 34333233322 2233556665554 333332 233333223456789999999999999999999886 12222 Q ss_pred Ccceeeeecc---ccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccc Q lcl|Aclame:pro 205 LQPVGLLKDL---SQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPE 281 (377) Q Consensus 205 ~~P~Gil~~~---~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 281 (377) ..|.+..... ........+..... +. ..++..+++.+..+...+.. . .....+ .+++++|. T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~tg~~~--d~---------~~~a~~i~~~i~~a~~~Lde--~--~VP~~g-R~~vv~P~ 212 (347) T protein:vir:33 149 DGSNENIEGLGKPTVLTLVKPTTGSLT--DP---------VELGKAIIAQLTIARASLTK--N--YVPAAD-RTFYTTPD 212 (347) T ss_pred ccccccccccccccccccccccccccc--ch---------hhhHHHHHHHHHHHHHHHhh--c--CCCccC-cEEEeCHH Confidence 2221111000 00000000000000 00 01122222333222222221 1 222345 45778998 Q ss_pred hhhhhcccc--cccCCC--Cccccc---cCCCceEEecCCCCcceE----------------------EEEecccE---- Q lcl|Aclame:pro 282 DRWTLEAKF--TSRNQF--GEYVTV---LPHGITILESLAVETGKA----------------------IAFVANRY---- 328 (377) Q Consensus 282 ~~~~~~~~~--~~~~~~--G~~~~~---l~~~~~v~~s~~~~~~~i----------------------i~gd~s~y---- 328 (377) .|..|+... ...+.. +.+... ...|++|+.|+++|.+.+ .-++|+.. T Consensus 213 ~y~~Ll~~~~~~~~d~~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~ 292 (347) T protein:vir:33 213 NYSAILAALMPNAANYQALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLF 292 (347) T ss_pred HHHHHhccccccccccccccccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeee Confidence 877665322 111111 111111 125788999999885321 11222211 Q ss_pred ----E--EEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 329 ----D--AFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 329 ----~--~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) . ...-.++.++...+....-| .+++.+.++.++++|++.+.|++..= T Consensus 293 ~h~~A~g~v~~~~~~~e~~r~~~~~~d--~i~~~~~~G~~vlrP~~av~i~~~~~ 345 (347) T protein:vir:33 293 QHRSAVGTVKLKDLALERARRANYQAD--QIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred ecchhheeeeeeceeeeeccchhhhhH--hhhhhhhcCCceecccceEEEecCCC Confidence 0 12223345555433322222 67889999999999999999998877 No 131 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.61 E-value=1.5e-08 Score=63.50 Aligned_cols=291 Identities=12% Similarity=0.033 Sum_probs=140.0 Q ss_pred HHhccccccccHHHHHHHHHHHhccCCCCCc--eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCcce Q lcl|Aclame:pro 58 FDLRDKNRELTAEEIKFFNDIDKNVGGKDKF--KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTA 133 (377) Q Consensus 58 ~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg--~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~~a 133 (377) ......++.+.. | -..+.+.+. .+-=+.|..++.+..+..|.+++++++.++. |+ +.||+... ..+ T Consensus 1 ma~~~~~~~~~t--~-------~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~-~t~ 70 (347) T protein:vir:15 1 MANIQGGQQIGT--N-------QGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR-TKA 70 (347) T ss_pred CCccccCCcccc--c-------cccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccc-eee Confidence 111111111100 0 001111111 1223788999999999999999999887754 44 67887654 334 Q ss_pred eeeccccccc-ccccccceeEeec--ceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeec--cC----- Q lcl|Aclame:pro 134 VWGDIFGEIK-GQLKQAFKEQDFS--QFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKG--NG----- 203 (377) Q Consensus 134 ~w~~e~~~~~-~~~~~~f~~i~l~--~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G--~G----- 203 (377) .....+.++. ...+.+.++.+|. ..++.. ..|.+-=-..+..|+.+.+.++.+.++++..|+.|+.= .+ T Consensus 71 ~~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~-~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~ 149 (347) T protein:vir:15 71 AYLKPGENLDDKRKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPD 149 (347) T ss_pred eeeccCCCCCCCCCCCccceEEEEechhhhhh-HHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 4333332321 2233556665554 444333 23322222345678999999999999999999988620 00 Q ss_pred -CCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccch Q lcl|Aclame:pro 204 -LLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) Q Consensus 204 -~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 282 (377) ...+.+-.-..........++..... + ......+++.+..+...+ +... ....+ .+++++|.. T Consensus 150 ~~~~~~~~~g~~~~~~~~~~~~~~~~~--~---------~~~~~~i~d~~~~a~~~L--de~~--VP~~g-R~~vv~P~~ 213 (347) T protein:vir:15 150 ASNENIEGLGKPTVLTLVKPTTGDLTD--P---------VELGKAIIAQLTIARASL--TKNY--VPAAD-RTFYTTPDN 213 (347) T ss_pred cccccccccCccccccccccccccchh--h---------hhHHHHHHHHHHHHHHHH--hhcC--CCccC-CEEEeCHHH Confidence 00100000000000000000000000 0 001112222222222211 1112 22345 457789988 Q ss_pred hhhhccccccc--CCCC--cccc---ccCCCceEEecCCCCcceE----------------------EEEeccc------ Q lcl|Aclame:pro 283 RWTLEAKFTSR--NQFG--EYVT---VLPHGITILESLAVETGKA----------------------IAFVANR------ 327 (377) Q Consensus 283 ~~~~~~~~~~~--~~~G--~~~~---~l~~~~~v~~s~~~~~~~i----------------------i~gd~s~------ 327 (377) |..|+...... +..| .+.. ....|++|+.|+++|.+.+ +-++|+. T Consensus 214 y~~LL~~~~~~~~d~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~ 293 (347) T protein:vir:15 214 YSAILAALMPNAANYQALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQ 293 (347) T ss_pred HHHHhcccccccccccccccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeee Confidence 87776432211 1111 1111 1125889999999884211 1112221 Q ss_pred --EEE--EecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 328 --YDA--FMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 328 --y~~--~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ..+ +...++.++.+.+....-| .+++.+.++.++++|++.+.+.+..= T Consensus 294 h~~A~g~v~~~~~~~e~~~~~~~~~d--~i~~~~~~G~~vlrP~~av~~~~~~~ 345 (347) T protein:vir:15 294 HRSAVGTVKLKDLALERARRANYQAD--QIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred ccceeeeeEeeceeeeecccchhhhh--hhehhhhcCCceeccccEEEEecCCC Confidence 112 2223445555543332223 67888899999999999999988877 No 132 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.60 E-value=5.9e-09 Score=65.68 Aligned_cols=286 Identities=11% Similarity=0.010 Sum_probs=143.6 Q ss_pred HHhccccccccHHHHHHHHHHHhccCCCCCce--eccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCcce Q lcl|Aclame:pro 58 FDLRDKNRELTAEEIKFFNDIDKNVGGKDKFK--LLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTA 133 (377) Q Consensus 58 ~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~--lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~~a 133 (377) ......++.+. .+...+.++|.. +-=+.+..++.+.....+.+++++++..+. |+ ..+|+... ..+ T Consensus 1 ma~~~~~~~~~---------t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~-~~~ 70 (347) T protein:vir:94 1 MANMNGGQQMG---------KDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGR-TKA 70 (347) T ss_pred CCccccccccc---------cccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccc-eeE Confidence 11011111110 000112222221 223899999999999999999999987754 44 68886543 344 Q ss_pred eeecccccccc-cccccceeEeecceeE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee----ccC---- Q lcl|Aclame:pro 134 VWGDIFGEIKG-QLKQAFKEQDFSQFKL-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK----GNG---- 203 (377) Q Consensus 134 ~w~~e~~~~~~-~~~~~f~~i~l~~~k~-~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~----G~G---- 203 (377) ..+..+.+... ..+++.++.++..-++ +.-..|..-=--.+..|+.+.+.++.++++++..|++|+. +.. T Consensus 71 ~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~ 150 (347) T protein:vir:94 71 AYLQPGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTA 150 (347) T ss_pred eeeecCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 43333333322 2346677766655443 2323343332334567899999999999999999998752 111 Q ss_pred -----CCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 204 -----LLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 204 -----~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) .+.|.|....+ ...+..... ...++...++.+..+...+. ....| ..+ .++++ T Consensus 151 ~~~~~~g~~~~~~v~i-------~~~~~~~~~----------~~~~~~~~~d~i~~a~~~Ld--e~dVP--~~~-R~~vv 208 (347) T protein:vir:94 151 NNENIAGLGKAHVLEV-------GDQATLQGD----------QVKLGQAIIAQLTLARAKLT--GNYVP--SSD-RVFYT 208 (347) T ss_pred cccccccCCcceeEee-------ecccccccc----------ccccHHHHHHHHHHHHHHhh--hcCCC--CCC-CEEEe Confidence 11111111000 000000000 01122223333333322221 11222 334 45667 Q ss_pred ccchhhhhcccccccCCCC----ccc---cccCCCceEEecCCCCcce-------------------------EEEEecc Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQFG----EYV---TVLPHGITILESLAVETGK-------------------------AIAFVAN 326 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~~G----~~~---~~l~~~~~v~~s~~~~~~~-------------------------ii~gd~s 326 (377) .|..|+.|+........+. .+. -....|++|+.|+++|.+. -+=+||+ T Consensus 209 ~P~~y~~LLk~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~ 288 (347) T protein:vir:94 209 TPDNYSAILAALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALD 288 (347) T ss_pred ChHHHHHHHHhhcccccccccccccccceeEEeeceEEEEcCccccccCccccccccccccccccccccccccccccccc Confidence 8988887764211111111 111 0111578899999988421 0113443 Q ss_pred cE--EE--------EecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 327 RY--DA--------FMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 327 ~y--~~--------~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +- .+ +.-.++.++...+..+..+ .+.+++=++.++.+|++.+.+.+++- T Consensus 289 ~~~~l~~~~~A~~tv~~~~~~~e~~~~~~~~~~--~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 289 NVVGLFNHRSAVGTVKLKDMALERARRANFQAD--QIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred ceEEEEechhhhhhhhhcccceeeeechhhhhh--hhhhhhhhcCcccccceeEEEEecCC Confidence 31 11 2223555555544433333 56788888999999999988777766 No 133 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.58 E-value=4.2e-09 Score=66.49 Aligned_cols=287 Identities=14% Similarity=0.047 Sum_probs=146.4 Q ss_pred cccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCcceeeeccccc Q lcl|Aclame:pro 64 NRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTAVWGDIFGE 141 (377) Q Consensus 64 ~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~~a~w~~e~~~ 141 (377) -..+++.-|-.+ .++.++-...| +.+..++.+.+...+.++++.++.++. |+ ..+|+. +...+.....+.+ T Consensus 1 ms~~~~~tr~~~-----~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~ 73 (335) T protein:vir:63 1 MSFLNDLTRPNY-----AGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEE 73 (335) T ss_pred CCCcccchhhhc-----ccccchhheeh-hhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcC Confidence 111111111111 22233333344 999999999999999999999988875 33 678876 3445554433333 Q ss_pred ccccccccceeEeecceeEE-EeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhccee----eccCCCcceeeeecc-- Q lcl|Aclame:pro 142 IKGQLKQAFKEQDFSQFKLT-AFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIV----KGNGLLQPVGLLKDL-- 214 (377) Q Consensus 142 ~~~~~~~~f~~i~l~~~k~~-~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l----~G~G~~~P~Gil~~~-- 214 (377) + ..+.+..++.++..-.+- +-..|-.----++..|+.+.+.+++++++++..|++++ .+-+..-|.++-... T Consensus 74 l-~~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:63 74 L-ERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred c-CCCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCC Confidence 3 334456677666555432 22333333333466789999999999999999999764 232221111110000 Q ss_pred ccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccC Q lcl|Aclame:pro 215 SQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRN 294 (377) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~ 294 (377) ........++.... .++..+...+..++..+. ....|.......+.+++|..|+.|+...-..| T Consensus 153 G~~~~~~~tg~~~~--------------~~~~~l~~a~~~a~~~L~--e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n 216 (335) T protein:vir:63 153 GVLEKLDLTGLTAK--------------QAADKIVRMHRRVVETFI--DRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMN 216 (335) T ss_pred CcceeeeeccCccc--------------ccHHHHHHHHHHHHHHHH--hccCCCcccCceEEEeChHHHHHHhccccccc Confidence 00000000000000 012222222222222111 11222211234567889988887764311111 Q ss_pred -----CCC--ccc---cccCCCceEEecCCCCcce-----------EEEEecccEE-EEec---------ceeeEEeech Q lcl|Aclame:pro 295 -----QFG--EYV---TVLPHGITILESLAVETGK-----------AIAFVANRYD-AFMA---------TASTIEEYDQ 343 (377) Q Consensus 295 -----~~G--~~~---~~l~~~~~v~~s~~~~~~~-----------ii~gd~s~y~-~~~~---------~~~~i~~~~~ 343 (377) .+| .|. -....|++|+.|+++|.+. .+-||++... +... .++..+...+ T Consensus 217 ~~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~ 296 (335) T protein:vir:63 217 VEYQATGATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWED 296 (335) T ss_pred cccccccccccccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeec Confidence 122 122 1122588999999998432 3445665432 2222 2222222222 Q ss_pred -hhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 344 -TFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 344 -~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ..|.. .+.+++-++.++.+|+|.+++++++= T Consensus 297 ~~~~~~---~i~~~~a~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:63 297 NEKFSW---VLDTFQMYNIGARRPDTAGAIELKGI 328 (335) T ss_pred cchhhH---HhHHHHHcCCcccccceEEEEEEcCC Confidence 22322 44566668899999999999998654 No 134 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.57 E-value=5e-09 Score=66.10 Aligned_cols=281 Identities=15% Similarity=0.075 Sum_probs=145.0 Q ss_pred cccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCcceeeeccccc Q lcl|Aclame:pro 64 NRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTAVWGDIFGE 141 (377) Q Consensus 64 ~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~~a~w~~e~~~ 141 (377) -..++..-|-. ..++.++-...| +.+..++.+.....+.++++.++.++. |+ ..+|+. +...+.....+.+ T Consensus 1 ms~~~~~t~~~-----~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~ 73 (335) T protein:vir:78 1 MSFLNDLTRPN-----YAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEE 73 (335) T ss_pred CCccccccccc-----cccccchhhhhh-hhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccCcc Confidence 00011111110 122233333334 899999999999999999999988875 43 688866 4445554433333 Q ss_pred ccccccccceeEeecceeEE-EeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhccee----eccCCCcce-------- Q lcl|Aclame:pro 142 IKGQLKQAFKEQDFSQFKLT-AFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIV----KGNGLLQPV-------- 208 (377) Q Consensus 142 ~~~~~~~~f~~i~l~~~k~~-~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l----~G~G~~~P~-------- 208 (377) . ..+.+..++.++..-.+- +-..|-.----++..|+.+.+.+++++++++..|++++ .+.+...|. T Consensus 74 l-~~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:78 74 L-ERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred c-CCCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCC Confidence 3 334456677666555432 22233333233566889999999999999999999765 222221111 Q ss_pred eeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcc Q lcl|Aclame:pro 209 GLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEA 288 (377) Q Consensus 209 Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~ 288 (377) |+..... .++... ..++..+...+..+...+ .....|-......+.+++|..|+.|+. T Consensus 153 G~~~~~~------~tg~~~--------------~~~~~~l~~a~~~a~~~l--~ekdvP~~~~~~rv~vv~P~~y~~Ll~ 210 (335) T protein:vir:78 153 GVLEKLD------LTGLTA--------------KEAAEKIVRMHRRVVETF--IERDLGDAVYSEGLTPMSPRVFSLLLE 210 (335) T ss_pred Ccceeee------eccccc--------------cccHHHHHHHHHHHHHHH--HhccCCCCCCCccEEEeChHHHHHHhc Confidence 2111000 000000 011121222222222111 112223222234577889988887754 Q ss_pred c--ccc---cCCCCc--ccc---ccCCCceEEecCCCCcce-----------EEEEeccc-EEEEec---------ceee Q lcl|Aclame:pro 289 K--FTS---RNQFGE--YVT---VLPHGITILESLAVETGK-----------AIAFVANR-YDAFMA---------TAST 337 (377) Q Consensus 289 ~--~~~---~~~~G~--~~~---~l~~~~~v~~s~~~~~~~-----------ii~gd~s~-y~~~~~---------~~~~ 337 (377) . +.. .+.+|. |.. ....|++|+.|+++|.+. ..-+|++. ..+... .++. T Consensus 211 ~~~l~n~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~ 290 (335) T protein:vir:78 211 HDKLMSVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQ 290 (335) T ss_pred ccccccccccccccccccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecc Confidence 3 221 112221 221 122588999999999542 22235543 222221 2223 Q ss_pred EEeech-hhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 338 IEEYDQ-TFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 338 i~~~~~-~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) -+...+ ..|.. .+.+++-++.++++|+|.+++++++- T Consensus 291 ~e~~~~~~~~~~---~i~~~~a~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:78 291 AKLWEDHDQFSW---VLDTFQMYNIGARRPDTAGAIELKGI 328 (335) T ss_pred cceeeccchhhH---hhhHHHHcCCcccCcceEEEEEecCC Confidence 333322 22322 44566668899999999999998876 No 135 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.56 E-value=7e-09 Score=65.31 Aligned_cols=292 Identities=13% Similarity=0.016 Sum_probs=141.2 Q ss_pred HHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCcceee Q lcl|Aclame:pro 58 FDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTAVW 135 (377) Q Consensus 58 ~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~~a~w 135 (377) .......+..+...+... .+++++-...| +.+..++.+.....+.+++++++.++. |+ .++|+.. ...+.. T Consensus 1 ma~~~~~~~~n~~~~~~~-----~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG-~~~~~~ 73 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDV-----MAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLG-RTQAAY 73 (344) T ss_pred CccccccccCCcccCCcc-----CCccchhHHHH-HHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeec-eeEEEe Confidence 000000000000000000 00000111133 889999999999999999999988875 43 6788763 344444 Q ss_pred eccccccccc-ccccceeEeecceeE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee----ccCC----- Q lcl|Aclame:pro 136 GDIFGEIKGQ-LKQAFKEQDFSQFKL-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK----GNGL----- 204 (377) Q Consensus 136 ~~e~~~~~~~-~~~~f~~i~l~~~k~-~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~----G~G~----- 204 (377) ...+.+.... .++.-++++|..-++ +.-..|..-=--.+..|+.+.+.++.++++++..|++++. +... T Consensus 74 ~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~ 153 (344) T protein:vir:10 74 LAPGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYN 153 (344) T ss_pred eecCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Confidence 4433333321 245556644444332 2222333322234567899999999999999999987742 1111 Q ss_pred CcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhh Q lcl|Aclame:pro 205 LQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRW 284 (377) Q Consensus 205 ~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 284 (377) ..|.|.-+..... ........... ...+..+++.+..++..+. .+.....+ .+.+++|..|+ T Consensus 154 ~~~~g~~~~~~~~----~~~~~~~~t~~---------~~~~~~~~~~i~~a~~~Ld----e~~VP~~g-R~~vv~P~~y~ 215 (344) T protein:vir:10 154 ENITGLGTATVIE----TTQDKTTLTDQ---------VALGKEIIAALTKARAALT----KNYVPSSD-RVFYCDPDSYS 215 (344) T ss_pred cccccccccceee----cccccccccch---------hhhHHHHHHHHHHHHHHHh----hcCCCccC-CEEEeChHHHH Confidence 1122211100000 00000000000 0011122222222222221 11223445 45678998887 Q ss_pred hhcccccc--cC--CCCccccc---cCCCceEEecCCCCcce---------------------EEEEecccEE------- Q lcl|Aclame:pro 285 TLEAKFTS--RN--QFGEYVTV---LPHGITILESLAVETGK---------------------AIAFVANRYD------- 329 (377) Q Consensus 285 ~~~~~~~~--~~--~~G~~~~~---l~~~~~v~~s~~~~~~~---------------------ii~gd~s~y~------- 329 (377) .|+..... .+ +++.+... ...|++|+.|+++|.+. ...++|+.-. T Consensus 216 ~Ll~~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~ 295 (344) T protein:vir:10 216 AILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRS 295 (344) T ss_pred HHhhcccccccccccccceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechh Confidence 66532211 11 11112111 12478999999887421 0112444321 Q ss_pred ---EEecceeeEEeec-hhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 330 ---AFMATASTIEEYD-QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 330 ---~~~~~~~~i~~~~-~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .+.-.++.++.+. +.+|.. .+++.+-++.++++|++.+++.++.- T Consensus 296 A~~~v~~~~~~~e~~r~~~~~~d---~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 296 AVGTVKLRDLALERARRANFQAD---QIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred hhhhhhhccceeecccchhHHHH---HHHHHhhcccceecccceEEEEeecC Confidence 2222344556543 334443 67788999999999999977777777 No 136 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.56 E-value=4.7e-09 Score=66.26 Aligned_cols=288 Identities=10% Similarity=-0.022 Sum_probs=143.4 Q ss_pred HHhccccccccHHHHHHHHHHHhccCCCC--Cc--eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCc Q lcl|Aclame:pro 58 FDLRDKNRELTAEEIKFFNDIDKNVGGKD--KF--KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSG 131 (377) Q Consensus 58 ~~~~~~~~~lt~~e~~~~~~~~~~~~~s~--gg--~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~ 131 (377) ....... .+. ....+...+ |. .+-=+.+..++.+.....+.++++++++++. |+ .++|+. +.. T Consensus 1 ~~~~~~~------~~~----~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~ 69 (345) T protein:vir:22 1 MASMTGG------QQM----GTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRT 69 (345) T ss_pred Ccccccc------hhc----ccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cce Confidence 0000000 000 000000000 11 2333889999999999999999999998875 44 678876 444 Q ss_pred ceeeecccccccc-ccccccee--EeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee----ccC- Q lcl|Aclame:pro 132 TAVWGDIFGEIKG-QLKQAFKE--QDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK----GNG- 203 (377) Q Consensus 132 ~a~w~~e~~~~~~-~~~~~f~~--i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~----G~G- 203 (377) .+.....+.+... ..+++.++ |+++..++..+ .|..-=--.+..|+.+.+.++.++++++..|++++. +.. T Consensus 70 ~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~-~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~ 148 (345) T protein:vir:22 70 QAAYLAPGENLDDKRKDIKHTEKVITIDGLLTADV-LIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNV 148 (345) T ss_pred EEEeeecCCCCCCCCCCcccceEEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 5554443444322 23466777 55554444432 233222234567899999999999999999997762 110 Q ss_pred ----CCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEec Q lcl|Aclame:pro 204 ----LLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLN 279 (377) Q Consensus 204 ----~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 279 (377) ++.|.|+-+................ ......+++.+..+...+ + .+.....+ .+.+++ T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~g~~~t~~-------------~~~~~~~~~ai~~a~~~L--d--e~~VP~~~-R~~vv~ 210 (345) T protein:vir:22 149 ESKYNENIEGLGTATVIETTQNKAALTDQ-------------VALGKEIIAALTKARAAL--T--KNYVPAAD-RVFYCD 210 (345) T ss_pred ccccccccccccccccccccccccccccc-------------ccCHHHHHHHHHHHHHHh--h--hcCCCccC-CEEEeC Confidence 1223322111111111101000000 001122222222222221 1 12233344 457789 Q ss_pred cchhhhhcccccccC----CCCcccc---ccCCCceEEecCCCCcce-----------------------E--------- Q lcl|Aclame:pro 280 PEDRWTLEAKFTSRN----QFGEYVT---VLPHGITILESLAVETGK-----------------------A--------- 320 (377) Q Consensus 280 ~~~~~~~~~~~~~~~----~~G~~~~---~l~~~~~v~~s~~~~~~~-----------------------i--------- 320 (377) |..|+.|+......+ +++.+.. ....|++|++|+++|.+. . T Consensus 211 P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 290 (345) T protein:vir:22 211 PDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIG 290 (345) T ss_pred hHHHHHHhccccccccccccccccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEE Confidence 988876653221111 1111111 112478899988876311 0 Q ss_pred EEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 321 i~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +|+-.+-...+...++.++.+.+.....| .+++.+-++.++++|+|.++|+++=- T Consensus 291 l~~h~~A~~~v~~~~~~~e~~r~~~~~~d--~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 291 LFMHRSAVGTVKLRDLALERARRANFQAD--QIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred EEEehhheeeeeeecceeeeeechhHHHH--HHHHHHhcCCcccccceeEEEEEeeC Confidence 11111111223333455555543332223 67888889999999999999888877 No 137 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.53 E-value=4.2e-09 Score=66.50 Aligned_cols=282 Identities=14% Similarity=0.077 Sum_probs=141.2 Q ss_pred ccccccccHHHHHHHHHHHhccCCCCCc---eeccHHHHHHHHHHHHhhhhhhhhceeEecC-C-ceEEEEEcCCcceee Q lcl|Aclame:pro 61 RDKNRELTAEEIKFFNDIDKNVGGKDKF---KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-L-RLKALTAETSGTAVW 135 (377) Q Consensus 61 ~~~~~~lt~~e~~~~~~~~~~~~~s~gg---~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~-~~~~p~~~~~~~a~w 135 (377) ......++.- +..+...+.+++. .+.=+.+..+|.+..+..|.+++++++.++. | .+.||+... ..+.. T Consensus 1 ~~~~~~~~~~-----~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~-~~~~~ 74 (332) T protein:vir:78 1 MTTLSNFSLP-----NQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGK-LSAGY 74 (332) T ss_pred CcccccccCC-----ccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccc-eeEee Confidence 0000001000 0011122222332 1333899999999999999999999987764 4 378887643 33333 Q ss_pred ecccccccccccccceeEee--cceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee----ccCCCccee Q lcl|Aclame:pro 136 GDIFGEIKGQLKQAFKEQDF--SQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK----GNGLLQPVG 209 (377) Q Consensus 136 ~~e~~~~~~~~~~~f~~i~l--~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~----G~G~~~P~G 209 (377) ...+.......+++-.+++| +..++.. ..|..-=-..+..|+.+.+.++.++++++..|+.++. +-..+-|.+ T Consensus 75 ~~~g~~l~~~~~~~~~~~~l~ID~~ky~~-~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~ 153 (332) T protein:vir:78 75 HTPGTPIVGDAGIKANEKTLVMDDLLVSS-QFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVT 153 (332) T ss_pred ecCCCCCCCCCCCCCceEEEEEehhhhhH-HHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccc Confidence 33233332222334444444 4333333 2332211224566899999999999999999987752 111111111 Q ss_pred eeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc Q lcl|Aclame:pro 210 LLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK 289 (377) Q Consensus 210 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~ 289 (377) ..-. ...... ++.. ..++..+++.+..+...+.. +.....+ .+++++|..|+.|+.. T Consensus 154 ~~~g--~~~~~~-~~~~---------------~~~~~~~~~~i~~a~~~Lde----~~VP~~g-R~~vv~P~~y~~Ll~~ 210 (332) T protein:vir:78 154 GEPG--GFHVNI-GAGN---------------TNDAQAIVDGFFEAAAVLDE----RSAPQEG-RVAVLSPRQYYSLISS 210 (332) T ss_pred cccc--cccccc-CCcc---------------ccCHHHHHHHHHHHHHHHhh----cCCCccC-CEEEeCHHHHHHHHhh Confidence 1000 000000 0000 01222233333333332221 1223344 4566799888777541 Q ss_pred ----ccc---cCCCCccccc----cCCCceEEecCCCCcce--------------EEEEecccEE--EEec--------c Q lcl|Aclame:pro 290 ----FTS---RNQFGEYVTV----LPHGITILESLAVETGK--------------AIAFVANRYD--AFMA--------T 334 (377) Q Consensus 290 ----~~~---~~~~G~~~~~----l~~~~~v~~s~~~~~~~--------------ii~gd~s~y~--~~~~--------~ 334 (377) +.. .+.+|..... ...|++|+.|+++|... .+-|||++.. +.-+ . T Consensus 211 ~d~~~~n~~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~ 290 (332) T protein:vir:78 211 VDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSV 290 (332) T ss_pred cCceeeeeeccccccceecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeee Confidence 111 1223332211 12488999999998421 2344555421 1112 2 Q ss_pred eeeEEee----chhhhhcCcEEEEEEEEEcCEEecccceEEEEee Q lcl|Aclame:pro 335 ASTIEEY----DQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) Q Consensus 335 ~~~i~~~----~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~ 375 (377) ++.+++. ++.+|. ..+++.+.++.++++|++.++|+-+ T Consensus 291 ~~~~~~t~~~~~~~~~~---d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 291 APTIQTTSGDFNVQYQG---DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred ccchhhhhcccchhhhH---hhhhhhhhhcCceecccceEEEeeC Confidence 3333321 233333 2678888999999999999999999 No 138 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.49 E-value=4.5e-08 Score=60.86 Aligned_cols=288 Identities=9% Similarity=-0.030 Sum_probs=140.4 Q ss_pred ccHHHHHHHHHHHhc-cCCCCCceecc-HHHHHHHHHHHHhhhhhhhhceeEecC-C-ceEEEEEcCCcceeeecccccc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKN-VGGKDKFKLLP-EETMVQVFDDLVAEHPLLKVINFKNTS-L-RLKALTAETSGTAVWGDIFGEI 142 (377) Q Consensus 67 lt~~e~~~~~~~~~~-~~~s~gg~lvP-~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~-~~~~p~~~~~~~a~w~~e~~~~ 142 (377) ++. -|..... .++++.-.-+. +.+..++.+.....+.++++.++.++. | ..++|+.. ...+.....+.+. T Consensus 1 ms~-----~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG-~~~~~~~~~G~~l 74 (364) T protein:vir:10 1 MSN-----PNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIG-ETELQVLSPGKSP 74 (364) T ss_pred CCC-----cccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeee-eeEEeeeccCccc Confidence 000 0001111 11111112233 889999999999999999999998875 3 37888763 3444444322232 Q ss_pred cccccccceeEeecceeEE-EeehhhHHHHhcCHHH-HHHHHHHHHHHHHHHHhhcceee---ccCCCcceeeeeccccc Q lcl|Aclame:pro 143 KGQLKQAFKEQDFSQFKLT-AFVVIPKDALKFGPKW-LKQFITEQLKEAIAVALELAIVK---GNGLLQPVGLLKDLSQP 217 (377) Q Consensus 143 ~~~~~~~f~~i~l~~~k~~-~~~~iS~ell~ds~~~-~~~~l~~~la~~~a~~~~~a~l~---G~G~~~P~Gil~~~~~~ 217 (377) ..+.+..++.+|..-.+- +-..|-.----++.+| +.+.+.+++++++++..|++++. --+..+-.+..+... T Consensus 75 -d~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~-- 151 (364) T protein:vir:10 75 -DASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPR-- 151 (364) T ss_pred -CCCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCc-- Confidence 344566677666555432 2222222112234566 78999999999999999998742 000000000000000 Q ss_pred cccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc--cccc-- Q lcl|Aclame:pro 218 TVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK--FTSR-- 293 (377) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~--~~~~-- 293 (377) ....+........... ...++..+...+......+ + .+-....+ .+.+++|..|+.++.. +... T Consensus 152 ~~~~g~~i~~~~~a~~-------~~~~~~~l~~ai~~a~~~L--d--EkdVP~~~-R~~vv~P~~y~~Ll~~~~lvn~d~ 219 (364) T protein:vir:10 152 VAGHGFSIHIVGLASS-------FLTSPQYMMAAIEMAMEQQ--T--EQEVDTSE-LCGLMPWTAFNCLRDADRIVDKSY 219 (364) T ss_pred ccCCcceeeecccCcc-------hhhhHHHHHHHHHHHHHHH--h--hcCCCccc-cEEEeChHHHHHHhcCCccccccc Confidence 0000000000000000 0011122222222222211 1 22233344 5677899888777643 2211 Q ss_pred --CCCCccccc---cCCCceEEecCCCCcc---------------------eE--EEEecccE--EEEec--------ce Q lcl|Aclame:pro 294 --NQFGEYVTV---LPHGITILESLAVETG---------------------KA--IAFVANRY--DAFMA--------TA 335 (377) Q Consensus 294 --~~~G~~~~~---l~~~~~v~~s~~~~~~---------------------~i--i~gd~s~y--~~~~~--------~~ 335 (377) .+.|.|... ...|++|+.|+++|.. .- ..+|++.. .++.+ .+ T Consensus 220 ~~~~~~~~~~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~ 299 (364) T protein:vir:10 220 TIAASDNTVDGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTIS 299 (364) T ss_pred cccCCCccccceeEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEec Confidence 122334321 2258899999999831 00 12455432 22222 34 Q ss_pred eeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 336 STIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 336 ~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +..+...+.....+ .+.+++=++.++.+|+|+++++.++. T Consensus 300 ~t~e~~~~~~~~~~--~ida~~a~G~g~lRPeaa~~i~~~~~ 339 (364) T protein:vir:10 300 ITGDIFYEKKEKTW--YIDTFLAEGAIPDRWEAVAVVTAADT 339 (364) T ss_pred ceeeeeeccceeee--eeeeehcccCcccCccceEEEEecCC Confidence 44444433221111 33466668899999999999998887 No 139 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.34 E-value=1.9e-08 Score=62.91 Aligned_cols=283 Identities=12% Similarity=0.072 Sum_probs=139.0 Q ss_pred hccc-cccccHHHHHHHHHHHhccCCCCCc--eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCccee Q lcl|Aclame:pro 60 LRDK-NRELTAEEIKFFNDIDKNVGGKDKF--KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTAV 134 (377) Q Consensus 60 ~~~~-~~~lt~~e~~~~~~~~~~~~~s~gg--~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~~a~ 134 (377) +... .+.+. .+...+.+++. .+-=+.|..+++...+..+.+++++++.++. |+ +.||+.. ...+. T Consensus 1 m~~~~~~~~~---------t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG-~~tv~ 70 (347) T protein:vir:94 1 MANVPGQKIG---------TDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMG-RTSGV 70 (347) T ss_pred CCCCCccccc---------cccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEeccc-ceeee Confidence 0000 00000 00111222222 2223889999999999999999999988764 44 6788764 34444 Q ss_pred eeccccccccc-cccccee--EeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee--c--cC-CC- Q lcl|Aclame:pro 135 WGDIFGEIKGQ-LKQAFKE--QDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK--G--NG-LL- 205 (377) Q Consensus 135 w~~e~~~~~~~-~~~~f~~--i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~--G--~G-~~- 205 (377) ....+...... .+.+=.+ |+++..++.. ..|..-=--.+..|+.+.+.++.+.++++..|++|+. . .+ ++ T Consensus 71 ~~t~G~~l~~~~~~~~~~e~~itID~~~~~~-~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~ 149 (347) T protein:vir:94 71 YLAPGERLSDKRKGIKHTEKVITIDGLLTAD-VMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAA 149 (347) T ss_pred eecCCCCcCCCCCCCCcceEEEEecchhhhh-HHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 43333333111 1233444 4444443322 2222211223456899999999999999999997752 1 11 11 Q ss_pred ---cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccch Q lcl|Aclame:pro 206 ---QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) Q Consensus 206 ---~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 282 (377) .+.|+-.. ............+. ..++..+++.+..++..+. .. .....+ .+.+++|.. T Consensus 150 ~~~~~~g~~~~---s~~~~~~~~~~~~~-----------~~~~~~~~~~i~~a~~~Ld--e~--~VP~~~-R~~vv~P~~ 210 (347) T protein:vir:94 150 SNENIAGLGTA---SVLEVGKKADLDTP-----------AKLGEAIIGQLTIARAKLT--SN--YVPAGD-RYFYTTPDN 210 (347) T ss_pred cccccCCCccc---ceeeccccccccch-----------hhhHHHHHHHHHHHHHHHh--hc--CCCCCC-cEEEeCHHH Confidence 11121100 00000000000000 0112222333322222221 11 122234 456789988 Q ss_pred hhhhcccccccCC---------CCccccccCCCceEEecCCCCcce-----------E---------------EEEeccc Q lcl|Aclame:pro 283 RWTLEAKFTSRNQ---------FGEYVTVLPHGITILESLAVETGK-----------A---------------IAFVANR 327 (377) Q Consensus 283 ~~~~~~~~~~~~~---------~G~~~~~l~~~~~v~~s~~~~~~~-----------i---------------i~gd~s~ 327 (377) |+.|+......+. +|.-.++ .|++|+.|+++|.+. + +-+||++ T Consensus 211 ~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i--~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~ 288 (347) T protein:vir:94 211 YSAILAALMPNAANYAALIDPETGNIRNV--MGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDN 288 (347) T ss_pred HHHHhccchhhhhhccccccccccceEEE--eceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccc Confidence 8766533221111 1221122 578899999998421 1 1122322 Q ss_pred E-EE---------EecceeeEEeech-hhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 328 Y-DA---------FMATASTIEEYDQ-TFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 328 y-~~---------~~~~~~~i~~~~~-~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) - .+ +...+++++...+ .+|. | .+++.+-++.++++|++.++|+.++- T Consensus 289 ~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~-d--~i~~~~~~G~~~~rP~~a~~~~~~~A 346 (347) T protein:vir:94 289 VVGLFSHRSAVGTVKLRDLALERDRDVDAQG-D--LIVGKYAMGHGGLRPEAAGALVFSPA 346 (347) T ss_pred eeEEEeehhhhhhhhcccccccchhchhhHH-H--HhhhhhhhcCcccccceeEEEEecCC Confidence 1 11 1122334554433 3333 3 78899999999999999999999877 No 140 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=98.34 E-value=1.3e-08 Score=63.73 Aligned_cols=277 Identities=10% Similarity=0.001 Sum_probs=134.5 Q ss_pred HHHHHHHHh-ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe---cCC-ceEEEEEcCCcceeeecccccccccc Q lcl|Aclame:pro 72 IKFFNDIDK-NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN---TSL-RLKALTAETSGTAVWGDIFGEIKGQL 146 (377) Q Consensus 72 ~~~~~~~~~-~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~---~~~-~~~~p~~~~~~~a~w~~e~~~~~~~~ 146 (377) ..+.|.+.. .-+++.....||+-++.+|++.++....+.++++-.+ .+| .++||+.. .+.+.-....+.+. -. T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i~-~~ 78 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS-ELGVEDKATDVPVG-VQ 78 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC-cceeeeecCCCccc-cc Confidence 011111000 0112223346899999999999998888888765432 234 47899764 44454443333332 23 Q ss_pred cccceeEeeccee-EEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeecc--CCCcceeeeeccccccccccc Q lcl|Aclame:pro 147 KQAFKEQDFSQFK-LTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGN--GLLQPVGLLKDLSQPTVDQST 223 (377) Q Consensus 147 ~~~f~~i~l~~~k-~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~--G~~~P~Gil~~~~~~~~~~~~ 223 (377) +.+-.++++...+ .+.-+.|+..-...+..|+.+.+.++.++++++++|+.++.-- +++++.+. ...... T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~-------~~~~~~ 151 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQN-------VFSSSN 151 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCc-------cccCcc Confidence 3444555555533 2444667775555567899999999999999999998876321 11111110 000000 Q ss_pred cccccccchhhhhhhhhhccChH-HHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc--ccccCCCCc-- Q lcl|Aclame:pro 224 GRDITTYKTDKEAIADLSDLDPD-TAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK--FTSRNQFGE-- 298 (377) Q Consensus 224 ~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~--~~~~~~~G~-- 298 (377) ..... ++. ...+.+..+... ++... ....+ .+++++|..+..|+.. +...+..|+ T Consensus 152 ~~~t~---------------~~~~~~~~~i~~a~~~--Lde~~--VP~~g-R~lvv~P~~~~~Ll~~~~~~~~~~~g~~~ 211 (341) T protein:vir:94 152 GAITG---------------NGQAFSFAVFLAARRL--LLEAD--VPEEK-IVLLISPGQESALFTIPQFISKDFINNAP 211 (341) T ss_pred ccccC---------------chhhhhHHHHHHHHHH--HhhcC--CCccC-CEEEeCHHHHHHHhhchhhhhhhccccch Confidence 00000 000 001111111111 11111 22334 4567899887776532 111111111 Q ss_pred ccc---ccCCCceEEecCCCCcceEEE---------------------------EecccE--E------EEecceeeEE- Q lcl|Aclame:pro 299 YVT---VLPHGITILESLAVETGKAIA---------------------------FVANRY--D------AFMATASTIE- 339 (377) Q Consensus 299 ~~~---~l~~~~~v~~s~~~~~~~ii~---------------------------gd~s~y--~------~~~~~~~~i~- 339 (377) +.. ...+|++|+.|+++|.+.... +|++.+ + ++..+.+..+ T Consensus 212 l~~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~ 291 (341) T protein:vir:94 212 IAQGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDW 291 (341) T ss_pred hheeeeeeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchh Confidence 111 112588899999888543110 111111 1 1111100100 Q ss_pred --------eechhhhh--cCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 340 --------EYDQTFAM--EDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 340 --------~~~~~~f~--~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ...+..|. .-...+++.+-++.++.+|++.|.|..+|- T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~ 339 (341) T protein:vir:94 292 AAAVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGD 339 (341) T ss_pred hhccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecCcC Confidence 00011111 112356688888999999999887777666 No 141 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.26 E-value=9.7e-08 Score=59.04 Aligned_cols=294 Identities=11% Similarity=0.071 Sum_probs=142.3 Q ss_pred ccHHHHHHHHHHHhccCCCCCc-----eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCcceeeeccc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKNVGGKDKF-----KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTAVWGDIF 139 (377) Q Consensus 67 lt~~e~~~~~~~~~~~~~s~gg-----~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~~a~w~~e~ 139 (377) ++..-...+-.....+....|| .+-=+.|..++.+..+..+.+++++++.++. |+ +++|+.. ...+.....+ T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG-~~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTG-RMTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeee-eeEEeeecCC Confidence 1100000000000000000111 2333889999999999999999999988765 44 6788763 3444433322 Q ss_pred cccc--ccccccce--eEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee----ccCCCcceeee Q lcl|Aclame:pro 140 GEIK--GQLKQAFK--EQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK----GNGLLQPVGLL 211 (377) Q Consensus 140 ~~~~--~~~~~~f~--~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~----G~G~~~P~Gil 211 (377) .++- +..+.+-. .++++..++..+ .|..-=--.+..|+.+.+.++.++++++.+|++++. |-....|.+.- T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~~-~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~ 158 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISSA-FVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSAT 158 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 2221 11122333 355555444432 233222234567899999999999999999997752 22222222111 Q ss_pred eccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc-- Q lcl|Aclame:pro 212 KDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK-- 289 (377) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~-- 289 (377) .....+......++.. ......++..+++.+..+...+.. +.....++ +.+++|..|+.++.. T Consensus 159 ~~~~~Gg~~i~~~sg~----------~~~~~~ta~~~~~ai~~a~~~Lde----~~VP~~~R-~~vv~P~~y~~Ll~~~d 223 (375) T protein:vir:10 159 NFVEPGGTQIRVGSGT----------NESDAFTASALVNAFYDAAAAMDE----KGVSSQGR-CAVLNPRQYYALIQDIG 223 (375) T ss_pred cccccCcceeeecccc----------ccccccCHHHHHHHHHHHHHHHhh----cCCCCCCC-EEEeChHHHHHHHhcCC Confidence 1000000000000000 000011233334444333332221 22234454 467899887666532 Q ss_pred ---ccccC--CCCcccc---ccCCCceEEecCCCCcce-------------------------------------EEEEe Q lcl|Aclame:pro 290 ---FTSRN--QFGEYVT---VLPHGITILESLAVETGK-------------------------------------AIAFV 324 (377) Q Consensus 290 ---~~~~~--~~G~~~~---~l~~~~~v~~s~~~~~~~-------------------------------------ii~gd 324 (377) +...+ ++|.+.. ....|++|+.|+.+|... -+-+| T Consensus 224 ~~~~~n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d 303 (375) T protein:vir:10 224 SNGLVNRDVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTN 303 (375) T ss_pred ccceeeecccccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeecccccccccc Confidence 22111 1221111 123588899988887321 12224 Q ss_pred c---cc-E---------EEEecceeeEEeec-hhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 325 A---NR-Y---------DAFMATASTIEEYD-QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 325 ~---s~-y---------~~~~~~~~~i~~~~-~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) | ++ . ..+.-.++.++++. +..-.+-...+.+.+=++..+.+|+|.|.|+..|- T Consensus 304 ~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~ 370 (375) T protein:vir:10 304 AELGAKSCGLIFQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGAT 370 (375) T ss_pred ccccCceEEEEEchhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcC Confidence 4 21 1 11122345555542 12223334467788899999999999999998855 No 142 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.25 E-value=2.8e-08 Score=62.03 Aligned_cols=274 Identities=12% Similarity=0.034 Sum_probs=136.6 Q ss_pred HHhccCCCCCceec-cHHHHHHHHHHHHhhhhhhhhceeEecC-C-ceEEEEEcCCcceeeecccccc-cccccccceeE Q lcl|Aclame:pro 78 IDKNVGGKDKFKLL-PEETMVQVFDDLVAEHPLLKVINFKNTS-L-RLKALTAETSGTAVWGDIFGEI-KGQLKQAFKEQ 153 (377) Q Consensus 78 ~~~~~~~s~gg~lv-P~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~-~~~~p~~~~~~~a~w~~e~~~~-~~~~~~~f~~i 153 (377) ......++++-.+| |+.++.+|..-+.+......++++...+ | .++||....-....+. ..+.+ .++.+.+=-.+ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV~dY~-~~~~i~~d~ltt~~~~l 79 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVVRSRP-EQGDFTFDNLDTGEISI 79 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEecccccccccccc-CCCCcccccCCCceEEE Confidence 11122345554555 9999999998887776656666654432 4 4788865443333332 22222 12222222356 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceee--ccCCCcceeeeeccccccccccccccccccc Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVK--GNGLLQPVGLLKDLSQPTVDQSTGRDITTYK 231 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~--G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~ 231 (377) .+...|+.++. |+.+..+ ...+|.+...++.+++++...|+.+.. -+|..+-.++ +......+. ..-.+ T Consensus 80 ~IDq~KYfaf~-VdDD~~Q-a~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~------~~p~vin~~-~~~iv 150 (322) T protein:vir:31 80 ILRDEVYAGNA-ISKKLRQ-DSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQ------NDPNVINGV-PHRFV 150 (322) T ss_pred EEehhhhhccc-cchhHHH-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc------CCcceecCC-cccee Confidence 66777777665 7886655 568999999999999999999986632 1222110000 000000000 00000 Q ss_pred hhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhh---------c--ccccccCCCCc-- Q lcl|Aclame:pro 232 TDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL---------E--AKFTSRNQFGE-- 298 (377) Q Consensus 232 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~---------~--~~~~~~~~~G~-- 298 (377) -+..++...++.+..+...+ +.......+ .+.+++|.-+..| . +..+....+|. T Consensus 151 --------~~gt~~~~ay~~lv~l~~kL----dkanVP~~g-R~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~ 217 (322) T protein:vir:31 151 --------GTGTDQTMDVTDFSRVNYVM----TQSKMPMGG-MIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAP 217 (322) T ss_pred --------ccCCCchhhHHHHHHHHHHh----ccccCCCCC-eEEEeCchhhhhhhhhhhhhhhhccccccccccccchh Confidence 01112223334443333222 112223334 4556778654322 1 11111222332 Q ss_pred --cccccCCCceEEecCCCCcce--EEEE-e--------cccE----------EEEeccee-eEEee-chhhhhcCcEEE Q lcl|Aclame:pro 299 --YVTVLPHGITILESLAVETGK--AIAF-V--------ANRY----------DAFMATAS-TIEEY-DQTFAMEDLQLY 353 (377) Q Consensus 299 --~~~~l~~~~~v~~s~~~~~~~--ii~g-d--------~s~y----------~~~~~~~~-~i~~~-~~~~f~~~~~~~ 353 (377) +.-.-.+|..|+.|+.+++++ +..| | .+.+ ++.-++.| .-+.. ++.+| -..+ T Consensus 218 g~~~Vg~~~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~---~d~~ 294 (322) T protein:vir:31 218 DMQFVRSVYGIDLFVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYND---DLNT 294 (322) T ss_pred hHHHHHHHhceeeeeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCcccc---ccce Confidence 111122577889999886433 2222 1 1111 11111111 10111 11122 2368 Q ss_pred EEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 354 LTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 354 ~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |+.+|++-++..+|.+++|.-+|- T Consensus 295 ~~~~~~g~g~~r~e~l~~~~a~~~ 318 (322) T protein:vir:31 295 ATTARWGNGLVRDENLVCVLANAD 318 (322) T ss_pred eeeeeecceeecccceEEEEeccc Confidence 999999999999999999999998 No 143 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.18 E-value=2e-07 Score=57.28 Aligned_cols=286 Identities=12% Similarity=-0.000 Sum_probs=128.0 Q ss_pred HHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec---CC-ceEEEEEcCCcc Q lcl|Aclame:pro 57 MFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT---SL-RLKALTAETSGT 132 (377) Q Consensus 57 ~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~---~~-~~~~p~~~~~~~ 132 (377) +......+... ...-..++.-.+||+.++.+|++.+++...+.++++.... .| .++||+.. .+. T Consensus 1 ~~~~~~~~~~~-----------~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~ 68 (381) T protein:vir:80 1 MATIQGTGGYK-----------GSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAA 68 (381) T ss_pred Cceeccccccc-----------CcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cce Confidence 00000000000 0001111223478999999999999988888887765433 23 47888764 456 Q ss_pred eeeecccccccccccccceeEeecceeE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeecc--CCCccee Q lcl|Aclame:pro 133 AVWGDIFGEIKGQLKQAFKEQDFSQFKL-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGN--GLLQPVG 209 (377) Q Consensus 133 a~w~~e~~~~~~~~~~~f~~i~l~~~k~-~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~--G~~~P~G 209 (377) +..+.+.+.+. ..+.+..++++...+. +.-+.|+..-...+..|+.+.+.+.++.++++..|+.++.-- ....+.+ T Consensus 69 a~d~~~g~~i~-~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~ 147 (381) T protein:vir:80 69 VYDKQPQTPVN-LQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQ 147 (381) T ss_pred eeeecCCCccc-ccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 66665554443 3445666666665443 334577776666667899999999999999999999886421 1111110 Q ss_pred e-eeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcc Q lcl|Aclame:pro 210 L-LKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEA 288 (377) Q Consensus 210 i-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~ 288 (377) . ..... ........ ...+.......+..+..++..+ +....| ..+ .+++++|..+..|+. T Consensus 148 ~~~t~~~--~i~~~~~~------------~~~t~~~~~~t~~~i~~a~~~L--de~~VP--~eg-R~lvv~P~~~~~Ll~ 208 (381) T protein:vir:80 148 RIYSYDT--TLGDGTVN------------AHLTGTPAPLTYAALLLAKQKL--DEADVP--QEG-RIVMVSPAQYIDLLS 208 (381) T ss_pred ccccccc--cccccccc------------cccccchhhHHHHHHHHHHHHH--hhcCCC--cCC-cEEEeCHHHHHHHhh Confidence 0 00000 00000000 0000001111122222222211 111222 234 467789988877653 Q ss_pred cccccCC---------CCccccccCCCceEEecCCCCcceEEEEeccc-EEEEecceeeEEeechhhhhcCcEEEEEEEE Q lcl|Aclame:pro 289 KFTSRNQ---------FGEYVTVLPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQTFAMEDLQLYLTKNY 358 (377) Q Consensus 289 ~~~~~~~---------~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~-y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r 358 (377) .....+. +|...+ .+|++|+.|+.+|.+.+..-.... .-......++-.. ....|..+-...+.... T Consensus 209 ~~~~~~ad~~~~~~l~~G~Ig~--i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~-~~g~~s~~a~av~~~k~ 285 (381) T protein:vir:80 209 INQFISVDFSQVKPVTSGVVGT--ILGMEVIVTTQIGINSLTGYVNGQGAPTQPTPGVLGSP-YLPDQAGTANVVNTGSA 285 (381) T ss_pred chhhhhhhhccchhhhceeeeE--EcceEEEeecccccccccceeeeccccccccccccccc-cccccccceeeeeeeee Confidence 2111111 122222 268889999999864321000000 0000001111000 11112233334455555 Q ss_pred EcCEEecc-cceEEEE-----eecC Q lcl|Aclame:pro 359 FYGKAKDN-HTAALLT-----LAGG 377 (377) Q Consensus 359 ~dg~~~~~-~af~~l~-----~~a~ 377 (377) +|.++... ..+-+.+ -+.| T Consensus 286 yd~~~~~~~~~~~~~~g~~~~~~~~ 310 (381) T protein:vir:80 286 SDLAVSLSYFGLPVFSGAGATAADG 310 (381) T ss_pred eceeeeeeeccceeeecceeeecCC Confidence 55544322 1111111 1111 No 144 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.11 E-value=1.9e-07 Score=57.43 Aligned_cols=242 Identities=17% Similarity=0.161 Sum_probs=130.5 Q ss_pred HHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC--CceEEEEEcCCccee Q lcl|Aclame:pro 57 MFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS--LRLKALTAETSGTAV 134 (377) Q Consensus 57 ~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~--~~~~~p~~~~~~~a~ 134 (377) +...... .+|-.|. ...+-|......|||.+.+.++|+..+.+.... ......+.++-+.+. T Consensus 1 m~~~~~~--~~TL~e~--------------Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~ 64 (328) T protein:vir:95 1 MAVKGLT--ALTLADW--------------GKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSAT 64 (328) T ss_pred CCccccc--cccHHHH--------------HhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCce Confidence 0000000 1111111 011345557789999999999999999998763 347888999999999 Q ss_pred eecccccccccccccceeEeecceeEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHhhcceeeccCCCcceee-- Q lcl|Aclame:pro 135 WGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP--KWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGL-- 210 (377) Q Consensus 135 w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~--~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gi-- 210 (377) |..-+...+ .++.++.+++-...-+.+.+.|.+.+.+... .++.+.=.....++++..+...||||+.+..|.++ T Consensus 65 fR~lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~G 143 (328) T protein:vir:95 65 WRLLNYGVQ-PSKSTTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMG 143 (328) T ss_pred eeecCCccC-cccceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcc Confidence 987776664 6778999999999999999999999997762 23334445568999999999999999987666644 Q ss_pred -eecc---cccc-----ccccccccccccc-------------h-hhh------hhh--hhhcc---------------- Q lcl|Aclame:pro 211 -LKDL---SQPT-----VDQSTGRDITTYK-------------T-DKE------AIA--DLSDL---------------- 243 (377) Q Consensus 211 -l~~~---~~~~-----~~~~~~~~~~~~~-------------~-~~~------~~~--~l~~~---------------- 243 (377) -+.. +... .+.+++...+..+ + ..+ .++ ++..+ T Consensus 144 L~~R~~~~s~~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~ 223 (328) T protein:vir:95 144 LSSRYSSLSAGNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDN 223 (328) T ss_pred hhhhcCccccccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeee Confidence 2322 1100 0111111111100 0 000 000 00000 Q ss_pred -------------------------ChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccC---- Q lcl|Aclame:pro 244 -------------------------DPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRN---- 294 (377) Q Consensus 244 -------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~---- 294 (377) ....+++.|.. ....-|....++.+|+||..-.-.+..+...++ T Consensus 224 Gl~i~d~r~vvrI~NId~~~l~~~~~~~~l~~lm~~-------a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~ 296 (328) T protein:vir:95 224 GLALRDWRYVVRIANIDVSNLSEPSSAANIAKLMVK-------ALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAI 296 (328) T ss_pred eeEEcCcccEEEEecCcccccccccChhhHHHHHHH-------HHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceee Confidence 11111222111 112234556678888888654433332222111 Q ss_pred ----CCCccccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEE Q lcl|Aclame:pro 295 ----QFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQL 352 (377) Q Consensus 295 ----~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~ 352 (377) -.|..++.+ .|+||...+++--. +..++ T Consensus 297 ~~~~~~g~~~t~~-~gipir~~dai~~t-----------------------------E~~vv 328 (328) T protein:vir:95 297 SVKETEGEWWTSF-RGVPIRETDALLET-----------------------------EARVV 328 (328) T ss_pred eeeccCCcceeEE-CCeEEEEEeeeecC-----------------------------ccccC Confidence 122222222 34555444433211 11111 No 145 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.09 E-value=2.6e-07 Score=56.67 Aligned_cols=245 Identities=9% Similarity=-0.003 Sum_probs=116.7 Q ss_pred hceeEecCCceEEEEEcCCcceeeeccccccc-cccccccee--EeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHH Q lcl|Aclame:pro 112 VINFKNTSLRLKALTAETSGTAVWGDIFGEIK-GQLKQAFKE--QDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKE 188 (377) Q Consensus 112 ~~~v~~~~~~~~~p~~~~~~~a~w~~e~~~~~-~~~~~~f~~--i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~ 188 (377) +++-+.-+...++|+. +...+....-+.++. ...+..=++ |+++..++..+. |..-=--.+..|+.+...++.++ T Consensus 1 ~vr~i~~g~s~~~~~i-G~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~-VdDiD~~qa~~Dlr~e~s~~~G~ 78 (324) T protein:vir:99 1 MTRTITSGKSAQFPVM-GRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVL-IYDIEDAMNHYDVRSEYSTQMGE 78 (324) T ss_pred CeeeeecCceEEEeee-eeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhh-hhhHHHHhcCccchhHHHHHHHH Confidence 3332222234788876 344444443233321 112234444 555555544432 22222223567899999999999 Q ss_pred HHHHHhhcceee----ccCCCcceeeeecc-ccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhh Q lcl|Aclame:pro 189 AIAVALELAIVK----GNGLLQPVGLLKDL-SQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVND 263 (377) Q Consensus 189 ~~a~~~~~a~l~----G~G~~~P~Gil~~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 263 (377) ++++.+|++++. +-....|.+--+.. .+.+.....+.... . ...++..+++.+..+...+.. T Consensus 79 aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~--~---------~~~~~~~~~dai~~a~~~Lde-- 145 (324) T protein:vir:99 79 ALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKE--D---------PAKYGTQVIQALTYARAAFAK-- 145 (324) T ss_pred HHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceeccccccc--c---------cccCHHHHHHHHHHHHHHHhh-- Confidence 999999987751 00000000000000 00000000000000 0 011233333433333322221 Q ss_pred hhhhhcccCceEEEeccchhhhhcccccccC----CCCccccc---cCCCceEEecCCCCcceE---------------- Q lcl|Aclame:pro 264 KKHPLKIAGQVKLLLNPEDRWTLEAKFTSRN----QFGEYVTV---LPHGITILESLAVETGKA---------------- 320 (377) Q Consensus 264 ~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~----~~G~~~~~---l~~~~~v~~s~~~~~~~i---------------- 320 (377) +.....+ .+.+++|..|+.|+......+ +.|.+... ...|++|+.|+++|.... T Consensus 146 --~~VP~~g-R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~ 222 (324) T protein:vir:99 146 --KYIPAGD-RTFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPA 222 (324) T ss_pred --cCCCCCC-CEEEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccc Confidence 2223345 457789988776543221111 12222221 115889999999985311 Q ss_pred ---------EEEecccE----------EEEecceeeEEeec-hhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 321 ---------IAFVANRY----------DAFMATASTIEEYD-QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 321 ---------i~gd~s~y----------~~~~~~~~~i~~~~-~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +-+|++.- ..+.-.++..+.+. +.+|.. .+++.+-++.++.+|+|.+++++.+| T Consensus 223 ~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d---~i~~~~a~G~~~lRPe~a~~v~l~~~ 296 (324) T protein:vir:99 223 TGDSTTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPEYQAD---QIIAKYAMGHGGLRPEAVGAIIFEDG 296 (324) T ss_pred ccccccccccccccCceeEEEEehhheEEEeeecceecceechhhHHH---hhhhhhhhcCcccccceEEEEEEccC Confidence 12333321 12222333444443 333333 57788888999999999999999999 No 146 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.05 E-value=4e-07 Score=55.64 Aligned_cols=280 Identities=9% Similarity=-0.034 Sum_probs=140.7 Q ss_pred ccHHHHHHHHHHHhcc--CCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-c-eEEEEEcCCcceeeecccccc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKNV--GGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-R-LKALTAETSGTAVWGDIFGEI 142 (377) Q Consensus 67 lt~~e~~~~~~~~~~~--~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~-~~~p~~~~~~~a~w~~e~~~~ 142 (377) +|. .+.....+ +..+--.+.=+.+..++.+.....+.++++..++++.+ + .++|+. +...+.....+.++ T Consensus 1 Ms~-----~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~l 74 (400) T protein:vir:10 1 MST-----PNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSP 74 (400) T ss_pred CCC-----CccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCc Confidence 000 00000000 00011124457889999999999999999999998754 3 678876 44555554433333 Q ss_pred cccccccceeEeecceeE-EEeehhhHHHHhcCHHH-HHHHHHHHHHHHHHHHhhcceee----c-c-CCCcc----eee Q lcl|Aclame:pro 143 KGQLKQAFKEQDFSQFKL-TAFVVIPKDALKFGPKW-LKQFITEQLKEAIAVALELAIVK----G-N-GLLQP----VGL 210 (377) Q Consensus 143 ~~~~~~~f~~i~l~~~k~-~~~~~iS~ell~ds~~~-~~~~l~~~la~~~a~~~~~a~l~----G-~-G~~~P----~Gi 210 (377) ..+.+..++..|..-.+ .+-..|..----++.+| +.+.+.+++++++++.+|++++. + - -+..| -|+ T Consensus 75 -dg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~ 153 (400) T protein:vir:10 75 -AATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVK 153 (400) T ss_pred -CCCCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCcc Confidence 34456666666655443 23333332222344577 78999999999999999997752 1 0 01122 222 Q ss_pred eeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc- Q lcl|Aclame:pro 211 LKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK- 289 (377) Q Consensus 211 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~- 289 (377) -...+... .+.... ...++..+...+......+ .....| .++.+.++.|..|..++.. T Consensus 154 ~~g~s~~v----~~~~~~------------~~~~~~~l~~A~~~A~~~L--dEkdVP---~~d~vvl~pp~~Ys~Ll~~d 212 (400) T protein:vir:10 154 GHGFSVNV----EVNEGE------------ALVNPQYVMAAVEFALEQQ--LEQEVD---ISDVAILMPWRYFNVLRDAD 212 (400) T ss_pred ccccceee----cccccc------------cccCHHHHHHHHHHHHHHH--HhcCCC---ccceEEEcCHHHHHHHHhCC Confidence 11111000 000000 0012222222222222222 112222 2456666666554444322 Q ss_pred -ccccC----CCCcccc---ccCCCceEEecCCCCcce-------E----------EEEecccEE--EEecceeeE---- Q lcl|Aclame:pro 290 -FTSRN----QFGEYVT---VLPHGITILESLAVETGK-------A----------IAFVANRYD--AFMATASTI---- 338 (377) Q Consensus 290 -~~~~~----~~G~~~~---~l~~~~~v~~s~~~~~~~-------i----------i~gd~s~y~--~~~~~~~~i---- 338 (377) +.... ++|.|.. +...|++|+.|+++|... + +-||++.-. ++.+..+-+ T Consensus 213 kLvnrdf~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~ 292 (400) T protein:vir:10 213 RIVDKSYTISQSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSI 292 (400) T ss_pred cccchhccccCCCccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEee Confidence 21111 2233432 223689999999998421 1 236776532 233332221 Q ss_pred ----Eee-chhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 339 ----EEY-DQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 339 ----~~~-~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +.. +...|.. .+.+++-++..+.+|+|.++++.+-+ T Consensus 293 ~lt~~~~~d~r~~~~---~id~~~a~G~g~~RPeaa~vv~~~~~ 333 (400) T protein:vir:10 293 DVIGDIFYEKKEKTY---YIDTFMSEGAIPDRWEAVSVVTTKRQ 333 (400) T ss_pred ccccccccchhhHHH---HHHHHHHhCCcccchhheEEEEecCC Confidence 111 2222222 33466678889999999999999988 No 147 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=97.95 E-value=7.7e-06 Score=48.61 Aligned_cols=271 Identities=13% Similarity=0.086 Sum_probs=140.8 Q ss_pred ccHHHHHHHHHHHhccCCCCCceecc--HHHHHHHHHHHHhhhhhhhhceeEe-cCC---ceEEEEEcCCcceeeecccc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKNVGGKDKFKLLP--EETMVQVFDDLVAEHPLLKVINFKN-TSL---RLKALTAETSGTAVWGDIFG 140 (377) Q Consensus 67 lt~~e~~~~~~~~~~~~~s~gg~lvP--~~~~~~Ii~~~~~~s~l~~~~~v~~-~~~---~~~~p~~~~~~~a~w~~e~~ 140 (377) |+-+ ..++.|-+++. +.+.+.|++.....-..+.++.+.. .+. ...+++....+.+.|....+ T Consensus 1 ~~~~-----------~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~ 69 (296) T protein:vir:10 1 MGVD-----------KADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYT 69 (296) T ss_pred Cccc-----------chhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCc Confidence 1111 11222223332 3345566655544444444444332 221 23556666677888876554 Q ss_pred cccccccccceeEeecceeEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccc Q lcl|Aclame:pro 141 EIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFG---PKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQP 217 (377) Q Consensus 141 ~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds---~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~ 217 (377) ...+..+..+.......+.++.-+.++.+=|+.+ ..++..--....+++++..+|+.+++|+...+-.|+||.+... T Consensus 70 ~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~ 149 (296) T protein:vir:10 70 DDLPLVDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNIN 149 (296) T ss_pred cccceeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCc Confidence 4334555677788888888888888876666433 5678888888999999999999999999877788999986653 Q ss_pred cccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCC Q lcl|Aclame:pro 218 TVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFG 297 (377) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G 297 (377) ...+.. .+. ++....+.+..++..+..... ...+...++++|..+..+.... +..| T Consensus 150 ~~~~~~-----~W~------------~~t~i~~Di~~~~~~l~~~s~----g~~~p~~l~L~p~~~~~L~~~~---~~~~ 205 (296) T protein:vir:10 150 NVVSGG-----SWS------------QPTTAVSDITSLLDIIETSTN----GQHRATHLLLPTTARRIMQNLV---PGTS 205 (296) T ss_pred cccccC-----Ccc------------CHHHHHHHHHHHHHHHHHhhC----ceecceeEEeCHHHHHHHhhcc---CCCC Confidence 322111 111 111122222222222211111 1223446778887665543221 2222 Q ss_pred c----cccccCCCceEEecCCCC----c--ceEEEEecc--cEEEEecceeeEEeechhhhhcCcEEEEEEEEEc-CEEe Q lcl|Aclame:pro 298 E----YVTVLPHGITILESLAVE----T--GKAIAFVAN--RYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFY-GKAK 364 (377) Q Consensus 298 ~----~~~~l~~~~~v~~s~~~~----~--~~ii~gd~s--~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~d-g~~~ 364 (377) . ++.-...++.++..+.+. . +.+++.+-+ ...+.....+.... .+. ..-...++...|+. ..+. T Consensus 206 ~t~l~~ik~~~~~l~i~~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~-~e~--~~l~~~~~~~~~~~Gv~i~ 282 (296) T protein:vir:10 206 VSYGEFFRQNNSGVTVEFVQYLNDYNGTGTSAAIAYEKDPNNMAIEIPEATNALP-AQP--KDLHFKIPVTSKATGLIVY 282 (296) T ss_pred ccHHHHHHHhcCCceEEEeeeeccCCCCcceEEEEEEcCCceEEEEcCcceeeec-ccc--cCceEEEeeEeeEEEEEEE Confidence 1 111111233333322221 1 123444433 22334334444322 122 12233566788886 5778 Q ss_pred cccceEEE---Eee Q lcl|Aclame:pro 365 DNHTAALL---TLA 375 (377) Q Consensus 365 ~~~af~~l---~~~ 375 (377) .|.|++++ |.+ T Consensus 283 ~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 283 RPLTMAVMKGITFA 296 (296) T ss_pred CCceeEEEeeeecC Confidence 89999998 677 No 148 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=97.95 E-value=3.5e-07 Score=55.95 Aligned_cols=264 Identities=12% Similarity=-0.013 Sum_probs=140.1 Q ss_pred ccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC--Cce---EEEEEcCCcceeeeccccc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS--LRL---KALTAETSGTAVWGDIFGE 141 (377) Q Consensus 67 lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~--~~~---~~p~~~~~~~a~w~~e~~~ 141 (377) ++.++ +-....+-+..+--+|.+++-..+.+-..++...+..|++ +.+ ++|..+...++.-+.|+++ T Consensus 1 M~~e~--------nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~ 72 (303) T protein:vir:10 1 MSAEN--------NLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDV 72 (303) T ss_pred CCCCc--------CCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcc Confidence 11100 0111122233444567777666666666677777888875 345 4555555566666665555 Q ss_pred ccccccccce---eEeecceeEEEeehhhHHHHhcC-HHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccc Q lcl|Aclame:pro 142 IKGQLKQAFK---EQDFSQFKLTAFVVIPKDALKFG-PKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQP 217 (377) Q Consensus 142 ~~~~~~~~f~---~i~l~~~k~~~~~~iS~ell~ds-~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~ 217 (377) + +.++.+.. ..++..+|++.-+ |.|-++.| --+-.+.-.+.|...++.++++.|+.= ++..+. T Consensus 73 I-plskvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~---------lktaT~- 139 (303) T protein:vir:10 73 I-PLTKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFET---------LKSAIE- 139 (303) T ss_pred c-chhhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHH---------Hhhccc- Confidence 5 46666543 4777788887744 99998543 334567778889999999999988741 100000 Q ss_pred cccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCC-- Q lcl|Aclame:pro 218 TVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQ-- 295 (377) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~-- 295 (377) +...+... ......+...++.... ..........+++..|||.|.++++........ T Consensus 140 t~~~t~~t----------------~~s~~glq~Al~~~~~-----kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t 198 (303) T protein:vir:10 140 NGKRTNKT----------------KLSAENLQGALSKGRA-----NLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGA 198 (303) T ss_pred ccccccce----------------eecHHHHHHHHHhhhh-----hccccccccccEEEEEchHHHHHHhhcCCcchhhh Confidence 00000000 0001111111111100 011112234578999999999987654332211 Q ss_pred -CC-ccccccCCCceEEecCCCCcceEEEE---ecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEc---------- Q lcl|Aclame:pro 296 -FG-EYVTVLPHGITILESLAVETGKAIAF---VANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFY---------- 360 (377) Q Consensus 296 -~G-~~~~~l~~~~~v~~s~~~~~~~ii~g---d~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~d---------- 360 (377) -| +|+..+ +|..++.|..+|+|+++.= +..-+++..++++. ..-.+..|++++.+..+.- T Consensus 199 ~fG~n~L~nf-LG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~----~~f~~t~D~tglIGv~h~~~~~~~t~eT~ 273 (303) T protein:vir:10 199 QFGVNLLTPY-VGVKIVEFADVPQGEVWMTVAENLNVAYANPRGELS----RAFAFATDATGFVGVLHDIQPQRLTSDTI 273 (303) T ss_pred hhhhhhhhhh-hcceEEEeccCCCceEEEeeccceEEEEecCchhhh----hhhhhccccccceEEEeccccceeeehhH Confidence 13 233211 3556788999999997643 23334455544332 2223445666666654321 Q ss_pred ---CE---EecccceEEEEeecC Q lcl|Aclame:pro 361 ---GK---AKDNHTAALLTLAGG 377 (377) Q Consensus 361 ---g~---~~~~~af~~l~~~a~ 377 (377) |- |-..+++++.+|+++ T Consensus 274 ~~~~~~lfpE~~dgiv~~ti~~~ 296 (303) T protein:vir:10 274 YASAISMFPENIDAVIKVTIKKD 296 (303) T ss_pred hHhHHHhcccccceEEEEEEecc Confidence 11 334578999999999 No 149 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=97.87 E-value=1.3e-06 Score=52.89 Aligned_cols=253 Identities=10% Similarity=-0.022 Sum_probs=137.2 Q ss_pred HhccCCCCCceeccH---HHHHHHHHHHHhhhhhhhhceeEecC--CceEEEEEcCCcceeeecccccccccccccce-- Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPE---ETMVQVFDDLVAEHPLLKVINFKNTS--LRLKALTAETSGTAVWGDIFGEIKGQLKQAFK-- 151 (377) Q Consensus 79 ~~~~~~s~gg~lvP~---~~~~~Ii~~~~~~s~l~~~~~v~~~~--~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~-- 151 (377) +++.+.+..--++|. ++.+++-+.+.+...++...+..|++ ..+++|...-.+++.-+.|++++ +.+..+.. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~I-plskvt~~~~ 79 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETI-PLSKVTRTKD 79 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCccc-chhhheeeee Confidence 333322222224433 34444444445555566666888875 46899998877788777665555 46666654 Q ss_pred -eEeecceeEEEeehhhHHHHhcC-HHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccc Q lcl|Aclame:pro 152 -EQDFSQFKLTAFVVIPKDALKFG-PKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITT 229 (377) Q Consensus 152 -~i~l~~~k~~~~~~iS~ell~ds-~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~ 229 (377) ..++..+|+..- +|.|-++.| --+-.+.-.+.|..+++.++++.|+.-=. .++.... +. T Consensus 80 ~t~t~kikK~rK~--tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lk------------tat~t~t-g~---- 140 (295) T protein:vir:99 80 KDYTVKWFKKRRA--TTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLK------------TKPTKVK-GV---- 140 (295) T ss_pred eeeEEEeeeeccc--ccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhc------------cCceeee-hh---- Confidence 366677777763 499988544 33456778889999999999998885111 0000000 00 Q ss_pred cchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc--ccCC--CC-ccccccC Q lcl|Aclame:pro 230 YKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT--SRNQ--FG-EYVTVLP 304 (377) Q Consensus 230 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~--~~~~--~G-~~~~~l~ 304 (377) . ....++.....+... ......+.++.+||.|.+.++.... +..+ -| +|+.-+ T Consensus 141 --~---------------lq~a~a~~~~al~~f----~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~nf- 198 (295) T protein:vir:99 141 --G---------------LQKALSASWAKLATF----NEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKNF- 198 (295) T ss_pred --h---------------HHHHHHHhhhhhhhc----ccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhhhh- Confidence 0 000001000000000 1112246788999999988764432 2222 22 343221 Q ss_pred CCce-EEecCCCCcceEEEE---ecccEEEEec-ceeeEEeechhhhhcCcEEEEEEEEEc-------------CE---E Q lcl|Aclame:pro 305 HGIT-ILESLAVETGKAIAF---VANRYDAFMA-TASTIEEYDQTFAMEDLQLYLTKNYFY-------------GK---A 363 (377) Q Consensus 305 ~~~~-v~~s~~~~~~~ii~g---d~s~y~~~~~-~~~~i~~~~~~~f~~~~~~~~~~~r~d-------------g~---~ 363 (377) +|.. ++.|..+|+|++|.= +..-+++... +++. .--.+..|++++.+..+.- |- | T Consensus 199 LG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~----~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfp 274 (295) T protein:vir:99 199 LGMQNVIVMPSVPEGKIYSTAVENLVFASLNVKGGDLG----GLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFA 274 (295) T ss_pred hccceEEEcccCCCceEEEeeccceEEEEecCCchhhh----hhhhhccCcccceEEEeccccceeeehhhhHhHHHhcc Confidence 4664 888999999997643 2333334443 3333 1112345555665554321 11 3 Q ss_pred ecccceEEEEeecC Q lcl|Aclame:pro 364 KDNHTAALLTLAGG 377 (377) Q Consensus 364 ~~~~af~~l~~~a~ 377 (377) -..+++++.+|+++ T Consensus 275 E~~dgiv~~tI~~~ 288 (295) T protein:vir:99 275 EIPEGVVEATIEAA 288 (295) T ss_pred cccceEEEEEEecC Confidence 34578999999887 No 150 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=97.84 E-value=5.5e-07 Score=54.89 Aligned_cols=280 Identities=9% Similarity=-0.035 Sum_probs=138.3 Q ss_pred ccHHHHHHHHHHHhccC-C-CCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-c-eEEEEEcCCcceeeecccccc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKNVG-G-KDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-R-LKALTAETSGTAVWGDIFGEI 142 (377) Q Consensus 67 lt~~e~~~~~~~~~~~~-~-s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~-~-~~~p~~~~~~~a~w~~e~~~~ 142 (377) +|. .+.....+. + .+-=.+.=+.+..++.+.....+.++++..++++.+ + .++|+. +...+.....+.+. T Consensus 1 Ms~-----~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~l 74 (401) T protein:vir:70 1 MST-----PNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSP 74 (401) T ss_pred CCC-----CccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCc Confidence 000 000000000 0 011124447889999999999999999999998754 3 688876 34455444333333 Q ss_pred cccccccceeEeecceeEE-EeehhhHHHHhcCHHH-HHHHHHHHHHHHHHHHhhcceee-----cc----CC-Ccceee Q lcl|Aclame:pro 143 KGQLKQAFKEQDFSQFKLT-AFVVIPKDALKFGPKW-LKQFITEQLKEAIAVALELAIVK-----GN----GL-LQPVGL 210 (377) Q Consensus 143 ~~~~~~~f~~i~l~~~k~~-~~~~iS~ell~ds~~~-~~~~l~~~la~~~a~~~~~a~l~-----G~----G~-~~P~Gi 210 (377) ..+.+..++..|..-.+- +-..|-.----++.+| +.+.+.+++++++++..|+.++. |- +. ..|.|. T Consensus 75 -d~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~ 153 (401) T protein:vir:70 75 -AATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVK 153 (401) T ss_pred -CCCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcC Confidence 344566777655544432 2222222112244566 78999999999999999986632 10 00 111111 Q ss_pred eeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc- Q lcl|Aclame:pro 211 LKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK- 289 (377) Q Consensus 211 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~- 289 (377) -.... ....+. ......++..+...+...+..+ +....| .++++.++.|..|..++.. T Consensus 154 ~~G~~----i~v~~~------------~~~~~~~~~~l~~ai~dA~~~L--dEkdVP---~~r~vvl~pp~~Ys~Ll~~d 212 (401) T protein:vir:70 154 GHGFS----INVEVA------------EGEALVNPQYVMAAVEFALEQQ--LEQEVD---ISDVAILMPWRYFNVLRDAD 212 (401) T ss_pred CCceE----Eecccc------------ccccccCHHHHHHHHHHHHHHH--HhcCCC---ccceEEEcCHHHHHHHHhcC Confidence 10000 000000 0001123333333333333322 222223 2466666666554444322 Q ss_pred -cccc----CCCCcccc---ccCCCceEEecCCCCcce---------------E--EEEecccE--EEEecceeeE---- Q lcl|Aclame:pro 290 -FTSR----NQFGEYVT---VLPHGITILESLAVETGK---------------A--IAFVANRY--DAFMATASTI---- 338 (377) Q Consensus 290 -~~~~----~~~G~~~~---~l~~~~~v~~s~~~~~~~---------------i--i~gd~s~y--~~~~~~~~~i---- 338 (377) +... .++|.|.. +...|++|+.++++|.+. . +-|||+.- .++.+..+-+ T Consensus 213 ~L~nrd~~~s~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~ 292 (401) T protein:vir:70 213 RIVDKTYTISQSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSI 292 (401) T ss_pred cccchhhccccCCccccceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEee Confidence 2111 12344442 233689999999998521 1 12566542 2223332221 Q ss_pred ----Eeechh-hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 339 ----EEYDQT-FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 339 ----~~~~~~-~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +..++. .|.. .+.+++-++..+.+|+|.++++.+-+ T Consensus 293 ~lt~~~~~d~r~~~~---~id~~~a~g~g~~RPeaa~vv~~k~~ 333 (401) T protein:vir:70 293 DVTGDIFYEKKEKTY---YIDTFMAEGAIPDRWEAVSVVTTKRN 333 (401) T ss_pred ccccchhhhhhhhHH---HHHHHHHhCCcccchhheEEEeecCc Confidence 112222 2222 23366667889999999999988776 No 151 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=97.80 E-value=1.1e-06 Score=53.34 Aligned_cols=280 Identities=10% Similarity=-0.017 Sum_probs=138.0 Q ss_pred ccHHHHHHHHHHHhc-cCCCCCceec-cHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEEcCCcceeeecccccc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKN-VGGKDKFKLL-PEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTAVWGDIFGEI 142 (377) Q Consensus 67 lt~~e~~~~~~~~~~-~~~s~gg~lv-P~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~-~~~p~~~~~~~a~w~~e~~~~ 142 (377) ++. -|..... .++++.-.-+ =+.+..++.+.....+.++++.++.++. |+ .++|+.. ...+.....+.+. T Consensus 1 Ms~-----~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG-~~~a~y~~~G~~l 74 (402) T protein:vir:97 1 MST-----PNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLG-ETELQVLAPGQSP 74 (402) T ss_pred CCC-----cccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEe-eeEEeeecccccc Confidence 000 0001111 1111111223 3889999999999999999999998875 33 6888763 3444444332232 Q ss_pred cccccccceeEeecceeEE-EeehhhHHHHhcCHHH-HHHHHHHHHHHHHHHHhhcceee-----cc----CC-Ccceee Q lcl|Aclame:pro 143 KGQLKQAFKEQDFSQFKLT-AFVVIPKDALKFGPKW-LKQFITEQLKEAIAVALELAIVK-----GN----GL-LQPVGL 210 (377) Q Consensus 143 ~~~~~~~f~~i~l~~~k~~-~~~~iS~ell~ds~~~-~~~~l~~~la~~~a~~~~~a~l~-----G~----G~-~~P~Gi 210 (377) ..+.+..++.+|..-.+- +-..|-.----++.+| +.+.+.+++++++++..|+.++. |- +. ..|.+. T Consensus 75 -dg~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~ 153 (402) T protein:vir:97 75 -NATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) T ss_pred -CCCCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCccc Confidence 344566666655554432 2222222111234566 78999999999999999997742 10 00 011111 Q ss_pred eeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc- Q lcl|Aclame:pro 211 LKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK- 289 (377) Q Consensus 211 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~- 289 (377) -.......+ .+... ...++..+...+...+..+. .+-....+ .+.+++|..|+.++.. T Consensus 154 ~~g~s~~~~--~t~~~--------------a~~~~~~l~~ai~~a~~~Ld----EkdVP~~d-Rv~vv~P~~y~~Ll~~~ 212 (402) T protein:vir:97 154 GHGFSINVN--VTESE--------------ALANPQYVMAAVEYALEQQL----EQEVDISD-VAIMMPWKFFNALRDAD 212 (402) T ss_pred ccccccccc--cccch--------------hhcCHHHHHHHHHHHHHHHH----hcCCCccc-cEEEeChHHHHHHhhcc Confidence 110000000 00000 01122222232222222221 12233345 4678899877766533 Q ss_pred -ccc----cCCCCcccc---ccCCCceEEecCCCCcce---------------E--EEEeccc--EEEEecceeeE---- Q lcl|Aclame:pro 290 -FTS----RNQFGEYVT---VLPHGITILESLAVETGK---------------A--IAFVANR--YDAFMATASTI---- 338 (377) Q Consensus 290 -~~~----~~~~G~~~~---~l~~~~~v~~s~~~~~~~---------------i--i~gd~s~--y~~~~~~~~~i---- 338 (377) +.. ..+.|.|.. ....|++|+.|+++|..- . +-||++. .+++.+..+-. T Consensus 213 rl~n~d~~~~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~ 292 (402) T protein:vir:97 213 RIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTI 292 (402) T ss_pred cccchhhccccCCccccceeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEee Confidence 111 123343432 223588999999998521 1 2256653 22333332221 Q ss_pred ----Eee-chhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 339 ----EEY-DQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 339 ----~~~-~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +.. +...|.. .+.+++-++..+.+|+|..++++.-| T Consensus 293 ~vT~~~~~d~r~~~~---~id~~~a~G~g~~RPeaa~vv~~~~~ 333 (402) T protein:vir:97 293 EVTGDIFYEKKEKTY---YIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) T ss_pred ccccchhhchhHHHH---HHHHHHHhCCcccCccceEEEEEecc Confidence 111 1112222 23355667888999999999988765 No 152 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=97.79 E-value=1.9e-05 Score=46.45 Aligned_cols=290 Identities=15% Similarity=0.115 Sum_probs=141.0 Q ss_pred cccccccHHHHHHHHHH-H----hccCCCCCceecc---HHHHHHHHHHHHhhhhhhhhceeEe-cC-C--ceEEEEEcC Q lcl|Aclame:pro 62 DKNRELTAEEIKFFNDI-D----KNVGGKDKFKLLP---EETMVQVFDDLVAEHPLLKVINFKN-TS-L--RLKALTAET 129 (377) Q Consensus 62 ~~~~~lt~~e~~~~~~~-~----~~~~~s~gg~lvP---~~~~~~Ii~~~~~~s~l~~~~~v~~-~~-~--~~~~p~~~~ 129 (377) +........|....... . ......+.|++.- +.+.+.|++.....-..+.++.+.. .+ + ...+.+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~ 80 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDK 80 (319) T ss_pred CCCcchhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeecc Confidence 11111111111111110 0 0111112243333 3344456665555545555555432 22 2 134556666 Q ss_pred CcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHhhcceeeccCCCc Q lcl|Aclame:pro 130 SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFG---PKWLKQFITEQLKEAIAVALELAIVKGNGLLQ 206 (377) Q Consensus 130 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds---~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~ 206 (377) .+.+.|.+..+...+..+..+.......+.++.-+.+|..=|+.+ ..++..--....++++++.+|+.+++|+...+ T Consensus 81 ~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g 160 (319) T protein:vir:10 81 VGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPHK 160 (319) T ss_pred ccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccccc Confidence 778888865544334555667777778888887777776655433 56678888889999999999999999998878 Q ss_pred ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhh Q lcl|Aclame:pro 207 PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL 286 (377) Q Consensus 207 P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~ 286 (377) -.|++|.+..............+ ..+....+.+..++..+..... .......++++|..+..+ T Consensus 161 ~~GLlN~p~~~~~~~~~~~~~~t-------------~t~~~i~~di~~~~~~l~~~s~----g~~~p~~L~L~p~~~~~L 223 (319) T protein:vir:10 161 IVSVFNHPNITKITSGKWIDVST-------------MKPETAEAELTQAIETIETITR----GQHRATNILIPPSMRKVL 223 (319) T ss_pred ceeEEeCCCceeeecCCCCCccc-------------cCHHHHHHHHHHHHHHHHHhcC----ceeeceEEEecHHHHHhh Confidence 89999987654433222111111 1223333333333333222111 122445788888876555 Q ss_pred cccccccCCCCc----cccccCCCceEEecCCCC----cc--eEEEEeccc-E-EEEecceeeEEeechhhhhcCcEEEE Q lcl|Aclame:pro 287 EAKFTSRNQFGE----YVTVLPHGITILESLAVE----TG--KAIAFVANR-Y-DAFMATASTIEEYDQTFAMEDLQLYL 354 (377) Q Consensus 287 ~~~~~~~~~~G~----~~~~l~~~~~v~~s~~~~----~~--~ii~gd~s~-y-~~~~~~~~~i~~~~~~~f~~~~~~~~ 354 (377) .... +..|. ++.....++.++..+.+. .+ .+++...+. + .+.....+..... +.+-. ...+. T Consensus 224 ~~~~---~~~~~t~l~~lk~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~-e~~~l--~~~~~ 297 (319) T protein:vir:10 224 AIRM---PETTMSYLDYFKSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPA-QPKDL--HFKVP 297 (319) T ss_pred hccc---CCCCeeHHHHHHHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCcceeeeee-eecCc--eEEEe Confidence 3221 22231 111111233444333222 12 133333322 2 2333333332211 21111 11233 Q ss_pred EEEEEcC-EEecccceEEEEeecC Q lcl|Aclame:pro 355 TKNYFYG-KAKDNHTAALLTLAGG 377 (377) Q Consensus 355 ~~~r~dg-~~~~~~af~~l~~~a~ 377 (377) ...|+.| .+..|.|+++++ | T Consensus 298 ~~~r~~Gv~i~~P~ai~~~d---G 318 (319) T protein:vir:10 298 CTSKCTGLTIYRPMTIVLIT---G 318 (319) T ss_pred eeeeeEEEEEEccceeEeee---c Confidence 4566654 456778877765 4 No 153 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=97.78 E-value=2.9e-06 Score=50.94 Aligned_cols=264 Identities=15% Similarity=-0.011 Sum_probs=139.3 Q ss_pred cccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceE-EEEEcCCcceeeecc Q lcl|Aclame:pro 62 DKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLK-ALTAETSGTAVWGDI 138 (377) Q Consensus 62 ~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~-~p~~~~~~~a~w~~e 138 (377) +.-..--+|+. -....+-+...--+|.+++-..+.+-..++...+..|++. .++ +|......++.-+.| T Consensus 1 ~~~~~~~~e~n--------lt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaE 72 (296) T protein:vir:98 1 MVTSRTYPEEN--------LIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPE 72 (296) T ss_pred CCCccccCcCC--------CcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccC Confidence 00000000000 1112233333445677777666666666777778888863 464 354555666766665 Q ss_pred ccccccccccccee---EeecceeEEEeehhhHHHHhcC-HHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeecc Q lcl|Aclame:pro 139 FGEIKGQLKQAFKE---QDFSQFKLTAFVVIPKDALKFG-PKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDL 214 (377) Q Consensus 139 ~~~~~~~~~~~f~~---i~l~~~k~~~~~~iS~ell~ds-~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~ 214 (377) ++++ +.+..+... .++..+|++.- +|.|-++.| --+-.+.-.+.|...++.++++.|+.-=. T Consensus 73 Ge~I-plskvt~~~~~t~t~~ikK~rK~--tTdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~Lk----------- 138 (296) T protein:vir:98 73 GEVI-PLSKVERKIHSEKKIELKKYRKA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALK----------- 138 (296) T ss_pred Cccc-chhhheeeecceEEEEeeccccc--cCHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHh----------- Confidence 5555 466666543 66677777765 499998544 33456777889999999999998874110 Q ss_pred ccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhc-ccCceEEEeccchhhhhccccc-- Q lcl|Aclame:pro 215 SQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLK-IAGQVKLLLNPEDRWTLEAKFT-- 291 (377) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~n~~~~~~~~~~~~-- 291 (377) .++.. ... +......++. ..+..+.. ...+ ...+.+..+||.|.++++.... T Consensus 139 -taT~t--~~~---t~~~lQ~Ala--------~~~~~l~~-----------~feded~~~~V~FVnP~D~a~ylg~a~it 193 (296) T protein:vir:98 139 -TGTGT--QDA---LGAGLQGALA--------SAWGKLQV-----------LFEDYGSERAIVFANSLDVAEYIAKAGIT 193 (296) T ss_pred -cccce--eee---chhhHHHHHH--------HHhhhhhh-----------hccccCCCceEEEEehHHHHHHhcCCccc Confidence 00000 000 0000000000 00011111 1111 1246889999999988764332 Q ss_pred ccCCC-Ccccc-ccCCCceEEecCCCCcceEEEE---ecccEEEEecceeeEEeechhhhhcCcEEEEEEEEE------- Q lcl|Aclame:pro 292 SRNQF-GEYVT-VLPHGITILESLAVETGKAIAF---VANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYF------- 359 (377) Q Consensus 292 ~~~~~-G~~~~-~l~~~~~v~~s~~~~~~~ii~g---d~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~------- 359 (377) ..+.- ++|.. .| |..++.|..+|+|+++.= +..-+++..+.| +....-.+..|++++.+..+. T Consensus 194 ~qt~fG~tyl~nfL--G~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~---~l~~~f~~~~d~tglIGv~h~~~~~~~t 268 (296) T protein:vir:98 194 TQTAFGLTYLVDFT--GTVIISTNDVTKGEIWATVPENIIFAYINPNNS---ELAKEFNLYGDPTGYIGMNHFQENTTLT 268 (296) T ss_pred hhheechhhhhhcc--ccEEEEcCcCCCceEEEeeecceEEEeeccccc---chhhhhccccccccceEEEeccccceee Confidence 22222 34554 43 567899999999997643 233344444422 111222244455666555432 Q ss_pred ------cCE---EecccceEEEEeecC Q lcl|Aclame:pro 360 ------YGK---AKDNHTAALLTLAGG 377 (377) Q Consensus 360 ------dg~---~~~~~af~~l~~~a~ 377 (377) -|- |-..+++++.+|++| T Consensus 269 ~eT~~~~~~~lfpE~~dgiv~~tI~~~ 295 (296) T protein:vir:98 269 IQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) T ss_pred ehhHhHhHHHhcccccceEEEEEecCC Confidence 111 334578999999999 No 154 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=97.74 E-value=1.6e-06 Score=52.38 Aligned_cols=283 Identities=13% Similarity=-0.027 Sum_probs=138.1 Q ss_pred ccHHHHHHHHHHHhccCCCCCce-ec------cHHHHHHHHHHHHhhhhhhhhceeE-ec-CCceEEEEEcC---Cccee Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKNVGGKDKFK-LL------PEETMVQVFDDLVAEHPLLKVINFK-NT-SLRLKALTAET---SGTAV 134 (377) Q Consensus 67 lt~~e~~~~~~~~~~~~~s~gg~-lv------P~~~~~~Ii~~~~~~s~l~~~~~v~-~~-~~~~~~p~~~~---~~~a~ 134 (377) +++- ..-.+..+|+. .| |+-+-+.|.+.++..-..-.+.+.. .. ++-+.+-.... ..++. T Consensus 1 ~~~~--------~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e 72 (318) T protein:vir:10 1 MTAP--------TGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVA 72 (318) T ss_pred CCCC--------CcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHh Confidence 0000 00011122222 11 5555556666554443322233332 22 22334433221 23444 Q ss_pred eecccccccccccccceeEee-cceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeec Q lcl|Aclame:pro 135 WGDIFGEIKGQLKQAFKEQDF-SQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKD 213 (377) Q Consensus 135 w~~e~~~~~~~~~~~f~~i~l-~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~ 213 (377) -+.|.++.+ ...+.++.-.+ ..+|.+.-+.||+|++..+..+..+-....+++.|++..|+..+. .|.. T Consensus 73 ~VaEggEiP-~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~d---------al~s 142 (318) T protein:vir:10 73 DVAEFGEIP-VSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKA---------LLQS 142 (318) T ss_pred hccCccccc-ccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHH---------HHhc Confidence 456667765 55677877777 557999999999999999999999999999999999999986653 1111 Q ss_pred cccccccccccccc-cccc-hhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc Q lcl|Aclame:pro 214 LSQPTVDQSTGRDI-TTYK-TDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT 291 (377) Q Consensus 214 ~~~~~~~~~~~~~~-~~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~ 291 (377) ......++..+... .... ....+......+.+.... .. .........+.- -.++|||.+...+..... T Consensus 143 a~t~~~~~s~~w~~~~~~~~d~~~A~e~v~~a~~~~~~---a~------~~~~~~~~GY~p-dtIVlhP~~~~~l~~n~~ 212 (318) T protein:vir:10 143 PIVPTLAVPTAWDNGGKVRTDIAIAIEQISTAAPTAYP---AG------VGSSDEYFGFIP-DTIVMHYALLPILMDNEN 212 (318) T ss_pred cccccccCCcCCCCcccccccchhhhhhhhhhhhhhhh---hh------hhhhhhccCccc-eeeEECHHHHHHHhcchh Confidence 11111111111000 0000 001111111111110000 00 000001111111 257889988665532211 Q ss_pred -----ccCCC-----Ccccccc---CCCceEEecCCCCcceEEEEecccE-EEEecceeeEEeec-h---hhhhcC-cEE Q lcl|Aclame:pro 292 -----SRNQF-----GEYVTVL---PHGITILESLAVETGKAIAFVANRY-DAFMATASTIEEYD-Q---TFAMED-LQL 352 (377) Q Consensus 292 -----~~~~~-----G~~~~~l---~~~~~v~~s~~~~~~~ii~gd~s~y-~~~~~~~~~i~~~~-~---~~f~~~-~~~ 352 (377) ..+.+ ..|...+ .+|+.|+.|+.+|.+++++-+-... ++.+...++..... | ..-.++ .-. T Consensus 213 ~~~~y~~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~ 292 (318) T protein:vir:10 213 FMKVYERNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYR 292 (318) T ss_pred hhhhhhccchhhhhcccccccccceeeceEEeecCccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhh Confidence 11111 1122222 3688999999999999887775543 34566666655432 1 111122 223 Q ss_pred EEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 353 YLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 353 ~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .|+.++---.+.+|+|...||==.. T Consensus 293 ~~~~~~~~~~V~~PkA~~~itgi~~ 317 (318) T protein:vir:10 293 ADASHKRALAVDQPKAALWLTGIVT 317 (318) T ss_pred eehheeeeeeeeCcceeEEEeeccC Confidence 4455555667888888887762222 No 155 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=97.73 E-value=2.4e-05 Score=45.93 Aligned_cols=276 Identities=10% Similarity=-0.035 Sum_probs=142.8 Q ss_pred ccCCCCCceecc--HHHHHHHHHHHHhhhhhhhhceeE-ecC-C--ceEEEEEcCCcceeeecccccccccccccceeEe Q lcl|Aclame:pro 81 NVGGKDKFKLLP--EETMVQVFDDLVAEHPLLKVINFK-NTS-L--RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQD 154 (377) Q Consensus 81 ~~~~s~gg~lvP--~~~~~~Ii~~~~~~s~l~~~~~v~-~~~-~--~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~ 154 (377) -.++++|.+++- +.+.+.|++.+.+.-..|.++.+. +.+ + ...+......+.+.|....+...+..+..++... T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 455555553332 244556666666665566655443 222 1 2356666667788887655443345556677777 Q ss_pred ecceeEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccc Q lcl|Aclame:pro 155 FSQFKLTAFVVIPKDALKFG---PKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYK 231 (377) Q Consensus 155 l~~~k~~~~~~iS~ell~ds---~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~ 231 (377) ...+.++.-+.++..=|+.+ ..++..--....+++++..+|+.+++|+...+-.|++|................... T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~~~ 160 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNVSK 160 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcccccccc Confidence 78888887777776655433 567888888999999999999999999988788999998765443322211111100 Q ss_pred hhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCc----ccc-ccCCC Q lcl|Aclame:pro 232 TDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGE----YVT-VLPHG 306 (377) Q Consensus 232 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~----~~~-~l~~~ 306 (377) + ...++...++.+..+...+.... ........++++|..+..+...+. .+..|. ++. ..+ + T Consensus 161 -w-------~~~t~~ei~~di~~~~~~l~~~s----~g~~~p~~L~L~p~~~~~L~~~~~-~~~~~~tvl~~l~~~~~-~ 226 (301) T protein:vir:80 161 -W-------EKKTAEQIIDEIGEAHTKITVLP----GYGTASLKLCLPPKQFELINKKRY-SNEDSRSVLKVLQDNAW-F 226 (301) T ss_pred -c-------ccCCHHHHHHHHHHHHHHHHHhc----CceecccEEEecHHHHHhhhhccc-cCCCCeeHHHHHHHHcC-c Confidence 0 11233333333333333222111 112344678889987766542221 122221 111 111 2 Q ss_pred ceEEecCCCC----cce--EEEEecc--cEEEEecceeeEEeechhhhhcCc-EEEEEEEEEcC-EEecccceEEEEeec Q lcl|Aclame:pro 307 ITILESLAVE----TGK--AIAFVAN--RYDAFMATASTIEEYDQTFAMEDL-QLYLTKNYFYG-KAKDNHTAALLTLAG 376 (377) Q Consensus 307 ~~v~~s~~~~----~~~--ii~gd~s--~y~~~~~~~~~i~~~~~~~f~~~~-~~~~~~~r~dg-~~~~~~af~~l~~~a 376 (377) ..++..+.+. .++ +++..-+ ...+.....+.... -+. ++. .....+.|+.| .+..|.|+++++ T Consensus 227 ~~I~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~-~e~---~~~~~~~~~~~r~~Gv~i~~P~ai~~~~--- 299 (301) T protein:vir:80 227 SAIVRVPDLAGMGTAGSDSFAVIHDSNETAELIIPMDITRHP-EEY---SFPRTKVPFEERTAGVVVRFPAAIVRVD--- 299 (301) T ss_pred ceEEEcceeccCCCCcccEEEEEecCCcEEEEEecCceeeec-cee---cCceeEeeeeeeeEEEEEEccceEEEEe--- Confidence 2333322221 111 2222221 22233323332211 111 222 12234666654 677788888775 Q ss_pred C Q lcl|Aclame:pro 377 G 377 (377) Q Consensus 377 ~ 377 (377) | T Consensus 300 G 300 (301) T protein:vir:80 300 G 300 (301) T ss_pred c Confidence 4 No 156 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=97.70 E-value=1.7e-06 Score=52.23 Aligned_cols=246 Identities=13% Similarity=0.087 Sum_probs=126.1 Q ss_pred hcc-ccccccHHHHHHHHHHHhccCCCCCceeccHH-HHHHHHHHHHhhhhhhhhceeEecC--CceEEEEEcCCcceee Q lcl|Aclame:pro 60 LRD-KNRELTAEEIKFFNDIDKNVGGKDKFKLLPEE-TMVQVFDDLVAEHPLLKVINFKNTS--LRLKALTAETSGTAVW 135 (377) Q Consensus 60 ~~~-~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~-~~~~Ii~~~~~~s~l~~~~~v~~~~--~~~~~p~~~~~~~a~w 135 (377) +.. ....+|-.|... . +-|.. +...|+|.+.+.++|+..+.+.... ......+.++-+.+.| T Consensus 1 m~~~~~~~~TL~e~Ak-------~-------~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~f 66 (331) T protein:vir:98 1 MPTLSTTNPTLADVAA-------R-------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTW 66 (331) T ss_pred CCccccCcccHHHHHH-------h-------cCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchh Confidence 110 001111111110 0 11322 3567999999999999999988643 3345678888899999 Q ss_pred ecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHH---HHHHHHHHHHHHHHhhcceeeccCCCcceee-- Q lcl|Aclame:pro 136 GDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLK---QFITEQLKEAIAVALELAIVKGNGLLQPVGL-- 210 (377) Q Consensus 136 ~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~---~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gi-- 210 (377) ..-+...+ .+++++.+++-...-+.+.+.|.+.+.+... +.. +.-.+.+.++++..+...||+|+-+..|.++ T Consensus 67 R~lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~G 144 (331) T protein:vir:98 67 RKLNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMG 144 (331) T ss_pred hccCCccC-cccceeEEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhcc Confidence 87666664 6778999999999999999999999888643 344 4455668899999999999999976666544 Q ss_pred -eecc---cc----cc-ccccccccccccc-----------------------hhhhhhhhhhccChHHH---------- Q lcl|Aclame:pro 211 -LKDL---SQ----PT-VDQSTGRDITTYK-----------------------TDKEAIADLSDLDPDTA---------- 248 (377) Q Consensus 211 -l~~~---~~----~~-~~~~~~~~~~~~~-----------------------~~~~~~~~l~~~~~~~~---------- 248 (377) -+.. +. .. .+.+++...+..+ .+.. ...+..+++..| T Consensus 145 L~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g-~~~~~~~~G~~y~~y~~~~~w~ 223 (331) T protein:vir:98 145 LTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLG-EDTLIDAAGGRYQGYRTHYKWD 223 (331) T ss_pred chhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecC-ceeeecCCCCeeeEEEEEEEee Confidence 2211 11 01 1111111111100 0000 000001000000 Q ss_pred --------------HH-----------HHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCC-------- Q lcl|Aclame:pro 249 --------------VE-----------LLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQ-------- 295 (377) Q Consensus 249 --------------~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~-------- 295 (377) .+ ...++..........-|....++.+|+||..-.-.+..+...++. T Consensus 224 ~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~ 303 (331) T protein:vir:98 224 IGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEE 303 (331) T ss_pred eeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeee Confidence 00 001111112222233345567888899997654434333222211 Q ss_pred -CCccccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEE Q lcl|Aclame:pro 296 -FGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQL 352 (377) Q Consensus 296 -~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~ 352 (377) .|..++.+ .|+||...+++--. +..++ T Consensus 304 ~~g~~~t~~-~gipir~~dai~~t-----------------------------E~~Vv 331 (331) T protein:vir:98 304 IAGKKVVAF-DGIPCRRTDALLLT-----------------------------EARVV 331 (331) T ss_pred cCCcceeEE-CCeeEEEeeeeecC-----------------------------ccccC Confidence 12222222 24454444433211 11111 No 157 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=97.70 E-value=1.7e-06 Score=52.23 Aligned_cols=246 Identities=13% Similarity=0.087 Sum_probs=126.1 Q ss_pred hcc-ccccccHHHHHHHHHHHhccCCCCCceeccHH-HHHHHHHHHHhhhhhhhhceeEecC--CceEEEEEcCCcceee Q lcl|Aclame:pro 60 LRD-KNRELTAEEIKFFNDIDKNVGGKDKFKLLPEE-TMVQVFDDLVAEHPLLKVINFKNTS--LRLKALTAETSGTAVW 135 (377) Q Consensus 60 ~~~-~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~-~~~~Ii~~~~~~s~l~~~~~v~~~~--~~~~~p~~~~~~~a~w 135 (377) +.. ....+|-.|... . +-|.. +...|+|.+.+.++|+..+.+.... ......+.++-+.+.| T Consensus 1 m~~~~~~~~TL~e~Ak-------~-------~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~f 66 (331) T protein:vir:10 1 MPTLSTTNPTLADVAA-------R-------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTW 66 (331) T ss_pred CCccccCcccHHHHHH-------h-------cCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchh Confidence 110 001111111110 0 11322 3567999999999999999988643 3345678888899999 Q ss_pred ecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHH---HHHHHHHHHHHHHHhhcceeeccCCCcceee-- Q lcl|Aclame:pro 136 GDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLK---QFITEQLKEAIAVALELAIVKGNGLLQPVGL-- 210 (377) Q Consensus 136 ~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~---~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gi-- 210 (377) ..-+...+ .+++++.+++-...-+.+.+.|.+.+.+... +.. +.-.+.+.++++..+...||+|+-+..|.++ T Consensus 67 R~lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~G 144 (331) T protein:vir:10 67 RKLNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMG 144 (331) T ss_pred hccCCccC-cccceeEEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhcc Confidence 87666664 6778999999999999999999999888643 344 4455668899999999999999976666544 Q ss_pred -eecc---cc----cc-ccccccccccccc-----------------------hhhhhhhhhhccChHHH---------- Q lcl|Aclame:pro 211 -LKDL---SQ----PT-VDQSTGRDITTYK-----------------------TDKEAIADLSDLDPDTA---------- 248 (377) Q Consensus 211 -l~~~---~~----~~-~~~~~~~~~~~~~-----------------------~~~~~~~~l~~~~~~~~---------- 248 (377) -+.. +. .. .+.+++...+..+ .+.. ...+..+++..| T Consensus 145 L~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g-~~~~~~~~G~~y~~y~~~~~w~ 223 (331) T protein:vir:10 145 LTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLG-EDTLIDAAGGRYQGYRTHYKWD 223 (331) T ss_pred chhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecC-ceeeecCCCCeeeEEEEEEEee Confidence 2211 11 01 1111111111100 0000 000001000000 Q ss_pred --------------HH-----------HHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCC-------- Q lcl|Aclame:pro 249 --------------VE-----------LLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQ-------- 295 (377) Q Consensus 249 --------------~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~-------- 295 (377) .+ ...++..........-|....++.+|+||..-.-.+..+...++. T Consensus 224 ~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~ 303 (331) T protein:vir:10 224 IGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEE 303 (331) T ss_pred eeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeee Confidence 00 001111112222233345567888899997654434333222211 Q ss_pred -CCccccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEE Q lcl|Aclame:pro 296 -FGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQL 352 (377) Q Consensus 296 -~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~ 352 (377) .|..++.+ .|+||...+++--. +..++ T Consensus 304 ~~g~~~t~~-~gipir~~dai~~t-----------------------------E~~Vv 331 (331) T protein:vir:10 304 IAGKKVVAF-DGIPCRRTDALLLT-----------------------------EARVV 331 (331) T ss_pred cCCcceeEE-CCeeEEEeeeeecC-----------------------------ccccC Confidence 12222222 24454444433211 11111 No 158 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=97.70 E-value=1.7e-06 Score=52.23 Aligned_cols=246 Identities=13% Similarity=0.087 Sum_probs=126.1 Q ss_pred hcc-ccccccHHHHHHHHHHHhccCCCCCceeccHH-HHHHHHHHHHhhhhhhhhceeEecC--CceEEEEEcCCcceee Q lcl|Aclame:pro 60 LRD-KNRELTAEEIKFFNDIDKNVGGKDKFKLLPEE-TMVQVFDDLVAEHPLLKVINFKNTS--LRLKALTAETSGTAVW 135 (377) Q Consensus 60 ~~~-~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~-~~~~Ii~~~~~~s~l~~~~~v~~~~--~~~~~p~~~~~~~a~w 135 (377) +.. ....+|-.|... . +-|.. +...|+|.+.+.++|+..+.+.... ......+.++-+.+.| T Consensus 1 m~~~~~~~~TL~e~Ak-------~-------~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~f 66 (331) T protein:vir:10 1 MPTLSTTNPTLADVAA-------R-------MTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTW 66 (331) T ss_pred CCccccCcccHHHHHH-------h-------cCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchh Confidence 110 001111111110 0 11322 3567999999999999999988643 3345678888899999 Q ss_pred ecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHHH---HHHHHHHHHHHHHHhhcceeeccCCCcceee-- Q lcl|Aclame:pro 136 GDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLK---QFITEQLKEAIAVALELAIVKGNGLLQPVGL-- 210 (377) Q Consensus 136 ~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~---~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gi-- 210 (377) ..-+...+ .+++++.+++-...-+.+.+.|.+.+.+... +.. +.-.+.+.++++..+...||+|+-+..|.++ T Consensus 67 R~lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~G 144 (331) T protein:vir:10 67 RKLNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMG 144 (331) T ss_pred hccCCccC-cccceeEEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhcc Confidence 87666664 6778999999999999999999999888643 344 4455668899999999999999976666544 Q ss_pred -eecc---cc----cc-ccccccccccccc-----------------------hhhhhhhhhhccChHHH---------- Q lcl|Aclame:pro 211 -LKDL---SQ----PT-VDQSTGRDITTYK-----------------------TDKEAIADLSDLDPDTA---------- 248 (377) Q Consensus 211 -l~~~---~~----~~-~~~~~~~~~~~~~-----------------------~~~~~~~~l~~~~~~~~---------- 248 (377) -+.. +. .. .+.+++...+..+ .+.. ...+..+++..| T Consensus 145 L~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g-~~~~~~~~G~~y~~y~~~~~w~ 223 (331) T protein:vir:10 145 LTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLG-EDTLIDAAGGRYQGYRTHYKWD 223 (331) T ss_pred chhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecC-ceeeecCCCCeeeEEEEEEEee Confidence 2211 11 01 1111111111100 0000 000001000000 Q ss_pred --------------HH-----------HHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCC-------- Q lcl|Aclame:pro 249 --------------VE-----------LLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQ-------- 295 (377) Q Consensus 249 --------------~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~-------- 295 (377) .+ ...++..........-|....++.+|+||..-.-.+..+...++. T Consensus 224 ~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~ 303 (331) T protein:vir:10 224 IGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEE 303 (331) T ss_pred eeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeee Confidence 00 001111112222233345567888899997654434333222211 Q ss_pred -CCccccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEE Q lcl|Aclame:pro 296 -FGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQL 352 (377) Q Consensus 296 -~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~ 352 (377) .|..++.+ .|+||...+++--. +..++ T Consensus 304 ~~g~~~t~~-~gipir~~dai~~t-----------------------------E~~Vv 331 (331) T protein:vir:10 304 IAGKKVVAF-DGIPCRRTDALLLT-----------------------------EARVV 331 (331) T ss_pred cCCcceeEE-CCeeEEEeeeeecC-----------------------------ccccC Confidence 12222222 24454444433211 11111 No 159 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=97.61 E-value=4e-06 Score=50.20 Aligned_cols=241 Identities=14% Similarity=0.179 Sum_probs=130.3 Q ss_pred hc-cccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec--CCceEEEEEcCCcceeee Q lcl|Aclame:pro 60 LR-DKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT--SLRLKALTAETSGTAVWG 136 (377) Q Consensus 60 ~~-~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~--~~~~~~p~~~~~~~a~w~ 136 (377) +. .....+|-.|.. ..+-|......|+|.+.+.++|++.+++... +......+.++-|.+.|. T Consensus 1 m~~~~~~a~TL~e~A--------------Kr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR 66 (330) T protein:vir:10 1 MATLSTNNPTMADVA--------------KRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWR 66 (330) T ss_pred CCcCCCCcccHHHHH--------------hhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhh Confidence 11 011112222211 1134455677899999999999999888743 223345667788899998 Q ss_pred cccccccccccccceeEeecceeEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHhhcceeeccCCCcce---eee Q lcl|Aclame:pro 137 DIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP--KWLKQFITEQLKEAIAVALELAIVKGNGLLQPV---GLL 211 (377) Q Consensus 137 ~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~--~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~---Gil 211 (377) .-+...+ +++.++.+++-..+-+.+.+.|-+.+.+.+. -++...-.+...+++.+.+...||||+-...|. |+- T Consensus 67 ~lN~g~~-~s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~ 145 (330) T protein:vir:10 67 KLYGGVL-PNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLS 145 (330) T ss_pred hcCCccc-cccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchh Confidence 7776664 5679999999999999999999999987642 223444566789999999999999998766665 443 Q ss_pred eccc---c----cccc-cccccccccc-------------ch-hhh------hhh--hhh-------------------- Q lcl|Aclame:pro 212 KDLS---Q----PTVD-QSTGRDITTY-------------KT-DKE------AIA--DLS-------------------- 241 (377) Q Consensus 212 ~~~~---~----~~~~-~~~~~~~~~~-------------~~-~~~------~~~--~l~-------------------- 241 (377) +... . ...+ .+++...+.. ++ ..+ ..+ ++. T Consensus 146 kR~~~~ta~~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~ 225 (330) T protein:vir:10 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) T ss_pred hhcCCCCCCchhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeee Confidence 3221 1 0011 1111111000 00 000 000 010 Q ss_pred -----------------------ccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccC---- Q lcl|Aclame:pro 242 -----------------------DLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRN---- 294 (377) Q Consensus 242 -----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~---- 294 (377) .+....+++.+ ....+.-|....++.+|+||..-.-.+..+...++ T Consensus 226 Gl~i~d~r~vvRI~NIdvs~l~~~~~~~~li~lm-------~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l 298 (330) T protein:vir:10 226 GLTLRDWRYVARVCNIDVSDLATSANAQALIKYM-------IMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNL 298 (330) T ss_pred eeEEeCcccEEEEeecccccCCCCccHHHHHHHH-------HHHHHhccCCCCCcceeeechHHHHHHHHHHhhccccee Confidence 11111222222 12223445666788999999865544433332222 Q ss_pred ----CCCccccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEE Q lcl|Aclame:pro 295 ----QFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQL 352 (377) Q Consensus 295 ----~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~ 352 (377) ..|..++.+ .|+||...+++-... ..++ T Consensus 299 ~~~~~~g~~~t~~-~gipir~~Dail~tE-----------------------------~~vv 330 (330) T protein:vir:10 299 TWETVSGERVMTF-DGIPVQRTDALLNTE-----------------------------SRVV 330 (330) T ss_pred eeeecCCeeeEEE-CCeEEEEEeeeecCc-----------------------------cccC Confidence 223333322 355555554442211 1111 No 160 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=97.48 E-value=4e-05 Score=44.71 Aligned_cols=262 Identities=7% Similarity=-0.078 Sum_probs=127.9 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhc---------eeEe--cCCc-eEEEEEcCC-cceeeeccccccccc Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVI---------NFKN--TSLR-LKALTAETS-GTAVWGDIFGEIKGQ 145 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~---------~v~~--~~~~-~~~p~~~~~-~~a~w~~e~~~~~~~ 145 (377) +. ++.-.-.++|+-|.+-+.+.+.+.+.+++.. .... .+|+ +.+|....- +++.-+.+.+++. . T Consensus 1 MA--~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~-~ 77 (324) T protein:vir:59 1 MA--YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLV-P 77 (324) T ss_pred CC--ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccc-h Confidence 22 3334567889988888877777776664422 1221 2343 688877542 4554444455544 3 Q ss_pred ccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccc Q lcl|Aclame:pro 146 LKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGR 225 (377) Q Consensus 146 ~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~ 225 (377) .+.+.++-.-..++.+.-..++.+-..-+..|....+.+.+++.+++..++.+|.-= .|++.............+ T Consensus 78 ~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l-----~g~~~~~~~~~~~~dvsa 152 (324) T protein:vir:59 78 QKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAEL-----AGVFSNDDMKDNKLDISG 152 (324) T ss_pred hhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhhhccccccceeeeec Confidence 344444444444444444456665555566677888999999999999888766310 111111000000000000 Q ss_pred cccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc---cccCCCCccccc Q lcl|Aclame:pro 226 DITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF---TSRNQFGEYVTV 302 (377) Q Consensus 226 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~---~~~~~~G~~~~~ 302 (377) ... +..+...+.+.+ .+.++. ...-.+|+|||.++..+...- ..+..+|..... T Consensus 153 ~~~------------~~~s~~~l~~A~-------~~~GD~----~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~ 209 (324) T protein:vir:59 153 TAD------------GIYSAETFVDAS-------YKLGDH----ESLLTAIGMHSATMASAVKQDLIEFVKDSQSGIRFP 209 (324) T ss_pred ccc------------ceecHHHHHHHH-------HHhCCc----ccCcEEEEEchHHHHHHHHhhhhhhccccccCceee Confidence 000 001111111111 111111 123357999999988876321 112222221111 Q ss_pred cCCCceEEecCCCCcce----------EEEEecccEEEEe-cceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEE Q lcl|Aclame:pro 303 LPHGITILESLAVETGK----------AIAFVANRYDAFM-ATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAAL 371 (377) Q Consensus 303 l~~~~~v~~s~~~~~~~----------ii~gd~s~y~~~~-~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~ 371 (377) -..|++|+.++.||... .+|+.-. ..+.. +.++.++..++. ..++..+....++ +++|..+.. T Consensus 210 ~~~G~~VivdD~~p~~~~~~~~~~y~s~l~~~GA-i~~~~~~~~v~vE~dRd~--~~g~~~l~~r~~~---~~~p~G~s~ 283 (324) T protein:vir:59 210 TYMNKRVIVDDSMPVETLEDGTKVFTSYLFGAGA-LGYAEGQPEVPTETARNA--LGSQDILINRKHF---VLHPRGVKF 283 (324) T ss_pred eecccEEEEeCCCCccccCCCCceEEEEEEecCe-EEEeecCCCcceecccCc--cccceEEEEeeEE---EeEeeeEEe Confidence 12578999999988421 2333222 11111 223444444443 3555566666665 566666666 Q ss_pred EEee-cC Q lcl|Aclame:pro 372 LTLA-GG 377 (377) Q Consensus 372 l~~~-a~ 377 (377) -.-+ +| T Consensus 284 ~~~~~~~ 290 (324) T protein:vir:59 284 TENAMAG 290 (324) T ss_pred cccccCC Confidence 4433 23 No 161 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=97.22 E-value=2.8e-05 Score=45.56 Aligned_cols=340 Identities=17% Similarity=0.194 Sum_probs=143.6 Q ss_pred CCccHHHHHHHHHHHHHHHH---HHH-------------hccCHHHHHHHHHHHHHHHH---HH-----------HHH-H Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSA---KIS-------------AGATPEEQEKLFEAAFTTMG---DE-----------ILA-K 49 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~---~~~-------------~~~~~~~~~~~~~~~~~~~~---~~-----------~~~-~ 49 (377) |.+ -+|.+....+.++.+ .++ +.....+..+.+.+..-++. .+ .+. + T Consensus 1 mnk--pdliekqnrlaelkennvslksqisgfevknaiedl~K~~ELe~TlSe~~iEI~k~en~LN~~eE~~KGK~kMt~ 78 (393) T protein:vir:16 1 MNK--PDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTN 78 (393) T ss_pred CCC--cchhhhhhhhhhhhhcccchhhhccchhhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhcchhhHHHHH Confidence 433 233222222222222 111 11111111111111100000 00 000 0 Q ss_pred ---HHHHHHHHHHhccccccccHHHHHHHHHHHhccC--CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc-eE Q lcl|Aclame:pro 50 ---NEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVG--GKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLR-LK 123 (377) Q Consensus 50 ---~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~--~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~~-~~ 123 (377) .+.+....+.....+. -+.+-++++.+-..+.+ .++-...+|..+.-.|-..+..+.|+++...|.+.+.- ++ T Consensus 79 ~iesq~A~~eF~~vL~~N~-G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~ 157 (393) T protein:vir:16 79 FIESQNAVTEFFDVLKKNS-GKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVS 157 (393) T ss_pred HHhhHHHHHHHHHHHhccC-CchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHH Confidence 0001111111111112 22355666665544333 36777899999999999999999999997776655532 22 Q ss_pred EEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhh-HHHH---hcCHHHHHHHHHHHHHHHHH-HHhhcce Q lcl|Aclame:pro 124 ALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIP-KDAL---KFGPKWLKQFITEQLKEAIA-VALELAI 198 (377) Q Consensus 124 ~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS-~ell---~ds~~~~~~~l~~~la~~~a-~~~~~a~ 198 (377) ....+.. .|. +...|..+.+...+|..-++.+-. .++..| -++. .+|-..+-.||..+|+++|. +..+.|+ T Consensus 158 ~s~~s~~-eAq-~HkdGqTK~eqa~~~~~~Tl~~~~--VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~Al 233 (393) T protein:vir:16 158 RSFDSAN-EAQ-VHKDGQTKTEQAATLTIDTLEPVM--VYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLAL 233 (393) T ss_pred hhhhhhh-hhh-hhccCCccccceeeeeeechhHHH--HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 2223222 333 334566666555566655555543 333333 2333 34444568999999999999 8999999 Q ss_pred eeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 199 VKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 199 l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) +-|+|++....+-+..........+... ...+.-.++ ..+....++ -..+.|+.++++ T Consensus 234 V~GDG~N~f~~~DK~advK~I~k~Ttka--------ksagktpfa---daieeavdf-----------vrptagrryliv 291 (393) T protein:vir:16 234 VEGDGTNGFKSIDKEADVKKIKKITTKA--------KSAGKTPFA---DAIEEAVDF-----------VRPTAGRRYLIV 291 (393) T ss_pred heecCCCCccchhhHHHHHHHHHHhhhh--------hhcCCCchh---HHHHHHHhh-----------hccCCCceEEEE Confidence 9999998655553322211111111000 000000111 112222222 123456677777 Q ss_pred ccchhhhhcccccccCCCCccccccCCCceEEecCCCCcceEEEEeccc-----------EEEEecceeeEEeechhhhh Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANR-----------YDAFMATASTIEEYDQTFAM 347 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~-----------y~~~~~~~~~i~~~~~~~f~ 347 (377) ...+.-.++..+..-+.+.... +-..... ....+..+.+++.--|+ |.+.+ +++. .-+.--|. T Consensus 292 ktedrkalldelrqatananvr-iknddte--iasevgvdeiivytgskalkptvlvdqkyhidm-qdlt--kvdafewk 365 (393) T protein:vir:16 292 KTEDRKALLDELRQATANANVR-IKNDDTE--IASEVGVDEIIVYTGSKALKPTVLVDQKYHIDM-QDLT--KVDAFEWK 365 (393) T ss_pred eccchHHHHHHHHhhhccCcee-eeccchh--hhhhcCcceeeeeeccccccceeeeccccccch-hhhh--hhhhheec Confidence 6665443332221111221110 0000000 00011112222221111 22211 1111 11111133 Q ss_pred cCcEEEEEEEEEcCEEecccceEEEEee Q lcl|Aclame:pro 348 EDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) Q Consensus 348 ~~~~~~~~~~r~dg~~~~~~af~~l~~~ 375 (377) .+..-+..-..--|-+-...|=++++++ T Consensus 366 tnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 366 TNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred cCCceEEEeecccCcceeeccceeEeeC Confidence 3333333444445656666666777777 No 162 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=97.06 E-value=5.5e-05 Score=43.95 Aligned_cols=342 Identities=17% Similarity=0.201 Sum_probs=142.4 Q ss_pred CCc-----cHHHHHHHHHH---HHHHHHHHHhcc-------------CHHHHHHHHHHHH----------HHHHHHHHH- Q lcl|Aclame:pro 1 MAI-----NLKELPKYREA---VAELSAKISAGA-------------TPEEQEKLFEAAF----------TTMGDEILA- 48 (377) Q Consensus 1 m~~-----~~~~l~~~~~~---~~~~~~~~~~~~-------------~~~~~~~~~~~~~----------~~~~~~~~~- 48 (377) |.+ +--+++++-+. .++....++..+ ...+..+.+.+.. +.+.+..+. T Consensus 1 mriS~~~~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~LNa~~E~~KGK 80 (400) T protein:vir:93 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) T ss_pred CcccccccccchHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhhhhhh Confidence 222 11122222111 122222222111 0111111111111 011000000 Q ss_pred -HH------HHHHHHHHHhccccccccHHHHHHHHHHHhccC--CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC Q lcl|Aclame:pro 49 -KN------EEEMERMFDLRDKNRELTAEEIKFFNDIDKNVG--GKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS 119 (377) Q Consensus 49 -~~------~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~--~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~ 119 (377) ++ ++.....+.....+. -+.+-++++.+-....+ .++-...+|..+.-.|-..+..+.|+++...|.+.+ T Consensus 81 ~kMt~~i~sq~A~~eF~~vL~~N~-G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~ 159 (400) T protein:vir:93 81 DKMTNFIESQNAVTEFFDVLKKNS-GKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 159 (400) T ss_pred HHHHHHHhhHHHHHHHHHHHhccC-CchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccch Confidence 00 001111111111111 22355666655444333 367778999999999999999999999977776555 Q ss_pred Cc-eEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhh-HHHH---hcCHHHHHHHHHHHHHHHHH-HH Q lcl|Aclame:pro 120 LR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIP-KDAL---KFGPKWLKQFITEQLKEAIA-VA 193 (377) Q Consensus 120 ~~-~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS-~ell---~ds~~~~~~~l~~~la~~~a-~~ 193 (377) .- ++....+. ..|. +..+|..+.+...+|..-++.+-.+ ++..| -++. .+|-..+-.||..+|+++|. +. T Consensus 160 ~~~V~~s~~s~-~~Aq-~HkdGqTK~eqa~~~~~~Tl~~~~V--Y~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~ 235 (400) T protein:vir:93 160 ALLVSRSFDSA-NEAQ-VHKDGQTKTEQAATLTIDTLEPVMV--YKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKI 235 (400) T ss_pred hhhHHhhhhhh-hhhh-hhccCCccccceeeeeeechhHHHH--HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHH Confidence 32 22222222 2333 3345666665556666666655433 33333 2333 34444568999999999999 88 Q ss_pred hhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCc Q lcl|Aclame:pro 194 LELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQ 273 (377) Q Consensus 194 ~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 273 (377) .+.+++-|+|++....+-+..........+... ...+.-.++ ..+....++ -..+.|+ T Consensus 236 Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttka--------ksagktpfa---daieeavdf-----------vrptagr 293 (400) T protein:vir:93 236 VDLALVEGDGTNGFKSIDKEADVKKIKKITTKA--------KSAGKTPFA---DAIEEAVDF-----------VRPTAGR 293 (400) T ss_pred HHhhhheecCCCCccchhhHHHHHHHHHHhhhh--------hhcCCCchh---HHHHHHHhh-----------hccCCCc Confidence 999999999998655553322211111111000 000000111 111222222 1234566 Q ss_pred eEEEeccchhhhhcccccccCCCCccccccCCCceEEecCCCCcceEEEEeccc-----------EEEEecceeeEEeec Q lcl|Aclame:pro 274 VKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANR-----------YDAFMATASTIEEYD 342 (377) Q Consensus 274 ~~~~~n~~~~~~~~~~~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~-----------y~~~~~~~~~i~~~~ 342 (377) .++++...|.-.++..+..-+.+.. +..-... --....+..+.+++.--|+ |.+.+ +++. .-+ T Consensus 294 rylivktedrkalldelrqatanah-vrikndd--aeiasevgvdeiivytgskalkptvlvdqkyhidm-qdlt--kvd 367 (400) T protein:vir:93 294 RYLIVKTEDRKALLDELRQATANAH-VRIKNDD--AEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDM-QDLT--KVD 367 (400) T ss_pred eEEEEeccchHHHHHHHHhhccccc-eEeecch--hhhhhhcCcceeeeeeccccccceeeeccccccch-hhhh--hhh Confidence 7777766654433322211112211 0000000 0000001112222221111 22211 1111 111 Q ss_pred hhhhhcCcEEEEEEEEEcCEEecccceEEEEee Q lcl|Aclame:pro 343 QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) Q Consensus 343 ~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~ 375 (377) .--|..+..-+..-..--|-+-...|=++++++ T Consensus 368 afewktnsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 368 AFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred hheeccCCceEEEeecccCcceeeccceeEeeC Confidence 111333333333444445666666666777777 No 163 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=96.87 E-value=4.7e-05 Score=44.31 Aligned_cols=244 Identities=12% Similarity=0.076 Sum_probs=125.0 Q ss_pred HHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec--CCceEEEEEcCCccee Q lcl|Aclame:pro 57 MFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT--SLRLKALTAETSGTAV 134 (377) Q Consensus 57 ~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~--~~~~~~p~~~~~~~a~ 134 (377) +... ....+|-.|... .+-|......|+|.+.+.++|++.+.+... +......+.++-|.+. T Consensus 1 m~~~--~~~a~TL~E~Ak--------------r~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~ 64 (335) T protein:vir:73 1 MALI--GQTLPSLLDIYN--------------RTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPV 64 (335) T ss_pred CCcC--CCCchhHHHHHh--------------hcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCch Confidence 0000 011122222110 122344566799999999999999888743 2223456677888999 Q ss_pred eecccccccccccccceeEeecceeEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHhhcceeeccCCCcce---e Q lcl|Aclame:pro 135 WGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP--KWLKQFITEQLKEAIAVALELAIVKGNGLLQPV---G 209 (377) Q Consensus 135 w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~--~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~---G 209 (377) |..-+...+ +++.++.+++-..+-+.+.+.|-+.|.+.+. -++.+.-.+...+++.+.+...||||+-+..|. | T Consensus 65 fR~lN~g~~-~s~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdG 143 (335) T protein:vir:73 65 WRRYNQGVQ-PTKTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMG 143 (335) T ss_pred hhhcCCccc-cccceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccc Confidence 987666664 5679999999999999999999998886543 223455556688999999999999998766665 4 Q ss_pred eeecc---cccc-------cc-ccccccccc-------------cc----------hhh------hhhh----------- Q lcl|Aclame:pro 210 LLKDL---SQPT-------VD-QSTGRDITT-------------YK----------TDK------EAIA----------- 238 (377) Q Consensus 210 il~~~---~~~~-------~~-~~~~~~~~~-------------~~----------~~~------~~~~----------- 238 (377) +-+.. +... .+ .+++...+. .+ .|. ++.+ T Consensus 144 L~kR~~~~st~~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~ 223 (335) T protein:vir:73 144 LAPRFNTLSTSKAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFK 223 (335) T ss_pred hhhhhcCccccccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeee Confidence 42221 1100 11 111111100 00 000 0000 Q ss_pred -------------------hhhc----c-ChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccC Q lcl|Aclame:pro 239 -------------------DLSD----L-DPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRN 294 (377) Q Consensus 239 -------------------~l~~----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~ 294 (377) .+.. + ....+++.|...+ .....|....++.+|+||..-.-.+..+...++ T Consensus 224 w~~Gl~i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~-----~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~ 298 (335) T protein:vir:73 224 WDIGLSVRDWRSISRICNIDVTTLTKDASTGADLISMMVDAY-----YARDVAMLGDGKEVIYANKTIHAWLHKQAMNAK 298 (335) T ss_pred eeeeeEEeCcccEEEEeecccccccccccchhhHHhhHHHHH-----HHHhccCCCCCceEEEechHHHHHHHHHHhccC Confidence 0000 0 0011112221111 011224445677899999754433333322221 Q ss_pred --------CCCccccccCCCceEEecCCCCcce-EEEE Q lcl|Aclame:pro 295 --------QFGEYVTVLPHGITILESLAVETGK-AIAF 323 (377) Q Consensus 295 --------~~G~~~~~l~~~~~v~~s~~~~~~~-ii~g 323 (377) ..|..++.+ .|+||...+++--.. .+.. T Consensus 299 n~~l~~~~~~g~~~t~~-~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 299 NVNLTIEEYGGKKIVSF-LGIPIRRVDAILNTESAVTA 335 (335) T ss_pred ceeeeeeccCCceeEEE-CCeEEEEEeeeecCcccccC Confidence 123333322 255555544432111 1111 No 164 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=96.72 E-value=0.00039 Score=39.30 Aligned_cols=269 Identities=7% Similarity=-0.076 Sum_probs=124.2 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe---------cCC-ceEEEEEcCC-cceeeeccc-ccccccc Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN---------TSL-RLKALTAETS-GTAVWGDIF-GEIKGQL 146 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~---------~~~-~~~~p~~~~~-~~a~w~~e~-~~~~~~~ 146 (377) +....+.-...++|+-|.+-+.+...+.+.|++..-+.+ .+| .+++|....- +++.-+.+. ..+. .. T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~-~~ 79 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALE-TG 79 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccc-hh Confidence 443344455678999888877777766666655322222 234 3688976532 444322222 2333 22 Q ss_pred cccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeecccccccccccccc Q lcl|Aclame:pro 147 KQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRD 226 (377) Q Consensus 147 ~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~ 226 (377) +.+-++-.-..++.+.-..++.+-..-+..|..+.+.+.+++...+..++.++. .-.|+++.............. T Consensus 80 ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla-----~l~gvf~~~~~~~~~~~~~~~ 154 (330) T protein:vir:10 80 KITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIA-----TLNGIFATGTAGEKGALEETH 154 (330) T ss_pred hcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHH-----HHHhhhhhhhcccchhhhhhh Confidence 333333333333344444555555555666778889999998888877776553 111333221111110000000 Q ss_pred ccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc-c--ccCC--CCcccc Q lcl|Aclame:pro 227 ITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF-T--SRNQ--FGEYVT 301 (377) Q Consensus 227 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~-~--~~~~--~G~~~~ 301 (377) .. ....+. ...+...+.+.. .+.++. ...-.+|+|||.++..++..- . .+.. ++.+.+ T Consensus 155 ~~---~~~~~~---a~~s~~~l~~A~-------~~~GD~----~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~ 217 (330) T protein:vir:10 155 VS---DQSKAS---TGIDAGMVLDAK-------QLLGDS----ADQVTAIAMHSAVYTKLQKDNLIQYIQPTTATINIPT 217 (330) T ss_pred ee---cccccc---cccCHHHHHHHH-------HHhccc----cccceEEEEcHHHHHHHHHhhhhhhhcccccCccccc Confidence 00 000000 001111111111 111111 112357999999998876431 1 1122 223333 Q ss_pred ccCCCceEEecCCCCcce-----EEEEeccc-EEEEe-cceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEE- Q lcl|Aclame:pro 302 VLPHGITILESLAVETGK-----AIAFVANR-YDAFM-ATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLT- 373 (377) Q Consensus 302 ~l~~~~~v~~s~~~~~~~-----ii~gd~s~-y~~~~-~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~- 373 (377) . .|.+|+.++.+|... .+|+.-.- |.-+. ...+.+++.++. ..++..+....++ +++|..+..-. T Consensus 218 ~--~G~~VivdD~~p~~~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~--~~g~~~l~~r~~~---~~hp~G~s~~~~ 290 (330) T protein:vir:10 218 Y--LGYRVIIDDGIAPTGDIYTSYLFRTGSIGLNTGNPSGLTTFETSREA--AKGNDMIYTRRAL---VMHPYGVKWTGA 290 (330) T ss_pred c--cceEEEEeCCCCCCCCceeEEEEecCceeeecccCCccccccccCCc--cccceEEEEeeEE---Eeeeeeeeeccc Confidence 2 478999999998422 23332221 11111 112334444443 2455455555553 56666666553 Q ss_pred -ee-cC Q lcl|Aclame:pro 374 -LA-GG 377 (377) Q Consensus 374 -~~-a~ 377 (377) ++ +| T Consensus 291 ~~~~~~ 296 (330) T protein:vir:10 291 EVDAGN 296 (330) T ss_pred ccccCc Confidence 22 33 No 165 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=96.68 E-value=0.00042 Score=39.11 Aligned_cols=299 Identities=15% Similarity=0.122 Sum_probs=136.8 Q ss_pred HHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHH-------HhccCCCCCceecc--HHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 42 MGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDI-------DKNVGGKDKFKLLP--EETMVQVFDDLVAEHPLLKV 112 (377) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~-------~~~~~~s~gg~lvP--~~~~~~Ii~~~~~~s~l~~~ 112 (377) +.. ....+.+..++..+.... .....+..+.+++. +.+.+.|++.....-..+.+ T Consensus 1 ~~~----------------~~~~~~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~ 64 (329) T protein:vir:79 1 MRG----------------NIMSKEMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRV 64 (329) T ss_pred Ccc----------------chhhhhhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhh Confidence 000 000111111111111000 00111111222322 33456677665555555555 Q ss_pred ceeEe-cC-C--ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcC---HHHHHHHHHHH Q lcl|Aclame:pro 113 INFKN-TS-L--RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFG---PKWLKQFITEQ 185 (377) Q Consensus 113 ~~v~~-~~-~--~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds---~~~~~~~l~~~ 185 (377) +.+.. .+ + ...+.+....+.+.|.+..+...+..+..+..-....+.++.-+.++..=|+-+ ..++..--... T Consensus 65 i~i~~~~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~a 144 (329) T protein:vir:79 65 FPVTSELSDTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANA 144 (329) T ss_pred cccccCCCCceeEEEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHH Confidence 54432 22 1 235666666778888865443333445556666666677777677765544433 56788888899 Q ss_pred HHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 186 LKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKK 265 (377) Q Consensus 186 la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 265 (377) .+.+++..+|+-+++|++..+-.|+||.+...+........ ..+ ...++...++.+..+......... T Consensus 145 A~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~~~-~~w----------~~kt~~ei~~di~~~~~~l~~~s~- 212 (329) T protein:vir:79 145 AQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSAGWNN-AAG----------TGKKPETAQDELEQAIEKIETLTN- 212 (329) T ss_pred HHHHHHHhhccEEEeecccccceeeecCCCccccccCCCCC-ccc----------cccCHHHHHHHHHHHHHHHHHhcC- Confidence 99999999999999999887889999987764433221111 011 112333333333333333222111 Q ss_pred hhhcccCceEEEeccchhhhhcccccccCCCCc----cccccCCCceEEecCCCC----c--ceEEEEeccc-E-EEEec Q lcl|Aclame:pro 266 HPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGE----YVTVLPHGITILESLAVE----T--GKAIAFVANR-Y-DAFMA 333 (377) Q Consensus 266 ~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~----~~~~l~~~~~v~~s~~~~----~--~~ii~gd~s~-y-~~~~~ 333 (377) .......++++|..+..+.... +..|. ++.-...++.++..+.+. . +.+++.+.+. + .+... T Consensus 213 ---g~~~p~~L~Lpp~~~~~L~~~~---~~~~~tvl~~lk~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp 286 (329) T protein:vir:79 213 ---GQHRANMILIPPSMRKVLMVRM---PETTMSYLDYFKQQNGGITIESISELEDIDGAGTKAALVYEKDPMNMSIEIP 286 (329) T ss_pred ---ceecccEEEecHHHHHHhhccc---CCCCccHHHHHHHhCCCcEEEEcccccccCCCCceEEEEEecCCceEEEecC Confidence 1223356778887654443211 12231 221111233333322221 1 1234444332 2 22222 Q ss_pred ceeeEEeechhhhhcCcEEEEEEEEEcC-EEecccceEEEE-eecC Q lcl|Aclame:pro 334 TASTIEEYDQTFAMEDLQLYLTKNYFYG-KAKDNHTAALLT-LAGG 377 (377) Q Consensus 334 ~~~~i~~~~~~~f~~~~~~~~~~~r~dg-~~~~~~af~~l~-~~a~ 377 (377) ..+.... .+.+-.. ..+....|+.| .+..|.|+++++ |--| T Consensus 287 ~~~~~l~-~q~~~~~--~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 287 EAFNMLT-AQPKDLH--FKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred cceeeee-ceecCce--EEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 3333211 1211111 12224555543 455667766654 2223 No 166 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=96.64 E-value=6.7e-05 Score=43.46 Aligned_cols=299 Identities=17% Similarity=0.211 Sum_probs=135.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHhccC--CCCCceeccHHHHHHHHHHHHhhhhhhhhcee Q lcl|Aclame:pro 38 AFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVG--GKDKFKLLPEETMVQVFDDLVAEHPLLKVINF 115 (377) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~--~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v 115 (377) +-+-+.. ++.....+.....+.. +.+-++++++-..+.+ .++-...+|+.+...|-..+..+.|+++...| T Consensus 1 mtn~ies------q~A~~eF~~vL~~N~G-~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHV 73 (318) T protein:vir:86 1 MTNFIES------QNAVTEFFDVLKKNSG-KSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHV 73 (318) T ss_pred Ccchhhh------hHHHHHHHHHHhccCC-chhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeee Confidence 0000000 0011111111111111 2255666666544433 36777899999999999999999999997777 Q ss_pred EecCCc-eEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhh-HHHH---hcCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 116 KNTSLR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIP-KDAL---KFGPKWLKQFITEQLKEAI 190 (377) Q Consensus 116 ~~~~~~-~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS-~ell---~ds~~~~~~~l~~~la~~~ 190 (377) .+.+.- ++....+ ++.+.- ..+|..+.+...+|..-++.+-.+ ++.-| -++. .+|-..+-.||..+|+++| T Consensus 74 T~~~~~~V~~s~~s-~AeAq~-HkdGqTK~eqa~~~~~~Tl~~~~V--Y~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~ 149 (318) T protein:vir:86 74 TNVGALLVSRSFDS-SAEAQV-HKDGQTKTEQAATLTIDTLEPVMV--YKLQSLAERVKRLQMSYSELYNLIVAELTQAI 149 (318) T ss_pred ccchhhhhhhhhhh-hhhhhh-hccCCccccceeeeeeechhHHHH--HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHH Confidence 655532 2222222 234443 346666665555666656555433 33333 2333 3444456899999999999 Q ss_pred H-HHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhc Q lcl|Aclame:pro 191 A-VALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLK 269 (377) Q Consensus 191 a-~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 269 (377) . +..+.+++-|+|.+....+-+..........+... +..++-.+ .+.+....++ -.. T Consensus 150 vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttka--------ksagttpf---anaieeavdf-----------vrp 207 (318) T protein:vir:86 150 VNKIVDLALVEGDGSNGFKSIDKEADVKKIKKITTKA--------KSAGTTPF---ANAIEEAVDF-----------VRP 207 (318) T ss_pred HHHHHHhhheeecCCCCccchhhHHHHHHHHHHhhhh--------hccCCCch---hhHHHHHHhh-----------hcc Confidence 9 88999999999988655553322211111111000 00000011 1112222222 123 Q ss_pred ccCceEEEeccchhhhhcccccccCCCCccccccCCCceEEecCCCCcceEEEEeccc-----------EEEEecceeeE Q lcl|Aclame:pro 270 IAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANR-----------YDAFMATASTI 338 (377) Q Consensus 270 ~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~-----------y~~~~~~~~~i 338 (377) ..|+.++++...+.-.++..+..-+.+.. +.+-.-... ....+..+.+++.--|+ |.+.+ +++. T Consensus 208 tagrrylivkaedrkalldelrqatanah-vriknddte--iasevgvdeiivytgskalkptvlvdqkyhidm-qdlt- 282 (318) T protein:vir:86 208 TAGRRYLIVKAEDRKALLDELRQATANAH-VRIKNDDTE--IASEVGVDEIIVYTGSKALKPTVLVDQKYHIDM-QDLT- 282 (318) T ss_pred CCCceEEEEeecchHHHHHHHHhhcccce-eEEeccchh--hhhhcCcceeeeeeccccccceeeeccceecch-hhhh- Confidence 45666777666554433322211112211 000000000 00011112222221111 22211 1111 Q ss_pred EeechhhhhcCcEEEEEEEEEcCEEecccceEEEEee Q lcl|Aclame:pro 339 EEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) Q Consensus 339 ~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~ 375 (377) .-+.--|..+..-+..-..--|-+-..+|=++++++ T Consensus 283 -kvdafewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:86 283 -KVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred -hhhcceeccCCceEEEeecccCcceeecCceeEEeC Confidence 111111333333333334444556666666777777 No 167 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=96.49 E-value=0.00051 Score=38.63 Aligned_cols=256 Identities=11% Similarity=0.051 Sum_probs=106.8 Q ss_pred CCCceeccHHHHHHHHHHHHhhhhhhhhceeE---ecC---C-ceEEEEEcCCcceeeecc----ccccccccccccee- Q lcl|Aclame:pro 85 KDKFKLLPEETMVQVFDDLVAEHPLLKVINFK---NTS---L-RLKALTAETSGTAVWGDI----FGEIKGQLKQAFKE- 152 (377) Q Consensus 85 s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~---~~~---~-~~~~p~~~~~~~a~w~~e----~~~~~~~~~~~f~~- 152 (377) -..-.++|+-++.++++.++....+.+++..- ..+ | .++||+... ..+.+... .+....-.+.+-.. T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTEDSF 79 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc-ccceeeeccccccCCcccccccccceE Confidence 11234889999999999999998887776432 222 3 378876543 33333211 11111112233333 Q ss_pred -EeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccc Q lcl|Aclame:pro 153 -QDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYK 231 (377) Q Consensus 153 -i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~ 231 (377) +++..++..+ +.|+.+-...+..++...+.+...++++.++|..++. .=.+.|.+.. ..... T Consensus 80 ~~~id~~k~~~-~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~-~~~~a~~~~~--------~~~~~------- 142 (392) T protein:vir:99 80 PVTLTDVAYHL-GVLTDEELTFDLESFATQILPRQVRGVADILEEGVRD-MIVGAPYEAA--------GAVHE------- 142 (392) T ss_pred EEEEeeeeecc-eeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHH-HHhccccccc--------ccccc------- Confidence 4444444444 3455555444567777777788899999999987652 1111111100 00000 Q ss_pred hhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc--ccccCC----------CCcc Q lcl|Aclame:pro 232 TDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK--FTSRNQ----------FGEY 299 (377) Q Consensus 232 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~--~~~~~~----------~G~~ 299 (377) .++...++.+..+...+ +....| .+| .+++.|..+..+... +..... +|.. T Consensus 143 -----------~~~~~~~~~i~~a~~~L--~~~~vP---~~R-~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~v 205 (392) T protein:vir:99 143 -----------VAPDEFFKGVNGARRAL--NELYIP---QGR-VLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARL 205 (392) T ss_pred -----------cChhhhHHHHHHHHHHH--hhcCCC---CCC-EEEEcHHHHHHHhcccceeecccccchhhhhhhccee Confidence 01111122222222211 111223 244 566788777665422 111111 1222 Q ss_pred ccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCc--EEEEEEEEEcCEEeccc-------ceE Q lcl|Aclame:pro 300 VTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDL--QLYLTKNYFYGKAKDNH-------TAA 370 (377) Q Consensus 300 ~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~--~~~~~~~r~dg~~~~~~-------af~ 370 (377) ..+ +|.+|+.+.++|.+..+.+..+.+....+...............+. +..+...-.++...-+. ++. T Consensus 206 g~i--~G~~v~~s~~~~~~t~~a~~~~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~ 283 (392) T protein:vir:99 206 GRI--YGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLK 283 (392) T ss_pred eee--eeeEEEeecccccccceeeeccccccccccccccccccceeEEecccceecceeecccceeeccccccceeEEEE Confidence 122 5678888988887765544333222221111000000000000000 00000111111111000 000 Q ss_pred EEEeecC Q lcl|Aclame:pro 371 LLTLAGG 377 (377) Q Consensus 371 ~l~~~a~ 377 (377) .++..+| T Consensus 284 ~v~~~~~ 290 (392) T protein:vir:99 284 VVEDPNG 290 (392) T ss_pred EEeeccc Confidence 1111111 No 168 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=96.42 E-value=0.00051 Score=38.62 Aligned_cols=277 Identities=11% Similarity=0.009 Sum_probs=121.6 Q ss_pred HHHHHHHhc---cCCCCCceeccHHHHHHHHHHHH-hhhhhhhhceeEecCCc-e--EEEEEcCCcce-----eeecccc Q lcl|Aclame:pro 73 KFFNDIDKN---VGGKDKFKLLPEETMVQVFDDLV-AEHPLLKVINFKNTSLR-L--KALTAETSGTA-----VWGDIFG 140 (377) Q Consensus 73 ~~~~~~~~~---~~~s~gg~lvP~~~~~~Ii~~~~-~~s~l~~~~~v~~~~~~-~--~~p~~~~~~~a-----~w~~e~~ 140 (377) -.++..+.. -++.-.-..| +++.+++....+ ..+.|++.++..+-.++ . ..+....-... .-....+ T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 79 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFV-QTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSADG 79 (322) T ss_pred CcccceeeeeeeeechhhhHHH-HHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccccCc Confidence 001111110 0000011122 556666655444 44677887775443322 1 22221111110 0000001 Q ss_pred ---cccccccccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeecc-CCCcceeeeecccc Q lcl|Aclame:pro 141 ---EIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGN-GLLQPVGLLKDLSQ 216 (377) Q Consensus 141 ---~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~-G~~~P~Gil~~~~~ 216 (377) ......+.....+.+..+ +....|.+.-+.....|..+...+..+.+++++.|..|+.|- |... .|- .+ T Consensus 80 ~~dtp~~~~~~~~r~~~~~d~--~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~-~~~----~g 152 (322) T protein:vir:10 80 TYPTPVNNKPFAKRRTNVDTY--DTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS-IKG----TG 152 (322) T ss_pred ccCCCccccccceEEEeeccc--ccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc-ccc----cc Confidence 111122234445555555 444577776666667888999999999999999999887632 3211 000 00 Q ss_pred ccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc--cccC Q lcl|Aclame:pro 217 PTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF--TSRN 294 (377) Q Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~--~~~~ 294 (377) ..+......... .....++ .+.+-.+... ++... ....++.+.+++|..+..|+... +..+ T Consensus 153 t~v~~~ss~~i~------~g~~g~t-------~~kl~~a~~~--l~~~d--vp~d~~R~~vv~p~~~~~LL~d~~~ts~D 215 (322) T protein:vir:10 153 QPVEFLATQEIG------DGTKPIS-------FDYVTEITER--FLENE--IEPEVSKVIVIGPTQARKLLQITEATSAD 215 (322) T ss_pred cccccCCCcccc------cCccchh-------HHHHHHHHHH--HHhcC--CCCCCCeEEEeCHHHHHHHhcchhhhhhh Confidence 000000000000 0000000 0111111111 11111 22234446778888877665321 1111 Q ss_pred C--------CCccccccCCCceEEecCCCCcc------------------eEEEEecccEEEEecceeeEEeechhhhhc Q lcl|Aclame:pro 295 Q--------FGEYVTVLPHGITILESLAVETG------------------KAIAFVANRYDAFMATASTIEEYDQTFAME 348 (377) Q Consensus 295 ~--------~G~~~~~l~~~~~v~~s~~~~~~------------------~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~ 348 (377) - +|.-.++ +|..++.++.+|.. +.+++--+....+...++..+.+.... .. T Consensus 216 ~~~~~~l~~~G~ig~~--lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~-~~ 292 (322) T protein:vir:10 216 YTSAMDLQSKGIITNW--MGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPS-AS 292 (322) T ss_pred cccchhhhhcCeeeee--eeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCC-cc Confidence 1 1322233 56778888877631 122222233333333444444322111 11 Q ss_pred CcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 349 DLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 349 ~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) .-.-+++.+-++.++++|+.+|.+...=- T Consensus 293 ~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~ 321 (322) T protein:vir:10 293 FAWRIYSAFTADCVRVEDEHIFKLRLKNS 321 (322) T ss_pred hhhhhhhhhhhCceEeccCcEEEEEEecc Confidence 12235677889999999999999999777 No 169 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=96.34 E-value=0.00024 Score=40.42 Aligned_cols=287 Identities=10% Similarity=0.015 Sum_probs=124.6 Q ss_pred HHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEEcCCcceeeecccccccccccccceeEe Q lcl|Aclame:pro 76 NDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQD 154 (377) Q Consensus 76 ~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~-~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~ 154 (377) ++.-..+=.+-.....-+++.+.|...-....|+.+++-..... ..+.|+...-...+.-.-.||+..+.......... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~ 80 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTML 80 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEEe Confidence 00000000011122345677888887777778888876544433 34567654433222111112221111111111111 Q ss_pred ecce-eEEEeehhhHHHHhcCHHHHHHHHH---HHHHHHHHHHhhcceeecc-----CC----Ccceeeeeccccccccc Q lcl|Aclame:pro 155 FSQF-KLTAFVVIPKDALKFGPKWLKQFIT---EQLKEAIAVALELAIVKGN-----GL----LQPVGLLKDLSQPTVDQ 221 (377) Q Consensus 155 l~~~-k~~~~~~iS~ell~ds~~~~~~~l~---~~la~~~a~~~~~a~l~G~-----G~----~~P~Gil~~~~~~~~~~ 221 (377) -+.- =+...+.||..+..-+.......+. ..=...+.+-+|.+||+|. |. .+.-||++.+....... T Consensus 81 ~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~~ 160 (317) T protein:vir:88 81 NNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSLG 160 (317) T ss_pred ccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCceec Confidence 1111 1223344555544433332222222 3334457788999999985 11 23457765543322211 Q ss_pred cccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccc----c--CC Q lcl|Aclame:pro 222 STGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTS----R--NQ 295 (377) Q Consensus 222 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~----~--~~ 295 (377) ..+.............+....... +.+..+++..-..+ ...+ .++++|...-.+-..... . .. T Consensus 161 ~~g~~~~~~~~~~~t~~t~~~lte----~~l~~~l~~i~~~G------g~~~-~i~v~a~~k~~i~~~~~~~~~~i~~~~ 229 (317) T protein:vir:88 161 ANGVAPVGDGSNTGTAGDLRLLTE----DMLLNASESIWRNG------GQAN-SIQTSSSIKKAISKNMKGRATEITLDA 229 (317) T ss_pred cCccccccCCCccccccccccccH----HHHHHHHHHHHhcC------CCCC-EEEeChHHHHHHHHHhcCCceeEEEcc Confidence 111110000000000000000111 11222222111110 1112 235676543222111000 0 00 Q ss_pred -C---C----ccccccCCCceEEecCCCCcceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEeccc Q lcl|Aclame:pro 296 -F---G----EYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNH 367 (377) Q Consensus 296 -~---G----~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~ 367 (377) . | .|.+-+| -+.++.+.++|++++++.|++++-+..=..+..+.+...+ |..-+.....+.-++.+++ T Consensus 230 ~~~~~g~~v~~~~tdfG-~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~laKtG---d~~k~~i~~E~tLe~~N~~ 305 (317) T protein:vir:88 230 SDNRIAQTVDVYESDFG-KYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHELAKTG---DSEKRQLLVEYTFRVNNEK 305 (317) T ss_pred cCeEEEEEEEEEEeCCe-EEEEEeCCCCCCCeEEEEcccccceeecccceeeccCCCc---ccceeEEEEEEEEEEcCcc Confidence 0 1 1223333 2577889999999999999998755443455544443332 3334556677778899999 Q ss_pred ceEEEEe-ecC Q lcl|Aclame:pro 368 TAALLTL-AGG 377 (377) Q Consensus 368 af~~l~~-~a~ 377 (377) |..++.. +++ T Consensus 306 a~a~i~~l~~~ 316 (317) T protein:vir:88 306 SGALIRDVVAQ 316 (317) T ss_pred ceeEEEEeccc Confidence 9988874 444 No 170 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=96.03 E-value=0.0011 Score=36.79 Aligned_cols=260 Identities=9% Similarity=-0.025 Sum_probs=120.6 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe---------cCC-ceEEEEEcC-Ccceeeeccccccccccc Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN---------TSL-RLKALTAET-SGTAVWGDIFGEIKGQLK 147 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~---------~~~-~~~~p~~~~-~~~a~w~~e~~~~~~~~~ 147 (377) +. ++.-.-.++|+-|.+-+.+...+.+.+++..-+.+ -+| .+.+|.... ++++.-+.+..++..+ + T Consensus 1 MA--~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~-k 77 (351) T protein:vir:15 1 MA--ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVN-N 77 (351) T ss_pred CC--ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchh-e Confidence 22 33345678899887777776666666654221111 134 368897654 2455444445554432 3 Q ss_pred ccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccc---ccccccc Q lcl|Aclame:pro 148 QAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQP---TVDQSTG 224 (377) Q Consensus 148 ~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~---~~~~~~~ 224 (377) .+-++-.-..+..+.-..++.+-..-+.-|..+.+.++++...++..++.+|.- -.|++...... ....+.. T Consensus 78 itt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~-----l~gv~~~~~~~~~~~~d~t~~ 152 (351) T protein:vir:15 78 LTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSV-----LKGVMGVTKIANSKVYDQTKV 152 (351) T ss_pred ecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhhchhhcccceeccccc Confidence 333333333333343355566544445567788899999999999888876631 01111100000 0000000 Q ss_pred ccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhccc-C-ceEEEeccchhhhhcccc---cccCCCC-- Q lcl|Aclame:pro 225 RDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIA-G-QVKLLLNPEDRWTLEAKF---TSRNQFG-- 297 (377) Q Consensus 225 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~n~~~~~~~~~~~---~~~~~~G-- 297 (377) ... . .......+.+.+ .+. -+.. . -.+|+||+..+..++..- ..+..+| T Consensus 153 ~~~---------~---~~is~~~l~~A~-------~~~-----GD~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~ 208 (351) T protein:vir:15 153 SPS---------E---PMFGAKGFTGAI-------GLM-----GDLQDTAFGAIAVNSATYSLMKVQGLIETIQPQNGAT 208 (351) T ss_pred ccc---------c---cccCHHHHHHHH-------HHh-----ccccccceEEEEEChHHHHHHHhhhhhhhccccccCc Confidence 000 0 001111111111 111 1111 1 257899999988776321 1122222 Q ss_pred ccccccCCCceEEecCCCCcc----------eEEEEecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEeccc Q lcl|Aclame:pro 298 EYVTVLPHGITILESLAVETG----------KAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNH 367 (377) Q Consensus 298 ~~~~~l~~~~~v~~s~~~~~~----------~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~ 367 (377) .+.+. .|++|+.++.||.. ..+||.-. +.+.. ++..+++.++.....++.......++ ++||. T Consensus 209 ~i~t~--~G~~VivdD~~p~~~~~~~~~~ytsyl~~~GA-i~~~~-~~~~ve~~rd~~~~~g~d~l~~r~~~---~~hp~ 281 (351) T protein:vir:15 209 PFEAY--NGLRIVLDDDIEIDLTDKTKPVSTSYIFAPGA-VRYST-NMRSTETKYDPLINGGQDVIVQKRVG---TIHVA 281 (351) T ss_pred cccee--cceEEEEcCCCccccCCCCCceeEEEEEecce-eeeec-CCcCcceeecccCCCCceEEEEeeee---eeeee Confidence 22222 47899999999842 12333211 11111 22233443433333444334343333 57777 Q ss_pred ceEEEE---eecC Q lcl|Aclame:pro 368 TAALLT---LAGG 377 (377) Q Consensus 368 af~~l~---~~a~ 377 (377) .+..-. .++| T Consensus 282 G~s~~~~~~~~~~ 294 (351) T protein:vir:15 282 GTSIKASFSPSKA 294 (351) T ss_pred eeeecccccccCc Confidence 776542 2344 No 171 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=95.22 E-value=0.0025 Score=34.84 Aligned_cols=286 Identities=14% Similarity=0.127 Sum_probs=134.0 Q ss_pred ccccccHHHHHHHHHHH---hccCCCCCceecc--HHHHHHHHHHHHhhhhhhhhceeEec-C-C--ceEEEEEcCCcce Q lcl|Aclame:pro 63 KNRELTAEEIKFFNDID---KNVGGKDKFKLLP--EETMVQVFDDLVAEHPLLKVINFKNT-S-L--RLKALTAETSGTA 133 (377) Q Consensus 63 ~~~~lt~~e~~~~~~~~---~~~~~s~gg~lvP--~~~~~~Ii~~~~~~s~l~~~~~v~~~-~-~--~~~~p~~~~~~~a 133 (377) ..-...++..+.-.... ....++.|-+++. +.+.+.|++.....-.-+.++.+..- + + ...++.....+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a 80 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIA 80 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccccce Confidence 00000011111111111 1222233344443 34455566544444333444333221 1 1 2356666677888 Q ss_pred eeecccccccccccccceeEeecceeEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceee Q lcl|Aclame:pro 134 VWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFG---PKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGL 210 (377) Q Consensus 134 ~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds---~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gi 210 (377) .|++..+...+..+..++......+.++..+.+|..=|+-+ ..++..--....+.++...+|+.+++|+...+-.|+ T Consensus 81 ~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GL 160 (314) T protein:vir:10 81 QIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPHGIVSV 160 (314) T ss_pred eeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeE Confidence 88875544334556677888888888888888876655433 567888888999999999999999999987788999 Q ss_pred eeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccc Q lcl|Aclame:pro 211 LKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF 290 (377) Q Consensus 211 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~ 290 (377) ||.+......+... + +.+...++.+..++..+.....+ ......+++.|..+..+.. T Consensus 161 lN~p~v~~~~~~~~-----W------------aT~~ei~~Di~~~~~~l~~~s~g----~~~p~~l~Lpp~~~~~L~~-- 217 (314) T protein:vir:10 161 FDQPNINNVVATPN-----W------------SVPQNAIDDVTAMIDAVESSTQG----LHHVTDILLPASARRVMQG-- 217 (314) T ss_pred eecCCCccccCCCC-----c------------ccHHHHHHHHHHHHHHHHHhcCc----cccceeEEecHHHHHhhcc-- Confidence 99876533222111 1 11222223333333322221111 1122356777775543322 Q ss_pred cccCCCCc----cccccCCCceEEecCCCC----cce--EEEEeccc-E-EEEecceeeEEeechhhhhcCcEEEEEEEE Q lcl|Aclame:pro 291 TSRNQFGE----YVTVLPHGITILESLAVE----TGK--AIAFVANR-Y-DAFMATASTIEEYDQTFAMEDLQLYLTKNY 358 (377) Q Consensus 291 ~~~~~~G~----~~~~l~~~~~v~~s~~~~----~~~--ii~gd~s~-y-~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r 358 (377) +. +..|. |+.--..++.+...+.+. .++ +++.+-+. + .+.....+.... -+.+-. ...+....| T Consensus 218 ~~-~~~~~tvl~~l~~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~-~e~~~~--~~~~~~~~r 293 (314) T protein:vir:10 218 LV-PQTNLSYGELFTRNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLP-AQPKDL--HFRYPVTSK 293 (314) T ss_pred cc-cCCCccHHHHHHHhCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceeec-ceecCc--eEEEcceee Confidence 11 11221 111111233333322221 111 22222221 2 222222332211 111111 112234566 Q ss_pred Ec-CEEecccceEEEE---ee Q lcl|Aclame:pro 359 FY-GKAKDNHTAALLT---LA 375 (377) Q Consensus 359 ~d-g~~~~~~af~~l~---~~ 375 (377) +. ..+..|.|+++++ .+ T Consensus 294 ~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 294 ATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred eEEEEEECcceeEeeeeeecC Confidence 65 4566778888653 33 No 172 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=94.88 E-value=0.0016 Score=35.90 Aligned_cols=288 Identities=9% Similarity=0.021 Sum_probs=132.9 Q ss_pred hccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhhh----ceeEecCCc--eEEEEEcC-Ccc Q lcl|Aclame:pro 60 LRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKV----INFKNTSLR--LKALTAET-SGT 132 (377) Q Consensus 60 ~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~----~~v~~~~~~--~~~p~~~~-~~~ 132 (377) +.. ..|++-....+. +.+..+.+.+-..++|+.. .++.+.+|+ +..|..-. ... T Consensus 1 mp~--~~lsel~t~tl~-----------------~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~ 61 (321) T protein:vir:34 1 MPF--PNISDIITTTIE-----------------SRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSN 61 (321) T ss_pred CCC--chHHHHHHHHHH-----------------hhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcc Confidence 111 122222222111 1122233333344444332 345556664 55666654 778 Q ss_pred eeeecccccccccccccceeEeecceeEEEeehhhHH-HHhcCH-HHHHHHHH---HHHHHHHHHHhhcceee-ccC--C Q lcl|Aclame:pro 133 AVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKD-ALKFGP-KWLKQFIT---EQLKEAIAVALELAIVK-GNG--L 204 (377) Q Consensus 133 a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~e-ll~ds~-~~~~~~l~---~~la~~~a~~~~~a~l~-G~G--~ 204 (377) +.|............-.|.+-+|..+.+++-+.||-. +|+.+. ..+..+|. +..-+.++..++..+.. |+| . T Consensus 62 ~~wy~Gyd~l~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~sdGTa~g~ 141 (321) T protein:vir:34 62 GGWYSGYDVLPTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYGDGTAFGG 141 (321) T ss_pred eeEEEeeeeeccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhcccccccc Confidence 9998766666544455899999999999988888753 554432 22223333 33345566667776665 664 3 Q ss_pred Ccceeeeeccc----cccccccccccccccchhhhhhhhhhc-cChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEec Q lcl|Aclame:pro 205 LQPVGLLKDLS----QPTVDQSTGRDITTYKTDKEAIADLSD-LDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLN 279 (377) Q Consensus 205 ~~P~Gil~~~~----~~~~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 279 (377) .+..|+---+. .++..+...++ +..+......... .++.+....+..+-.. ..+..+.+-+++- T Consensus 142 ~~i~GL~~lv~~~p~tGtvGGIdra~---~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~--------~~Rg~~~PDlii~ 210 (321) T protein:vir:34 142 RAINGLDGAVPVDPTVGTYGGINRAL---WPFWRSQVEDMAAVATINTIQPAMTKLWSR--------CVRGADMPDLIMS 210 (321) T ss_pred chhhhhhhhcccCCCCceeccccccc---hhhhhhhhhhhhhcccHHHHHHHHHHHHHh--------hccCCCCccEEEe Confidence 35556532222 12222212222 2222221111111 2333333333322221 2223334444444 Q ss_pred cchhhhhccc-------ccccCCCCccccccC-CCceEEecC----CCCcceEEEEecccEEEEecceeeEEeechhhhh Q lcl|Aclame:pro 280 PEDRWTLEAK-------FTSRNQFGEYVTVLP-HGITILESL----AVETGKAIAFVANRYDAFMATASTIEEYDQTFAM 347 (377) Q Consensus 280 ~~~~~~~~~~-------~~~~~~~G~~~~~l~-~~~~v~~s~----~~~~~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~ 347 (377) ..++|..-.. +.+-.......+.|- .+..|+.++ .+|++..+|=|=++..+...++-.+......++. T Consensus 211 ~~~~y~~y~~s~q~~qR~~~~~~a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r~~ 290 (321) T protein:vir:34 211 GNDAWTTYSNSLQVLQRFTSAEEANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSRRA 290 (321) T ss_pred chHHHHHHHHhhheeeeecccccccccceeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCccccc Confidence 4555443211 111111111112222 367788776 6888888888888777777666666665554433 Q ss_pred -cCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 348 -EDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 348 -~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) -||....-..-+-|.++-.++..=..+.+- T Consensus 291 ~~NqdA~~q~I~~~GnL~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 291 AFNQDAEAQILAWAGNLTCSGAQFQGRLIAE 321 (321) T ss_pred ccchhHHhhhhhhhheeeeecccceeEEeeC Confidence 233333333333344433333322222222 No 173 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=94.85 E-value=0.0033 Score=34.16 Aligned_cols=296 Identities=8% Similarity=0.032 Sum_probs=140.7 Q ss_pred ccccccHHHHHHHHHHHhc------cCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCccee Q lcl|Aclame:pro 63 KNRELTAEEIKFFNDIDKN------VGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAV 134 (377) Q Consensus 63 ~~~~lt~~e~~~~~~~~~~------~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~ 134 (377) ..+.|+.+-|..|+..... ..+....+.|-+.+...+.+.+++.|-+++.++++++.- +-++.....++-++ T Consensus 1 m~~~m~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iag 80 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQLAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTG 80 (341) T ss_pred CcccccHHHHHHHHHHHHHHHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceee Confidence 3344666666666655432 222334567777889999999999999999999998862 34555555555544 Q ss_pred eecccccccccccccceeEeecceeEEEeehhhHHHHhc-C----HHHHHHHHHHHHHHHHHHHhhcceeeccCC----- Q lcl|Aclame:pro 135 WGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF-G----PKWLKQFITEQLKEAIAVALELAIVKGNGL----- 204 (377) Q Consensus 135 w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d-s----~~~~~~~l~~~la~~~a~~~~~a~l~G~G~----- 204 (377) -... + ..+ .++..+.-.+..++.--=+.|+.+.|+. + .++|...+++.+.++++.-+-.--++|+-. T Consensus 81 rtdt-~-R~~-r~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td 157 (341) T protein:vir:27 81 RKAG-G-RFT-KQVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTD 157 (341) T ss_pred ccCC-C-cee-cccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCC Confidence 4332 2 211 1234555555555555556677777742 2 478999999999999988777777787641 Q ss_pred --Ccce------eeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEE Q lcl|Aclame:pro 205 --LQPV------GLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKL 276 (377) Q Consensus 205 --~~P~------Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (377) ..|. |++......+....-.... ...-+..+. .++..+..+....-.........+.+. T Consensus 158 ~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~---------~~~g~~gdy----~nLDAlV~D~~~~lI~~~~~~d~dLVv 224 (341) T protein:vir:27 158 PSANPLGQDVNEGWIAFVKNRKASQVVDVDV---------YFDETNGDY----RTLDAMASDIINNQIHPMFRNDPRLTV 224 (341) T ss_pred hhhcccccccchhHHHHHHhhcccceeccce---------eeccCCCcc----ccHHHHHHHHHhcccChHHhcCCCEEE Confidence 1232 4443221111000000000 000001111 111111111110001111222334555 Q ss_pred EeccchhhhhcccccccCC------CCccccccCCCceEEecCCCCcceEEEEecccEEEEe-cceee--EEeechh-hh Q lcl|Aclame:pro 277 LLNPEDRWTLEAKFTSRNQ------FGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFM-ATAST--IEEYDQT-FA 346 (377) Q Consensus 277 ~~n~~~~~~~~~~~~~~~~------~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~y~~~~-~~~~~--i~~~~~~-~f 346 (377) +|-..-...-...+..... .++-++-..-|+|.+.-+++|++.+++=-|+..-|.. .+... +.-.++. ++ T Consensus 225 ivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ri 304 (341) T protein:vir:27 225 FVGSGLIGAAQAKLYDKADKPSEQIAAQKLDKTIAGRPAYVPPFLPDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKRS 304 (341) T ss_pred EEchhhhhhhhhhhhccCCCCHHHHHHHHHHHhhCCCeEEEccccCCCceEEeeccceEEEEecCcEEEEEEeccccccc Confidence 5543211111111111100 0111111113778888999999998766555432211 11111 1111111 11 Q ss_pred hcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 347 MEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 347 ~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ...+-+|+. +-+ |+ ...=.|..+++++| T Consensus 305 e~yes~YvV-Edy-g~-~~~~~~~~vkl~~~ 332 (341) T protein:vir:27 305 KTHTGAWKV-TQW-VC-WKRSPLTTQKKSTS 332 (341) T ss_pred cchhhhhee-ehh-hh-hhhccccccccCcc Confidence 111224433 222 22 33345777888888 No 174 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=93.49 E-value=0.0074 Score=32.27 Aligned_cols=274 Identities=14% Similarity=0.080 Sum_probs=126.7 Q ss_pred CCCCceeccH--HHHHHHHHHHHhhhhhhhhceeEecC----CceEEEEEcCCccee--eecccccccccccccceeEee Q lcl|Aclame:pro 84 GKDKFKLLPE--ETMVQVFDDLVAEHPLLKVINFKNTS----LRLKALTAETSGTAV--WGDIFGEIKGQLKQAFKEQDF 155 (377) Q Consensus 84 ~s~gg~lvP~--~~~~~Ii~~~~~~s~l~~~~~v~~~~----~~~~~p~~~~~~~a~--w~~e~~~~~~~~~~~f~~i~l 155 (377) -+...+++.+ .+.++|.+.....-..+.++.+.+.. -...+...+..+.+. |....+...+..+..+++-.. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 2223333321 22333333222222233333322111 123445555555666 866554444455666777777 Q ss_pred cceeEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHhhcceeeccCC-Ccceeeeeccccccccccccccccccc Q lcl|Aclame:pro 156 SQFKLTAFVVIPKDALKFG---PKWLKQFITEQLKEAIAVALELAIVKGNGL-LQPVGLLKDLSQPTVDQSTGRDITTYK 231 (377) Q Consensus 156 ~~~k~~~~~~iS~ell~ds---~~~~~~~l~~~la~~~a~~~~~a~l~G~G~-~~P~Gil~~~~~~~~~~~~~~~~~~~~ 231 (377) ..+.++.-+.+|.+=|+-+ ..++.+--.+...+++...+|+..++|+-. ..-.|++|.+...............+ T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~w- 159 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTKV- 159 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCcc- Confidence 7777776666665544322 345667666777788999999999999743 35789999877764433222111111 Q ss_pred hhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCc---ccc-----cc Q lcl|Aclame:pro 232 TDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGE---YVT-----VL 303 (377) Q Consensus 232 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~---~~~-----~l 303 (377) ...++...++.+..+.........+ ..-...+++.|+.+..+.... ..+.+.. |+. .- T Consensus 160 ---------~~~T~~eI~~di~~~~~~i~~~s~~----~~~p~tl~Lpp~~~~~l~~~~-~~~~~~Tvl~~l~~n~~~~~ 225 (304) T protein:vir:52 160 ---------QAMDFDKAVAFFKEIFLKGMEKTKR----IEAPNTFAIDSLDLAHLALVQ-RANTDTTALEFLTKHLSAAA 225 (304) T ss_pred ---------ccCCHHHHHHHHHHHHHHHHhccCc----eecCceEEeCHHHHHHHhhcc-CCCCCchHHHHHHHhccccc Confidence 1123333333333333332211111 111123555665443332111 1111211 110 01 Q ss_pred CCCceEEe--cCCCC---cc--eEEEEecccEEEEecceeeEEeechhhhhcCcEEEE--EEEEEcC-EEecccceEEEE Q lcl|Aclame:pro 304 PHGITILE--SLAVE---TG--KAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYL--TKNYFYG-KAKDNHTAALLT 373 (377) Q Consensus 304 ~~~~~v~~--s~~~~---~~--~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~--~~~r~dg-~~~~~~af~~l~ 373 (377) +.|+.+.. ..... .| .+++.+-+.=.+.+.-.+.+..... -.++...|. ++.|+.| .+..|.|++++. T Consensus 226 g~~l~I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~--q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D 303 (304) T protein:vir:52 226 GRQVAIKALPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDA--QPKGLLAFESGLRMAFGGVTFMEPDSALYVD 303 (304) T ss_pred CCcceEEEecccccccCCCCceEEEEEecChhheEEecCccccccch--hhcCCceEEecceeeeeeEEEEccceeeeec Confidence 12333322 22111 12 2455555432222222333333322 113432332 6777776 677889999999 Q ss_pred e Q lcl|Aclame:pro 374 L 374 (377) Q Consensus 374 ~ 374 (377) . T Consensus 304 ~ 304 (304) T protein:vir:52 304 Y 304 (304) T ss_pred C Confidence 9 No 175 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=91.49 E-value=0.016 Score=30.49 Aligned_cols=301 Identities=17% Similarity=0.192 Sum_probs=134.4 Q ss_pred HHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHh--ccCCCCCceeccHHHHHHH Q lcl|Aclame:pro 22 ISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK--NVGGKDKFKLLPEETMVQV 99 (377) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~--~~~~s~gg~lvP~~~~~~I 99 (377) +.+ -.+-..+..+-++.+.+ +..+.+. +.+++.-.. ..+.++.-+-+|..+...| T Consensus 1 mtn---fiesqnavteffdvlkk---nsgksei-----------------knawnaklaengvtitdttfqlprklvesi 57 (318) T protein:vir:94 1 MTN---FIESQNAVTEFFDVLKK---NSGKSEI-----------------KNAWNAKLAENGVTITDTTFQLPRKLVESI 57 (318) T ss_pred Ccc---chhhhhhHHHHHHHHhc---ccChhhh-----------------hhhhhhhhhhCCceeecchhhhHHHHHHhh Confidence 000 00000111111111110 0011122 223333222 2333455566888888888 Q ss_pred HHHHHhhhhhhhhceeEecCCceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHH--HHhcCHHH Q lcl|Aclame:pro 100 FDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKD--ALKFGPKW 177 (377) Q Consensus 100 i~~~~~~s~l~~~~~v~~~~~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~e--ll~ds~~~ 177 (377) -..+...+|+++...+.+++.-+--.....+..+.-+ -.|..++++..++.--++.|-.++.+-.+... -|++|-.. T Consensus 58 ntallntnpvfkvfhvtnvgallvsrsfdssneaqvh-kdgqtkteqaatltidtlepvmvyklqslaervkrlqmsyse 136 (318) T protein:vir:94 58 NTALLNTNPVFKVFHVTNVGALLVSRSFDSSNEAQVH-KDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSE 136 (318) T ss_pred hhhhccCCcceeeeeehhhhheeeeccccccchhhhh-cccccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHHH Confidence 8888888899888777766543321122334444433 34555566666777667777666655544443 35778778 Q ss_pred HHHHHHHHHHHHHHHHh-hcceeeccCCCcceeeeeccccccccccc-cccccccchhhhhhhhhhccChHHHHHHHHHH Q lcl|Aclame:pro 178 LKQFITEQLKEAIAVAL-ELAIVKGNGLLQPVGLLKDLSQPTVDQST-GRDITTYKTDKEAIADLSDLDPDTAVELLVPV 255 (377) Q Consensus 178 ~~~~l~~~la~~~a~~~-~~a~l~G~G~~~P~Gil~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 255 (377) +-..|..++.++|..++ |-+++-|+|++..+.|-+..........+ .+... +.-.++ ..+....++ T Consensus 137 lynlivaeltqaivnkivdlalvegdgtngfksidkeadvkkikkittkaksa---------gktpfa---daieeavdf 204 (318) T protein:vir:94 137 LYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSA---------GKTPFA---DAIEEAVDF 204 (318) T ss_pred HHHHHHHHHHHHHHhhhhheeeeecCCcchhhhhchhhhHHHHHHhhhhhhhc---------CCCchh---HHHHHHHhh Confidence 88999999999998775 66889999998777775433221111110 00000 000111 112222222 Q ss_pred HHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCccccccCCCceEEecCCCCcceEEEEecc--------- Q lcl|Aclame:pro 256 MKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVAN--------- 326 (377) Q Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s--------- 326 (377) -....|+.++++...|.-.++..+..-+.+.... +-..... ....+..+.|++.--| T Consensus 205 -----------vrptagrrylivktedrkalldelrqatananvr-iknddte--iasevgvdeiivytgskavkptvlv 270 (318) T protein:vir:94 205 -----------VRPTAGRRYLIVKTEDRKALLDELRQATANANVR-IKNDDTE--IASEVGVDEIIVYTGSKAVKPTVLV 270 (318) T ss_pred -----------hccCCCceEEEEeccchHHHHHHHHhhhcccceE-Eeccchh--hhhhcCcceeEEeeccccccceeEe Confidence 1234566677776665443332221111221100 0000000 0000111222222111 Q ss_pred --cEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEee Q lcl|Aclame:pro 327 --RYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) Q Consensus 327 --~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~ 375 (377) +|.+.+ +++. .-+.--|..+..-+..-..--|-+-..+|=++++++ T Consensus 271 dqkyhidm-qdlt--kvdafewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:94 271 DQKYHIDM-QDLT--KVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred ccceecch-hhhh--hhhceeeccCCceEEEEecccCcceeecCceeEEeC Confidence 122222 1111 111111333333333334444556666666777777 No 176 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=90.76 E-value=0.019 Score=30.00 Aligned_cols=254 Identities=10% Similarity=-0.070 Sum_probs=113.0 Q ss_pred cCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec-----CC-ceEEEEEcCCcceeeecccccccccccccce--eE Q lcl|Aclame:pro 82 VGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT-----SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFK--EQ 153 (377) Q Consensus 82 ~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~-----~~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~--~i 153 (377) ...-+...+-|+-++.++++.+++..++.++|..-.- .| .++||+......... .... -.+.+=. .+ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~dg----~~~~-~~~~te~~v~l 75 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSASG----RTLV-KQPMVDQTIPF 75 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeeccc----CCcc-ccccccceEEE Confidence 2333345566999999999999999998887754211 13 567877432221111 1111 1122223 35 Q ss_pred eecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchh Q lcl|Aclame:pro 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) Q Consensus 154 ~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) +|+-+|... +.++.+-+..+..++...+.+..+.+++..+|..++. +++... ....+..... T Consensus 76 ~id~~k~~~-~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~---------l~~~a~---~~~gt~gt~~----- 137 (418) T protein:vir:10 76 KIAYQEHVG-LEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLAL---------TLKKAF---HSSGTPGVRP----- 137 (418) T ss_pred EEecccccc-eeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHH---------HHhhcc---cccccCCcCc----- Confidence 555555444 4455554445567787777788999999999987663 111110 0000000000 Q ss_pred hhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc-ccCCCCc---ccc---ccCCC Q lcl|Aclame:pro 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT-SRNQFGE---YVT---VLPHG 306 (377) Q Consensus 234 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~-~~~~~G~---~~~---~l~~~ 306 (377) . .++.+..+...+ + .......++..++++|..+..+..... ..+..|. |.. ...+| T Consensus 138 ------------~-~~~~i~~a~~~L--d--~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~G 200 (418) T protein:vir:10 138 ------------G-AFIDFANAGAKQ--T--TYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAA 200 (418) T ss_pred ------------c-hHHHHHHHHHHH--H--hcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeec Confidence 0 011122221111 1 112233455567789987765542211 1111111 111 12367 Q ss_pred ceEEecCCCCcceEEEEeccc-E-EEEe-cceeeEEe-----echhhhhcC-cEEEEEEEE---EcCEE-ecccceEEEE Q lcl|Aclame:pro 307 ITILESLAVETGKAIAFVANR-Y-DAFM-ATASTIEE-----YDQTFAMED-LQLYLTKNY---FYGKA-KDNHTAALLT 373 (377) Q Consensus 307 ~~v~~s~~~~~~~ii~gd~s~-y-~~~~-~~~~~i~~-----~~~~~f~~~-~~~~~~~~r---~dg~~-~~~~af~~l~ 373 (377) ..|+.|+++|..+. |.+.. . +.+- ..+-.+.. +.......| ...|-+..- +...+ -+.+-|++.. T Consensus 201 F~V~~S~nip~~ta--g~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~ 278 (418) T protein:vir:10 201 YEVYESQNLPKHTV--GDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLE 278 (418) T ss_pred eEEEEecCCCcccc--cccccceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEe Confidence 78999999985431 22221 1 1111 11111111 111112222 222322211 11111 1334565553 Q ss_pred e----ecC Q lcl|Aclame:pro 374 L----AGG 377 (377) Q Consensus 374 ~----~a~ 377 (377) - ++| T Consensus 279 ~~~~~~~~ 286 (418) T protein:vir:10 279 DVDTDAGG 286 (418) T ss_pred eccccccC Confidence 2 222 No 177 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=89.79 E-value=0.024 Score=29.43 Aligned_cols=346 Identities=10% Similarity=-0.008 Sum_probs=122.6 Q ss_pred CCccHHHHHHHHHHHHHHHHH--HHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHH- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAK--ISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFND- 77 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~- 77 (377) |.++-|+|.++|.-+.+-... +.+.-.+..-.+.+|.....+.++. +++........++.|++.+...... T Consensus 1 ~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~------~~~~~~~~e~~~~~l~e~~~~~~~~~ 74 (529) T protein:vir:10 1 MSLKTKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKTDP------VYRDDKLIEAFGQSLMEAEVAGDHGY 74 (529) T ss_pred CccchHHHHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHhhccc------ccchhhhhhhhhhccchhhccccccc Confidence 999999999988765442221 1111111111122222222221111 1111111111112222222210000 Q ss_pred ---HHhccCCCCCceeccHHHHHHHHHHHHh---hhhhhhhceeEecCCce------E--EEEEcC-------------- Q lcl|Aclame:pro 78 ---IDKNVGGKDKFKLLPEETMVQVFDDLVA---EHPLLKVINFKNTSLRL------K--ALTAET-------------- 129 (377) Q Consensus 78 ---~~~~~~~s~gg~lvP~~~~~~Ii~~~~~---~s~l~~~~~v~~~~~~~------~--~p~~~~-------------- 129 (377) ....++++ +.. +.+...++..+|. .-+-.+++-|.||++.. + ++-... T Consensus 75 ~~~~ia~s~~t-~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~ 150 (529) T protein:vir:10 75 DPTNIAAGQSS-GAI---TNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAP 150 (529) T ss_pred ccccccccccc-ccc---ccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCcccccccccccc Confidence 00111111 110 1112222222221 11223344444443211 0 000000 Q ss_pred ---------------------------------Ccceeeec--------------------------------------- Q lcl|Aclame:pro 130 ---------------------------------SGTAVWGD--------------------------------------- 137 (377) Q Consensus 130 ---------------------------------~~~a~w~~--------------------------------------- 137 (377) .....|.. T Consensus 151 dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~ 230 (529) T protein:vir:10 151 DAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGEL 230 (529) T ss_pred cccccccccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccc Confidence 00000000 Q ss_pred ---------ccccc----cccccccceeEeecceeEEEe-------ehhhHHHHhcC----HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 138 ---------IFGEI----KGQLKQAFKEQDFSQFKLTAF-------VVIPKDALKFG----PKWLKQFITEQLKEAIAVA 193 (377) Q Consensus 138 ---------e~~~~----~~~~~~~f~~i~l~~~k~~~~-------~~iS~ell~ds----~~~~~~~l~~~la~~~a~~ 193 (377) ..++. ...+...|.+..|...|..+- ...|-||.+|= ..|.|++|.+-|+..|..- T Consensus 231 ~~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlE 310 (529) T protein:vir:10 231 AEIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLE 310 (529) T ss_pred cccccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHH Confidence 00000 011123466777777766543 45888888873 4679999999999999999 Q ss_pred hhcceeeccC-C-Cc--ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhc Q lcl|Aclame:pro 194 LELAIVKGNG-L-LQ--PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLK 269 (377) Q Consensus 194 ~~~a~l~G~G-~-~~--P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 269 (377) +|+.||. += + .+ -.|+-.... ....+.......+ .....+....+..++-......+...+. T Consensus 311 INReii~-~i~~~a~~~~~g~~~~~~-------~~~gv~d~~~~~d------~~~~~~~~e~~~~L~~~i~~~an~I~~~ 376 (529) T protein:vir:10 311 INREVID-WINYTAQVGKSGWTQTVG-------SAAGVFDFQDPID------VRGARWAGESYKALLIQIDKEANEIARQ 376 (529) T ss_pred hhHHHHH-Hhhhhceeeeeeeecccc-------ccccceecccccc------ccccchhHHHHHHHHHHHHHHHHHHHHh Confidence 9999986 20 0 11 112210000 0000000000000 0001111111112222222222222222 Q ss_pred c---cCceEEEeccchhhhhc--------------ccccccCCCCccccccCCCceEEecCCCCcceEEEEecc--cEEE Q lcl|Aclame:pro 270 I---AGQVKLLLNPEDRWTLE--------------AKFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVAN--RYDA 330 (377) Q Consensus 270 ~---~~~~~~~~n~~~~~~~~--------------~~~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s--~y~~ 330 (377) + .++ .+++.|.-..-|. ......+..+.+...|.-+++|+.+++.+.+-+++|--. .|.. T Consensus 377 T~rg~~n-~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~ 455 (529) T protein:vir:10 377 TGRGAGN-FIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDA 455 (529) T ss_pred hccccce-EEEEchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCccccc Confidence 2 233 3445553222111 000001112234445555778888888887767666321 1110 Q ss_pred Ee-----cceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEee---------------cC Q lcl|Aclame:pro 331 FM-----ATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA---------------GG 377 (377) Q Consensus 331 ~~-----~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~---------------a~ 377 (377) +. ....-....|...|.. .+-.+.|+ |-.++| |+.-+-- +| T Consensus 456 glfy~PYv~l~~~~~~dp~sfqP---~~g~~tRY-~l~~NP--~~~~~~~~~~~r~~~g~~~~~~ag 516 (529) T protein:vir:10 456 GIYYCPYVALTPLRGSDPKNFQP---VMGFKTRY-AIGVNP--FAESRTQAPTSRISNGMPGAHSVG 516 (529) T ss_pred ceeeccccccccccccCCCcccc---eeeeeeee-ceeecC--ccccccccccccccCCcchhhhcC Confidence 00 0011111233333433 33344555 334555 3221111 11 No 178 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=88.64 E-value=0.031 Score=28.85 Aligned_cols=344 Identities=12% Similarity=0.026 Sum_probs=125.0 Q ss_pred CCccHH-HHHHHHHHHHHHHHHHHhccC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHH---H Q lcl|Aclame:pro 1 MAINLK-ELPKYREAVAELSAKISAGAT--PEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIK---F 74 (377) Q Consensus 1 m~~~~~-~l~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~ 74 (377) |+|+-+ +|.++|+-+.+-.. +.+-.+ +.--.+.+|.....++++. +++........+.-|++.+.. . T Consensus 1 ~~~~~~~~l~~kw~p~l~~~~-~~~i~~~~~~~~a~~~enq~~~~~~~~------~~~~~~~~~~~~~~l~e~~~~~~~~ 73 (521) T protein:vir:10 1 MTIKTKAELLNKWKPLLEGEG-LPEIANSKQAIIAKIFENQEKDFQTAP------EYKDEKIAQAFGSFLTEAEIGGDHG 73 (521) T ss_pred CCcchhHHHHHhhhhhhccCC-CCccccchhhhhhhhhhhhhhhhhhcc------ccchhHHHHHHhhhhhhhcccCccc Confidence 999764 47888875544311 100001 1111122222222221111 111111111111111111000 0 Q ss_pred HHH-HHhccCCCCCceeccHHHHHHHHHHHHh---hhhhhhhceeEecCCce------EEEEEcCC-------------- Q lcl|Aclame:pro 75 FND-IDKNVGGKDKFKLLPEETMVQVFDDLVA---EHPLLKVINFKNTSLRL------KALTAETS-------------- 130 (377) Q Consensus 75 ~~~-~~~~~~~s~gg~lvP~~~~~~Ii~~~~~---~s~l~~~~~v~~~~~~~------~~p~~~~~-------------- 130 (377) .+. ....++++ +.. +.+...++..+|. .-+-.+++-|.||++.. +.-..... T Consensus 74 ~~~~~i~es~~t-~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ 149 (521) T protein:vir:10 74 YNATNIAAGQTS-GAV---TQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYG 149 (521) T ss_pred cccccccccccc-ccc---ccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhcc Confidence 000 00011111 111 1222333333332 12234556666665421 11110000 Q ss_pred cceeeeccc--------------------------------------------------------------------ccc Q lcl|Aclame:pro 131 GTAVWGDIF--------------------------------------------------------------------GEI 142 (377) Q Consensus 131 ~~a~w~~e~--------------------------------------------------------------------~~~ 142 (377) +++.|.+.. +.. T Consensus 150 ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~Gms 229 (521) T protein:vir:10 150 PDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMA 229 (521) T ss_pred ccccccccccccccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccc Confidence 000000000 000 Q ss_pred ----------cccccccceeEeecceeEEEe-------ehhhHHHHhcC----HHHHHHHHHHHHHHHHHHHhhcceeec Q lcl|Aclame:pro 143 ----------KGQLKQAFKEQDFSQFKLTAF-------VVIPKDALKFG----PKWLKQFITEQLKEAIAVALELAIVKG 201 (377) Q Consensus 143 ----------~~~~~~~f~~i~l~~~k~~~~-------~~iS~ell~ds----~~~~~~~l~~~la~~~a~~~~~a~l~G 201 (377) ...+...|.+..|...|..+- ...|-||.+|= ..|.|++|.+-|+..|..-+|+.||. T Consensus 230 Ta~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~- 308 (521) T protein:vir:10 230 TSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVD- 308 (521) T ss_pred hhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhh- Confidence 011123477777777777653 45888988873 46799999999999999999999883 Q ss_pred cC-CC-c--ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhc---ccCce Q lcl|Aclame:pro 202 NG-LL-Q--PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLK---IAGQV 274 (377) Q Consensus 202 ~G-~~-~--P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 274 (377) += .. + -.|+.+. ............+.+ .....+....+..++-...+..+..... ..+++ T Consensus 309 ~i~~sa~~~~~g~t~~-------~~~~~G~~d~~~~~d------~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~ 375 (521) T protein:vir:10 309 WINYSAQVGKSGMTLT-------PGSKAGVFDFQDPID------IRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNF 375 (521) T ss_pred hhhheeeeeeeeeeec-------cCccccceecccccc------cccchHHHHHHHHHHHHHHHHHHHHHHhcccccceE Confidence 20 00 0 1122100 000000000000000 0001111111112222222222222222 22333 Q ss_pred EEEeccchhhhhccc---------------ccccCCCCccccccCCCceEEecCCCCcceEEEEeccc--E-----EEEe Q lcl|Aclame:pro 275 KLLLNPEDRWTLEAK---------------FTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANR--Y-----DAFM 332 (377) Q Consensus 275 ~~~~n~~~~~~~~~~---------------~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~--y-----~~~~ 332 (377) +++.|.-.. ++.. ...-+....+...|.-+++|+.+++.+.+-+++|--.. + +.== T Consensus 376 -~i~S~~Va~-~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPY 453 (521) T protein:vir:10 376 -IIASRNVVN-VLASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPY 453 (521) T ss_pred -EEEchHHHH-HHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccc Confidence 345553322 2211 00001111233445557788888888877676663211 1 1000 Q ss_pred cceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEE-------EeecC Q lcl|Aclame:pro 333 ATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALL-------TLAGG 377 (377) Q Consensus 333 ~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l-------~~~a~ 377 (377) .....+...|...|.. .+-.+.|+ |-.++| |+.- .|.+| T Consensus 454 v~l~~~~~~dp~sfqP---~~g~~tRY-~l~~NP--~~~~~~~~~~~~i~~~ 499 (521) T protein:vir:10 454 VALTPLRGSDPKNFQP---VMGFKTRY-GIGINP--FAESAAQAPASRIQSG 499 (521) T ss_pred cccccccccCCccccc---eeeeeeee-ceeecC--cccccCCccceeeccc Confidence 0111112234444443 34445565 445566 3332 12222 No 179 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=84.08 E-value=0.063 Score=27.16 Aligned_cols=179 Identities=16% Similarity=0.187 Sum_probs=82.0 Q ss_pred EEEeehhhHHHH-----hcCHHHHHHHHHHHHHHHHHHHhhcceee----ccCCCcceeeeecccccccccccccccccc Q lcl|Aclame:pro 160 LTAFVVIPKDAL-----KFGPKWLKQFITEQLKEAIAVALELAIVK----GNGLLQPVGLLKDLSQPTVDQSTGRDITTY 230 (377) Q Consensus 160 ~~~~~~iS~ell-----~ds~~~~~~~l~~~la~~~a~~~~~a~l~----G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~ 230 (377) +-.+ -+|.-++ -++..|+.+...++++++++...|+.++. +.....|..- ...+.......+. T Consensus 1 iD~l-L~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~--~~~g~~~~~~a~~----- 72 (221) T protein:vir:17 1 MDDL-LVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTG--QDGGFSVNIGAGN----- 72 (221) T ss_pred CCcc-hhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccc--cccCcceeccccc----- Confidence 1111 2333333 24678899999999999999999998753 2211112100 0000000000000 Q ss_pred chhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccc----cccc---CCCCcccc-- Q lcl|Aclame:pro 231 KTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK----FTSR---NQFGEYVT-- 301 (377) Q Consensus 231 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~----~~~~---~~~G~~~~-- 301 (377) .-++..+++.+..+...+ +. +.....+ .+++++|..|+.++.. +... +.+|.... T Consensus 73 -----------t~~~~~l~dai~~a~~~L--de--kdVP~~g-R~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~ 136 (221) T protein:vir:17 73 -----------TNNAQAIVDGFFEAAAVL--DE--RSAPMDG-RVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGK 136 (221) T ss_pred -----------cCCHHHHHHHHHHHHHHH--hh--cCCCCCC-CEEEeCcHHHHHHHHhcCcceeeeecccccccccccc Confidence 012222333333322222 11 2222334 4566789888877632 1111 12222111 Q ss_pred --ccCCCceEEecCCCCc--ceEEEEecccEEEEecceeeEEeechhhhhcCcEEEEE-EEEEcCEEecccceEEEEeec Q lcl|Aclame:pro 302 --VLPHGITILESLAVET--GKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLT-KNYFYGKAKDNHTAALLTLAG 376 (377) Q Consensus 302 --~l~~~~~v~~s~~~~~--~~ii~gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~-~~r~dg~~~~~~af~~l~~~a 376 (377) ....|++|+.|+++|. ++-+..+.+.+. .....+ ..||+ +.-.=|.+.+++|...+++=+ T Consensus 137 ~i~~v~G~~V~~SnnlP~~~gt~~~~~ag~~~--------~~~~~~-------~~yr~~fs~~~glv~~~~Avgtvkl~~ 201 (221) T protein:vir:17 137 GLYVNAGIRIYKSNVLASLYGTNLVTDPGDAT--------TSGENN-------GSYRPAITDRAGLVFHKEAADTVEVLL 201 (221) T ss_pred eeeeecCcEEEEeccCCcccccccccCCcccc--------cccccc-------ccccccccceEEEEEcchheeeeeeec Confidence 1124889999999995 221111111110 000000 01111 111228899999999999887 Q ss_pred C Q lcl|Aclame:pro 377 G 377 (377) Q Consensus 377 ~ 377 (377) - T Consensus 202 ~ 202 (221) T protein:vir:17 202 P 202 (221) T ss_pred C Confidence 7 No 180 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=82.53 E-value=0.076 Score=26.72 Aligned_cols=295 Identities=13% Similarity=0.040 Sum_probs=131.2 Q ss_pred ccHHHHHHHHHHHhc-----c-----CCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC--CceEEEEEcCCccee Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKN-----V-----GGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS--LRLKALTAETSGTAV 134 (377) Q Consensus 67 lt~~e~~~~~~~~~~-----~-----~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~--~~~~~p~~~~~~~a~ 134 (377) |+.+-|..|+..... + ...+.-+.|.+.+...+.+.+++.|-+++.++++++. +........++..++ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~ 80 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYG 80 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccC Confidence 666666666654331 1 1222347898999999999999999999999998874 222233222222111 Q ss_pred eecccccccccccccceeEeecceeEEEeehhhHHHHhc--CHHH-HHHHHHHHHHHHHHHHhhcceeeccCC----Ccc Q lcl|Aclame:pro 135 WGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--GPKW-LKQFITEQLKEAIAVALELAIVKGNGL----LQP 207 (377) Q Consensus 135 w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s~~~-~~~~l~~~la~~~a~~~~~a~l~G~G~----~~P 207 (377) -....+...+... -+.-.+..++.--=..|+.+.|+. ..+| |..-+++.+.++++.-+=.--+||+-. ..| T Consensus 81 r~~t~~~~~~~~~--~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T~nP 158 (343) T protein:vir:98 81 AHDRRTPIQQRWT--RQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDTSDP 158 (343) T ss_pred ccccCCCcccccc--CCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCCCCc Confidence 1111111111110 111123333333345677777753 1356 888999999998887666666777632 234 Q ss_pred e------eeeecccccccccc-ccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEecc Q lcl|Aclame:pro 208 V------GLLKDLSQPTVDQS-TGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNP 280 (377) Q Consensus 208 ~------Gil~~~~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~ 280 (377) . |+|......+.... +..... ......+ ...+ +.++..+..+. ..............+.+|-. T Consensus 159 llqDVN~GWLQ~~Re~ap~rVm~~~~~~---~~~~~~G--~ggd----y~NLDalV~D~-~~~I~~~~~~d~dLVvivG~ 228 (343) T protein:vir:98 159 NLADVNKGWIQFVRENKATQILTQGATS---GEIRLFG--EGAD----YVNLDELAYDL-KQGLDARHRDAGDLVFLVGA 228 (343) T ss_pred chhhcchHHHHHHHhcchhhhhccceec---cceeEec--CCCC----cccHHHHHHHH-HhcCchHHhcCCCEEEEEch Confidence 3 44322111111000 000000 0000000 0111 11122222221 11122222233455555554 Q ss_pred chhhhhcccccccCCCCc-c--------c--cccCCCceEEecCCCCcceEEEEec---ccEEE--Eecceee-EE-eec Q lcl|Aclame:pro 281 EDRWTLEAKFTSRNQFGE-Y--------V--TVLPHGITILESLAVETGKAIAFVA---NRYDA--FMATAST-IE-EYD 342 (377) Q Consensus 281 ~~~~~~~~~~~~~~~~G~-~--------~--~~l~~~~~v~~s~~~~~~~ii~gd~---s~y~~--~~~~~~~-i~-~~~ 342 (377) .-..+-...+ .+..+. + . +-..=|+|.+.-+++|++.+++=-| |=|+- ..|.-+. .. +.. T Consensus 229 dLla~~~~~l--~n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~r 306 (343) T protein:vir:98 229 DLVAKEASLV--YKGNGLIATEKAALNTHDLMKSFGGMPAMIVPNMPPRAAIVTSLSNLSIYTQEGSMRRGMKDDDDKKA 306 (343) T ss_pred hhhhhhhhhh--hhhcCCChHHHHHHHHHHHHHhhCCCeeEEccccCCCceEEeeccccEEEEecCcEEEEEEecccccc Confidence 2211111111 111121 0 0 0011267888889999999876544 44442 1111121 11 111 Q ss_pred -hhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 343 -QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 343 -~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+... ..+|..-.+--+.++..-.+++.. -+| T Consensus 307 ie~y~s~-Ne~YvVEd~~~~a~iE~i~v~~~~-~~g 340 (343) T protein:vir:98 307 VRDSYYR-NEAYAVEDCGKFMAVDFTKVKLSS-GKG 340 (343) T ss_pred ccchhhh-cceeeeeccccEEEeeeeeeeecC-CCC Confidence 112222 336655544445566655554443 223 No 181 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=77.12 E-value=0.13 Score=25.48 Aligned_cols=251 Identities=10% Similarity=-0.014 Sum_probs=106.7 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe-----c--CC-ceEEEEEcCCcce---eeeccccccccccc Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN-----T--SL-RLKALTAETSGTA---VWGDIFGEIKGQLK 147 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~-----~--~~-~~~~p~~~~~~~a---~w~~e~~~~~~~~~ 147 (377) +. ..-..++|+-+++++++.+++..++.+++..-. . .| .++||+....... .+.. .+...++.. T Consensus 1 MA----Nsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~-t~~~~~~l~ 75 (423) T protein:vir:10 1 MA----NNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDI-TGKSKNSLI 75 (423) T ss_pred Cc----cccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCccc-Ccccccccc Confidence 11 111228999999999999999999888876421 1 13 3567664422111 1110 111111111 Q ss_pred ccceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccc Q lcl|Aclame:pro 148 QAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDI 227 (377) Q Consensus 148 ~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~ 227 (377) -.--.+.|+.+|..++-.=+.|+. .+..+++++++.. .++++..+|..+.......-+..+ ...+... T Consensus 76 e~~v~l~id~~k~~a~~v~d~E~~-l~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~v----------gt~~t~~ 143 (423) T protein:vir:10 76 SAKATGEVGNYITVAVEYRQIEEA-LKLNQLDQILVPI-NERMVTDLETELALFMMKHGALSL----------GSPNTPI 143 (423) T ss_pred cceEEEEecceeeeeeeeChHHHh-cChhHHHHHHHHH-HHHHHHHHHHHHHHHhhhcccccc----------ccccccc Confidence 112356677777666554455554 5677888877555 678999999977532211111100 0000000 Q ss_pred cccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc---ccCCCC--cccc- Q lcl|Aclame:pro 228 TTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT---SRNQFG--EYVT- 301 (377) Q Consensus 228 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~---~~~~~G--~~~~- 301 (377) .. ++.+..+.+.+ +....|. .+ ..++++|..+..++.... .....+ .|.. T Consensus 144 ~a-------------------~~~~a~a~~~L--~~~~vP~--~~-R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~ 199 (423) T protein:vir:10 144 KK-------------------WSDVAQTASFL--KDLGINS--GE-NYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENA 199 (423) T ss_pred cc-------------------HHHHHHHHHHH--hhccCCc--CC-CEEEeCHHHHHHHhhhhhhhccccccchHHHHhc Confidence 00 11111111111 1111222 23 456889987776643211 111111 1111 Q ss_pred ---ccCCCceEEecCCCCcceEEEEecc------cEEEEecc--------ee---eEEeechhhhhc-CcEEEEEEEEEc Q lcl|Aclame:pro 302 ---VLPHGITILESLAVETGKAIAFVAN------RYDAFMAT--------AS---TIEEYDQTFAME-DLQLYLTKNYFY 360 (377) Q Consensus 302 ---~l~~~~~v~~s~~~~~~~ii~gd~s------~y~~~~~~--------~~---~i~~~~~~~f~~-~~~~~~~~~r~d 360 (377) ...+|..++.|+++|..+ -|+.. -....... +. ....+. .++.+ +. .+...+ T Consensus 200 ~i~G~~~GFdi~~Sn~vp~~T--~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~-~g~l~~GD----~~t~aG 272 (423) T protein:vir:10 200 QISGNFGGIRALMSNGLASRT--QGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASK-KGFLKVGD----QLQFDD 272 (423) T ss_pred ccceeecceEEEEecCCcccc--cccccceeeeeeeeEEEecccccccccccceeecccee-ceeEEecc----eEeecc Confidence 122577888898887421 11111 11110000 00 000110 01111 11 122222 Q ss_pred CEEecc-cceEEEEeecC Q lcl|Aclame:pro 361 GKAKDN-HTAALLTLAGG 377 (377) Q Consensus 361 g~~~~~-~af~~l~~~a~ 377 (377) ...+|+ ...++.+-+.| T Consensus 273 v~~v~~~tk~~l~~~~~~ 290 (423) T protein:vir:10 273 THWLNQQSKQTLYNGASA 290 (423) T ss_pred eeeecccccceeecccCC Confidence 233333 22333344445 No 182 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=71.76 E-value=0.19 Score=24.52 Aligned_cols=349 Identities=10% Similarity=0.004 Sum_probs=122.8 Q ss_pred ccHHHHHHHHHHHHHHHHH--HHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------ccccccHHHH Q lcl|Aclame:pro 3 INLKELPKYREAVAELSAK--ISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRD--------KNRELTAEEI 72 (377) Q Consensus 3 ~~~~~l~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~lt~~e~ 72 (377) |..|+|.++|.-+.+-... +.+.-.+..-.+.+|.....+.++. -..++....... ....|.+.+. T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~l~ea~~ 76 (534) T protein:vir:10 1 MSKKSLLKKWQPLVESEGMPAIASMKRKDIVARIFENQDEDIAHNE----GGVYTDQVVVNSMVDVKGRIEEARLAEANI 76 (534) T ss_pred CchhHHHHHhHHhhcCCccccccchhhhhhhhhhhhhHHHHHhhhc----ccccchhhhhhhhhccccchhhcccccccc Confidence 7778888888755442220 1110000111122222222221110 011111111100 1111222111 Q ss_pred HHHHHH----HhccCCCCCceeccHHHHHHHHHHHHh---hhhhhhhceeEecCCce------EEEEEcCC--------- Q lcl|Aclame:pro 73 KFFNDI----DKNVGGKDKFKLLPEETMVQVFDDLVA---EHPLLKVINFKNTSLRL------KALTAETS--------- 130 (377) Q Consensus 73 ~~~~~~----~~~~~~s~gg~lvP~~~~~~Ii~~~~~---~s~l~~~~~v~~~~~~~------~~p~~~~~--------- 130 (377) ..-+.. ...++++ +.. +.+...++..+|. .-+-.+++-|.||++.. +--+.... T Consensus 77 ~~~~g~~~~~ia~s~~s-~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf 152 (534) T protein:vir:10 77 GGDHGYDATKIASGETS-GSI---TNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAF 152 (534) T ss_pred ccccccccccccccccc-ccc---ccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCccccccc Confidence 100000 0001111 110 1122233333331 12234456666665321 11110000 Q ss_pred -----cceeeecc------------------------------------------------------------------- Q lcl|Aclame:pro 131 -----GTAVWGDI------------------------------------------------------------------- 138 (377) Q Consensus 131 -----~~a~w~~e------------------------------------------------------------------- 138 (377) +++.|.+. T Consensus 153 ~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~ 232 (534) T protein:vir:10 153 HPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANG 232 (534) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccc Confidence 00001000 Q ss_pred ------------cccc----cccccccceeEeecceeEEEe-------ehhhHHHHhcC----HHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 139 ------------FGEI----KGQLKQAFKEQDFSQFKLTAF-------VVIPKDALKFG----PKWLKQFITEQLKEAIA 191 (377) Q Consensus 139 ------------~~~~----~~~~~~~f~~i~l~~~k~~~~-------~~iS~ell~ds----~~~~~~~l~~~la~~~a 191 (377) .++. ...++..|.+..|...|..+- ...|-||.+|= ..|.|++|.+-|+..|. T Consensus 233 ~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEIm 312 (534) T protein:vir:10 233 YAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIM 312 (534) T ss_pred cceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHH Confidence 0000 001123466666666666542 45888888873 46789999999999999 Q ss_pred HHhhcceeeccCCCcceeeeeccccccccccc--cccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhc Q lcl|Aclame:pro 192 VALELAIVKGNGLLQPVGLLKDLSQPTVDQST--GRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLK 269 (377) Q Consensus 192 ~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 269 (377) .-+|+.||.= |.+..........+ ....+.++.... ......++....+..++-...+..+..... T Consensus 313 lEINReii~~--------l~~~a~~~k~~~~~~~~~~~G~~d~~~~----~~~~~~~~~~e~~~~L~~~i~~~an~i~~~ 380 (534) T protein:vir:10 313 HEINREMVLW--------INATAKVGKTGWTNMHGGKAGVFDFQDT----KDIRGARWAGESYKALVVQIDKEANEIARQ 380 (534) T ss_pred HHhhHHHHHH--------Hhhhhheeecccccccccccceeeeecc----ccccchhHHHHHHHHHHHHHHHHHHHHHHh Confidence 9999988751 11111111111000 000010000000 000112223333333333333333322222 Q ss_pred cc---CceEEEeccchhhhhccccccc------------C---CCCccccccCCCceEEecCCCCcceEEEEeccc---- Q lcl|Aclame:pro 270 IA---GQVKLLLNPEDRWTLEAKFTSR------------N---QFGEYVTVLPHGITILESLAVETGKAIAFVANR---- 327 (377) Q Consensus 270 ~~---~~~~~~~n~~~~~~~~~~~~~~------------~---~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~---- 327 (377) +. ++ .+++.|.-..-|. ...+. + ..+.+...|.-+++|+.+++.+.+-+++|.-.. T Consensus 381 T~rg~~n-~~v~S~~Va~~L~-~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~ 458 (534) T protein:vir:10 381 TGRGQGN-FIICSRNVAAALG-HTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEMD 458 (534) T ss_pred hcccccc-EEEEchhHHHHHh-hccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccc Confidence 22 33 3445554322221 11100 1 111233444457788888888876666663211 Q ss_pred ---EEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEeccc---------------------------ceEEEEeecC Q lcl|Aclame:pro 328 ---YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNH---------------------------TAALLTLAGG 377 (377) Q Consensus 328 ---y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~---------------------------af~~l~~~a~ 377 (377) |+.==.....+...|...|.. .+-.+.|+. -.++|= =|+.|.++.= T Consensus 459 ~glfyaPYv~l~~~~~~dp~sfqP---~~g~~tRY~-l~~NP~~~~~~~~~~~~i~~g~~~~~~~ag~n~~~~~~~Vk~l 534 (534) T protein:vir:10 459 AGLYYCPYVALTPLRGTDPKNFQP---VLGFKTRYG-VKLHPMADATQNKGFAKISNGMPQHTNMFGKNAFFRRVLVAGV 534 (534) T ss_pred cceeeccccccccccccCCccccc---eeeeeeeec-eeecCcccccCCccccccccCCcchhhhcccccceeeeeeecC Confidence 110000111112223333333 233334442 233331 1222222222 No 183 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=71.10 E-value=0.2 Score=24.41 Aligned_cols=259 Identities=10% Similarity=-0.027 Sum_probs=111.5 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec-------CC-ceEEEEEcCCcceeeeccc--ccccccccc Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT-------SL-RLKALTAETSGTAVWGDIF--GEIKGQLKQ 148 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~-------~~-~~~~p~~~~~~~a~w~~e~--~~~~~~~~~ 148 (377) +. ..--..+|+.++.++++.+++..++.++++.-.. .| .++||+.........-... +...++..- T Consensus 1 Ma----N~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e 76 (423) T protein:vir:17 1 MP----NNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cc----cchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCcccc Confidence 11 1111247999999999999999888887754221 12 4667754322222221111 111111111 Q ss_pred cceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeecccccccccccccccc Q lcl|Aclame:pro 149 AFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDIT 228 (377) Q Consensus 149 ~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~ 228 (377) .--.+.++.+|..++-.=..|+. ....++++++... .++++..+|..++.- ..+. ......+..... T Consensus 77 ~~v~l~id~~k~va~~v~d~E~~-~~i~~~~~~l~~A-~~aLA~~vd~~ia~~-~~~~----------a~~~~gt~~t~~ 143 (423) T protein:vir:17 77 GKATGRVGNYITVAVEYQQLEEA-IKLNQLEEILAPV-RQRIVTDLETELAHF-MMNN----------GALSLGSPNTPI 143 (423) T ss_pred ceeEEEeeceeeeeeeecHHHHh-cChhHHHHHHHHH-HHHHHHHHHHHHHHH-Hhhc----------cccccccCCccc Confidence 22357777777777665556655 4466788877655 688999999876631 1110 000000000000 Q ss_pred ccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc--ccCCCC---cccc-- Q lcl|Aclame:pro 229 TYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT--SRNQFG---EYVT-- 301 (377) Q Consensus 229 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~--~~~~~G---~~~~-- 301 (377) . .++.+..+.+.+. ....| ..+ ..++++|..+..++.... .....+ .|.. T Consensus 144 ~------------------a~~~i~~a~~~Ld--~~~vP--~~~-R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~ 200 (423) T protein:vir:17 144 T------------------KWSDVAQTASFLK--DLGVN--EGE-NYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQ 200 (423) T ss_pred c------------------cHHHHHHHHHHHH--hccCC--cCC-CEEEeChHHHHHHhccccceecccccchHHHhhcc Confidence 0 0111222222111 11122 233 456789987766643211 111111 1221 Q ss_pred --ccCCCceEEecCCCCcceEE-EEe--------c-ccEEEEec----ceeeEEeechhhhh--cCcEEEEEE---EEEc Q lcl|Aclame:pro 302 --VLPHGITILESLAVETGKAI-AFV--------A-NRYDAFMA----TASTIEEYDQTFAM--EDLQLYLTK---NYFY 360 (377) Q Consensus 302 --~l~~~~~v~~s~~~~~~~ii-~gd--------~-s~y~~~~~----~~~~i~~~~~~~f~--~~~~~~~~~---~r~d 360 (377) ...+|..++.|+++|..+.. ++- . ........ .++.........+. .|.+.|-+. .+.. T Consensus 201 i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t 280 (423) T protein:vir:17 201 IPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQT 280 (423) T ss_pred ceeeecceEEEEeCCCccccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccc Confidence 12257789999999854311 110 0 00000000 01111111111111 344433332 2222 Q ss_pred CEEe------cccceEEEE----eecC Q lcl|Aclame:pro 361 GKAK------DNHTAALLT----LAGG 377 (377) Q Consensus 361 g~~~------~~~af~~l~----~~a~ 377 (377) +.++ ..+-|++.. .++| T Consensus 281 k~v~~~~~t~~~~~~~v~~~~~~~a~~ 307 (423) T protein:vir:17 281 KQALYNGATPISFTATVTADANSDSSG 307 (423) T ss_pred cccccccccccceEEEEEecccccccC Confidence 3222 334555542 1223 No 184 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=70.34 E-value=0.21 Score=24.30 Aligned_cols=295 Identities=12% Similarity=0.065 Sum_probs=134.7 Q ss_pred ccHHHHHHHHHHHh------ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcceeeecc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDK------NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWGDI 138 (377) Q Consensus 67 lt~~e~~~~~~~~~------~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~~e 138 (377) |..+-|..|+.... ...+....+.|.+.+...+.+.+++.|-+++.++++++.- +-++....+++-++-..- T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT 80 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDT 80 (338) T ss_pred CCHHHHHHHHHHHHHHHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccC Confidence 66666666655432 2233445678889999999999999999999999998862 345655555554443221 Q ss_pred --cccccccccc-cceeEeecceeEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHhhcceeeccCC-------Cc Q lcl|Aclame:pro 139 --FGEIKGQLKQ-AFKEQDFSQFKLTAFVVIPKDALKF--GPKWLKQFITEQLKEAIAVALELAIVKGNGL-------LQ 206 (377) Q Consensus 139 --~~~~~~~~~~-~f~~i~l~~~k~~~~~~iS~ell~d--s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~-------~~ 206 (377) .++..+ .++ ..+.-.+..++.---..|+.+.|+. ...+|..-+++.+.++++.-+-.--+||+-. .. T Consensus 81 ~~~~~R~~-~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~n 159 (338) T protein:vir:11 81 TGDGVRKP-RDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAAN 159 (338) T ss_pred CCCCcccc-ccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhC Confidence 112221 222 4455555566555566788888862 3457999999999999888777666777641 12 Q ss_pred ce------eeeecccccccccc-ccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEec Q lcl|Aclame:pro 207 PV------GLLKDLSQPTVDQS-TGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLN 279 (377) Q Consensus 207 P~------Gil~~~~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n 279 (377) |. |+|......+.... +....... ..++.-...+. .++..+..+..-.-...........+.+|. T Consensus 160 PllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~----i~i~~g~~gdy----~nLDalV~d~~~~lI~~~~~~d~dLVvivG 231 (338) T protein:vir:11 160 PLLQDVNIGWFQQYRNNAPARVLKEGKTTGK----VVVGNGADADY----KNLDALVFDVVSSLIDPWHRRDPGLVVILG 231 (338) T ss_pred cCccccchhHHHHHHhhhhhhhhhcccccce----eeecCCCCCcc----ccHHHHHHHHHhccCChHHhcCCCEEEEEc Confidence 32 44322111110000 00000000 00000000111 111111111110011111222345566666 Q ss_pred cchhhhhcccccccCCC------Cccc--cccCCCceEEecCCCCcceEEEEeccc---EEEE--eccee-eEE-eec-h Q lcl|Aclame:pro 280 PEDRWTLEAKFTSRNQF------GEYV--TVLPHGITILESLAVETGKAIAFVANR---YDAF--MATAS-TIE-EYD-Q 343 (377) Q Consensus 280 ~~~~~~~~~~~~~~~~~------G~~~--~~l~~~~~v~~s~~~~~~~ii~gd~s~---y~~~--~~~~~-~i~-~~~-~ 343 (377) ..-...-...+...... ++-+ +-..=|+|.+.-+++|++.+++=-|+. |+-. .|.-+ +.. +.. | T Consensus 232 ~dLladk~~~l~n~~~~ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie 311 (338) T protein:vir:11 232 RELVHDKYFPMVNKDQPATEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRRYLKEVPEKNRIE 311 (338) T ss_pred hhhhHHHHhHHHhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecccccccc Confidence 43211111111111000 1111 111127788889999999987654444 4321 11111 111 011 1 Q ss_pred hhhhcCcEEEEEEEEEcCEEecccceEEEE Q lcl|Aclame:pro 344 TFAMEDLQLYLTKNYFYGKAKDNHTAALLT 373 (377) Q Consensus 344 ~~f~~~~~~~~~~~r~dg~~~~~~af~~l~ 373 (377) .+...+ .+|..-.+--+.+++. +.+.. T Consensus 312 ~y~s~N-e~YvVEd~~~~a~ien--i~~~~ 338 (338) T protein:vir:11 312 NYESSN-DAYVVEDYGLGCLVEN--IEVAE 338 (338) T ss_pred chhhhc-cceeeeccccEEEeec--ceecC Confidence 112222 2443322222233321 12222 No 185 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=68.05 E-value=0.24 Score=23.95 Aligned_cols=347 Identities=10% Similarity=-0.022 Sum_probs=143.9 Q ss_pred CC------ccHHHHHHHHHHHHHHHH---------------HHHh-ccC-HHHHHHHHHHHHHHHH-----------HHH Q lcl|Aclame:pro 1 MA------INLKELPKYREAVAELSA---------------KISA-GAT-PEEQEKLFEAAFTTMG-----------DEI 46 (377) Q Consensus 1 m~------~~~~~l~~~~~~~~~~~~---------------~~~~-~~~-~~~~~~~~~~~~~~~~-----------~~~ 46 (377) .+ ++.+-+.+.++.+..+.+ .+.+ +.+ ++.+...++.+...-. ... T Consensus 222 AP~~De~airAq~~aeeraRi~~I~~l~a~Fggr~~~l~~~~l~d~~~s~e~ar~~il~~l~~~~~p~~~~~~~~~~~~~ 301 (652) T protein:vir:79 222 APVVDENSIRAQVLAEQKARVNGINDLFAMFGGRYQTLQAQCLADPECSLEQAREKLLNEMGRESTPSNKNTPAHIYAGN 301 (652) T ss_pred CCcCchhHHHHHHHHHHHHHHHHHHHHHHhhccccchHHHHHhhccCCCHHHHHHHHHHHHHhhcCCCCCCcceeEeecc Confidence 11 111111111111111111 0101 111 1112222222210000 000 Q ss_pred HHHHHHHHHHHHHhcccc------c---cccH------------------HHHHHHHHHHhccCCCCCceeccHHHHHHH Q lcl|Aclame:pro 47 LAKNEEEMERMFDLRDKN------R---ELTA------------------EEIKFFNDIDKNVGGKDKFKLLPEETMVQV 99 (377) Q Consensus 47 ~~~~~~~~~~~~~~~~~~------~---~lt~------------------~e~~~~~~~~~~~~~s~gg~lvP~~~~~~I 99 (377) ....+...++....+.+. + .++- .......... .+++++.+.++-...-..+ T Consensus 302 g~~~~d~~~~aL~~R~g~~~~~~~~~~~g~~L~elAr~~L~~~G~~~~~~~~~~~v~~A~-~hsTsDFp~IL~~~~nk~l 380 (652) T protein:vir:79 302 GNFVGDGIRQALMARAGFEKTERDNVYNGMTLREYARMSLTERGIGVSSYNPMQMVGAAF-THSTSDFGNILLDVANKAI 380 (652) T ss_pred chhhHHHHHHHHHhhcCCcccccCccccCccHHHHHHHHHHhhccCCCCCCHHHHHHHHh-hcCcchHHHHHHHHHHHHH Confidence 011112222222211110 0 0010 0001111111 1456666655544444444 Q ss_pred HHHHHhhh-hhhhhceeEecCC--ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHH Q lcl|Aclame:pro 100 FDDLVAEH-PLLKVINFKNTSL--RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPK 176 (377) Q Consensus 100 i~~~~~~s-~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~ 176 (377) .+..+... ..+..|+..+++- ..+.....+-+...-+.|.|+.+-.+ ..=..-++...+++..+.||++.+-.-+. T Consensus 381 ~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t-~~e~~e~~~l~tyG~~~~iTRqaiINDDL 459 (652) T protein:vir:79 381 LQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVT-TGDKQATIALATYGELFSITRQAIINDDL 459 (652) T ss_pred HHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccceee-ecCccceeeeecccCeeeeehheeeccch Confidence 44444433 3556666555431 12333334556666777888876432 23355678889999999999998865578 Q ss_pred HHHHHHHHHHHHHHHHHhhcc---eeeccCC-C-cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHH Q lcl|Aclame:pro 177 WLKQFITEQLKEAIAVALELA---IVKGNGL-L-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVEL 251 (377) Q Consensus 177 ~~~~~l~~~la~~~a~~~~~a---~l~G~G~-~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 251 (377) ++..-|-..++++-++.+++. +|.++.+ . --+.++.....++.....+.. ...+ .. T Consensus 460 ~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~aa~~------------------~~~l-~~ 520 (652) T protein:vir:79 460 NMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESAAMD------------------VASL-DK 520 (652) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeecccccccccccccCC------------------HHHH-HH Confidence 888889999999988888873 3444432 1 223344112222211110000 0111 11 Q ss_pred HHHHHHhhhhhhhhhhhcccCceEEEeccchhhh---hcccccccCCC---CccccccCCCceEEecCCCCcce---EEE Q lcl|Aclame:pro 252 LVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWT---LEAKFTSRNQF---GEYVTVLPHGITILESLAVETGK---AIA 322 (377) Q Consensus 252 ~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~---~~~~~~~~~~~---G~~~~~l~~~~~v~~s~~~~~~~---ii~ 322 (377) ....|.. -+.+ ..++..... +|++.|.-... +.......+.+ |..--+.++ ..+|.++.+.++. -++ T Consensus 521 ar~aM~~-Qk~g-~~~l~i~P~-~llvp~~le~~a~~ll~s~~v~~a~~~~~~~Np~~~~-~~~i~eprL~~~s~~~wyl 596 (652) T protein:vir:79 521 ARQLMRV-QKEG-ERHLNIRPA-FVLVPTAMESVANQVIRSSSVKGADINAGIINPVKDF-ATVIAEPRLDDNSQTTFYL 596 (652) T ss_pred HHHHHHH-hccC-Ccccccccc-EEEecchhHHHHHHHhccCCCcccccccccccccccc-cccccccccCCCCcccEEE Confidence 1111211 1122 223333333 34444432221 11111111111 111000011 1445555443221 222 Q ss_pred E-eccc--E---EEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEEe Q lcl|Aclame:pro 323 F-VANR--Y---DAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTL 374 (377) Q Consensus 323 g-d~s~--y---~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~ 374 (377) + +-.. + ++.-..+..|+. +..|..|-+-||+..-++.+++|-.+++..+- T Consensus 597 aa~~~~dtiev~yL~G~~~P~ie~--~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 597 AASKGSDTIEVAYLNGVDTPYIDQ--MEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred ecCCCCCeEEEEEecCCCCCeeee--cCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 2 2111 1 222234455544 33699998889999999999999999887776 No 186 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=67.45 E-value=0.25 Score=23.87 Aligned_cols=301 Identities=15% Similarity=0.065 Sum_probs=126.9 Q ss_pred HHHHHHHHHHHHHHHHHhccccccc-------cHHHHHHHHHHHhc----cCCCCCceeccHHHHH----HHHHHHHhhh Q lcl|Aclame:pro 43 GDEILAKNEEEMERMFDLRDKNREL-------TAEEIKFFNDIDKN----VGGKDKFKLLPEETMV----QVFDDLVAEH 107 (377) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~l-------t~~e~~~~~~~~~~----~~~s~gg~lvP~~~~~----~Ii~~~~~~s 107 (377) .++. .......+.+-.| +.+-+..-...... .+++++| ||..+.+ ++++.+...- T Consensus 1 ~~~~--------~~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g--~~~~l~~~i~p~~~~~~~~~~ 70 (336) T protein:vir:78 1 MRDA--------QRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILVAPM 70 (336) T ss_pred CchH--------HHHHHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcc--hHHHHHHhcccceeeehhhhh Confidence 0000 0000111111112 22211111111111 1222222 5554332 3333333332 Q ss_pred hhhhhceeEecC----CceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHh---cCHHHHHH Q lcl|Aclame:pro 108 PLLKVINFKNTS----LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALK---FGPKWLKQ 180 (377) Q Consensus 108 ~l~~~~~v~~~~----~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~---ds~~~~~~ 180 (377) ....+..+...+ ..+.+++....+.+.+.+..... +..+..-+..+-..+.+..-+.++.+=+. ....++.+ T Consensus 71 ~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~-P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~ 149 (336) T protein:vir:78 71 KAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSD-GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLAS 149 (336) T ss_pred hhhhhcccccCCCccccEEEEeeeecceeeEEeecccCC-CeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHH Confidence 333333333322 23467777777777777533333 45566666666677777777777744332 23567888 Q ss_pred HHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhh Q lcl|Aclame:pro 181 FITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLS 260 (377) Q Consensus 181 ~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 260 (377) --....++++.+.+|+-.++|++..+-.|++|.+........++.... ...++..++.+..+...+. T Consensus 150 ~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~-------------~~T~~~I~~Di~~~~~~l~ 216 (336) T protein:vir:78 150 ELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSG-------------SPAVEAVVNEVVTLFQVLQ 216 (336) T ss_pred HHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcCccc-------------ccCHHHHHHHHHHHHHHHH Confidence 888899999999999999999988889999998765433222111100 1122222333333333222 Q ss_pred hhhhhhhhcccCceEEEeccchhhhhcccccccCCCCc-cccccC--CC-ceEEecCCCCcceEEEEecccEEEEecc-- Q lcl|Aclame:pro 261 VNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGE-YVTVLP--HG-ITILESLAVETGKAIAFVANRYDAFMAT-- 334 (377) Q Consensus 261 ~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~-~~~~l~--~~-~~v~~s~~~~~~~ii~gd~s~y~~~~~~-- 334 (377) ....+. ....-...+++-|..+ ..+.. .+..|. ....|- || +.++..+...+ .-|+..+.+..... T Consensus 217 ~qt~g~-~~~~~~~tL~Lp~~~~-~~L~~---~n~~g~tv~~~lk~n~Pnl~i~t~pel~~---Agg~~~~~~~~~~~~~ 288 (336) T protein:vir:78 217 TQSQGI-ITQEAVLHMGLPPTAM-SDLSK---TNQYGLSAAAKLKEIFPKLEFVTIPEYDT---ASGRLVQLWAPRVEGK 288 (336) T ss_pred HhcCCe-eeeccceEEEechHHH-HhccC---CCccCccHHHHHHHhcCccEEEEcccccc---cCcceEEEEEeeccCC Confidence 221111 1111123445544432 22221 233332 111111 22 44443322211 01222222222211 Q ss_pred -eeeEEeechh---hhhc--CcEEEEEEEEEcCEE-ecccceEEEEeecC Q lcl|Aclame:pro 335 -ASTIEEYDQT---FAME--DLQLYLTKNYFYGKA-KDNHTAALLTLAGG 377 (377) Q Consensus 335 -~~~i~~~~~~---~f~~--~~~~~~~~~r~dg~~-~~~~af~~l~~~a~ 377 (377) -.++.....- .... -.....+..|..|-+ ..|-||+.++ += T Consensus 289 ~t~~~~~p~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~--GI 336 (336) T protein:vir:78 289 DTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMI--GV 336 (336) T ss_pred cceeeecchhhhccceeecCceeEeccccceeeeeeeccchheeec--cC Confidence 1222111110 0111 122233567776644 3456666543 11 No 187 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=67.30 E-value=0.25 Score=23.84 Aligned_cols=302 Identities=9% Similarity=-0.027 Sum_probs=139.3 Q ss_pred ccccccHHHHHHHHHHHhcc--------CCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcc Q lcl|Aclame:pro 63 KNRELTAEEIKFFNDIDKNV--------GGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGT 132 (377) Q Consensus 63 ~~~~lt~~e~~~~~~~~~~~--------~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~ 132 (377) ..+.|+.+-|..|+...... ......+.|.+.+...+.+.+++.|-+++.++++++.- +-++....+++- T Consensus 1 m~~~M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~i 80 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCDALAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLY 80 (358) T ss_pred CcccccHHHHHHHHHHHHHHHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCccc Confidence 34456667777666543211 12245678999999999999999999999999998862 335555555554 Q ss_pred eeeecccccccccccccceeEeecceeEEEeehhhHHHHhc-C----HHHHHHHHHHHHHHHHHHHhhcceeeccCCC-- Q lcl|Aclame:pro 133 AVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF-G----PKWLKQFITEQLKEAIAVALELAIVKGNGLL-- 205 (377) Q Consensus 133 a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d-s----~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~-- 205 (377) ++-.. ... +......+.-.+..++.--=..|+.+.|+. + ..+|..-+++.+.++++.-.=.--+||+-.. T Consensus 81 agrt~-tr~--~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~ 157 (358) T protein:vir:78 81 TGRKK-GGR--FKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADD 157 (358) T ss_pred ceecC-CCc--cccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccC Confidence 44332 222 122234455555555555556788888863 2 2369999999999998877666667776321 Q ss_pred -----cce------eeeeccccccccccc-cccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCc Q lcl|Aclame:pro 206 -----QPV------GLLKDLSQPTVDQST-GRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQ 273 (377) Q Consensus 206 -----~P~------Gil~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 273 (377) .|. |+|...........- ....+. ...++.-...+ +.++..+..+..............+ T Consensus 158 Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~----~i~ig~g~~Gd----y~NLDalV~D~~~~lI~~~~~~d~d 229 (358) T protein:vir:78 158 TDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGE----KIYFDPDGKGE----YKTLDEMASDLINTTIDPLFQQDPR 229 (358) T ss_pred CChhhCcCccccchHHHHHHHhhchhhhhccccccC----ceeecCCCCCc----cccHHHHHHHHHhccCChHHhcCCC Confidence 232 443221111110000 000000 00000000011 1112222222111111112223344 Q ss_pred eEEEeccchhhhhcccccccCC------CCccc-cccCCCceEEecCCCCcceEEEEec---ccEEE--Eecceee-EE- Q lcl|Aclame:pro 274 VKLLLNPEDRWTLEAKFTSRNQ------FGEYV-TVLPHGITILESLAVETGKAIAFVA---NRYDA--FMATAST-IE- 339 (377) Q Consensus 274 ~~~~~n~~~~~~~~~~~~~~~~------~G~~~-~~l~~~~~v~~s~~~~~~~ii~gd~---s~y~~--~~~~~~~-i~- 339 (377) .+.+|-..-...-...+..... .++-+ ..+ =|+|.+.-+++|++.+++=-| |=|+- ..|.-+. .. T Consensus 230 LVvivG~dLla~k~~~l~n~~~~pTE~~Aa~~i~k~i-GGlpa~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~ 308 (358) T protein:vir:78 230 LVVLVGTDLVAAAQAKLYSEATKPSEQIAAQQLAKSI-AGRKAYIPPFFPGKRMVVTTLDNLHCYTQRGTRKRKADDNQD 308 (358) T ss_pred EEEEEchhhhhHHhhhHhhcCCCcHHHHHHHHHHHHh-CCCeEEEccccCCCceEEeeccccEEEEecCcEEEEEEeccc Confidence 5655554321111111111110 01111 112 267888889999999876544 44442 1111111 11 Q ss_pred eec-hhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 340 EYD-QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 340 ~~~-~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +.. |.+...+ .+|..-.+--+.+++.-.+.+..-.|- T Consensus 309 r~riE~y~s~N-e~YvVEd~~~~a~iE~i~v~~~~~pa~ 346 (358) T protein:vir:78 309 SKSFDNQYWRM-EGYALGEHKAYGGFEEADIEIGADPAV 346 (358) T ss_pred cccccchhhhc-ceeeeeccccEEEEeeeeeeeCCCCCc Confidence 111 1122222 255443333344444333333221111 No 188 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=66.27 E-value=0.27 Score=23.70 Aligned_cols=257 Identities=12% Similarity=0.027 Sum_probs=110.5 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe----c-C--C-ceEEEEEcCCcceeeecccccccccccccc Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN----T-S--L-RLKALTAETSGTAVWGDIFGEIKGQLKQAF 150 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~----~-~--~-~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f 150 (377) +. ..--..||+-++.++++.++...++-++++.-. . . | .++||+.............+......+..= T Consensus 1 MA----N~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e 76 (423) T protein:vir:35 1 MA----NNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFS 76 (423) T ss_pred Cc----cchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccccccc Confidence 11 111124799999999999999999888876421 1 1 2 356776543222222111111111111221 Q ss_pred --eeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeecccccccccccccccc Q lcl|Aclame:pro 151 --KEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDIT 228 (377) Q Consensus 151 --~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~ 228 (377) -.+.|+.+|..++-.=..|... +..++++++...+ ++++..+|..++.--=.+-|. .+ ...+.... T Consensus 77 ~~v~l~id~~k~~a~~v~d~e~~l-~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a~~----~v------gt~~t~~~ 144 (423) T protein:vir:35 77 AKATGKVGKYITVAVEWTQIEEAL-KLNQLDQILSPIH-ERMVTDLETELAHFMMNNGAL----SL------GSPNTAIK 144 (423) T ss_pred ceeeEEeccceeccceeCHHHHHh-hHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcccc----cc------ccccCCcc Confidence 3466677766665444455444 4667888777664 668888888776310000010 00 00000000 Q ss_pred ccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc--ccCCC-C--cccc-- Q lcl|Aclame:pro 229 TYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT--SRNQF-G--EYVT-- 301 (377) Q Consensus 229 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~--~~~~~-G--~~~~-- 301 (377) .++.+..+...+ +....| . ++..++++|..+..++.... ..... + .|.. T Consensus 145 -------------------~~~~i~~a~~~L--d~~~vP--~-~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~ 200 (423) T protein:vir:35 145 -------------------KWADVAQTASFI--KDIGIK--T-GENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQ 200 (423) T ss_pred -------------------hHHHHHHHHHHH--HHhcCC--c-CCCEEEeCHHHHHHHhccccceeccccchhHHHhhcc Confidence 011222222221 111222 2 23456789987766542211 11111 1 1211 Q ss_pred --ccCCCceEEecCCCCcceE-------EE-----------EecccEEEEecceeeEEeechhhhhcCcEEEEEEEEEc- Q lcl|Aclame:pro 302 --VLPHGITILESLAVETGKA-------IA-----------FVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFY- 360 (377) Q Consensus 302 --~l~~~~~v~~s~~~~~~~i-------i~-----------gd~s~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~d- 360 (377) ...+|..++.|+++|..+. .. .+.+.+.+.. .+..+... ...-..|...|-+..-++ T Consensus 201 i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~-~~~~~~~~-g~l~~GD~~t~aGv~~v~~ 278 (423) T protein:vir:35 201 ISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVAL-TGATPSKT-GFLKAGDQLKFTSTHWLNQ 278 (423) T ss_pred ceeeecceEEEEcCCCccccccccccceeeccccccccccccccccceeee-eeeeeccC-CcEEecceEEeeeeeeccc Confidence 1235778999999985431 10 0111111111 01111111 111123444443432221 Q ss_pred --CE------EecccceEEE----EeecC Q lcl|Aclame:pro 361 --GK------AKDNHTAALL----TLAGG 377 (377) Q Consensus 361 --g~------~~~~~af~~l----~~~a~ 377 (377) +. -.+..-|+++ +.++| T Consensus 279 ~t~~~~~~~~t~~~~~~~V~~~~~~~a~g 307 (423) T protein:vir:35 279 QSKQTLYNGSTAMSFTATVLEETNSTASG 307 (423) T ss_pred cccceeecccCCceeEEEEeccccccccC Confidence 11 1233456665 23344 No 189 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=65.74 E-value=0.28 Score=23.63 Aligned_cols=259 Identities=12% Similarity=-0.020 Sum_probs=112.6 Q ss_pred HhccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe---c----CC-ceEEEEEcCCcceeeeccc--ccccccccc Q lcl|Aclame:pro 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKN---T----SL-RLKALTAETSGTAVWGDIF--GEIKGQLKQ 148 (377) Q Consensus 79 ~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~---~----~~-~~~~p~~~~~~~a~w~~e~--~~~~~~~~~ 148 (377) +. ..--..+|+.++.++++.+++..++.++++.-. . .| .++|++............. +...++..- T Consensus 1 Ma----N~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e 76 (423) T protein:vir:10 1 MP----NNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cc----cchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCcccc Confidence 11 111123799999999999999999888875421 1 13 3567765432222222111 111122222 Q ss_pred cceeEeecceeEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeecccccccccccccccc Q lcl|Aclame:pro 149 AFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDIT 228 (377) Q Consensus 149 ~f~~i~l~~~k~~~~~~iS~ell~ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~ 228 (377) .--.+.++.+|..++-.=+.|+. ...-++++++... .++++..+|..++.- ..+.+. ....+.+.... T Consensus 77 ~~v~l~id~~k~va~~v~d~E~~-~~i~~~~~~l~~A-~~aLA~~vd~~ia~~-~~~~~~---------~~~gt~~t~~~ 144 (423) T protein:vir:10 77 GKATGRVGNYITVAVEYQQLEEA-IKLNQLEEILAPV-RQRIVTDLETELAHF-MMNNGA---------LSLGSPNTPIT 144 (423) T ss_pred ceeEEEeeceeeeeeeechHHHh-cChhhHHHHHHHH-HHHHHHHHHHHHHHH-Hhhccc---------cccccCCcccc Confidence 22347777777776655556655 4456788877655 688999999987631 111000 00000000000 Q ss_pred ccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhccccc--ccCCCC---cccc-- Q lcl|Aclame:pro 229 TYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT--SRNQFG---EYVT-- 301 (377) Q Consensus 229 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~--~~~~~G---~~~~-- 301 (377) . ++.+..+.+.+ +....| ..+ ...+++|..+..++.... .....+ .|.. T Consensus 145 a-------------------~~~i~~a~~~L--d~~~vP--~~~-R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~ 200 (423) T protein:vir:10 145 K-------------------WSDVAQTASFL--KDLGVN--EGE-NYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQ 200 (423) T ss_pred h-------------------HHHHHHHHHHH--HhccCC--cCC-CEEEeChHHHHHHhccccceecccccchhhhhhcc Confidence 0 11112221111 111122 233 456789987766542211 111111 1221 Q ss_pred --ccCCCceEEecCCCCcceEE-EEe----cccEEE-----EecceeeEEe----echhhhh--cCcEEEEEE---EEEc Q lcl|Aclame:pro 302 --VLPHGITILESLAVETGKAI-AFV----ANRYDA-----FMATASTIEE----YDQTFAM--EDLQLYLTK---NYFY 360 (377) Q Consensus 302 --~l~~~~~v~~s~~~~~~~ii-~gd----~s~y~~-----~~~~~~~i~~----~~~~~f~--~~~~~~~~~---~r~d 360 (377) ...+|..++.|+++|..+.. ++- -..+.+ .......+.. -....+. .|...|-+. .+.. T Consensus 201 i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~t 280 (423) T protein:vir:10 201 IPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQT 280 (423) T ss_pred ceeeecceEEEEeCCCccccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccc Confidence 12257789999999864311 110 000111 1111112211 1111222 233333332 2223 Q ss_pred CEEe------cccceEEEEee---c-C Q lcl|Aclame:pro 361 GKAK------DNHTAALLTLA---G-G 377 (377) Q Consensus 361 g~~~------~~~af~~l~~~---a-~ 377 (377) ..++ ..+-|+++.-+ + | T Consensus 281 k~~~~~~~t~~~~~~~v~a~~~~~~~g 307 (423) T protein:vir:10 281 KQALYNGATPISFTATVTADANSDSGG 307 (423) T ss_pred cccccccccCcceEEEEEeeeeeccCC Confidence 3222 33456655322 2 2 No 190 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=64.88 E-value=0.29 Score=23.51 Aligned_cols=347 Identities=9% Similarity=-0.017 Sum_probs=120.1 Q ss_pred CCccHHHHHHHHHHHHHHHHH--HHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHH--- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAK--ISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFF--- 75 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~--- 75 (377) |.|+.|+|.++|.-+.+-... +.+.-.+..-.+.+|.....+.++. .++........++.|++.+..-. T Consensus 1 ~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~------~~~~~~~~e~~~~~l~~~~~~~~~~~ 74 (529) T protein:vir:10 1 MSLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDP------VYRDDKLIEAFGQSLMEAEVAGDHGY 74 (529) T ss_pred CcccHHHHHHHhHHHhcCCccchhccchhhhhhhhhhhhhHHHHhhcc------ccchhhhhhhhhcccchhhccccccc Confidence 999999999999866443220 1111111111122222211111110 00000000011111211111000 Q ss_pred HH-HHhccCCCCCceeccHHHHHHHHHHHHh---hhhhhhhceeEecCCce------E--EEEEcCC------------- Q lcl|Aclame:pro 76 ND-IDKNVGGKDKFKLLPEETMVQVFDDLVA---EHPLLKVINFKNTSLRL------K--ALTAETS------------- 130 (377) Q Consensus 76 ~~-~~~~~~~s~gg~lvP~~~~~~Ii~~~~~---~s~l~~~~~v~~~~~~~------~--~p~~~~~------------- 130 (377) +. ....++.+ +.. +.+...++..+|. .-+-.+++-|.||++.. + ++-.... T Consensus 75 ~~~~i~est~t-~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~P 150 (529) T protein:vir:10 75 DPTNIAAGQSS-GAI---TNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAP 150 (529) T ss_pred ccccccccccc-ccc---cccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCcccccccccccccccc Confidence 00 00011111 000 1111122222221 11223344444443210 0 0000000 Q ss_pred -----------------------------------------------------------------------------cce Q lcl|Aclame:pro 131 -----------------------------------------------------------------------------GTA 133 (377) Q Consensus 131 -----------------------------------------------------------------------------~~a 133 (377) +.. T Consensus 151 da~~sga~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~ 230 (529) T protein:vir:10 151 DAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKL 230 (529) T ss_pred ccccccccccccccccCccccccccccccccccCcceeeeecccceecccccccccccCccccCcccccccccccccccc Confidence 000 Q ss_pred e-e----ecccccc----cccccccceeEeecceeEEEe-------ehhhHHHHhcC----HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 134 V-W----GDIFGEI----KGQLKQAFKEQDFSQFKLTAF-------VVIPKDALKFG----PKWLKQFITEQLKEAIAVA 193 (377) Q Consensus 134 ~-w----~~e~~~~----~~~~~~~f~~i~l~~~k~~~~-------~~iS~ell~ds----~~~~~~~l~~~la~~~a~~ 193 (377) . . ....++. ...+...|.+..|...|..+- ...|-||.+|- ..|.|++|.+-|+..|..- T Consensus 231 ~~~~~Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlE 310 (529) T protein:vir:10 231 AEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLE 310 (529) T ss_pred cccccccchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHH Confidence 0 0 0000000 011223477777777776543 45888888873 4678999999999999999 Q ss_pred hhcceeeccC----CCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhc Q lcl|Aclame:pro 194 LELAIVKGNG----LLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLK 269 (377) Q Consensus 194 ~~~a~l~G~G----~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 269 (377) +|+.||.=-= .++-.|+-+.. +.+.......+.+ .....+....+..++-......+...+. T Consensus 311 INReii~~l~~~a~~~k~~g~~~~~--------~~~Gv~d~~~~~~------~~~~~~~~e~~k~L~~~i~~~an~I~~~ 376 (529) T protein:vir:10 311 INREVIDWINYTAQVGKSGWTKTDG--------SASGVFDFQDPID------VRGARWAGESYKALLIQIDKEANEIARQ 376 (529) T ss_pred hhHHHHHhHhhhhhhhhcccccccc--------cccceeecccCcc------ccccchHHHHHHHHHHHHHHHHHHHHHh Confidence 9998875110 00111110000 0000000000000 0011111111122222222222222222 Q ss_pred c---cCceEEEeccchhhhhccc---------------ccccCCCCccccccCCCceEEecCCCCcceEEEEecc--cEE Q lcl|Aclame:pro 270 I---AGQVKLLLNPEDRWTLEAK---------------FTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVAN--RYD 329 (377) Q Consensus 270 ~---~~~~~~~~n~~~~~~~~~~---------------~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s--~y~ 329 (377) + .++ .+++.|.-.. ++.. ....+..+.+...|.-+++|+.+++.+.+-+++|--. .|. T Consensus 377 T~rg~~n-~vi~S~~Va~-~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~ 454 (529) T protein:vir:10 377 TGRGAGN-FIIASRNVVS-ALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLD 454 (529) T ss_pred hccccce-EEEEchHHHH-HHHhhhhhccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccc Confidence 2 233 3445553222 2211 0000111224445555778888888887766666321 111 Q ss_pred EEe-----cceeeEEeechhhhhcCcEEEEEEEEEcCEEecccc--------------------------eEEEEeecC Q lcl|Aclame:pro 330 AFM-----ATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHT--------------------------AALLTLAGG 377 (377) Q Consensus 330 ~~~-----~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~a--------------------------f~~l~~~a~ 377 (377) .+. .....+...|...|.. .+-.+.|+ |-.++|=+ |++|.++.= T Consensus 455 ~glfy~PYv~l~~~~~~dp~sfqP---~~g~~tRY-~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 455 AGIYYCPYVALTPLRGSDPKNFQP---VMGFKTRY-AIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred cceeeccccccccccccCCCcccc---eeeeeeee-ceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 000 0011112233333333 23334444 22344411 122222211 No 191 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=57.12 E-value=0.44 Score=22.53 Aligned_cols=302 Identities=10% Similarity=0.021 Sum_probs=135.3 Q ss_pred ccHHHHHHHHHHHhc------cC--CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcceeee Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKN------VG--GKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWG 136 (377) Q Consensus 67 lt~~e~~~~~~~~~~------~~--~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~ 136 (377) |+.+-|..|+..... .. .....+.|-+.+...+.+.+++.|-+++.++++++.- +-++....+++-++-. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccc Confidence 666666666554321 11 1223567878889999999999999999999998862 3456555555544432 Q ss_pred ccc--ccccccccccceeEeecceeEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHhhcceeeccCC----C--- Q lcl|Aclame:pro 137 DIF--GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--GPKWLKQFITEQLKEAIAVALELAIVKGNGL----L--- 205 (377) Q Consensus 137 ~e~--~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~----~--- 205 (377) ... .+..+......+.-.+..++.-.-..|+.+.|+. ...+|..-+++.+.++++.-+-.--+||+-. + T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:98 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) T ss_pred cCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 211 1121122233444555555555556788887752 2367999999999999887776666777641 1 Q ss_pred cce------eeeecccccccccc-cccc--ccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEE Q lcl|Aclame:pro 206 QPV------GLLKDLSQPTVDQS-TGRD--ITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKL 276 (377) Q Consensus 206 ~P~------Gil~~~~~~~~~~~-~~~~--~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (377) .|. |+|......+.... +... ..........++ ...+. .++..+..+..-.-.........+.+. T Consensus 161 nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G--~~gdy----~NLDAlV~D~~~~lI~~~~~~d~dLVv 234 (355) T protein:vir:98 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVG--KNGDY----ENIDALVMDATNNLIDEVYQDDPNLVA 234 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeC--CCCCc----ccHHHHHHHHHhccCChHHhcCCCEEE Confidence 233 44422111110000 0000 000000000000 01111 111111111111111111223345666 Q ss_pred EeccchhhhhcccccccC-CC-----Ccccc--ccCCCceEEecCCCCcceEEEEec---ccEEE--Eeccee-eEE-ee Q lcl|Aclame:pro 277 LLNPEDRWTLEAKFTSRN-QF-----GEYVT--VLPHGITILESLAVETGKAIAFVA---NRYDA--FMATAS-TIE-EY 341 (377) Q Consensus 277 ~~n~~~~~~~~~~~~~~~-~~-----G~~~~--~l~~~~~v~~s~~~~~~~ii~gd~---s~y~~--~~~~~~-~i~-~~ 341 (377) +|...-...-...+.... .. ++-+. -..=|+|.+.-+++|++.+++=-| |=|+- ..|.-+ +.. +. T Consensus 235 ivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ 314 (355) T protein:vir:98 235 IVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKD 314 (355) T ss_pred EEchhhhHHHhhhHhhccCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccc Confidence 666432111111111111 00 11111 111277888899999998876544 44432 111111 111 11 Q ss_pred c-hhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 342 D-QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 342 ~-~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) . |.+...+ .+|..-.+--+.+++ .+.+....+- T Consensus 315 rie~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~ 348 (355) T protein:vir:98 315 RVENYESMN-IDYVVEVYAAGCLLE--NITLGDFTAP 348 (355) T ss_pred cccchhhhc-ceeeeeccccEEEee--ceeeeCCCCC Confidence 1 1122222 255433333333333 3333322211 No 192 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=56.48 E-value=0.46 Score=22.46 Aligned_cols=348 Identities=10% Similarity=0.012 Sum_probs=119.0 Q ss_pred CCccHHHHHHHHHHHHHHHHH---HHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHH--- Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAK---ISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKF--- 74 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~--- 74 (377) |..+ ++|.++|.-+.+-.+. +...-.+..-.+.+|.....+.++. .++........+.-|++.|..- T Consensus 1 ~~~~-~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~------~~~~~~~~~~~~~~l~ea~~~~~~~ 73 (524) T protein:vir:98 1 MSKK-NELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDP------VYRDEKIVESFGGFLAEAEIAGDHN 73 (524) T ss_pred Ccch-HHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHhcCc------cccchHHHHhhhccccccccccccc Confidence 7665 4666666655432111 1111111111122222111111111 1111111111222232222100 Q ss_pred HHHH-HhccCCCCCceeccHHHHHHHHHHHHh---hhhhhhhceeEecCCce------E--EEEEcCC-c---------- Q lcl|Aclame:pro 75 FNDI-DKNVGGKDKFKLLPEETMVQVFDDLVA---EHPLLKVINFKNTSLRL------K--ALTAETS-G---------- 131 (377) Q Consensus 75 ~~~~-~~~~~~s~gg~lvP~~~~~~Ii~~~~~---~s~l~~~~~v~~~~~~~------~--~p~~~~~-~---------- 131 (377) +... ...++++ +.. +.+...++..+|. .-+-.+++-|.||++.. + ++-.+.. + T Consensus 74 ~~~~~i~~s~~t-~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~ 149 (524) T protein:vir:98 74 YDQTNIASGKSS-GAI---TNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFH 149 (524) T ss_pred cccccccccccc-ccc---ccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccccccccccc Confidence 0000 0001111 110 1112222222221 11223444555544321 1 1110000 0 Q ss_pred -----ceeeec-------------------------------------------c------------------------- Q lcl|Aclame:pro 132 -----TAVWGD-------------------------------------------I------------------------- 138 (377) Q Consensus 132 -----~a~w~~-------------------------------------------e------------------------- 138 (377) ++.|.+ . T Consensus 150 ~~ye~dt~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~ 229 (524) T protein:vir:98 150 PMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEIS 229 (524) T ss_pred cccccccccCCccccccccccccccccccccccccccccccceeccccccCcccccccccccccccccccccccceeecc Confidence 000000 0 Q ss_pred cc------c----ccccccccceeEeecceeEEEe-------ehhhHHHHhcC----HHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 139 FG------E----IKGQLKQAFKEQDFSQFKLTAF-------VVIPKDALKFG----PKWLKQFITEQLKEAIAVALELA 197 (377) Q Consensus 139 ~~------~----~~~~~~~~f~~i~l~~~k~~~~-------~~iS~ell~ds----~~~~~~~l~~~la~~~a~~~~~a 197 (377) .+ + ....+...|.+..|...|..+- ...|-||.+|= ..|.|++|.+-|+..|..-+|+- T Consensus 230 ~GmsTA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINRe 309 (524) T protein:vir:98 230 VGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINRE 309 (524) T ss_pred cccchhhhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHH Confidence 00 0 0011233477777777776653 45888888873 46799999999999999999999 Q ss_pred eeeccCC-C--cceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcc---c Q lcl|Aclame:pro 198 IVKGNGL-L--QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKI---A 271 (377) Q Consensus 198 ~l~G~G~-~--~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~ 271 (377) ||.=--. . +-.|+-+... +.+.......+.+ ....++....+..++-......+...+.+ . T Consensus 310 ii~~i~~~a~~~~~g~t~~~~-------~~~G~~dl~~~~d------~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~ 376 (524) T protein:vir:98 310 IVDLINYTAQVGKSGFTQTVG-------SKAGSFDFQDPVD------IRGARWAGESYKALLIQIDKEANEIARQTGRGA 376 (524) T ss_pred HHHHHhhhheeceeecccccc-------cccceeecccccc------ccccchhHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 8831000 1 1122211100 0000000000000 00111112122222222222222222222 2 Q ss_pred CceEEEeccchhhhhc-------cccccc----CCC---CccccccCCCceEEecCCCCcceEEEEeccc--E-----EE Q lcl|Aclame:pro 272 GQVKLLLNPEDRWTLE-------AKFTSR----NQF---GEYVTVLPHGITILESLAVETGKAIAFVANR--Y-----DA 330 (377) Q Consensus 272 ~~~~~~~n~~~~~~~~-------~~~~~~----~~~---G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~--y-----~~ 330 (377) ++ .+++.|.-..-|. +..... +.+ ..+...|.-+++|+.+++.+.+-+++|--.. + +. T Consensus 377 ~n-~~i~S~~Va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfya 455 (524) T protein:vir:98 377 GN-FIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYA 455 (524) T ss_pred cc-EEEEchHHHHHHhhhhcccccccchhhcccccCCccceEEEEecCceEEEecCCCCcceEEEEeeCCcccccceeec Confidence 33 3455654221111 100000 111 1223334446788888888877666663211 1 10 Q ss_pred EecceeeEEeechhhhhcCcEEEEEEEEEcCEEeccc--------------------------ceEEEEeecC Q lcl|Aclame:pro 331 FMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNH--------------------------TAALLTLAGG 377 (377) Q Consensus 331 ~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~--------------------------af~~l~~~a~ 377 (377) ==.....+...|...|.. .+-.+.|+ |-.++|= =|++|.++.= T Consensus 456 PYv~l~~~~~~dp~sfqP---~~g~~tRY-~l~~NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~l 524 (524) T protein:vir:98 456 PYVALTPLRGSDPKNFQP---VMGFKTRY-GIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) T ss_pred cccccccccccCCccccc---eeeeeeee-ceeecCcccccCCccccccccCcchHhhcCccceeeEeeeccC Confidence 000111112233333333 23334444 2244441 1222222222 No 193 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=55.56 E-value=0.48 Score=22.35 Aligned_cols=297 Identities=9% Similarity=0.031 Sum_probs=132.7 Q ss_pred ccHHHHHHHHHHHh------ccCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcceeeecc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDK------NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWGDI 138 (377) Q Consensus 67 lt~~e~~~~~~~~~------~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~~e 138 (377) |+.+-|..|+.... .....+..+.|-+.+...+.+.+++.|-+++.++++++.- +-++....+++-++-... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDT 80 (339) T ss_pred CChHHHHHHHHHHHHHHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccC Confidence 66666666655432 2233445677888899999999999999999999998862 345555555544443211 Q ss_pred c-ccccccccccceeEeecceeEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHhhcceeeccCCC-------cce Q lcl|Aclame:pro 139 F-GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--GPKWLKQFITEQLKEAIAVALELAIVKGNGLL-------QPV 208 (377) Q Consensus 139 ~-~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~-------~P~ 208 (377) . ....+..-..++.-.+..++.-.=..|+.+.|+. ...+|..-+++.+.++++.-.=.--+||+-.. .|. T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (339) T protein:vir:79 81 TQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPM 160 (339) T ss_pred CCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcC Confidence 1 1121111124444455555554445677887752 34579999999999988876666667776321 232 Q ss_pred ------eeeecccccccccc-ccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccc Q lcl|Aclame:pro 209 ------GLLKDLSQPTVDQS-TGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPE 281 (377) Q Consensus 209 ------Gil~~~~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~ 281 (377) |+|......+.... .....+... ....+ +..+. .++..+..+..-.-...........+.+|-.. T Consensus 161 lqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~--i~~~G--~ggdy----~NLDalV~d~~~~lId~~~~~d~dLVvivG~d 232 (339) T protein:vir:79 161 LQDVNKGWLQNLREQAPQRVMKEGKAAAGK--ITVGG--AGADY----GNLDALVYDITNHLVEPWYAEDPDLVVVCGRN 232 (339) T ss_pred ccccchhHHHHHHhhhhhhhhccceeccce--eEecc--CCCCc----ccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 44322111110000 000000000 00000 00111 11111111111000111122233455555543 Q ss_pred hhhhhcccccccCC------CCcccc--ccCCCceEEecCCCCcceEEEEec---ccEEE--Eecceee-EE-eec-hhh Q lcl|Aclame:pro 282 DRWTLEAKFTSRNQ------FGEYVT--VLPHGITILESLAVETGKAIAFVA---NRYDA--FMATAST-IE-EYD-QTF 345 (377) Q Consensus 282 ~~~~~~~~~~~~~~------~G~~~~--~l~~~~~v~~s~~~~~~~ii~gd~---s~y~~--~~~~~~~-i~-~~~-~~~ 345 (377) -...-...+..... .++-+. -..=|.|.+.-+++|++.+++=-| |=|+- ..|.-+. .. +.. |.+ T Consensus 233 Lla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y 312 (339) T protein:vir:79 233 LLSDKYFPLVNRDRDPVQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDNLSIYYQEGGRRRTILDNAKRDRIENY 312 (339) T ss_pred hhhhHhhhHhhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccch Confidence 21111111111100 011111 011267888889999999876544 44432 1111111 11 111 111 Q ss_pred hhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 346 AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 346 f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ...+ .+|..-.+.-+.+++ - +++..+ T Consensus 313 ~s~N-e~YvVEd~~~~a~iE--n---i~~~~a 338 (339) T protein:vir:79 313 ESSN-DAYVIEDLACAAMAE--N---IALAAA 338 (339) T ss_pred hhcc-ceeeeeccccEEEee--e---eecccC Confidence 2222 245433333333333 1 222222 No 194 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=54.37 E-value=0.51 Score=22.21 Aligned_cols=340 Identities=9% Similarity=0.013 Sum_probs=119.1 Q ss_pred CCccHHHHHHHHHHHHHHHHH--HHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHH Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAK--ISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDI 78 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~ 78 (377) |.|+-|+|.++|.-+.+-... +.+.-.+..-.+.+|.....+.++. .++. ..|.+.+-+.+.+. T Consensus 1 ~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~------~~~~--------~~~~e~~~~~l~e~ 66 (529) T protein:vir:10 1 MSLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDP------VYRD--------DKLIEAFGQSLMEA 66 (529) T ss_pred CccchHHHHHHhhHhhcCCccchhccchhhhhhhhhhhhhHHHHhccc------ccch--------hhhhhhhhccchhh Confidence 999999999998765443220 1111111111122222111111110 0100 11111111112111 Q ss_pred H------------hccCCCCCceeccHHHHHHHHHHHHh---hhhhhhhceeEecCCce------EEEEEcCC------- Q lcl|Aclame:pro 79 D------------KNVGGKDKFKLLPEETMVQVFDDLVA---EHPLLKVINFKNTSLRL------KALTAETS------- 130 (377) Q Consensus 79 ~------------~~~~~s~gg~lvP~~~~~~Ii~~~~~---~s~l~~~~~v~~~~~~~------~~p~~~~~------- 130 (377) . ..++.+ +.. +.+...++..+|. .-+-.+++-|.||++.. +.-..... T Consensus 67 ~~~~~~~~~~~~i~~st~t-~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~e 142 (529) T protein:vir:10 67 EVAGDHGYDPTNIAAGQSS-GAI---TNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKE 142 (529) T ss_pred ccccccccccccccccccc-ccc---cccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCcccccccc Confidence 1 001111 110 1111122222221 11223344454443211 10000000 Q ss_pred ---------------------------------------------------------------cce-eee---------- Q lcl|Aclame:pro 131 ---------------------------------------------------------------GTA-VWG---------- 136 (377) Q Consensus 131 ---------------------------------------------------------------~~a-~w~---------- 136 (377) ..+ .+. T Consensus 143 af~~~~~pda~~sga~~~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~~~~ 222 (529) T protein:vir:10 143 AFHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLIN 222 (529) T ss_pred cccccccccccccccccccccccccccccccccccccccccccceeeecccCceeeccccccccccCccccCcccccccc Confidence 000 000 Q ss_pred ----------------cccccc----cccccccceeEeecceeEEEe-------ehhhHHHHhcC----HHHHHHHHHHH Q lcl|Aclame:pro 137 ----------------DIFGEI----KGQLKQAFKEQDFSQFKLTAF-------VVIPKDALKFG----PKWLKQFITEQ 185 (377) Q Consensus 137 ----------------~e~~~~----~~~~~~~f~~i~l~~~k~~~~-------~~iS~ell~ds----~~~~~~~l~~~ 185 (377) ...++. ...+...|.+..|...|..+- ...|-||.+|- ..|.|++|.+- T Consensus 223 ~~~a~~~~~~~~~GmsTa~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNI 302 (529) T protein:vir:10 223 AAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGI 302 (529) T ss_pred cccccccccccccchhhhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHH Confidence 000000 011223477777777776543 45888988873 46789999999 Q ss_pred HHHHHHHHhhcceeeccC----CCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhh Q lcl|Aclame:pro 186 LKEAIAVALELAIVKGNG----LLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSV 261 (377) Q Consensus 186 la~~~a~~~~~a~l~G~G----~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 261 (377) |+..|..-+|+.||.=-= .++-.|+-+. ...+.......+.+ .....+....+..++-.... T Consensus 303 LStEImlEINReii~~l~~~a~~~~~~~~~~~--------~~~~Gv~d~~~~~~------~~~~~~~~e~~~~L~~~i~~ 368 (529) T protein:vir:10 303 LANEVMLEINREVIDWINYTAQVGKSGWTKTD--------GSASGVFDFQDPID------VRGARWAGESYKALLIQIDK 368 (529) T ss_pred HHHHHHHHhhHHHHHHHhhhhhhhcccccccc--------ccccceeecccCcc------ccccchHHHHHHHHHHHHHH Confidence 999999999998874110 0000111000 00000000000000 00111111111222222222 Q ss_pred hhhhhhhcc---cCceEEEeccchhhhhc-------ccc----c---ccCCCCccccccCCCceEEecCCCCcceEEEEe Q lcl|Aclame:pro 262 NDKKHPLKI---AGQVKLLLNPEDRWTLE-------AKF----T---SRNQFGEYVTVLPHGITILESLAVETGKAIAFV 324 (377) Q Consensus 262 ~~~~~~~~~---~~~~~~~~n~~~~~~~~-------~~~----~---~~~~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd 324 (377) ..+...+.+ .++ .+++.|.-..-|. |.. . ..+..+.+...|.-+++|+.+++.+.+-+++|- T Consensus 369 ~an~I~~~T~rg~~n-~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~ 447 (529) T protein:vir:10 369 EANEIARQTGRGAGN-FIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGY 447 (529) T ss_pred HHHHHHHhhccccce-EEEEchHHHHHHHhhcccccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEE Confidence 222222222 233 3445553222111 000 0 001112244455557788888888776666663 Q ss_pred cc--cEEEEe-----cceeeEEeechhhhhcCcEEEEEEEEEcCEEecccc--------------------------eEE Q lcl|Aclame:pro 325 AN--RYDAFM-----ATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHT--------------------------AAL 371 (377) Q Consensus 325 ~s--~y~~~~-----~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~a--------------------------f~~ 371 (377) -. .|..+. ..+..+...|...|.. .+-.+.|+ |-.++|=+ |++ T Consensus 448 KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP---~~g~~tRY-~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~ 523 (529) T protein:vir:10 448 RGANNLDAGIYYCPYVALTPLRGFDPKNFQP---VMGFKTRY-AIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRR 523 (529) T ss_pred eCCcccccceeeccccccccccccCCCcccc---eeeeeeee-ceeecCccccccccccccccCCcchhhhcCccceeEE Confidence 21 011000 0011111223333333 23334444 22334311 122 Q ss_pred EEeecC Q lcl|Aclame:pro 372 LTLAGG 377 (377) Q Consensus 372 l~~~a~ 377 (377) |.++.= T Consensus 524 ~~Vk~l 529 (529) T protein:vir:10 524 VWVKGL 529 (529) T ss_pred eeeccC Confidence 222211 No 195 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=53.16 E-value=0.54 Score=22.07 Aligned_cols=300 Identities=13% Similarity=0.035 Sum_probs=122.7 Q ss_pred HHHHHHHHHHHHHHHHHhccccccc-------cHHHHHHHHHHHhc----cCCCCCceeccHHHHHHHH-HHHHhhhhhh Q lcl|Aclame:pro 43 GDEILAKNEEEMERMFDLRDKNREL-------TAEEIKFFNDIDKN----VGGKDKFKLLPEETMVQVF-DDLVAEHPLL 110 (377) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~l-------t~~e~~~~~~~~~~----~~~s~gg~lvP~~~~~~Ii-~~~~~~s~l~ 110 (377) .++. .......+.+-.| +.+-+..-...... .+++++| ||..+.+-|- ..++-..+-+ T Consensus 1 ~~~~--------~~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g--~~~~l~~~i~p~~~~~~~~~~ 70 (336) T protein:vir:10 1 MRDA--------QRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILVAPM 70 (336) T ss_pred CchH--------HHHHHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcc--hHHHHHhhcCcceeeeeechh Confidence 0000 0000111111112 22211111111111 1122222 5554433221 2223333334 Q ss_pred hhceeEecC--C-----ceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHh---cCHHHHHH Q lcl|Aclame:pro 111 KVINFKNTS--L-----RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALK---FGPKWLKQ 180 (377) Q Consensus 111 ~~~~v~~~~--~-----~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~---ds~~~~~~ 180 (377) +...++|++ | .+.+++....+.+.+.+..... +..+..-+.-.-..+.+..-+.++.+=+. ....++.+ T Consensus 71 ~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~-P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~ 149 (336) T protein:vir:10 71 KAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSD-GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLAS 149 (336) T ss_pred chhhhcccccCCCcceeeEEEEeeeeeeeEEEccccCCC-cceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHH Confidence 444444433 2 2356666666666666433333 34444444445556666766777744332 23567888 Q ss_pred HHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhh Q lcl|Aclame:pro 181 FITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLS 260 (377) Q Consensus 181 ~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 260 (377) --....++++.+.+|+-.++|++..+-.|++|.+........++.... ...+...++.+..+...+. T Consensus 150 ~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~-------------~~T~~eI~~Di~~~~~~l~ 216 (336) T protein:vir:10 150 ELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSG-------------SPAVEAVVNEVVTLFQVLQ 216 (336) T ss_pred HHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcCccc-------------ccCHHHHHHHHHHHHHHHH Confidence 888899999999999999999998889999998766433222111100 1112222333333333222 Q ss_pred hhhhhhhhcccCceEEEeccchhhhhcccccccCCCCc-cccccC--CC-ceEEecCCCCcceEEEEecccEEEEec--- Q lcl|Aclame:pro 261 VNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGE-YVTVLP--HG-ITILESLAVETGKAIAFVANRYDAFMA--- 333 (377) Q Consensus 261 ~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~-~~~~l~--~~-~~v~~s~~~~~~~ii~gd~s~y~~~~~--- 333 (377) ....+. ....-...+++-|.-+ ..+.. .+..|. ....|- || +.++..+...+ .-|+..+.+.... T Consensus 217 ~qt~g~-i~~~~~~tL~Lp~~~~-~~L~~---~n~~g~tv~~~lk~n~Pnl~i~t~pel~~---Agg~~~~~~~~~~~~~ 288 (336) T protein:vir:10 217 TQSQGI-ITQEAVLHMGLPPTAM-SDLSK---TNQYGLSAAAKLKEIFPKLEFVTIPEYDT---ASGRLVQLWAPRVEGK 288 (336) T ss_pred HhcCCe-eeeccceEEEechHHH-HhccC---CCccCccHHHHHHHhCCccEEEEcccccc---cCCceEEEEEecccCC Confidence 221111 1111122344444432 22221 233332 111111 22 44444332211 0122222222111 Q ss_pred ceeeEEeechhh----hhcC--cEEEEEEEEEcCEEe-cccceEEEEeecC Q lcl|Aclame:pro 334 TASTIEEYDQTF----AMED--LQLYLTKNYFYGKAK-DNHTAALLTLAGG 377 (377) Q Consensus 334 ~~~~i~~~~~~~----f~~~--~~~~~~~~r~dg~~~-~~~af~~l~~~a~ 377 (377) .-.++.. .+.+ .... .....+..|..|.++ .|-||+.++ += T Consensus 289 ~t~~~~~-P~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~--GI 336 (336) T protein:vir:10 289 DTATCGF-TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQML--GV 336 (336) T ss_pred cceeeec-ChhhhccceeecCceeEeccccceeeeeeeccchheeec--cC Confidence 1122211 1111 1111 222335667766443 456665543 11 No 196 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=52.16 E-value=0.56 Score=21.96 Aligned_cols=290 Identities=10% Similarity=0.027 Sum_probs=131.6 Q ss_pred ccHHHHHHHHHHHhcc----C------CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCccee Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKNV----G------GKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAV 134 (377) Q Consensus 67 lt~~e~~~~~~~~~~~----~------~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~ 134 (377) +..+-|..|+...... + +..--+.|-+.+...+.+.+++.|-+++.++++++.- +-++....+++-++ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iag 80 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVAS 80 (342) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCccccc Confidence 5556666665544311 1 1112467888889999999999999999999998862 34566555555444 Q ss_pred eeccc--ccccccccccceeEeecceeEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHhhcceeeccCCC----- Q lcl|Aclame:pro 135 WGDIF--GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--GPKWLKQFITEQLKEAIAVALELAIVKGNGLL----- 205 (377) Q Consensus 135 w~~e~--~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~----- 205 (377) -..-. +...+..-...+.-.+..++.-.=..|+.+.|+. ...+|..-+++.+.++++.-.=.--+||+-.. T Consensus 81 rtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 160 (342) T protein:vir:10 81 TTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDR 160 (342) T ss_pred ccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCCh Confidence 32211 1222222234455555555555556688887752 34579999999999988877666667776321 Q ss_pred --cce------eeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEE Q lcl|Aclame:pro 206 --QPV------GLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLL 277 (377) Q Consensus 206 --~P~------Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (377) .|. |+|......+....-..... .....++ ...+. .++..+..+....-.........+.+.+ T Consensus 161 ~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~---~~~i~iG--~~gdy----~NLDalV~D~~~~lI~~~~~~d~dLVvi 231 (342) T protein:vir:10 161 NSNPLLQDVAKGWLQKMREDAKERVMNGEST---DNQVLVG--KGQEY----ANLDALVMDATEELIDEWHRDDTDLVVI 231 (342) T ss_pred hhCcCccccchHHHHHHHhhhhhhhccccee---ccceeec--CCCCc----ccHHHHHHHHHhccCChHHhcCCCEEEE Confidence 232 44432111110000000000 0000000 01111 1111111111100011112223445555 Q ss_pred eccchhhhhcccccccCC------CCcccc--ccCCCceEEecCCCCcceEEEEe---cccEEE--Eecceee-EE---- Q lcl|Aclame:pro 278 LNPEDRWTLEAKFTSRNQ------FGEYVT--VLPHGITILESLAVETGKAIAFV---ANRYDA--FMATAST-IE---- 339 (377) Q Consensus 278 ~n~~~~~~~~~~~~~~~~------~G~~~~--~l~~~~~v~~s~~~~~~~ii~gd---~s~y~~--~~~~~~~-i~---- 339 (377) |-..-...-...+..... .++-+. -..=|.|.+.-+++|++.+++=- +|=|+- ..|.-+. .. T Consensus 232 vG~dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~r 311 (342) T protein:vir:10 232 TGRKLLADKYFPIVNQQNAPTEELAADIVISQKRIGGLKAVRVPFFPANAILITKLENLAIYVQEGTTRKHIENVPKKDR 311 (342) T ss_pred EchhhhHHHHHHHHhcCCChHHHHHHHHHHhhhhhcCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecccccc Confidence 554322111111111100 011110 01126788888999999887654 444442 1111111 10 Q ss_pred -----eechhhhhcCcEEEEEEEEEcCEEeccc Q lcl|Aclame:pro 340 -----EYDQTFAMEDLQLYLTKNYFYGKAKDNH 367 (377) Q Consensus 340 -----~~~~~~f~~~~~~~~~~~r~dg~~~~~~ 367 (377) .+.+.+..+|--.+-++. +.++.+|+ T Consensus 312 ie~y~s~Ne~YvVEd~~~~a~iE--~i~i~~~~ 342 (342) T protein:vir:10 312 IETYESENIDYVVEDYGCAALIE--NITLKDKE 342 (342) T ss_pred ccchhhhccceeeeccccEEEee--cceecCCC Confidence 011222222221221221 34555555 No 197 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=52.12 E-value=0.56 Score=21.95 Aligned_cols=346 Identities=12% Similarity=0.031 Sum_probs=123.2 Q ss_pred CCccHH-HHHHHHHHHHHHHHHHHhcc--CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHH---H Q lcl|Aclame:pro 1 MAINLK-ELPKYREAVAELSAKISAGA--TPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIK---F 74 (377) Q Consensus 1 m~~~~~-~l~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~ 74 (377) |+|+.+ +|.++|.-+.+-.. +.+-. .+.--.+.+|..+..++++. +++........+.-|++.+.. . T Consensus 1 ~~~~~~~~l~~kw~p~l~~~~-~~~i~~~~~~~~a~~~enq~~~~~~~~------~~~~~~~~~~~~~~l~e~~~~~~~~ 73 (521) T protein:vir:72 1 MTIKTKAELLNKWKPLLEGEG-LPEIANSKQAIIAKIFENQEKDFQTAP------EYKDEKIAQAFGSFLTEAEIGGDHG 73 (521) T ss_pred CCcchhHHHHHhhhhhhccCC-CCccccchhhhhhhhhhhhhhhhhhcc------cccchHHHHHHhhhhhhhcccCccc Confidence 999764 47888875544311 11100 11111122222222221111 111111111111111110000 0 Q ss_pred HHH-HHhccCCCCCceeccHHHHHHHHHHHHh---hhhhhhhceeEecCCce------EEEEEcCC-------------- Q lcl|Aclame:pro 75 FND-IDKNVGGKDKFKLLPEETMVQVFDDLVA---EHPLLKVINFKNTSLRL------KALTAETS-------------- 130 (377) Q Consensus 75 ~~~-~~~~~~~s~gg~lvP~~~~~~Ii~~~~~---~s~l~~~~~v~~~~~~~------~~p~~~~~-------------- 130 (377) .+. ....++++ +.. +.+...++..+|. .-+-.+++-|.||++.. +.-..... T Consensus 74 ~~~~~iaes~~t-~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~ 149 (521) T protein:vir:72 74 YNATNIAAGQTS-GAV---TQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYG 149 (521) T ss_pred cCcccccccccc-ccc---ccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhcc Confidence 000 00001111 111 1222233333332 11233455566654321 11110000 Q ss_pred cceeeec------------------------------------------------------------------------- Q lcl|Aclame:pro 131 GTAVWGD------------------------------------------------------------------------- 137 (377) Q Consensus 131 ~~a~w~~------------------------------------------------------------------------- 137 (377) +.+.|.+ T Consensus 150 ~da~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~ 229 (521) T protein:vir:72 150 PDAMFSGQGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMA 229 (521) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccc Confidence 0000000 Q ss_pred -cccc----ccccccccceeEeecceeEEEe-------ehhhHHHHhcC----HHHHHHHHHHHHHHHHHHHhhcceeec Q lcl|Aclame:pro 138 -IFGE----IKGQLKQAFKEQDFSQFKLTAF-------VVIPKDALKFG----PKWLKQFITEQLKEAIAVALELAIVKG 201 (377) Q Consensus 138 -e~~~----~~~~~~~~f~~i~l~~~k~~~~-------~~iS~ell~ds----~~~~~~~l~~~la~~~a~~~~~a~l~G 201 (377) ..++ ....++..|.+..|...|..+- ...|-||.+|= ..|.|++|.+-|+..|..-+|+-||. T Consensus 230 Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~- 308 (521) T protein:vir:72 230 TSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVD- 308 (521) T ss_pred hhhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhh- Confidence 0000 0011123467777777666543 45888888873 46799999999999999999999883 Q ss_pred cC-CC-c--ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhc---ccCce Q lcl|Aclame:pro 202 NG-LL-Q--PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLK---IAGQV 274 (377) Q Consensus 202 ~G-~~-~--P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 274 (377) .= .. + -.|+.+. ............+.+ .....+....+..++-...+..+..... ..+++ T Consensus 309 ~i~~sa~~g~~g~t~~-------~~~~~G~~d~~~~~d------~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~ 375 (521) T protein:vir:72 309 WINYSAQVGKSGMTLT-------PGSKAGVFDFQDPID------IRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNF 375 (521) T ss_pred hhhheeeeeeeeeeec-------cCccccceecccccc------cccchHHHHHHHHHHHHHHHHHHHHHHhcccccceE Confidence 20 00 0 1122100 000000000000000 0001111111112222222222222222 22333 Q ss_pred EEEeccchhhhhcccc------cccC---------CCCccccccCCCceEEecCCCCcceEEEEeccc--E-----EEEe Q lcl|Aclame:pro 275 KLLLNPEDRWTLEAKF------TSRN---------QFGEYVTVLPHGITILESLAVETGKAIAFVANR--Y-----DAFM 332 (377) Q Consensus 275 ~~~~n~~~~~~~~~~~------~~~~---------~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~--y-----~~~~ 332 (377) +++.|.-.. ++... ..++ ....|...|.-+++|+.+++.+.+-+++|.-.. + +.== T Consensus 376 -~i~S~~Va~-~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPY 453 (521) T protein:vir:72 376 -IIASRNVVN-VLASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPY 453 (521) T ss_pred -EEEchHHHH-HHhhcccccccccccccccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccceeeccc Confidence 445553322 22110 0011 011234455557788888888877677663211 1 1000 Q ss_pred cceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEE-----EEeecC Q lcl|Aclame:pro 333 ATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAAL-----LTLAGG 377 (377) Q Consensus 333 ~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~-----l~~~a~ 377 (377) .....+...|...|.. .+-.+.|+ |-.++|=+-.. -.|.+| T Consensus 454 v~l~~~~~~dp~sfqP---~~g~~tRY-~l~~NP~~~~~~~~~a~~i~~~ 499 (521) T protein:vir:72 454 VALTPLRGSDPKNFQP---VMGFKTRY-GIGINPFAESAAQAPASRIQSG 499 (521) T ss_pred cccccccccCCccccc---eeeeeeee-ceeecCcccccCcccceeecCc Confidence 0111112234434443 33345555 33555522111 122333 No 198 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=51.69 E-value=0.57 Score=21.90 Aligned_cols=302 Identities=11% Similarity=0.030 Sum_probs=135.6 Q ss_pred ccHHHHHHHHHHHhc------cC--CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcceeee Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKN------VG--GKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWG 136 (377) Q Consensus 67 lt~~e~~~~~~~~~~------~~--~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~ 136 (377) |+.+-|..|+..... .. .....+.|-+.+...+.+.+++.|-+++.++++++.- +-++....+++-++-. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeecc Confidence 666666666554321 11 1224567878889999999999999999999998863 3456665555554432 Q ss_pred ccc--ccccccccccceeEeecceeEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHhhcceeeccCC----C--- Q lcl|Aclame:pro 137 DIF--GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--GPKWLKQFITEQLKEAIAVALELAIVKGNGL----L--- 205 (377) Q Consensus 137 ~e~--~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~----~--- 205 (377) .-. .+..+......+.-.+..++.-.-..|+.+.|+. ...+|..-+++.+.++++.-+-.--+||+-. + T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:18 81 DTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVK 160 (355) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 211 1122222233455555555555556788887752 2357999999999999887776666777641 1 Q ss_pred cce------eeeecccccccccc-cccc--ccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEE Q lcl|Aclame:pro 206 QPV------GLLKDLSQPTVDQS-TGRD--ITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKL 276 (377) Q Consensus 206 ~P~------Gil~~~~~~~~~~~-~~~~--~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (377) .|. |+|......+.... +... ...... ..+..-...+. .++..+..+..-.-.........+.+. T Consensus 161 nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~--~~i~~G~~gdy----~NLDAlV~d~~~~lI~~~~~~d~dLVv 234 (355) T protein:vir:18 161 NPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVS--AVIRVGKNGDY----ENLDALVMDGTNTLIDEIYQDDPKLVA 234 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhcccccccccccc--ceeeecCCCCc----ccHHHHHHHHHhccCChHHhcCCCEEE Confidence 233 44422111110000 0000 000000 00000001111 111111111110011111223345666 Q ss_pred EeccchhhhhcccccccCC------CCcccc--ccCCCceEEecCCCCcceEEEEec---ccEEEE--eccee-eEE-ee Q lcl|Aclame:pro 277 LLNPEDRWTLEAKFTSRNQ------FGEYVT--VLPHGITILESLAVETGKAIAFVA---NRYDAF--MATAS-TIE-EY 341 (377) Q Consensus 277 ~~n~~~~~~~~~~~~~~~~------~G~~~~--~l~~~~~v~~s~~~~~~~ii~gd~---s~y~~~--~~~~~-~i~-~~ 341 (377) +|...-...-...+..... .++-+. -..=|+|.+.-+++|++.+++=-| |=|+-. .|.-+ +.. +. T Consensus 235 ivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ 314 (355) T protein:vir:18 235 IVGRKLLADKYFPLVNKQQENTESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLENLSIYFMDESHRRSIDENPKKD 314 (355) T ss_pred EEchhhhHHHHhHHhhccCChHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccc Confidence 6664321111111111110 011111 111277888899999998876544 434321 11111 111 11 Q ss_pred c-hhhhhcCcEEEEEEEEEcCEEecccceEEEE------eecC Q lcl|Aclame:pro 342 D-QTFAMEDLQLYLTKNYFYGKAKDNHTAALLT------LAGG 377 (377) Q Consensus 342 ~-~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~------~~a~ 377 (377) . |.+...+ .+|..-.+--+.+++ .+.+.. .++| T Consensus 315 rie~y~s~N-e~YvVEd~~~~a~ie--ni~~~~~~~~~~~~~g 354 (355) T protein:vir:18 315 RVENYESMN-IDYVVEAYAAGCLLE--NITLGDFTAPAAPEGG 354 (355) T ss_pred cccchhhhc-ceeeeeccccEEEEe--eeeecCCCCcccccCC Confidence 1 1122222 245333333233332 333332 2233 No 199 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=50.83 E-value=0.6 Score=21.81 Aligned_cols=304 Identities=10% Similarity=-0.005 Sum_probs=132.6 Q ss_pred ccHHHHHHHHHHHhc------cC--CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcceeee Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKN------VG--GKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWG 136 (377) Q Consensus 67 lt~~e~~~~~~~~~~------~~--~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~ 136 (377) |+.+-|..|+..... .. .....+.|-+.+...+.+.+++.|-+++.++++++.- +-++....+++-++-. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 666666666554321 11 1224567888889999999999999999999998863 3456655555544432 Q ss_pred ccc--ccccccccccceeEeecceeEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHhhcceeeccCCC------- Q lcl|Aclame:pro 137 DIF--GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--GPKWLKQFITEQLKEAIAVALELAIVKGNGLL------- 205 (377) Q Consensus 137 ~e~--~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~------- 205 (377) ... ....+..-..++.-.+..++.-.=..|+.+.|+. ...+|..-+++.+.++++.-+=.--+||+-.. T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:60 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSS 160 (357) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 111 1221111123444455555554445677777752 23578999999999988876666667776321 Q ss_pred cce------eeeeccccccccccccc-cccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 206 QPV------GLLKDLSQPTVDQSTGR-DITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 206 ~P~------Gil~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) .|. |+|......+....-.. ....-......+..-...+ +.++..+..+....-.........+.+.+| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gd----y~NLDalV~D~~~~lI~~~~~~d~dLVviv 236 (357) T protein:vir:60 161 NQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGD----YASLDALVMDATNNLIEPWYQEDPDLVVIV 236 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCC----cccHHHHHHHHHhccCChHHhcCCCEEEEE Confidence 232 44422111110000000 0000000000000000011 111111111111000111122234555555 Q ss_pred ccchhhhhcccccccCCCCcc--------cc--ccCCCceEEecCCCCcceEEEEe---cccEEEE--ecceee-EE-ee Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQFGEY--------VT--VLPHGITILESLAVETGKAIAFV---ANRYDAF--MATAST-IE-EY 341 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~~G~~--------~~--~l~~~~~v~~s~~~~~~~ii~gd---~s~y~~~--~~~~~~-i~-~~ 341 (377) -..-...-...+ .+....+ +. -..=|+|.+.-+++|++.+++=- +|=|+-. .|.-+. .. +. T Consensus 237 G~dLla~k~~~l--~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~ 314 (357) T protein:vir:60 237 GRQLLADKYFPI--VNREQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLD 314 (357) T ss_pred chhhhhHHhhhH--hhcCCChHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccc Confidence 543211111111 1111111 10 01126788888999999987654 4444421 111111 11 11 Q ss_pred c-hhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 342 D-QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 342 ~-~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) . |.+...+ .+|..-.+--+.+++.=.++-..-.|+ T Consensus 315 riE~y~s~N-e~YvVEd~~~~a~iE~i~~~~~~~pa~ 350 (357) T protein:vir:60 315 RVENYESMN-IDYVVEDYAAGCLVEKIKVGDFSTPAK 350 (357) T ss_pred cccchhhhc-ceeeeeccccEEEeeeeeeccCccccc Confidence 1 1122222 255433333334443211211111111 No 200 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=48.59 E-value=0.66 Score=21.56 Aligned_cols=312 Identities=11% Similarity=0.011 Sum_probs=128.6 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHH-HHHHHHH Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEI-KFFNDID 79 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~-~~~~~~~ 79 (377) |++++.-.. .+++.+ +---.+ ......++.+-. -++.... T Consensus 1 ~~~~~~~~~-----~~~l~~-~g~~~~---------------------------------~~~~~~~~~~~~~~a~d~~~ 41 (339) T protein:vir:94 1 MSINNDRTD-----IKQLEK-VGIIFD---------------------------------GYSPKSISSEVSAYAMDAVN 41 (339) T ss_pred CceechHHH-----HHHHHh-hceeec---------------------------------cchhhhcchhhHhhhccccc Confidence 222211000 000000 000000 000000110000 0000000 Q ss_pred ---hccCCCCCce--eccHHHHHHHHHHHHhhhhhhhhceeEecCC----ceEEEEEcCCcceeeecccccc-ccccccc Q lcl|Aclame:pro 80 ---KNVGGKDKFK--LLPEETMVQVFDDLVAEHPLLKVINFKNTSL----RLKALTAETSGTAVWGDIFGEI-KGQLKQA 149 (377) Q Consensus 80 ---~~~~~s~gg~--lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~----~~~~p~~~~~~~a~w~~e~~~~-~~~~~~~ 149 (377) ...+..+.|. ..++.+.+.|++...+.-..+.++.+.+.+. .+.+++.+..+.+.|.+..... ....+.. T Consensus 42 ~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~ 121 (339) T protein:vir:94 42 LTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVN 121 (339) T ss_pred cccccccccccchhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCCcccccce Confidence 0111222222 1334445666666666555666666555442 3578888888888887543333 1222345 Q ss_pred ceeEeecceeEEEeehhhHHHHh--cCHHHHHHHHHHHHHHHHHHHhhcceeeccCCCcceeeeeccccccccccccccc Q lcl|Aclame:pro 150 FKEQDFSQFKLTAFVVIPKDALK--FGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDI 227 (377) Q Consensus 150 f~~i~l~~~k~~~~~~iS~ell~--ds~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~ 227 (377) |.+.++..+..+-... ..|+-. ....++.+--.....+++...+|+..++|+-..+-.|++|++...+....+. T Consensus 122 ~~~~~v~~~~~g~~y~-~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~--- 197 (339) T protein:vir:94 122 FESRQNYRYQTWTEYG-DLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAATV--- 197 (339) T ss_pred eeEEeEEEEEEEEeec-HHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCCC--- Confidence 6666655555443332 333332 2356788888899999999999999999987777899999876543221111 Q ss_pred cccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhhcccccccCCCCc-cccccC-- Q lcl|Aclame:pro 228 TTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGE-YVTVLP-- 304 (377) Q Consensus 228 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~-~~~~l~-- 304 (377) .+ ...++...++.+..+...+.....+ ....+-...+++.|.-+ ..+.. .+..|. ....+- T Consensus 198 -~W----------a~kT~~eI~~Di~~~~~~l~~~s~g-~~~~~~~~~L~LP~~~~-~~L~~---~n~~~~Tvl~~lk~n 261 (339) T protein:vir:94 198 -NW----------ATAAPEDIANDVVAMVGRLISQSGG-LITGQERMVMALAPSAL-NNVNR---TNNFGLSAGAKIAQT 261 (339) T ss_pred -Cc----------ccCCHHHHHHHHHHHHHHHHHhcCC-eeeeccCcEEEecHHHH-Hhccc---CCcCCccHHHHHHHh Confidence 11 0122333333333333332222111 11111223455555433 22221 233332 111110 Q ss_pred C-CceEEecCCCCcceEEEEecccEEEEe---cceeeEEeechhhh---hcC--cEEEEEEEEEcC-EEecccceEEEEe Q lcl|Aclame:pro 305 H-GITILESLAVETGKAIAFVANRYDAFM---ATASTIEEYDQTFA---MED--LQLYLTKNYFYG-KAKDNHTAALLTL 374 (377) Q Consensus 305 ~-~~~v~~s~~~~~~~ii~gd~s~y~~~~---~~~~~i~~~~~~~f---~~~--~~~~~~~~r~dg-~~~~~~af~~l~~ 374 (377) + ++.++..+...+. -|+....++.. ..-..+.......+ ... .....+..|..| -+..|.||+.++ T Consensus 262 ~pnl~i~~~~el~~a---~g~~~~~~~~~~~~~~~~~~~~p~~~~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~- 337 (339) T protein:vir:94 262 YPNIQFVAVPEFDTA---SGRLVQLWVPEVNGQPTGEVAFAEKLRSHSIERYSTTTRQKHSGATFGAVIYQPWAVTQEL- 337 (339) T ss_pred cCCcEEEEccccccC---CCceEEEEEEeccCCcceEEEcchhhhccccEEcCceEEecceeeeeeEEEEccceeeeee- Confidence 2 2444433222110 11111111111 11112211111111 111 222346677665 455678877664 Q ss_pred ecC Q lcl|Aclame:pro 375 AGG 377 (377) Q Consensus 375 ~a~ 377 (377) | T Consensus 338 --G 338 (339) T protein:vir:94 338 --G 338 (339) T ss_pred --c Confidence 4 No 201 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=47.83 E-value=0.69 Score=21.47 Aligned_cols=308 Identities=13% Similarity=0.036 Sum_probs=121.2 Q ss_pred HHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHhcc----CCCCCceeccHHHHH----HHHHHHHhhhhhhhhce Q lcl|Aclame:pro 43 GDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNV----GGKDKFKLLPEETMV----QVFDDLVAEHPLLKVIN 114 (377) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~----~~s~gg~lvP~~~~~----~Ii~~~~~~s~l~~~~~ 114 (377) .++...-+.-+ +-.+-.......++.+-...-......+ +++++ -+|..+.+ .+++.+...-....++. T Consensus 1 ~~~~~~~~~l~-~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~--~~~~~l~~~i~p~~~~~~~~~~~~~~l~p 77 (336) T protein:vir:36 1 MRDAQRIQNLA-RAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSS--GIPNYLTTYVDPSVIDILVAPMKAAELVG 77 (336) T ss_pred CchHHHHHHHh-hcCeeecchhhhhhhHHHHhhhhhhhccCccccCCCc--chHHHHHHhhccceEeeecchhhhhhhcc Confidence 00000000000 0000000011112222111111111111 12222 25654443 33333333323333333 Q ss_pred eEecC----CceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhh-HHHHh--cCHHHHHHHHHHHHH Q lcl|Aclame:pro 115 FKNTS----LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIP-KDALK--FGPKWLKQFITEQLK 187 (377) Q Consensus 115 v~~~~----~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS-~ell~--ds~~~~~~~l~~~la 187 (377) +...+ ....+++....+.+.+.+..... +..+..-...+-..+.+..-+.++ .|+.. -...++.+--....+ T Consensus 78 v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~-P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~ 156 (336) T protein:vir:36 78 ESKKGDWTTLVAAFITAEPTTKVATYGDYSSD-GDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) T ss_pred ccccCCccceeEEEeeeeceeeEEEeeccCCC-ceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHH Confidence 33222 13456776666777776433333 445544444455566667667776 44443 235667788888899 Q ss_pred HHHHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 188 EAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHP 267 (377) Q Consensus 188 ~~~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 267 (377) +++.+.+|+-.++|++..+-.|++|++.........+.... ...+...++.+..+...+.....+. T Consensus 157 ~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~-------------~~t~~ei~~Di~~~~~~l~~qt~G~- 222 (336) T protein:vir:36 157 LGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSG-------------SPAVEAVVNEVVALFQVLQTQSQGI- 222 (336) T ss_pred HHHHHhhCcEEEEeccccceEEEEecCCCccccccCCCccc-------------ccCHHHHHHHHHHHHHHHHHhcCCe- Confidence 99999999999999988889999998765432221111110 0112222233333333222221111 Q ss_pred hcccCceEEEeccchhhhhcccccccCCCCccc-cccC--C-CceEEecCCCCcceEEEEecccEEEEecc---eeeEEe Q lcl|Aclame:pro 268 LKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYV-TVLP--H-GITILESLAVETGKAIAFVANRYDAFMAT---ASTIEE 340 (377) Q Consensus 268 ~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~-~~l~--~-~~~v~~s~~~~~~~ii~gd~s~y~~~~~~---~~~i~~ 340 (377) .+......++|-|+. ...+.. .+..|.-+ ..+- | ++.++..+-.... -|+..+++..... ..++.. T Consensus 223 i~~~~~~tL~LP~~~-~~~Ls~---~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a---~g~~~~l~~~~~~~~~t~~~~~ 295 (336) T protein:vir:36 223 ITQEDVLRMGLPPTA-MSDLSK---TNQYGLAAAAKLKDIFPKLEFVTIPEYDTA---SGRLVQLWAPRVEGKDTATCGF 295 (336) T ss_pred eeeccccEEEechHH-HHhccC---CCccCccHHHHHHHhcCccEEEEccccccC---CCceEEEEEEecCCCcceeeec Confidence 111112345555443 222221 23333211 1110 2 2444333222110 1222122211111 122211 Q ss_pred echhh---hhcC--cEEEEEEEEEcCEE-ecccceEEEEeecC Q lcl|Aclame:pro 341 YDQTF---AMED--LQLYLTKNYFYGKA-KDNHTAALLTLAGG 377 (377) Q Consensus 341 ~~~~~---f~~~--~~~~~~~~r~dg~~-~~~~af~~l~~~a~ 377 (377) ...-. .... .....+..|..|.+ ..|-||+.++ += T Consensus 296 p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~--GI 336 (336) T protein:vir:36 296 TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMI--GV 336 (336) T ss_pred chhhhccceeecCceeEeccccceeeeeeeccchheeee--cC Confidence 11000 0111 12233556666643 3456666543 11 No 202 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=44.65 E-value=0.8 Score=21.12 Aligned_cols=294 Identities=13% Similarity=0.022 Sum_probs=118.3 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHH--hccCCCCCceeccHHHHHHHHHHHH Q lcl|Aclame:pro 27 TPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDID--KNVGGKDKFKLLPEETMVQVFDDLV 104 (377) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~--~~~~~s~gg~lvP~~~~~~Ii~~~~ 104 (377) ..+++. ......+..++-.+.|+..- +..+-.+++++=-+.+.++|..+.. T Consensus 1 ~~~~~n---------------------------~~~~~~~~~e~~~Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~ 53 (464) T protein:vir:80 1 MTEKKN---------------------------TERQLTSVQEEVIKGFTTGYGITPESQTDAAALRREFLDDQITMLTW 53 (464) T ss_pred CCcchh---------------------------hHhhcCcccHHHHHHHHhCCccCcccccCcchhhhhhhhhhhheeee Confidence 000000 00000001111112222111 1112235666666666666654433 Q ss_pred hhh--hhhhhceeEecCCce-EEE---EEcCCcceeeecccccccccccccceeEeecceeEEE--eehhhHHHHhcCHH Q lcl|Aclame:pro 105 AEH--PLLKVINFKNTSLRL-KAL---TAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTA--FVVIPKDALKFGPK 176 (377) Q Consensus 105 ~~s--~l~~~~~v~~~~~~~-~~p---~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~--~~~iS~ell~ds~~ 176 (377) ... .+++-..+.+..+.+ +|- .....+.+.++.|.+- ++.+++.+.......+=+.. .+.+-..|.+ +.. T Consensus 54 ~~~~f~f~~di~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~-~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn-~~~ 131 (464) T protein:vir:80 54 ADGDLSFYRDITKRPATSTVAKYDVYLAHGRVGHTRFTREIGV-APISDPNLRQKTVNMKYVSDTKNMSIATGLVN-NIE 131 (464) T ss_pred cccchhhhhhcCCchhhhhhhhhheeeccCccccccccccccc-cccCCCceEEEEEEeeeeecceeeeeehhhhc-chh Confidence 322 344555556655432 332 2333456677765554 46788999998887664432 2334444444 466 Q ss_pred HHHHHHHHHHHHHHHHHhhcceeeccCC-----C-----cceeeeeccccccccccccccccccchhhhhhhhhhccChH Q lcl|Aclame:pro 177 WLKQFITEQLKEAIAVALELAIVKGNGL-----L-----QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPD 246 (377) Q Consensus 177 ~~~~~l~~~la~~~a~~~~~a~l~G~G~-----~-----~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 246 (377) |-+..+.+.-...++..++.+.++|+-. + |..||.+-+....+....+....- .....+.......++. T Consensus 132 d~~~~~~~dai~~va~tiE~a~FyGds~l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~-~~ln~Aa~~i~~~fGt 210 (464) T protein:vir:80 132 DPMRILTDDAISVVAKTIEWASFYGDSDLSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTE-ALLNQASVLVGKGYGT 210 (464) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCCCccccchhhhHhhcCCCceeecCCCCcCH-HHHhhhhhhhhcccCC Confidence 7788888888889999999999999832 1 334665544444433333322210 0000000000000000 Q ss_pred HHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccchhhhh-----cccccccCCCCccccccCCCceEEecCCCC---cc Q lcl|Aclame:pro 247 TAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL-----EAKFTSRNQFGEYVTVLPHGITILESLAVE---TG 318 (377) Q Consensus 247 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~-----~~~~~~~~~~G~~~~~l~~~~~v~~s~~~~---~~ 318 (377) ..+ ++|.+.+..++ .++......+|+-. ..|++++-+.+..-+ .+ T Consensus 211 -----~TD---------------------~~lp~~v~a~f~n~~l~~q~~~~~~n~~~~-~~G~~v~~f~sa~G~i~L~~ 263 (464) T protein:vir:80 211 -----PTD---------------------AYMPIGVQADFVNQQLDRQVQVISDNGQNA-TMGFNVKGFNSARGFIRLHG 263 (464) T ss_pred -----hhh---------------------cccchhHHHHHHhhhcCceeEEEcCCCCcc-eeeeecccccccccceeccC Confidence 000 11122111111 11111111121111 112222111111000 01 Q ss_pred eEEEE-----ecccEE---EEecceeeEEeechh--hhhc-C---cEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 319 KAIAF-----VANRYD---AFMATASTIEEYDQT--FAME-D---LQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 319 ~ii~g-----d~s~y~---~~~~~~~~i~~~~~~--~f~~-~---~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) ..+.. |++.-. .-....++..++... .|.. + ..-|+....-+..=-.|-..+-.+++++ T Consensus 264 s~~m~~~~~ld~~~~~~~~apaapsvt~tv~~~~~g~f~~~~~~~~~~Ykv~~vn~~GeS~ps~~~~~ti~~~ 336 (464) T protein:vir:80 264 STVMELEQILDENRMQLPNAPQKATVKATLEAGTKGKFRDEDLTIDTEYKVVVVSDDAESAPSDVASVVIDDK 336 (464) T ss_pred ccccCcccccccccccCCCCcCCceeEEEecCCcccCCccccccceeEEEEEEECCCCccccceeeeeeecCc Confidence 11112 111100 111223344444332 2332 2 2346655544433334444677777777 No 203 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=41.60 E-value=0.92 Score=20.78 Aligned_cols=348 Identities=12% Similarity=0.045 Sum_probs=120.3 Q ss_pred ccHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHH---HHHHHH Q lcl|Aclame:pro 3 INLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIK---FFNDID 79 (377) Q Consensus 3 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~~~~~~ 79 (377) |+.|+|.++|.-+.+-.. ..++...-+...+......+++++.... +++..........-|++.+-. .+.... T Consensus 1 ~~~~~l~~kw~p~l~~~~--~~~i~~~~~~~i~~~~~en~~~~~~~~~--~~~~~~~~~~~~~~l~e~~~~~~~~~~~t~ 76 (519) T protein:vir:10 1 MKKNALVQKWSALLENEA--LPEIVGASKQAIIAKIFENQEQDILTAP--EYRDEKISEAFGSFLTEAEIGGDHGYDATN 76 (519) T ss_pred CchhHHHHHhHHhhcccc--cchhhhhhhHHHHHHHHHHHHHHhhhcc--cccchHHHHHHhhhcchhccCCccccCccc Confidence 778899998886654111 1111101111111111111111111100 000000000011111111100 000000 Q ss_pred hccCCCCCceeccHHHHHHHHHHHHh---hhhhhhhceeEecCCce------EEEEEcCC--------------cceeee Q lcl|Aclame:pro 80 KNVGGKDKFKLLPEETMVQVFDDLVA---EHPLLKVINFKNTSLRL------KALTAETS--------------GTAVWG 136 (377) Q Consensus 80 ~~~~~s~gg~lvP~~~~~~Ii~~~~~---~s~l~~~~~v~~~~~~~------~~p~~~~~--------------~~a~w~ 136 (377) -..+.+.|+. ..+...++..+|. .-+-.+++-|.||++.. +.-..... +.+.|- T Consensus 77 i~~~~~t~~v---~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fS 153 (519) T protein:vir:10 77 IAAGQTSGAV---TQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFS 153 (519) T ss_pred cccccccccc---cccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccC Confidence 0000011111 1223333333322 12234556666665321 11111000 000000 Q ss_pred cc--------------------------------------------------------------------cc------c- Q lcl|Aclame:pro 137 DI--------------------------------------------------------------------FG------E- 141 (377) Q Consensus 137 ~e--------------------------------------------------------------------~~------~- 141 (377) +. ++ + T Consensus 154 G~~~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEa 233 (519) T protein:vir:10 154 GQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAEL 233 (519) T ss_pred ccccccccccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchhhc Confidence 00 00 0 Q ss_pred ---ccccccccceeEeecceeEEEe-------ehhhHHHHhcC----HHHHHHHHHHHHHHHHHHHhhcceeeccCC-Cc Q lcl|Aclame:pro 142 ---IKGQLKQAFKEQDFSQFKLTAF-------VVIPKDALKFG----PKWLKQFITEQLKEAIAVALELAIVKGNGL-LQ 206 (377) Q Consensus 142 ---~~~~~~~~f~~i~l~~~k~~~~-------~~iS~ell~ds----~~~~~~~l~~~la~~~a~~~~~a~l~G~G~-~~ 206 (377) ....+...|.+..|...|..+- ...|-||.+|= ..|.|++|.+-|+..|..-+|+.||.=-.. .+ T Consensus 234 l~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~ 313 (519) T protein:vir:10 234 QEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQ 313 (519) T ss_pred cccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhh Confidence 0001122366666666666543 45888888873 467999999999999999999999841111 11 Q ss_pred --ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhc---ccCceEEEeccc Q lcl|Aclame:pro 207 --PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLK---IAGQVKLLLNPE 281 (377) Q Consensus 207 --P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~n~~ 281 (377) -.|+-.. .+..+.+.....+.+ -...++....+..++-...+..+...+. ..++ .+++.|. T Consensus 314 ~~~~g~t~~-------~~~~aGv~d~~~~~d------~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn-~ii~S~~ 379 (519) T protein:vir:10 314 VGKSGMTNT-------VGAKAGVFDFQDPID------IRGARWAGESFKALLFQIDKEAAEIARQTGRGAGN-FIIASRN 379 (519) T ss_pred cceeecccC-------cccccceeecccccc------cccchHHHHHHHHHHHHHHHHHHHHHHhhcccccc-EEEEchH Confidence 1121100 000000000000000 0011111111112222222222222222 2233 3445553 Q ss_pred hhhhhcccc--------------cccCCCCccccccCCCceEEecCCCCcceEEEEecc-------cEEEEecceeeEEe Q lcl|Aclame:pro 282 DRWTLEAKF--------------TSRNQFGEYVTVLPHGITILESLAVETGKAIAFVAN-------RYDAFMATASTIEE 340 (377) Q Consensus 282 ~~~~~~~~~--------------~~~~~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s-------~y~~~~~~~~~i~~ 340 (377) -..-|.... ...+....+...|.-+++|+.+++.+.+-+++|--. -|+.==.....+.. T Consensus 380 Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~ 459 (519) T protein:vir:10 380 VVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRG 459 (519) T ss_pred HHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEecCcccccceeeccccccccccc Confidence 222111110 000111223344555778888888887666666321 11100001111122 Q ss_pred echhhhhcCcEEEEEEEEEcCEEecccceEEEE-------eecC Q lcl|Aclame:pro 341 YDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLT-------LAGG 377 (377) Q Consensus 341 ~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~-------~~a~ 377 (377) .|...|.. .+-.+.|+ |-.++| |+-.. +.-| T Consensus 460 ~dp~sfqP---~~g~~tRY-~l~~NP--~~~~~~~~~~~~i~~g 497 (519) T protein:vir:10 460 SDPKNFQP---VMGFKTRY-GIGINP--FADPAAQAPTKRIQNG 497 (519) T ss_pred cCCccccc---eeeeeeee-ceeecC--cccccccCccceeccC Confidence 33333433 33344555 334555 32111 1111 No 204 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=41.40 E-value=0.93 Score=20.76 Aligned_cols=304 Identities=11% Similarity=0.006 Sum_probs=131.9 Q ss_pred ccHHHHHHHHHHHhc------cC--CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcceeee Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKN------VG--GKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWG 136 (377) Q Consensus 67 lt~~e~~~~~~~~~~------~~--~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~ 136 (377) |+.+-|..|+..... .. .....+.|-+.+...+.+.+++.|-+++.++++++.- +-++....+++-++-. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 666666666554321 11 1224567888889999999999999999999998863 3456655555544432 Q ss_pred ccc--ccccccccccceeEeecceeEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHhhcceeeccCCC------- Q lcl|Aclame:pro 137 DIF--GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--GPKWLKQFITEQLKEAIAVALELAIVKGNGLL------- 205 (377) Q Consensus 137 ~e~--~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~------- 205 (377) ... ....+..-..++.-.+..++.-.=..|+.+.|+. ...+|..-+++.+.++++.-+=.--+||+-.. T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:20 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 211 1221111123444455555544445677777752 23578999999999988876666667776321 Q ss_pred cce------eeeecccccccccc-ccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 206 QPV------GLLKDLSQPTVDQS-TGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 206 ~P~------Gil~~~~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) .|. |+|......+.... .......-......+..-...+. .++..+..+..............+.+.+| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy----~NLDalV~D~~~~lI~~~~~~d~dLVviv 236 (357) T protein:vir:20 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDY----ASLDALVMDATNNLIEPWYQEDPDLVVIV 236 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCc----ccHHHHHHHHHhccCChHHhcCCCEEEEE Confidence 232 44421111110000 00000000000000000000111 11111111111000111122234455555 Q ss_pred ccchhhhhcccccccCC------CCcccc--ccCCCceEEecCCCCcceEEEEe---cccEEEE--ecceee-EE-eec- Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQ------FGEYVT--VLPHGITILESLAVETGKAIAFV---ANRYDAF--MATAST-IE-EYD- 342 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~------~G~~~~--~l~~~~~v~~s~~~~~~~ii~gd---~s~y~~~--~~~~~~-i~-~~~- 342 (377) -..-...-...+..... .++-+. -..=|+|.+.-+++|++.+++=- +|=|+-. .|.-+. .. +.. T Consensus 237 G~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~ri 316 (357) T protein:vir:20 237 GRQLLADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRV 316 (357) T ss_pred chhhhhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccc Confidence 54321111111111110 011110 01126788888999999987654 4444421 111111 11 111 Q ss_pred hhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 343 QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 343 ~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+...+ .+|..-.+--+.+++. +.+...++. T Consensus 317 E~y~s~N-e~YvVEd~~~~a~iE~--i~~~~~~~p 348 (357) T protein:vir:20 317 ENYESMN-IDYVVEDYAAGCLVEK--IKVGDFSTP 348 (357) T ss_pred cchhhhc-ceeeeeccccEEEeee--eeeccccCC Confidence 1122222 2554333333334332 222221111 No 205 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=41.07 E-value=0.94 Score=20.72 Aligned_cols=304 Identities=10% Similarity=0.007 Sum_probs=131.0 Q ss_pred ccHHHHHHHHHHHhc------cC--CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcceeee Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKN------VG--GKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWG 136 (377) Q Consensus 67 lt~~e~~~~~~~~~~------~~--~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~ 136 (377) |+.+-|..|+..... .. .....+.|-+.+...+.+.+++.|-+++.++++++.- +-++....+++-++-. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 666666666554321 11 1224567888889999999999999999999998863 3456655555544432 Q ss_pred ccc--ccccccccccceeEeecceeEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHhhcceeeccCCC------- Q lcl|Aclame:pro 137 DIF--GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--GPKWLKQFITEQLKEAIAVALELAIVKGNGLL------- 205 (377) Q Consensus 137 ~e~--~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~------- 205 (377) ... ....+..-..++.-.+..++.-.=..|+.+.|+. ...+|..-+++.+.++++.-+=.--+||+-.. T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:56 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 211 1221111123444455555544445677777752 23578999999999988876666667776321 Q ss_pred cce------eeeeccccccccccccc-cccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEe Q lcl|Aclame:pro 206 QPV------GLLKDLSQPTVDQSTGR-DITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) Q Consensus 206 ~P~------Gil~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (377) .|. |+|......+....-.. ....-......+..-...+. .++..+..+..............+.+.+| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy----~NLDalV~D~~~~lI~~~~~~d~dLVviv 236 (357) T protein:vir:56 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDY----ASLDALVMDATNNLIEPWYQEDPDLVVIV 236 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCc----ccHHHHHHHHHhccCChHHhcCCCEEEEE Confidence 232 44422111110000000 00000000000000000111 11111111111000111122234455555 Q ss_pred ccchhhhhcccccccCC------CCcccc--ccCCCceEEecCCCCcceEEEEe---cccEEEE--ecceee-EE-eec- Q lcl|Aclame:pro 279 NPEDRWTLEAKFTSRNQ------FGEYVT--VLPHGITILESLAVETGKAIAFV---ANRYDAF--MATAST-IE-EYD- 342 (377) Q Consensus 279 n~~~~~~~~~~~~~~~~------~G~~~~--~l~~~~~v~~s~~~~~~~ii~gd---~s~y~~~--~~~~~~-i~-~~~- 342 (377) -..-...-...+..... .++-+. -..=|+|.+.-+++|++.+++=- +|=|+-. .|.-+. .. +.. T Consensus 237 G~dLla~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~ri 316 (357) T protein:vir:56 237 GRQLLADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRV 316 (357) T ss_pred chhhhhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccc Confidence 54321111111111110 011110 01126788888999999987654 4444421 111111 11 111 Q ss_pred hhhhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 343 QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 343 ~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) |.+...+ .+|..-.+.-+.+++. ..+...++- T Consensus 317 E~y~s~N-e~YvVEd~~~~a~iE~--i~i~~~~~~ 348 (357) T protein:vir:56 317 ENYESMN-IDYVVEDYAAGCLVEK--IKVGDFSTP 348 (357) T ss_pred cchhhhc-ceeeeeccccEEEeee--eeeccCCCC Confidence 1122222 2453333332333332 111111111 No 206 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=40.58 E-value=0.96 Score=20.67 Aligned_cols=346 Identities=10% Similarity=0.002 Sum_probs=116.1 Q ss_pred CCccHHHHHHHHHHHHHHHH--HHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHH---H Q lcl|Aclame:pro 1 MAINLKELPKYREAVAELSA--KISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKF---F 75 (377) Q Consensus 1 m~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~---~ 75 (377) |+. .|+|.++|.-+.+-.. .+.+.-.+..-.+.+|.....+.++. .++........+.-|++.+..- + T Consensus 1 ~~~-~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~------~~~~~~~~~~~~~~l~ea~~~~~~~~ 73 (528) T protein:vir:80 1 MKT-TKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDP------IYKDEKVVEAFGGFIAEAEVAGDHGY 73 (528) T ss_pred Ccc-hHHHHHhhhHhhcCCccchhcchhhhhhhhhhhhhhhHHhhccc------cccchHHHHhhhhhccccccccccCC Confidence 544 4666666654433111 11111111111122222222221111 1111111111111121111000 0 Q ss_pred HH-HHhccCCCCCceeccHHHHHHHHHHHHh---hhhhhhhceeEecCCce------E--EEEEcC------------Cc Q lcl|Aclame:pro 76 ND-IDKNVGGKDKFKLLPEETMVQVFDDLVA---EHPLLKVINFKNTSLRL------K--ALTAET------------SG 131 (377) Q Consensus 76 ~~-~~~~~~~s~gg~lvP~~~~~~Ii~~~~~---~s~l~~~~~v~~~~~~~------~--~p~~~~------------~~ 131 (377) +. ...+++++ +.. +.+...++..+|. .-+-.+++-|.||+|.. + ++-... .+ T Consensus 74 ~~~~i~es~~t-~~v---~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~ 149 (528) T protein:vir:80 74 DASQIAAGQTT-GAI---TNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAP 149 (528) T ss_pred ccccccccccc-ccc---ccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCcccccccccccccccc Confidence 00 00000100 000 1112222222221 11223445555554320 1 100000 00 Q ss_pred ce------------------------------------------------------------------------------ Q lcl|Aclame:pro 132 TA------------------------------------------------------------------------------ 133 (377) Q Consensus 132 ~a------------------------------------------------------------------------------ 133 (377) ++ T Consensus 150 da~fS~~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~ 229 (528) T protein:vir:80 150 DAFHSSLAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKL 229 (528) T ss_pred ccccccccccccccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccc Confidence 00 Q ss_pred eee-----ccccc----ccccccccceeEeecceeEEEe-------ehhhHHHHhcC----HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 134 VWG-----DIFGE----IKGQLKQAFKEQDFSQFKLTAF-------VVIPKDALKFG----PKWLKQFITEQLKEAIAVA 193 (377) Q Consensus 134 ~w~-----~e~~~----~~~~~~~~f~~i~l~~~k~~~~-------~~iS~ell~ds----~~~~~~~l~~~la~~~a~~ 193 (377) .-+ ...++ ....+...|.+..|...|..+- ...|-||.+|= ..|.|++|.+-|+..|..- T Consensus 230 ~~~~~Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlE 309 (528) T protein:vir:80 230 AEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLE 309 (528) T ss_pred cccccccchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHH Confidence 000 00000 0011233477777777776643 45888888873 4688999999999999999 Q ss_pred hhcceeeccCCC-c--ceeeeecccc--ccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 194 LELAIVKGNGLL-Q--PVGLLKDLSQ--PTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPL 268 (377) Q Consensus 194 ~~~a~l~G~G~~-~--P~Gil~~~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 268 (377) +|+.||.=-... + -.|+...+.. +..+-....+. ...++.......++-...+..+...+ T Consensus 310 INReii~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~---------------~g~r~~~e~~k~L~~~i~~~an~I~~ 374 (528) T protein:vir:80 310 INREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDT---------------RGARWAGESFKSLIYQIDKEAAEIAR 374 (528) T ss_pred hhHHHHhhhhheeeeeeeeeeeccccccceeeccccccc---------------cccchhHHHHHHHHHHHHHHHHHHHH Confidence 999996310100 0 0121110000 00000000000 00011111111122222222222222 Q ss_pred c---ccCceEEEeccchhhhhcccc-------------cccC-CCCccccccCCCceEEecCCCCcceEEEEecc----- Q lcl|Aclame:pro 269 K---IAGQVKLLLNPEDRWTLEAKF-------------TSRN-QFGEYVTVLPHGITILESLAVETGKAIAFVAN----- 326 (377) Q Consensus 269 ~---~~~~~~~~~n~~~~~~~~~~~-------------~~~~-~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s----- 326 (377) . ..++ .+++.|.-..-|...- ...+ ....|...|.-+++|+.+++.+.+-+++|--. T Consensus 375 ~T~~~~gn-~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~ 453 (528) T protein:vir:80 375 QTGRGAGN-FVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMD 453 (528) T ss_pred hhcccccc-EEEEchHHHHHHhhccccccccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccc Confidence 2 2233 3445543322221100 0001 11123344445678888888887666666321 Q ss_pred --cEEEEecceeeEEeechhhhhcCcEEEEEEEEEcCEEecccc--------------------------eEEEEeecC Q lcl|Aclame:pro 327 --RYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHT--------------------------AALLTLAGG 377 (377) Q Consensus 327 --~y~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~a--------------------------f~~l~~~a~ 377 (377) -|+-==..+.-....|...|.. .+-.+.|+ |-.++|=+ |++|.++.= T Consensus 454 ~glfy~PYv~l~~~~~~dp~sfqP---~~g~~tRY-~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:80 454 AGIYYAPYVALTPLRATDPQSFHP---VLGFKTRY-GIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred cceeecccccceeeEeeCCccccc---eeeeeeee-ceeecCcccccCCcccccccccchhhhhcCccceeEEeeeccC Confidence 1110000111122334434443 23334444 33444411 122222211 No 207 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=40.46 E-value=0.97 Score=20.66 Aligned_cols=295 Identities=12% Similarity=0.046 Sum_probs=132.7 Q ss_pred ccHHHHHHHHHHHhc------cCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcceeeecc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKN------VGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWGDI 138 (377) Q Consensus 67 lt~~e~~~~~~~~~~------~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~~e 138 (377) |+.+-|..|+..... .......+.|-+.+...+.+.+++.|-+++.++++++.- +-++....+++-++-... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 666666666554321 122334566777889999999999999999999998862 345555555554443221 Q ss_pred -cccccccccccceeEeecceeEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHhhcceeeccCC-------Ccce Q lcl|Aclame:pro 139 -FGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--GPKWLKQFITEQLKEAIAVALELAIVKGNGL-------LQPV 208 (377) Q Consensus 139 -~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~-------~~P~ 208 (377) .....+..-...+.-.+..++.---..|+.+.|+. ...+|..-+++.+.++++.-+-.--+||+-. ..|. T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:10 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 11221111223455555555555556788888862 3457999999999999887776666777641 1232 Q ss_pred ------eeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccch Q lcl|Aclame:pro 209 ------GLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) Q Consensus 209 ------Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 282 (377) |+|......+....-.....+ .....++ ...+. .++..+..+..-.-.........+.+.+|-..- T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~--~~~i~iG--~~gdy----~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dL 232 (337) T protein:vir:10 161 LQDVNIGWLQQYRERAAQRVLHEGAKQ--AGKVLVG--KAGDY----ENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) T ss_pred ccccchhHHHHHHhcchhhhhcccccc--Ccceeec--CCCCc----ccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 444322111100000000000 0000000 00111 111111111110001111222334555554322 Q ss_pred hhhhcccccccCCCCcc--------cc--ccCCCceEEecCCCCcceEEEEeccc---EEEE--eccee-eEE-eec-hh Q lcl|Aclame:pro 283 RWTLEAKFTSRNQFGEY--------VT--VLPHGITILESLAVETGKAIAFVANR---YDAF--MATAS-TIE-EYD-QT 344 (377) Q Consensus 283 ~~~~~~~~~~~~~~G~~--------~~--~l~~~~~v~~s~~~~~~~ii~gd~s~---y~~~--~~~~~-~i~-~~~-~~ 344 (377) ...-. ....+....+ .. -..=|+|.+.-+++|++.+++=-|+. |+-. .|.-+ +.. +.. |. T Consensus 233 ladk~--~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 310 (337) T protein:vir:10 233 LHDKY--FPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIEN 310 (337) T ss_pred hhHHh--hHHhccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccc Confidence 11111 1111111111 11 01127788889999999987655544 4321 11111 110 010 11 Q ss_pred hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 345 FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 345 ~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +...+ .+|..-.+.-+.+++ -+++..+ T Consensus 311 y~s~N-e~YvVEd~~~~a~ie-----nI~~~~a 337 (337) T protein:vir:10 311 YESSN-DAYVVEDFGCGCVAE-----NIELAAA 337 (337) T ss_pred hhhcc-ceeeeeccccEEEEe-----ceeecCC Confidence 11222 244332222222222 1222222 No 208 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=38.05 E-value=1.1 Score=20.39 Aligned_cols=295 Identities=12% Similarity=0.036 Sum_probs=131.2 Q ss_pred ccHHHHHHHHHHHhc------cCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcceeeecc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKN------VGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWGDI 138 (377) Q Consensus 67 lt~~e~~~~~~~~~~------~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~~e 138 (377) |+.+-|..|+..... .......+.|-+.+...+.+.+++.|-+++.++++++.- +-++....+++-++-... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecC Confidence 666666666654321 223344667888889999999999999999999998862 345555555444433221 Q ss_pred -cccccccccccceeEeecceeEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHhhcceeeccCCC-------cce Q lcl|Aclame:pro 139 -FGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--GPKWLKQFITEQLKEAIAVALELAIVKGNGLL-------QPV 208 (377) Q Consensus 139 -~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~~-------~P~ 208 (377) .....+..-...+.-.+..++.--=..|+.+.|+. ...+|..-+++.+.++++.-.=.--+||+-.. .|. T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (337) T protein:vir:78 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcC Confidence 11121111123444445555444445677887752 34579999999999988877666667776321 232 Q ss_pred ------eeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccch Q lcl|Aclame:pro 209 ------GLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) Q Consensus 209 ------Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 282 (377) |+|......+....-.....+ .....++ ...+. .++..+..+..-.-.........+.+.+|-..- T Consensus 161 lqDVN~GWlQ~~Re~ap~rVl~~~~~~--~~~i~iG--~~gdy----~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dL 232 (337) T protein:vir:78 161 LQDVNIGWLQQYRERAAQRVLHEGAKQ--AGKVLIG--KAGDY----ENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) T ss_pred ccccchHHHHHHHhcchhhhhcccccc--CCceeec--CCCCc----ccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 444222111100000000000 0000000 01111 111111111110001111222344555555432 Q ss_pred hhhhcccccccCCCCcc--------cc--ccCCCceEEecCCCCcceEEEEec---ccEEE--Eecceee-EE-eec-hh Q lcl|Aclame:pro 283 RWTLEAKFTSRNQFGEY--------VT--VLPHGITILESLAVETGKAIAFVA---NRYDA--FMATAST-IE-EYD-QT 344 (377) Q Consensus 283 ~~~~~~~~~~~~~~G~~--------~~--~l~~~~~v~~s~~~~~~~ii~gd~---s~y~~--~~~~~~~-i~-~~~-~~ 344 (377) ...-.. ...+....+ +. -..=|.|.+.-+++|++.+++=-| |=|+- ..|.-+. .. +.. |. T Consensus 233 ladk~~--~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 310 (337) T protein:vir:78 233 LHDKYF--PIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIEN 310 (337) T ss_pred hHHHHH--HHHhcCCCcHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccc Confidence 111111 111111111 11 011267888889999999876544 44432 1111111 10 010 11 Q ss_pred hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 345 FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 345 ~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +...+ .+|..-.+.-+.+++ -+++..+ T Consensus 311 y~s~N-e~YvVEd~~~~a~iE-----nI~~~~a 337 (337) T protein:vir:78 311 YESSN-DAYVVEDFGCGCVAE-----NIELAAA 337 (337) T ss_pred hhhcc-ceeeeeccccEEEEe-----ceeecCC Confidence 11222 244332222222222 1222222 No 209 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=36.14 E-value=1.2 Score=20.17 Aligned_cols=310 Identities=13% Similarity=0.029 Sum_probs=120.9 Q ss_pred HHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHhccC--CCCCceeccHHHH----HHHHHHHHhhhhhhhhceeE Q lcl|Aclame:pro 43 GDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVG--GKDKFKLLPEETM----VQVFDDLVAEHPLLKVINFK 116 (377) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~--~s~gg~lvP~~~~----~~Ii~~~~~~s~l~~~~~v~ 116 (377) .++...-+.-+ +-.+-.......++.+-...-........ .+.+...+|..+. ..+++.+...-....++.+. T Consensus 1 ~~~~~~~~~l~-~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~ 79 (336) T protein:vir:10 1 MRDAQRIQNLA-RAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGES 79 (336) T ss_pred CchHHHHHHHh-hcCeeecchhhhhhhhHHHhhhhhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhcccc Confidence 00000000000 00000001111122211111111111111 1111223554333 23333333332233333333 Q ss_pred ecC----CceEEEEEcCCcceeeecccccccccccccceeEeecceeEEEeehhh-HHHHh--cCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 117 NTS----LRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIP-KDALK--FGPKWLKQFITEQLKEA 189 (377) Q Consensus 117 ~~~----~~~~~p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS-~ell~--ds~~~~~~~l~~~la~~ 189 (377) ..+ ....+++....+.+.+.+..... +..+..-...+-..+.+..-+.++ .|+-. ....++.+--....+++ T Consensus 80 t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~-P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~a 158 (336) T protein:vir:10 80 KKGDWTTLVAAFITAEPTTKVATYGDYSSD-GDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) T ss_pred ccCCccceeEEEeeeeceeeEEEeeccCCC-ceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHH Confidence 322 13456776666777776433333 445544444455566677667777 44432 33567888888999999 Q ss_pred HHHHhhcceeeccCCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhc Q lcl|Aclame:pro 190 IAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLK 269 (377) Q Consensus 190 ~a~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 269 (377) +.+.+|+-.++|++..+-.|++|++.........+.... ...+...++.+..+...+.....+. .. T Consensus 159 le~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~-------------~~t~eei~~Di~~~~~~l~~qs~G~-i~ 224 (336) T protein:vir:10 159 LAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSG-------------SPAVEAVVNEVVALFQVLQTQSQGI-IT 224 (336) T ss_pred HHHhhCcEEEEeccccceEEEEeCCCCccccccCCCccc-------------ccCHHHHHHHHHHHHHHHHHhcCCe-ec Confidence 999999999999988889999998766432221111110 0111222233333333222211111 11 Q ss_pred ccCceEEEeccchhhhhcccccccCCCCccc-cccC--C-CceEEecCCCCcceEEEEecccEEEEecc---eeeEEeec Q lcl|Aclame:pro 270 IAGQVKLLLNPEDRWTLEAKFTSRNQFGEYV-TVLP--H-GITILESLAVETGKAIAFVANRYDAFMAT---ASTIEEYD 342 (377) Q Consensus 270 ~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~-~~l~--~-~~~v~~s~~~~~~~ii~gd~s~y~~~~~~---~~~i~~~~ 342 (377) ......++|-|+. ...+.. .+..|.-+ ..+- | ++.++..+-.... -|+..+++..... ..++.... T Consensus 225 ~~~~~tL~LP~~~-~~~Ls~---~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a---~G~~~~l~~~~~~~~~t~~~~~p~ 297 (336) T protein:vir:10 225 QEDVLRMGLPPTA-MSDLSK---TNQYGLAAAAKLKDIFPKLEFVTIPEYDTA---SGRLVQLWAPRVEGKDTATCGFTE 297 (336) T ss_pred ccCcceEEecHHH-HHhccC---CCccCccHHHHHHHhcCccEEEEccccccC---CCceEEEEEEecCCCcceeeecch Confidence 1112344555443 222221 23333211 1110 2 2444333222110 1221122211111 12221111 Q ss_pred hhh---hhcC--cEEEEEEEEEcCEE-ecccceEEEEeecC Q lcl|Aclame:pro 343 QTF---AMED--LQLYLTKNYFYGKA-KDNHTAALLTLAGG 377 (377) Q Consensus 343 ~~~---f~~~--~~~~~~~~r~dg~~-~~~~af~~l~~~a~ 377 (377) .-. .... .....+..|..|.+ ..|-||+.++ += T Consensus 298 ~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~--GI 336 (336) T protein:vir:10 298 KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMI--GV 336 (336) T ss_pred hhhccceeecCceeEeccccceeeeeeeccchheeee--cC Confidence 000 0111 12233556666643 3456666543 11 No 210 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=34.74 E-value=1.3 Score=20.01 Aligned_cols=295 Identities=12% Similarity=0.044 Sum_probs=132.1 Q ss_pred ccHHHHHHHHHHHhc------cCCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEEcCCcceeeecc Q lcl|Aclame:pro 67 LTAEEIKFFNDIDKN------VGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWGDI 138 (377) Q Consensus 67 lt~~e~~~~~~~~~~------~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~~~~v~~~~~--~~~~p~~~~~~~a~w~~e 138 (377) |+.+-|..|+..... .....-.+.|-+.+...+.+.+++.|-+++.++++++.- +-++....+++-++-... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 666666666654332 122233566777889999999999999999999998862 345555555554443221 Q ss_pred -cccccccccccceeEeecceeEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHhhcceeeccCC-------Ccce Q lcl|Aclame:pro 139 -FGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKF--GPKWLKQFITEQLKEAIAVALELAIVKGNGL-------LQPV 208 (377) Q Consensus 139 -~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d--s~~~~~~~l~~~la~~~a~~~~~a~l~G~G~-------~~P~ 208 (377) .....+..-...+.-.+..++.---..|+.+.|+. ...+|..-+++.+.++++.-+-.--+||+-. ..|. T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:79 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 11221111223455555555555556788888862 3457999999999999887776666777641 1233 Q ss_pred ------eeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccch Q lcl|Aclame:pro 209 ------GLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) Q Consensus 209 ------Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 282 (377) |+|......+....-.....+ .....++ ...+. .++..+..+..-.-.........+.+.+|-..- T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~--~~~i~iG--~~gdy----~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dL 232 (337) T protein:vir:79 161 LQDVNIGWLQQYRERAAQRVLHEGAKQ--AGKVLVG--KAGDY----ENLDALVMDIVSSMIDPWFQEDTGLVAICGREL 232 (337) T ss_pred ccccchhHHHHHHhcchhhhhcccccc--Ccceeec--CCCCc----ccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 444322111100000000000 0000000 01111 111111111110001111222334555554322 Q ss_pred hhhhcccccccCCCCcc--------cc--ccCCCceEEecCCCCcceEEEEeccc---EEE--Eeccee-eEE-eec-hh Q lcl|Aclame:pro 283 RWTLEAKFTSRNQFGEY--------VT--VLPHGITILESLAVETGKAIAFVANR---YDA--FMATAS-TIE-EYD-QT 344 (377) Q Consensus 283 ~~~~~~~~~~~~~~G~~--------~~--~l~~~~~v~~s~~~~~~~ii~gd~s~---y~~--~~~~~~-~i~-~~~-~~ 344 (377) ...-. ....+....+ .. -..=|+|.+.-+++|++.+++=-|+. |+- ..|.-+ +.. +.. |. T Consensus 233 ladk~--~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 310 (337) T protein:vir:79 233 LHDKY--FPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIEN 310 (337) T ss_pred hhHHh--hHHhccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccc Confidence 11111 1111111111 11 01127788889999999987655544 432 111111 110 010 11 Q ss_pred hhhcCcEEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 345 FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 345 ~f~~~~~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) +...+ .+|..-.+.-+.+++ . +++... T Consensus 311 y~s~N-e~YvVEd~~~~a~ie--n---I~~~~a 337 (337) T protein:vir:79 311 YESSN-DAYVVEDFGCGCVAE--N---IELAAA 337 (337) T ss_pred hhhcc-ceeeeeccccEEEEe--c---eeecCC Confidence 11222 244332222222222 1 122222 No 211 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=24.34 E-value=2.2 Score=18.72 Aligned_cols=275 Identities=14% Similarity=0.045 Sum_probs=103.6 Q ss_pred HHHHHHHHHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhh--hce--eEecCC-ceEE Q lcl|Aclame:pro 50 NEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLK--VIN--FKNTSL-RLKA 124 (377) Q Consensus 50 ~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~--~~~--v~~~~~-~~~~ 124 (377) +..+.++.-. .-.|+ -+ .| ...+..-+- ++=.+....+++.+.....+-. .++ +.-.+| .++| T Consensus 1 ~~~~~~~~~~----~~~~~--~~-~~----~~~~~~~nt-~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkI 68 (319) T protein:vir:94 1 MNKTIKNATG----MLKLN--LQ-HF----ANKSVEPGQ-TLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTV 68 (319) T ss_pred CCcccccccc----eeEee--hh-hh----hccCCCcch-HHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEE Confidence 0000000000 00010 01 11 001111111 2222333333443333322221 111 222344 5899 Q ss_pred EEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHH--HHHHHHHHHHHHHHHhhcceeecc Q lcl|Aclame:pro 125 LTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWL--KQFITEQLKEAIAVALELAIVKGN 202 (377) Q Consensus 125 p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~--~~~l~~~la~~~a~~~~~a~l~G~ 202 (377) |.....+-..+-...+-.....+.+...++|...+.-.+..=.-+ .+.+...+ ...+.+.....++-.+|...+.-- T Consensus 69 p~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D-~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skl 147 (319) T protein:vir:94 69 MKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALD-RKDTEGNIDINYVVARQGAEVVAPYLDNLRFATL 147 (319) T ss_pred eeecccccccccCCCCcccCCcccceeEEEeecccccccccchhh-HhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHH Confidence 988765544443222222233334555566665555444321111 12222222 223333344444444554333210 Q ss_pred CCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccch Q lcl|Aclame:pro 203 GLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) Q Consensus 203 G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 282 (377) -. .. ......+ ....+.+..+..++..+.. .+.| . +.+++|+|.. T Consensus 148 a~--------~a---~~~~~~~------------------~t~~n~y~~i~~a~~~Lde--~~VP---~-~Rvl~Vtp~~ 192 (319) T protein:vir:94 148 AR--------NK---AKHLTVG------------------TGSDAQYDAVLDVSVELDE--IKAP---E-NRVLFVSPTF 192 (319) T ss_pred Hh--------hc---ccccccc------------------cCHHHHHHHHHHHHHHHHh--cCCC---C-CcEEEeCHHH Confidence 00 00 0000000 0112222333333332211 1112 2 3566788876 Q ss_pred hhhhcccc-cccCC--------CCccccccCCCceEEec--CCCCcceEEEEecccEEE-EecceeeEEeechhhhhcCc Q lcl|Aclame:pro 283 RWTLEAKF-TSRNQ--------FGEYVTVLPHGITILES--LAVETGKAIAFVANRYDA-FMATASTIEEYDQTFAMEDL 350 (377) Q Consensus 283 ~~~~~~~~-~~~~~--------~G~~~~~l~~~~~v~~s--~~~~~~~ii~gd~s~y~~-~~~~~~~i~~~~~~~f~~~~ 350 (377) +..|.... ...+. +|.-.++ -|++|+.. ..++.-.+++|-.+-... ..-..+++-...+..|. T Consensus 193 ~~~L~~~~~f~~~~~~~~~~~~~g~Vg~i--dG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~~~a--- 267 (319) T protein:vir:94 193 YKGIKKFVIALPQGDTRQQVLGKGVQGEL--DGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMFG--- 267 (319) T ss_pred HHHHHhhhhhhccccccccceeeeeceee--cCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCccccc--- Confidence 65442111 01111 1211122 35666553 444444466665543221 11123333221222232 Q ss_pred EEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 351 QLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 351 ~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) -.|++..++|.++.++++..+...+.- T Consensus 268 ~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:94 268 TLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred eeeeeeeeeeeEEeccccceEEEeecC Confidence 378899999999999998777764443 No 212 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=24.34 E-value=2.2 Score=18.72 Aligned_cols=275 Identities=14% Similarity=0.045 Sum_probs=103.6 Q ss_pred HHHHHHHHHHhccccccccHHHHHHHHHHHhccCCCCCceeccHHHHHHHHHHHHhhhhhhh--hce--eEecCC-ceEE Q lcl|Aclame:pro 50 NEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLK--VIN--FKNTSL-RLKA 124 (377) Q Consensus 50 ~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~s~gg~lvP~~~~~~Ii~~~~~~s~l~~--~~~--v~~~~~-~~~~ 124 (377) +..+.++.-. .-.|+ -+ .| ...+..-+- ++=.+....+++.+.....+-. .++ +.-.+| .++| T Consensus 1 ~~~~~~~~~~----~~~~~--~~-~~----~~~~~~~nt-~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkI 68 (319) T protein:vir:97 1 MNKTIKNATG----MLKLN--LQ-HF----ANKSVEPGQ-TLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTV 68 (319) T ss_pred CCcccccccc----eeEee--hh-hh----hccCCCcch-HHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEE Confidence 0000000000 00010 01 11 001111111 2222333333443333322221 111 222344 5899 Q ss_pred EEEcCCcceeeecccccccccccccceeEeecceeEEEeehhhHHHHhcCHHHH--HHHHHHHHHHHHHHHhhcceeecc Q lcl|Aclame:pro 125 LTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWL--KQFITEQLKEAIAVALELAIVKGN 202 (377) Q Consensus 125 p~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~~--~~~l~~~la~~~a~~~~~a~l~G~ 202 (377) |.....+-..+-...+-.....+.+...++|...+.-.+..=.-+ .+.+...+ ...+.+.....++-.+|...+.-- T Consensus 69 p~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D-~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skl 147 (319) T protein:vir:97 69 MKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALD-RKDTEGNIDINYVVARQGAEVVAPYLDNLRFATL 147 (319) T ss_pred eeecccccccccCCCCcccCCcccceeEEEeecccccccccchhh-HhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHH Confidence 988765544443222222233334555566665555444321111 12222222 223333344444444554333210 Q ss_pred CCCcceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcccCceEEEeccch Q lcl|Aclame:pro 203 GLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) Q Consensus 203 G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 282 (377) -. .. ......+ ....+.+..+..++..+.. .+.| . +.+++|+|.. T Consensus 148 a~--------~a---~~~~~~~------------------~t~~n~y~~i~~a~~~Lde--~~VP---~-~Rvl~Vtp~~ 192 (319) T protein:vir:97 148 AR--------NK---AKHLTVG------------------TGSDAQYDAVLDVSVELDE--IKAP---E-NRVLFVSPTF 192 (319) T ss_pred Hh--------hc---ccccccc------------------cCHHHHHHHHHHHHHHHHh--cCCC---C-CcEEEeCHHH Confidence 00 00 0000000 0112222333333332211 1112 2 3566788876 Q ss_pred hhhhcccc-cccCC--------CCccccccCCCceEEec--CCCCcceEEEEecccEEE-EecceeeEEeechhhhhcCc Q lcl|Aclame:pro 283 RWTLEAKF-TSRNQ--------FGEYVTVLPHGITILES--LAVETGKAIAFVANRYDA-FMATASTIEEYDQTFAMEDL 350 (377) Q Consensus 283 ~~~~~~~~-~~~~~--------~G~~~~~l~~~~~v~~s--~~~~~~~ii~gd~s~y~~-~~~~~~~i~~~~~~~f~~~~ 350 (377) +..|.... ...+. +|.-.++ -|++|+.. ..++.-.+++|-.+-... ..-..+++-...+..|. T Consensus 193 ~~~L~~~~~f~~~~~~~~~~~~~g~Vg~i--dG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~~~a--- 267 (319) T protein:vir:97 193 YKGIKKFVIALPQGDTRQQVLGKGVQGEL--DGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMFG--- 267 (319) T ss_pred HHHHHhhhhhhccccccccceeeeeceee--cCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCccccc--- Confidence 65442111 01111 1211122 35666553 444444466665543221 11123333221222232 Q ss_pred EEEEEEEEEcCEEecccceEEEEeecC Q lcl|Aclame:pro 351 QLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) Q Consensus 351 ~~~~~~~r~dg~~~~~~af~~l~~~a~ 377 (377) -.|++..++|.++.++++..+...+.- T Consensus 268 ~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:97 268 TLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred eeeeeeeeeeeEEeccccceEEEeecC Confidence 378899999999999998777764443 No 213 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=23.40 E-value=2.3 Score=18.59 Aligned_cols=345 Identities=11% Similarity=0.007 Sum_probs=117.8 Q ss_pred CCcc--HHHHHHHHHHHHHHHHHHHhccC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHH-- Q lcl|Aclame:pro 1 MAIN--LKELPKYREAVAELSAKISAGAT--PEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKF-- 74 (377) Q Consensus 1 m~~~--~~~l~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~-- 74 (377) |+.. -|+|.++|.-+.+-.. +..-.+ +.--.+.+|.....+.++. ++++.......+.-|++.+..- T Consensus 1 ~~~~~~~e~l~~kw~p~l~~~~-~~~~~~~~~~~~a~l~enq~~~~~~~~------~~~~~~~~~~~~~~l~ea~~~~~~ 73 (522) T protein:vir:69 1 MTTIKTKAQLVDKWKELLEGEG-LPEIANSKQAIIAKIFENQEKDFEVSP------EYKDEKIAQAFGSFLTEAEIGGDH 73 (522) T ss_pred CCccchHHHHHHhhHHHhcCCC-CCccccchhhhhhhhhhhhhHHhhccc------ccchhHHHHhhhhhhhhhcccccc Confidence 6653 2556666655433211 000000 0011112222111111111 1222221111222222221100 Q ss_pred -HHH-HHhccCCCCCceeccHHHHHHHHHHHHh---hhhhhhhceeEecCCce------E--EEEEcC------------ Q lcl|Aclame:pro 75 -FND-IDKNVGGKDKFKLLPEETMVQVFDDLVA---EHPLLKVINFKNTSLRL------K--ALTAET------------ 129 (377) Q Consensus 75 -~~~-~~~~~~~s~gg~lvP~~~~~~Ii~~~~~---~s~l~~~~~v~~~~~~~------~--~p~~~~------------ 129 (377) ++. ....++++ +.. +.+...++-.+|. .-+-.+++-|.||++.. + ++-... T Consensus 74 ~~~~~~i~es~~t-~~v---~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~n 149 (522) T protein:vir:69 74 GYNAQNIAAGQTS-GAV---TQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMY 149 (522) T ss_pred CCCcccccccccc-ccc---ccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCcccccccccc Confidence 000 00011111 110 1222222222211 12233455555554321 1 110000 Q ss_pred Ccceeee------------------------------------------------------------------------- Q lcl|Aclame:pro 130 SGTAVWG------------------------------------------------------------------------- 136 (377) Q Consensus 130 ~~~a~w~------------------------------------------------------------------------- 136 (377) .+.+.|. T Consensus 150 eadt~fSG~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~Gm 229 (522) T protein:vir:69 150 APDAMFSGQGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGM 229 (522) T ss_pred ccccccccccccccccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeecccc Confidence 0000000 Q ss_pred -ccccc----ccccccccceeEeecceeEEEe-------ehhhHHHHhcC----HHHHHHHHHHHHHHHHHHHhhcceee Q lcl|Aclame:pro 137 -DIFGE----IKGQLKQAFKEQDFSQFKLTAF-------VVIPKDALKFG----PKWLKQFITEQLKEAIAVALELAIVK 200 (377) Q Consensus 137 -~e~~~----~~~~~~~~f~~i~l~~~k~~~~-------~~iS~ell~ds----~~~~~~~l~~~la~~~a~~~~~a~l~ 200 (377) ...++ ....+...|.+..|...|..+- ...|-||.+|= ..|.|++|.+-|+..|..-+|+.||. T Consensus 230 sTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~ 309 (522) T protein:vir:69 230 ATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVD 309 (522) T ss_pred chhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHh Confidence 00000 0011122466666666666543 45888988873 46789999999999999999998883 Q ss_pred ccC--CCc--ceeeeeccccccccccccccccccchhhhhhhhhhccChHHHHHHHHHHHHhhhhhhhhhhhcc---cCc Q lcl|Aclame:pro 201 GNG--LLQ--PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKI---AGQ 273 (377) Q Consensus 201 G~G--~~~--P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 273 (377) += +.+ -.|+.+- .+...+.++..... .....++....+..++-...+..+.....+ .++ T Consensus 310 -~i~~sa~~~~~g~t~~---------~~~~~Gv~Dl~~~~----~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n 375 (522) T protein:vir:69 310 -WINYSAQVGKSGMTNI---------VGSKAGVFDFQDPI----DIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGN 375 (522) T ss_pred -hhhhhheeeccccccc---------cccccceeeccccc----ccccchhHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 20 001 0111100 00000000000000 000011212222222222222222222222 233 Q ss_pred eEEEeccchhhhhc--------------ccccccCCCCccccccCCCceEEecCCCCcceEEEEeccc--E-----EEEe Q lcl|Aclame:pro 274 VKLLLNPEDRWTLE--------------AKFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANR--Y-----DAFM 332 (377) Q Consensus 274 ~~~~~n~~~~~~~~--------------~~~~~~~~~G~~~~~l~~~~~v~~s~~~~~~~ii~gd~s~--y-----~~~~ 332 (377) .+++.|.-..-|. .....-+....+...|.-+++|+.+++.+.+-+++|.-.. + +-== T Consensus 376 -~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPY 454 (522) T protein:vir:69 376 -FIIASRNVVNVLASVDTGISYAAQGLASGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYYAPY 454 (522) T ss_pred -EEEEchhHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccc Confidence 3445553322221 0000001111233445556788888888877666663211 1 1000 Q ss_pred cceeeEEeechhhhhcCcEEEEEEEEEcCEEecccceEEEE-------e---------ecC Q lcl|Aclame:pro 333 ATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLT-------L---------AGG 377 (377) Q Consensus 333 ~~~~~i~~~~~~~f~~~~~~~~~~~r~dg~~~~~~af~~l~-------~---------~a~ 377 (377) .....+...|...|.. .+-.+.|+ |-.++| |+.-. | ++| T Consensus 455 v~l~~~~~~dp~sfqP---~~g~~tRY-~l~vNP--~~~~~~~~~~~ri~~g~p~~~~~~~ 509 (522) T protein:vir:69 455 VALTPLRGSDPKNFQP---VMGFKTRY-GIGVNP--FAESSLQAPGARIQSGMPSILNSLG 509 (522) T ss_pred cccccccccCCccccc---eeeeeeee-ceeecC--cccccCCcccceeecccchhhcccC Confidence 0111112233333433 33344555 334444 32211 1 111 Done!